[Lazarus] Some questions on Unicode wiki page

Hans-Peter Diettrich DrDiettrich1 at aol.com
Fri Jun 22 20:57:21 CEST 2012


Michael Schnell schrieb:
> On 06/22/2012 01:17 PM, Felipe Monteiro de Carvalho wrote:
>> Yes, but in practice the only place that I know that uses everything 
>> in decomposed unicode is the Mac OS X filesystem. So for dealing with 
>> Mac OS X filenames we need special care and for the rest just suppose 
>> composed mode (but remembering that some chars have no composed mode!). 
> 
> Anyway it might be helpful to note this in the wiki page at 
> http://wiki.lazarus.freepascal.org/LCL_Unicode_Support#Searching_a_substring. 
> 
> 
> The text "For searching in a substring they do work perfectly" might 
> cause misunderstanding on that behalf. (Of course it technically does 
> work fine, but the ambiguity of Unicode itself introduces potential 
> problems).

IMO the meaning is clear: searching for an sub*string* means an *exact* 
match. Who wants case insensitive or otherwise inexact matches, has to 
use specialized *text* functions.

Ambiguous character representations are a general problem with 
characters, where e.g. line endings or whitespace can be represented in 
different ways. The accent/umlaut composition is not a Unicode problem, 
it occurs in all codepages.

DoDi





More information about the Lazarus mailing list