[Lazarus] cwstring in arm-linux

Hans-Peter Diettrich DrDiettrich1 at aol.com
Fri Oct 21 10:19:35 CEST 2011


Graeme Geldenhuys schrieb:
> On 2011-10-21 00:20, Hans-Peter Diettrich wrote:
>> your legacy code can assume that every (visible) character is a Char, in 
>> an SBCS codepage, this is not different in UTF-16.
> 
> Rookie mistake!!! You forgot surrogate pairs in UTF-16.

Which Ansi characters translate into surrogate pairs?


> Now in UTF-8, this is not a problem at all. Finding a visible character
> in the BMP or Supplementary Plane is a identical process, no special
> checking is required. Thus making UTF-8 much easier and safer to use.

Please specify "Finding", a code snippet would be nice.


> I've ported enough Delphi code to FPC + fpGUI where UTF-8 is used for
> Unicode support. I fully agree with Felipe, using UTF-8 is much easier
> with legacy code that UTF-16.

This only demonstrates that UTF-16 has not been supported sufficiently 
in FPC, until now. Give an example of UTF-8 code, which would become 
*more* complicated with UTF-16.

DoDi





More information about the Lazarus mailing list