[Lazarus] GB18030 support in Lazarus

Martin Frb lazarus at mfriebe.de
Fri Oct 16 15:33:03 CEST 2015


On 16/10/2015 10:19, Tony Whyman wrote:
>
> In terms of "work", if I use functions such as UTF8Length and 
> ValidUTF8String on a GB18030 string should they always work, or are 
> there exceptions?

IIRC ... UTF8Length counts codepoints, not chars. So if the chars you 
are interested in have chars that need more than one codepoint then this 
is not the  length in char.
This can even happen with some western languages, but it is not likely 
with them.

The same is for char accessing function (NextUtf8CharByteLen or 
similar). They only get codepoints.




More information about the Lazarus mailing list