[Lazarus] UTF16 2 utf8
Hans-Peter Diettrich
DrDiettrich1 at aol.com
Thu May 5 12:10:56 CEST 2011
Graeme Geldenhuys schrieb:
> Quoted text from the Unicode wikipedia article. Note the last three lines.
>
> "Unicode can be implemented by different character encodings. The most
> commonly used encodings are UTF-8 (which uses one byte for any ASCII
> characters, which have the same code values in both UTF-8 and ASCII
> encoding, and up to four bytes for other characters), the now-obsolete
> UCS-2 (which uses two bytes for each character but cannot encode every
> character in the current Unicode standard), and UTF-16 (which extends
> UCS-2 to handle code points beyond the scope of UCS-2)."
IMO UCS-2 primarily is an ISO/IEC Unicode version, with 16 bit code
positions, as applicable to Unicode 1. Later Unicode versions require
more bits, so that UCS-4 (32 bit) had been introduced.
DoDi
More information about the Lazarus
mailing list