[Lazarus] UTF16 2 utf8

Hans-Peter Diettrich DrDiettrich1 at aol.com
Thu May 5 12:10:56 CEST 2011


Graeme Geldenhuys schrieb:
> Quoted text from the Unicode wikipedia article. Note the last three lines.
> 
> "Unicode can be implemented by different character encodings. The most
> commonly used encodings are UTF-8 (which uses one byte for any ASCII
> characters, which have the same code values in both UTF-8 and ASCII
> encoding, and up to four bytes for other characters), the now-obsolete
> UCS-2 (which uses two bytes for each character but cannot encode every
> character in the current Unicode standard), and UTF-16 (which extends
> UCS-2 to handle code points beyond the scope of UCS-2)."

IMO UCS-2 primarily is an ISO/IEC Unicode version, with 16 bit code 
positions, as applicable to Unicode 1.  Later Unicode versions require 
more bits, so that UCS-4 (32 bit) had been introduced.

DoDi





More information about the Lazarus mailing list