[Lazarus] logic bug in many (or most) TSynEdit

Marco van de Voort marcov at stack.nl
Sat Jun 5 13:31:50 CEST 2010


On Sat, Jun 05, 2010 at 11:29:10AM +0300, ik wrote:
> > Not really. UTF-8 is implemented in such a way that it is indeed
> > possible to treat it as Extended ASCII in this case.
> >
> 
> How is so ? here is a multi-byte char: ?? . It takes more then a word to be
> used, so you can not do S[i] because it will provide you only part of the
> char (one byte).

Identifier scanning in unicode enabled languages is usually changed from
determining which chars are allowed to which ones are explicitely not
allowed. (whitespace, first char not a (standard) number) etc.
 




More information about the Lazarus mailing list