[Lazarus] UTF-8 XML

Felipe Monteiro de Carvalho felipemonteiro.carvalho at gmail.com
Sun Jun 24 10:17:53 CEST 2012


Hello,

I am using xmlread and dom from FPC to read a XML file and I got an
unpleasant surprise.

It is converting things like this:

<mo>&#x00B1;<!-- ± --></mo>

Into:

<mo>±</mo>

But encoded in ISO 8859-1 which is awful as I don't want my program to
go into the dark ages of pre-unicode and all problems it has.

I know that we have a XML reading library in Lazarus which uses UTF-8,
but can it be utilized as a direct replacement for xmlread and dom?

To write my code I utilized this wiki page which I wrote most of a
long time ago (back then I wasn't using non-ASCII so didn't notice
this issue): http://wiki.freepascal.org/XML_Tutorial

Hopefully the Lazarus XML routines will be very similar...

Looking at our package LazUtils I found laz2_xmlread and laz_xmlread,
so I suppose I should just go for the version with 2 on it, correct?

thanks,
-- 
Felipe Monteiro de Carvalho




More information about the Lazarus mailing list