Re: a little bit offlist: charakter tables
Re: a little bit offlist: charakter tables
- Subject: Re: a little bit offlist: charakter tables
- From: Frank Miedreich <email@hidden>
- Date: Tue, 17 Dec 2002 14:49:40 +0100
Hi,
if you parse the xml file you should find that the first line
specifies the character encoding of the xml file. Something like:
<?xml version="1.0" encoding="iso-8859-1"?>
Often xml files are encoded in either Unicode or UTF-8, both can be
converted with TEC (Text Encoding Converter, part of the Mac OS). I
think there is an osax that provides an interface to TEC, search
osaxen.com.
If your high order ascii chars appear as 2 characters in the xml file
it probably is UTF-8. If you want to produce unicode strings on the
Mac side of things a complete conversion table is somewhat lengthy,
but you should be able to find one in perl.
cheers, Frank
At 14:16 Uhr +0100 17.12.2002, Michael Stoll wrote:
> i have to parse/write a xml and convert characters like "i" to ")".
ups, these characters got lost, so i'll describe it:
"e plus accent" should get converted to "radix plus copyright-sign".
the questions remain:
two questions:
what kind of conversion is that at all?
where can i find a complete conversion table?
thank you very much.
> michael.
--
Frank Miedreich
Motorstr 38 - 80809 M|nchen - (089) 3544841
_______________________________________________
applescript-users mailing list | email@hidden
Help/Unsubscribe/Archives:
http://www.lists.apple.com/mailman/listinfo/applescript-users
Do not post admin requests to the list. They will be ignored.