Re: Getting TextEdit to Recognise UTF-8 Output
Re: Getting TextEdit to Recognise UTF-8 Output
- Subject: Re: Getting TextEdit to Recognise UTF-8 Output
- From: Sixten Otto <email@hidden>
- Date: Fri, 21 May 2010 16:23:28 -0400
On Fri, May 21, 2010 at 3:42 PM, K.Darcy Otto <email@hidden> wrote:
> I've tried prefacing the text file with a BOM in this way:
You may well already know this, and that may be something you're
attempting out of desperation, but:
"The endian order entry for UTF-8 in Table 2-4 is marked N/A because
UTF-8 code units
are 8 bits in size, and the usual machine issues of endian order for
larger code units do not
apply. The serialized order of the bytes must not depart from the
order defined by the UTF-
8 encoding form. Use of a BOM is neither required nor recommended for
UTF-8, but may
be encountered in contexts where UTF-8 data is converted from other
encoding forms that
use a BOM or where the BOM is used as a UTF-8 signature. See the “Byte
Order Mark”
subsection in Section 16.8, Specials, for more information."
http://www.unicode.org/versions/Unicode5.0.0/ch02.pdf, p36
_______________________________________________
Cocoa-dev mailing list (email@hidden)
Please do not post admin requests or moderator comments to the list.
Contact the moderators at cocoa-dev-admins(at)lists.apple.com
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden