Re: Is there any support in Cocoa for stupidly encoded UTF-8 string?
Re: Is there any support in Cocoa for stupidly encoded UTF-8 string?
- Subject: Re: Is there any support in Cocoa for stupidly encoded UTF-8 string?
- From: Andrew Farmer <email@hidden>
- Date: Thu, 20 Jan 2005 11:42:00 -0800
On 20 Jan 2005, at 09:26, Stephane Sudre wrote:
In some e-mail subjects, people are using what is supposed to be UTF-8
encoded and is actually poor Unicode encoded.
For instance, instead of 0xC3A9 for eacute, you end up with 0xE9
(where it should be 0x00E9).
When you use NSString initWithBytes:length:encoding with the UTF-8
encoding as the paramter, you obtain nil. I understand this.
Now, the question is: is there a method in Cocoa to deal with stupidly
encoded UTF-8 string?
What you're looking at is ISO8859-1 encoded text. Decode it as such and
you'll be fine.
I'm pretty sure that there *should* be some easy way to detect whether
text in the subject is encoded with ISO8859-1 or UTF-8. Look up the
standards (if they exist).
Attachment:
PGP.sig
Description: This is a digitally signed message part
_______________________________________________
Do not post admin requests to the list. They will be ignored.
Cocoa-dev mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden