• Open Menu Close Menu
  • Apple
  • Shopping Bag
  • Apple
  • Mac
  • iPad
  • iPhone
  • Watch
  • TV
  • Music
  • Support
  • Search apple.com
  • Shopping Bag

Lists

Open Menu Close Menu
  • Terms and Conditions
  • Lists hosted on this site
  • Email the Postmaster
  • Tips for posting to public mailing lists
Re: Is there any support in Cocoa for stupidly encoded UTF-8 string?
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Is there any support in Cocoa for stupidly encoded UTF-8 string?


  • Subject: Re: Is there any support in Cocoa for stupidly encoded UTF-8 string?
  • From: John Stiles <email@hidden>
  • Date: Thu, 20 Jan 2005 11:41:51 -0800


On Jan 20, 2005, at 11:42 AM, Andrew Farmer wrote:

On 20 Jan 2005, at 09:26, Stephane Sudre wrote:
In some e-mail subjects, people are using what is supposed to be UTF-8 encoded and is actually poor Unicode encoded.

For instance, instead of 0xC3A9 for eacute, you end up with 0xE9 (where it should be 0x00E9).

When you use NSString initWithBytes:length:encoding with the UTF-8 encoding as the paramter, you obtain nil. I understand this.

Now, the question is: is there a method in Cocoa to deal with stupidly encoded UTF-8 string?

What you're looking at is ISO8859-1 encoded text. Decode it as such and you'll be fine.


I'm pretty sure that there *should* be some easy way to detect whether text in the subject is encoded with ISO8859-1 or UTF-8. Look up the standards (if they exist).

The easiest detection method would be that NSString initWithBytes:length:encoding returned NULL :) :)
Seriously, that's a pretty good clue that the text wasn't valid UTF8. At that point you get to guess its format, and Windows Latin-1 is as good a guess as any.


_______________________________________________
Do not post admin requests to the list. They will be ignored.
Cocoa-dev mailing list      (email@hidden)
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden


  • Follow-Ups:
    • Re: Is there any support in Cocoa for stupidly encoded UTF-8 string?
      • From: Andrew Farmer <email@hidden>
References: 
 >Is there any support in Cocoa for stupidly encoded UTF-8 string? (From: Stephane Sudre <email@hidden>)
 >Re: Is there any support in Cocoa for stupidly encoded UTF-8 string? (From: Andrew Farmer <email@hidden>)

  • Prev by Date: Re: How to customize Comand +Q handler
  • Next by Date: Re: How to customize Comand +Q handler
  • Previous by thread: Re: Is there any support in Cocoa for stupidly encoded UTF-8 string?
  • Next by thread: Re: Is there any support in Cocoa for stupidly encoded UTF-8 string?
  • Index(es):
    • Date
    • Thread