Mailing Lists: Apple Mailing Lists

Image of Mac OS face in stamp
 
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: AppleSingle/AppleDouble & character encoding



On 17.10.2005 21:07, "John Stiles" <email@hidden> wrote:

>>   MacCyrillic springs next ;-)
> 
> Well, I thought of Shift-JIS because it's easy to reject text which is
> malformed--if it parses as Shift-JIS without error, it's probably Shift-JIS.
> You won't have a ton of false positives. MacCyrillic, OTOH, isn't a
> double-byte character system, so all combinations of characters ought to be
> valid (much like MacRoman). This makes it a lot harder to reject false
> positives; you need to use the statistical analysis method, which takes a lot
> more effort to develop.

  Agreed (almost - ShiftJIS may give poor results for ASCII clean text,
because it doesn't encode backslash like ASCII does). As for MacRoman vs.
MacCyrillic, CFStringGetSystemEncoding() is often the best last try for
legacy data.

- WBR, Alexey Proskuryakov


 _______________________________________________
Do not post admin requests to the list. They will be ignored.
Carbon-dev mailing list      (email@hidden)
Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/carbon-dev/email@hidden

This email sent to email@hidden

References: 
 >Re: AppleSingle/AppleDouble & character encoding (From: John Stiles <email@hidden>)



Visit the Apple Store online or at retail locations.
1-800-MY-APPLE

Contact Apple | Terms of Use | Privacy Policy

Copyright © 2007 Apple Inc. All rights reserved.