Mailing Lists: Apple Mailing Lists

Image of Mac OS face in stamp
 
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: AppleSingle/AppleDouble & character encoding



On Oct 17, 2005, at 6:43 AM, Alexey Proskuryakov wrote:

On 17.10.2005 17:35, "John Stiles" <email@hidden> wrote:


Well, there are several quite popular encodings you could try before
falling back to MacRoman. Shift-JIS springs to mind right away :)


  MacCyrillic springs next ;-)

Well, I thought of Shift-JIS because it's easy to reject text which is malformed--if it parses as Shift-JIS without error, it's probably Shift-JIS. You won't have a ton of false positives. MacCyrillic, OTOH, isn't a double-byte character system, so all combinations of characters ought to be valid (much like MacRoman). This makes it a lot harder to reject false positives; you need to use the statistical analysis method, which takes a lot more effort to develop.

 _______________________________________________
Do not post admin requests to the list. They will be ignored.
Carbon-dev mailing list      (email@hidden)
Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/carbon-dev/email@hidden

This email sent to email@hidden

References: 
 >Re: AppleSingle/AppleDouble & character encoding (From: Alexey Proskuryakov <email@hidden>)



Visit the Apple Store online or at retail locations.
1-800-MY-APPLE

Contact Apple | Terms of Use | Privacy Policy

Copyright © 2007 Apple Inc. All rights reserved.