Thread-topic: AppleSingle/AppleDouble & character encoding
User-agent: Microsoft-Entourage/11.2.0.050811
On 17.10.2005 21:07, "John Stiles" <email@hidden> wrote:
>> MacCyrillic springs next ;-)
>
> Well, I thought of Shift-JIS because it's easy to reject text which is
> malformed--if it parses as Shift-JIS without error, it's probably Shift-JIS.
> You won't have a ton of false positives. MacCyrillic, OTOH, isn't a
> double-byte character system, so all combinations of characters ought to be
> valid (much like MacRoman). This makes it a lot harder to reject false
> positives; you need to use the statistical analysis method, which takes a lot
> more effort to develop.
Agreed (almost - ShiftJIS may give poor results for ASCII clean text,
because it doesn't encode backslash like ASCII does). As for MacRoman vs.
MacCyrillic, CFStringGetSystemEncoding() is often the best last try for
legacy data.
- WBR, Alexey Proskuryakov
_______________________________________________
Do not post admin requests to the list. They will be ignored.
Carbon-dev mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/carbon-dev/email@hidden
This email sent to email@hidden