Re: Bad Characters from Unicode
Re: Bad Characters from Unicode
- Subject: Re: Bad Characters from Unicode
- From: Emmanuel <email@hidden>
- Date: Sun, 30 Sep 2007 11:53:43 +0200
As a complement to your thorough paper, here are a few remarks.
- the original post on this topic was about Subjects in mails. RFC
1342 defines how to transmit "high"-ASCII characters in Subjects
despite the fact that RFC 822 specifies that the headers should be
pure [7 bit-]ASCII. However RFC 1342 doesn't tell us about Unicode.
At first sight, the ellipsis in the culprit Subject was pasted as
UTF-16, and that might simply be labelled a bug of Mail - at least a
bad taste option: UTF-8 would make weird results less often.
- Smile has the Unicode character and Unicode number commands
(working on whole strings, by the way), which support the 2
encodings, and Smile has Unicode windows, so you can visualize the
characters you're working on.
- Making a list of strings into one string when non-ASCII characters
are present in the list and/or in the text item delimiters is broken.
Proobably we can expect it be fixed in the next big cat. In the
meanwhile you can use Smile's "coerceAsUnicodeText" command, which
does that (and a bit more.) If the latest public version does not
include the command, ask me privately for the latest beta.
Emmanuel
_______________________________________________
Do not post admin requests to the list. They will be ignored.
AppleScript-Users mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
Archives: http://lists.apple.com/archives/applescript-users
This email sent to email@hidden