Re: Bad Characters from Unicode
Re: Bad Characters from Unicode
- Subject: Re: Bad Characters from Unicode
- From: Philip Aker <email@hidden>
- Date: Fri, 05 Oct 2007 04:56:28 -0700
On 2007-04-10, at 19:03, Sander Tekelenburg wrote:
At 18:46 -0600 UTC, on 2007-10-04, Doug McNutt wrote:
.U08 for UTF-8 format
Maybe I misunderstand, but just "filename.U08" wouldn't tell you
whether it's a text file, a HTML file, a CSS file, a PHP, perl, sql
file, etc.
That's essentially correct, but lots of apps scan the first few lines
of the file and make a reasonable guess.
#!/usr/bin/perl
<!DOCTYPE … !>
There's also a considerable movement towards metadata which when
finally done should be more powerful than file type/creator codes.
§
But to comment further on Doug's notions for arbitrary text files on
Mac I believe the best convention would be to re-use the AppleEvent
text codes:
.utf8 /* may have the utf8 pseudo BOM */
.utxt /* 16 bit unicode - reader deduces byte order */
.ut16 /* big-endian 16 bit unicode with optional byte-order-mark, or
little-endian 16 bit unicode with required byte-order-mark. */
Philip Aker
email@hidden
_______________________________________________
Do not post admin requests to the list. They will be ignored.
AppleScript-Users mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
Archives: http://lists.apple.com/archives/applescript-users
This email sent to email@hidden