• Open Menu Close Menu
  • Apple
  • Shopping Bag
  • Apple
  • Mac
  • iPad
  • iPhone
  • Watch
  • TV
  • Music
  • Support
  • Search apple.com
  • Shopping Bag

Lists

Open Menu Close Menu
  • Terms and Conditions
  • Lists hosted on this site
  • Email the Postmaster
  • Tips for posting to public mailing lists
Re: Bad Characters from Unicode
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Bad Characters from Unicode


  • Subject: Re: Bad Characters from Unicode
  • From: Philip Aker <email@hidden>
  • Date: Fri, 05 Oct 2007 04:56:28 -0700

On 2007-04-10, at 19:03, Sander Tekelenburg wrote:

At 18:46 -0600 UTC, on 2007-10-04, Doug McNutt wrote:

.U08 for UTF-8 format

Maybe I misunderstand, but just "filename.U08" wouldn't tell you whether it's a text file, a HTML file, a CSS file, a PHP, perl, sql file, etc.

That's essentially correct, but lots of apps scan the first few lines of the file and make a reasonable guess.


	#!/usr/bin/perl
	<!DOCTYPE … !>

There's also a considerable movement towards metadata which when finally done should be more powerful than file type/creator codes.

§

But to comment further on Doug's notions for arbitrary text files on Mac I believe the best convention would be to re-use the AppleEvent text codes:

.utf8 /* may have the utf8 pseudo BOM */
.utxt /* 16 bit unicode - reader deduces byte order */
.ut16 /* big-endian 16 bit unicode with optional byte-order-mark, or little-endian 16 bit unicode with required byte-order-mark. */



Philip Aker email@hidden



_______________________________________________
Do not post admin requests to the list. They will be ignored.
AppleScript-Users mailing list      (email@hidden)
Help/Unsubscribe/Update your Subscription:
Archives: http://lists.apple.com/archives/applescript-users

This email sent to email@hidden
  • Follow-Ups:
    • Re: Bad Characters from Unicode
      • From: Sander Tekelenburg <email@hidden>
References: 
 >Re: Bad Characters from Unicode (From: has <email@hidden>)
 >Re: [OT] Bad Characters from Unicode (From: "Mark J. Reed" <email@hidden>)
 >Re: Bad Characters from Unicode (From: Sander Tekelenburg <email@hidden>)
 >Re: Bad Characters from Unicode (From: Doug McNutt <email@hidden>)
 >Re: Bad Characters from Unicode (From: Sander Tekelenburg <email@hidden>)

  • Prev by Date: Re: Bad Characters from Unicode
  • Next by Date: Re: Bad Characters from UnicodeŠ
  • Previous by thread: Re: Bad Characters from Unicode
  • Next by thread: Re: Bad Characters from Unicode
  • Index(es):
    • Date
    • Thread