• Open Menu Close Menu
  • Apple
  • Shopping Bag
  • Apple
  • Mac
  • iPad
  • iPhone
  • Watch
  • TV
  • Music
  • Support
  • Search apple.com
  • Shopping Bag

Lists

Open Menu Close Menu
  • Terms and Conditions
  • Lists hosted on this site
  • Email the Postmaster
  • Tips for posting to public mailing lists
Re: en-dash and em-dash
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: en-dash and em-dash


  • Subject: Re: en-dash and em-dash
  • From: Mitchell L Model <email@hidden>
  • Date: Tue, 21 Jun 2016 04:22:57 -0400

Further information. : it depends on which editor you view it through. It’s actually an HTML file, so Safari of course chokes. BBEdit and Emacs see it correctly, but TextEdit sees them with their problems:
PEG.js – Parser Generator for _javascript_
and
Examples overview — PyObjC — the Python ⟷ Objective-C bridge
So the problem is not in the output, it is in the editor’s (or Safari’s, of course, because the Unicode characters wouldn’t be encoded) interpretation of the Unicode characters that is at issue. I thought that’s what the file really contained because I looked at it through TextEdit and believed what it showed.

Is it really possible that TextEdit gets confused by Unicode characters? I thought everything (except browsers) could handle Unicode characters.

The above is with “write theText to theFile as «class utf8»”. Without the utf8 conversion, what comes out is viewed by Emacs as
PEG.js Ð Parser Generator for _javascript_
and
Examples overview Ñ PyObjC Ñ the Python ? Objective-C bridge
but by BBEdit as 
js – Parser Generator for _javascript_
and
Examples overview — PyObjC — the Python ? Objective-C bridge

and by TextEdit as
PEG.js Ð Parser Generator for _javascript_
and
Examples overview Ñ PyObjC Ñ the Python ? Objective-C bridge


so what I have learned that I didn’t know is that have to add “as «class utf8»” when writing the file.

Except that although without the utf8 conversion, TextEdit doesn’t understand unicode characters above 256, BBEdit gets 
Examples overview — PyObjC — the Python ⟷ Objective-C bridge
right for the dashes but not for the double arrow, and Emacs gets confused in the same way as TextEdit. (The dashes are high-end
Unicode dashes: BBEdit is somehow seeing #8211 and #8212 as en-dash and em-dash even with a file written by AppleScript
without the UTF8 conversions.

I am getting more, not slightly, confused.




 _______________________________________________
Do not post admin requests to the list. They will be ignored.
AppleScript-Users mailing list      (email@hidden)
Help/Unsubscribe/Update your Subscription:
Archives: http://lists.apple.com/archives/applescript-users

This email sent to email@hidden

  • Follow-Ups:
    • Re: en-dash and em-dash
      • From: Christopher Stone <email@hidden>
    • Re: en-dash and em-dash
      • From: Shane Stanley <email@hidden>
  • Prev by Date: Re: What is Best Practice for Reading Files of Unknown Encoding?
  • Next by Date: Re: en-dash and em-dash
  • Previous by thread: Re: en-dash and em-dash
  • Next by thread: Re: en-dash and em-dash
  • Index(es):
    • Date
    • Thread