seeking webguru advice on html character encoding
seeking webguru advice on html character encoding
- Subject: seeking webguru advice on html character encoding
- From: has <email@hidden>
- Date: Thu, 20 Dec 2001 15:36:18 +0000
Hi all,
I'm working on a spanking new library for encoding ascii characters as html
character entities. I'm wondering if there's anyone who knows about
character sets, fonts and html character entities for discussing how to
handle some of the more awkward stuff. I'm avoiding some of the obvious
pratfalls (none of this –/— for en/em dashes crap as in
DreamWeaver, for example) but I'm lost elsewhere:
ASCII character 219 - in older Mac fonts this is used for the general
currency symbol; in newer ones it appears as the euro
Stuff I can't find codes for in html4:
ASCII character 240 - apple
ASCII character 215 - diamond
ASCII character 197 - wavy thing (what is that anyway? approx equal?)
ASCII character 222 - fi ligature
ASCII character 223 - fl ligature
Characters that appear in Mac fonts but not in Windows fonts - I bet there
are some, but what are they? Will a Windows-based browser make mince of
them, or will it manage to render them ok? (Ditto Unix.) e.g. I know
IE5/Mac handles things like Þ and Ð which aren't on my machine -
how does it manage to do that?
What about <cough> "foreign" character sets - I'm woefully ignorant of any
character set that doesn't speak in the Queen's English. How should I
support them (since I don't want this thing to be English-only, if
possible)?
This library will be for general consumption when done (via AppleMods), so
I want to make sure it's a damn good un. Anyone willing to provide me a
brain to pick can mail me off-list and I'll be happy to show them how it
currently stands and where I'm in need of advice.
Thanks in advance,
has