Mailing Lists: Apple Mailing Lists

Image of Mac OS face in stamp
 
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Internationalization help



Mike Dougherty wrote:

Sorry, my bad. I'm used to using UTF-8 on things so I wrote that, but meant to write unicode. I've run native2ascii on the file and ended up with the following unicode:

button.cancel=\u201e\u00c7\u2260\u201e\u00c9\u00a3\u201e \u00c9 \u2265\u201e\u00c7\u00aa\u201e\u00c9\u00b4\u201e \u00c5\u00f4\u201e \u00c7\u00e3

Referring to a Unicode reference, \u201e is a double low quotation mark, and \u00c7 is capital-C with cedilla. Since those are the first two things I see in the pushbutton's image, I think it's doing exactly what you told it to do. I think the problem occurs at some earlier point in the production pathway.


Exactly what was the input file to native2ascii that caused it to generate the \u201e\u00c7 etc. sequence? Are you certain it was encoded as UTF8? Using what program?

If someone sent you that file of Japanese text, you should confirm that what they sent you is what you received, i.e. that no data was mangled or distorted in transit, or at any other step along the way. See the man-page for the 'md5' command.

You should also use another tool to independently verify the actual binary data in the file of Japanese text. See the man-page for the 'hexdump' command, for example.

Also, exactly what was the native2ascii command-line you used for the conversion? If you omit the -encoding option, it will use the default from the "file.encoding" property, and on US-English configurations under Mac OS X, that's MacRoman, not UTF8. Frankly, it looks an awful lot like utf8 bytes were interpreted as MacRoman characters.

So to summarize, I think you need to carefully examine every step you took in the production and processing of the file that's supposed to contain the Japanese text. A little detective work should nail down the point where things are going wrong, even if it may not be clear why they're going wrong.

Also, you can look up Unicode chars and code-points at www.unicode.org or at www.fileformat.info.
Or just google:
unicode 201e
unicode 00c7
unicode 2260



-- GG

_______________________________________________
Do not post admin requests to the list. They will be ignored.
Java-dev mailing list      (email@hidden)
Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/java-dev/email@hidden

This email sent to email@hidden


Visit the Apple Store online or at retail locations.
1-800-MY-APPLE

Contact Apple | Terms of Use | Privacy Policy

Copyright © 2007 Apple Inc. All rights reserved.