Re: NSString and Unicode and Japanese character encodings
Re: NSString and Unicode and Japanese character encodings
- Subject: Re: NSString and Unicode and Japanese character encodings
- From: Satoshi Matsumoto <email@hidden>
- Date: Fri, 26 Aug 2005 08:07:51 +0900
on 05.8.26 4:33 AM, Izidor Jerebic at email@hidden wrote:
> One of the problems is that Japanese Shift-JIS encoding does not map
> one-to-one to Unicode. There are Shift-JIS codes that do not survive
> round-trip to Unicode and back to Shift-JIS - you get back a
> different one than the starting one.
At first, in my personal opinion as a Japanese text editor developer, it may
not be a big problem for Japanese unicode editors that handle Shift-JIS.
But you should note that there are four types (dialects) of Shift-JIS
encodings. When you are going to convert to/from Shift-JIS. you need to
specify correctly which type of Shift-JIS encodings.
1 .Japanese (Shift JIS) contains the half-width alphanumeric characters,
symbols, and katakana, along with JIS X0208 (JIS 1&2) characters.
2. Japanese (Mac OS) contains all of Japanese (Shift JIS) along with some
Mac-specific symbols.
3. Japanese (Windows, DOS) contains all of Japanese (Shift JIS) along with
some Windows-specific symbols.
4. Japanese (Shift JIS X0213) contains all of Japanese (Shift JIS) along
with kanji specified in JIS 3&4 for a total of 13,000 characters total.
I think the problem are derived from the confusion among above types.
Japanese text created in Classic OS is encoded in Japanese (Mac OS), so this
may be the safest option among the various Shift JIS encodings.
-----------------------------------------------------
Satoshi Matsumoto <email@hidden>
816-5 Odake, Odawara, Kanagawa, Japan 256-0802
_______________________________________________
Do not post admin requests to the list. They will be ignored.
Cocoa-dev mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden