Re: UTF8 question.
Re: UTF8 question.
- Subject: Re: UTF8 question.
- From: Chris Ridd <email@hidden>
- Date: Wed, 24 Aug 2005 10:52:57 +0100
On 24/8/05 9:11, Manish shrikant shet <email@hidden>
wrote:
>
> Hi List,
> I am a Cocoa developer involved in writing application interoperable between
> Mac and Windows. While localizing the application in Japanese, I am observing
> some interoperable issues with some Japanese composite characters. Some of
> the these characters are resolving to different UTF8 characters on Mac and
> Windows. On Mac I am using the UTF8String method on a NSString object to get
> the UTF8 string. On Windows I am using the MultiByteToWideChar() function to
> convert a string in the local code page into Unicode. I am using the
> MB_COMPOSITE option here. The uncicode string is then being converted into
> UTF8 string by using the function WideCharToMultiByte(). The UTF8 characters
> obtained by following this process seem to differ on Mac and Windows.
> As an example, I am providing the wide char and UTF8 strings I am obtaining on
> both Mac and Windows for a couple of Japanese Characters. Any pointers on how
> to resolve this problem will be highly appreciated.
> 1) ?
> wide char values
> Win: C0 30 (little endian)
OK, you're encoding U+30C0 (KATAKANA LETTER DA)
> Mac: 30 BF 30 99 (big endian)
This becomes KATAKANA LETTER TA + COMBINING KATAKANA-HIRAGANA VOICED SOUND
MARK.
In other words, the conversion is splitting the original character into two
decomposed (is that the right term?) pieces.
Apologies if any of the terminology in the above's wrong, but it might give
you some insights.
Cheers,
Chris
_______________________________________________
Do not post admin requests to the list. They will be ignored.
Cocoa-dev mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden
References: | |
| >UTF8 question. (From: "Manish shrikant shet" <email@hidden>) |