Lists

Terms and Conditions
Lists hosted on this site
Email the Postmaster
Tips for posting to public mailing lists

Re: UTF8 question.

[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: UTF8 question.

Subject: Re: UTF8 question.
From: Chris Ridd <email@hidden>
Date: Wed, 24 Aug 2005 10:52:57 +0100

On 24/8/05 9:11, Manish  shrikant shet <email@hidden>
wrote:

>
> Hi List,
> I am a Cocoa developer involved in writing application interoperable between
> Mac and Windows.  While localizing the application in Japanese, I am observing
> some interoperable issues with some Japanese composite characters.  Some of
> the these characters are resolving to different UTF8 characters on Mac and
> Windows.  On Mac I am using the UTF8String method on a NSString object to get
> the UTF8 string.  On Windows I am using the MultiByteToWideChar() function to
> convert a string in the local code page into Unicode.  I am using the
> MB_COMPOSITE option here.   The uncicode string is then being converted into
> UTF8 string by using the function WideCharToMultiByte().  The UTF8 characters
> obtained by following this process seem to differ on Mac and Windows.
> As an example, I am providing the wide char and UTF8 strings I am obtaining on
> both Mac and Windows for a couple of Japanese Characters.  Any pointers on how
> to resolve this problem will be highly appreciated.
> 1) ?
> wide char values
> Win: C0 30 (little endian)

OK, you're encoding U+30C0 (KATAKANA LETTER DA)

> Mac: 30 BF 30 99 (big endian)

This becomes KATAKANA LETTER TA + COMBINING KATAKANA-HIRAGANA VOICED SOUND
MARK.

In other words, the conversion is splitting the original character into two
decomposed (is that the right term?) pieces.

Apologies if any of the terminology in the above's wrong, but it might give
you some insights.

Cheers,

Chris

 _______________________________________________
Do not post admin requests to the list. They will be ignored.
Cocoa-dev mailing list      (email@hidden)
Help/Unsubscribe/Update your Subscription:

This email sent to email@hidden

Follow-Ups:
- Re: UTF8 question.
  - From: Andrew Thompson <email@hidden>

References:
	>UTF8 question. (From: "Manish shrikant shet" <email@hidden>)

Prev by Date: Superscript causing crash
Next by Date: Serious NSTextStorage subclassing troubles
Previous by thread: Re: UTF8 question.
Next by thread: Re: UTF8 question.
Index(es):
- Date
- Thread