• Open Menu Close Menu
  • Apple
  • Shopping Bag
  • Apple
  • Mac
  • iPad
  • iPhone
  • Watch
  • TV
  • Music
  • Support
  • Search apple.com
  • Shopping Bag

Lists

Open Menu Close Menu
  • Terms and Conditions
  • Lists hosted on this site
  • Email the Postmaster
  • Tips for posting to public mailing lists
Re: UTF8 question.
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: UTF8 question.


  • Subject: Re: UTF8 question.
  • From: Chris Ridd <email@hidden>
  • Date: Wed, 24 Aug 2005 10:52:57 +0100

On 24/8/05 9:11, Manish  shrikant shet <email@hidden>
wrote:

>
> Hi List,
> I am a Cocoa developer involved in writing application interoperable between
> Mac and Windows.  While localizing the application in Japanese, I am observing
> some interoperable issues with some Japanese composite characters.  Some of
> the these characters are resolving to different UTF8 characters on Mac and
> Windows.  On Mac I am using the UTF8String method on a NSString object to get
> the UTF8 string.  On Windows I am using the MultiByteToWideChar() function to
> convert a string in the local code page into Unicode.  I am using the
> MB_COMPOSITE option here.   The uncicode string is then being converted into
> UTF8 string by using the function WideCharToMultiByte().  The UTF8 characters
> obtained by following this process seem to differ on Mac and Windows.
> As an example, I am providing the wide char and UTF8 strings I am obtaining on
> both Mac and Windows for a couple of Japanese Characters.  Any pointers on how
> to resolve this problem will be highly appreciated.
> 1) ?
> wide char values
> Win: C0 30 (little endian)

OK, you're encoding U+30C0 (KATAKANA LETTER DA)

> Mac: 30 BF 30 99 (big endian)

This becomes KATAKANA LETTER TA + COMBINING KATAKANA-HIRAGANA VOICED SOUND
MARK.

In other words, the conversion is splitting the original character into two
decomposed (is that the right term?) pieces.

Apologies if any of the terminology in the above's wrong, but it might give
you some insights.

Cheers,

Chris


 _______________________________________________
Do not post admin requests to the list. They will be ignored.
Cocoa-dev mailing list      (email@hidden)
Help/Unsubscribe/Update your Subscription:

This email sent to email@hidden

  • Follow-Ups:
    • Re: UTF8 question.
      • From: Andrew Thompson <email@hidden>
References: 
 >UTF8 question. (From: "Manish shrikant shet" <email@hidden>)

  • Prev by Date: Superscript causing crash
  • Next by Date: Serious NSTextStorage subclassing troubles
  • Previous by thread: Re: UTF8 question.
  • Next by thread: Re: UTF8 question.
  • Index(es):
    • Date
    • Thread