Mac OS X stores file names as canonically decomposed character
sequences (Unicode Normalization Form D).
In a decomposed characters sequence, the character "a umlaut" รค is
stored as two unicode code points.
Java usually works with composed Unicode character sequences which
stores "a umlaut" in a single code point.
With J2SE6 you can convert between normalization forms using class
java.text.Normalizer.
To do the normalization with earlier Java versions, you can use the
Normalizer classes available from the Unicode consortium: http://www.unicode.org/reports/tr15/Normalizer.html
With best regards,
Werner
On 17.09.2006, at 16:40, Sciss wrote:
i have a problem with pathname strings from File objects. they
appear to be in a weird encoding which results in me being unable
to transcode them to other charsets and transfer them using
OpenSoundControl. the problem arises with characters not in the
lower 7 bit of standard ascii, for example umlauts. like the
following:
_______________________________________________
Do not post admin requests to the list. They will be ignored.
Java-dev mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
http://lists.apple.com/mailman/options/java-dev/email@hidden