Re: splitting CJK text into "words"
Re: splitting CJK text into "words"
- Subject: Re: splitting CJK text into "words"
- From: Martin Wierschin <email@hidden>
- Date: Wed, 26 Sep 2012 15:05:02 -0700
>> I'm trying to split CJK text using the kind of word boundaries detected by -[NSAttributedString doubleClickAtIndex:]. That method does the job correctly, but only if the system preferences have the Word Break mode set to Japanese. I need to ensure this kind of word splitting independent of the user's system preferences.
>
> Does -[NSString enumerateSubstringsInRange:options:usingBlock:] with NSStringEnumerationByWords as the option work any better?
Thanks for the suggestion Ken, but no, it doesn't produce the results I'm looking for. It also isn't sensitive to the system's Word Break mode, whether I pass NSStringEnumerationLocalized or not. So basically enumerateSubstringsInRange:etc: seems to behave as CFStringTokenizer, but without offering the option of specifying a locale.
~Martin
_______________________________________________
Cocoa-dev mailing list (email@hidden)
Please do not post admin requests or moderator comments to the list.
Contact the moderators at cocoa-dev-admins(at)lists.apple.com
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden