Re: RegEx libraries & unicode support
Re: RegEx libraries & unicode support
- Subject: Re: RegEx libraries & unicode support
- From: Mike Ferris <email@hidden>
- Date: Fri, 14 May 2004 07:43:42 -0700
I do not know the full Unicode-related details about the MOKit regex
package, but it does use unichar as its base character type and it
supposedly has pretty decent Unicode support. Unicode regular
expression handling is a deep topic and I do not know how fancy MOKit's
implementation is, but it should handle basic Unicode stuff pretty
well. MOKit (and the underlying regex package) probably do not do
canonical decomposition of the candidate strings themselves, so you may
want to use NSString's -decomposedStringWithCanonicalMapping to prep
candidate strings. (Of course, be careful using this on pattern
strings because it can change their meaning. Better to compose your
pattern strings decomposed to begin with so you know what to expect...)
Mike Ferris
_______________________________________________
cocoa-dev mailing list | email@hidden
Help/Unsubscribe/Archives:
http://www.lists.apple.com/mailman/listinfo/cocoa-dev
Do not post admin requests to the list. They will be ignored.