• Open Menu Close Menu
  • Apple
  • Shopping Bag
  • Apple
  • Mac
  • iPad
  • iPhone
  • Watch
  • TV
  • Music
  • Support
  • Search apple.com
  • Shopping Bag

Lists

Open Menu Close Menu
  • Terms and Conditions
  • Lists hosted on this site
  • Email the Postmaster
  • Tips for posting to public mailing lists
Re: RegEx libraries & unicode support
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: RegEx libraries & unicode support


  • Subject: Re: RegEx libraries & unicode support
  • From: Mike Ferris <email@hidden>
  • Date: Fri, 14 May 2004 07:43:42 -0700

I do not know the full Unicode-related details about the MOKit regex package, but it does use unichar as its base character type and it supposedly has pretty decent Unicode support. Unicode regular expression handling is a deep topic and I do not know how fancy MOKit's implementation is, but it should handle basic Unicode stuff pretty well. MOKit (and the underlying regex package) probably do not do canonical decomposition of the candidate strings themselves, so you may want to use NSString's -decomposedStringWithCanonicalMapping to prep candidate strings. (Of course, be careful using this on pattern strings because it can change their meaning. Better to compose your pattern strings decomposed to begin with so you know what to expect...)

Mike Ferris
_______________________________________________
cocoa-dev mailing list | email@hidden
Help/Unsubscribe/Archives: http://www.lists.apple.com/mailman/listinfo/cocoa-dev
Do not post admin requests to the list. They will be ignored.


References: 
 >RegEx libraries & unicode support (From: Allan Odgaard <email@hidden>)
 >Re: RegEx libraries & unicode support (From: Nicholas Riley <email@hidden>)
 >Re: RegEx libraries & unicode support (From: Allan Odgaard <email@hidden>)
 >Re: RegEx libraries & unicode support (From: Jim Correia <email@hidden>)

  • Prev by Date: Re: RegEx libraries & unicode support
  • Next by Date: Re: Screen trash with some fonts
  • Previous by thread: Re: RegEx libraries & unicode support
  • Next by thread: Re: RegEx libraries & unicode support
  • Index(es):
    • Date
    • Thread