Re: finding substring
Re: finding substring
- Subject: Re: finding substring
- From: Daniel Jalkut <email@hidden>
- Date: Sat, 1 Apr 2006 00:02:41 -0500
On Mar 31, 2006, at 10:26 PM, Chuck Soper wrote:
This answer makes sense from a programmer's perspective, but from a
user's perspective it might be confusing. For example, if someone
searches for "San Jose", the results include San Jose, California
but not San José, Costa Rica.
My English atlas shows San Jose, California and San José, Costa
Rica. I suspect that most users think of the two city names as
being the same, but they're not.
Do you think that striping diacritical marks makes sense when
comparing some geographical names/languages, but not all, such as
localized Japanese names? If so, is there a way to make a distinction?
It sounds to me like the problem you are dealing with is more general
than just diacritical markings. The identification of geographical
location names seems especially dependent on localization. A good
example was in the news lately in the US. The Winter Olympics were
held in Torino, Italy. But many US newspapers described the event as
being held in Turin. No amount of diacritical exemption will help the
user who searches on Turin find the name if it's indexed as Torino.
How you deal with this largely depends on where you're getting your
data from and what localizations you're targeting. If your
geographical names are all "basically English," and your target
localization is English, then maybe it makes sense to just do a
diacrtiical insensitive search. But if your geographical names come
from a more complicated source like the real world, you might have to
resign to identifying places by a number of search terms. So "San
Jose" matches "San José", "Turin" matches "Torino", etc.
It seems like this is a special situation where planning for multiple
matching keys might be the right choice from the get-go.
Daniel
_______________________________________________
Do not post admin requests to the list. They will be ignored.
Cocoa-dev mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden