Search Kit on 10.4 and searching in Japanese
Search Kit on 10.4 and searching in Japanese
- Subject: Search Kit on 10.4 and searching in Japanese
- From: "Christian Moen" <email@hidden>
- Date: Wed, 11 Jul 2007 05:53:33 +0900
Dear cocoa-dev,
How does Search Kit in 10.4 work in non-Western languages such as Japanese,
where there's no blank character which acts as a token separator?
The documentation for SKIndexAddDocumentWithText() says "Search Kit will
index any size text string that you give it, up to its 4 GB index file size
limit" which I believe means that no normalization is being done on the
text. However, the documentation for SKSearchCreate() explains that Search
Kit does Unicode Normalization Form KC (http://unicode.org/reports/tr15). I
assume this means that the same NFKC normalization also happens prior to
indexing. Is this correct?
If I index the sentence "明日寿司を食べたい" ("I would like to eat sushi tomorrow")
using SKIndexAddDocumentWithText(), will I get a match if I search for "寿司"
("sushi") as a substring search ("*寿司*") using SKSearchCreate()?
Any up-front feedback to this is mostly appreciated before I start
experimenting.
Many thanks,
Christian
_______________________________________________
Cocoa-dev mailing list (email@hidden)
Please do not post admin requests or moderator comments to the list.
Contact the moderators at cocoa-dev-admins(at)lists.apple.com
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden