Re: Indexing text, pdf, .doc
Re: Indexing text, pdf, .doc
- Subject: Re: Indexing text, pdf, .doc
- From: Michael Johnston <email@hidden>
- Date: Sat, 2 Nov 2002 01:17:53 +0100
Not sure about an embeddable library, but here are a couple good
standalone indexing/search engines:
http://htdig.org/ is open source, and is perl
http://alkaline.vestris.com is very fast and inexpensive, but not open
source
PDF and Word should be converted to text or html and the resulting file
indexed. Everyone uses 
http://www.foolabs.com/xpdf/ for pdf; htdig has
a doc2html script for word.
Michael Johnston
On Friday, November 1, 2002, at 06:12 PM, Steve Ivy wrote:
I'm doing some research for an app and one of the things I need is the
ability to index (and subsequently search, obviously) a store of
content in text documents, pdf files, and Word documents. It can be
Java or Obj-C. I prefer not to use straight C simply due to my own
limitations in the language. I'm wondering if anyone has knowledge of
anything like this. What is Apple using in Sherlock/iTunes/etc?
Whatever became of AIAT?
TIA,
--Steve
_______________________________________________
cocoa-dev mailing list | email@hidden
Help/Unsubscribe/Archives:
http://www.lists.apple.com/mailman/listinfo/cocoa-dev
Do not post admin requests to the list. They will be ignored.
_______________________________________________
cocoa-dev mailing list | email@hidden
Help/Unsubscribe/Archives: 
http://www.lists.apple.com/mailman/listinfo/cocoa-dev
Do not post admin requests to the list. They will be ignored.