• Open Menu Close Menu
  • Apple
  • Shopping Bag
  • Apple
  • Mac
  • iPad
  • iPhone
  • Watch
  • TV
  • Music
  • Support
  • Search apple.com
  • Shopping Bag

Lists

Open Menu Close Menu
  • Terms and Conditions
  • Lists hosted on this site
  • Email the Postmaster
  • Tips for posting to public mailing lists
Re: Indexing text, pdf, .doc
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Indexing text, pdf, .doc


  • Subject: Re: Indexing text, pdf, .doc
  • From: Michael Johnston <email@hidden>
  • Date: Sat, 2 Nov 2002 01:17:53 +0100

Not sure about an embeddable library, but here are a couple good standalone indexing/search engines:

http://htdig.org/ is open source, and is perl
http://alkaline.vestris.com is very fast and inexpensive, but not open source

PDF and Word should be converted to text or html and the resulting file indexed. Everyone uses http://www.foolabs.com/xpdf/ for pdf; htdig has a doc2html script for word.

Michael Johnston


On Friday, November 1, 2002, at 06:12 PM, Steve Ivy wrote:

I'm doing some research for an app and one of the things I need is the ability to index (and subsequently search, obviously) a store of content in text documents, pdf files, and Word documents. It can be Java or Obj-C. I prefer not to use straight C simply due to my own limitations in the language. I'm wondering if anyone has knowledge of anything like this. What is Apple using in Sherlock/iTunes/etc? Whatever became of AIAT?

TIA,

--Steve
_______________________________________________
cocoa-dev mailing list | email@hidden
Help/Unsubscribe/Archives: http://www.lists.apple.com/mailman/listinfo/cocoa-dev
Do not post admin requests to the list. They will be ignored.
_______________________________________________
cocoa-dev mailing list | email@hidden
Help/Unsubscribe/Archives: http://www.lists.apple.com/mailman/listinfo/cocoa-dev
Do not post admin requests to the list. They will be ignored.

  • Follow-Ups:
    • Re: Indexing text, pdf, .doc
      • From: Marco Scheurer <email@hidden>
    • Re: Indexing text, pdf, .doc
      • From: Joseph Heck <email@hidden>
References: 
 >Indexing text, pdf, .doc (From: Steve Ivy <email@hidden>)

  • Prev by Date: Re: Cocoa really needs to be thread-safe
  • Next by Date: Re: Indexing text, pdf, .doc
  • Previous by thread: Indexing text, pdf, .doc
  • Next by thread: Re: Indexing text, pdf, .doc
  • Index(es):
    • Date
    • Thread