• Open Menu Close Menu
  • Apple
  • Shopping Bag
  • Apple
  • Mac
  • iPad
  • iPhone
  • Watch
  • TV
  • Music
  • Support
  • Search apple.com
  • Shopping Bag

Lists

Open Menu Close Menu
  • Terms and Conditions
  • Lists hosted on this site
  • Email the Postmaster
  • Tips for posting to public mailing lists
Re: Indexing text, pdf, .doc
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]

Re: Indexing text, pdf, .doc


  • Subject: Re: Indexing text, pdf, .doc
  • From: Marco Scheurer <email@hidden>
  • Date: Sat, 2 Nov 2002 02:06:35 +0100

On Saturday, November 2, 2002, at 01:17 AM, Michael Johnston wrote:

Not sure about an embeddable library, but here are a couple good standalone indexing/search engines:

http://htdig.org/ is open source, and is perl

ht//dig is quite nice, but it's C++, and there are some issues to make it work on Mac OS X with accented text, because of incompatibilities between the "locale" functions of various Unix:

On 10.2, a small change in the source code is needed to make it work (changing a LC_ALL to LC_CTYPE) in Configuration::AddParsed(). This change may have been included in the official distribution, I'm not sure.

On 10.1, setlocale() seems to be broken beyond repair, and an ugly hack to remove accents before indexing was needed.

Marco Scheurer
Sen:te, Lausanne, Switzerland http://www.sente.ch
_______________________________________________
cocoa-dev mailing list | email@hidden
Help/Unsubscribe/Archives: http://www.lists.apple.com/mailman/listinfo/cocoa-dev
Do not post admin requests to the list. They will be ignored.

  • Follow-Ups:
    • Re: Indexing text, pdf, .doc
      • From: Scott Anguish <email@hidden>
    • Re: Indexing text, pdf, .doc
      • From: Scott Anguish <email@hidden>
References: 
 >Re: Indexing text, pdf, .doc (From: Michael Johnston <email@hidden>)

  • Prev by Date: Re: class method list
  • Next by Date: Re: cocoa-dev digest, Vol 2 #1467 - 15 msgs
  • Previous by thread: Re: Indexing text, pdf, .doc
  • Next by thread: Re: Indexing text, pdf, .doc
  • Index(es):
    • Date
    • Thread