Re: AUGD: MUG Newsletter Indexing
Re: AUGD: MUG Newsletter Indexing
- Subject: Re: AUGD: MUG Newsletter Indexing
- From: Jo Booth <email@hidden>
- Date: Tue, 07 Feb 2006 08:41:32 +1300
Jim,
I've used ht://Dig from http://www.htdig.org in the past to index
pdfs. It was running on a local linux box, but <http://
forums.macosxhints.com/showthread.php?t=1087> suggests ht://Dig is
available in the iTools package and may run on your webserver. You
can use a php interface if needed <http://www.computerengineering.ca/
a_way_to_use_htdig_with_php/> and <http://www.htdig.org/
FAQ.html#q4.9> explains how to use Xpdf to create the pdf indexes.
A simpler way which we use here at WelMac is to leverage Google....
A google search such as this:
meeting filetype:pdf site:mause.ca
Will find all pdf files that mention meeting on your website.
An example google link to all the pages that have "Jim Foster" in
them is:
<http://www.google.com/search?q="Jim+Foster"+filetype:pdf+site%
3Amause.ca>
A simple form on your website can fill in the "filetype:pdf
site:mause.ca" bits, and let your users search as needed.
Eg:
<form action="http://www.google.co.nz/search" method="get">
<input name="q" size="40" value="
filetype:pdf site:mause.ca">
<input type="submit" value="Google Search"
name="submit">
</form>
With a slight modification you can get rid of the confusing
"filetype:pdf site:mause.ca" bit and let people search as wanted.
You may also want to look into customising your search to "look" the
same as your webpages -- see <http://www.google.com/services/
siteflavored.html>
Hope this helps :)
-Jo.
WelMac VP / NMGgite
http://forums.welmac.org.nz
On 6/02/2006, at 11:47 , Jim Foster wrote:
Hi All,
Our MUG has managed to store just about all of its monthly
newsletters on our club website, one of our more impressive
accomplishments given that the first issue was produced in January
of 1988.
One thing which limits the usefulness of retaining all those issues
on our server is the lack of any Index of the articles contained in
each issue.
A manual attempt to do this was completed for one year's issues,
but was never kept up. Even that attempt was more of a Table of
Contents than an Index, in the sense that it simply listed for each
Month the titles of the major articles contained in each issue.
These newsletters are retained in PDF format.
I am wondering if anyone has any suggestions on how one might be
able to develop an Index for all this material, preferably
something that would be accessible online. The idea would be that
someone could key in a search term and get back a listing of all
the articles that contain that search term and the month and year
of the issue in which that article appears.
Any suggestions or pointers to other sites would be appreciated.
Jim Foster
President
Macintosh Users East [MaUsE]
Oshawa, Ontario, Canada
Ph: (905) 263-4167
Email: email@hidden
http://www.mause.ca
Attachment:
PGP.sig
Description: This is a digitally signed message part
_______________________________________________
Do not post admin requests to the list. They will be ignored.
Augd mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden