Re: AUGD: MUG Newsletter Indexing
Re: AUGD: MUG Newsletter Indexing
- Subject: Re: AUGD: MUG Newsletter Indexing
- From: michael briney <email@hidden>
- Date: Mon, 6 Feb 2006 16:58:23 -0600
Hi all...
been a busy, busy day....(but that's good!)
Looks like Jo has "stolen the thunder" and provided you pretty much
about what I was going to....almost.
Its true that Google indexes PDFs, but so does Spotlight (if you are
using a Mac OS X server); and Excite (Search) and Go.com and
well....nearly all of the popular search engines have the ability to
search PDFs.
I was over at Devon (www.devon.com) yesterday and noticed that they
are giving away a Quick find utility that also indexes PDF files....
the nice thing about the Google tool, is that once you index your
PDFs then you can add a google search field on your site with the
options to search your site (using your index) and general Google
search...
sorry for the delay in getting back to all...but the party didn't
stop until 4am and then my phone didn't stop until a little bit ago...
michael [tired and slightly hung over]
On Feb 6, 2006, at 1:41 PM, Jo Booth wrote:
Jim,
I've used ht://Dig from http://www.htdig.org in the past to index
pdfs. It was running on a local linux box, but <http://
forums.macosxhints.com/showthread.php?t=1087> suggests ht://Dig is
available in the iTools package and may run on your webserver. You
can use a php interface if needed <http://
www.computerengineering.ca/a_way_to_use_htdig_with_php/> and
<http://www.htdig.org/FAQ.html#q4.9> explains how to use Xpdf to
create the pdf indexes.
A simpler way which we use here at WelMac is to leverage Google....
A google search such as this:
meeting filetype:pdf site:mause.ca
Will find all pdf files that mention meeting on your website.
An example google link to all the pages that have "Jim Foster" in
them is:
<http://www.google.com/search?q="Jim+Foster"+filetype:pdf+site
:mause.ca>
A simple form on your website can fill in the "filetype:pdf
site:mause.ca" bits, and let your users search as needed.
Eg:
<form action="http://www.google.co.nz/search" method="get">
<input name="q" size="40" value="
filetype:pdf site:mause.ca">
<input type="submit" value="Google Search"
name="submit">
</form>
With a slight modification you can get rid of the confusing
"filetype:pdf site:mause.ca" bit and let people search as wanted.
You may also want to look into customising your search to "look"
the same as your webpages -- see <http://www.google.com/services/
siteflavored.html>
Hope this helps :)
-Jo.
WelMac VP / NMGgite
http://forums.welmac.org.nz
On 6/02/2006, at 11:47 , Jim Foster wrote:
Hi All,
Our MUG has managed to store just about all of its monthly
newsletters on our club website, one of our more impressive
accomplishments given that the first issue was produced in January
of 1988.
One thing which limits the usefulness of retaining all those
issues on our server is the lack of any Index of the articles
contained in each issue.
A manual attempt to do this was completed for one year's issues,
but was never kept up. Even that attempt was more of a Table of
Contents than an Index, in the sense that it simply listed for
each Month the titles of the major articles contained in each issue.
These newsletters are retained in PDF format.
I am wondering if anyone has any suggestions on how one might be
able to develop an Index for all this material, preferably
something that would be accessible online. The idea would be that
someone could key in a search term and get back a listing of all
the articles that contain that search term and the month and year
of the issue in which that article appears.
Any suggestions or pointers to other sites would be appreciated.
Jim Foster
President
Macintosh Users East [MaUsE]
Oshawa, Ontario, Canada
Ph: (905) 263-4167
Email: email@hidden
http://www.mause.ca
_______________________________________________
Do not post admin requests to the list. They will be ignored.
Augd mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden
_______________________________________________
Do not post admin requests to the list. They will be ignored.
Augd mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden