Re: Getting the info out of the images of a PDF
Re: Getting the info out of the images of a PDF
- Subject: Re: Getting the info out of the images of a PDF
- From: "Steven D. Majewski" <email@hidden>
- Date: Wed, 2 Nov 2005 18:02:50 -0500
On Nov 2, 2005, at 12:50 PM, Rich Morin wrote:
At 12:10 PM -0500 11/2/05, Chris Tangora wrote:
We are an independent newspaper and I am trying to come up
with a way to do some in house image tracking. We save the
PostScripts of output every day and I was wondering that if I turned
them into PDF's would there be some way to find out what either the
filename of an image is or the title inside of the image is.
Given that you are generating the output yourself, I'd look into the
possibility of modifying the generation process. If you can encode
metadata into the PS files such that (say) Spotlight can find it, the
problems will become MUCH simpler.
In general, a lot of optional information MAY be encoded in either PS
or PDF files -- The dictionary for PDF Image objects, for example,
has a metadata field -- so the question is really: do the specific tools
you use in your workflow preserve this information or, can they be
easily
modified or scripted to force them to preserve this info ?
It's already been noted that you can scan the text in the PS and PDF
files to answer the first question.
-- Steve Majewski
_______________________________________________
Do not post admin requests to the list. They will be ignored.
Applescript-users mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
This email sent to email@hidden