Re: collectdata
Re: collectdata
- Subject: Re: collectdata
- From: Christopher Stone <email@hidden>
- Date: Mon, 06 Mar 2017 14:45:35 -0600
On Mar 06, 2017, at 06:41, Yvan KOENIG <email@hidden> wrote:
No need for a third party tool to extract text from a PDF. This script delivered by Shane STANLEY doe the job.
That is indeed useful – IF you only need the RAW output.
The ASObjC code returns RAW output and the layout of the pdf is significantly mangled.
It's the same as pdftotext's -raw output.
pdftotext -raw "/path/to/your/File.pdf" -
The primary reason to use pdftotext instead of other tools is its ability to preserve the fidelity of the PDF file's layout. (It's the only tool I personally know of that does this.)
It's not perfect, but it can make parsing a PDF document's text relatively easy instead of difficult to impossible.
pdftotext -layout "/path/to/your/File.pdf" -
|
_______________________________________________
Do not post admin requests to the list. They will be ignored.
AppleScript-Users mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
Archives: http://lists.apple.com/archives/applescript-users
This email sent to email@hidden