Re: pdftotext
Re: pdftotext
- Subject: Re: pdftotext
- From: Christopher Stone <email@hidden>
- Date: Fri, 20 Dec 2013 19:35:03 -0600
On Dec 20, 2013, at 16:28, Shane Stanley <email@hidden> wrote:Is there a better way to script the extraction of text from an unlocked pdf?
That depends on what you mean by "better". if it covers without recourse to third-party software, you could use AppleScript:
______________________________________________________________________
Hey Shane,
Thanks. I was hoping there was a way to handle that from a library. :)
Your handler doesn't produce as nice an output as pdftotext's raw switch, but I'm not certain what all the differences are yet. I imagine it can be worked around.
The biggest difference seems to be how whitespace is handled.
Of course I'm using the Satimage.osax's regex engine to do the heavy lifting.
<scratched_record> Or you could use AppleScript...
Yeah, eventually.
But I've been using the SIO for a decade+ and can write extremely useful code in seconds, and that doesn't take into account all the templates and handlers I have for it.
👿
-- Take Care, Chris
|
_______________________________________________
Do not post admin requests to the list. They will be ignored.
AppleScript-Users mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
Archives: http://lists.apple.com/archives/applescript-users
This email sent to email@hidden
References: | |
| >pdftotext (From: Christopher Stone <email@hidden>) |
| >Re: pdftotext (From: Shane Stanley <email@hidden>) |