Re: pdftotext
Re: pdftotext
- Subject: Re: pdftotext
- From: Christopher Stone <email@hidden>
- Date: Sat, 21 Dec 2013 16:26:31 -0600
On Dec 21, 2013, at 14:12, Emmanuel LEVY <email@hidden> wrote:
On my machine, readtext does output the PDF's text.
Of course the file's extension has to be ".pdf".
If your file is not too big, you might send it my way, I would test here.
______________________________________________________________________
Hallo Emmanuel,
Since 'readtext' is also a part of the Satimage.osax I just ran it from Script Debugger - hence the raw pdf code.
So to confirm - 'readtext' extracts text perfectly well when run directly from Smile, although like Shane's library the formatting is not as good as the pdftotext tool.
In fact the 'readtext' and Shane's library produce almost identical output with only a slight variation in whitespace in a couple of places in the test document.
Ah. The whitespace differences turn out to be object replacement characters, and that creates a few formatting options.
-- Take Care, Chris
|
_______________________________________________
Do not post admin requests to the list. They will be ignored.
AppleScript-Users mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
Archives: http://lists.apple.com/archives/applescript-users
This email sent to email@hidden