Abby FineReader and ScanSnap
Abby FineReader and ScanSnap
- Subject: Abby FineReader and ScanSnap
- From: Neil Laubenthal <email@hidden>
- Date: Wed, 04 Apr 2012 10:25:22 -0400
I wonder if anybody knows how to fix scans so that this will work for me . . .
I have a Fujitsu ScanSnap scanner which includes auto OCR using Abby FineReader; this works fine for scans of paper that will process through the autofeeder of the scanner.
However, I also need to scan things on my Canon LiDE 30 flatbed that won't fit in the Fujitsu or that are too delicate (old documents) for the auto feeder.
The Fujitsu outputs pdf files that are somehow marked as coming from the ScanSnap and hence won't process the ones from the Canon. So far I've been reduced to either printing and rescanning them with the Fujitsu or else going the route if import/OCR/export/delete using DevonThink Pro Office. Both of these work but are kind of a pain.
I've looked through the pdf files themselves with TextWrangler and found that if you replace a series of lines in the Canon file of
<< /Title 47 0 R /Producer 48 0 R /Creator 49 0 R /CreationDate 50 0 R /ModDate
50 0 R >>
endobj
with the equivalent series of lines found in the Fujitsu file of
<< /Creator (ScanSnap Manager) /Producer (Mac OS X 10.5.8 Quartz PDFContext)
/CreationDate (D:20120331133513Z00'00') /ModDate (D:20120331133513Z00'00')
>>
endobj
that FineReader will then process the file correctly.
I realize it's probably against the EULAs; but since I legally own all of the hardware and apps involved I really don't think I'm violating the spirit of them; but making this change manually for every file is again more of a pain than either of the above options. Is this something I can automate somehow using TextWrangler and AppleScript? I see that TextWrangler has the ability to run scripts but I'm certainly not a good enough scripter to figure out how to do it.
Trying a bit more with what to replace it looks like the only thing that is needed is to replace the characters right after /Creator…replacing 49 0 R with (ScanSnap Manager) seems to do the trick. TextWrangler's dictionary includes a replace command but I'm not enough of a scripter to create a droplet type app that would just replace the text for me.
Any pointers or prebuilt scripts that I could hack/borrow from to do this?
-----------------------------------------------
There are only three kinds of stress; your basic nuclear stress, cooking stress, and A$$hole stress. The key to their relationship is Jello.
neil
_______________________________________________
Do not post admin requests to the list. They will be ignored.
AppleScript-Users mailing list (email@hidden)
Help/Unsubscribe/Update your Subscription:
Archives: http://lists.apple.com/archives/applescript-users
This email sent to email@hidden