If you haven't already, take a look at tesseract (
http://code.google.com/p/tesseract-ocr/). There's some discussion of using
tesseract and shell scripting to work with tiffs to pdfs to ocr'd text,
which isn't exactly what you're wanting to do, I know, but may prove helpful
(http://www.groklaw.net/articlebasic.php?story=20061210115516438).
Cheers!
Bridger Dyson-Smith
On Fri, Oct 17, 2008 at 8:28 AM, Terry Harrison <[log in to unmask]> wrote:
> You might want to look at ABBYY Fine Reader 9.0 Professional, which can be
> driven from the command line. Fine Reader is used at the Library of
> Congress. Here is a info link to get you started (search "command"):
>
>
> http://www.scanstore.com/Scanning/Document_Imaging/Software/OCR_Software/Nuance/omnipage_review.asp
>
> Regards,
> Terry
>
> ------------------------------------
> Terry Harrison
> Project Manager
> CACI
> 5505 Robin Hood Road, Suite F
> Norfolk, Va. 23508
> Ph: 757.321.9120 x232
> Fax: 757.321.8797
> [log in to unmask]
>
|