If you haven't already, take a look at tesseract ( http://code.google.com/p/tesseract-ocr/). There's some discussion of using tesseract and shell scripting to work with tiffs to pdfs to ocr'd text, which isn't exactly what you're wanting to do, I know, but may prove helpful (http://www.groklaw.net/articlebasic.php?story=20061210115516438). Cheers! Bridger Dyson-Smith On Fri, Oct 17, 2008 at 8:28 AM, Terry Harrison <[log in to unmask]> wrote: > You might want to look at ABBYY Fine Reader 9.0 Professional, which can be > driven from the command line. Fine Reader is used at the Library of > Congress. Here is a info link to get you started (search "command"): > > > http://www.scanstore.com/Scanning/Document_Imaging/Software/OCR_Software/Nuance/omnipage_review.asp > > Regards, > Terry > > ------------------------------------ > Terry Harrison > Project Manager > CACI > 5505 Robin Hood Road, Suite F > Norfolk, Va. 23508 > Ph: 757.321.9120 x232 > Fax: 757.321.8797 > [log in to unmask] >