This is somewhat off-topic, since you asked for something you can use on Linux. In any case... I've been using OmniPage 16, and I'm sorry to say I can't recommend it. You can't run it from the command line, so you can't really integrate it into a script. It does have a batch manager, so you can set it to do whole folders at a time. Just make sure your folder's not too large; it crashes fairly reliably after about 10-40 pages. If you do use OmniPage to make your PDFs, I've found that it works best to convert a single TIFF into a single-page PDF, then use pdftk[1] (along with a [language of your choice] script) to put those PDFs together however you want them. Have a nice day, Jonathan [1] http://www.accesspdf.com/pdftk/ -- Jonathan M. Brinley Metadata & Digital Initiatives Developer Ball State University [log in to unmask] http://xplus3.net/ On Fri, Oct 17, 2008 at 7:56 AM, James Tuttle <[log in to unmask]> wrote: > -----BEGIN PGP SIGNED MESSAGE----- > Hash: SHA1 > > I wonder if any of you might have experience with creating text PDFs > from TIFFs. I've been using tiffcp to stitch TIFFs together into a > single image and then using tiff2pdf to generate PDFs from the single > TIFF. I've had to pass this image-based PDF to someone with Acrobat to > use it's batch processing facility to OCR the text and save a text-based > PDF. I wonder if anyone has suggestions for software I can integrate > into the script (Python on Linux) I'm using. > > Thanks, > James > > - -- > - ------------------------------- > James Tuttle > Digital Repository Librarian > > NCSU Libraries, Box 7111 > North Carolina State University > Raleigh, NC 27695-7111 > [log in to unmask] > > (919)513-0651 Phone > (919)515-3031 Fax > > -----BEGIN PGP SIGNATURE----- > Version: GnuPG v1.4.6 (GNU/Linux) > Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org > > iD8DBQFI+H1zKxpLzx+LOWMRAgxIAJwNXyeMJbk6r6hmHpNAdEvWIQbCVgCgp8JR > nyS3WZ4UuRbU/6DTH7ohe/M= > =mT2T > -----END PGP SIGNATURE----- >