try PDFBox. It can index PDF documents.
From: Code for Libraries on behalf of Thomas Dowling
Sent: Wed 1/28/2009 2:37 PM
To: [log in to unmask]
Subject: Re: [CODE4LIB] Is there a utility to open a folder of many pdfs and determine if each one will open? (eom)
On 01/28/2009 04:31 PM, Stockwell, Chris wrote:
> Chris Stockwell
> Library Systems Programmer Analyst
> Montana State Library
> [log in to unmask]
Your shell of choice should let you run pdfinfo on each one. It will
either give you sensible information about the PDF file (in which case,
you can assume it's good), or give you an error message.
[log in to unmask]