I am working with PDF files in some South Asian and South East Asian
languages. Each PDF has ActualText added for each tag in the PDF. Each PDF
has ActualText as an alternative forvthe visible text layer in the PDF.
Is anyone aware of tools the will allow me to index and search PDFs based
on the ActualText content rather than the visible text layers in the PDF?
[log in to unmask]