You may use Tesseract OCR. https://github.com/tesseract-ocr/ On Tue, May 14, 2019 at 5:08 PM Sergio Letuche <[log in to unmask]> wrote: > Hello, > > would you kindly please suggest a tool (preferably opensource) that would > extract out of a scanned pdf file, the font and the size (height, width in > a per letter basis?) > > Thank you for any hints in advance > -- Regards Vinit Kumar, Ph.D. Assistant Professor, Department of Library and Information Science Babasaheb Bhimrao Ambedkar University, Rae Bareilly Road, Lucknow 226025 +919454120174