Publisher's description
PDF parser and analyzer written entirely in Python. PDFMiner is a suite of programs that aims to help extracting or analyzing text data from PDF documents.Unlike other PDF-related tools, it allows to obtain the exact location of texts in a page, as well as other layout information such as font size or font name, which could be useful for analyzing the document.PDFMiner can be also used as a basis for a full-fledged PDF interpreter.