Tag: parsing

Python – Parsing PDFs with Tika

Reading from a pdf is actually quite an easy task with Python. If the PDF is of course “readable”, e.g. made from a word processor. The first thing to do is to install Tika and Java:

Having this, the

Tagged with: , , , ,
Top