tabula-py: Read tables in a PDF into DataFrame

tabula-py is a simple Python wrapper of tabula-java, which can read table of PDF. You can read tables from PDF and convert into pandas’s DataFrame. tabula-py also enables you to convert a PDF file into CSV/TSV/JSON file.

We highly recommend to look at the example notebook and try it on Google Colab.

For high level API reference, see High level interfaces.

Indices and tables