tabula-py: Read tables in a PDF into DataFrame¶

tabula-py is a simple Python wrapper of tabula-java, which can read table of PDF. You can read tables from PDF and convert them into pandas’ DataFrame. tabula-py also converts a PDF file into CSV/TSV/JSON file.

We highly recommend looking at the example notebook and trying it on Google Colab.

For high-level API reference, see High level interfaces.

Contents

API Reference

Indices and tables¶

Read the Docs v: v2.7.0

Versions: latest; stable; v2.7.0; v2.6.0; v2.5.1

Downloads

On Read the Docs: Project Home; Builds

Free document hosting provided by Read the Docs.