tabula-py: Read tables in a PDF into DataFrame¶

tabula-py is a simple Python wrapper of tabula-java, which can read table of PDF. You can read tables from PDF and convert into pandas’s DataFrame. tabula-py also enables you to convert a PDF file into CSV/TSV/JSON file.

We highly recommend to look at the example notebook and try it on Google Colab.

For high level API reference, see High level interfaces.

Contents

API Reference

Indices and tables¶

Read the Docs v: v2.6.0

Versions: latest; stable; v2.6.0; v2.5.1

Downloads

On Read the Docs: Project Home; Builds

Free document hosting provided by Read the Docs.