Skip to content

Latest commit

 

History

History
28 lines (24 loc) · 946 Bytes

README.md

File metadata and controls

28 lines (24 loc) · 946 Bytes

Data-Transformation-IC

The code available on this repository is intended to extract tables from a series of .pdf pages, export each of them as individual .csv files, and finally zip all those files.

Requirements

It is recommendable that you have Python 3.6 or greater installed.

The necessary Python modules can be successfully installed by typing the following in your bash terminal:

$   pip3 install -r requirements.txt

INFO: In case of getting a ModuleNotFound error related with tabula-py, please reinstall tabula manually, typing in the following:

$   pip3 install tabula-py

Running the program

After setting up all the requirements mentioned above, to run the Table Extractor you may access the root folder of this repository on a bash terminal and type in the following command:

$   python3 table-extractor.py