Skip Navigation

Scout Archives

Home Projects Publications Archives About Sign Up or Log In

Tabula

Screenshot
Tabula is a tool to extract data from PDFs. It is often used to extract data from government reports for aggregation and analysis. It has been used in this way by journalists at ProPublica, The Times of London, Foreign Policy, and others. To use Tabula, users draw a box around the region in a text-based PDF (not a scanned document) that they wish to extract data from. Tabula then produces a preview of the data. If anything is missing, users may revise their selection. If not, the data may be exported to an Excel file, a CSV, or in JSON format. Tabula is a free software, distributed under the MIT license, with source code available on github. Tabula works on Windows, macOS, and Linux.
?  Cumulative Rating:
Archived Scout Publication URL
Scout Publication
Publisher
Language
Date of Scout Publication
2018-01-05
Date Of Record Creation
2018-01-04 13:21:27
Resource URL Clicks
408

Resource Comments

(no comments available yet for this resource)