Skip Navigation

Scout Archives

Home Projects Publications Archives About Sign Up or Log In

Petl

Petl is a Python library and companion command-line tool to support Extract, Transform, and Load (ETL) workflows on tabular data. Petl can read and write several dozen formats including Excel, XML, JSON, Delimited text, HTML, and many others. Numerous transformations are supported, including transformations of individual values, splitting compound values from one cell into multiple cells, SQL-like joins across several tables, deduplication, and validation against a set of constraints. Petl also includes a number of utility functions to perform statistical analysis and interactive inspection of the data. In the ETL Pipelines section of the Introduction page, users can locate a brief illustration of a simple ETL workflow. As a Python package, Petl can be installed via `pip`. On many Linux and BSD systems, Petl will also be available via the system's package manager.
Archived Scout Publication URL
Scout Publication
Publisher
GEM Subject
Date Issued
2014
Data Type
Language
Date of Scout Publication
March 19th, 2021
Date Of Record Creation
March 9th, 2021 at 3:43pm
Date Of Record Release
March 11th, 2021 at 8:57am
Resource URL Clicks
75
Add Comment

Comments

(no comments available yet)