Skip to content

ETL Pipeline developed with Python. This pipeline was created to Extract, Transform, and Load data from an API source into MongoDB.

Notifications You must be signed in to change notification settings

LianaHarris360/ETL-Pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ETL-Pipeline

Components of an ETL Pipeline

Extract - extracting data from source systems

Transform - transforming the data according to business/data model requirements

Load - loading the data into a destination table

image

In this ETL Pipeline, the API data source is located in the .json file.

After extracting these files, the data will then be transformed (cleaned and formatted). Lastly the data is loaded into MongoDB.

The data source used is from an API containing Air Pollution measurements from various cities. Desired output is shown below.

Screen Shot 2021-08-04 at 6 50 37 PM

Screen Shot 2021-08-04 at 6 58 22 PM

About

ETL Pipeline developed with Python. This pipeline was created to Extract, Transform, and Load data from an API source into MongoDB.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages