Components of an ETL Pipeline
Extract - extracting data from source systems
Transform - transforming the data according to business/data model requirements
Load - loading the data into a destination table
In this ETL Pipeline, the API data source is located in the .json file.
After extracting these files, the data will then be transformed (cleaned and formatted). Lastly the data is loaded into MongoDB.
The data source used is from an API containing Air Pollution measurements from various cities. Desired output is shown below.