Skip to content

to create the dimensions to be used as reference data. It is a set of Hive tables that are normally cold and updated not frequently. the program run ETL for system information and station information automatically as Drop table Transform JSON files to CSV Enrich stations information data with system information. Note that the system information …

Notifications You must be signed in to change notification settings

adrestrada/AE_etl-spring-2

Repository files navigation

ETL-2

to create the dimensions to be used as reference data. It is a set of Hive tables that are normally cold and updated not frequently. The program run ETL for system information and station information automatically as Drop table Transform JSON files to CSV Enrich stations information data with system information. Note that the system information has only one record. You can simply use cross join to accomplish this task Skillset HDFS Hive JSON CSV Parquet Scala SQL REST API

About

to create the dimensions to be used as reference data. It is a set of Hive tables that are normally cold and updated not frequently. the program run ETL for system information and station information automatically as Drop table Transform JSON files to CSV Enrich stations information data with system information. Note that the system information …

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages