to create the dimensions to be used as reference data. It is a set of Hive tables that are normally cold and updated not frequently. The program run ETL for system information and station information automatically as Drop table Transform JSON files to CSV Enrich stations information data with system information. Note that the system information has only one record. You can simply use cross join to accomplish this task Skillset HDFS Hive JSON CSV Parquet Scala SQL REST API
-
Notifications
You must be signed in to change notification settings - Fork 0
to create the dimensions to be used as reference data. It is a set of Hive tables that are normally cold and updated not frequently. the program run ETL for system information and station information automatically as Drop table Transform JSON files to CSV Enrich stations information data with system information. Note that the system information …
adrestrada/AE_etl-spring-2
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
to create the dimensions to be used as reference data. It is a set of Hive tables that are normally cold and updated not frequently. the program run ETL for system information and station information automatically as Drop table Transform JSON files to CSV Enrich stations information data with system information. Note that the system information …
Resources
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published