DEZoomcamp is an educational initiative by DataTalksClub focused on teaching data engineering. The program covers various topics in data engineering, including ETL, containerization and resource provisioning, workflow orchestration, data warehousing, analytics engineering, and more. DEZoomcamp involves lectures, projects, and real-world applications to provide a comprehensive understanding of data engineering concepts and their practical implementations.
Project Link | Area | Project Description | Tools |
---|---|---|---|
ETL with PostgreSQL, Docker, and Terraform | ETL, containers and orchestration, resource provisioning | Demonstrated essential data engineering techniques using Docker. ETL development in a Jupyter notebook with Pandas for data manipulation and SQLAlchemy for database creation and exploration. Container orchestration with Docker-Compose. Resource provisioning on GCP with Terraform. | Google Cloud Platform, Pandas, SQLAlchemy, Jupyter, Docker, PostgreSQL, pgAdmin, Docker-Compose, Terraform |