I am a German student, who is passionate about Data Engineering and Infrastructure.
Infrastructure / Data Platform / Data Engineering 🏗
- Distributed System on Aws streaming earthquakes using Kafka
- ELT batch processing on Aws and data modeling with DBT
- Sample Data Lakehouse architecture, deployed in containers
- Simple beginners guide to containerization with Docker, with a focus on storage and build time reduction
Open source contributions 💡
Project | Added | Link |
---|---|---|
Apache Airflow | Functionality and respective unit tests to export and import roles including permissions using the Airflow CLI | Merged Pull-Request |
Apache Airflow | Changed the Airflow docker-compose to easily ingest custom config files and added relevant documentation | Merged Pull-Request |
PM4PY | Functionality to filter for a maximum coverage percentage of graph variants | Merged Pull-Request |
Apache Airflow | Added missing documentation for an Operator | Merged Pull-Request |