I have a degree in Business Administration from UFMG (Federal University of Minas Gerais), frequently positioned as the best business school in Brazil according to several rankings. However, somewhere in the past, I realized a deeper understanding of technology was fundamental to being able to generate more value for the teams I would work with, and decided to create my GitHub to develop some personal projects.
In addition to being curious and trying out new technologies, I also enjoy disconnecting from screens going out for all kinds of sports practice, and meeting new people.
I intend to use my repository as a way to organize courses and projects I've already taken to help my future self and others. Please find below an index of the most interesting projects on this GitHub.
Data Engineering:
- Data Streaming simulation consuming data from an external application and a Redis database and being consumed using Kafka and Spark Streaming
- Data Streaming simulation using Chicago's public transportation data to create a near real-time dashboard with Kafka Connect, Kafka REST Proxy, Faust, KSQL, and Postgres
- Data Pipelines with Airflow, S3 and Redshift
- AWS Data Engineering data modeling and ETL project ingesting multiple JSON files from S3 into Redshift staging and analytical tables using Python Boto3 SDK
- AWS Data Engineering ETL project using an 11GB dataset, S3, SQS, Lambda, Firehose, and Glue transforming data from CSV to JSON to Parquet so it can be queried using SQL in Athena
Web Development
I'm currently focused on refining my expertise in: