- πͺ Worked at multiple start-ups ranging from bootstrapped to Series B
- β AWS Certified Solutions Architect - Associate
- β 100% Job Success Score and only 5-star client reviews as contractor on Upwork.com
- π· Domain knowledge in Upstream Oil and Gas Operations
RE Data - Open Source Maintainer & Top Contributor
An open source data reliability framework for the modern data stack. RE Data is a DBT package, Python library and React UI. Adding the RE Data DBT package to a DBT project will run out of the box data observability SQL queries in the background when dbt run
is called. These queries calculate and store metrics like standard deviation, mean, row count, etc. The RE Data Python library can be called from the CLI to read in the stored metrics and create and serve the RE Data UI. RE Data is hosted across two GitHub repos. I am a top contributor to both and manage reviewing/merging PRs and creating releases.
- Contributions re-data/re-data
- Contributions re-data/re-data-dbt
- Releases re-data/re-data
- Releases re-data/re-data-dbt
packages:
- package: re-data/re_data
version: 0.11.0
Zillacode - Open Source LeetCode for PySpark, Spark, Pandas and DBT/Snowflake
Created Zillacode the only platform on the internet where you can practice LeetCode like questions using PySpark, Spark, DBT and Snowflake. Zillacode.com was once a live micro-serviced B2C SAAS platform with SSO and Automated Billing but is now Open Source on GitHub.
The entire platform spins up locally with one command using Docker Compose. When deployed to the Cloud Zillacode utilizes AWS Lambda Functions built from custom Docker images that package Spark and PySpark in a way that allows answers to be returned quickly.
$ git clone https://github.com/davidzajac1/zillacode.git
$ cd zillacode
$ docker-compose up
IAMScan - CLI tool checks code for AWS IAM Privileges
IAMScan is an open source command line tool that reads your code and generates an AWS IAM policy with your needed permissions. Keeping track of AWS IAM permissions is annoying and time consuming. How often have you seen an update deployed to the cloud followed by The provided execution role does not have permissions to call CreateSomething on SomeService
? IAMScan solves this issue by generating a perfectly least privileged AWS IAM Policy for all Python Files, JavaScript Files and Shell Scripts from a single command line command.
IAMScan is hosted on PyPI and is installed using pip
$ pip install iamscan
ZOil - Generate random Oil and Gas Data
ZOil is a python library used to generate random Oil and Gas data. Most Oil and Gas data is either proprietary or costly to acquire. ZOil lets you quickly generate an unlimited amount of production data that can be used to for testing, anonymization and much more. ZOil was inspired by the Faker
library.
ZOil is hosted on PyPI and is installed using pip
$ pip install zoil