Skip to content
View devmithun7's full-sized avatar
⚽
⚽
  • Northeastern University
  • Boston

Block or report devmithun7

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
devmithun7/README.md

Hi, I'm Dev Mithunisvar! πŸ‘‹

Data Engineer | AI & Analytics | MS in Information Systems @ Northeastern University

Welcome to my GitHub! I’m a data engineer passionate about building scalable pipelines, analytics systems, and AI-driven applications. I love working at the intersection of data engineering, ML, and cloud, turning raw information into reliable, actionable, and intelligent systems.


πŸ‘¨β€πŸ’» About Me

  • πŸŽ“ I’m currently pursuing my Master’s in Information Systems at Northeastern University, Boston (Dec 2025)
  • πŸ’Ό Previously worked as:
    • Data Analyst Co-op at Boehringer Ingelheim - Built Tableau dashboards, automated data pipelines, and improved data quality across large-scale healthcare campaigns.
    • Data Engineer at LTIMindtree - Engineered scalable data pipelines and optimized analytics workflows to deliver reliable insights.
  • πŸ€– Currently exploring GenAI, Python agents, RAG systems, and healthcare ML
  • 🀝 Open to collaborating on data engineering, AI/ML, BI, and research projects
  • πŸ’‘ Passionate about building scalable systems and making data accessible, reliable, and impactful

πŸš€ Key Projects

Here are some of my key projects hosted on GitHub:

🧠 GenAI & AI Projects

  • AI Healthcare System
    RAG-powered medication query engine integrating medical data ingestion, vector search, and chatbot interaction
    LLAMA β€’ RAG β€’ LangChain β€’ Streamlit β€’ Pinecone β€’ FastAPI β€’ Snowflake β€’ Python β€’ SQL β€’ Beautifulsoup

  • Mindaid
    LLM-powered mental health assistant with ML models, RAG search, and a Streamlit app for personalized counseling support
    Falcon-7B β€’ RAG β€’ Streamlit β€’ Pinecone β€’ Docker


πŸ“Š Data Engineering & BI Projects

  • Food Inspection Analysis
    End-to-end BI solution with ETL pipelines, dimensional modeling, and Tableau dashboards
    Azure Data Factory β€’ Snowflake β€’ dimensional Modeling β€’ Tableau β€’ Python β€’ SQL β€’ Alteryx

  • Optimizing Returns & Refunds in Supply Chain
    End-to-end OLTP system automating returns, refunds, customer reliability scoring, and exception handling
    OLTP β€’ PL/SQL β€’ ERD/DFD β€’ Oracle Database β€’ Supply Chain Systems β€’ Normalization

  • DBT Commercial Analytics Data Model
    Data quality framework using schema validation, tests, and CI/CD for commercial analytics with DBT, Snowflake, data modeling, and GitHub Actions automation
    DBT β€’ Snowflake β€’ Data Modeling β€’ GitHub Actions

  • Tableau Data Visualization Portfolio
    Collection of interactive dashboards covering retail, public safety, and food compliance analytics, showcasing end-to-end data storytelling and insight generation
    Tableau β€’ Data Visualization β€’ Analytics β€’ Storytelling


πŸ“Š Machine Learning Projects

  • US Accident Prediction
    ML model predicting accident severity using traffic, weather, and road condition features
    EDA β€’ ML Models β€’ Python

  • Sentiment Analysis using LSTM
    Deep learning model classifying Amazon customer reviews using LSTM and distributed training
    NLP β€’ LSTM β€’ Distributed Training (DDP) β€’ Pytorch


πŸ›  Technical Skills

πŸ’» Languages

Python β€’ SQL β€’ PySpark β€’ Scala β€’ Java β€’ Typescript

πŸ— Database/Warehouse

Snowflake β€’ Redshift β€’ BigQuery β€’ DynamoDB β€’ Delta Lake

πŸ— Data Engineering

Snowflake β€’ Databricks β€’ Airflow β€’ Kafka β€’ DBT β€’ Pyspark β€’ Flink β€’ Lambda β€’ Azure Datafactory β€’ Alteryx β€’ Docker

πŸ€– Machine Learning & AI

PyTorch β€’ TensorFlow β€’ Scikit-learn β€’ LangChain β€’ RAG β€’ NLP β€’ Transformers β€’ OpenAI APIs

πŸ“Š BI & Visualization

Tableau β€’ Power BI β€’ Plotly β€’ Streamlit

☁️ Cloud

AWS β€’ Azure β€’ GCP


πŸ“œ Certifications

  • Tableau Certified Desktop Specialist
  • AWS Certified Data Engineer Associate
  • Snowflake SnowPro Core Certification

🀝 Let's Connect!

Feel free to explore my repositories, and let’s connect to collaborate on data engineering, AI, and impactful analytics.

LinkedIn Email GitHub

Profile Views

Popular repositories Loading

  1. Cricket-game-Prediction Cricket-game-Prediction Public

    Indian Cricket game prediction Using Random Fprest

    Jupyter Notebook 1

  2. Breat-Cancer-Detection Breat-Cancer-Detection Public

    Jupyter Notebook 1

  3. Image-Classification-Using-Amazon-Sagemaker Image-Classification-Using-Amazon-Sagemaker Public

    Jupyter Notebook 1

  4. Empowering-E-commerce-Advanced-Analytics-Infrastructure Empowering-E-commerce-Advanced-Analytics-Infrastructure Public

    Jupyter Notebook 1

  5. Contactless_Ventilator Contactless_Ventilator Public

    C++

  6. Python Python Public

    Python