Skip to content
View saurabh48782's full-sized avatar

Block or report saurabh48782

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
saurabh48782/ReadMe.md
Banner

Hey there! I'm Saurabh Gupta

Data Science Engineer | Production ML & GenAI Systems

NLP • Computer Vision • MLOps • AWS


👨‍💻 About Me

I'm a Data Science Engineer with 4.5+ years of experience building ML & GenAI systems that ship to production—not just notebooks, but solutions that create measurable business impact.

Currently at Dodge Construction Network, engineering automated GenAI pipelines that reduce manual workflows and improve data quality at scale.

🎯 Core Expertise:

  • 🤖 GenAI & NLP: RAG architectures, Agentic workflows, LangChain, Ollama
  • ⚙️ Production MLOps: AWS (ECS, Fargate, SageMaker), Docker, FastAPI
  • 👁️ Computer Vision: Siamese Networks, CNNs, Transfer Learning
  • 📊 Data Engineering: SQL/ETL pipelines, PowerBI, Stakeholder Analytics

🏆 Competitive ML Achievements:

  • 🥇 1st / 1600+ — HackerEarth ML Challenge (World Water Day, 2025)
  • 🥈 5th / 1200+ — HackerEarth ML Challenge (World Earth Day, 2025)
  • 🏅 Value Evangelist Award — Get My Parking (2024)
  • 🎯 Go Getter Award — Get My Parking (2023)

💼 Impact Delivered:

  • 📉 Reduced manual workflows by 40%+
  • 📈 Improved model accuracy by 25%+
  • ⚡ Scaled systems to 100K+ daily predictions

💡 Currently Exploring: Advanced Agentic AI, RAG architectures, Scalable MLOps patterns



🛠️ Tech Stack

Languages & Databases

Python R SQL PostgreSQL MySQL

ML/DL & GenAI Frameworks

PyTorch TensorFlow Keras scikit-learn LangChain Hugging Face Ollama

Data Science & Analytics

NumPy Pandas SciPy Matplotlib

MLOps & Deployment

AWS Docker FastAPI Flask DVC

Version Control & Tools

Git GitHub Bitbucket Jupyter VS Code

Cloud & BI Tools

AWS SageMaker AWS ECS Power BI Metabase

Operating Systems

Windows Ubuntu macOS


📊 GitHub Analytics

GitHub Streak


LinkedIn Gmail HackerRank HackerEarth Instagram

💼 Open to: Senior Data Scientist | ML Engineer | GenAI/MLOps roles

Pinned Loading

  1. Customer_Attrition_Prediction Customer_Attrition_Prediction Public

    The goal of the project is to build a predictive model using machine learning concepts to predict customer attrition for a telecom service company.

    HTML 1

  2. Cab_Fare_Prediction Cab_Fare_Prediction Public

    The goal of the project is to build a predictive model using machine learning concepts to predict cab fare between two cities.

    Jupyter Notebook

  3. Learning_Machine_Learning Learning_Machine_Learning Public

    This repository shows my continuous hustle to learn machine learning and deep learning algorithms. I'll be implementing algorithms and gaining hands-on experience by using algorithms to work in a p…

    Jupyter Notebook

  4. Real_time_Face_Recognition Real_time_Face_Recognition Public

    A real-time face recognition system built using OpenCV, Haarcascade attributes and K-Nearest Neighbour algorithm.

    Jupyter Notebook

  5. Traffic_Light_Classification-Using-Deep-Learning Traffic_Light_Classification-Using-Deep-Learning Public

    Using Convolutional Neural Networks to correctly classify the traffic lights.

    Jupyter Notebook

  6. Web-Scraping-with-Scrapy Web-Scraping-with-Scrapy Public

    Scraping data from an e-Commerce website

    Python