Skip to content

This repository guides you through the process of building a GPT-style Large Language Model (LLM) from scratch using PyTorch. The structure and approach are inspired by the book Build a Large Language Model (From Scratch) by Sebastian Raschka.

License

Notifications You must be signed in to change notification settings

Sillians/Building-LLMs-from-scratch

Repository files navigation

Building-LLMs-from-scratch

This repository provides a practical guide to building a GPT-style Large Language Model (LLM) from the ground up using PyTorch, following the structure of Sebastian Raschka's book, Build a Large Language Model (From Scratch).


Prerequisites

Programming & Technical

  • Python 3.8+
  • NumPy
  • PyTorch
  • Software Engineering Basics

Mathematical

  • Calculus & Derivatives
  • Linear Algebra
  • Probability & Statistics
  • Basic Information Theory

Machine Learning

  • Core Classical ML Concepts
  • Deep Learning Fundamentals
  • NLP Basics
  • Hugging Face libraries: transformers, datasets, huggingface_hub
  • gradio for deployment

About

This repository guides you through the process of building a GPT-style Large Language Model (LLM) from scratch using PyTorch. The structure and approach are inspired by the book Build a Large Language Model (From Scratch) by Sebastian Raschka.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published