Skip to content

Building a custom Large Language Model (LLM) using transformer architectures to explore NLP tasks like text generation and summarization. This project showcases advanced AI techniques, serves as a foundation for experimentation, and contributes to open-source innovation.

License

Notifications You must be signed in to change notification settings

hanemma7moud/LLM-GPT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

8 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

LLM-GPT

Building a custom Large Language Model (LLM) using transformer architectures to explore NLP tasks like text generation and summarization. This project showcases advanced AI techniques, serves as a foundation for experimentation, and contributes to open-source innovation.

My LLM Project ๐Ÿš€

Overview

This project is my first step into building a Large Language Model (LLM) and a GPT-like architecture. The goal is to explore the foundational concepts of LLMs, experiment with transformer models, and build something impactful while polishing my profile.

Features

  • Customizable transformer architecture
  • Fine-tuning on domain-specific datasets
  • Example use cases: text generation, summarization, and more

About

Building a custom Large Language Model (LLM) using transformer architectures to explore NLP tasks like text generation and summarization. This project showcases advanced AI techniques, serves as a foundation for experimentation, and contributes to open-source innovation.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published