Change the repository type filter
All
Repositories list
151 repositories
- An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
- A framework for few-shot evaluation of language models.
aria
Publictokengrams
Publicsemantic-memorization
Publicw2s
PublicnanoGPT-mup
Publicpythia
Publicsteering-llama3
Publichuggingface.js
Publicllm-score-behavior
Public