Pinned Loading
Repositories
Showing 10 of 69 repositories
- zero-bubble-pipeline-parallelism Public Forked from NVIDIA/Megatron-LM
Zero Bubble Pipeline Parallelism
sail-sg/zero-bubble-pipeline-parallelism’s past year of commit activity - Meta-Unlearning Public
sail-sg/Meta-Unlearning’s past year of commit activity - Cheating-LLM-Benchmarks Public
[SafeGenAi @ NeurIPS 2024] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates
sail-sg/Cheating-LLM-Benchmarks’s past year of commit activity - SimLayerKV Public
The official implementation of paper: SimLayerKV: A Simple Framework for Layer-Level KV Cache Reduction.
sail-sg/SimLayerKV’s past year of commit activity - Attention-Sink Public
[ATTRIB @ NeurIPS 2024] When Attention Sink Emerges in Language Models: An Empirical View
sail-sg/Attention-Sink’s past year of commit activity