Skip to content

Pull requests: swiss-ai/Megatron-LM

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

LM eval adapter for megatron
#97 opened Dec 20, 2025 by andresnowak Loading…
Multimodality/sft extension
#91 opened Oct 6, 2025 by RaphaelKreft Loading…
2 tasks
Top-K Logits Distillation
#90 opened Sep 30, 2025 by BlackSamorez Loading…
NGC25.05 + Fix Xielu
#87 opened Jul 31, 2025 by TJ-Solergibert Loading…
Update update upstream
#84 opened Jun 25, 2025 by AleHD Loading…
Update upstream
#82 opened Jun 25, 2025 by AleHD Loading…
Data Mixture Modification Script
#74 opened May 27, 2025 by alexdremov Loading…
swiss
#73 opened May 23, 2025 by xrsrke Loading…
Adding CSCS' XIELU to Megatron-LM
#71 opened May 6, 2025 by rubber-duck-debug Loading…
Minor sbatch fixes
#68 opened Apr 1, 2025 by henrique Loading…
Log all grad norms
#50 opened Feb 27, 2025 by dhia680 Loading…
Process error file - v0
#49 opened Feb 27, 2025 by dhia680 Loading…
Update fork
#34 opened Feb 14, 2025 by AleHD Loading…
Slack bot
#16 opened Feb 10, 2025 by dhia680 Loading…
ProTip! Follow long discussions with comments:>50.