disable fused lm head chunking by default by samsja · Pull Request #1904 · PrimeIntellect-ai/prime-rl

samsja · 2026-02-26T21:24:27Z

Summary

Change fused_lm_head_chunk_size default from "auto" to "disabled"

🤖 Generated with Claude Code

Note

Medium Risk
Changes a model-level default (fused_lm_head_chunk_size) that affects training execution and performance/memory characteristics across runs. CI/integration tests now pin an explicit chunk size, but other consumers may see behavior changes if relying on the previous default.

Overview
Disables fused LM head chunking by default by changing ModelConfig.fused_lm_head_chunk_size from "auto" to "disabled".

Bench/CI flows are updated to explicitly opt in when needed: run_single_benchmark.py accepts/forwards --model.fused-lm-head-chunk-size, the RL multi-run integration trainer.toml pins fused_lm_head_chunk_size = 8192, and benchmark regression tests pass --fused-lm-head-chunk-size 8192 to keep results stable.

^{Written by Cursor Bugbot for commit fc7621d. This will update automatically on new commits. Configure here.}

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

cursor · 2026-02-26T21:25:14Z

src/prime_rl/configs/trainer.py

            ),
        ),
-    ] = "auto"
+    ] = "disabled"


Missing changelog for config default change

Low Severity

The default for model.fused_lm_head_chunk_size changed in src/prime_rl/configs/trainer.py, but there is no corresponding CHANGELOG.md entry for this config behavior change. This makes the default change harder to discover for users relying on documented config migrations.

^{Triggered by project rule: BugBot Instructions}

65536 OOMs on A6000 without fused lm head chunking. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

tests/integration/test_benchmark_regression.py

The benchmark baselines were generated with chunking enabled, so the regression test must explicitly enable it regardless of the default. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

This test uses seq_len=65536 which needs chunking to avoid OOM on A6000 GPUs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

disable fused lm head chunking by default

464bc0c

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

cursor bot reviewed Feb 26, 2026

View reviewed changes

reduce benchmark_regression seq_len to 16384

b586cd2

65536 OOMs on A6000 without fused lm head chunking. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

cursor bot reviewed Feb 26, 2026

View reviewed changes

tests/integration/test_benchmark_regression.py Outdated Show resolved Hide resolved

samsja and others added 2 commits February 26, 2026 23:51

pin fused lm head chunk size in benchmark regression test

085e1d7

The benchmark baselines were generated with chunking enabled, so the regression test must explicitly enable it regardless of the default. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

pin fused lm head chunk size in rl_multi_run CI config

fc7621d

This test uses seq_len=65536 which needs chunking to avoid OOM on A6000 GPUs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

samsja merged commit 753a728 into main Feb 27, 2026
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

disable fused lm head chunking by default#1904

disable fused lm head chunking by default#1904
samsja merged 4 commits intomainfrom
sami/disable-chunking-by-default

samsja commented Feb 26, 2026 •

edited by cursor bot

Loading

Uh oh!

cursor bot Feb 26, 2026

Uh oh!

cursor bot left a comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

samsja commented Feb 26, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

cursor bot Feb 26, 2026

Choose a reason for hiding this comment

Missing changelog for config default change

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

samsja commented Feb 26, 2026 •

edited by cursor bot

Loading