Add SFT validation eval with val_data by philippnormann · Pull Request #1850 · PrimeIntellect-ai/prime-rl

philippnormann · 2026-02-22T21:58:45Z

Summary

Add optional val_data and eval config blocks to SFT.
Run periodic validation inside SFT training and log val/loss and val/num_batches.
Add config validation that requires eval and val_data to be set together.
Add unit tests for config validation behavior.

Why

Train loss alone is not enough for checkpoint selection and overfitting detection.

Before

No native periodic validation signal in SFT runs.

After

SFT can emit validation metrics at configurable intervals during training.

Evidence

Reverse-text run showing periodic validation logging behavior.

train/loss	val/loss

Config used:

sft_fullft_rtext_split_200.toml

max_steps = 200

[ckpt]
interval = 20

[model]
name = "PrimeIntellect/Qwen3-0.6B"

[data]
name = "willcb/R1-reverse-wikipedia-paragraphs-v1-1000"
splits = ["train[:90%]"]
seq_len = 4096
batch_size = 32
shuffle = true
seed = 42

[val_data]
name = "willcb/R1-reverse-wikipedia-paragraphs-v1-1000"
splits = ["train[90%:]"]
seq_len = 4096
batch_size = 32
shuffle = false
seed = 42

[eval]
interval = 10
num_batches = 4

[optim]
lr = 2e-5

Validation

uv run pytest tests/unit/train/sft/test_sft_eval_config.py -q
Unit tests cover: eval without val_data (invalid), val_data without eval (invalid), and eval + val_data (valid).
200-step reverse-text run emits val/loss every 10 steps as configured.

Scope

This PR covers periodic SFT validation evaluation and config validation.

Note

Medium Risk
Touches the core SFT training loop by adding an optional validation pass with distributed reductions; while gated behind new config blocks, it can affect runtime behavior/perf when enabled.

Overview
Adds optional periodic SFT validation driven by new val_data and eval config blocks (with interval, num_batches, and eval_on_start) and logs val/loss + val/num_batches.

Updates SFTConfig validators to require eval and val_data together and to enforce CP/packing/seq_len/micro-batch constraints for validation data, implements the validation loop inside trainer/sft/train.py, and adds unit tests covering the new config validation rules.

^{Written by Cursor Bugbot for commit 3c19f26. This will update automatically on new commits. Configure here.}

src/prime_rl/configs/sft.py

src/prime_rl/trainer/sft/train.py

src/prime_rl/configs/sft.py

Apply CP compatibility checks to val_data, align eval scheduling with checkpoint step numbering, and document new SFT eval config fields in the changelog.

Add SFTEvalConfig.eval_on_start to support an explicit pre-training validation pass while keeping interval-based eval semantics unchanged by default.

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{Bugbot Autofix is OFF. To automatically fix reported issues with Cloud Agents, enable autofix in the Cursor dashboard.}

src/prime_rl/trainer/sft/train.py

cursor bot reviewed Feb 22, 2026

View reviewed changes

src/prime_rl/configs/sft.py Show resolved Hide resolved

src/prime_rl/trainer/sft/train.py Show resolved Hide resolved

src/prime_rl/configs/sft.py Show resolved Hide resolved

philippnormann added 3 commits February 26, 2026 11:47

Add SFT validation eval with val_data

3cecc7d

Fix SFT validation config safety and step alignment

03343f2

Apply CP compatibility checks to val_data, align eval scheduling with checkpoint step numbering, and document new SFT eval config fields in the changelog.

Add optional eval-on-start for SFT validation

fbd6f90

Add SFTEvalConfig.eval_on_start to support an explicit pre-training validation pass while keeping interval-based eval semantics unchanged by default.

philippnormann force-pushed the feature/sft-val-eval branch from f2ba8af to fbd6f90 Compare February 26, 2026 10:48

cursor bot reviewed Feb 26, 2026

View reviewed changes

src/prime_rl/trainer/sft/train.py Show resolved Hide resolved

philippnormann added 2 commits February 26, 2026 11:58

Fix SFT validation LoRA import and changelog entry

f26ccbb

Fix SFT eval test imports after config move

3c19f26

philippnormann mentioned this pull request Feb 27, 2026

Add SFT LoRA support #1849

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SFT validation eval with val_data#1850

Add SFT validation eval with val_data#1850
philippnormann wants to merge 5 commits intoPrimeIntellect-ai:mainfrom
philippnormann:feature/sft-val-eval

philippnormann commented Feb 22, 2026 •

edited by cursor bot

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

philippnormann commented Feb 22, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Why

Before

After

Evidence

Validation

Scope

Uh oh!

Uh oh!

Uh oh!

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

philippnormann commented Feb 22, 2026 •

edited by cursor bot

Loading