Skip to content

Conversation

@dxqb
Copy link
Collaborator

@dxqb dxqb commented Jan 16, 2026

git fetch origin pull/1261/head:pr-1261
git switch pr-1261

update.sh or update.bat

  • requires testing
  • requires experimentation with quantization and training parameters - the preset is very preliminary
  • might break Qwen-Image: this PR updates diffusers to a commit that includes Flux2-Klein - but that commit also fixes some bugs in their Qwen-Image pipeline. The workarounds for these bugs have to be removed in OneTrainer
  • there is an issue with masked training in this branch

includes #1237
see e50970f for code changes only in this PR

@yamatazen
Copy link

No 4B model?

@dxqb
Copy link
Collaborator Author

dxqb commented Jan 17, 2026

No 4B model?

works with both

@yamatazen
Copy link

Where is the 4B preset?

@dxqb
Copy link
Collaborator Author

dxqb commented Jan 17, 2026

Where is the 4B preset?

there is none. just replace the base model name

@pyros-projects
Copy link

Three test runs so far and everything works fine. Did not test qwen tho. Will report in detail later with examples and stuff.

Is there a way to do the in-training sampling with a different model than as defined under "model"? As in I train on the base model, but sampling happens with the distilled version.
Would also be handy for z-image.

If not is this something OneTrainer should be able to do?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants