issues training TiTok & TA-TiTok with the single stage loss #76

SnakeOnex · 2025-01-21T09:52:46Z

Hello,
first of all thanks for amazing research and for open sourcing the code & weights.

I had issues with training the TiTok & TA-TiTok with Single Stage loss, I tried:

training TiTok with the Single Stage loss
training TA-TiTok with Single Stage loss, but with placeholder text guidance prompt on ImageNet

In both cases I got very bad results: loss goes down, but the reconstructed image is just pale noise with not any real resemblence of the right colors or shapes. I trained with bs=32 on a single A100 GPU for 10 hours. After which I would expect to start seeing signs of convergence. I noticed the grad norms are all in the 1e-7 to 1e-9 range.

Below are linked a minimal repro (~5 lines changed in config) and the resulting wandb training run with results.

minimal diff repro
wandb training run report

TACJu · 2025-01-21T23:13:35Z

Hi,

Thank you for bringing this issue to our attention. We’ve identified that the problem was caused by the Perceptual Loss not being updated to align with the latest configuration. This has now been fixed in the latest update. We’ve verified that with the fix, the model begins reconstructing reasonable images around 25k steps with a total batch size of 256 on 8 A100 GPUs. Please give it a try and let us know if you encounter any further issues.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

issues training TiTok & TA-TiTok with the single stage loss #76

issues training TiTok & TA-TiTok with the single stage loss #76

SnakeOnex commented Jan 21, 2025 •

edited

Loading

TACJu commented Jan 21, 2025

issues training TiTok & TA-TiTok with the single stage loss #76

issues training TiTok & TA-TiTok with the single stage loss #76

Comments

SnakeOnex commented Jan 21, 2025 • edited Loading

TACJu commented Jan 21, 2025

SnakeOnex commented Jan 21, 2025 •

edited

Loading