What parameters should I use to reproduce the pre-training process of ResNet18 #205

smallbox120 · 2020-10-27T08:32:13Z

I use sgd 0.01, and scheduled 30, 60, 90 epoches with lr decay rate 0.1, momentum is set at 0.9 with nesterov enabled, but I can not get the results as good as directly downloaded model, which is 8% less in top-1. Can the owner give me some tips on how to train the pre-trained model?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

What parameters should I use to reproduce the pre-training process of ResNet18 #205

What parameters should I use to reproduce the pre-training process of ResNet18 #205

smallbox120 commented Oct 27, 2020

What parameters should I use to reproduce the pre-training process of ResNet18 #205

What parameters should I use to reproduce the pre-training process of ResNet18 #205

Comments

smallbox120 commented Oct 27, 2020