You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Doing some research into the method of finding an optimal learning rate.
I made the models both from scratch as the videos and also in a torch friendly way, or well using torch modules, dataloaders, optmizer, etc ..
However something weird when running the following, which 'should' be same as code from video. The lr - loss graph is showed in im1 below.
im2 is using code very similar to videos, i.e manually updating weights. Why are the results not the same ? Is the optimizer doing somethign different in the backend ? Over all the training is about the same, both will converge roughly at the same rate.
Doing some research into the method of finding an optimal learning rate.
I made the models both from scratch as the videos and also in a torch friendly way, or well using torch modules, dataloaders, optmizer, etc ..
However something weird when running the following, which 'should' be same as code from video. The lr - loss graph is showed in im1 below.
im2 is using code very similar to videos, i.e manually updating weights. Why are the results not the same ? Is the optimizer doing somethign different in the backend ? Over all the training is about the same, both will converge roughly at the same rate.
The text was updated successfully, but these errors were encountered: