Implementing an optimizer (Nesterov SGD) for training a CNN model on the CIFAR-10 dataset in the following settings:
Directory optimizer-benchmarks
contains benchmarks for various first-order and second-order based GD methods:
- SGD
- Momentum SGD
- Nesterov SGD
- Adagrad
- RMSProp
- ADAM