SR2 is an optimizer that trains deep neural networks with nonsmooth regularization to retrieve a sparse and efficient sub-structure.
The optimizer minimizes a the sum of a finite-sum loss function and a nonsmooth nonconvex regularizer:
F(x) = l(x) + R(x)
with an adaptive proximal quadratic regularization scheme.
Supported regularizers are
- Numpy
- Pytorch
- PyHessian [https://github.com/amirgholami/PyHessian]