Second-order optimisation? #28
Labels
good first issue
Good for newcomers
idea
Something not relevant to current work, but could be useful in the future
low priority
Should be fixed eventually, but isn't urgent
Optimisation
Issue affects the optimisation of the detector
Idea
Since we're dealing with comparatively few parameters (4 layers * 10*10 voxels * 2 parameters per voxel = 800 params), we may want to investigate whether optimisation based on the (approximation) of the Hessian of the loss provides quicker convergence. PyTorch includes L-BFGS, but it only works for single parameter groups, and Newtonian optimisation should be implementable.
Possible problems
The text was updated successfully, but these errors were encountered: