Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Second-order optimisation? #28

Open
GilesStrong opened this issue May 20, 2021 · 0 comments
Open

Second-order optimisation? #28

GilesStrong opened this issue May 20, 2021 · 0 comments
Labels
good first issue Good for newcomers idea Something not relevant to current work, but could be useful in the future low priority Should be fixed eventually, but isn't urgent Optimisation Issue affects the optimisation of the detector

Comments

@GilesStrong
Copy link
Owner

Idea

Since we're dealing with comparatively few parameters (4 layers * 10*10 voxels * 2 parameters per voxel = 800 params), we may want to investigate whether optimisation based on the (approximation) of the Hessian of the loss provides quicker convergence. PyTorch includes L-BFGS, but it only works for single parameter groups, and Newtonian optimisation should be implementable.

Possible problems

  • Memory requirements
    • L-BFGS has a history size which can be reduced
    • An 800x800 Hessian should be fine, though, I'd have thought
  • Learning rate scale
    • Currently we use two different learning rates due to the different scales of the resolution and efficiency. Probably we could keep the parameters at the same scale (N(0,1)) and then rescale them during application in the forward pass.
@GilesStrong GilesStrong added good first issue Good for newcomers low priority Should be fixed eventually, but isn't urgent idea Something not relevant to current work, but could be useful in the future labels May 20, 2021
@GilesStrong GilesStrong added the Optimisation Issue affects the optimisation of the detector label Jun 4, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers idea Something not relevant to current work, but could be useful in the future low priority Should be fixed eventually, but isn't urgent Optimisation Issue affects the optimisation of the detector
Projects
None yet
Development

No branches or pull requests

1 participant