Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Cholesky decomposition unsuccessful #11

Open
lanyiyun opened this issue Aug 22, 2018 · 5 comments
Open

Cholesky decomposition unsuccessful #11

lanyiyun opened this issue Aug 22, 2018 · 5 comments

Comments

@lanyiyun
Copy link

Hi,

This has been asked before. I ran into the cholesky issue repeatedly in spite of trying large batch size. I wonder how is your experience of resolving this issue. Any tips would help, thank you in advance!

@kstant0725
Copy link
Collaborator

kstant0725 commented Sep 3, 2018 via email

@lihenryhfl
Copy link
Collaborator

Another tip is reducing the number of clusters, if possible.

One requirement of SpectralNet is that the orthonormalization layer is of rank equal to the number of clusters you set. Each minibatch must have enough variety / structure to have a full rank orthonormalization matrix. Thus, the dual to increasing the minibatch size is decreasing the cluster number. If your clusters are relatively balanced, and the number of clusters is on the order of a dozen or so, you're probably fine as is. But if it's much larger you might have problems. We have a few ideas in mind for loosening this restriction but there are no concrete plans yet.

@lanyiyun
Copy link
Author

Thank you for your input, that makes a lot sense. I was trying to get 30+ clusters in a fairly large dataset. And most likely it is not balanced.

@lihenryhfl
Copy link
Collaborator

I see. Yeah, this could be the reason why you had problems, especially if the classes are not balanced, unfortunately.

@spdj2271
Copy link

spdj2271 commented Jan 5, 2022

I find this problem in some datasets, such as FRGC.
Then I find it works when I change the epsilon (core/layers.py line 11) from 1e-7 to 1e-5 and reduce the spec_lr from 1e-3 to 1e-5.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants