GPLVM tutorial #122

leachim · 2021-07-13T13:11:18Z

This is some initial code for the GPLVM model. @torfjelde I am trying to get ADVI working for this example, but there is an error popping up to do with logabsdetjac. Do you mind having a look at this, since you seem to have worked on the bijectors package before.

It is working with the NUTS sampler inference, but running the same model with ADVI throws this error.

torfjelde · 2021-07-14T12:09:58Z

tutorials/12-gaussian-process-latent-variable-model/12-gplvm-advi.jl

+  α ~ MvLogNormal(MvNormal(K, 1.0))
+  σ ~ MvLogNormal(MvNormal(D, 1.0))
+  # use filldist for Zygote compatibility
+  Z ~ filldist(Normal(0., 1.), K, N)


The following will fix it:

Suggested change

Z ~ filldist(Normal(0., 1.), K, N)

Z_vec ~ filldist(Normal(0., 1.), K * N)

Z = reshape(Z_vec, K, N)

It's due to the meanfield approximation constructed not handling higher-dim arrays properly. I have a PR open with a fix for this, but it never got any attention; I'll try to get that PR merged so the above workaround won't be necessary.

Great, thanks for the quick response!

I am running into some sort of race condition/infinity loop now (or it's just taking realllly long). It's affecting both MCMC inference and ADVI. For ADVI I get an error message, whereas MCMC just seems to run forever. This happens when I uncomment these 2 lines, I added a reproducible example in the latest commit:

# using Zygote # Tracker supported? check it? # Turing.setadbackend(:zygote)

Should I raise an issue for this in Turing?

I think so -- I've been running into this on another project too, and honestly I'm kind of glad that someone else can replicate this issue outside of my project.

This is probably also what's timing out all the CIs 😕

EDIT: It actually seems like it's the GP in this case though. If you remove the Y ~ ... it works.

Also, you probably don't want to use filldist(prior, D) here; Y ~ prior should do the trick (Turing just calls loglikelihood under the hood, and so you can verify that this would work by just checking that loglikelihood(prior, Y) returns a single float).

TuringLang/DistributionsAD.jl#180

rikhuijzer

I'll propose a fix for the CI in a bit. It's some issue with the caching.

Could you make the name of the tutorial consistent with the other names? The pattern is

tutorials/02-logistic-regression/02_logistic-regression.jmd
[...]
tutorials/08-multinomial-logistic-regression/08_multinomial-logistic-regression.jmd
[...]

Also, it would be great if you can hide some assertions in the tutorial. That way, it's easier to check whether the tutorial also is built correctly in the future with different Julia versions or dependency versions.

Finally, maybe it's a good idea to split this PR up into two PRs. That should speed up the review process.

rikhuijzer · 2021-08-11T14:26:55Z

For example, the CI will fail when assertion blocks such as

```julia; echo=false
@assert false       
```

fail. Note that this block is hidden from the output. This functionality is tested in https://github.com/TuringLang/TuringTutorials/blob/master/test/build.jl.

rikhuijzer · 2021-08-11T14:48:40Z

@leachim The following error will be fixed if you merge master into your branch:

IOError: open("/home/runner/work/TuringTutorials/TuringTutorials/ClonedTuringTutorials/html/12-gaussian-process-latent-variable-model", 0, 0): no such file or directory (ENOENT)

leachim · 2021-08-12T11:24:08Z

@leachim The following error will be fixed if you merge master into your branch:
IOError: open("/home/runner/work/TuringTutorials/TuringTutorials/ClonedTuringTutorials/html/12-gaussian-process-latent-variable-model", 0, 0): no such file or directory (ENOENT)

Many thanks for your help!

I rebased this on master, hope that's okay now. Let me know if I did something wrong :)

I'll add some assert later on, this is still a bit of work in progress, so bear with me for a little longer. I will let you know once it's ready for review again.

rikhuijzer · 2021-08-12T12:06:42Z

Great 👍 Thanks. I see that the tests in CI are still broken, but that is my bad and not yours. Let me know when the tutorial works for you, then I'll fix the tests.

rikhuijzer · 2021-08-18T11:14:10Z

I rebased this on master, hope that's okay now. Let me know if I did something wrong :)

Yeah, I'm afraid that something went wrong. PRs which add a new tutorial should only add three files. For example, to add a new tutorial "my-bayesian-tutorial", the PR should only add the files:

tutorials/11-my-bayesian-tutorial/11_my-bayesian-tutorial.jmd
tutorials/11-my-bayesian-tutorial/Project.toml
tutorials/11-my-bayesian-tutorial/Manifest.toml

and, locally, the build should succeed when doing

julia> using TuringTutorials

julia> build("11-my-bayesian-tutorial");

I'm aware that this isn't super easy to do and could be easier. I'm working on that, but also have to do my 40 hours a week job and a book that I'm working on, so it will take a while still 🙂

rikhuijzer · 2021-08-18T11:15:50Z

It's probably easiest to just copy your files into a new and up-to-date branch and open a new PR.

leachim · 2021-08-19T09:08:16Z

Okay, so I rebased this on master and I also squased some commits, so should be pretty much equivalent to opening a new pull request, but we can keep the discussion history.

I am still working on the tutorial. There is two things that might cause issues at the moment

I have included two files with data (it's a small example that is commonly used in the literature, and so would be nice to have)
The run time is really long (more than 10hrs at the moment). I think this is similar to the 10: Bayesian differential equations tutorial.

I'll try to speed it up more, but not sure how much I can reduce runtime. I am also still working on details and fine-tuning things.

rikhuijzer · 2021-08-19T16:33:53Z

The run time is really long (more than 10hrs at the moment). I think this is similar to the 10: Bayesian differential equations tutorial.

That's very unfortunate yes, because it makes maintaining the tutorial very difficult. If I look at the other tutorials, it would probably mean that it will be outdated/not working in a few months to a year. Can we find a solution for that? Maybe a smaller dataset or simpler example?

mu

update naming for creation refactor files

add dependencies

rikhuijzer · 2021-08-22T12:47:37Z

From the logs:

Error: ArgumentError: Package Turing not found in current path:
 - Run `import Pkg; Pkg.add("Turing")` to install the Turing package.

The build will work locally because you have probably added Turing.jl to your global environment, but not to the gplvm tutorial environment.