Neural Kernel Network implements #92

HamletWantToCode · 2020-03-01T15:07:09Z

This PR is an attempt to implement Neural Kernel Network (NKN) for Stheno, a working example can be find here

Interface

primitive_layer = Primitive(EQ(), PerEQ(), Linear()) # <:Primitive, kernel container & compute covariant matrix for each kernel & make them valid input to neural network
lin1 = LinearLayer(3, 4)  # <:LinearLayer, linear transformation
lin2 = LinearLayer(2, 1)
nn = Chain(lin1, Product, lin2)   # use Flux's `Chain` to build a neural network
nkn_kernel = NeuralKernelNetwork(primitive_layer, nn) # <:Kernel, composite kernel built on neural network

Newly implemented types & function:

Primitive: NKN can be viewed as a composite kernel, Primitive serves as a container of all the basic kernels. It has ew & pw method implemented, but it isn't a subtype of Stheno's Kernel. Calls like ew(<:Primitive, x) & pw(<:Primitive, x) will compute ew and pw for each kernel inside Primitive, and then prepare them to be inputs to the following neural network.
LinearLayer: This is just a linear transformation z = W*x. The reason I create this type instead of using Flux's Dense is because we don't need bias and activation functions here.
Product: A product function perform element wise multiplication of kernel matrices.
NeuralKernelNetwork: It's a subtype of Stheno's Kernel type with ew and pw method implemented, it can be viewed as a common Stheno's kernel.

Supports

Extract all parameters within NKN with Flux's params method
Use Zygote to compute gradient of the logpdf w.r.t all the parameters in NKN

To be discussed

In order to allow using Flux's params to extract all the parameters inside NKN, I slightly modify the definition and type of input variables ofScaled, Stretched and RQ in kernels.jl.

Scaled: the original σ² is replaced by logσ², and it's type is restricted to AbstractVector. The reason for doing so is that σ² should remain positive during the optimization, and Flux's params method requires the type of the fields to be an AbstractArray.
Stretched: a is replaced by loga and it's type is restricted to AbstractVecOrMat ( reason is the same as above ).
RQ: α is replaced by logα and it's type is restricted to AbstractVector ( reason is the same as above ).
PerEQ: I noticed that this kernel hasn't been exported by Stheno yet, I reimplement and export it.

NOTE: I only do some basic tests for these modification, it is not guaranteed to be type stable and may report bugs in other situations

Reference

[1] Shengyang Sun, Guodong Zhang, Chaoqi Wang, Wenyuan Zeng, Jiaman Li , Roger Grosse, Differentiable Compositional Kernel Learning for Gaussian Processes (2018)

* initial pass * Enable main tests * Make documenter a test dep * Fix travis * Some work * Some work * Tweak compat * Tweak compat again

* Fix Diagonal perf * Bump version * Update news

* Make compat less restrictive * Bump patch

* Make forwards-pass type stable for Float32 * Remove new space

* Basic GP examples * Relax version requirement * Complete plotting basics * Document examples * Demonstrate approximate inference with Titsias * Docuemntation * Furhter docs improvements * More docs and the process decomposition example * More docs, more examples * Sensor fusion * Tweak docs * More docs and more examples * More examples, more docs * WIP on GPPP + Pseudo-Points

adds examples

merge remote

merge nkn kernel

…n kernels.jl

willtebbutt · 2020-03-02T13:03:57Z

Thanks for this PR. I'm really busy this week, so I'll do a proper review early next week.

…ate example

codecov · 2020-03-06T14:13:12Z

Codecov Report

Merging #92 into wct/flux-nkn-integration will decrease coverage by 13.17%.
The diff coverage is 43.1%.

@@                    Coverage Diff                     @@
##           wct/flux-nkn-integration   #92       +/-   ##
==========================================================
- Coverage                     88.17%   75%   -13.18%     
==========================================================
  Files                            24    27        +3     
  Lines                           685   844      +159     
==========================================================
+ Hits                            604   633       +29     
- Misses                           81   211      +130

Impacted Files	Coverage Δ
src/composite/compose.jl	`61.53% <ø> (ø)`	⬆️
src/Stheno.jl	`100% <ø> (ø)`	⬆️
src/neural_network/basic.jl	`0% <0%> (ø)`
src/abstract_model.jl	`0% <0%> (ø)`
src/gp/neural_kernel_network.jl	`0% <0%> (ø)`
src/composite/composite_gp.jl	`80.64% <0%> (-19.36%)`	⬇️
src/gp/gp.jl	`88.23% <0%> (-11.77%)`	⬇️
src/util/zygote_rules.jl	`97.43% <100%> (+0.37%)`	⬆️
src/abstract_gp.jl	`86.36% <100%> (+1.74%)`	⬆️
src/gp/mean.jl	`60% <33.33%> (-40%)`	⬇️
... and 4 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 02bea63...3c6ef99. Read the comment docs.

fix indentation

willtebbutt · 2020-03-10T14:32:02Z

Would you mind rebasing this on top of master so that it's easier to inspect the diff?

HamletWantToCode · 2020-03-10T14:41:20Z

PR #94 includes all the changes contained in PR #92, do I still have to do rebasing ? 在 2020年3月10日，22:32，willtebbutt <[email protected]> 写道： Would you mind rebasing this on top of master so that it's easier to inspect the diff? — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub<#92>, or unsubscribe<https://github.com/notifications/unsubscribe-auth/AINMODE6ZYIXWXTR7M3NBC3RGZFOHANCNFSM4K7FT4IQ>.

willtebbutt · 2020-03-10T14:45:03Z

It would be really helpful. It's not really possible to review it as it currently is.

HamletWantToCode · 2020-03-10T15:39:44Z

I have opened a new PR #95 , since rebasing this PR failed ( due to the existence of PR #94 ), sorry for messing these up, and thank you for your time.

I will close this PR.

willtebbutt and others added 30 commits February 6, 2020 03:35

Fix typos (JuliaGaussianProcesses#74)

1f3c6cf

Basic kernel documentation (JuliaGaussianProcesses#75)

00a8198

Switch to only using Jobs (JuliaGaussianProcesses#76)

e596c9a

Install TagBot as a GitHub Action (JuliaGaussianProcesses#77)

4ab0b91

WIP: More docs (JuliaGaussianProcesses#79)

902572f

* initial pass * Enable main tests * Make documenter a test dep * Fix travis * Some work * Some work * Tweak compat * Tweak compat again

Fix Diagonal perf (JuliaGaussianProcesses#81)

e0ad849

* Fix Diagonal perf * Bump version * Update news

Fix Turing compat (JuliaGaussianProcesses#82)

757b938

* Make compat less restrictive * Bump patch

Basic GP examples

1bf561d

Relax version requirement

aa412c9

Complete plotting basics

331605e

Document examples

2841ba8

Demonstrate approximate inference with Titsias

e5bedfe

Docuemntation

bbccdab

Furhter docs improvements

40a8725

More docs and the process decomposition example

64a3a59

More docs, more examples

b2bd74e

Sensor fusion

c2d8d1d

Tweak docs

5c172bc

More docs and more examples

73b87f4

More examples, more docs

731654a

WIP on GPPP + Pseudo-Points

7b6b0df

add fnn example

8f32481

add classification example

d7e70b6

Make forwards-pass type stable for Float32 (JuliaGaussianProcesses#83)

7b6159f

* Make forwards-pass type stable for Float32 * Remove new space

correct some syntax

dabddaf

Merge branch 'example-revamp'

76804ab

adds examples

correct indentation

8f3d599

correct indentation

d0bd5f8

add readme entries on new examples

7bcf9e6

HamletWantToCode and others added 7 commits March 1, 2020 00:32

Merge branch 'master' of https://github.com/willtebbutt/Stheno.jl

0f0b96e

merge remote

Merge branch 'nkn_kernel'

e9afe7b

merge nkn kernel

correct indentation

07f4291

Update neural_kernel_network.jl

7f6bb36

update, fix NKN's ew method, modify parameter's type of some kernel i…

3e1c2d6

…n kernels.jl

update

fe77f08

update example

86d33b8

HamletWantToCode added 8 commits March 3, 2020 12:53

design a tree structure for handling model parameters

fe3d483

add AbstractModel type, add neural network specified for GP

2cf9d6e

fix bug

9da0fd4

fix bug, pass tests

237d99b

add kernel parameter constraint, redefine a interface for Scaled, upd…

4a98ba8

…ate example

update

582277d

add child & get_iparam to composite_gp

fba284b

add annotations

bc05b77

HamletWantToCode added 4 commits March 6, 2020 22:45

Update composite_gp.jl

0b46ba2

Update kernel.jl

a3a40ef

fix indentation

Update basic.jl

3250164

fix indentation

Update abstract_model.jl

9d14353

fix indentation

HamletWantToCode mentioned this pull request Mar 6, 2020

AbstractModel type & neural kernel network #94

Closed

HamletWantToCode added 2 commits March 7, 2020 00:40

fix bug

cf39c97

fix bug

3c6ef99

HamletWantToCode mentioned this pull request Mar 10, 2020

Original NKN kernel ( Flux version ) #95

Merged

HamletWantToCode closed this Mar 10, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Neural Kernel Network implements #92

Neural Kernel Network implements #92

HamletWantToCode commented Mar 1, 2020

willtebbutt commented Mar 2, 2020

codecov bot commented Mar 6, 2020 •

edited

Loading

willtebbutt commented Mar 10, 2020

HamletWantToCode commented Mar 10, 2020 via email

willtebbutt commented Mar 10, 2020

HamletWantToCode commented Mar 10, 2020

Neural Kernel Network implements #92

Neural Kernel Network implements #92

Conversation

HamletWantToCode commented Mar 1, 2020

Interface

Supports

To be discussed

Reference

willtebbutt commented Mar 2, 2020

codecov bot commented Mar 6, 2020 • edited Loading

Codecov Report

willtebbutt commented Mar 10, 2020

HamletWantToCode commented Mar 10, 2020 via email

willtebbutt commented Mar 10, 2020

HamletWantToCode commented Mar 10, 2020

codecov bot commented Mar 6, 2020 •

edited

Loading