Deeponet multi-output fix #11

ayushinav · 2024-07-04T13:52:37Z

Closes #9

src/display.jl

avik-pal · 2024-07-04T15:18:38Z

src/display.jl

+# function Base.show(io::IO, model::conv) where {conv <: OperatorConv}
+#     # print(io, model.name*"()  # "*string(Lux.parameterlength(model))*" parameters")
+#     print(io, model.name)
+# end
+
+# function Base.show(io::IO, ::MIME"text/plain", model::conv) where {conv <: OperatorConv}
+#     show(io, model.name)
+# end


Remove these, printing was fixed upstream

KirillZubov · 2024-07-04T15:22:11Z

src/deeponet.jl

+
+julia> trunk_net = Chain(Dense(1 => 8), Dense(8 => 8), Dense(8 => 16));
+
+julia> additional = Chain(Dense(1 => 4));


input for additional layer should be size of inner embedding size

So it not need reduction/sum/dropdim before additional layer. It should be additional = Chain(Dense(16 => 4)); here. Otherwise It's created a bottleneck and we lose information here.

Should be fixed now. Using the linear layer as additional layer for the cases where we do not have the additional layer did not seem ideal to me because it would imply weighted sum, where the weights would be learnt during training, but since DeepONets by default take the dot product, aka non-weighted sum, which could be required by many users.

ayushinav · 2024-07-05T08:18:40Z

Other than the doctests, the failing test cases were because the compiler because Tuple{Array{Float32, 4}, ...} would not be a subtype of Tuple{Union{Array{Float32, 3}, Array{Float32, 4}}

return type 
Tuple{Array{Float32, 4}, @NamedTuple{branch::@NamedTuple{layer_1::@NamedTuple{}, layer_2::@NamedTuple{}, layer_3::@NamedTuple{}}, trunk::@NamedTuple{layer_1::@NamedTuple{}, layer_2::@NamedTuple{}, layer_3::@NamedTuple{}}, additional::@NamedTuple{}}} 
does not match inferred return type 
Tuple{Union{Array{Float32, 3}, Array{Float32, 4}}, @NamedTuple{branch::@NamedTuple{layer_1::@NamedTuple{}, layer_2::@NamedTuple{}, layer_3::@NamedTuple{}}, trunk::@NamedTuple{layer_1::@NamedTuple{}, layer_2::@NamedTuple{}, layer_3::@NamedTuple{}}, additional::@NamedTuple{}}}

The tests pass if I comment out the @inferred and @jet cases.

avik-pal · 2024-07-09T04:08:36Z

Tuple{Union{Array{Float32, 3}, Array{Float32, 4}}, @NamedTuple{branch::@NamedTuple{layer_1::@NamedTuple{}, layer_2::@NamedTuple{}, layer_3::@NamedTuple{}}, trunk::@NamedTuple{layer_1::@NamedTuple{}, layer_2::@NamedTuple{}, layer_3::@NamedTuple{}}, additional::@NamedTuple{}}}

This is bad. You are returning a 3D array / 4D array based on the input sizes (which won't be type inferred). Avoid doing the dropdims

ayushinav · 2024-07-10T04:24:07Z

You are returning a 3D array / 4D array based on the input sizes (which won't be type inferred). Avoid doing the dropdims

Not sure how this might be an issue because the scalar tests also have the same dropdims calling

@inline function __project(b::AbstractArray{T1, 2}, t::AbstractArray{T2, 3},
        additional::Nothing) where {T1, T2}
    # b : p x nb
    # t : p x N x nb
    b_ = reshape(b, size(b, 1), 1, size(b, 2)) # p x 1 x nb
    return dropdims(sum(b_ .* t; dims=1); dims=1) # N x nb
end

still pass the test, and Scalar II and Vector Additonal layer tests calling

@inline function __project(
        b::AbstractArray{T1, 3}, t::AbstractArray{T2, 3}, additional::T) where {T1, T2, T}
    # b : p x u x nb
    # t : p x N x nb

    if size(b, 2) == 1 || size(t, 2) == 1
        return additional(b .* t) # p x N x nb => out_dims x N x nb
    else
        b_ = reshape(b, size(b)[1:2]..., 1, size(b, 3)) # p x u x 1 x nb
        t_ = reshape(t, size(t, 1), 1, size(t)[2:end]...) # p x 1 x N x nb

        return additional(b_ .* t_) # p x u x N x nb => out_size x N x nb
    end
end

fail.

ayushinav · 2024-07-10T05:07:46Z

Only doc tests fail for now.

avik-pal · 2024-07-11T15:30:03Z

Rebase with the latest changes to main.

avik-pal · 2024-07-11T20:55:14Z

set your git config to rebase on pull instead of merge, else the commit history gets royally messed up.

codecov · 2024-07-12T03:20:41Z

Codecov Report

Attention: Patch coverage is 83.33333% with 5 lines in your changes missing coverage. Please review.

Project coverage is 93.20%. Comparing base (aaf7d45) to head (a8149e2).

Files	Patch %	Lines
src/utils.jl	80.76%	5 Missing ⚠️

❗ There is a different number of reports uploaded between BASE (aaf7d45) and HEAD (a8149e2). Click for more details.

HEAD has 1 upload less than BASE

Flag BASE (aaf7d45) HEAD (a8149e2)

5 4

Additional details and impacted files

@@             Coverage Diff             @@
##              main      #11      +/-   ##
===========================================
- Coverage   100.00%   93.20%   -6.80%     
===========================================
  Files            7        7              
  Lines           77      103      +26     
===========================================
+ Hits            77       96      +19     
- Misses           0        7       +7

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

ayushinav added 3 commits July 4, 2024 09:48

deeponet multi-output fix

55dc372

test bug fix

36745f3

format

acd35d1

avik-pal reviewed Jul 4, 2024

View reviewed changes

src/display.jl Outdated Show resolved Hide resolved

avik-pal reviewed Jul 4, 2024

View reviewed changes

KirillZubov reviewed Jul 4, 2024

View reviewed changes

compat with additional layer

ec0644a

KirillZubov mentioned this pull request Jul 8, 2024

Physics informed neural operator ode SciML/NeuralPDE.jl#806

Merged

12 tasks

inference tests

a0e57d2

explicit import test fix

f238d80

ayushinav requested a review from avik-pal July 10, 2024 05:08

avik-pal and others added 10 commits July 11, 2024 15:07

chore: set version to 1.0.0-DEV

a99e89f

chore: start migration to NeuralOperators.jl

eabe384

refactor: remove the ext in favor of LuxDeviceUtils

3781405

fix: update doctests to not check printing

1db83a6

ci: use updated CI scripts

1926a26

test: lazy install cuda and amdgpu

ddb1f75

ci: create local preferences in CI script

d2beb52

test: more explicit imports testing

de70851

test: display layers

f7f1ee2

Merge branch 'main' into fixes

c542639

fixes

a8149e2

ayushinav closed this Jul 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deeponet multi-output fix #11

Deeponet multi-output fix #11

ayushinav commented Jul 4, 2024 •

edited

Loading

avik-pal Jul 4, 2024

KirillZubov Jul 4, 2024

KirillZubov Jul 4, 2024

ayushinav Jul 5, 2024

ayushinav commented Jul 5, 2024

avik-pal commented Jul 9, 2024

ayushinav commented Jul 10, 2024 •

edited

Loading

ayushinav commented Jul 10, 2024

avik-pal commented Jul 11, 2024

avik-pal commented Jul 11, 2024

codecov bot commented Jul 12, 2024 •

edited

Loading


		julia> trunk_net = Chain(Dense(1 => 8), Dense(8 => 8), Dense(8 => 16));

		julia> additional = Chain(Dense(1 => 4));

Deeponet multi-output fix #11

Deeponet multi-output fix #11

Conversation

ayushinav commented Jul 4, 2024 • edited Loading

avik-pal Jul 4, 2024

Choose a reason for hiding this comment

KirillZubov Jul 4, 2024

Choose a reason for hiding this comment

KirillZubov Jul 4, 2024

Choose a reason for hiding this comment

ayushinav Jul 5, 2024

Choose a reason for hiding this comment

ayushinav commented Jul 5, 2024

avik-pal commented Jul 9, 2024

ayushinav commented Jul 10, 2024 • edited Loading

ayushinav commented Jul 10, 2024

avik-pal commented Jul 11, 2024

avik-pal commented Jul 11, 2024

codecov bot commented Jul 12, 2024 • edited Loading

Codecov Report

ayushinav commented Jul 4, 2024 •

edited

Loading

ayushinav commented Jul 10, 2024 •

edited

Loading

codecov bot commented Jul 12, 2024 •

edited

Loading