DiffEqFlux Layers don't satisfy Lux API #727

avik-pal · 2022-06-18T07:22:23Z

The DiffEqFlux Layers need to satisfy https://lux.csail.mit.edu/dev/api/core/#Lux.AbstractExplicitLayer else the parameters/states returned from Lux.setup be incorrect. As pointed out in slack

julia> ps, st = Lux.setup(rng, Chain(node,Dense(2=>3)))
((layer_1 = NamedTuple(), layer_2 = (weight = Float32[0.11987843 -0.1679378; 0.36991563 0.41324985; 0.73272866 0.7062624], bias = Float32[0.0; 0.0; 0.0;;])), (layer_1 = NamedTuple(), layer_2 = NamedTuple()))

ps.layer_1 should not be an empty NamedTuple

https://lux.csail.mit.edu/dev/manual/interface/ -- is the most recent manual for the interface

The text was updated successfully, but these errors were encountered:

YichengDWu · 2022-06-19T04:45:04Z

Should be easy

initialparameters(rng::AbstractRNG, node::NeuralODE) = initialparameters(rng, node.model) 
initialstates(rng::AbstractRNG, node::NeuralODE) = initialstates(rng, node.model)
parameterlength(node::NeuralODE) = parameterlength(node.model)
statelength(node::NeuralODE) = statelength(node.model)

To make setup work not only for Chain but also directly on NeuralODE, we need to add

function setup(rng::AbstractRNG, node::NeuralODE)
    return (initialparameters(rng, node), initialstates(rng, node))
end

ChrisRackauckas · 2022-06-19T12:47:35Z

Are you supposed to overload setup? I assume that should just follow from the interface.

avik-pal · 2022-06-19T16:42:01Z

You just need to define

initialparameters(rng::AbstractRNG, node::NeuralODE) = initialparameters(rng, node.model) 
initialstates(rng::AbstractRNG, node::NeuralODE) = initialstates(rng, node.model)

ChrisRackauckas · 2022-06-19T16:51:18Z

We should put an abstract type on all of the AbstractNeuralDE types and then overload from there.

YichengDWu · 2022-06-19T20:08:16Z

You just need to define

initialparameters(rng::AbstractRNG, node::NeuralODE) = initialparameters(rng, node.model) 
initialstates(rng::AbstractRNG, node::NeuralODE) = initialstates(rng, node.model)

For it to work yes. Would it be nicer if the number of parameters could be printed automatically?

YichengDWu · 2022-06-19T20:15:07Z

Are you supposed to overload setup? I assume that should just follow from the interface.

I was assuming NeuralODE was not a subtype of AbstractExplicitLayer. Should be nonnecessary if you are going to subtype it

avik-pal · 2022-06-20T18:25:34Z

No even if you are not subtying initialparameters and initialstates are the only functions that need to be mandatorily implemented, parameterlength and statelength are optional. setup should never be extended

YichengDWu · 2022-06-20T18:40:48Z

I would appreciate it if you could help me understand two questions:

Is it still mandatory to implement initialstates if I just have one layer and just need to return NameTuple()? I have implemented some layers without it. Looks like they are just calling initialstates(::AbstractRNG, ::Any) = NamedTuple() in the source code.
What are the bad consequences of extending setup?

avik-pal · 2022-06-20T18:51:40Z

It is meant to satisfy an interface.

You are right, the default for initialstates is NamedTuple(), but this is undocumented so this can be changed without it being considered breaking.
Extending setup is not going to solve problems for most people and sets false expectation. For example, if you extend setup for a layer which is contained inside another layer. Calling Lux.setup on the outer layer, will cause the parameters and states for the internal custom layer to have empty parameters and states.

YichengDWu · 2022-06-20T19:08:28Z

Highly appreciate the clarification you made.

ChrisRackauckas · 2022-06-20T19:57:14Z

Flux doesn't care about the subtyping but Lux does, so we should subtype for Lux and then also make it a functor and we're 👍.

ChrisRackauckas · 2022-06-22T08:15:26Z

Copying over from #735. All should be an AbstractExplicitLayer, which means they should do things exactly like Dense. They should have one state, take in a state, and return a state. They should take in a neural network definition and give you back a state from setup. Basically, it should act exactly like Dense does, and be able to perfectly swap in without any other code changes, and if not it's wrong. The only thing that should be different is the constructor for the layer.

@Abhishek-1Bhatt let me know if you need me to do the first one.

avik-pal · 2022-06-23T05:20:54Z

Once it gets built, http://lux.csail.mit.edu/previews/PR70/manual/interface should describe the recommended Lux Interface. For DiffEqFlux, everything should really be a subtype of http://lux.csail.mit.edu/stable/api/core/#Lux.AbstractExplicitContainerLayer, and there would be no need to define initialparameters and initialstates. (Just a small heads up there will be a small breaking change for the Container Layers in v0.5 (which is still far out) )

ba2tro · 2022-06-23T05:51:49Z

Ahh yes, I had thought about it sometime ago https://julialang.slack.com/archives/C7T968HRU/p1655536943724979?thread_ts=1655535510.205359&cid=C7T968HRU but we didn't discuss it so ended up subtyping to AbstractExplicitLayer

ChrisRackauckas · 2023-01-17T14:16:06Z

Done in #750

ba2tro mentioned this issue Jun 18, 2022

NeuralODE is not a subtype of Lux.AbstractExplicitLayer #728

Closed

avik-pal mentioned this issue Jun 18, 2022

Cleaner and Better Documentation LuxDL/Lux.jl#56

Merged

9 tasks

YichengDWu mentioned this issue Jun 19, 2022

Update the tutorial of Neural Graph ODE SciML/SciMLSensitivity.jl#635

Merged

ChrisRackauckas mentioned this issue Jun 22, 2022

Neural SDE tutorial #735

Merged

ChrisRackauckas closed this as completed Jan 17, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DiffEqFlux Layers don't satisfy Lux API #727

DiffEqFlux Layers don't satisfy Lux API #727

avik-pal commented Jun 18, 2022 •

edited

Loading

YichengDWu commented Jun 19, 2022 •

edited

Loading

ChrisRackauckas commented Jun 19, 2022

avik-pal commented Jun 19, 2022

ChrisRackauckas commented Jun 19, 2022

YichengDWu commented Jun 19, 2022

YichengDWu commented Jun 19, 2022

avik-pal commented Jun 20, 2022

YichengDWu commented Jun 20, 2022 •

edited

Loading

avik-pal commented Jun 20, 2022

YichengDWu commented Jun 20, 2022

ChrisRackauckas commented Jun 20, 2022

ChrisRackauckas commented Jun 22, 2022

avik-pal commented Jun 23, 2022

ba2tro commented Jun 23, 2022

ChrisRackauckas commented Jan 17, 2023

DiffEqFlux Layers don't satisfy Lux API #727

DiffEqFlux Layers don't satisfy Lux API #727

Comments

avik-pal commented Jun 18, 2022 • edited Loading

YichengDWu commented Jun 19, 2022 • edited Loading

ChrisRackauckas commented Jun 19, 2022

avik-pal commented Jun 19, 2022

ChrisRackauckas commented Jun 19, 2022

YichengDWu commented Jun 19, 2022

YichengDWu commented Jun 19, 2022

avik-pal commented Jun 20, 2022

YichengDWu commented Jun 20, 2022 • edited Loading

avik-pal commented Jun 20, 2022

YichengDWu commented Jun 20, 2022

ChrisRackauckas commented Jun 20, 2022

ChrisRackauckas commented Jun 22, 2022

avik-pal commented Jun 23, 2022

ba2tro commented Jun 23, 2022

ChrisRackauckas commented Jan 17, 2023

avik-pal commented Jun 18, 2022 •

edited

Loading

YichengDWu commented Jun 19, 2022 •

edited

Loading

YichengDWu commented Jun 20, 2022 •

edited

Loading