`conv` gradient is not implemented in EnzymeJAX #214

yolhan83 · 2024-11-01T18:58:24Z

Hello, I wonder if Reactant works with Conv layers, it seems it works in forward but not in the gradient pass, neither on cpu of gpu

version :

Julia Version 1.10.6
Commit 67dffc4a8ae (2024-10-28 12:23 UTC)
Build Info:
  Official https://julialang.org/ release
Platform Info:
  OS: Linux (x86_64-linux-gnu)
  CPU: 20 × 12th Gen Intel(R) Core(TM) i7-12700H
  WORD_SIZE: 64
  LIBM: libopenlibm
  LLVM: libLLVM-15.0.7 (ORCJIT, alderlake)
Threads: 1 default, 0 interactive, 1 GC (on 20 virtual cores)

code :

import Enzyme
using Lux,Reactant,Statistics,Random

const rng = Random.default_rng(123)
const dev = xla_device()
model_conv = Chain(
    Conv((3,3),1=>8,pad=SamePad(),relu), 
    Lux.FlattenLayer(),
    Dense(32*32*8,10),
    softmax
)
model_dense = Chain(
    Lux.FlattenLayer(),
    Dense(32*32*1,10),
    softmax
)

ps_conv,st_conv = Lux.setup(rng, model_conv) |> dev
ps_dense,st_dense = Lux.setup(rng, model_dense) |> dev

loss(model,x,ps,st,y) = Lux.MSELoss()(first(model(x,ps,st)),y)
x = randn(rng, Float32, 32,32,1,100) |> dev
y = randn(rng, Float32, 10,100) |> dev

function get_grad(loss,model,x,ps,st,y)
    dps = Enzyme.make_zero(ps)
    Enzyme.autodiff(Enzyme.Reverse,loss,Enzyme.Const(model),Enzyme.Const(x),Enzyme.Duplicated(ps,dps),Enzyme.Const(st),Enzyme.Const(y))
    return dps
end

loss_compile_conv = @compile loss(model_conv,x,ps_conv,st_conv,y) # works
loss_compile_dense = @compile loss(model_dense,x,ps_dense,st_dense,y) # works

grad_compile_conv = @compile get_grad(loss,model_conv,x,ps_conv,st_conv,y) #doesn't work
grad_compile_dense = @compile get_grad(loss,model_dense,x,ps_dense,st_dense,y) # works

error :

error: expects input feature dimension (8) / feature_group_count = kernel input feature dimension (1). Got feature_group_count = 1.
ERROR: "failed to run pass manager on module"
Stacktrace:
  [1] run!
    @ ~/.julia/packages/Reactant/rRa4g/src/mlir/IR/Pass.jl:70 [inlined]
  [2] run_pass_pipeline!(mod::Reactant.MLIR.IR.Module, pass_pipeline::String)
    @ Reactant.Compiler ~/.julia/packages/Reactant/rRa4g/src/Compiler.jl:241
  [3] compile_mlir!(mod::Reactant.MLIR.IR.Module, f::Function, args::Tuple{…}; optimize::Bool)
    @ Reactant.Compiler ~/.julia/packages/Reactant/rRa4g/src/Compiler.jl:272
  [4] compile_mlir!
    @ ~/.julia/packages/Reactant/rRa4g/src/Compiler.jl:256 [inlined]
  [5] (::Reactant.Compiler.var"#30#32"{typeof(get_grad), Tuple{…}})()
    @ Reactant.Compiler ~/.julia/packages/Reactant/rRa4g/src/Compiler.jl:584
  [6] context!(f::Reactant.Compiler.var"#30#32"{typeof(get_grad), Tuple{…}}, ctx::Reactant.MLIR.IR.Context)
    @ Reactant.MLIR.IR ~/.julia/packages/Reactant/rRa4g/src/mlir/IR/Context.jl:71
  [7] compile_xla(f::Function, args::Tuple{…}; client::Nothing)
    @ Reactant.Compiler ~/.julia/packages/Reactant/rRa4g/src/Compiler.jl:581
  [8] compile_xla
    @ ~/.julia/packages/Reactant/rRa4g/src/Compiler.jl:575 [inlined]
  [9] compile(f::Function, args::Tuple{…}; client::Nothing)
    @ Reactant.Compiler ~/.julia/packages/Reactant/rRa4g/src/Compiler.jl:608
 [10] compile(f::Function, args::Tuple{…})
    @ Reactant.Compiler ~/.julia/packages/Reactant/rRa4g/src/Compiler.jl:607
 [11] top-level scope
    @ ~/.julia/packages/Reactant/rRa4g/src/Compiler.jl:368

The text was updated successfully, but these errors were encountered:

avik-pal · 2024-11-01T19:00:50Z

the feature_group_count is probably missing on EnzymeJAX end?

cc @Pangoraw

Pangoraw · 2024-11-01T19:12:06Z

The reverse is not implemented for convolution. It should be fixed in Enzyme-JAX

yolhan83 · 2024-11-01T19:18:44Z

Oh ok I will wait for doing my MISNT benchmark then, have a nice day

wsmoses · 2024-11-02T02:36:05Z

@Pangoraw do you want to bump the commits of EnzymeJaX and reactant and bump the jll

Setup needs to bump:
https://github.com/EnzymeAD/Enzyme-JAX/blob/6109edcd3f1d04c9f19f7a30d0e493ffae5ba417/workspace.bzl#L4
Then

Reactant.jl/deps/ReactantExtra/WORKSPACE

Line 12 in a17315c

ENZYMEXLA_COMMIT = "2f1a70349297a21ce67f41cc94ff305dd0aef5d4"

Then
https://github.com/JuliaPackaging/Yggdrasil/blob/ff9587a0df76e20171efd10076a59395c4fad5dd/R/Reactant/build_tarballs.jl#L12

mofeing · 2024-11-02T11:20:51Z

@wsmoses @Pangoraw can we do a Reactant release after the new JLL lands?

wsmoses · 2024-11-22T23:37:33Z

@Pangoraw @mofeing I don't think this is implemented in EnzymeJaX atm: https://github.com/EnzymeAD/Enzyme-JAX/blob/main/test/lit_tests/diffrules/stablehlo/convolution.mlir , no?

Pangoraw · 2024-11-23T07:47:00Z

I meant that there is a rule in https://github.com/EnzymeAD/Enzyme-JAX/blob/724be0666e0f9fbba66f6b2fddbdb222b0db5fb6/src/enzyme_ad/jax/Implementations/HLODerivatives.td#L588 but it generates invalid code instead of returning an unimplemented error.

wsmoses · 2024-11-23T08:06:22Z

Ah I read your comment of “it should be fixed In the other repo” as it is fixed but needs a jll bump rather than your intended, we should fix it but the bug is in another repo

avik-pal · 2024-12-06T04:15:13Z

This is fixed in the latest release

avik-pal changed the title ~~How to use Reactant on Conv layers~~ conv gradient is not implemented in EnzymeJAX Nov 8, 2024

avik-pal mentioned this issue Nov 10, 2024

feat: update ConvMixer to support reactant LuxDL/Lux.jl#1063

Draft

6 tasks

avik-pal closed this as completed Dec 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`conv` gradient is not implemented in EnzymeJAX #214

`conv` gradient is not implemented in EnzymeJAX #214

yolhan83 commented Nov 1, 2024

avik-pal commented Nov 1, 2024 •

edited

Loading

Pangoraw commented Nov 1, 2024

yolhan83 commented Nov 1, 2024 •

edited

Loading

wsmoses commented Nov 2, 2024

mofeing commented Nov 2, 2024

wsmoses commented Nov 22, 2024

Pangoraw commented Nov 23, 2024

wsmoses commented Nov 23, 2024

avik-pal commented Dec 6, 2024

conv gradient is not implemented in EnzymeJAX #214

conv gradient is not implemented in EnzymeJAX #214

Comments

yolhan83 commented Nov 1, 2024

avik-pal commented Nov 1, 2024 • edited Loading

Pangoraw commented Nov 1, 2024

yolhan83 commented Nov 1, 2024 • edited Loading

wsmoses commented Nov 2, 2024

mofeing commented Nov 2, 2024

wsmoses commented Nov 22, 2024

Pangoraw commented Nov 23, 2024

wsmoses commented Nov 23, 2024

avik-pal commented Dec 6, 2024

`conv` gradient is not implemented in EnzymeJAX #214

`conv` gradient is not implemented in EnzymeJAX #214

avik-pal commented Nov 1, 2024 •

edited

Loading

yolhan83 commented Nov 1, 2024 •

edited

Loading