add smoke test for ffjord #621

prbzrg · 2021-09-17T04:10:18Z

I just added some smoke tests for ffjord to reveal errors related to some adtypes
It reveals errors that reported in #588, #610, #615, #624
This PR can be merged when these errors fixed

ChrisRackauckas · 2021-09-17T13:16:25Z

Instead, just set @test_broken for now.

prbzrg · 2021-09-17T16:26:09Z

Should I expand the tests and unroll the loop? (to use both @test and @test_broken)

ChrisRackauckas · 2021-09-20T20:51:00Z

Are all of the tests broken?

prbzrg · 2021-09-20T22:43:46Z

Regardless of adtype, using

regularize=true & monte_carlo=false

make errors and using

GalacticOptim.AutoForwardDiff()
GalacticOptim.AutoReverseDiff()
GalacticOptim.AutoTracker()

as adtype is also broken.
Overall, 14 of 20 tests are broken.

ChrisRackauckas · 2021-09-21T03:30:31Z

as adtype is also broken.

What do you mean?

prbzrg · 2021-09-21T10:21:52Z

In each #610, #615, #624, I tested the same code with different adtype and reported errors.
AutoZygote and AutoFiniteDiff works fine, but other adtypes make different errors.

ChrisRackauckas · 2021-09-21T11:27:18Z

oh with FFJORD, yes.

ChrisRackauckas · 2021-09-21T11:27:55Z

It looks like this still isn't catching some of the errors?

prbzrg · 2021-09-22T23:27:53Z

If we only use @test, it could reproduce all four issues, but also make testing fail, you can see the errors in CI of first commit:
https://github.com/SciML/DiffEqFlux.jl/runs/3628543892?check_suite_focus=true#step:6:312
https://github.com/SciML/DiffEqFlux.jl/runs/3628543892?check_suite_focus=true#step:6:636
https://github.com/SciML/DiffEqFlux.jl/runs/3628543892?check_suite_focus=true#step:6:969
https://github.com/SciML/DiffEqFlux.jl/runs/3628543892?check_suite_focus=true#step:6:1350

I think we have two choices:

Only use @test and wait until all four issues fixed to merge this.
Use both @test and @test_broken by unrolling the loop. We can merge this sooner, but it doesn't log errors to CI, and it increases line of codes (more code to read and review)

I don't know whether there is any other choices. By second one, if someone wants to fix the errors, just changes it to @test and investigate the problems, but I prefer shorter codes.
Which way should we go?

ChrisRackauckas · 2021-09-22T23:35:37Z

(2) is a better way to go, because it's always better to fix and change from broken than have things documented in unmerged PRs.

prbzrg · 2021-09-23T01:44:06Z

OK, I made the loop unrolled and also converted section comments to testsets.
Furthermore, I notice something; there is a test for FFJORD with regularizers

DiffEqFlux.jl/test/cnf_test.jl

Lines 103 to 127 in 34c022f

    
           ### 
        
           # test for default multivariate distribution and FFJORD with regularizers 
        
           ### 
        
           nn = Chain(Dense(1, 1, tanh)) 
        
           data_dist = Beta(7, 7) 
        
           data_train = Float32.(rand(data_dist, 1, 100)) 
        
           tspan = (0.0f0, 1.0f0) 
        
           ffjord_test = FFJORD(nn, tspan, Tsit5()) 
        
           function loss_adjoint(θ) 
        
               logpx, λ₁, λ₂ = ffjord_test(data_train, θ, true) 
        
               return mean(@. -logpx + 0.1 * λ₁ + λ₂) 
        
           end 
        
           optfunc = GalacticOptim.OptimizationFunction((x, p) -> loss_adjoint(x), GalacticOptim.AutoZygote()) 
        
           optprob = GalacticOptim.OptimizationProblem(optfunc, 0.01f0 .* ffjord_test.p) 
        
           res = GalacticOptim.solve(optprob, ADAM(0.1), cb=cb, maxiters=300) 
        
           θopt = res.minimizer 
        
           data_validate = Float32.(rand(data_dist, 1, 100)) 
        
           actual_pdf = pdf.(data_dist, data_validate) 
        
           learned_pdf = exp.(ffjord_test(data_validate, θopt; monte_carlo=false)[1]) 
        
           @test totalvariation(learned_pdf, actual_pdf) / size(data_validate, 2) < 0.40

but it doesn't use regularize=true!

prbzrg · 2021-09-23T02:07:51Z

I want to update other tests to more look like tests of this PR:

Use Float32 distribution instead of converting samples
Use DiffEqFlux.sciml_train instead of OptimizationFunction -> OptimizationProblem -> solve

Can I update other tests in this PR?

prbzrg · 2021-09-23T03:05:34Z

I made the updates. If you disagree, I will move them to new PR.

prbzrg · 2021-09-23T04:40:57Z

Tests are failing. I just undo two last commits and will change other tests in a new PR.

ChrisRackauckas · 2021-09-23T09:08:19Z

Thanks!

add smoke test for ffjord

576af65

use test_broken for smoke test

899020b

prbzrg added 2 commits September 23, 2021 04:52

Unrolled the loop for smoke test

867ed73

convert section comment to testset

ddc96d9

prbzrg mentioned this pull request Sep 23, 2021

Update other tests for FFJORD #625

Merged

ChrisRackauckas merged commit d21eb9d into SciML:master Sep 23, 2021

prbzrg deleted the smoke-test branch September 26, 2021 15:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add smoke test for ffjord #621

add smoke test for ffjord #621

prbzrg commented Sep 17, 2021 •

edited

Loading

ChrisRackauckas commented Sep 17, 2021

prbzrg commented Sep 17, 2021 •

edited

Loading

ChrisRackauckas commented Sep 20, 2021

prbzrg commented Sep 20, 2021

ChrisRackauckas commented Sep 21, 2021

prbzrg commented Sep 21, 2021

ChrisRackauckas commented Sep 21, 2021

ChrisRackauckas commented Sep 21, 2021

prbzrg commented Sep 22, 2021

ChrisRackauckas commented Sep 22, 2021

prbzrg commented Sep 23, 2021

prbzrg commented Sep 23, 2021

prbzrg commented Sep 23, 2021

prbzrg commented Sep 23, 2021

ChrisRackauckas commented Sep 23, 2021

add smoke test for ffjord #621

add smoke test for ffjord #621

Conversation

prbzrg commented Sep 17, 2021 • edited Loading

ChrisRackauckas commented Sep 17, 2021

prbzrg commented Sep 17, 2021 • edited Loading

ChrisRackauckas commented Sep 20, 2021

prbzrg commented Sep 20, 2021

ChrisRackauckas commented Sep 21, 2021

prbzrg commented Sep 21, 2021

ChrisRackauckas commented Sep 21, 2021

ChrisRackauckas commented Sep 21, 2021

prbzrg commented Sep 22, 2021

ChrisRackauckas commented Sep 22, 2021

prbzrg commented Sep 23, 2021

prbzrg commented Sep 23, 2021

prbzrg commented Sep 23, 2021

prbzrg commented Sep 23, 2021

ChrisRackauckas commented Sep 23, 2021

prbzrg commented Sep 17, 2021 •

edited

Loading

prbzrg commented Sep 17, 2021 •

edited

Loading