Use "differentiable optimization trick" for backpropagation through tangent vector field calculation #74

mfschubert · 2024-01-10T17:07:31Z

Currently, we directly backpropagate through the tangent vector field calculation, which involves a Newton solve to find the minimum of a convex quadratic objective. It may be more efficient to define a custom gradient for this operation, in a manner similar to what is done for differentiable optimization.

mfschubert · 2024-01-10T21:29:33Z

I am seeing some issues with super-long compile times in the optimization context, which are eliminated when we use a stop_gradient before the vector field calculation. I am thinking we should just add this stop_gradient, and then restore the ability to backpropagate through vector field generation via the method mentioned above. This might be fairly involved, and would take time. fyi @smartalecH

smartalecH · 2024-01-10T22:18:42Z

Yep this sounds like a good plan to me. How hard do we anticipate the manual adjoint will be?

mfschubert · 2024-01-10T22:49:21Z

I am looking at it a bit. It might actually be relatively straightforward. Here's a reference that seems nice, it even includes Jax code: https://implicit-layers-tutorial.org/implicit_functions/

mfschubert · 2024-02-09T22:57:45Z

@smartalecH @Luochenghuang I have things working here---all it needed was a bit of regularization.

https://github.com/mfschubert/mewtax

mfschubert · 2024-02-28T18:23:48Z

I think we may want to put this on hold for now: the potential accuracy improvement is small, and there is a speed penalty.

I added a test with Test AD gradient against finite difference gradient #94 which checks the FD gradient against AD gradient. They are very close as-is, i.e. even with the stop_gradient in the vector field calculation.
I tested using mewtax to solve for the vector fields, but this seems to make the tests much slower (2x time to complete all tests). I suspect there is a significant compile time penalty.

mfschubert mentioned this issue Jan 11, 2024

Add stop gradient to tangent vector field calcuation #76

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use "differentiable optimization trick" for backpropagation through tangent vector field calculation #74

Use "differentiable optimization trick" for backpropagation through tangent vector field calculation #74

mfschubert commented Jan 10, 2024

mfschubert commented Jan 10, 2024

smartalecH commented Jan 10, 2024

mfschubert commented Jan 10, 2024

mfschubert commented Feb 9, 2024

mfschubert commented Feb 28, 2024

Use "differentiable optimization trick" for backpropagation through tangent vector field calculation #74

Use "differentiable optimization trick" for backpropagation through tangent vector field calculation #74

Comments

mfschubert commented Jan 10, 2024

mfschubert commented Jan 10, 2024

smartalecH commented Jan 10, 2024

mfschubert commented Jan 10, 2024

mfschubert commented Feb 9, 2024

mfschubert commented Feb 28, 2024