Rule for gradient accumulation #137

CarloLucibello · 2023-04-05T04:28:21Z

Fix #130

PR Checklist

Tests are added
Documentation, if applicable

src/rules.jl

test/rules.jl

src/rules.jl

src/interface.jl

src/rules.jl

mcabbott · 2023-04-05T11:47:10Z

test/rules.jl

+  x0 = rand(5)
+  x = copy(x0)
+  lr = 0.01
+  tree = Optimisers.setup(AccumGrad(Descent(lr), 3), x)


A torture test would use something with a large momentum, so that it fails if the rule is applied 3 times, rather that once for the total gradient. But not sure that's necessary.

test/rules.jl

src/rules.jl

Co-authored-by: Michael Abbott <32575566+mcabbott@users.noreply.github.com>

darsnack · 2023-04-12T11:30:56Z

src/rules.jl

+
+
+"""
+  AccumGrad(n::Int)


Minor typo for rendering

Suggested change

AccumGrad(n::Int)

AccumGrad(n::Int)

CarloLucibello added 2 commits April 5, 2023 01:55

implement AccumGrad

bbfcf8f

gradient accumulation

d5421d1

CarloLucibello commented Apr 5, 2023

View reviewed changes

src/rules.jl Outdated Show resolved Hide resolved

CarloLucibello commented Apr 5, 2023

View reviewed changes

test/rules.jl Outdated Show resolved Hide resolved

darsnack reviewed Apr 5, 2023

View reviewed changes

src/rules.jl Outdated Show resolved Hide resolved

mcabbott reviewed Apr 5, 2023

View reviewed changes

CarloLucibello added 2 commits April 6, 2023 08:28

new interface

fef7a75

more tests

ea2d3a2

mcabbott reviewed Apr 6, 2023

View reviewed changes

src/rules.jl Outdated Show resolved Hide resolved

src/rules.jl Outdated Show resolved Hide resolved

CarloLucibello mentioned this pull request Apr 7, 2023

nothing does not correspond to updating the state with a zero gradient. #140

Open

CarloLucibello added 5 commits April 7, 2023 16:46

remove NoUpdaete

cb07a5d

fix

e721905

test for subtract! Zero

192c9af

fix

355fc94

don't test AccumGrad with other rules

1ad10cc

CarloLucibello requested a review from mcabbott April 8, 2023 13:42

another variant

9151ce4

mcabbott approved these changes Apr 10, 2023

View reviewed changes

src/rules.jl Outdated Show resolved Hide resolved

CarloLucibello and others added 2 commits April 12, 2023 09:07

Update src/rules.jl

0dd18ab

Co-authored-by: Michael Abbott <32575566+mcabbott@users.noreply.github.com>

less docs

26553dc

CarloLucibello merged commit e6d8160 into master Apr 12, 2023

darsnack reviewed Apr 12, 2023

View reviewed changes

src/rules.jl

"""

AccumGrad(n::Int)

Copy link

Member

darsnack Apr 12, 2023

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor typo for rendering

Suggested change

AccumGrad(n::Int)

AccumGrad(n::Int)

CarloLucibello deleted the cl/accum2 branch March 31, 2024 15:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rule for gradient accumulation #137

Rule for gradient accumulation #137

CarloLucibello commented Apr 5, 2023

mcabbott Apr 5, 2023

darsnack Apr 12, 2023

Rule for gradient accumulation #137

Rule for gradient accumulation #137

Conversation

CarloLucibello commented Apr 5, 2023

PR Checklist

mcabbott Apr 5, 2023

Choose a reason for hiding this comment

darsnack Apr 12, 2023

Choose a reason for hiding this comment