backward_pass now clears nodes which will not be used #221

Etragas · 2017-05-08T17:08:27Z

Attempting to help issues mentioned in #219

This line removed any references to Array nodes which will not be revisited, since they have passed all their gradient information to their parents.

In the special case of the start_node, the gradients will not be lost, since setting outgrads[node] to None only deletes the reference, not the values. So as long as cur_outgrad holds on to the gradient reference all is fine.

On a test dataset, using a proprietary algorithm which can be roughly described as two RNN's feeding into eachother, I obtained the following gains in performance.

https://cloud.githubusercontent.com/assets/6620250/25814415/6a8f085a-33eb-11e7-9237-ce58fa09300a.png

In addition, I compared the numerical results of my change to the previous version, and they were identical.

j-towns · 2017-05-08T17:16:43Z

Your code is throwing this error on Python 3:

ERROR: Failure: TabError (inconsistent use of tabs and spaces in indentation (core.py, line 43))

should be easy to fix :)

Etragas · 2017-05-08T17:18:49Z

Haha, sorry about that.
Should be good now.

j-towns · 2017-05-08T17:26:31Z

This looks good to me, being pernickety maybe it would be slightly cleaner to use

del outgrads[node]

or

outgrads.pop(node)

to completely remove node from the dictionary.

This would mean no left over/unnecessary reference to node (edit: does a dict contain references to its keys?).

Etragas · 2017-05-08T17:32:25Z

I agree that it's cleaner, since it also drops the key as opposed to just setting it to None.

mattjj · 2017-05-08T22:49:28Z

Thanks for this improvement, @Etragas and @j-towns! This is open-source software collaboration at its best.

I marked @dougalm as a reviewer because I think he's also modifying this part of the code, but on reflection I see no reason not to merge this change immediately, so long as he doesn't accidentally clobber it.

Thanks again for this!

mattjj · 2017-05-09T15:00:30Z

For posterity: see this related comment.

* 'master' of github.com:HIPS/autograd: Switched to incremental summation on the reverse pass to save memory in cases of high fan-out. backward_pass now clears nodes which will not be used (#221)

backward_pass now clears nodes which will not be used

33d6dc7

Etragas mentioned this pull request May 8, 2017

Decreasing autograd memory usage #219

Closed

Only using spaces now

5829b36

Remove dict reference more cleanly

7287fd5

mattjj requested a review from dougalm May 8, 2017 18:18

mattjj merged commit 3c7b69b into HIPS:master May 8, 2017

etragas-fathom mentioned this pull request Mar 8, 2019

RFC: What do you think about TRAX? tensorflow/tensor2tensor#1478

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

backward_pass now clears nodes which will not be used #221

backward_pass now clears nodes which will not be used #221

Etragas commented May 8, 2017

j-towns commented May 8, 2017

Etragas commented May 8, 2017

j-towns commented May 8, 2017 •

edited

Loading

Etragas commented May 8, 2017

mattjj commented May 8, 2017

mattjj commented May 9, 2017

backward_pass now clears nodes which will not be used #221

backward_pass now clears nodes which will not be used #221

Conversation

Etragas commented May 8, 2017

j-towns commented May 8, 2017

Etragas commented May 8, 2017

j-towns commented May 8, 2017 • edited Loading

Etragas commented May 8, 2017

mattjj commented May 8, 2017

mattjj commented May 9, 2017

j-towns commented May 8, 2017 •

edited

Loading