Aggressive Memory Pruning #433

elijahbenizzy · 2023-10-04T04:14:57Z

Is your feature request related to a problem? Please describe.
Currently the Hamilton Executor holds onto everything until the end. This is problematic if one is processing immutable, large objects.

Describe the solution you'd like
Hamilton should prune everything that's not needed. Two levels of improvements:

At every step of the DAG, do a GC. That is, determine everything whose depended on nodes are (a) in the critical path and (b) have been computed, deleting them from the results dictionary. We should be able to do this when we compute it -- loop through its dependencies and compute:

hamilton/hamilton/execution/graph_functions.py

Line 146 in 28c955e

value = adapter.execute_node(node_, kwargs)

. Note we have a PR to do this for parallelism -- its quite a bit simpler, as its just a matter of pruning the nodes that are not needed by the tasks: Garbage collection/memory optimization #374.
We could actually compute any chains as one -- E.G. in a loop. A "closed" chain could be fused, so as to not hold onto any memory. This is effectively the same as handling it in the dict, although the dict makes garbage collection a little gnarlier...

So, let's start with (1) and see how that helps us.

elijahbenizzy · 2023-10-04T23:43:59Z

See #374

elijahbenizzy added the performance label Oct 4, 2023

elijahbenizzy self-assigned this Oct 4, 2023

elijahbenizzy closed this as completed Oct 4, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Aggressive Memory Pruning #433

Aggressive Memory Pruning #433

elijahbenizzy commented Oct 4, 2023

elijahbenizzy commented Oct 4, 2023

Aggressive Memory Pruning #433

Aggressive Memory Pruning #433

Comments

elijahbenizzy commented Oct 4, 2023

elijahbenizzy commented Oct 4, 2023