Reuse GPU E-vecs #1648

jeremylt · 2024-08-22T22:00:11Z

This PR reuses active input e-vec buffers for output e-vecs where able. These are one of the biggest allocations of memory that the user doesn't see.

~~Adding this for AtPoints is a bit trickier. It is probably best as a follow-up~~

Note: manually tested with Ratel as well

jeremylt · 2024-08-26T19:43:20Z

I did not add this change on the CPU side, as a 1 or 8 element block for the Opt backends is pretty negligible memory that doesn't seem worth the extra complexity in the code compared to the memory savings on the GPU side

backends/cuda-ref/ceed-cuda-ref-operator.c

jeremylt added GPU performance 0-WIP labels Aug 22, 2024

jeremylt self-assigned this Aug 22, 2024

jeremylt added 1-In Review and removed 0-WIP labels Aug 22, 2024

jeremylt force-pushed the jeremy/cuda-reuse-out branch 3 times, most recently from 95c081e to bcaca9e Compare August 26, 2024 19:16

jrwrigh approved these changes Aug 29, 2024

View reviewed changes

backends/cuda-ref/ceed-cuda-ref-operator.c Show resolved Hide resolved

backends/cuda-ref/ceed-cuda-ref-operator.c Show resolved Hide resolved

backends/cuda-ref/ceed-cuda-ref-operator.c Show resolved Hide resolved

jeremylt added 2 commits August 29, 2024 11:34

gpu - reuse evecs where able

41655a2

gpu - reuse evecs for AtPoints where able

8a21357

jeremylt force-pushed the jeremy/cuda-reuse-out branch from bcaca9e to 8a21357 Compare August 29, 2024 17:34

jeremylt merged commit 71ed691 into main Aug 29, 2024
28 checks passed

jeremylt deleted the jeremy/cuda-reuse-out branch August 29, 2024 18:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reuse GPU E-vecs #1648

Reuse GPU E-vecs #1648

jeremylt commented Aug 22, 2024 •

edited

Loading

jeremylt commented Aug 26, 2024

Reuse GPU E-vecs #1648

Reuse GPU E-vecs #1648

Conversation

jeremylt commented Aug 22, 2024 • edited Loading

jeremylt commented Aug 26, 2024

jeremylt commented Aug 22, 2024 •

edited

Loading