GPU support #4

adayton1 · 2024-11-13T17:45:38Z

Are there any plans for adding GPU support to verdict? My GPU accelerated code is hitting a big slowdown when I have to switch to host only execution to call some verdict functions. It looks to me as if porting verdict would mostly involve adding __host__ __device__ specifiers to functions and switching from std:: math functions to the corresponding c versions.

The text was updated successfully, but these errors were encountered:

clintonstimpson · 2024-11-15T15:39:00Z

Hi @adayton1, we do not currently have plans to add GPU support to Verdict. Though there is a chance we could soon use Verdict on the GPU ourselves. Do you have specific changes you want to see?

adayton1 · 2024-11-19T19:15:21Z

I use the hex and quad functions. But it's trivial enough to port the whole library. I put up pull request #5

dtaller · 2024-12-16T22:18:58Z

Hi. Do you know if the verdict pull request with GPU support, #5 will get merged in?

clintonstimpson · 2024-12-16T22:22:42Z

Yes, it will be merged, and was merged last week internally. We'll be pushing a new update to the verdict source to this github repo soon.

dtaller · 2024-12-16T22:26:50Z

Thanks!

liu15 · 2024-12-19T17:29:08Z

I've been testing Alan's implementation with ROCM on mi300a (rzadams) and found that the stack memory was getting exhausted for the hex_distortion (and quad_distortion) calls unless I specified 128K(!) gpu stack memory. This was due to the arrays that are sized maxTotalNumberGaussPointsmaxNumberNodes and maxNumberNodesmaxNumberNodes.

For our use, we only have linear elements, so setting maxTotalNumberGaussPoints and maxNumberNodes to 8 fixed the issue for us, but obviously that is not a general solution.

I'm not sure what is the preferred "correct" fix: compile-time switch, multiple implementations, or a templated API.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

GPU support #4

GPU support #4

adayton1 commented Nov 13, 2024 •

edited

Loading

clintonstimpson commented Nov 15, 2024

adayton1 commented Nov 19, 2024

dtaller commented Dec 16, 2024

clintonstimpson commented Dec 16, 2024

dtaller commented Dec 16, 2024

liu15 commented Dec 19, 2024

GPU support #4

GPU support #4

Comments

adayton1 commented Nov 13, 2024 • edited Loading

clintonstimpson commented Nov 15, 2024

adayton1 commented Nov 19, 2024

dtaller commented Dec 16, 2024

clintonstimpson commented Dec 16, 2024

dtaller commented Dec 16, 2024

liu15 commented Dec 19, 2024

adayton1 commented Nov 13, 2024 •

edited

Loading