Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Memory re-use for nested reductions #210

Open
Bastacyclop opened this issue Sep 6, 2021 · 0 comments
Open

Memory re-use for nested reductions #210

Bastacyclop opened this issue Sep 6, 2021 · 0 comments
Labels
enhancement New feature or request prio:high

Comments

@Bastacyclop
Copy link
Member

Bastacyclop commented Sep 6, 2021

In the CGO17 benchmarks MM NVIDIA and NBody NVIDIA, there are nested reductions where memory can be re-used. Currently DPIA cannot model this, meaning that we use twice the required memory and introduce unnecessary copies. This has a significant performance impact (observed roughly 30% to 75% performance loss for these benchmarks depending on the target hardware).

Any comments on that issue @bastian-koepcke ?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request prio:high
Projects
None yet
Development

No branches or pull requests

1 participant