You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the CGO17 benchmarks MM NVIDIA and NBody NVIDIA, there are nested reductions where memory can be re-used. Currently DPIA cannot model this, meaning that we use twice the required memory and introduce unnecessary copies. This has a significant performance impact (observed roughly 30% to 75% performance loss for these benchmarks depending on the target hardware).
In the CGO17 benchmarks MM NVIDIA and NBody NVIDIA, there are nested reductions where memory can be re-used. Currently DPIA cannot model this, meaning that we use twice the required memory and introduce unnecessary copies. This has a significant performance impact (observed roughly 30% to 75% performance loss for these benchmarks depending on the target hardware).
Any comments on that issue @bastian-koepcke ?
The text was updated successfully, but these errors were encountered: