Use cublas<t>matinvBatched() for N <= 32 #739

tbensonatl · 2024-08-27T15:50:08Z

Use the cublasmatinvBatched() family of functions to invert linear systems of size N <= 32. This has two advantages over the more general pair of getrfBatched() and getriBatched() functions:

Higher performance with the single kernel than with split kernels.
The matinv functions support in-place transforms and do not modify the input in the case of out-of-place transforms, so we do not need a temporary input work buffer if the input is a tensor view.

Use the cublas<t>matinvBatched() family of functions to invert linear systems of size N <= 32. This has two advantages over the more general pair of getrfBatched() and getriBatched() functions: 1. Higher performance with the single kernel than with split kernels. 2. The matinv functions support in-place transforms and do not modify the input in the case of out-of-place transforms, so we do not need a temporary input work buffer if the input is a tensor view.

tbensonatl · 2024-08-27T15:50:16Z

/build

coveralls · 2024-08-27T17:02:23Z

coverage: 93.386% (-0.02%) from 93.406%
when pulling c6ae9fa on optimize-inv-operator-for-small-systems
into 77f2901 on main.

cliffburdick · 2024-08-27T18:29:35Z

/build

tbensonatl requested review from luitjens and cliffburdick August 27, 2024 15:50

tbensonatl self-assigned this Aug 27, 2024

cliffburdick approved these changes Aug 27, 2024

View reviewed changes

cliffburdick merged commit d9053d6 into main Aug 27, 2024
1 check passed

cliffburdick deleted the optimize-inv-operator-for-small-systems branch August 27, 2024 18:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use cublas<t>matinvBatched() for N <= 32 #739

Use cublas<t>matinvBatched() for N <= 32 #739

tbensonatl commented Aug 27, 2024

tbensonatl commented Aug 27, 2024

coveralls commented Aug 27, 2024

cliffburdick commented Aug 27, 2024

Use cublas<t>matinvBatched() for N <= 32 #739

Use cublas<t>matinvBatched() for N <= 32 #739

Conversation

tbensonatl commented Aug 27, 2024

tbensonatl commented Aug 27, 2024

coveralls commented Aug 27, 2024

cliffburdick commented Aug 27, 2024