You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
This is primarily for record-keeping, so that we don't forget about CUDA kernels that should be revisited someday. To be in this list, a kernel must be implemented correctly (in main or an impending PR), but have some reason to be rewritten. The list is to help us stick to the policy that existence is the first priority and optimization is second, without the temptation to go down a rabbit-hole of optimizing every kernel before moving on to the next one.
awkward_ListArray_min_range is a reduce-to-scalar algorithm, but it's implemented with atomicMin instead of tree-reduction.
This is primarily for record-keeping, so that we don't forget about CUDA kernels that should be revisited someday. To be in this list, a kernel must be implemented correctly (in
main
or an impending PR), but have some reason to be rewritten. The list is to help us stick to the policy that existence is the first priority and optimization is second, without the temptation to go down a rabbit-hole of optimizing every kernel before moving on to the next one.atomicMin
instead of tree-reduction.Variable-length inner loop:
The text was updated successfully, but these errors were encountered: