You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
It should be quite straightforward: the kernel just looks something like C[i] = A[B[i], 2]. Writing a very general kernel is harder though and I guess it'd have to extend GPUArray's current indexing kernels.
Yeah, I agree, I wrote my custom kernel which is not general. I found that pytorch have a general implementation however, so I think we should have one as well.
I guess this can be faster with GPU? But any idea on how to implement this?
The text was updated successfully, but these errors were encountered: