New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add vectorized cached loads #1993

Merged

maleadt merged 1 commit into JuliaGPU:master from Zentrik:vectorized-loads

Jul 10, 2023

Contributor

Zentrik commented Jul 8, 2023 •

edited

Loading

Adds ability to do explicit vectorized loads using the ldg intrinsic.


          Add vectorized cached loads

aa6f145

maleadt added enhancement cuda kernels labels

Member

maleadt commented Jul 10, 2023

LGTM, thanks!

maleadt merged commit cd90c74 into JuliaGPU:master

maleadt mentioned this pull request

Use cached loads JuliaGPU/GemmKernels.jl#140

Open

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

cuda kernels enhancement