You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Now that Cuda has a workspace method, it's possible to use cuDNN to speed up convolutions. The work for this should be:
Add a cudnn feature to dfdx cudnn = ["cuda", "cudarc/cudnn"]
In conv kernel forward & backward, if cudnn feature is active, invoke the cudnn API calls, otherwise keep the existing methods.
Note that currently cudarc doesn't have a safe cudnn API, so this may have to use straight sys/result calls for now. Part of this issue can be contributing back to cudarc as well.
The text was updated successfully, but these errors were encountered:
Now that Cuda has a workspace method, it's possible to use cuDNN to speed up convolutions. The work for this should be:
cudnn
feature to dfdxcudnn = ["cuda", "cudarc/cudnn"]
Note that currently cudarc doesn't have a safe cudnn API, so this may have to use straight sys/result calls for now. Part of this issue can be contributing back to cudarc as well.
The text was updated successfully, but these errors were encountered: