-
Notifications
You must be signed in to change notification settings - Fork 747
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
cuLaunchCooperativeKernelMultiDevice #478
Comments
|
Thanks for the feedback. I got it to work! |
@saudet , one more question. I have the array of CUDA_LAUNCH_PARAMS working. I can also set all grid and block variables and the kernels executes correctly on the gpus. Next up is setting the kernel parameters. But each time when I set a kernel parameter like: launchParams.kernelParams(0, new LongPointer(new long[1]) I instantly get a SIGSEGV crash. |
That doesn't look right. You're going to need to follow the doc that NVIDIA
provides about that...
|
NVIDIA doc says you have to make this struct: typedef struct CUDA_LAUNCH_PARAMS_st {
} CUDA_LAUNCH_PARAMS;` So I want to set the "void **kernelParams;" pointer. However the cuda. java code only provides these options: 'public native Pointer kernelParams(int i); So how should I proceed? |
You'll need to allocate your own PointerPointer and pass that... |
Like this? All variants give a SIGSEGV I would also think that kernelParams(0, pointer) would suggest a normal pointer and when returning the entire array with kernelParams() then I would get a PointerPointer back? |
That is indeed an issue. We'll have to fix this. |
In the meantime, we can work around that by using |
Thanks for the feedback and yes it works! |
… member setters (issue bytedeco/javacpp-presets#478)
The fix is included in version 1.4, providing wrappers for CUDA 9.1 now though: |
I have been working with CUDA9 / JavaCPP for a few days and got everything up and running very fast. Thank you!
However I cannot seems to get cuda.cuLaunchCooperativeKernelMultiDevice() working. It takes CUDA_LAUNCH_PARAMS as the first argument and second argument the array size, but what I need is an array of CUDA_LAUNCH_PARAMS. I tried via PointerPointer but that dit not fix things.
Does anyone have a solution on how to call cuLaunchCooperativeKernelMultiDevice for multiple devices?
The text was updated successfully, but these errors were encountered: