You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Can anyone tell me how to set a proper value of ssids options so that I can improve my programs using GPU? I try to change the parameters, but I didn't see much improvement.
These days, I was reading reference from page http://www.numerical.rl.ac.uk/spral/doc/latest/C/. And I also try to read the source code to find some explaintation.
However, there seems few guides about how to set the value of min_gpu_work, gpu_perf_coeff.
So, how do you use ssids? Do you find an apparent improvement in calculation by setting different min_gpu_work and gpu_perf_coeff?
The text was updated successfully, but these errors were encountered:
There are two options that control the GPU usage: gpu_perf_coeff and min_gpu_work. gpu_perf_coeff is an architecture dependent value corresponding to how much faster is the GPU compared to a CPU (including all the cores!). Obviously this quantity depends on the kernels as well as the sizes of the workload being executed but it is not necessary to have a precise value. By default this value is set to 1.0 but if the GPU is twice as fast as a CPU (on average) then this value should be set to 2.0.
Using this gpu_perf_coeff the workload partitioned between the CPUs and the GPU during the analyse phase. However, partitions that are attributed to the GPU are run on the GPU only if the associated amount of flop for a partition exceed the value given by min_gpu_work. By default it is set to 5 GFlop. This means that if the workload is very small, which seems to be your case, none of the workload is every going to be put on the GPU.
Can anyone tell me how to set a proper value of ssids options so that I can improve my programs using GPU? I try to change the parameters, but I didn't see much improvement.
These days, I was reading reference from page http://www.numerical.rl.ac.uk/spral/doc/latest/C/. And I also try to read the source code to find some explaintation.
However, there seems few guides about how to set the value of min_gpu_work, gpu_perf_coeff.
So, how do you use ssids? Do you find an apparent improvement in calculation by setting different min_gpu_work and gpu_perf_coeff?
The text was updated successfully, but these errors were encountered: