Cuda-Example Parallel sum reduction parallel addition of a scalar across an array GPU hardware query. No header file needed. Just compile with minimum archetecture specification of 30. Example: nvcc example.cu - o example -arch=sm_30