Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Reduce code duplication in CUDA memcpy and memset implementations
Since they are now using the same underlying CUDA API calls, they can reuse the implementation and add the final wait depending on the queue type.
- Loading branch information