Skip to content
This repository has been archived by the owner on Nov 25, 2024. It is now read-only.

Commit

Permalink
Sync stream for scatter_op
Browse files Browse the repository at this point in the history
  • Loading branch information
chang-l committed Nov 18, 2024
1 parent 6821f7a commit 0b6f889
Showing 1 changed file with 2 additions and 0 deletions.
2 changes: 2 additions & 0 deletions cpp/src/wholememory_ops/scatter_op_impl_mapped.cu
Original file line number Diff line number Diff line change
Expand Up @@ -18,6 +18,7 @@
#include <wholememory/env_func_ptrs.h>
#include <wholememory/wholememory.h>

#include "cuda_macros.hpp"
#include "wholememory_ops/functions/gather_scatter_func.h"

namespace wholememory_ops {
Expand All @@ -41,6 +42,7 @@ wholememory_error_code_t wholememory_scatter_mapped(
wholememory_desc,
stream,
scatter_sms);
WM_CUDA_CHECK(cudaStreamSynchronize(stream));
}

} // namespace wholememory_ops

0 comments on commit 0b6f889

Please sign in to comment.