Stas/best-practice-ntt #517

svpolonsky · 2024-05-15T20:12:53Z

Describe the changes

Icicle examples: Concurrent Data Transfer and NTT Computation

This PR introduces a Best Practice series of examples in c++. Specifically, the example shows how to concurrently transfer data to/from device and execute NTT

Linked Issues

Resolves #

ChickenLover

Very good example @svpolonsky

examples/c++/best-practice-ntt/CMakeLists.txt

ChickenLover · 2024-05-16T04:36:19Z

examples/c++/best-practice-ntt/compile.sh

+mkdir -p build/icicle
+
+# Configure and build Icicle
+cmake -S ../../../icicle/ -B build/icicle -DCMAKE_BUILD_TYPE=Release -DCURVE=bn254 -DG2=OFF


you can also add -MSM=OFF to exclude MSM compilation, which boost the build time significantly

examples/c++/best-practice-ntt/example.cu

examples/c++/best-practice-ntt/README.md

yshekel · 2024-05-16T10:41:03Z

examples/c++/best-practice-ntt/example.cu

+        cudaMemcpyAsync(&h_out[vec_transfer][i*block_size], &d_vec[vec_transfer][i*block_size], sizeof(E)*block_size, cudaMemcpyDeviceToHost, stream_d2h);    
+      }
+      if (i>0) {
+        cudaMemcpyAsync(&d_vec[vec_transfer][(i-1)*block_size], &h_inp[vec_transfer][(i-1)*block_size], sizeof(E)*block_size, cudaMemcpyHostToDevice, stream_h2d);


(1) how do you guarantee that copy from d_vec ->h_out is fully executed before h_inp->d_vec when they are both async and on different streams? Seems like a race.
(2) I think it would be better to compare it to a naive implementation both in terms of performance and correctness.
(3) since it's an example, I think additional comments you be helpful

Regarding (1): I sync the streams on line 118, 119

cudaStreamSynchronize(stream_d2h); cudaStreamSynchronize(stream_h2d);

Regarding (2): I referenced existing NTT example as baseline in section "Running the example":

To compare with ICICLE baseline (i.e. non-concurrent ) NTT, you can run [this example](../ntt/README.md)

Regarding (3): I added comments to explain why we won't have races

The sync is blocking until the all tasks in the queue are completed but it doesn't prevent races between streams

yuvalingo

Looks good

## Describe the changes Icicle examples: Concurrent Data Transfer and NTT Computation This PR introduces a Best Practice series of examples in c++. Specifically, the example shows how to concurrently transfer data to/from device and execute NTT ## Linked Issues Resolves #

svpolonsky added 4 commits May 12, 2024 12:43

working commit

4749ec4

max device memory utilization test

76483b2

README and cleaner code

7edfa4d

spelling fixes

8df944f

svpolonsky requested a review from yuvalingo May 15, 2024 20:13

spelling updates

2f693a8

ChickenLover approved these changes May 16, 2024

View reviewed changes

jeremyfelder reviewed May 16, 2024

View reviewed changes

examples/c++/best-practice-ntt/README.md Outdated Show resolved Hide resolved

yshekel reviewed May 16, 2024

View reviewed changes

resolved PR comments

12ca622

svpolonsky requested a review from jeremyfelder May 16, 2024 18:08

LeonHibnik approved these changes May 16, 2024

View reviewed changes

yuvalingo approved these changes May 16, 2024

View reviewed changes

LeonHibnik merged commit 02059fc into main May 16, 2024
26 checks passed

LeonHibnik deleted the stas/best-practice-NTT branch May 16, 2024 20:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stas/best-practice-ntt #517

Stas/best-practice-ntt #517

svpolonsky commented May 15, 2024

ChickenLover left a comment

ChickenLover May 16, 2024

svpolonsky May 16, 2024

yshekel May 16, 2024 •

edited

Loading

svpolonsky May 16, 2024

yshekel May 17, 2024

yuvalingo left a comment

Stas/best-practice-ntt #517

Stas/best-practice-ntt #517

Conversation

svpolonsky commented May 15, 2024

Describe the changes

Linked Issues

ChickenLover left a comment

Choose a reason for hiding this comment

ChickenLover May 16, 2024

Choose a reason for hiding this comment

svpolonsky May 16, 2024

Choose a reason for hiding this comment

yshekel May 16, 2024 • edited Loading

Choose a reason for hiding this comment

svpolonsky May 16, 2024

Choose a reason for hiding this comment

yshekel May 17, 2024

Choose a reason for hiding this comment

yuvalingo left a comment

Choose a reason for hiding this comment

yshekel May 16, 2024 •

edited

Loading