Performance tuning the sampling primitive for multi-node multi-GPU systems. #3169

seunghwak · 2023-01-23T17:55:58Z

Update groupby code in multi-GPU communication to use atomics based partitioning instead of sort based partitioning (with atomics performance updates in recent NVIDIA GPUs, now the atomics based approach is significantly faster than the sorting based approach if the number of groups is not excessive).
In random index generation, add an additional code to handle high-degree vertices with with_replacement = false.

codecov-commenter · 2023-01-23T21:15:29Z

Codecov Report

❗ No coverage uploaded for pull request base (branch-23.04@a0d964d). Click here to learn what that means.
Patch has no changes to coverable lines.

Additional details and impacted files

@@               Coverage Diff               @@
##             branch-23.04    #3169   +/-   ##
===============================================
  Coverage                ?   56.26%           
===============================================
  Files                   ?      153           
  Lines                   ?     9658           
  Branches                ?        0           
===============================================
  Hits                    ?     5434           
  Misses                  ?     4224           
  Partials                ?        0

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

☔ View full report at Codecov.
📢 Do you have feedback about the report comment? Let us know in this issue.

…ample_prim_perf_draco

…prim_perf_draco

ajschmidt8 · 2023-02-06T18:46:41Z

Removing ops-codeowners from the required reviews since it doesn't seem there are any file changes that we're responsible for. Feel free to add us back if necessary.

…ample_prim_perf_draco

rlratzel

(approving for python-codeowners to unblock merging, but didn't review any code here - assuming python-codeowners added from a file change no longer in this PR)

seunghwak · 2023-02-07T23:13:10Z

(approving for python-codeowners to unblock merging, but didn't review any code here - assuming python-codeowners added from a file change no longer in this PR)

Yes, this is due to branch re-targeting.

ChuckHastings · 2023-02-08T15:45:55Z

/merge

seunghwak added 4 commits January 17, 2023 10:10

update groupby in shuffle comm to use atomics instead of sorting

ab5cb10

bug fix in MG C++ random sampling tests

ce4994e

performance tune per_v_random_select_transform_outgoing_e

6667f7a

resolve merge conflicts

532537e

seunghwak requested a review from a team as a code owner January 23, 2023 17:55

seunghwak added 3 - Ready for Review improvement Improvement / enhancement to an existing function non-breaking Non-breaking change labels Jan 23, 2023

seunghwak added 2 commits January 23, 2023 10:04

fix copyright year

dc8d5f5

clang-format

4ffac10

seunghwak requested review from naimnv, jnke2016 and ChuckHastings January 23, 2023 18:07

seunghwak self-assigned this Jan 23, 2023

seunghwak added this to the 23.02 milestone Jan 23, 2023

seunghwak added 2 commits January 23, 2023 10:09

fix a mistake in resolving merge conflicts

3e2eb26

clang-format

a24aa00

BradReesWork modified the milestones: 23.02, 23.04 Jan 23, 2023

seunghwak added 5 commits January 25, 2023 14:07

Merge branch 'branch-23.02' of github.com:rapidsai/cugraph into enh_s…

64427b3

…ample_prim_perf_draco

performance tuning

a44fd43

remove temporary debug code

f268549

Merge branch 'branch-23.02' of github.com:rapidsai/cugraph into enh_s…

cc20dbb

…ample_prim_perf_draco

Merge remote-tracking branch 'upstream/branch-23.04' into enh_sample_…

7a727c7

…prim_perf_draco

seunghwak requested review from a team as code owners February 6, 2023 18:06

seunghwak changed the base branch from branch-23.02 to branch-23.04 February 6, 2023 18:07

seunghwak mentioned this pull request Feb 6, 2023

Uniform sampling code cleanup and minor performance tuning #3238

Merged

ajschmidt8 removed the request for review from a team February 6, 2023 18:46

ChuckHastings approved these changes Feb 7, 2023

View reviewed changes

Merge branch 'branch-23.04' of github.com:rapidsai/cugraph into enh_s…

0781c74

…ample_prim_perf_draco

rlratzel approved these changes Feb 7, 2023

View reviewed changes

rapids-bot bot merged commit c39fc02 into rapidsai:branch-23.04 Feb 8, 2023

seunghwak deleted the enh_sample_prim_perf_draco branch May 5, 2023 23:48

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performance tuning the sampling primitive for multi-node multi-GPU systems. #3169

Performance tuning the sampling primitive for multi-node multi-GPU systems. #3169

seunghwak commented Jan 23, 2023 •

edited

Loading

codecov-commenter commented Jan 23, 2023 •

edited

Loading

ajschmidt8 commented Feb 6, 2023

rlratzel left a comment

seunghwak commented Feb 7, 2023

ChuckHastings commented Feb 8, 2023

Performance tuning the sampling primitive for multi-node multi-GPU systems. #3169

Performance tuning the sampling primitive for multi-node multi-GPU systems. #3169

Conversation

seunghwak commented Jan 23, 2023 • edited Loading

codecov-commenter commented Jan 23, 2023 • edited Loading

Codecov Report

ajschmidt8 commented Feb 6, 2023

rlratzel left a comment

Choose a reason for hiding this comment

seunghwak commented Feb 7, 2023

ChuckHastings commented Feb 8, 2023

seunghwak commented Jan 23, 2023 •

edited

Loading

codecov-commenter commented Jan 23, 2023 •

edited

Loading