Skip to content
This repository has been archived by the owner on May 17, 2022. It is now read-only.

Implemented Torch-UCC CI (WIP) #33

Open
wants to merge 9 commits into
base: ucc
Choose a base branch
from

Conversation

artemry-nv
Copy link

@artemry-nv artemry-nv commented Feb 17, 2021

Changes:

  • Prepared base docker environment
  • Added building scenarios

Signed-off-by: artemry-nv <artemry@nvidia.com>
@artemry-nv artemry-nv changed the title Implemented Torch-UCC CI Implemented Torch-UCC CI (WIP) Feb 17, 2021
@artemry-nv
Copy link
Author

@mike-dubman @Sergei-Lebedev please grant me permissions to add reviewers.

Signed-off-by: artemry-nv <artemry@nvidia.com>
.ci/job_matrix.yaml Outdated Show resolved Hide resolved
.ci/job_matrix.yaml Outdated Show resolved Hide resolved
.ci/job_matrix.yaml Outdated Show resolved Hide resolved
@mike-dubman
Copy link
Contributor

@mike-dubman @Sergei-Lebedev please grant me permissions to add reviewers.

done. plz check your inbox and accept invite

@artemry-nv
Copy link
Author

@mike-dubman @Sergei-Lebedev please grant me permissions to add reviewers.

done. plz check your inbox and accept invite

Don't see any invitations - could you please re-send it?

Signed-off-by: artemry-nv <artemry@nvidia.com>
@artemry-nv artemry-nv force-pushed the artemry-nv/torch-ucc-ci branch from 32591b2 to e5553df Compare March 15, 2021 06:40
.ci/proj_jjb.yaml Show resolved Hide resolved
.ci/scripts/build_ucx.sh Show resolved Hide resolved
.ci/Jenkinsfile.shlib Outdated Show resolved Hide resolved
.ci/scripts/env.sh Outdated Show resolved Hide resolved
.ci/scripts/run_fb_dlrm.sh Outdated Show resolved Hide resolved
@artemry-nv artemry-nv changed the base branch from master to ucc March 29, 2021 19:32
TORCH_UCC_SRC_DIR_WITH_XCCL_SUPPORT="${TORCH_UCC_SRC_DIR}_xccl"

echo "UCC barrier"
/bin/bash ${TORCH_UCC_SRC_DIR_WITH_XCCL_SUPPORT}/test/start_test.sh ${TORCH_UCC_SRC_DIR_WITH_XCCL_SUPPORT}/test/torch_barrier_test.py --backend=gloo
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

why backend=gloo?

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@Sergei-Lebedev could you please comment?

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

backend option sets backend to compare with, i.e. ucc collective will be compared with gloo collective for correctness check

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants