Change CI cuda versions to 10.2 #3869

datumbox · 2021-05-21T08:46:04Z

Updates the CI to use cuda 10.2 instead of 10.1 so that we can start using fresh PyTorch core nightlies.

…0.1_to_cuda10.2

NicolasHug · 2021-05-21T08:59:01Z

Thanks for the PR!

Let's see what the CI says. We might need to also change stuff like the use of image_name: "pytorch/manylinux-cuda101" in the yaml file, as well as other references to 101 in regenerate.py, in particular:

                if device_type == 'gpu':
                    if python_version != "3.8":
                        job['filters'] = gen_filter_branch_tree('master', 'nightly')
                    job['cu_version'] = 'cu101'

datumbox · 2021-05-21T09:01:16Z

Thanks I'm still looking for more places I have to change. If you see others please let me know.

datumbox · 2021-05-21T09:41:14Z

It seems that the latest PyTorch core on linux has a new restriction on how you index things across devices:

RuntimeError: Expected all tensors to be on the same device, but found at least two devices, cuda:0 and cpu! (when checking arugment for argument indices in method wrapper_Tensor_index_Tensor)

The problem appears only on Linux GPU and not on Windows:
https://app.circleci.com/pipelines/github/pytorch/vision/8241/workflows/6af4ce74-e157-4ca6-9eca-64ed7b9989ee/jobs/590609/tests#failed-test-0

I propose to merge this now and fix ASAP the issues on master on a separate PR.

fmassa

Accepting to unblock

ngimel · 2021-05-22T01:17:42Z

To close the loop on this, indexing from python is not affected, because python_variable_indexing copies all the indices to self device before dispatching to index. However, having indices on the different device in c++ has been broken by pytorch/pytorch#56872. cc @wenleix, @ezyang

wenleix · 2021-05-22T03:49:30Z

@ngimel Thanks for reporting this. Will figure out a fix to this.

Update: index leverage TensorIterator and should be opt-out for automatic device check:

https://github.com/pytorch/pytorch/blob/master/aten/src/ATen/native/TensorAdvancedIndexing.cpp#L296

make_index_iterator:

https://github.com/pytorch/pytorch/blob/dc8bc6ba4bbd61d21de4ffbccf5b79d22ff31a23/aten/src/ATen/native/TensorAdvancedIndexing.cpp#L266-L277

Summary: * Change cuda versions. * changing cu_version * patching regenerate.py * more changes. Reviewed By: vincentqb, cpuhrsch Differential Revision: D28677174 fbshipit-source-id: a32861bd62e3f5a3d5b19106e4f1773128ba1006

Change cuda versions.

d61ebd8

datumbox added the module: ci label May 21, 2021

datumbox requested a review from NicolasHug May 21, 2021 08:46

facebook-github-bot added the cla signed label May 21, 2021

datumbox and others added 3 commits May 21, 2021 09:47

Merge branch 'master' into cuda10.1_to_cuda10.2

2c7d4a9

changing cu_version

f8ec9e6

Merge remote-tracking branch 'origin/cuda10.1_to_cuda10.2' into cuda1…

72569b1

…0.1_to_cuda10.2

patching regenerate.py

af664c7

datumbox force-pushed the cuda10.1_to_cuda10.2 branch from 062afef to 6a22a73 Compare May 21, 2021 09:03

more changes.

0020b68

datumbox force-pushed the cuda10.1_to_cuda10.2 branch from 6a22a73 to 0020b68 Compare May 21, 2021 09:04

datumbox requested a review from fmassa May 21, 2021 09:08

datumbox changed the title ~~Change cuda versions to 10.2~~ Change CI cuda versions to 10.2 May 21, 2021

fmassa approved these changes May 21, 2021

View reviewed changes

datumbox merged commit 91d9797 into pytorch:master May 21, 2021

datumbox deleted the cuda10.1_to_cuda10.2 branch May 21, 2021 09:47

datumbox mentioned this pull request May 21, 2021

Moving tensors to the right device #3870

Merged

wenleix mentioned this pull request May 22, 2021

[PyTorch] Remove device check from a few indexing methods pytorch/pytorch#58800

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Change CI cuda versions to 10.2 #3869

Change CI cuda versions to 10.2 #3869

datumbox commented May 21, 2021 •

edited

Loading

NicolasHug commented May 21, 2021

datumbox commented May 21, 2021

datumbox commented May 21, 2021 •

edited

Loading

fmassa left a comment

ngimel commented May 22, 2021 •

edited by wenleix

Loading

wenleix commented May 22, 2021 •

edited

Loading

Change CI cuda versions to 10.2 #3869

Change CI cuda versions to 10.2 #3869

Conversation

datumbox commented May 21, 2021 • edited Loading

NicolasHug commented May 21, 2021

datumbox commented May 21, 2021

datumbox commented May 21, 2021 • edited Loading

fmassa left a comment

Choose a reason for hiding this comment

ngimel commented May 22, 2021 • edited by wenleix Loading

wenleix commented May 22, 2021 • edited Loading

datumbox commented May 21, 2021 •

edited

Loading

datumbox commented May 21, 2021 •

edited

Loading

ngimel commented May 22, 2021 •

edited by wenleix

Loading

wenleix commented May 22, 2021 •

edited

Loading