This repository has been archived by the owner on Nov 17, 2023. It is now read-only.
-
Notifications
You must be signed in to change notification settings - Fork 6.8k
Windows CI CUDA Intermittent error C2993 #17935
Comments
7 tasks
@mxnet-label-bot add [ci, windows] |
ChaiBapchya
changed the title
Windows CI CUDA Intermitted error C2993
Windows CI CUDA Intermittent error C2993
Apr 2, 2020
Created an upstream issue: NVIDIA/thrust#1090 |
@vexilligera did you test if the error also occurs on more recent versions of thrust? I suggest we try installing thrust 1.9.8 version on Windows CI, which is the version that'll be shipped with Cuda 11 We do that on Ubuntu CI already |
There is another suggested fix at pytorch/pytorch#25393 (comment) cc @vexilligera |
This was referenced May 1, 2020
Seems to be a nvcc bug NVIDIA/thrust#1090 (comment) |
This is indeed an nvcc bug. There is no known workaround at the moment, but the next release of the CUDA toolkit will contain a fix. Ref NVIDIA/thrust#1090. |
Sign up for free
to subscribe to this conversation on GitHub.
Already have an account?
Sign in.
Description
Intermittent failure seen on windows-gpu compilation phase (WIN_GPU/WIN_GPU_MKLDNN)
Discovered in this PR : #17808
Related to pytorch/pytorch#25393
Error Message
It intermittently gives the error :
Errors:
Entire stack trace:
http://jenkins.mxnet-ci.amazon-ml.com/blue/rest/organizations/jenkins/pipelines/mxnet-validation/pipelines/windows-gpu/branches/PR-17808/runs/15/nodes/39/log/?start=0
To Reproduce
Build using Windows AMI and run
Clone repo &
py -3 ci/build_windows.py -f WIN_GPU
What have you tried to solve it?
Currently, what is found to work:
Introduced max retries = 5
The text was updated successfully, but these errors were encountered: