You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Trying to move a nn.Module from CUDA to XLA device causes a segmentation fault. This is probably related to #3466. Models that hit #6011 will also hit this issue.
Fatal Python error: Segmentation fault
Thread 0x00007ff283043700 (most recent call first):
File "/usr/local/lib/python3.8/selectors.py", line 415 in select
File "/usr/local/lib/python3.8/multiprocessing/connection.py", line 931 in wait
File "/usr/local/lib/python3.8/concurrent/futures/process.py", line 362 in _queue_management_worker
File "/usr/local/lib/python3.8/threading.py", line 870 in run
File "/usr/local/lib/python3.8/threading.py", line 932 in _bootstrap_inner
File "/usr/local/lib/python3.8/threading.py", line 890 in _bootstrap
Current thread 0x00007ff4fb086740 (most recent call first):
File "torch/nn/modules/module.py", line 1150 in convert
File "torch/nn/modules/module.py", line 825 in _apply
File "torch/nn/modules/module.py", line 1152 in to
File "example.py", line 9 in <module>
Segmentation fault (core dumped)
In an internal discussion, I mentioned I thought we could just modify the benchmark code for fixing the error for DALLE2_pytorch. However, it's implemented with an external library. Given that, I think we should try to fix the actual error: the segmentation fault.
🐛 Bug
Trying to move a
nn.Module
from CUDA to XLA device causes a segmentation fault. This is probably related to #3466. Models that hit #6011 will also hit this issue.Affected benchmarks
Environment
cc @JackCaoG @miladm
The text was updated successfully, but these errors were encountered: