-
Notifications
You must be signed in to change notification settings - Fork 9.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
train lvis1.0 datasets have a error:RuntimeError: transform: failed to synchronize: cudaErrorAssert: device-side assert triggered #5912
Comments
It looks like an out-of-range problem. Did you check the num_classes is correct? And did you modify the default config? |
No,I don't modify the defalut. The LVISV1 num classes is 1203. model = dict( |
I want to know if my GPU is rtx3090, so I can't use mmdet2.3.0. |
Can you try the latest version to see if you still encounter this problem? |
I have no problem using the new version, but now there is a new problem that has not been solved.I used the latest version of mmdet = 2.15 to test the lvisv1.0 dataset, and used the officially downloaded lvis1.0(mask_rcnn_r50_fpn_sample1e-3_mstrain_1x_lvis_v1.pth) model and default configuration for evaluation and testing, but there will still be insufficient memory leading to the killing of the process. So I want to know how much memory is needed to test LVIs 1.0 model, and why I still don't have enough memory in 30g.For example: [>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>] 19809/19809, 8.4 task/s, elapsed: 6364s, ETA: 0s2021-08-18 08:48:11,286 - mmdet - INFO - Evaluating bbox... |
LVIS is a very large dataset, it requires a lot of memory during training and evaluation especially in instance segmentation tasks. 30g may not be enough. Upgrade to 64G or higher would be better. |
The memory problem already has an issue in #5909. So I close this issue. |
TorchVision: 0.9.1+cu111
OpenCV: 4.5.1
MMCV: 1.0.5
MMDetection: 2.3.0+
MMDetection Compiler: GCC 9.3
MMDetection CUDA Compiler: 11.2
PyTorch: 1.8.1+cu111
PyTorch compiling details: PyTorch built with:
run command:
command:use default config:python tools/train.py configs/lvis/mask_rcnn_r50_fpn_sample1e-3_mstrain_1x_lvis_v1.py
Error info:
/pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [96,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [97,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [98,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [99,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [100,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [101,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [102,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [103,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [104,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [105,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [106,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [108,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [109,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [111,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [112,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [114,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [115,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [116,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [117,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [118,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [119,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [120,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [32,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [44,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [45,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [46,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [47,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [48,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [49,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [50,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [51,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [52,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [53,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [54,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [55,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [56,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [57,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [58,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [59,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [60,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [61,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [62,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [63,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [64,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [65,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [66,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [67,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [68,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [69,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [70,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [71,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [72,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [73,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [74,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [75,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [76,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [77,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [78,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [86,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [87,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [88,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [89,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [90,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [91,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [92,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [93,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [94,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [95,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [2,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [3,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [4,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [5,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [6,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [7,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [8,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [9,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [10,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [11,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [12,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [13,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [14,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [15,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [16,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [17,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [18,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [19,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [20,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [21,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [22,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [23,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [24,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [25,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [26,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [27,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [28,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [29,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [30,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed./pytorch/aten/src/ATen/native/cuda/IndexKernel.cu:142: operator(): block: [0,0,0], thread: [31,0,0] Assertion
index >= -sizes[i] && index < sizes[i] && "index out of bounds"
failed.Traceback (most recent call last):
File "tools/train.py", line 154, in
main()
File "tools/train.py", line 143, in main
train_detector(
File "/home/ubuntu/.conda/envs/mmdet/lib/python3.8/site-packages/mmdet-2.3.0-py3.8.egg/mmdet/apis/train.py", line 143, in train_detector
runner.run(data_loaders, cfg.workflow, cfg.total_epochs)
File "/home/ubuntu/.conda/envs/mmdet/lib/python3.8/site-packages/mmcv/runner/epoch_based_runner.py", line 122, in run
epoch_runner(data_loaders[i], **kwargs)
File "/home/ubuntu/.conda/envs/mmdet/lib/python3.8/site-packages/mmcv/runner/epoch_based_runner.py", line 31, in train
outputs = self.model.train_step(data_batch, self.optimizer,
File "/home/ubuntu/.conda/envs/mmdet/lib/python3.8/site-packages/mmcv/parallel/data_parallel.py", line 31, in train_step
return self.module.train_step(*inputs[0], **kwargs[0])
File "/home/ubuntu/.conda/envs/mmdet/lib/python3.8/site-packages/mmdet-2.3.0-py3.8.egg/mmdet/models/detectors/base.py", line 237, in train_step
losses = self(**data)
File "/home/ubuntu/.local/lib/python3.8/site-packages/torch/nn/modules/module.py", line 889, in _call_impl
result = self.forward(*input, **kwargs)
File "/home/ubuntu/.conda/envs/mmdet/lib/python3.8/site-packages/mmdet-2.3.0-py3.8.egg/mmdet/core/fp16/decorators.py", line 51, in new_func
return old_func(*args, **kwargs)
File "/home/ubuntu/.conda/envs/mmdet/lib/python3.8/site-packages/mmdet-2.3.0-py3.8.egg/mmdet/models/detectors/base.py", line 171, in forward
return self.forward_train(img, img_metas, **kwargs)
File "/home/ubuntu/.conda/envs/mmdet/lib/python3.8/site-packages/mmdet-2.3.0-py3.8.egg/mmdet/models/detectors/two_stage.py", line 150, in forward_train
rpn_losses, proposal_list = self.rpn_head.forward_train(
File "/home/ubuntu/.conda/envs/mmdet/lib/python3.8/site-packages/mmdet-2.3.0-py3.8.egg/mmdet/models/dense_heads/base_dense_head.py", line 54, in forward_train
losses = self.loss(*loss_inputs, gt_bboxes_ignore=gt_bboxes_ignore)
File "/home/ubuntu/.conda/envs/mmdet/lib/python3.8/site-packages/mmdet-2.3.0-py3.8.egg/mmdet/models/dense_heads/rpn_head.py", line 69, in loss
losses = super(RPNHead, self).loss(
File "/home/ubuntu/.conda/envs/mmdet/lib/python3.8/site-packages/mmdet-2.3.0-py3.8.egg/mmdet/core/fp16/decorators.py", line 131, in new_func
return old_func(*args, **kwargs)
File "/home/ubuntu/.conda/envs/mmdet/lib/python3.8/site-packages/mmdet-2.3.0-py3.8.egg/mmdet/models/dense_heads/anchor_head.py", line 463, in loss
cls_reg_targets = self.get_targets(
File "/home/ubuntu/.conda/envs/mmdet/lib/python3.8/site-packages/mmdet-2.3.0-py3.8.egg/mmdet/models/dense_heads/anchor_head.py", line 346, in get_targets
results = multi_apply(
File "/home/ubuntu/.conda/envs/mmdet/lib/python3.8/site-packages/mmdet-2.3.0-py3.8.egg/mmdet/core/utils/misc.py", line 54, in multi_apply
return tuple(map(list, zip(*map_results)))
File "/home/ubuntu/.conda/envs/mmdet/lib/python3.8/site-packages/mmdet-2.3.0-py3.8.egg/mmdet/models/dense_heads/anchor_head.py", line 230, in _get_targets_single
sampling_result = self.sampler.sample(assign_result, anchors,
File "/home/ubuntu/.conda/envs/mmdet/lib/python3.8/site-packages/mmdet-2.3.0-py3.8.egg/mmdet/core/bbox/samplers/base_sampler.py", line 97, in sample
neg_inds = neg_inds.unique()
File "/home/ubuntu/.local/lib/python3.8/site-packages/torch/tensor.py", line 511, in unique
return torch.unique(self, sorted=sorted, return_inverse=return_inverse, return_counts=return_counts, dim=dim)
File "/home/ubuntu/.local/lib/python3.8/site-packages/torch/_jit_internal.py", line 365, in fn
return if_false(*args, **kwargs)
File "/home/ubuntu/.local/lib/python3.8/site-packages/torch/_jit_internal.py", line 365, in fn
return if_false(args, kwargs)
File "/home/ubuntu/.local/lib/python3.8/site-packages/torch/functional.py", line 831, in _return_output
output, _, _ = _unique_impl(input, sorted, return_inverse, return_counts, dim)
File "/home/ubuntu/.local/lib/python3.8/site-packages/torch/functional.py", line 745, in _unique_impl
output, inverse_indices, counts = torch._unique2(
RuntimeError: transform: failed to synchronize: cudaErrorAssert: device-side assert triggered
terminate called after throwing an instance of 'c10::Error'
what(): CUDA error: device-side assert triggered
Exception raised from create_event_internal at /pytorch/c10/cuda/CUDACachingAllocator.cpp:733 (most recent call first):
frame #0: c10::Error::Error(c10::SourceLocation, std::string) + 0x42 (0x7fa0b468c2f2 in /home/ubuntu/.local/lib/python3.8/site-packages/torch/lib/libc10.so)
frame #1: c10::detail::torchCheckFail(char const, char const, unsigned int, std::string const&) + 0x5b (0x7fa0b468967b in /home/ubuntu/.local/lib/python3.8/site-packages/torch/lib/libc10.so)
frame #2: c10::cuda::CUDACachingAllocator::raw_delete(void) + 0x809 (0x7fa0bcb6a1f9 in /home/ubuntu/.local/lib/python3.8/site-packages/torch/lib/libc10_cuda.so)
frame #3: c10::TensorImpl::release_resources() + 0x54 (0x7fa0b46743a4 in /home/ubuntu/.local/lib/python3.8/site-packages/torch/lib/libc10.so)
frame #4: + 0x6e47ba (0x7fa0bd8727ba in /home/ubuntu/.local/lib/python3.8/site-packages/torch/lib/libtorch_python.so)
frame #5: + 0x6e4851 (0x7fa0bd872851 in /home/ubuntu/.local/lib/python3.8/site-packages/torch/lib/libtorch_python.so)
frame #23: __libc_start_main + 0xf3 (0x7fa0c2c670b3 in /lib/x86_64-linux-gnu/libc.so.6)
The text was updated successfully, but these errors were encountered: