cupy_backends.cuda.api.driver.CUDADriverError: CUDA_ERROR_INVALID_SOURCE: device kernel image is invalid #437

SuperFCR · 2023-10-19T11:27:46Z

Issue type

Bug Report
Feature Request
Help wanted
Other

SpikingJelly version

0.0.0.0.12

Description

Traceback (most recent call last):
File "train.py", line 502, in
main(args)
File "train.py", line 446, in main
train_loss, train_acc1, train_acc5 = train_one_epoch(
File "train.py", line 216, in train_one_epoch
output = model(image)
File "/opt/conda/envs/spiking/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
return forward_call(*args, **kwargs)
File "/root/falcary/cifar10dvs/model.py", line 245, in forward
x = self.forward_features(x)
File "/root/falcary/cifar10dvs/model.py", line 238, in forward_features
x = patch_embed(x)
File "/opt/conda/envs/spiking/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
return forward_call(*args, **kwargs)
File "/root/falcary/cifar10dvs/model.py", line 153, in forward
x = self.proj_lif(x).flatten(0,1).contiguous()
File "/opt/conda/envs/spiking/lib/python3.8/site-packages/torch/nn/modules/module.py", line 1501, in _call_impl
return forward_call(*args, **kwargs)
File "/opt/conda/envs/spiking/lib/python3.8/site-packages/spikingjelly/clock_driven/neuron.py", line 855, in forward
spike_seq, self.v_seq = neuron_kernel.MultiStepLIFNodePTT.apply(
File "/opt/conda/envs/spiking/lib/python3.8/site-packages/torch/autograd/function.py", line 506, in apply
return super().apply(*args, **kwargs) # type: ignore[misc]
File "/opt/conda/envs/spiking/lib/python3.8/site-packages/spikingjelly/clock_driven/neuron_kernel.py", line 732, in forward
cp_numel = cupy.asarray(numel)
File "/opt/conda/envs/spiking/lib/python3.8/site-packages/cupy/_creation/from_data.py", line 76, in asarray
return _core.array(a, dtype, False, order)
File "cupy/_core/core.pyx", line 2266, in cupy._core.core.array
File "cupy/_core/core.pyx", line 2290, in cupy._core.core.array
File "cupy/_core/core.pyx", line 2424, in cupy._core.core._array_default
File "cupy/_core/core.pyx", line 699, in cupy._core.core.ndarray.fill
File "cupy/_core/_kernel.pyx", line 900, in cupy._core._kernel.ElementwiseKernel.call
File "cupy/_core/_kernel.pyx", line 925, in cupy._core._kernel.ElementwiseKernel._get_elementwise_kernel
File "cupy/_util.pyx", line 67, in cupy._util.memoize.decorator.ret
File "cupy/_core/_kernel.pyx", line 712, in cupy._core._kernel._get_elementwise_kernel
File "cupy/_core/_kernel.pyx", line 72, in cupy._core._kernel._get_simple_elementwise_kernel
File "cupy/_core/core.pyx", line 2141, in cupy._core.core.compile_with_cache
File "/opt/conda/envs/spiking/lib/python3.8/site-packages/cupy/cuda/compiler.py", line 492, in _compile_module_with_cache
return _compile_with_cache_cuda(
File "/opt/conda/envs/spiking/lib/python3.8/site-packages/cupy/cuda/compiler.py", line 561, in _compile_with_cache_cuda
mod.load(cubin)
File "cupy/cuda/function.pyx", line 264, in cupy.cuda.function.Module.load
File "cupy/cuda/function.pyx", line 266, in cupy.cuda.function.Module.load
File "cupy_backends/cuda/api/driver.pyx", line 210, in cupy_backends.cuda.api.driver.moduleLoadData
File "cupy_backends/cuda/api/driver.pyx", line 60, in cupy_backends.cuda.api.driver.check_status
cupy_backends.cuda.api.driver.CUDADriverError: CUDA_ERROR_INVALID_SOURCE: device kernel image is invalid

Minimal code to reproduce the error/bug

env
cuda==1.7
cupy-cuda117
spikingjelly==0.0.0.0.12

Thanks for your response!

The text was updated successfully, but these errors were encountered:

sduzzx857 · 2023-10-26T03:59:48Z

我也遇到了这个问题，请问您解决了吗

fangwei123456 · 2023-10-26T12:33:23Z

I am not sure whether this bug is caused by this:

https://github.com/fangwei123456/spikingjelly/blob/master/bugs.md

Bug: When using CuPy with version >= 10, CuPy will change torch.cuda.current_device() to 0, cupy/cupy#6569. This bug will break training when using Distributed Data Parallel (DDP).

Please using spikingjelly==0.0.0.0.14 and try again.

fangwei123456 added the help wanted Extra attention is needed label Oct 26, 2023

MyNiuuu mentioned this issue Jun 25, 2024

Hybird gradio error MyNiuuu/MOFA-Video#15

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

cupy_backends.cuda.api.driver.CUDADriverError: CUDA_ERROR_INVALID_SOURCE: device kernel image is invalid #437

cupy_backends.cuda.api.driver.CUDADriverError: CUDA_ERROR_INVALID_SOURCE: device kernel image is invalid #437

SuperFCR commented Oct 19, 2023

sduzzx857 commented Oct 26, 2023

fangwei123456 commented Oct 26, 2023

cupy_backends.cuda.api.driver.CUDADriverError: CUDA_ERROR_INVALID_SOURCE: device kernel image is invalid #437

cupy_backends.cuda.api.driver.CUDADriverError: CUDA_ERROR_INVALID_SOURCE: device kernel image is invalid #437

Comments

SuperFCR commented Oct 19, 2023

sduzzx857 commented Oct 26, 2023

fangwei123456 commented Oct 26, 2023