Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Error when training model #12

Open
jm02058 opened this issue Jul 7, 2024 · 2 comments
Open

Error when training model #12

jm02058 opened this issue Jul 7, 2024 · 2 comments

Comments

@jm02058
Copy link

jm02058 commented Jul 7, 2024

I am trying to train the model (currently using provided example 'python bin/train_model.py -c configs/train_tsdiff/train_uber_tlc.yaml'), but am struggling to fix the following error:

File "/.../src/uncond_ts_diff/arch/s4.py", line 967, in forward
k = torch.fft.irfft(k_f, n=discrete_L) # (B+1, C, H, L)
RuntimeError: cuFFT error: CUFFT_INTERNAL_ERROR

Any guidance on how I might have caused this would be much appreciated, I am not very experienced in this area. Thank you.
I have also provided a screenshot of the last output (I am running the model on WSL).

Screenshot 2024-07-07 114209

@abdulfatir
Copy link
Contributor

@jm02058 Is this relevant? pytorch/pytorch#88038

@jm02058
Copy link
Author

jm02058 commented Jul 30, 2024

@jm02058 Is this relevant? pytorch/pytorch#88038

Yes, I'll have another look.
Thank you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants