-
Notifications
You must be signed in to change notification settings - Fork 7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Question about torchvision.io.decode_image #4325
Comments
Hi, Thanks for the report. Does the code work as expected with Also, note that we have fixed some bugs in image reading recently, see #3948 #4101 and #4268 |
yes, only when set num_workers=0, it works.
|
Oh, the issue happens when doing GPU decoding only? I believe this might be expected, as doing GPU computations in multiprocessing doesn't work well, and this is irrespective of if it's image decoding or not. For decoding on the GPU, we might need a different set of tooling in the dataset level to get this working, which we are starting to explore but won't be available soon. |
when we use torchvision.io.decode_image(img,device = local_rank) to train with ddp,we find num_workers>0 can't work.
RuntimeError: DataLoader worker (pid 58353) exited unexpectedly with exit code 1. Details are lost due to multiprocessing. Rerunning with num_workers=0 may give better error trace.
The text was updated successfully, but these errors were encountered: