-
Notifications
You must be signed in to change notification settings - Fork 88
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Running the t code shows loss=nan when calculating the coco dataset. #68
Comments
I encountered the same problem with the coco_stuff dataset. 2023-08-07 16:07:13,007 INFO [trainer.py, 229] Train Epoch: 0 Train Iteration: 30 Time 4.248s / 10iters, (0.425) Forward Time 2.557s / 10iters, (0.256) Backward Time 1.583s / 10iters, (0.158) Loss Time 0.075s / 10iters, (0.007) Data load 0.033s / 10iters, (0.003317) Is it because there aren't enough training epochs? |
I encountered the same error, have you solved this problem? |
I encountered the same problem when I changed the architecture to my own model. In my case, I found some elements of After making a small change from ContrastiveSeg/lib/loss/loss_contrast.py Line 121 in 287e5d3
to log_prob = logits - torch.log(exp_logits + neg_logits + 1e-10) Everything was going well. |
That makes sense for me, thanks for your help!! |
Running the t code shows loss=nan when calculating the coco dataset.
The text was updated successfully, but these errors were encountered: