Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Nan is encountered when training imagenet_rnn_gate_rl_50 #5

Open
Willy0919 opened this issue Oct 26, 2018 · 1 comment
Open

Nan is encountered when training imagenet_rnn_gate_rl_50 #5

Willy0919 opened this issue Oct 26, 2018 · 1 comment

Comments

@Willy0919
Copy link

I have met a problem when training imagenet_rnn_gate_rl_50 using offered pretrained sp model:

10-26-18 02:30:Epoch: [4][4010/5004] Time 1.793 (1.839) Data 0.000 (0.003) Loss nan (nan) Total rewards nan (nan) Prec@1 0.391 (75.964) Prec@5 1.953 (91.181)
10-26-18 02:30:total gate rewards = 2.560
10-26-18 02:30:*** Computation Percentage: 97.532 %
10-26-18 02:30:Epoch: [4][4020/5004] Time 1.754 (1.838) Data 0.000 (0.003) Loss nan (nan) Total rewards nan (nan) Prec@1 0.000 (75.775) Prec@5 0.000 (90.954)

I didn't change any default configure and the loss and rewards became "nan", is there any other matters needing attention. Please help

Thanks,
Willy

@xinw1012
Copy link
Collaborator

xinw1012 commented Dec 29, 2018

Really sorry about the late reply. Not sure if you have already solved the issue... Did you try to reduce the learning rate? If this is still bothering you, I can take a look at the code and re-run the experiments for checking.

Thanks,
Xin

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants