-
Notifications
You must be signed in to change notification settings - Fork 109
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unmatching size and error #3
Comments
Thanks for your question. But I won't be available for a few days. |
@transedward Thanks for your reply. I have tested in Pytorch 0.2.0.post1 (0.2.0.1), Python 3.5.3 with Anaconda and Ubuntu 16.04. |
Open
3 tasks
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi, thanks for sharing your wonderful code.
But I have met some errors when running it.
Inside the line 197~205 from
dqn_learn.py
, the size oftarget_Q_values
and that ofcurrent_Q_values
does not matched well. I have changed tonext_max_q = next_max_q.unsqueeze(-1)
for correcting sizes. Also I have changed torew_batch[0]
from line 203.(IMO) After stacking records in replay buffer, queue action does not work properly. I have changed the line 158 to
action = select_epilson_greedy_action(Q, recent_observations, t)
, however different action value has queued.I am still working these but having troubles. Could you help make them right?
The text was updated successfully, but these errors were encountered: