-
Notifications
You must be signed in to change notification settings - Fork 5.9k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[rllib] Auto clip actions to Box space range; deprecate squash_to_range #3426
Conversation
Test FAILed. |
jenkins retest this please |
Test FAILed. |
jenkins retest this please |
Test FAILed. |
Test FAILed. |
Test FAILed. |
@@ -61,6 +61,8 @@ | |||
# Whether to clip rewards prior to experience postprocessing. Setting to | |||
# None means clip for Atari only. | |||
"clip_rewards": None, | |||
# Whether to np.clip() actions to the action space low/high range spec. | |||
"clip_actions": True, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
when would you want this to be false?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
For backwards compatibility I think it's a reasonable option to have.
Co-Authored-By: ericl <ekhliang@gmail.com>
Test FAILed. |
What do these changes do?
Per discussion here https://groups.google.com/forum/#!topic/ray-dev/loJOOU5LzDk, squash_to_range incorrectly zeros out gradients and can cause divergence.