-
Notifications
You must be signed in to change notification settings - Fork 373
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feature(lxy): modify ppof rewardclip and add atari config #589
Conversation
Codecov Report
@@ Coverage Diff @@
## main #589 +/- ##
==========================================
- Coverage 83.53% 83.50% -0.04%
==========================================
Files 564 564
Lines 46201 46241 +40
==========================================
+ Hits 38596 38613 +17
- Misses 7605 7628 +23
Flags with carried forward coverage won't be shown. Click here to find out more.
Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here. |
ding/bonus/config.py
Outdated
@@ -77,6 +77,39 @@ def get_instance_config(env: str) -> EasyDict: | |||
critic_head_hidden_size=256, | |||
actor_head_hidden_size=256, | |||
) | |||
elif env == 'qbert': |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
qbert can use the same config as kangaroo and bowling
ding/bonus/config.py
Outdated
@@ -152,6 +185,36 @@ def get_instance_env(env: str) -> BaseEnv: | |||
}, | |||
seed_api=False, | |||
) | |||
elif env == 'qbert': |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
unify these code and remove unused code
ding/bonus/config.py
Outdated
@@ -152,6 +185,36 @@ def get_instance_env(env: str) -> BaseEnv: | |||
}, | |||
seed_api=False, | |||
) | |||
elif env == 'qbert': |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
rename to atari_qbert
ding/bonus/ppof.py
Outdated
@@ -86,7 +91,7 @@ def train( | |||
logging.debug(self.policy._model) | |||
# define env and policy | |||
collector_env = self._setup_env_manager(collector_env_num, context, debug) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
add collector
here
Description
Related Issue
TODO
Check List