Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feature(lxy): modify ppof rewardclip and add atari config #589

Merged
merged 6 commits into from
Mar 6, 2023

Conversation

karroyan
Copy link
Collaborator

Description

Related Issue

TODO

Check List

  • merge the latest version source branch/repo, and resolve all the conflicts
  • pass style check
  • pass all the tests

@karroyan karroyan added algo Add new algorithm or improve old one config Update config labels Feb 21, 2023
@codecov
Copy link

codecov bot commented Feb 21, 2023

Codecov Report

Merging #589 (2ca2234) into main (1e6f503) will decrease coverage by 0.04%.
The diff coverage is 16.98%.

❗ Current head 2ca2234 differs from pull request most recent head 47bb5a4. Consider uploading reports for the commit 47bb5a4 to get more accurate results

@@            Coverage Diff             @@
##             main     #589      +/-   ##
==========================================
- Coverage   83.53%   83.50%   -0.04%     
==========================================
  Files         564      564              
  Lines       46201    46241      +40     
==========================================
+ Hits        38596    38613      +17     
- Misses       7605     7628      +23     
Flag Coverage Δ
unittests 83.50% <16.98%> (-0.04%) ⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Impacted Files Coverage Δ
ding/bonus/config.py 0.00% <0.00%> (ø)
ding/bonus/ppof.py 0.00% <0.00%> (ø)
ding/envs/env/default_wrapper.py 42.85% <50.00%> (+0.54%) ⬆️
ding/envs/env/ding_env_wrapper.py 84.51% <87.50%> (+0.10%) ⬆️
ding/framework/message_queue/tests/test_nng.py 42.30% <0.00%> (-57.70%) ⬇️
ding/utils/data/dataloader.py 47.15% <0.00%> (+1.03%) ⬆️
ding/envs/env_manager/subprocess_env_manager.py 75.85% <0.00%> (+1.06%) ⬆️
ding/framework/message_queue/redis.py 84.90% <0.00%> (+5.66%) ⬆️
ding/framework/message_queue/tests/test_redis.py 100.00% <0.00%> (+38.46%) ⬆️

Help us with your feedback. Take ten seconds to tell us how you rate us. Have a feature suggestion? Share it here.

@@ -77,6 +77,39 @@ def get_instance_config(env: str) -> EasyDict:
critic_head_hidden_size=256,
actor_head_hidden_size=256,
)
elif env == 'qbert':
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

qbert can use the same config as kangaroo and bowling

@@ -152,6 +185,36 @@ def get_instance_env(env: str) -> BaseEnv:
},
seed_api=False,
)
elif env == 'qbert':
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

unify these code and remove unused code

@@ -152,6 +185,36 @@ def get_instance_env(env: str) -> BaseEnv:
},
seed_api=False,
)
elif env == 'qbert':
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

rename to atari_qbert

@@ -86,7 +91,7 @@ def train(
logging.debug(self.policy._model)
# define env and policy
collector_env = self._setup_env_manager(collector_env_num, context, debug)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

add collector here

ding/bonus/ppof.py Show resolved Hide resolved
@PaParaZz1 PaParaZz1 merged commit b7ce258 into opendilab:main Mar 6, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
algo Add new algorithm or improve old one config Update config
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants