-
Notifications
You must be signed in to change notification settings - Fork 373
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
polish(pu): polish the nstep_return_ngu and null_padding action in NGU #116
Commits on Jul 8, 2021
-
style(nyz): update badges with actions and issues
opendilab authoredJul 8, 2021 Configuration menu - View commit details
-
Copy full SHA for 90b2797 - Browse repository at this point
Copy the full SHA 90b2797View commit details -
Configuration menu - View commit details
-
Copy full SHA for e95b01a - Browse repository at this point
Copy the full SHA e95b01aView commit details
Commits on Jul 10, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 3b88f70 - Browse repository at this point
Copy the full SHA 3b88f70View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1567939 - Browse repository at this point
Copy the full SHA 1567939View commit details -
Configuration menu - View commit details
-
Copy full SHA for 539e8ac - Browse repository at this point
Copy the full SHA 539e8acView commit details -
Configuration menu - View commit details
-
Copy full SHA for cc9f682 - Browse repository at this point
Copy the full SHA cc9f682View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0ee3636 - Browse repository at this point
Copy the full SHA 0ee3636View commit details -
Configuration menu - View commit details
-
Copy full SHA for f15d444 - Browse repository at this point
Copy the full SHA f15d444View commit details -
Configuration menu - View commit details
-
Copy full SHA for 43fab4e - Browse repository at this point
Copy the full SHA 43fab4eView commit details -
hotfix(nyz): fix subprocess env manager state transition bug and exec…
… usage in setup.py
Configuration menu - View commit details
-
Copy full SHA for 5adf800 - Browse repository at this point
Copy the full SHA 5adf800View commit details -
Configuration menu - View commit details
-
Copy full SHA for 98b938a - Browse repository at this point
Copy the full SHA 98b938aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 240d013 - Browse repository at this point
Copy the full SHA 240d013View commit details -
Configuration menu - View commit details
-
Copy full SHA for 814ab70 - Browse repository at this point
Copy the full SHA 814ab70View commit details -
Configuration menu - View commit details
-
Copy full SHA for ce54633 - Browse repository at this point
Copy the full SHA ce54633View commit details
Commits on Jul 12, 2021
-
Configuration menu - View commit details
-
Copy full SHA for d5fe8a6 - Browse repository at this point
Copy the full SHA d5fe8a6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 742d513 - Browse repository at this point
Copy the full SHA 742d513View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7f9b8e3 - Browse repository at this point
Copy the full SHA 7f9b8e3View commit details -
Merge pull request opendilab#4 from PaParaZz1/doc/separate
separate doc from main repo to doc repo
Configuration menu - View commit details
-
Copy full SHA for 25e43c2 - Browse repository at this point
Copy the full SHA 25e43c2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 71a7ec5 - Browse repository at this point
Copy the full SHA 71a7ec5View commit details -
adapt unittest to github actions (opendilab#6)
* test(nyz): comment subprocess env manager and parallel entry unittest * fix(nyz): try to fix test_ppg and flask_fs_collector test close * test(nyz): modify unittest worker number * test(nyz) fix unittest worker and ignore 1v1 collector test * test(nyz): test different range for unittest(env, rl_utils, entry, interaction) * test(nyz): test different range for unittest(env, rl_utils, entry, interaction, league, model) and add execution timeout * test(nyz): test different range for unittest(env, rl_utils, entry, interaction, league, model) * test(nyz): test different range for unittest(env, rl_utils, entry, interaction, league, model, torch_utils) * test(nyz): fix test DingEnvWrapper unittest env bug * test(nyz): add utils unittest and disable dataloader unittest * test(nyz): simplify reward model unittest * test(nyz): enable all the unittest except dataloader * test(nyz): enable parallel entry and dataloader unittest * test(nyz): fix test ppg rerun bug * test(nyz): enable windows test * test(nyz): disable subprocess env manager unittest * test(nyz): fix test auto checkpoint bug * test(nyz): disable test dataloader * test(nyz): enable subprocess env manager unittest * test(nyz): update coveragerc * test(nyz): add coverage upload workflow * test(nyz): disable test_block in subprocess env manager * test(nyz): enable rerun in test demo buffer
Configuration menu - View commit details
-
Copy full SHA for 33e5088 - Browse repository at this point
Copy the full SHA 33e5088View commit details -
Configuration menu - View commit details
-
Copy full SHA for e41864d - Browse repository at this point
Copy the full SHA e41864dView commit details -
Configuration menu - View commit details
-
Copy full SHA for e322083 - Browse repository at this point
Copy the full SHA e322083View commit details
Commits on Jul 13, 2021
-
Configuration menu - View commit details
-
Copy full SHA for df81bee - Browse repository at this point
Copy the full SHA df81beeView commit details -
badge(hansbug): add LoC and Documentation Percentage badge to README.…
…md && update code coverage badge (opendilab#8)
Configuration menu - View commit details
-
Copy full SHA for f92680a - Browse repository at this point
Copy the full SHA f92680aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 966a6d7 - Browse repository at this point
Copy the full SHA 966a6d7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7accb4e - Browse repository at this point
Copy the full SHA 7accb4eView commit details
Commits on Jul 14, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 557a44c - Browse repository at this point
Copy the full SHA 557a44cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 582f707 - Browse repository at this point
Copy the full SHA 582f707View commit details
Commits on Jul 16, 2021
-
polish(nyz): codestyle optimization by lgtm (opendilab#7)
* refactor(nyz): refactor read_config to 3 different function interface * feature(nyz): enable env_setting param in entry * polish(nyz): remove redundant code and global declaration * polish(nyz): remove flag in import_helper * polish(nyz): remove unused import * style(nyz): correct format
Configuration menu - View commit details
-
Copy full SHA for e02f328 - Browse repository at this point
Copy the full SHA e02f328View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1849da7 - Browse repository at this point
Copy the full SHA 1849da7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4379a8c - Browse repository at this point
Copy the full SHA 4379a8cView commit details -
Configuration menu - View commit details
-
Copy full SHA for f3d3c31 - Browse repository at this point
Copy the full SHA f3d3c31View commit details -
Merge pull request opendilab#10 from opendilab/dev-output-structure
refactor output structure (opendilab#10)
Configuration menu - View commit details
-
Copy full SHA for fad58a3 - Browse repository at this point
Copy the full SHA fad58a3View commit details -
Configuration menu - View commit details
-
Copy full SHA for f6c7c01 - Browse repository at this point
Copy the full SHA f6c7c01View commit details -
Configuration menu - View commit details
-
Copy full SHA for a59907a - Browse repository at this point
Copy the full SHA a59907aView commit details
Commits on Jul 19, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 58a25cf - Browse repository at this point
Copy the full SHA 58a25cfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 436f5ac - Browse repository at this point
Copy the full SHA 436f5acView commit details
Commits on Jul 21, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 0af6686 - Browse repository at this point
Copy the full SHA 0af6686View commit details -
Configuration menu - View commit details
-
Copy full SHA for b0aa5dc - Browse repository at this point
Copy the full SHA b0aa5dcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5b1aae8 - Browse repository at this point
Copy the full SHA 5b1aae8View commit details
Commits on Jul 22, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 1bd8dd2 - Browse repository at this point
Copy the full SHA 1bd8dd2View commit details -
Merge pull request opendilab#13 from opendilab/dev-minigrid
minigrid env (opendilab#13)
Configuration menu - View commit details
-
Copy full SHA for f55ec45 - Browse repository at this point
Copy the full SHA f55ec45View commit details
Commits on Jul 23, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 1e00390 - Browse repository at this point
Copy the full SHA 1e00390View commit details -
Merge branch 'main' of https://github.com/opendilab/DI-engine into de…
…v-on-policy # Conflicts: # ding/policy/common_utils.py # ding/policy/ppo.py
Configuration menu - View commit details
-
Copy full SHA for 494b397 - Browse repository at this point
Copy the full SHA 494b397View commit details
Commits on Jul 29, 2021
-
Merge pull request opendilab#9 from YinminZhang/dev-on-policy
on policy ppo (opendilab#9)
Configuration menu - View commit details
-
Copy full SHA for 55a3dd1 - Browse repository at this point
Copy the full SHA 55a3dd1View commit details -
Configuration menu - View commit details
-
Copy full SHA for a7b4126 - Browse repository at this point
Copy the full SHA a7b4126View commit details -
add endoro env config. add enduro's ppo,dqn,drdqn,rainbow,impala conf…
…ig. (opendilab#11) * add endoro env config. add enduro's ppo,dqn,drdqn,rainbow,impala config. * modified as reviewer mentions * add qacd network * fix bugs * update dizoo readme * add README.md about max reward result * update dqn config and update README Co-authored-by: shenziju <simonshen2011@foxmail.com> Co-authored-by: simon shen <simon@simondeMacBook-Air.local> Co-authored-by: Swain <niuyazhe314@outlook.com>
Configuration menu - View commit details
-
Copy full SHA for c231322 - Browse repository at this point
Copy the full SHA c231322View commit details -
Configuration menu - View commit details
-
Copy full SHA for bba994d - Browse repository at this point
Copy the full SHA bba994dView commit details
Commits on Aug 1, 2021
-
add pybullet env (opendilab#16)
* add pybullet envs. * add td3/ddpg/sac/ppo configs for pybullet. * update td3/ddpg/sac/ppo configs for pybullet. * update td3 configs; remove td3 model.
Configuration menu - View commit details
-
Copy full SHA for 0796249 - Browse repository at this point
Copy the full SHA 0796249View commit details -
add ACER algorithm(szj) (opendilab#14)
* add endoro env config. add enduro's ppo,dqn,drdqn,rainbow,impala config. * modified as reviewer mentions * add qacd network * fix bugs * fix bugs * update acer algorithm * update ACER code * update acer config * fix bug * update pong acer's config * edit commit * update code as mention * fix the comment table and trust region * fix format * fix typing lint * fix format,flake8 * fix format * fix whitespace problem * test(nyz): add acer unittest and algotest * style(nyz): correct flake8 style Co-authored-by: shenziju <simonshen2011@foxmail.com> Co-authored-by: Swain <niuyazhe314@outlook.com>
Configuration menu - View commit details
-
Copy full SHA for cc4accd - Browse repository at this point
Copy the full SHA cc4accdView commit details
Commits on Aug 2, 2021
-
serial training league demo (opendilab#12)
* feature(nyz): add naive 1v1 two player demo * feature(nyz): add 1v1 evaluator and 2 rule-based policy for evaluation * feature(nyz): modify game env and adjust hyper-param * feature(nyz): add naive league training multi player demo * feature(nyz): enable force snapshot to support init historical league player; finish league demo basic code * feature(nyz): modify selfplay demo and add two type game env * style(nyz): correct format style * polish(nyz): correct format style and adapt league demo main * feature(nyz): add league payoff viz and enable payoff update in league demo * feature(nyz): modify win rate calculation with draws * test(nyz): fix one vs one league test compatibility bug * test(nyz): add selfplay and league demo into unittest and algotest * style(nyz): correct format * hotfix(nyz): fix ppo continuous comatibility bug
Configuration menu - View commit details
-
Copy full SHA for c24e7a5 - Browse repository at this point
Copy the full SHA c24e7a5View commit details -
Configuration menu - View commit details
-
Copy full SHA for e67ad91 - Browse repository at this point
Copy the full SHA e67ad91View commit details -
Configuration menu - View commit details
-
Copy full SHA for a8bfc44 - Browse repository at this point
Copy the full SHA a8bfc44View commit details
Commits on Aug 3, 2021
-
Configuration menu - View commit details
-
Copy full SHA for cd71492 - Browse repository at this point
Copy the full SHA cd71492View commit details -
feature(lj): add new smac benchmark add qtran algo, polish env readme…
…,fix double_q bug
Configuration menu - View commit details
-
Copy full SHA for e44f858 - Browse repository at this point
Copy the full SHA e44f858View commit details -
Configuration menu - View commit details
-
Copy full SHA for 24e9a57 - Browse repository at this point
Copy the full SHA 24e9a57View commit details -
hotfix(nyz): fix return bug when adv_norm=True and remove unused norm…
…alize_advantage field
Configuration menu - View commit details
-
Copy full SHA for b9e3b7f - Browse repository at this point
Copy the full SHA b9e3b7fView commit details
Commits on Aug 6, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 82380f9 - Browse repository at this point
Copy the full SHA 82380f9View commit details
Commits on Aug 10, 2021
-
add overcooked environment (opendilab#20)
* init runable ppo * init overcooked env * overcooked ppo in place * runable ppo with shaped rewards * modified config * feature(nyz): modify win rate calculation with draws * remove redundant code, modified baseline model * Update __init__.py * Update config.py * modify temp_config_file.close() position in config.py to work in windows os * remove redundant comments and rename files * fix name bug and use namedlist * add simple readme and remove redundant comments from copies * resolve threads * remove debug comments Co-authored-by: niuyazhe <niuyazhe314@outlook.com>
Configuration menu - View commit details
-
Copy full SHA for 2df687f - Browse repository at this point
Copy the full SHA 2df687fView commit details
Commits on Aug 11, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 243edf1 - Browse repository at this point
Copy the full SHA 243edf1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3a2a637 - Browse repository at this point
Copy the full SHA 3a2a637View commit details
Commits on Aug 13, 2021
-
fix ACER's bug. update Qbert and space invader's config and result (o…
…pendilab#21) * fix weight bug * update acer qbert result * fix flake8 format problem * update space qbert config * update as review Co-authored-by: shenziju <simonshen2011@foxmail.com>
Configuration menu - View commit details
-
Copy full SHA for a69617e - Browse repository at this point
Copy the full SHA a69617eView commit details
Commits on Aug 18, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 5e47a9a - Browse repository at this point
Copy the full SHA 5e47a9aView commit details
Commits on Aug 19, 2021
-
Configuration menu - View commit details
-
Copy full SHA for dc7b778 - Browse repository at this point
Copy the full SHA dc7b778View commit details -
* add sqil * conceal all the personal info * revise according to the comments * correct_format * add_comment to hardcodes part * pass flake8 * add force_reproducibility = True; device, ex_model * check format
Configuration menu - View commit details
-
Copy full SHA for 1c18bcd - Browse repository at this point
Copy the full SHA 1c18bcdView commit details
Commits on Aug 23, 2021
-
Dev modified predator prey (opendilab#30)
* add modifiled predator_prey env * add collision_ratio * add readme and cfg for modified_predator_prey env * add readme imgs for modified_predator_prey * check format * fix format
Configuration menu - View commit details
-
Copy full SHA for 528e466 - Browse repository at this point
Copy the full SHA 528e466View commit details -
Configuration menu - View commit details
-
Copy full SHA for a8cb80c - Browse repository at this point
Copy the full SHA a8cb80cView commit details -
Configuration menu - View commit details
-
Copy full SHA for c52b75d - Browse repository at this point
Copy the full SHA c52b75dView commit details
Commits on Aug 24, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 7d0865c - Browse repository at this point
Copy the full SHA 7d0865cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6678605 - Browse repository at this point
Copy the full SHA 6678605View commit details -
test(nyz): add sqil unittest and algotest, remove adder comment in po…
…licy, polish sqil config
Configuration menu - View commit details
-
Copy full SHA for 2a80096 - Browse repository at this point
Copy the full SHA 2a80096View commit details
Commits on Aug 25, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 98f1ad1 - Browse repository at this point
Copy the full SHA 98f1ad1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8cb22d6 - Browse repository at this point
Copy the full SHA 8cb22d6View commit details
Commits on Aug 26, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 7dea20f - Browse repository at this point
Copy the full SHA 7dea20fView commit details
Commits on Aug 27, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 0165498 - Browse repository at this point
Copy the full SHA 0165498View commit details -
Configuration menu - View commit details
-
Copy full SHA for 731e80a - Browse repository at this point
Copy the full SHA 731e80aView commit details -
hotfix(nyz): fix random policy typo in serial entry and base policy m…
…odel device problem
Configuration menu - View commit details
-
Copy full SHA for adc6e6e - Browse repository at this point
Copy the full SHA adc6e6eView commit details -
Configuration menu - View commit details
-
Copy full SHA for bf3f04f - Browse repository at this point
Copy the full SHA bf3f04fView commit details
Commits on Aug 31, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 92ca261 - Browse repository at this point
Copy the full SHA 92ca261View commit details
Commits on Sep 2, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 8eed6a9 - Browse repository at this point
Copy the full SHA 8eed6a9View commit details -
* feature(nyz): add trueskill as league metric, naive elo calculator, fix game_env info bug * fix(nyz): fix league player mutate bug * fix(nyz): fix league unittest bug * feature(nyz): add elo ranking in league metric env * polish(nyz): modify fixed eval policy and trueskill init * feature(nyz): add init main player in evaluation and fix stop_value bug * style(nyz): rename test_league_metric to avoid pyc cache bug
Configuration menu - View commit details
-
Copy full SHA for 2ea7212 - Browse repository at this point
Copy the full SHA 2ea7212View commit details -
Configuration menu - View commit details
-
Copy full SHA for bb7f651 - Browse repository at this point
Copy the full SHA bb7f651View commit details -
Configuration menu - View commit details
-
Copy full SHA for e5ba933 - Browse repository at this point
Copy the full SHA e5ba933View commit details
Commits on Sep 3, 2021
-
Configuration menu - View commit details
-
Copy full SHA for ee6812d - Browse repository at this point
Copy the full SHA ee6812dView commit details -
feature(zlx): add tb in naive buffer; modify tb in advanced buffer (o…
…pendilab#39) * feature(zlx): Add tb in naive buffer; modify tb in advanced buffer * feature(zlx): naive_buffer tb, fix bug in valid_count update
Configuration menu - View commit details
-
Copy full SHA for 1174cd7 - Browse repository at this point
Copy the full SHA 1174cd7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4543b33 - Browse repository at this point
Copy the full SHA 4543b33View commit details
Commits on Sep 6, 2021
-
fix(pu): fix r2d2 bug (opendilab#36)
* test rnd * fix mz config * fix config * fix(pu): fix r2d2 * feature(puyuan): add minigrid r2d2 config * polish minigrid config * modified as review * fix(pu): fix bugffor compatibility * polish(pu): add annotations and polish slice operation * style(pu): run format.sh * style(pu): correct yapf format
Configuration menu - View commit details
-
Copy full SHA for babc6bd - Browse repository at this point
Copy the full SHA babc6bdView commit details -
enable user to use any expert model for sqil(opendilab#44)
* enable user to use any model generated here * delete irelevant package * add test * bash format.sh to reformat style
Configuration menu - View commit details
-
Copy full SHA for 19164ac - Browse repository at this point
Copy the full SHA 19164acView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9614088 - Browse repository at this point
Copy the full SHA 9614088View commit details -
Configuration menu - View commit details
-
Copy full SHA for ef39a9c - Browse repository at this point
Copy the full SHA ef39a9cView commit details -
feature(zym): add offlineRL algo CQL; add offlineRL env D4RL (opendil…
…ab#37) * feature(zym): add pybullet env info; add entropy type in sac. * feature(zym): add cql; add serial entry for offlineRL. * feature/polish(zym): add generation entry in mujoco env for offlineRL; polish cql/serial entry for offlineRL. * feature(lj): add d4rl env for offlineRL. * polish(zym): polish cql. * feature/polish(zym): add dataset registry; polish offlineRL pipeline. * fix(zym): fix bug in d4rl/mujoco config; fix bug in dataset for offlineRL. * style(zym): add pybulletgym and d4rl requirements in setup. * fix/polish(zym): support str in NaiveRLDataset; polish cql. * polish(zym): polish command policy. * feature(zym): add cql in pendulum env; add unittest/algotest for cql. * fix(zym): fix cql bug in unittest/algotest for cql.
Configuration menu - View commit details
-
Copy full SHA for 2b52cc4 - Browse repository at this point
Copy the full SHA 2b52cc4View commit details
Commits on Sep 7, 2021
-
Configuration menu - View commit details
-
Copy full SHA for f4c920b - Browse repository at this point
Copy the full SHA f4c920bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 469307f - Browse repository at this point
Copy the full SHA 469307fView commit details
Commits on Sep 8, 2021
-
style(wyh): add env information in readme (opendilab#46)
* env-list * env-list-fix-grammmer * env-only-test * modify-gif * modify-gif-pendulum * modify-gif-delect-maze
Configuration menu - View commit details
-
Copy full SHA for df65cb3 - Browse repository at this point
Copy the full SHA df65cb3View commit details -
Configuration menu - View commit details
-
Copy full SHA for b79e456 - Browse repository at this point
Copy the full SHA b79e456View commit details -
Configuration menu - View commit details
-
Copy full SHA for f23141f - Browse repository at this point
Copy the full SHA f23141fView commit details -
feature(nyz): add supervised learning image classification training d…
…emo (opendilab#27) * feature(nyz): add resnet for cv sl task * feature(nyz): add imagenet classification dataset and adapt compile config for sl * feature(nyz): add naive image training entry demo * style(nyz): polish image cls train log * polish(nyz): polish multi gpu training setting * feature(nyz): add nn training bp and update async execution * feature(nyz): add distributed sampler for different dist backend * fix(nyz): fix compile config collector and buffer compatibility problem * style(nyz): correct yapf format * fix(nyz): fix env manager compile config compatibility bug * refactor(nyz): abstarct ISerialEvaluator and rename serial evaluation implementation * refactor(nyz): refactor collector name * feature(nyz): add metric evaluator and image cls acc metric eval demo * fix(nyz): fix cuda and multi gpu bug in image cls demo
Configuration menu - View commit details
-
Copy full SHA for a70d639 - Browse repository at this point
Copy the full SHA a70d639View commit details
Commits on Sep 9, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 9645549 - Browse repository at this point
Copy the full SHA 9645549View commit details -
feature(lqp): add k8s launcher and di-orchestrator launcher (opendila…
…b#45) * feat: add k8s launcher * feat: install kubectl when install k3d * feat: add orchestrator launcher and a test case * ci: install kubernetes related package and cli * style: format code * style: flake check code * test k8s launcher * ci: change back to unit test * feat: delete cert manager when delete orchestrator * style: flake8 check * feat: merge k8s-launcher with k8s-helper 1. merge k8s-launcher with k8s-helper 2. move kubernetes package import to where it will be used 3. hack/install-k8s-tools.sh -> ding/scripts/install-k8s-tools.sh
Configuration menu - View commit details
-
Copy full SHA for 1248c70 - Browse repository at this point
Copy the full SHA 1248c70View commit details
Commits on Sep 11, 2021
-
Configuration menu - View commit details
-
Copy full SHA for b34e0c6 - Browse repository at this point
Copy the full SHA b34e0c6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2bf67b3 - Browse repository at this point
Copy the full SHA 2bf67b3View commit details
Commits on Sep 13, 2021
-
feature(lqp): add cartpole dqn algorithm test running in k8s cluster (o…
…pendilab#49) * test dijob * test: wait for dijob Succeeded phase, and read coordinator logs * test: update wait condition * ci: update algo_test.yaml and flake check * test: move kubernetes package to where it will be used
Configuration menu - View commit details
-
Copy full SHA for 8515323 - Browse repository at this point
Copy the full SHA 8515323View commit details -
fix(wyh): mappo nan bug and dict obs cannot unsqueeze bug (opendilab#54)
* fix_mappo_bug_masknan_and_dict_cannot_unsqueeze * squeeze_bug
Configuration menu - View commit details
-
Copy full SHA for f0a1522 - Browse repository at this point
Copy the full SHA f0a1522View commit details -
feature(lk): add hyper-parameter scheduler module (opendilab#38)
* feature(nyz): add trueskill as league metric, naive elo calculator, fix game_env info bug * fix(nyz): fix league player mutate bug * fix(nyz): fix league unittest bug * feature(nyz): add elo ranking in league metric env * polish(nyz): modify fixed eval policy and trueskill init * add_scheduler_module * fix_change_range_and_factor * cooldown_counter_bug_fix * add_div_mode * code_format_fixed * fix_pr_bug * add_unnitest_module * add_patience_test * polish(nyz): polish scheduler design and fix league mode scheduler bug * fix(nyz): fix merge test_metric.py bug Co-authored-by: niuyazhe <niuyazhe@sensetime.com> Co-authored-by: 李可 <like2@CN0014008466M.local>
Configuration menu - View commit details
-
Copy full SHA for f458623 - Browse repository at this point
Copy the full SHA f458623View commit details -
fix(wyh):formatted config no eval bug (opendilab#53)
* fix_formatted_config_bug_eval * fix(wyh):add config pytest
Configuration menu - View commit details
-
Copy full SHA for 219f78d - Browse repository at this point
Copy the full SHA 219f78dView commit details
Commits on Sep 14, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 429479e - Browse repository at this point
Copy the full SHA 429479eView commit details
Commits on Sep 17, 2021
-
Configuration menu - View commit details
-
Copy full SHA for cabd936 - Browse repository at this point
Copy the full SHA cabd936View commit details -
fix(pu): fix r2d2 done slice bug and LSTM hidden state reset bug (ope…
…ndilab#52) * test rnd * fix mz config * fix config * fix(pu): fix r2d2 * fix(puyuan): fix r2d2 * feature(puyuan): add minigrid r2d2 config * polish minigrid config * modified as review * fix(pu): fix bugffor compatibility * polish(pu): add annotations and polish slice operation * style(pu): run format.sh * style(pu): correct yapf format * fix(pu): fix config * fix(pu): fix done slice bug and lstm reset bug * style(pu): format config * polish(pu): polish config params for cartpole, lunarlander and minigrid * polish(pu): polish minigrid config params * Update r2d2.py * polish(pu): polish rnn reset problem * fix(pu): fix merge error * polish(pu): polish cartpole config * polish(nyz): polish cartpole r2d2 config for faster convergence * test(nyz): enable r2d2 algotest Co-authored-by: niuyazhe <niuyazhe@sensetime.com>
Configuration menu - View commit details
-
Copy full SHA for 272ba12 - Browse repository at this point
Copy the full SHA 272ba12View commit details -
feature(crb): update multi discrete policy(dqn, ppo, rainbow) (opendi…
…lab#51) * update md_dqn * update offpolicy ppo * add rainbow md policy * format code * del ppo; leave to future updates * add doc string; fix rainbow returns
Configuration menu - View commit details
-
Copy full SHA for 71d6738 - Browse repository at this point
Copy the full SHA 71d6738View commit details -
feature(davide): add BSuite environment wrapper (opendilab#58)
* start implementing bsuite env * add bsuite env * Implemented * removed unused file * added cartpole_swing environment * Update test_bsuite_env.py * added env in readme and in setup.py * Create bsuite.png
Configuration menu - View commit details
-
Copy full SHA for 438c622 - Browse repository at this point
Copy the full SHA 438c622View commit details
Commits on Sep 22, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 7c88b7e - Browse repository at this point
Copy the full SHA 7c88b7eView commit details -
fix(wyh): add plot function (opendilab#59)
* fix(wyh): plot function * fix(wyh): plot function pytest * fix(wyh):plot function modify comments * feature(wyh):plot style Co-authored-by: weiyuhong <weiyuhong@sensetime.com>
Configuration menu - View commit details
-
Copy full SHA for 7a94c0d - Browse repository at this point
Copy the full SHA 7a94c0dView commit details -
feature(wyh):mappo and ippo win rate and time (opendilab#62)
* feature(wyh):mappo and ippo win rate and time * feature(wyh):mappo and ippo epymarl win rate and time * feature(wyh):smac epymarl commit id Co-authored-by: 卫昱宏 <SENSETIME\weiyuhong@cn0214000504l.domain.sensetime.com>
Configuration menu - View commit details
-
Copy full SHA for 11124de - Browse repository at this point
Copy the full SHA 11124deView commit details
Commits on Sep 23, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 092a9e9 - Browse repository at this point
Copy the full SHA 092a9e9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7c51151 - Browse repository at this point
Copy the full SHA 7c51151View commit details
Commits on Sep 24, 2021
-
Configuration menu - View commit details
-
Copy full SHA for b5a9a6e - Browse repository at this point
Copy the full SHA b5a9a6eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2ae40db - Browse repository at this point
Copy the full SHA 2ae40dbView commit details
Commits on Sep 26, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 9edc924 - Browse repository at this point
Copy the full SHA 9edc924View commit details -
fix(hansbug): fix spawn context problem in interaction unittest (open…
…dilab#69) * fix(hansbug): try support spawn backend * fix(hansbug): try fix the xxxxing problem in interaction spawn support && reformat the code style * fix(nyz): disable silence decorator for spawn context interaction test Co-authored-by: HansBug <killog@126.com>
Configuration menu - View commit details
-
Copy full SHA for 9918b6d - Browse repository at this point
Copy the full SHA 9918b6dView commit details -
style(nyz): add docker deploy in github workflow (opendilab#70)
* style(nyz): add docker deploy workflow(enable docker) * style(nyz): fix docker push info(enable docker) * style(nyz): modify org name and image default name rule(enable docker) * style(nyz): change default version to date(enable docker)
Configuration menu - View commit details
-
Copy full SHA for 132670e - Browse repository at this point
Copy the full SHA 132670eView commit details -
fix(xjx): fix the catch statments that will never succeed in test net…
…works; fix silence method (opendilab#71)
Configuration menu - View commit details
-
Copy full SHA for 7ea4005 - Browse repository at this point
Copy the full SHA 7ea4005View commit details -
feature(crb): add multi-discrete ppo and off policy ppo (opendilab#72)
* add md ppo * add doc string
Configuration menu - View commit details
-
Copy full SHA for f560c2c - Browse repository at this point
Copy the full SHA f560c2cView commit details
Commits on Sep 28, 2021
-
feature(nyz): move atari_py to ale-py; split base and env docker build (
opendilab#77) * feature(nyz): move atari_py to ale-py and polish standard docker build(enable docker) * fix(nyz): fix atari env import bug(enable docker) * feature(nyz): add autorom install in docker(enable docker) * feature(nyz): split base and env docker build(enable docker) * fix(nyz): fix docker env source image bug(enable docker)
Configuration menu - View commit details
-
Copy full SHA for d731276 - Browse repository at this point
Copy the full SHA d731276View commit details -
feature(pu): add WQMIX algorithm (opendilab#24)
* add wqmix * update annotation * reformate * update annotation * update config * fix annotation * update as review * fix as review * add 5m6m MMM MMM2 config * reformate * fix(pu): fix rnn reset bug and add unittest * fix(pu): fix rnn reset bug in centrally-weighted wqmix * style(pu): yapf format and let WQMIXPolicy extend QMIXPolicy * fix(pu): fix wqmix policy extend bug * test(pu): add unittest test_wqmix * fix(pu): fix mixer key bug in particle config * feature(pu): add cooperative_navigation_wqmix_config * style(pu): yapf format * test(pu): change nn.Identity() to nn.Sequential() * fix(pu): fix unittest bug in test_wqmix
Configuration menu - View commit details
-
Copy full SHA for fd8f7f2 - Browse repository at this point
Copy the full SHA fd8f7f2View commit details
Commits on Sep 29, 2021
-
feature(nyz): add mujoco docker (opendilab#78)
* feature(nyz): add docker_mujoco build and upgrada numpy version to 1.20.0(enable docker) * fix(nyz): fix numpy version compatibility bug and add -y option in apt-get(enable docker) * fix(nyz): add libosmesa6-dev in Dockerfile.env(enable docker) * fix(nyz): add permanent env variable about mujoco(enable docker) * fix(nyz): change sh source to .(enable docker) * fix(nyz): set env variable in bashrc(enable docker) * fix(nyz): fix pip typo(enable docker) * fix(nyz): add env in dockerfile(enable docker)
Configuration menu - View commit details
-
Copy full SHA for a60cce5 - Browse repository at this point
Copy the full SHA a60cce5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9651471 - Browse repository at this point
Copy the full SHA 9651471View commit details -
feature(nyz): add smac docker (opendilab#80)
* style(nyz): add ctools.pysc2 import in smac env * feature(nyz): add smac docker build(enable docker, smac docker) * fix(nyz): fix if condition syntax in deploy(enable docker, smac docker) * fix(nyz): fix if condition syntax in deploy(enable docker, smac docker) * fix(nyz): remove cache layer in smac docker(enable docker, smac docker) * feature(nyz): use self-hosted runner in docker smac deploy(enable docker, smac docker) * feature(nyz): build smac docker manually(enable docker, smac docker) * feature(nyz): use docker buildx as default tool in smac and add SC2Map in setup(enable docker, smac docker) * feature(nyz): add __init__.py in smac env maps(enable docker, smac docker)
Configuration menu - View commit details
-
Copy full SHA for 4d024b8 - Browse repository at this point
Copy the full SHA 4d024b8View commit details -
fix(nyz): fix ppg atari config bug, and ppg atari entry, and update d…
…efault eval_freq
Configuration menu - View commit details
-
Copy full SHA for d7c2f88 - Browse repository at this point
Copy the full SHA d7c2f88View commit details
Commits on Sep 30, 2021
-
Configuration menu - View commit details
-
Copy full SHA for a0ebd94 - Browse repository at this point
Copy the full SHA a0ebd94View commit details -
Configuration menu - View commit details
-
Copy full SHA for 673188f - Browse repository at this point
Copy the full SHA 673188fView commit details -
Configuration menu - View commit details
-
Copy full SHA for f305978 - Browse repository at this point
Copy the full SHA f305978View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5fb97e5 - Browse repository at this point
Copy the full SHA 5fb97e5View commit details -
feature(zym): add offlineRL algo Discrete CQL; add hdf5 dataset for o…
…fflineRL. (opendilab#68) * feature(zym): add offlineRL algo Discrete CQL. * feature(zym): add offlineRL algo Discrete CQL; add hdf5 dataset for offlineRL.
Configuration menu - View commit details
-
Copy full SHA for 76240d6 - Browse repository at this point
Copy the full SHA 76240d6View commit details -
feature(davide): Implementation of D4PG (opendilab#76)
* added experience replay and n-step * implementing distributional q value * added distributional q-value * added overview in qac_dist and d4pg * derived D4PG from DDPG * fixed a bug when action shape >1 * benchmark D4PG mujoco + minor fixs -entry for DDPG mujoco -entry for D4PG mujoco -config for D4PG mujoco -fixed style D4PG code -unittests for QAC distributional * formatted code * minor updates (read description) -added d4pg seria_entry test -updated comments in QACDIST -added d4pg in commander register -added q_value in d4pg return dict -added priority update in d4pg entry -added assertion in QACDIST
Configuration menu - View commit details
-
Copy full SHA for b9edb7c - Browse repository at this point
Copy the full SHA b9edb7cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0779418 - Browse repository at this point
Copy the full SHA 0779418View commit details
Commits on Oct 1, 2021
-
style(nyz): fix typo and release multi python version bug(enable dock…
…er, smac docker)
Configuration menu - View commit details
-
Copy full SHA for 0040577 - Browse repository at this point
Copy the full SHA 0040577View commit details -
Configuration menu - View commit details
-
Copy full SHA for 43f6319 - Browse repository at this point
Copy the full SHA 43f6319View commit details -
Configuration menu - View commit details
-
Copy full SHA for 20b1fd6 - Browse repository at this point
Copy the full SHA 20b1fd6View commit details
Commits on Oct 2, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 2950d49 - Browse repository at this point
Copy the full SHA 2950d49View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4ee8e7e - Browse repository at this point
Copy the full SHA 4ee8e7eView commit details
Commits on Oct 8, 2021
-
feature(zlx): add vs bot training and self-play training with slime v…
…olley env (opendilab#23) * slime volley env in dizoo, first commit * fix bug in slime volley env * modify volley env to satisfy ding 1v1 requirements; add naive self-play and league training pipeline(evaluator is not finished, now use a very naive one) * adopt volley builtin ai as default eval opponent * polish(nyz): polish slime_volley_env and its test * feature(nyz): add slime_volley vs bot ppo demo * feature(nyz): add battle_sample_serial_collector and adapt abnormal check in subprocess env manager * feature(nyz): add slime volley self-play demo * style(nyz): add slime_volleyball env gif and split MARL and selfplay label * feature(nyz): add save replay function in slime volleyball env Co-authored-by: zlx-sensetime <zhaoliangxuan@sensetime.com> Co-authored-by: niuyazhe <niuyazhe@sensetime.com>
Configuration menu - View commit details
-
Copy full SHA for 770ca0a - Browse repository at this point
Copy the full SHA 770ca0aView commit details
Commits on Oct 9, 2021
-
Configuration menu - View commit details
-
Copy full SHA for e10ad49 - Browse repository at this point
Copy the full SHA e10ad49View commit details
Commits on Oct 12, 2021
-
feature(nyz): add gym-hybrid hybrid action space env (opendilab#86)
* feature(nyz): add gym-hybrid hybrid action space env * style(nyz): update readme for gym_hybrid env
Configuration menu - View commit details
-
Copy full SHA for 58326ab - Browse repository at this point
Copy the full SHA 58326abView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0a8d288 - Browse repository at this point
Copy the full SHA 0a8d288View commit details
Commits on Oct 15, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 6c5670c - Browse repository at this point
Copy the full SHA 6c5670cView commit details -
Configuration menu - View commit details
-
Copy full SHA for b1c7300 - Browse repository at this point
Copy the full SHA b1c7300View commit details -
fix(wyh): add test for rl_utils ppo and td (opendilab#89)
* fix(wyh):test rl_utils code * fix(wyh):modify rl utils bug ppo adv batch B,A * fix(wyh):style * fix(wyh):fix bug
Configuration menu - View commit details
-
Copy full SHA for 6d09f79 - Browse repository at this point
Copy the full SHA 6d09f79View commit details
Commits on Oct 16, 2021
-
Configuration menu - View commit details
-
Copy full SHA for afe509c - Browse repository at this point
Copy the full SHA afe509cView commit details -
feature(nyp): add DQfD algorithm (opendilab#48)
* add_dqfd * Is_expert to is_expert * modify according to the last commnets * value_gamma; done; marginloss; sqil compatibility * finally shorten the code, revise config * revise config, style * add_readme/two_more_config * correct format Co-authored-by: niuyazhe <niuyazhe@sensetime.com>
Configuration menu - View commit details
-
Copy full SHA for f6aad4c - Browse repository at this point
Copy the full SHA f6aad4cView commit details -
fix(wyh): add model test and policy/entry test and remove unused qacd(o…
…pendilab#92) * fix(wyh):model test and policy/entry test * fix(wyh):delect qacd * fix(wyh):test serial entry onpolicy
Configuration menu - View commit details
-
Copy full SHA for e2c91ba - Browse repository at this point
Copy the full SHA e2c91baView commit details
Commits on Oct 17, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 5b95bb2 - Browse repository at this point
Copy the full SHA 5b95bb2View commit details
Commits on Oct 19, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 842f9c2 - Browse repository at this point
Copy the full SHA 842f9c2View commit details
Commits on Oct 20, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 5999dfc - Browse repository at this point
Copy the full SHA 5999dfcView commit details
Commits on Oct 21, 2021
-
Configuration menu - View commit details
-
Copy full SHA for f4cfeae - Browse repository at this point
Copy the full SHA f4cfeaeView commit details -
feature(lk): add gym-soccer (HFO) env (opendilab#94)
* add_soccer_env * add_info * close * format * test_gym_soccer * rm_torch * replay_log * format_style * add_gym_soccer_to_readme * separate render_func * add_gif_file * scale_action * flake_style_format * resolve_review_comments * add branch info for gym hybrid
Configuration menu - View commit details
-
Copy full SHA for 094ede9 - Browse repository at this point
Copy the full SHA 094ede9View commit details -
feature(xjx): test in pure docker environment (opendilab#103)
* Test in docker * Add docker test entry * Trap exit * Test in docker
Configuration menu - View commit details
-
Copy full SHA for e29627b - Browse repository at this point
Copy the full SHA e29627bView commit details
Commits on Oct 22, 2021
-
Configuration menu - View commit details
-
Copy full SHA for a6c8662 - Browse repository at this point
Copy the full SHA a6c8662View commit details -
Configuration menu - View commit details
-
Copy full SHA for b48582c - Browse repository at this point
Copy the full SHA b48582cView commit details -
polish(nyz): fix ppo bugs and update atari ppo offpolicy config (open…
…dilab#108) * fix(nyz): fix ppo cuda bug and random collect bug * config(nyz): add pong ppo off policy better config * fix(nyz): fix ppo device bug in get_train_sample and update ppo offpolicy config * style(nyz): correct yapf format
Configuration menu - View commit details
-
Copy full SHA for 9e623c1 - Browse repository at this point
Copy the full SHA 9e623c1View commit details -
feature(zym): add offlineRL algo td3_bc and polish policy comments(op…
…endilab#88) * feature(zym): add offlineRL algo td3_bc. * feature(zym): add offlineRL algo td3_bc. * feature(zym): add offlineRL algo td3_bc. * polish(zym): polish some annotations in td3/ddpg/sac/ppo; polish `_forward_collect` and `_foward_eval`. * fix(lj): fix dimension bug in cql for continuous env. * fix(zym): fix dimension bug in cql for continuous env. * fix(zym): fix dimension bug in cql for continuous env. * polish(zym): update README.md.
Configuration menu - View commit details
-
Copy full SHA for 30b8f79 - Browse repository at this point
Copy the full SHA 30b8f79View commit details
Commits on Oct 25, 2021
-
test(wyh): add more unittest for ppo and sac policy (opendilab#104)
* fix(wyh):reward model test * fix(wyh):sac ppo test * fix(wyh):ppo_continuous test * fix(wyh):style * fix(wyh):ppo test Co-authored-by: Swain <niuyazhe314@outlook.com>
Configuration menu - View commit details
-
Copy full SHA for 3c96dce - Browse repository at this point
Copy the full SHA 3c96dceView commit details
Commits on Oct 26, 2021
-
test(yzj): add unittest for dataset, metric_serial_evaluator and lear…
…ner (opendilab#107) * add 4 pytest dataset.py learner_aggregator.py learner_hook.py metric_serial_evaluator.py * fix yapf and flake8 And remove invalid self._env * fix fake_cls_config.py flake8
Configuration menu - View commit details
-
Copy full SHA for 05fd79e - Browse repository at this point
Copy the full SHA 05fd79eView commit details -
Configuration menu - View commit details
-
Copy full SHA for ae11f79 - Browse repository at this point
Copy the full SHA ae11f79View commit details
Commits on Oct 28, 2021
-
feature(nyz): add gobigger baseline (opendilab#95)
* feature(nyz): add gobigger baseline * style(nyz): add gobigger env infor * feature(nyz): add ignore prefix in default collate * feautre(nyz): add vsbot training baseline * fix(nyz): fix to_tensor empty list bug and polish gobigger baseline * style(nyz): split gobigger baseline code
Configuration menu - View commit details
-
Copy full SHA for 33ee048 - Browse repository at this point
Copy the full SHA 33ee048View commit details
Commits on Oct 29, 2021
-
feature(nyz): add PADDPG for hybrid action space as baseline (opendil…
…ab#109) * fix(nyz): fix gym_hybrid env not scale action bug * feature(nyz): add PADDPG basic implementation for hybrid action space * fix(nyz): fix td3/d4pg comatibility bug with new modifications * fix(nyz): fix hybrid ddpg action type grad bug and update config * feature(nyz): add eps greedy + multinomial wrapper and gym_hybrid ddpg convergence config * style(nyz): update PADDPG in README * test_model_hybrid_qac * fix_typo_in_README * test_policy_hybrid_qac * polish(nyz): polish hybrid action space to dict structure and polish unittest * fix(nyz): fix td3bc compatibility bug Co-authored-by: 李可 <like2@CN0014008466M.local>
Configuration menu - View commit details
-
Copy full SHA for 82a35d2 - Browse repository at this point
Copy the full SHA 82a35d2View commit details -
style(nyz): modify doc and deploy trigger and update mujoco license d…
…ownload link(smac docker)
Configuration menu - View commit details
-
Copy full SHA for 7a0a3cc - Browse repository at this point
Copy the full SHA 7a0a3ccView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1be01c7 - Browse repository at this point
Copy the full SHA 1be01c7View commit details -
feature(lcm): add MBPO algorithm (opendilab#113)
* feature(lcm): add MBPO algorithm (opendilab#87) * add model-based rl * fix yazhe's comments * format * pass flake8 test * polish(nyz): polish mbpo import, name and test Co-authored-by: lichuming <lichuming@lichumingdeMacBook-Pro.local>
Configuration menu - View commit details
-
Copy full SHA for 121d4eb - Browse repository at this point
Copy the full SHA 121d4ebView commit details
Commits on Oct 31, 2021
-
Configuration menu - View commit details
-
Copy full SHA for e42fb78 - Browse repository at this point
Copy the full SHA e42fb78View commit details
Commits on Nov 1, 2021
-
feature(pu): add NGU algorithm (opendilab#40)
* test rnd * fix mz config * fix config * feature(pu): fix r2d2, add beta to actor * feature(pu): add ngu-dev * fix(pu): fix r2d2 * fix(puyuan): fix r2d2 * feature(puyuan): add minigrid r2d2 config * polish minigrid config * dev-ngu * feature(pu): add action and reward as inputs of q network * feature(pu): add episodic reward model * feature(pu): add episodic reward model, modify r2d2 and collector for ngu * fix(pu): recover files that were changed by mistake * fix(pu): fix tblogger cnt bug * add_dqfd * Is_expert to is_expert * fix(pu): fix r2d2 bug * fix(pu): fix beta index to gamma bug * fix(pu): fix numerical stability problem * style(pu): flake8 format * fix(pu): fix rnd reward model train times * polish(pu): polish r2d2 reset problem * fix(pu): fix episodic reward normalize bug * polish(pu): polish config params and episodic_reward init value * modify according to the last commnets * value_gamma;done;marginloss;sqil适配 * feature(pu): add r2d3 algorithm and config of lunarlander and pong * fix(pu): fix demo path bug * fix(pu): fix cuda bug at function get_gae in adder.py * feature(pu): add pong r2d2 config * polish(pu): r2d2 uses the mixture priority, episodic_reward transforms to mean 0 std1 * polish(pu): polish r2d2 config * test(pu): test cuda compatiality of dqfd_nstep_td_error in r2d3 * polish(pu): polish config * polish(pu): polish config and annotation * fix(pu): fix r2d2 target net update bug and done bug * polish(pu): polish pong r2d2 config and add montezuma r2d2 config * polish(pu): add some logs for debugging in r2d2 * polish(pu): recover config deleted by mistake * fix(pu): fix r2d3 config of lunarlander and pong * fix(pu): fix the r2d2 bug in r2d3 * fix(pu): fix r2d3 cpu device bug in fun dqfd_nstep_td_error of td.py * fix(pu): fix n_sample bug in serial_entry_r2d3 * polish(pu): polish minigrid r2d2 config * fix(pu): add info dict of fourrooms doorkey in minigrid_env * polish(pu): polish r2d2 config * fix(pu): fix expert policy collect traj bug, now we use the argmax_sample wrapper * fix(pu): fix r2d2 done and target update bug, polish config * fix(pu): fix null_padding transition obs to zeros * fix(pu): episodic_reward transform to [0,1] * fix(pu): fix the value_gamma bug * fix(pu): fix device bug in ngu_reward_model.py * fix(pu): fix null_padding problem in rnd and episodic reward model * polish(pu): polish config * fix(pu): use the deepcopy train_data to add bonus reward * polish(pu): add the operation of enlarging seq_length times to the last reward of the whole episode * fix(pu): fix the episode length 1 bug and weight intrinsic reward bug * feature(pu): add montezuma ngu config * fix(pu): fix lunarlander ngu unroll_len to 998 so that the sequence length is equal to the max step 1000 * test(pu): episodic reward transforms to [0,1] * fix(pu): fix r2d3 one-step rnn init bug and add r2d2_collect_traj * fix(pu): fix r2d2_collect_traj.py * feature(pu): add pong_r2d3_r2d2expert_config * polish(pu): yapf format * polish(pu): fix td.py conflict * polish(pu): flake8 format * polish(pu): add lambda_one_step_td key in dqfd error * test(pu): set key lambda_one_step_td and lambda_supervised_loss as 0 * style(pu): yapf format * style(pu): format * polish(nyz): fix ngu detailed compatibility error * fix(nyz): fix dqfd one_step td lambda bug * fix(pu): fix test_acer and test_rnd compatibility error Co-authored-by: Swain <niuyazhe314@outlook.com> Co-authored-by: Will_Nie <nieyunpengwill@hotmail.com>
Configuration menu - View commit details
-
Copy full SHA for 99cba89 - Browse repository at this point
Copy the full SHA 99cba89View commit details -
Configuration menu - View commit details
-
Copy full SHA for e611ad2 - Browse repository at this point
Copy the full SHA e611ad2View commit details -
Configuration menu - View commit details
-
Copy full SHA for e574608 - Browse repository at this point
Copy the full SHA e574608View commit details -
Configuration menu - View commit details
-
Copy full SHA for 51643a5 - Browse repository at this point
Copy the full SHA 51643a5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 89c5401 - Browse repository at this point
Copy the full SHA 89c5401View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7040af3 - Browse repository at this point
Copy the full SHA 7040af3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3477b2c - Browse repository at this point
Copy the full SHA 3477b2cView commit details
Commits on Nov 2, 2021
-
Configuration menu - View commit details
-
Copy full SHA for b217264 - Browse repository at this point
Copy the full SHA b217264View commit details -
Configuration menu - View commit details
-
Copy full SHA for ad0c8d0 - Browse repository at this point
Copy the full SHA ad0c8d0View commit details
Commits on Nov 3, 2021
-
fix(davide): small fix on bsuite environment (opendilab#117)
* small fix * added bsuite env version * modified test
Configuration menu - View commit details
-
Copy full SHA for 79a8041 - Browse repository at this point
Copy the full SHA 79a8041View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6b8a6da - Browse repository at this point
Copy the full SHA 6b8a6daView commit details -
Configuration menu - View commit details
-
Copy full SHA for e993546 - Browse repository at this point
Copy the full SHA e993546View commit details
Commits on Nov 4, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 556a790 - Browse repository at this point
Copy the full SHA 556a790View commit details -
Configuration menu - View commit details
-
Copy full SHA for d20ac3c - Browse repository at this point
Copy the full SHA d20ac3cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6b1d5f4 - Browse repository at this point
Copy the full SHA 6b1d5f4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6254816 - Browse repository at this point
Copy the full SHA 6254816View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0908af2 - Browse repository at this point
Copy the full SHA 0908af2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 73fdaf2 - Browse repository at this point
Copy the full SHA 73fdaf2View commit details
Commits on Nov 5, 2021
-
Configuration menu - View commit details
-
Copy full SHA for bcc92c0 - Browse repository at this point
Copy the full SHA bcc92c0View commit details -
Configuration menu - View commit details
-
Copy full SHA for e6aec26 - Browse repository at this point
Copy the full SHA e6aec26View commit details
Commits on Nov 7, 2021
-
feature(nyz): enable arbitrary policy num in serial sample collector …
…and evaluator, add git in docker(smac docker)
Configuration menu - View commit details
-
Copy full SHA for 3eb1840 - Browse repository at this point
Copy the full SHA 3eb1840View commit details
Commits on Nov 8, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 751f7df - Browse repository at this point
Copy the full SHA 751f7dfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4cdac39 - Browse repository at this point
Copy the full SHA 4cdac39View commit details
Commits on Nov 9, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 7bfa8d0 - Browse repository at this point
Copy the full SHA 7bfa8d0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9ff01ca - Browse repository at this point
Copy the full SHA 9ff01caView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6d53a61 - Browse repository at this point
Copy the full SHA 6d53a61View commit details -
Configuration menu - View commit details
-
Copy full SHA for 59f4161 - Browse repository at this point
Copy the full SHA 59f4161View commit details -
fix(pu): fix montezuma and pitfall action shape from 6 to 18, and add…
… the max_step in atari_env.py
Configuration menu - View commit details
-
Copy full SHA for 0836d66 - Browse repository at this point
Copy the full SHA 0836d66View commit details -
fix(pu): fix montezuma and pitfall action shape from 6 to 18, and add…
… the max_step in atari_env.py
Configuration menu - View commit details
-
Copy full SHA for ac2ab00 - Browse repository at this point
Copy the full SHA ac2ab00View commit details -
Configuration menu - View commit details
-
Copy full SHA for ba12ccb - Browse repository at this point
Copy the full SHA ba12ccbView commit details -
Configuration menu - View commit details
-
Copy full SHA for a9ebb72 - Browse repository at this point
Copy the full SHA a9ebb72View commit details
Commits on Nov 10, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 29f8ee1 - Browse repository at this point
Copy the full SHA 29f8ee1View commit details -
Configuration menu - View commit details
-
Copy full SHA for d824002 - Browse repository at this point
Copy the full SHA d824002View commit details -
Configuration menu - View commit details
-
Copy full SHA for a9f2d4a - Browse repository at this point
Copy the full SHA a9f2d4aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0ed9602 - Browse repository at this point
Copy the full SHA 0ed9602View commit details -
Configuration menu - View commit details
-
Copy full SHA for c9fc015 - Browse repository at this point
Copy the full SHA c9fc015View commit details -
Configuration menu - View commit details
-
Copy full SHA for c39d8f1 - Browse repository at this point
Copy the full SHA c39d8f1View commit details
Commits on Nov 11, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 9796669 - Browse repository at this point
Copy the full SHA 9796669View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0fa6090 - Browse repository at this point
Copy the full SHA 0fa6090View commit details
Commits on Nov 12, 2021
-
Configuration menu - View commit details
-
Copy full SHA for bb78b74 - Browse repository at this point
Copy the full SHA bb78b74View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2d1743f - Browse repository at this point
Copy the full SHA 2d1743fView commit details
Commits on Nov 15, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 77284e3 - Browse repository at this point
Copy the full SHA 77284e3View commit details -
Configuration menu - View commit details
-
Copy full SHA for c9fd4b4 - Browse repository at this point
Copy the full SHA c9fd4b4View commit details -
Configuration menu - View commit details
-
Copy full SHA for ad94dde - Browse repository at this point
Copy the full SHA ad94ddeView commit details -
Configuration menu - View commit details
-
Copy full SHA for b9d0a61 - Browse repository at this point
Copy the full SHA b9d0a61View commit details
Commits on Nov 16, 2021
-
Configuration menu - View commit details
-
Copy full SHA for ac5aceb - Browse repository at this point
Copy the full SHA ac5acebView commit details -
Configuration menu - View commit details
-
Copy full SHA for 040d2c6 - Browse repository at this point
Copy the full SHA 040d2c6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 03590ab - Browse repository at this point
Copy the full SHA 03590abView commit details -
Configuration menu - View commit details
-
Copy full SHA for 15901c2 - Browse repository at this point
Copy the full SHA 15901c2View commit details
Commits on Nov 17, 2021
-
Merge pull request opendilab#122 from opendilab/dev-torch1.1.0
feature(nyz): extend torch1.1.0 support
Configuration menu - View commit details
-
Copy full SHA for de953d7 - Browse repository at this point
Copy the full SHA de953d7View commit details -
Merge branch 'main' of https://github.com/opendilab/DI-engine into de…
…v-polish-ngu Conflicts: dizoo/box2d/bipedalwalker/config/bipedalwalker_ppo_config.py
Configuration menu - View commit details
-
Copy full SHA for 0268ac0 - Browse repository at this point
Copy the full SHA 0268ac0View commit details -
Merge branch 'main' of https://github.com/opendilab/DI-engine into de…
…v-polish-ngu Conflicts: dizoo/box2d/bipedalwalker/config/bipedalwalker_ppo_config.py
Configuration menu - View commit details
-
Copy full SHA for e6425a8 - Browse repository at this point
Copy the full SHA e6425a8View commit details
Commits on Nov 18, 2021
-
Configuration menu - View commit details
-
Copy full SHA for fe8b9c4 - Browse repository at this point
Copy the full SHA fe8b9c4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 063700b - Browse repository at this point
Copy the full SHA 063700bView commit details -
Configuration menu - View commit details
-
Copy full SHA for a137483 - Browse repository at this point
Copy the full SHA a137483View commit details -
Configuration menu - View commit details
-
Copy full SHA for c0eadcf - Browse repository at this point
Copy the full SHA c0eadcfView commit details -
Configuration menu - View commit details
-
Copy full SHA for a8be84c - Browse repository at this point
Copy the full SHA a8be84cView commit details
Commits on Nov 19, 2021
-
Configuration menu - View commit details
-
Copy full SHA for c4f4999 - Browse repository at this point
Copy the full SHA c4f4999View commit details -
Configuration menu - View commit details
-
Copy full SHA for 47b849c - Browse repository at this point
Copy the full SHA 47b849cView commit details -
feature(lk): add PDQN algorithm for hybrid action spaces (opendilab#118)
* add_pdqn_model * modify_model_structure * initial_version_PDQN * bug_free_PDQN_no_test_convergence * update_pdqn_config * add_noise_to_continuous_args * polish(nyz): polish code style and add noise in pdqn * seperate_dis_and_cont_model * fix_bug_for_separation * fix(pu): current q value use the data action, fix cont loss detach bug, 1 encoder, dist and cont learning rate * polish(pu): actor delay update * fix(pu): fix disc cont update frequency * polish(pu): polish pdqn config * polish(lk): add comments and typelint for pdqn and dqn * feature(lk): add test file for pdqn model and policy * polish(lk): code style * polish(lk): rm the modify of unrelated files * polish(lk): rm useless commentes code in pdqn Co-authored-by: niuyazhe <niuyazhe@sensetime.com> Co-authored-by: puyuan1996 <2402552459@qq.com>
Configuration menu - View commit details
-
Copy full SHA for 71df0d8 - Browse repository at this point
Copy the full SHA 71df0d8View commit details -
polish(davide) add example of GAIL entry + config for Mujoco and Cart…
…pole (opendilab#114) * added gail entry * added lunarlander and cartpole config * added gail mujoco config * added mujoco exp * update22-10 * added third exp * added metric to evaluate policies * added GAIL entry and config for Cartpole and Walker2d * checked style and unittest * restored lunarlander env * style problems * bug correction * Delete expert_data_train.pkl * changed loss of GAIL * Update walker2d_ddpg_gail_config.py * changed gail reward from -D(s, a) to -log(D(s, a)) * added small constant to reward function * added comment to clarify config * Update walker2d_ddpg_gail_config.py * added lunarlander entry + config * Added Atari discriminator + Pong entry config * Update gail_irl_model.py * Update gail_irl_model.py * added gail serial pipeline and onehot actions for gail atari * related to previous commit * removed main files * removed old comment
Configuration menu - View commit details
-
Copy full SHA for bc54d24 - Browse repository at this point
Copy the full SHA bc54d24View commit details -
Configuration menu - View commit details
-
Copy full SHA for f9f92a8 - Browse repository at this point
Copy the full SHA f9f92a8View commit details
Commits on Nov 21, 2021
-
Merge branch 'main' of https://github.com/opendilab/DI-engine into de…
…v-polish-ngu Conflicts: dizoo/gym_hybrid/config/gym_hybrid_ddpg_config.py
Configuration menu - View commit details
-
Copy full SHA for babf050 - Browse repository at this point
Copy the full SHA babf050View commit details -
Merge branch 'main' of https://github.com/opendilab/DI-engine into de…
…v-polish-ngu Conflicts: dizoo/gym_hybrid/config/gym_hybrid_ddpg_config.py
Configuration menu - View commit details
-
Copy full SHA for a87f4bb - Browse repository at this point
Copy the full SHA a87f4bbView commit details -
Configuration menu - View commit details
-
Copy full SHA for a97a772 - Browse repository at this point
Copy the full SHA a97a772View commit details -
Configuration menu - View commit details
-
Copy full SHA for 883ddcf - Browse repository at this point
Copy the full SHA 883ddcfView commit details
Commits on Nov 22, 2021
-
Configuration menu - View commit details
-
Copy full SHA for a1b0f89 - Browse repository at this point
Copy the full SHA a1b0f89View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4f9cca2 - Browse repository at this point
Copy the full SHA 4f9cca2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0bd47fd - Browse repository at this point
Copy the full SHA 0bd47fdView commit details -
Configuration menu - View commit details
-
Copy full SHA for cb3aa6b - Browse repository at this point
Copy the full SHA cb3aa6bView commit details -
Configuration menu - View commit details
-
Copy full SHA for fe05389 - Browse repository at this point
Copy the full SHA fe05389View commit details -
Configuration menu - View commit details
-
Copy full SHA for a63494d - Browse repository at this point
Copy the full SHA a63494dView commit details
Commits on Mar 17, 2022
-
Configuration menu - View commit details
-
Copy full SHA for 7de0ce0 - Browse repository at this point
Copy the full SHA 7de0ce0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7141eb2 - Browse repository at this point
Copy the full SHA 7141eb2View commit details
Commits on Mar 20, 2022
-
Configuration menu - View commit details
-
Copy full SHA for ba4d45a - Browse repository at this point
Copy the full SHA ba4d45aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 363003b - Browse repository at this point
Copy the full SHA 363003bView commit details -
Configuration menu - View commit details
-
Copy full SHA for d552a69 - Browse repository at this point
Copy the full SHA d552a69View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2bdd053 - Browse repository at this point
Copy the full SHA 2bdd053View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5df07c2 - Browse repository at this point
Copy the full SHA 5df07c2View commit details
Commits on Mar 21, 2022
-
Configuration menu - View commit details
-
Copy full SHA for 1ca8b10 - Browse repository at this point
Copy the full SHA 1ca8b10View commit details