Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

polish(pu): polish the nstep_return_ngu and null_padding action in NGU #116

Closed
wants to merge 249 commits into from

Commits on Jul 8, 2021

  1. style(nyz): update badges with actions and issues

    opendilab authored Jul 8, 2021
    Configuration menu
    Copy the full SHA
    90b2797 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    e95b01a View commit details
    Browse the repository at this point in the history

Commits on Jul 10, 2021

  1. Configuration menu
    Copy the full SHA
    3b88f70 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    1567939 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    539e8ac View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    cc9f682 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    0ee3636 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    f15d444 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    43fab4e View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    5adf800 View commit details
    Browse the repository at this point in the history
  9. Configuration menu
    Copy the full SHA
    98b938a View commit details
    Browse the repository at this point in the history
  10. create badge.json

    PaParaZz1 authored Jul 10, 2021
    Configuration menu
    Copy the full SHA
    240d013 View commit details
    Browse the repository at this point in the history
  11. delete badges.json

    PaParaZz1 authored Jul 10, 2021
    Configuration menu
    Copy the full SHA
    814ab70 View commit details
    Browse the repository at this point in the history
  12. Configuration menu
    Copy the full SHA
    ce54633 View commit details
    Browse the repository at this point in the history

Commits on Jul 12, 2021

  1. Configuration menu
    Copy the full SHA
    d5fe8a6 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    742d513 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    7f9b8e3 View commit details
    Browse the repository at this point in the history
  4. Merge pull request opendilab#4 from PaParaZz1/doc/separate

    separate doc from main repo to doc repo
    PaParaZz1 authored Jul 12, 2021
    Configuration menu
    Copy the full SHA
    25e43c2 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    71a7ec5 View commit details
    Browse the repository at this point in the history
  6. adapt unittest to github actions (opendilab#6)

    * test(nyz): comment subprocess env manager and parallel entry unittest
    
    * fix(nyz): try to fix test_ppg and flask_fs_collector test close
    
    * test(nyz): modify unittest worker number
    
    * test(nyz) fix unittest worker and ignore 1v1 collector test
    
    * test(nyz): test different range for unittest(env, rl_utils, entry, interaction)
    
    * test(nyz): test different range for unittest(env, rl_utils, entry, interaction, league, model) and add execution timeout
    
    * test(nyz): test different range for unittest(env, rl_utils, entry, interaction, league, model)
    
    * test(nyz): test different range for unittest(env, rl_utils, entry, interaction, league, model, torch_utils)
    
    * test(nyz): fix test DingEnvWrapper unittest env bug
    
    * test(nyz): add utils unittest and disable dataloader unittest
    
    * test(nyz): simplify reward model unittest
    
    * test(nyz): enable all the unittest except dataloader
    
    * test(nyz): enable parallel entry and dataloader unittest
    
    * test(nyz): fix test ppg rerun bug
    
    * test(nyz): enable windows test
    
    * test(nyz): disable subprocess env manager unittest
    
    * test(nyz): fix test auto checkpoint bug
    
    * test(nyz): disable test dataloader
    
    * test(nyz): enable subprocess env manager unittest
    
    * test(nyz): update coveragerc
    
    * test(nyz): add coverage upload workflow
    
    * test(nyz): disable test_block in subprocess env manager
    
    * test(nyz): enable rerun in test demo buffer
    PaParaZz1 authored Jul 12, 2021
    Configuration menu
    Copy the full SHA
    33e5088 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    e41864d View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    e322083 View commit details
    Browse the repository at this point in the history

Commits on Jul 13, 2021

  1. Configuration menu
    Copy the full SHA
    df81bee View commit details
    Browse the repository at this point in the history
  2. badge(hansbug): add LoC and Documentation Percentage badge to README.…

    …md && update code coverage badge (opendilab#8)
    HansBug authored Jul 13, 2021
    Configuration menu
    Copy the full SHA
    f92680a View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    966a6d7 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    7accb4e View commit details
    Browse the repository at this point in the history

Commits on Jul 14, 2021

  1. Configuration menu
    Copy the full SHA
    557a44c View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    582f707 View commit details
    Browse the repository at this point in the history

Commits on Jul 16, 2021

  1. polish(nyz): codestyle optimization by lgtm (opendilab#7)

    * refactor(nyz): refactor read_config to 3 different function interface
    
    * feature(nyz): enable env_setting param in entry
    
    * polish(nyz): remove redundant code and global declaration
    
    * polish(nyz): remove flag in import_helper
    
    * polish(nyz): remove unused import
    
    * style(nyz): correct format
    PaParaZz1 authored Jul 16, 2021
    Configuration menu
    Copy the full SHA
    e02f328 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    1849da7 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    4379a8c View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    f3d3c31 View commit details
    Browse the repository at this point in the history
  5. Merge pull request opendilab#10 from opendilab/dev-output-structure

    refactor output structure (opendilab#10)
    PaParaZz1 authored Jul 16, 2021
    Configuration menu
    Copy the full SHA
    fad58a3 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    f6c7c01 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    a59907a View commit details
    Browse the repository at this point in the history

Commits on Jul 19, 2021

  1. Configuration menu
    Copy the full SHA
    58a25cf View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    436f5ac View commit details
    Browse the repository at this point in the history

Commits on Jul 21, 2021

  1. Configuration menu
    Copy the full SHA
    0af6686 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    b0aa5dc View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    5b1aae8 View commit details
    Browse the repository at this point in the history

Commits on Jul 22, 2021

  1. Configuration menu
    Copy the full SHA
    1bd8dd2 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    f55ec45 View commit details
    Browse the repository at this point in the history

Commits on Jul 23, 2021

  1. Configuration menu
    Copy the full SHA
    1e00390 View commit details
    Browse the repository at this point in the history
  2. Merge branch 'main' of https://github.com/opendilab/DI-engine into de…

    …v-on-policy
    
    # Conflicts:
    #	ding/policy/common_utils.py
    #	ding/policy/ppo.py
    YinminZhang committed Jul 23, 2021
    Configuration menu
    Copy the full SHA
    494b397 View commit details
    Browse the repository at this point in the history

Commits on Jul 29, 2021

  1. Configuration menu
    Copy the full SHA
    55a3dd1 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    a7b4126 View commit details
    Browse the repository at this point in the history
  3. add endoro env config. add enduro's ppo,dqn,drdqn,rainbow,impala conf…

    …ig. (opendilab#11)
    
    * add endoro env config. add enduro's ppo,dqn,drdqn,rainbow,impala config.
    
    * modified as reviewer mentions
    
    * add qacd network
    
    * fix bugs
    
    * update dizoo readme
    
    * add README.md about max reward result
    
    * update dqn config and update README
    
    Co-authored-by: shenziju <simonshen2011@foxmail.com>
    Co-authored-by: simon shen <simon@simondeMacBook-Air.local>
    Co-authored-by: Swain <niuyazhe314@outlook.com>
    4 people authored Jul 29, 2021
    Configuration menu
    Copy the full SHA
    c231322 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    bba994d View commit details
    Browse the repository at this point in the history

Commits on Aug 1, 2021

  1. add pybullet env (opendilab#16)

    * add pybullet envs.
    
    * add td3/ddpg/sac/ppo configs for pybullet.
    
    * update td3/ddpg/sac/ppo configs for pybullet.
    
    * update td3 configs; remove td3 model.
    YinminZhang authored Aug 1, 2021
    Configuration menu
    Copy the full SHA
    0796249 View commit details
    Browse the repository at this point in the history
  2. add ACER algorithm(szj) (opendilab#14)

    * add endoro env config. add enduro's ppo,dqn,drdqn,rainbow,impala config.
    
    * modified as reviewer mentions
    
    * add qacd network
    
    * fix bugs
    
    * fix bugs
    
    * update acer algorithm
    
    * update ACER code
    
    * update acer config
    
    * fix bug
    
    * update pong acer's config
    
    * edit commit
    
    * update code as mention
    
    * fix the comment table and trust region
    
    * fix format
    
    * fix typing lint
    
    * fix format,flake8
    
    * fix format
    
    * fix whitespace problem
    
    * test(nyz): add acer unittest and algotest
    
    * style(nyz): correct flake8 style
    
    Co-authored-by: shenziju <simonshen2011@foxmail.com>
    Co-authored-by: Swain <niuyazhe314@outlook.com>
    3 people authored Aug 1, 2021
    Configuration menu
    Copy the full SHA
    cc4accd View commit details
    Browse the repository at this point in the history

Commits on Aug 2, 2021

  1. serial training league demo (opendilab#12)

    * feature(nyz): add naive 1v1 two player demo
    
    * feature(nyz): add 1v1 evaluator and 2 rule-based policy for evaluation
    
    * feature(nyz): modify game env and adjust hyper-param
    
    * feature(nyz): add naive league training multi player demo
    
    * feature(nyz): enable force snapshot to support init historical league player; finish league demo basic code
    
    * feature(nyz): modify selfplay demo and add two type game env
    
    * style(nyz): correct format style
    
    * polish(nyz): correct format style and adapt league demo main
    
    * feature(nyz): add league payoff viz and enable payoff update in league demo
    
    * feature(nyz): modify win rate calculation with draws
    
    * test(nyz): fix one vs one league test compatibility bug
    
    * test(nyz): add selfplay and league demo into unittest and algotest
    
    * style(nyz): correct format
    
    * hotfix(nyz): fix ppo continuous comatibility bug
    PaParaZz1 authored Aug 2, 2021
    Configuration menu
    Copy the full SHA
    c24e7a5 View commit details
    Browse the repository at this point in the history
  2. v0.1.1

    PaParaZz1 committed Aug 2, 2021
    Configuration menu
    Copy the full SHA
    e67ad91 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    a8bfc44 View commit details
    Browse the repository at this point in the history

Commits on Aug 3, 2021

  1. Configuration menu
    Copy the full SHA
    cd71492 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    e44f858 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    24e9a57 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    b9e3b7f View commit details
    Browse the repository at this point in the history

Commits on Aug 6, 2021

  1. Configuration menu
    Copy the full SHA
    82380f9 View commit details
    Browse the repository at this point in the history

Commits on Aug 10, 2021

  1. add overcooked environment (opendilab#20)

    * init runable ppo
    
    * init overcooked env
    
    * overcooked ppo in place
    
    * runable ppo with shaped rewards
    
    * modified config
    
    * feature(nyz): modify win rate calculation with draws
    
    * remove redundant code, modified baseline model
    
    * Update __init__.py
    
    * Update config.py
    
    * modify temp_config_file.close() position in config.py to work in windows os
    
    * remove redundant comments and rename files
    
    * fix name bug and use namedlist
    
    * add simple readme and remove redundant comments from copies
    
    * resolve threads
    
    * remove debug comments
    
    Co-authored-by: niuyazhe <niuyazhe314@outlook.com>
    garyzhang99 and PaParaZz1 authored Aug 10, 2021
    Configuration menu
    Copy the full SHA
    2df687f View commit details
    Browse the repository at this point in the history

Commits on Aug 11, 2021

  1. Configuration menu
    Copy the full SHA
    243edf1 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    3a2a637 View commit details
    Browse the repository at this point in the history

Commits on Aug 13, 2021

  1. fix ACER's bug. update Qbert and space invader's config and result (o…

    …pendilab#21)
    
    * fix weight bug
    
    * update acer qbert result
    
    * fix flake8 format problem
    
    * update space qbert config
    
    * update as review
    
    Co-authored-by: shenziju <simonshen2011@foxmail.com>
    simonat2011 and shenziju authored Aug 13, 2021
    Configuration menu
    Copy the full SHA
    a69617e View commit details
    Browse the repository at this point in the history

Commits on Aug 18, 2021

  1. Configuration menu
    Copy the full SHA
    5e47a9a View commit details
    Browse the repository at this point in the history

Commits on Aug 19, 2021

  1. Configuration menu
    Copy the full SHA
    dc7b778 View commit details
    Browse the repository at this point in the history
  2. SQIL (opendilab#25)

    * add sqil
    
    * conceal all the personal info
    
    * revise according to the comments
    
    * correct_format
    
    * add_comment to hardcodes part
    
    * pass flake8
    
    * add force_reproducibility = True; device, ex_model
    
    * check format
    Will-Nie authored Aug 19, 2021
    Configuration menu
    Copy the full SHA
    1c18bcd View commit details
    Browse the repository at this point in the history

Commits on Aug 23, 2021

  1. Dev modified predator prey (opendilab#30)

    * add modifiled predator_prey env
    
    * add collision_ratio
    
    * add readme and cfg for modified_predator_prey env
    
    * add readme imgs for modified_predator_prey
    
    * check format
    
    * fix format
    yifan123 authored Aug 23, 2021
    Configuration menu
    Copy the full SHA
    528e466 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    a8cb80c View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    c52b75d View commit details
    Browse the repository at this point in the history

Commits on Aug 24, 2021

  1. feature(ljw): add/delete/restart replicas via cli for k8s

    lijianwen authored and PaParaZz1 committed Aug 24, 2021
    Configuration menu
    Copy the full SHA
    7d0865c View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    6678605 View commit details
    Browse the repository at this point in the history
  3. test(nyz): add sqil unittest and algotest, remove adder comment in po…

    …licy, polish sqil config
    PaParaZz1 committed Aug 24, 2021
    Configuration menu
    Copy the full SHA
    2a80096 View commit details
    Browse the repository at this point in the history

Commits on Aug 25, 2021

  1. Configuration menu
    Copy the full SHA
    98f1ad1 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    8cb22d6 View commit details
    Browse the repository at this point in the history

Commits on Aug 26, 2021

  1. Configuration menu
    Copy the full SHA
    7dea20f View commit details
    Browse the repository at this point in the history

Commits on Aug 27, 2021

  1. Configuration menu
    Copy the full SHA
    0165498 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    731e80a View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    adc6e6e View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    bf3f04f View commit details
    Browse the repository at this point in the history

Commits on Aug 31, 2021

  1. Configuration menu
    Copy the full SHA
    92ca261 View commit details
    Browse the repository at this point in the history

Commits on Sep 2, 2021

  1. Configuration menu
    Copy the full SHA
    8eed6a9 View commit details
    Browse the repository at this point in the history
  2. League Metric (opendilab#22)

    * feature(nyz): add trueskill as league metric, naive elo calculator, fix game_env info bug
    
    * fix(nyz): fix league player mutate bug
    
    * fix(nyz): fix league unittest bug
    
    * feature(nyz): add elo ranking in league metric env
    
    * polish(nyz): modify fixed eval policy and trueskill init
    
    * feature(nyz): add init main player in evaluation and fix stop_value bug
    
    * style(nyz): rename test_league_metric to avoid pyc cache bug
    PaParaZz1 authored Sep 2, 2021
    Configuration menu
    Copy the full SHA
    2ea7212 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    bb7f651 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    e5ba933 View commit details
    Browse the repository at this point in the history

Commits on Sep 3, 2021

  1. Configuration menu
    Copy the full SHA
    ee6812d View commit details
    Browse the repository at this point in the history
  2. feature(zlx): add tb in naive buffer; modify tb in advanced buffer (o…

    …pendilab#39)
    
    * feature(zlx): Add tb in naive buffer; modify tb in advanced buffer
    
    * feature(zlx): naive_buffer tb, fix bug in valid_count update
    LuciusMos authored Sep 3, 2021
    Configuration menu
    Copy the full SHA
    1174cd7 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    4543b33 View commit details
    Browse the repository at this point in the history

Commits on Sep 6, 2021

  1. fix(pu): fix r2d2 bug (opendilab#36)

    * test rnd
    
    * fix mz config
    
    * fix config
    
    * fix(pu): fix r2d2
    
    * feature(puyuan): add minigrid r2d2 config
    
    * polish minigrid config
    
    * modified as review
    
    * fix(pu): fix bugffor compatibility
    
    * polish(pu): add annotations and polish slice operation
    
    * style(pu): run format.sh
    
    * style(pu): correct yapf format
    puyuan1996 authored Sep 6, 2021
    Configuration menu
    Copy the full SHA
    babc6bd View commit details
    Browse the repository at this point in the history
  2. enable user to use any expert model for sqil(opendilab#44)

    * enable user to use any model generated here
    
    * delete irelevant package
    
    * add test
    
    * bash format.sh to reformat style
    Will-Nie authored Sep 6, 2021
    Configuration menu
    Copy the full SHA
    19164ac View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    9614088 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    ef39a9c View commit details
    Browse the repository at this point in the history
  5. feature(zym): add offlineRL algo CQL; add offlineRL env D4RL (opendil…

    …ab#37)
    
    * feature(zym): add pybullet env info; add entropy type in sac.
    
    * feature(zym): add cql; add serial entry for offlineRL.
    
    * feature/polish(zym): add generation entry in mujoco env for offlineRL; polish cql/serial entry for offlineRL.
    
    * feature(lj): add d4rl env for offlineRL.
    
    * polish(zym): polish cql.
    
    * feature/polish(zym): add dataset registry; polish offlineRL pipeline.
    
    * fix(zym): fix bug in d4rl/mujoco config; fix bug in dataset for offlineRL.
    
    * style(zym): add pybulletgym and d4rl requirements in setup.
    
    * fix/polish(zym): support str in NaiveRLDataset; polish cql.
    
    * polish(zym): polish command policy.
    
    * feature(zym): add cql in pendulum env; add unittest/algotest for cql.
    
    * fix(zym): fix cql bug in unittest/algotest for cql.
    YinminZhang authored Sep 6, 2021
    Configuration menu
    Copy the full SHA
    2b52cc4 View commit details
    Browse the repository at this point in the history

Commits on Sep 7, 2021

  1. Configuration menu
    Copy the full SHA
    f4c920b View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    469307f View commit details
    Browse the repository at this point in the history

Commits on Sep 8, 2021

  1. style(wyh): add env information in readme (opendilab#46)

    * env-list
    
    * env-list-fix-grammmer
    
    * env-only-test
    
    * modify-gif
    
    * modify-gif-pendulum
    
    * modify-gif-delect-maze
    Weiyuhong-1998 authored Sep 8, 2021
    Configuration menu
    Copy the full SHA
    df65cb3 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    b79e456 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    f23141f View commit details
    Browse the repository at this point in the history
  4. feature(nyz): add supervised learning image classification training d…

    …emo (opendilab#27)
    
    * feature(nyz): add resnet for cv sl task
    
    * feature(nyz): add imagenet classification dataset and adapt compile config for sl
    
    * feature(nyz): add naive image training entry demo
    
    * style(nyz): polish image cls train log
    
    * polish(nyz): polish multi gpu training setting
    
    * feature(nyz): add nn training bp and update async execution
    
    * feature(nyz): add distributed sampler for different dist backend
    
    * fix(nyz): fix compile config collector and buffer compatibility problem
    
    * style(nyz): correct yapf format
    
    * fix(nyz): fix env manager compile config compatibility bug
    
    * refactor(nyz): abstarct ISerialEvaluator and rename serial evaluation implementation
    
    * refactor(nyz): refactor collector name
    
    * feature(nyz): add metric evaluator and image cls acc metric eval demo
    
    * fix(nyz): fix cuda and multi gpu bug in image cls demo
    PaParaZz1 authored Sep 8, 2021
    Configuration menu
    Copy the full SHA
    a70d639 View commit details
    Browse the repository at this point in the history

Commits on Sep 9, 2021

  1. Configuration menu
    Copy the full SHA
    9645549 View commit details
    Browse the repository at this point in the history
  2. feature(lqp): add k8s launcher and di-orchestrator launcher (opendila…

    …b#45)
    
    * feat: add k8s launcher
    
    * feat: install kubectl when install k3d
    
    * feat: add orchestrator launcher and a test case
    
    * ci: install kubernetes related package and cli
    
    * style: format code
    
    * style: flake check code
    
    * test k8s launcher
    
    * ci: change back to unit test
    
    * feat: delete cert manager when delete orchestrator
    
    * style: flake8 check
    
    * feat: merge k8s-launcher with k8s-helper
    
    1. merge k8s-launcher with k8s-helper
    2. move kubernetes package import to where it will be used
    3. hack/install-k8s-tools.sh -> ding/scripts/install-k8s-tools.sh
    konnase authored Sep 9, 2021
    Configuration menu
    Copy the full SHA
    1248c70 View commit details
    Browse the repository at this point in the history

Commits on Sep 11, 2021

  1. Configuration menu
    Copy the full SHA
    b34e0c6 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    2bf67b3 View commit details
    Browse the repository at this point in the history

Commits on Sep 13, 2021

  1. feature(lqp): add cartpole dqn algorithm test running in k8s cluster (o…

    …pendilab#49)
    
    * test dijob
    
    * test: wait for dijob Succeeded phase, and read coordinator logs
    
    * test: update wait condition
    
    * ci: update algo_test.yaml and flake check
    
    * test: move kubernetes package to where it will be used
    konnase authored Sep 13, 2021
    Configuration menu
    Copy the full SHA
    8515323 View commit details
    Browse the repository at this point in the history
  2. fix(wyh): mappo nan bug and dict obs cannot unsqueeze bug (opendilab#54)

    * fix_mappo_bug_masknan_and_dict_cannot_unsqueeze
    
    * squeeze_bug
    Weiyuhong-1998 authored Sep 13, 2021
    Configuration menu
    Copy the full SHA
    f0a1522 View commit details
    Browse the repository at this point in the history
  3. feature(lk): add hyper-parameter scheduler module (opendilab#38)

    * feature(nyz): add trueskill as league metric, naive elo calculator, fix game_env info bug
    
    * fix(nyz): fix league player mutate bug
    
    * fix(nyz): fix league unittest bug
    
    * feature(nyz): add elo ranking in league metric env
    
    * polish(nyz): modify fixed eval policy and trueskill init
    
    * add_scheduler_module
    
    * fix_change_range_and_factor
    
    * cooldown_counter_bug_fix
    
    * add_div_mode
    
    * code_format_fixed
    
    * fix_pr_bug
    
    * add_unnitest_module
    
    * add_patience_test
    
    * polish(nyz): polish scheduler design and fix league mode scheduler bug
    
    * fix(nyz): fix merge test_metric.py bug
    
    Co-authored-by: niuyazhe <niuyazhe@sensetime.com>
    Co-authored-by: 李可 <like2@CN0014008466M.local>
    3 people authored Sep 13, 2021
    Configuration menu
    Copy the full SHA
    f458623 View commit details
    Browse the repository at this point in the history
  4. fix(wyh):formatted config no eval bug (opendilab#53)

    * fix_formatted_config_bug_eval
    
    * fix(wyh):add config pytest
    Weiyuhong-1998 authored Sep 13, 2021
    Configuration menu
    Copy the full SHA
    219f78d View commit details
    Browse the repository at this point in the history

Commits on Sep 14, 2021

  1. Configuration menu
    Copy the full SHA
    429479e View commit details
    Browse the repository at this point in the history

Commits on Sep 17, 2021

  1. Configuration menu
    Copy the full SHA
    cabd936 View commit details
    Browse the repository at this point in the history
  2. fix(pu): fix r2d2 done slice bug and LSTM hidden state reset bug (ope…

    …ndilab#52)
    
    * test rnd
    
    * fix mz config
    
    * fix config
    
    * fix(pu): fix r2d2
    
    * fix(puyuan): fix r2d2
    
    * feature(puyuan): add minigrid r2d2 config
    
    * polish minigrid config
    
    * modified as review
    
    * fix(pu): fix bugffor compatibility
    
    * polish(pu): add annotations and polish slice operation
    
    * style(pu): run format.sh
    
    * style(pu): correct yapf format
    
    * fix(pu): fix config
    
    * fix(pu): fix done slice bug and lstm reset bug
    
    * style(pu): format config
    
    * polish(pu): polish config params for cartpole, lunarlander and minigrid
    
    * polish(pu): polish minigrid config params
    
    * Update r2d2.py
    
    * polish(pu): polish rnn reset problem
    
    * fix(pu): fix merge error
    
    * polish(pu): polish cartpole config
    
    * polish(nyz): polish cartpole r2d2 config for faster convergence
    
    * test(nyz): enable r2d2 algotest
    
    Co-authored-by: niuyazhe <niuyazhe@sensetime.com>
    puyuan1996 and PaParaZz1 authored Sep 17, 2021
    Configuration menu
    Copy the full SHA
    272ba12 View commit details
    Browse the repository at this point in the history
  3. feature(crb): update multi discrete policy(dqn, ppo, rainbow) (opendi…

    …lab#51)
    
    * update md_dqn
    
    * update offpolicy ppo
    
    * add rainbow md policy
    
    * format code
    
    * del ppo; leave to future updates
    
    * add doc string; fix rainbow returns
    RobinC94 authored Sep 17, 2021
    Configuration menu
    Copy the full SHA
    71d6738 View commit details
    Browse the repository at this point in the history
  4. feature(davide): add BSuite environment wrapper (opendilab#58)

    * start implementing bsuite env
    
    * add bsuite env
    
    * Implemented
    
    * removed unused file
    
    * added cartpole_swing environment
    
    * Update test_bsuite_env.py
    
    * added env in readme and in setup.py
    
    * Create bsuite.png
    davide97l authored Sep 17, 2021
    Configuration menu
    Copy the full SHA
    438c622 View commit details
    Browse the repository at this point in the history

Commits on Sep 22, 2021

  1. Configuration menu
    Copy the full SHA
    7c88b7e View commit details
    Browse the repository at this point in the history
  2. fix(wyh): add plot function (opendilab#59)

    * fix(wyh): plot function
    
    * fix(wyh): plot function pytest
    
    * fix(wyh):plot function modify comments
    
    * feature(wyh):plot style
    Co-authored-by: weiyuhong <weiyuhong@sensetime.com>
    Weiyuhong-1998 authored Sep 22, 2021
    Configuration menu
    Copy the full SHA
    7a94c0d View commit details
    Browse the repository at this point in the history
  3. feature(wyh):mappo and ippo win rate and time (opendilab#62)

    * feature(wyh):mappo and ippo win rate and time
    
    * feature(wyh):mappo and ippo epymarl win rate and time
    
    * feature(wyh):smac epymarl commit id
    
    Co-authored-by: 卫昱宏 <SENSETIME\weiyuhong@cn0214000504l.domain.sensetime.com>
    Weiyuhong-1998 and 卫昱宏 authored Sep 22, 2021
    Configuration menu
    Copy the full SHA
    11124de View commit details
    Browse the repository at this point in the history

Commits on Sep 23, 2021

  1. Configuration menu
    Copy the full SHA
    092a9e9 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    7c51151 View commit details
    Browse the repository at this point in the history

Commits on Sep 24, 2021

  1. Configuration menu
    Copy the full SHA
    b5a9a6e View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    2ae40db View commit details
    Browse the repository at this point in the history

Commits on Sep 26, 2021

  1. Configuration menu
    Copy the full SHA
    9edc924 View commit details
    Browse the repository at this point in the history
  2. fix(hansbug): fix spawn context problem in interaction unittest (open…

    …dilab#69)
    
    * fix(hansbug): try support spawn backend
    
    * fix(hansbug): try fix the xxxxing problem in interaction spawn support && reformat the code style
    
    * fix(nyz): disable silence decorator for spawn context interaction test
    
    Co-authored-by: HansBug <killog@126.com>
    PaParaZz1 and HansBug authored Sep 26, 2021
    Configuration menu
    Copy the full SHA
    9918b6d View commit details
    Browse the repository at this point in the history
  3. style(nyz): add docker deploy in github workflow (opendilab#70)

    * style(nyz): add docker deploy workflow(enable docker)
    
    * style(nyz): fix docker push info(enable docker)
    
    * style(nyz): modify org name and image default name rule(enable docker)
    
    * style(nyz): change default version to date(enable docker)
    PaParaZz1 authored Sep 26, 2021
    Configuration menu
    Copy the full SHA
    132670e View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    7ea4005 View commit details
    Browse the repository at this point in the history
  5. feature(crb): add multi-discrete ppo and off policy ppo (opendilab#72)

    * add md ppo
    
    * add doc string
    RobinC94 authored Sep 26, 2021
    Configuration menu
    Copy the full SHA
    f560c2c View commit details
    Browse the repository at this point in the history

Commits on Sep 28, 2021

  1. feature(nyz): move atari_py to ale-py; split base and env docker build (

    opendilab#77)
    
    * feature(nyz): move atari_py to ale-py and polish standard docker build(enable docker)
    
    * fix(nyz): fix atari env import bug(enable docker)
    
    * feature(nyz): add autorom install in docker(enable docker)
    
    * feature(nyz): split base and env docker build(enable docker)
    
    * fix(nyz): fix docker env source image bug(enable docker)
    PaParaZz1 authored Sep 28, 2021
    Configuration menu
    Copy the full SHA
    d731276 View commit details
    Browse the repository at this point in the history
  2. feature(pu): add WQMIX algorithm (opendilab#24)

    * add wqmix
    
    * update annotation
    
    * reformate
    
    * update annotation
    
    * update config
    
    * fix annotation
    
    * update as review
    
    * fix as review
    
    * add 5m6m MMM MMM2 config
    
    * reformate
    
    * fix(pu): fix rnn reset bug and add unittest
    
    * fix(pu): fix rnn reset bug in centrally-weighted wqmix
    
    * style(pu): yapf format and let WQMIXPolicy extend QMIXPolicy
    
    * fix(pu): fix wqmix policy extend bug
    
    * test(pu): add unittest test_wqmix
    
    * fix(pu): fix mixer key bug in particle config
    
    * feature(pu): add cooperative_navigation_wqmix_config
    
    * style(pu): yapf format
    
    * test(pu): change nn.Identity() to nn.Sequential()
    
    * fix(pu): fix unittest bug in test_wqmix
    puyuan1996 authored Sep 28, 2021
    Configuration menu
    Copy the full SHA
    fd8f7f2 View commit details
    Browse the repository at this point in the history

Commits on Sep 29, 2021

  1. feature(nyz): add mujoco docker (opendilab#78)

    * feature(nyz): add docker_mujoco build and upgrada numpy version to 1.20.0(enable docker)
    
    * fix(nyz): fix numpy version compatibility bug and add -y option in apt-get(enable docker)
    
    * fix(nyz): add libosmesa6-dev in Dockerfile.env(enable docker)
    
    * fix(nyz): add permanent env variable about mujoco(enable docker)
    
    * fix(nyz): change sh source to .(enable docker)
    
    * fix(nyz): set env variable in bashrc(enable docker)
    
    * fix(nyz): fix pip typo(enable docker)
    
    * fix(nyz): add env in dockerfile(enable docker)
    PaParaZz1 authored Sep 29, 2021
    Configuration menu
    Copy the full SHA
    a60cce5 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    9651471 View commit details
    Browse the repository at this point in the history
  3. feature(nyz): add smac docker (opendilab#80)

    * style(nyz): add ctools.pysc2 import in smac env
    
    * feature(nyz): add smac docker build(enable docker, smac docker)
    
    * fix(nyz): fix if condition syntax in deploy(enable docker, smac docker)
    
    * fix(nyz): fix if condition syntax in deploy(enable docker, smac docker)
    
    * fix(nyz): remove cache layer in smac docker(enable docker, smac docker)
    
    * feature(nyz): use self-hosted runner in docker smac deploy(enable docker, smac docker)
    
    * feature(nyz): build smac docker manually(enable docker, smac docker)
    
    * feature(nyz): use docker buildx as default tool in smac and add SC2Map in setup(enable docker, smac docker)
    
    * feature(nyz): add __init__.py in smac env maps(enable docker, smac docker)
    PaParaZz1 authored Sep 29, 2021
    Configuration menu
    Copy the full SHA
    4d024b8 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    d7c2f88 View commit details
    Browse the repository at this point in the history

Commits on Sep 30, 2021

  1. Configuration menu
    Copy the full SHA
    a0ebd94 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    673188f View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    f305978 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    5fb97e5 View commit details
    Browse the repository at this point in the history
  5. feature(zym): add offlineRL algo Discrete CQL; add hdf5 dataset for o…

    …fflineRL. (opendilab#68)
    
    * feature(zym): add offlineRL algo Discrete CQL.
    
    * feature(zym): add offlineRL algo Discrete CQL; add hdf5 dataset for offlineRL.
    YinminZhang authored Sep 30, 2021
    Configuration menu
    Copy the full SHA
    76240d6 View commit details
    Browse the repository at this point in the history
  6. feature(davide): Implementation of D4PG (opendilab#76)

    * added experience replay and n-step
    
    * implementing distributional q value
    
    * added distributional q-value
    
    * added overview in qac_dist and d4pg
    
    * derived D4PG from DDPG
    
    * fixed a bug when action shape >1
    
    * benchmark D4PG mujoco + minor fixs
    
    -entry for DDPG mujoco
    -entry for D4PG mujoco
    -config for D4PG mujoco
    -fixed style D4PG code
    -unittests for QAC distributional
    
    * formatted code
    
    * minor updates (read description)
    
    -added d4pg seria_entry test
    -updated comments in QACDIST
    -added d4pg in commander register
    -added q_value in d4pg return dict
    -added priority update in d4pg entry
    -added assertion in QACDIST
    davide97l authored Sep 30, 2021
    Configuration menu
    Copy the full SHA
    b9edb7c View commit details
    Browse the repository at this point in the history
  7. v0.2.0

    PaParaZz1 committed Sep 30, 2021
    Configuration menu
    Copy the full SHA
    0779418 View commit details
    Browse the repository at this point in the history

Commits on Oct 1, 2021

  1. Configuration menu
    Copy the full SHA
    0040577 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    43f6319 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    20b1fd6 View commit details
    Browse the repository at this point in the history

Commits on Oct 2, 2021

  1. Configuration menu
    Copy the full SHA
    2950d49 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    4ee8e7e View commit details
    Browse the repository at this point in the history

Commits on Oct 8, 2021

  1. feature(zlx): add vs bot training and self-play training with slime v…

    …olley env (opendilab#23)
    
    * slime volley env in dizoo, first commit
    
    * fix bug in slime volley env
    
    * modify volley env to satisfy ding 1v1 requirements; add naive self-play and league training pipeline(evaluator is not finished, now use a very naive one)
    
    * adopt volley builtin ai as default eval opponent
    
    * polish(nyz): polish slime_volley_env and its test
    
    * feature(nyz): add slime_volley vs bot ppo demo
    
    * feature(nyz): add battle_sample_serial_collector and adapt abnormal check in subprocess env manager
    
    * feature(nyz): add slime volley self-play demo
    
    * style(nyz): add slime_volleyball env gif and split MARL and selfplay label
    
    * feature(nyz): add save replay function in slime volleyball env
    
    Co-authored-by: zlx-sensetime <zhaoliangxuan@sensetime.com>
    Co-authored-by: niuyazhe <niuyazhe@sensetime.com>
    3 people authored Oct 8, 2021
    Configuration menu
    Copy the full SHA
    770ca0a View commit details
    Browse the repository at this point in the history

Commits on Oct 9, 2021

  1. Configuration menu
    Copy the full SHA
    e10ad49 View commit details
    Browse the repository at this point in the history

Commits on Oct 12, 2021

  1. feature(nyz): add gym-hybrid hybrid action space env (opendilab#86)

    * feature(nyz): add gym-hybrid hybrid action space env
    
    * style(nyz): update readme for gym_hybrid env
    PaParaZz1 authored Oct 12, 2021
    Configuration menu
    Copy the full SHA
    58326ab View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    0a8d288 View commit details
    Browse the repository at this point in the history

Commits on Oct 15, 2021

  1. Configuration menu
    Copy the full SHA
    6c5670c View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    b1c7300 View commit details
    Browse the repository at this point in the history
  3. fix(wyh): add test for rl_utils ppo and td (opendilab#89)

    * fix(wyh):test rl_utils code
    
    * fix(wyh):modify rl utils bug ppo adv batch B,A
    
    * fix(wyh):style
    
    * fix(wyh):fix bug
    Weiyuhong-1998 authored Oct 15, 2021
    Configuration menu
    Copy the full SHA
    6d09f79 View commit details
    Browse the repository at this point in the history

Commits on Oct 16, 2021

  1. Configuration menu
    Copy the full SHA
    afe509c View commit details
    Browse the repository at this point in the history
  2. feature(nyp): add DQfD algorithm (opendilab#48)

    * add_dqfd
    
    * Is_expert to is_expert
    
    * modify according to the last commnets
    
    * value_gamma; done; marginloss; sqil compatibility
    
    * finally shorten the code, revise config
    
    * revise config, style
    
    * add_readme/two_more_config
    
    * correct format
    
    Co-authored-by: niuyazhe <niuyazhe@sensetime.com>
    Will-Nie and PaParaZz1 authored Oct 16, 2021
    Configuration menu
    Copy the full SHA
    f6aad4c View commit details
    Browse the repository at this point in the history
  3. fix(wyh): add model test and policy/entry test and remove unused qacd(o…

    …pendilab#92)
    
    * fix(wyh):model test and policy/entry test
    
    * fix(wyh):delect qacd
    
    * fix(wyh):test serial entry onpolicy
    Weiyuhong-1998 authored Oct 16, 2021
    Configuration menu
    Copy the full SHA
    e2c91ba View commit details
    Browse the repository at this point in the history

Commits on Oct 17, 2021

  1. Configuration menu
    Copy the full SHA
    5b95bb2 View commit details
    Browse the repository at this point in the history

Commits on Oct 19, 2021

  1. Configuration menu
    Copy the full SHA
    842f9c2 View commit details
    Browse the repository at this point in the history

Commits on Oct 20, 2021

  1. Configuration menu
    Copy the full SHA
    5999dfc View commit details
    Browse the repository at this point in the history

Commits on Oct 21, 2021

  1. Configuration menu
    Copy the full SHA
    f4cfeae View commit details
    Browse the repository at this point in the history
  2. feature(lk): add gym-soccer (HFO) env (opendilab#94)

    * add_soccer_env
    
    * add_info
    
    * close
    
    * format
    
    * test_gym_soccer
    
    * rm_torch
    
    * replay_log
    
    * format_style
    
    * add_gym_soccer_to_readme
    
    * separate render_func
    
    * add_gif_file
    
    * scale_action
    
    * flake_style_format
    
    * resolve_review_comments
    
    * add branch info for gym hybrid
    KeLiChloe authored Oct 21, 2021
    Configuration menu
    Copy the full SHA
    094ede9 View commit details
    Browse the repository at this point in the history
  3. feature(xjx): test in pure docker environment (opendilab#103)

    * Test in docker
    
    * Add docker test entry
    
    * Trap exit
    
    * Test in docker
    sailxjx authored Oct 21, 2021
    Configuration menu
    Copy the full SHA
    e29627b View commit details
    Browse the repository at this point in the history

Commits on Oct 22, 2021

  1. Configuration menu
    Copy the full SHA
    a6c8662 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    b48582c View commit details
    Browse the repository at this point in the history
  3. polish(nyz): fix ppo bugs and update atari ppo offpolicy config (open…

    …dilab#108)
    
    * fix(nyz): fix ppo cuda bug and random collect bug
    
    * config(nyz): add pong ppo off policy better config
    
    * fix(nyz): fix ppo device bug in get_train_sample and update ppo offpolicy config
    
    * style(nyz): correct yapf format
    PaParaZz1 authored Oct 22, 2021
    Configuration menu
    Copy the full SHA
    9e623c1 View commit details
    Browse the repository at this point in the history
  4. feature(zym): add offlineRL algo td3_bc and polish policy comments(op…

    …endilab#88)
    
    * feature(zym): add offlineRL algo td3_bc.
    
    * feature(zym): add offlineRL algo td3_bc.
    
    * feature(zym): add offlineRL algo td3_bc.
    
    * polish(zym): polish some annotations in td3/ddpg/sac/ppo; polish `_forward_collect` and `_foward_eval`.
    
    * fix(lj): fix dimension bug in cql for continuous env.
    
    * fix(zym): fix dimension bug in cql for continuous env.
    
    * fix(zym): fix dimension bug in cql for continuous env.
    
    * polish(zym): update README.md.
    YinminZhang authored Oct 22, 2021
    Configuration menu
    Copy the full SHA
    30b8f79 View commit details
    Browse the repository at this point in the history

Commits on Oct 25, 2021

  1. test(wyh): add more unittest for ppo and sac policy (opendilab#104)

    * fix(wyh):reward model test
    
    * fix(wyh):sac ppo test
    
    * fix(wyh):ppo_continuous test
    
    * fix(wyh):style
    
    * fix(wyh):ppo test
    
    Co-authored-by: Swain <niuyazhe314@outlook.com>
    Weiyuhong-1998 and PaParaZz1 authored Oct 25, 2021
    Configuration menu
    Copy the full SHA
    3c96dce View commit details
    Browse the repository at this point in the history

Commits on Oct 26, 2021

  1. test(yzj): add unittest for dataset, metric_serial_evaluator and lear…

    …ner (opendilab#107)
    
    * add 4 pytest dataset.py learner_aggregator.py learner_hook.py metric_serial_evaluator.py
    
    * fix yapf and flake8 And remove invalid self._env
    
    * fix fake_cls_config.py flake8
    jayyoung0802 authored Oct 26, 2021
    Configuration menu
    Copy the full SHA
    05fd79e View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    ae11f79 View commit details
    Browse the repository at this point in the history

Commits on Oct 28, 2021

  1. feature(nyz): add gobigger baseline (opendilab#95)

    * feature(nyz): add gobigger baseline
    
    * style(nyz): add gobigger env infor
    
    * feature(nyz): add ignore prefix in default collate
    
    * feautre(nyz): add vsbot training baseline
    
    * fix(nyz): fix to_tensor empty list bug and polish gobigger baseline
    
    * style(nyz): split gobigger baseline code
    PaParaZz1 authored Oct 28, 2021
    Configuration menu
    Copy the full SHA
    33ee048 View commit details
    Browse the repository at this point in the history

Commits on Oct 29, 2021

  1. feature(nyz): add PADDPG for hybrid action space as baseline (opendil…

    …ab#109)
    
    * fix(nyz): fix gym_hybrid env not scale action bug
    
    * feature(nyz): add PADDPG basic implementation for hybrid action space
    
    * fix(nyz): fix td3/d4pg comatibility bug with new modifications
    
    * fix(nyz): fix hybrid ddpg action type grad bug and update config
    
    * feature(nyz): add eps greedy + multinomial wrapper and gym_hybrid ddpg convergence config
    
    * style(nyz): update PADDPG in README
    
    * test_model_hybrid_qac
    
    * fix_typo_in_README
    
    * test_policy_hybrid_qac
    
    * polish(nyz): polish hybrid action space to dict structure and polish unittest
    
    * fix(nyz): fix td3bc compatibility bug
    
    Co-authored-by: 李可 <like2@CN0014008466M.local>
    PaParaZz1 and 李可 authored Oct 29, 2021
    Configuration menu
    Copy the full SHA
    82a35d2 View commit details
    Browse the repository at this point in the history
  2. style(nyz): modify doc and deploy trigger and update mujoco license d…

    …ownload link(smac docker)
    PaParaZz1 committed Oct 29, 2021
    Configuration menu
    Copy the full SHA
    7a0a3cc View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    1be01c7 View commit details
    Browse the repository at this point in the history
  4. feature(lcm): add MBPO algorithm (opendilab#113)

    * feature(lcm): add MBPO algorithm (opendilab#87)
    
    * add model-based rl
    
    * fix yazhe's comments
    
    * format
    
    * pass flake8 test
    
    * polish(nyz): polish mbpo import, name and test
    
    Co-authored-by: lichuming <lichuming@lichumingdeMacBook-Pro.local>
    PaParaZz1 and lichuming authored Oct 29, 2021
    Configuration menu
    Copy the full SHA
    121d4eb View commit details
    Browse the repository at this point in the history

Commits on Oct 31, 2021

  1. Configuration menu
    Copy the full SHA
    e42fb78 View commit details
    Browse the repository at this point in the history

Commits on Nov 1, 2021

  1. feature(pu): add NGU algorithm (opendilab#40)

    * test rnd
    
    * fix mz config
    
    * fix config
    
    * feature(pu): fix r2d2, add beta to actor
    
    * feature(pu): add ngu-dev
    
    * fix(pu): fix r2d2
    
    * fix(puyuan): fix r2d2
    
    * feature(puyuan): add minigrid r2d2 config
    
    * polish minigrid config
    
    * dev-ngu
    
    * feature(pu): add action and reward as inputs of q network
    
    * feature(pu): add episodic reward model
    
    * feature(pu): add episodic reward model, modify r2d2 and collector for ngu
    
    * fix(pu): recover files that were changed by mistake
    
    * fix(pu): fix tblogger cnt bug
    
    * add_dqfd
    
    * Is_expert to is_expert
    
    * fix(pu): fix r2d2 bug
    
    * fix(pu): fix beta index to gamma bug
    
    * fix(pu): fix numerical stability problem
    
    * style(pu): flake8 format
    
    * fix(pu): fix rnd reward model train times
    
    * polish(pu): polish r2d2 reset problem
    
    * fix(pu): fix episodic reward normalize bug
    
    * polish(pu): polish config params and episodic_reward init value
    
    * modify according to the last commnets
    
    * value_gamma;done;marginloss;sqil适配
    
    * feature(pu): add r2d3 algorithm and config of lunarlander and pong
    
    * fix(pu): fix demo path bug
    
    * fix(pu): fix cuda bug at function get_gae in adder.py
    
    * feature(pu): add pong r2d2 config
    
    * polish(pu): r2d2 uses the mixture priority, episodic_reward transforms to mean 0 std1
    
    * polish(pu): polish r2d2 config
    
    * test(pu): test cuda compatiality of dqfd_nstep_td_error in r2d3
    
    * polish(pu): polish config
    
    * polish(pu): polish config and annotation
    
    * fix(pu): fix r2d2 target net update bug and done bug
    
    * polish(pu): polish pong r2d2 config and add montezuma r2d2 config
    
    * polish(pu): add some logs for debugging in r2d2
    
    * polish(pu): recover config deleted by mistake
    
    * fix(pu): fix r2d3 config of lunarlander and pong
    
    * fix(pu): fix the r2d2 bug in r2d3
    
    * fix(pu): fix r2d3 cpu device bug in fun dqfd_nstep_td_error of td.py
    
    * fix(pu): fix n_sample bug in serial_entry_r2d3
    
    * polish(pu): polish minigrid r2d2 config
    
    * fix(pu): add info dict of fourrooms doorkey in minigrid_env
    
    * polish(pu): polish r2d2 config
    
    * fix(pu): fix expert policy collect traj bug, now we use the argmax_sample wrapper
    
    * fix(pu): fix r2d2 done and target update bug, polish config
    
    * fix(pu): fix null_padding transition obs to zeros
    
    * fix(pu): episodic_reward transform to [0,1]
    
    * fix(pu): fix the value_gamma bug
    
    * fix(pu): fix device bug in ngu_reward_model.py
    
    * fix(pu): fix null_padding problem in rnd and episodic reward model
    
    * polish(pu): polish config
    
    * fix(pu): use the deepcopy train_data to add bonus reward
    
    * polish(pu): add the operation of enlarging seq_length times to the last reward of the whole episode
    
    * fix(pu): fix the episode length 1 bug and weight intrinsic reward bug
    
    * feature(pu): add montezuma ngu config
    
    * fix(pu): fix lunarlander ngu unroll_len to 998 so that the sequence length is equal to the max step 1000
    
    * test(pu): episodic reward transforms to [0,1]
    
    * fix(pu): fix r2d3 one-step rnn init bug and add r2d2_collect_traj
    
    * fix(pu): fix r2d2_collect_traj.py
    
    * feature(pu): add pong_r2d3_r2d2expert_config
    
    * polish(pu): yapf format
    
    * polish(pu): fix td.py conflict
    
    * polish(pu): flake8 format
    
    * polish(pu): add lambda_one_step_td key in dqfd error
    
    * test(pu): set key lambda_one_step_td and lambda_supervised_loss as 0
    
    * style(pu): yapf format
    
    * style(pu): format
    
    * polish(nyz): fix ngu detailed compatibility error
    
    * fix(nyz): fix dqfd one_step td lambda bug
    
    * fix(pu): fix test_acer and test_rnd compatibility error
    
    Co-authored-by: Swain <niuyazhe314@outlook.com>
    Co-authored-by: Will_Nie <nieyunpengwill@hotmail.com>
    3 people authored Nov 1, 2021
    Configuration menu
    Copy the full SHA
    99cba89 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    e611ad2 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    e574608 View commit details
    Browse the repository at this point in the history
  4. polish pong ngu config

    puyuan1996 committed Nov 1, 2021
    Configuration menu
    Copy the full SHA
    51643a5 View commit details
    Browse the repository at this point in the history
  5. polish pong ngu config

    puyuan1996 committed Nov 1, 2021
    Configuration menu
    Copy the full SHA
    89c5401 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    7040af3 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    3477b2c View commit details
    Browse the repository at this point in the history

Commits on Nov 2, 2021

  1. Configuration menu
    Copy the full SHA
    b217264 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    ad0c8d0 View commit details
    Browse the repository at this point in the history

Commits on Nov 3, 2021

  1. fix(davide): small fix on bsuite environment (opendilab#117)

    * small fix
    
    * added bsuite env version
    
    * modified test
    davide97l authored Nov 3, 2021
    Configuration menu
    Copy the full SHA
    79a8041 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    6b8a6da View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    e993546 View commit details
    Browse the repository at this point in the history

Commits on Nov 4, 2021

  1. Configuration menu
    Copy the full SHA
    556a790 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    d20ac3c View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    6b1d5f4 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    6254816 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    0908af2 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    73fdaf2 View commit details
    Browse the repository at this point in the history

Commits on Nov 5, 2021

  1. Configuration menu
    Copy the full SHA
    bcc92c0 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    e6aec26 View commit details
    Browse the repository at this point in the history

Commits on Nov 7, 2021

  1. feature(nyz): enable arbitrary policy num in serial sample collector …

    …and evaluator, add git in docker(smac docker)
    PaParaZz1 committed Nov 7, 2021
    Configuration menu
    Copy the full SHA
    3eb1840 View commit details
    Browse the repository at this point in the history

Commits on Nov 8, 2021

  1. Configuration menu
    Copy the full SHA
    751f7df View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    4cdac39 View commit details
    Browse the repository at this point in the history

Commits on Nov 9, 2021

  1. Configuration menu
    Copy the full SHA
    7bfa8d0 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    9ff01ca View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    6d53a61 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    59f4161 View commit details
    Browse the repository at this point in the history
  5. fix(pu): fix montezuma and pitfall action shape from 6 to 18, and add…

    … the max_step in atari_env.py
    puyuan1996 committed Nov 9, 2021
    Configuration menu
    Copy the full SHA
    0836d66 View commit details
    Browse the repository at this point in the history
  6. fix(pu): fix montezuma and pitfall action shape from 6 to 18, and add…

    … the max_step in atari_env.py
    puyuan1996 committed Nov 9, 2021
    Configuration menu
    Copy the full SHA
    ac2ab00 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    ba12ccb View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    a9ebb72 View commit details
    Browse the repository at this point in the history

Commits on Nov 10, 2021

  1. Configuration menu
    Copy the full SHA
    29f8ee1 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    d824002 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    a9f2d4a View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    0ed9602 View commit details
    Browse the repository at this point in the history
  5. polish(pu):polish config

    puyuan1996 committed Nov 10, 2021
    Configuration menu
    Copy the full SHA
    c9fc015 View commit details
    Browse the repository at this point in the history
  6. polish(pu):polish config

    puyuan1996 committed Nov 10, 2021
    Configuration menu
    Copy the full SHA
    c39d8f1 View commit details
    Browse the repository at this point in the history

Commits on Nov 11, 2021

  1. Configuration menu
    Copy the full SHA
    9796669 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    0fa6090 View commit details
    Browse the repository at this point in the history

Commits on Nov 12, 2021

  1. Configuration menu
    Copy the full SHA
    bb78b74 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    2d1743f View commit details
    Browse the repository at this point in the history

Commits on Nov 15, 2021

  1. polish(pu): polish ngu

    puyuan1996 committed Nov 15, 2021
    Configuration menu
    Copy the full SHA
    77284e3 View commit details
    Browse the repository at this point in the history
  2. polish(pu): polish ngu

    puyuan1996 committed Nov 15, 2021
    Configuration menu
    Copy the full SHA
    c9fd4b4 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    ad94dde View commit details
    Browse the repository at this point in the history
  4. feature(jrn): add the bipedalwalker config of sac and ppo (opendilab#121

    )
    
    * commit bipedalwalkere_ppo_config
    
    * commit bipedalwalker_sac_config
    jiaruonan authored Nov 15, 2021
    Configuration menu
    Copy the full SHA
    b9d0a61 View commit details
    Browse the repository at this point in the history

Commits on Nov 16, 2021

  1. Configuration menu
    Copy the full SHA
    ac5aceb View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    040d2c6 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    03590ab View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    15901c2 View commit details
    Browse the repository at this point in the history

Commits on Nov 17, 2021

  1. Merge pull request opendilab#122 from opendilab/dev-torch1.1.0

    feature(nyz): extend torch1.1.0 support
    sailxjx authored Nov 17, 2021
    Configuration menu
    Copy the full SHA
    de953d7 View commit details
    Browse the repository at this point in the history
  2. Merge branch 'main' of https://github.com/opendilab/DI-engine into de…

    …v-polish-ngu
    
    Conflicts:
    	dizoo/box2d/bipedalwalker/config/bipedalwalker_ppo_config.py
    puyuan1996 committed Nov 17, 2021
    Configuration menu
    Copy the full SHA
    0268ac0 View commit details
    Browse the repository at this point in the history
  3. Merge branch 'main' of https://github.com/opendilab/DI-engine into de…

    …v-polish-ngu
    
    Conflicts:
    	dizoo/box2d/bipedalwalker/config/bipedalwalker_ppo_config.py
    puyuan1996 committed Nov 17, 2021
    Configuration menu
    Copy the full SHA
    e6425a8 View commit details
    Browse the repository at this point in the history

Commits on Nov 18, 2021

  1. Configuration menu
    Copy the full SHA
    fe8b9c4 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    063700b View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    a137483 View commit details
    Browse the repository at this point in the history
  4. polish(yzj): add DataParallel and DataDistributedParallel (opendilab#123

    )
    
    * add spaceinvaders multi gpu
    
    * add dp and ddp
    
    * Update __init__.py
    
    * recover init
    jayyoung0802 authored Nov 18, 2021
    Configuration menu
    Copy the full SHA
    c0eadcf View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    a8be84c View commit details
    Browse the repository at this point in the history

Commits on Nov 19, 2021

  1. Configuration menu
    Copy the full SHA
    c4f4999 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    47b849c View commit details
    Browse the repository at this point in the history
  3. feature(lk): add PDQN algorithm for hybrid action spaces (opendilab#118)

    * add_pdqn_model
    
    * modify_model_structure
    
    * initial_version_PDQN
    
    * bug_free_PDQN_no_test_convergence
    
    * update_pdqn_config
    
    * add_noise_to_continuous_args
    
    * polish(nyz): polish code style and add noise in pdqn
    
    * seperate_dis_and_cont_model
    
    * fix_bug_for_separation
    
    * fix(pu): current q value use the data action, fix cont loss detach bug, 1 encoder, dist and cont learning rate
    
    * polish(pu): actor delay update
    
    * fix(pu): fix disc cont update frequency
    
    * polish(pu): polish pdqn config
    
    * polish(lk): add comments and typelint for pdqn and dqn
    
    * feature(lk): add test file for pdqn model and policy
    
    * polish(lk): code style
    
    * polish(lk): rm the modify of unrelated files
    
    * polish(lk): rm useless commentes code in pdqn
    
    Co-authored-by: niuyazhe <niuyazhe@sensetime.com>
    Co-authored-by: puyuan1996 <2402552459@qq.com>
    3 people authored Nov 19, 2021
    Configuration menu
    Copy the full SHA
    71df0d8 View commit details
    Browse the repository at this point in the history
  4. polish(davide) add example of GAIL entry + config for Mujoco and Cart…

    …pole (opendilab#114)
    
    * added gail entry
    
    * added lunarlander and cartpole config
    
    * added gail mujoco config
    
    * added mujoco exp
    
    * update22-10
    
    * added third exp
    
    * added metric to evaluate policies
    
    * added GAIL entry and config for Cartpole and Walker2d
    
    * checked style and unittest
    
    * restored lunarlander env
    
    * style problems
    
    * bug correction
    
    * Delete expert_data_train.pkl
    
    * changed loss of GAIL
    
    * Update walker2d_ddpg_gail_config.py
    
    * changed gail reward from -D(s, a) to -log(D(s, a))
    
    * added small constant to reward function
    
    * added comment to clarify config
    
    * Update walker2d_ddpg_gail_config.py
    
    * added lunarlander entry + config
    
    * Added Atari discriminator + Pong entry config
    
    * Update gail_irl_model.py
    
    * Update gail_irl_model.py
    
    * added gail serial pipeline and onehot actions for gail atari
    
    * related to previous commit
    
    * removed main files
    
    * removed old comment
    davide97l authored Nov 19, 2021
    Configuration menu
    Copy the full SHA
    bc54d24 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    f9f92a8 View commit details
    Browse the repository at this point in the history

Commits on Nov 21, 2021

  1. Merge branch 'main' of https://github.com/opendilab/DI-engine into de…

    …v-polish-ngu
    
    Conflicts:
    	dizoo/gym_hybrid/config/gym_hybrid_ddpg_config.py
    puyuan1996 committed Nov 21, 2021
    Configuration menu
    Copy the full SHA
    babf050 View commit details
    Browse the repository at this point in the history
  2. Merge branch 'main' of https://github.com/opendilab/DI-engine into de…

    …v-polish-ngu
    
    Conflicts:
    	dizoo/gym_hybrid/config/gym_hybrid_ddpg_config.py
    puyuan1996 committed Nov 21, 2021
    Configuration menu
    Copy the full SHA
    a87f4bb View commit details
    Browse the repository at this point in the history
  3. style(pu): yapf format

    puyuan1996 committed Nov 21, 2021
    Configuration menu
    Copy the full SHA
    a97a772 View commit details
    Browse the repository at this point in the history
  4. style(pu): yapf format

    puyuan1996 committed Nov 21, 2021
    Configuration menu
    Copy the full SHA
    883ddcf View commit details
    Browse the repository at this point in the history

Commits on Nov 22, 2021

  1. Configuration menu
    Copy the full SHA
    a1b0f89 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    4f9cca2 View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    0bd47fd View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    cb3aa6b View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    fe05389 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    a63494d View commit details
    Browse the repository at this point in the history

Commits on Mar 17, 2022

  1. Configuration menu
    Copy the full SHA
    7de0ce0 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    7141eb2 View commit details
    Browse the repository at this point in the history

Commits on Mar 20, 2022

  1. style(pu): yapf format

    puyuan1996 committed Mar 20, 2022
    Configuration menu
    Copy the full SHA
    ba4d45a View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    363003b View commit details
    Browse the repository at this point in the history
  3. fix(pu): fix import error

    puyuan1996 committed Mar 20, 2022
    Configuration menu
    Copy the full SHA
    d552a69 View commit details
    Browse the repository at this point in the history
  4. fix(pu): fix unittest

    puyuan1996 committed Mar 20, 2022
    Configuration menu
    Copy the full SHA
    2bdd053 View commit details
    Browse the repository at this point in the history
  5. Configuration menu
    Copy the full SHA
    5df07c2 View commit details
    Browse the repository at this point in the history

Commits on Mar 21, 2022

  1. Configuration menu
    Copy the full SHA
    1ca8b10 View commit details
    Browse the repository at this point in the history