merge code #1

karroyan · 2022-07-07T02:10:12Z

Description

Related Issue

TODO

Check List

merge the latest version source branch/repo, and resolve all the conflicts
pass style check
pass all the tests

* feature(zlx): Support async reset for envpool env manager * fixbug(zlx): Add final_eval_reward in returned dict

* impala cnn encoder refactor. * Minor change.

* test(wyh):add plot test code * test(wyh):add plot test code * test(wyh):add compare two different PICS codes * test(wyh):add compare two different PICS * test(wyh):compare two different PICS

* feature(nyz): add pure ppo policy gradient policy * fix(nyz): fix flake8 style problem and remove redundant codes of continuous bc

* add infoNCE * refine infonce test * add dim * fix style and pytest * fix style and pytest * add dqn_dim test * fix style * refine descriptions * polish infoNCE & ST-DIM * polish stdim & infonce loss * fix style * fix codecov * fix import * add readme * update quick colab link * add buffer description * polish buffer api * polish buffer api

* fix bugs in dmc2gym env. * fix bugs of libOpenGL.so.0 cannot open shared object file. * fix bugs in base env manager in subprocess mode when resetting and rendering dmc envs. * fix dockerfile * backward compatibity for tb video logging Co-authored-by: ZHZisZZ <zhanhui@umich.edu>

* feature(pu): add board_gmaes env including tictactoe, gomoku, chess, go, atari * style(pu): yapf format * polish(pu): refactor * polish(pu): polish as review * polish(pu): move atari_muzero_env to dizoo/atari * polish(pu):polish as review * fix(pu): fix tictactoe expert action * polish(pu): polish board_games env * style(pu): yapf format

* draft steve * remove breakpoint * fix numerical instability * polish * polish STEVE * change flatten_batch -> fold_batch * change from VE to MVE in README.md * speed up mbrl test * Polish docstrings, create unsqueeze_repeat helper function * fix assertion bug

* add trex example * polish import * merge main

* polish(lisong): add sqil cosine similarity of expert and agent grad * polish(lisong): create SQILSACPolicy and add mujoco config * feature(lisong): add sqil_sac example * polish(lisong): use pendulum as demo env * polish(lisong):polish sqil_continuous example * polish(lisong): remove get config

* test(rjy):add discrete pendulum env * test(rjy):fix discrete pendulum env * test(rjy): change the parameter * test(rjy): revise the pend_dqn config * test(rjy): correct spelling mistakes * test(rjy): modify the format

…384) * dev(lwq) continuous examples: ddpg & td3 * dev(lwq) continuous examples: ddpg, d4pg & td3

* polish(lwq): polish VAE * remove base VAE class

* Discard message sent by self in redis mq * Set running flag of redis mq * Rename nng finished to running * Fix self._mq is none * Turn deque into list

* add c51/qrdqn/iqn newpipeline example * fix style

karroyan and others added 29 commits June 14, 2022 16:20

fix(lxy): fix import path error in lunarlander (#362)

c901a00

fix(wzl): add dt entry in entry/__init__ (#367)

85ce729

style(nyz): update readme and enable dmc docker(dmc2gym docker)

e00c5bf

feature(zlx): support async reset for envpool env manager (#250)

0246f05

* feature(zlx): Support async reset for envpool env manager * fixbug(zlx): Add final_eval_reward in returned dict

fix(nyz): fix gail unittest ci bug

f0210eb

polish(zjow): impala cnn encoder refactor. (#378)

ce286cd

* impala cnn encoder refactor. * Minor change.

fix(zjow): fix for dmc env replay and opengl settings

5178676

test(wyh):add plot test code (#370)

bec0d8d

* test(wyh):add plot test code * test(wyh):add plot test code * test(wyh):add compare two different PICS codes * test(wyh):add compare two different PICS * test(wyh):compare two different PICS

fix(nyz): fix normed nn unittest bug(dmc2gym docker)

268d77d

feature(nyz): add pure ppo policy gradient policy (#382)

8fd08a8

* feature(nyz): add pure ppo policy gradient policy * fix(nyz): fix flake8 style problem and remove redundant codes of continuous bc

fix(nyz): fix world model unittest repeat name bug

8e8e53c

fix(nyz): fix bc policy unittest

412bc26

style(nyz): update mujoco docker download path (#386)

549f2eb

v0.4.0

47940ef

fix(xjx): remove pace controller (#400)

8c817b6

feature(whl): add trex new pipeline example (#380)

c302382

* add trex example * polish import * merge main

feature(rjy): add discrete pendulum env (#395)

b89d477

* test(rjy):add discrete pendulum env * test(rjy):fix discrete pendulum env * test(rjy): change the parameter * test(rjy): revise the pend_dqn config * test(rjy): correct spelling mistakes * test(rjy): modify the format

demo(lwq): add new pipeline continuous examples: ddpg, td3 and d4pg (#…

7575a7c

…384) * dev(lwq) continuous examples: ddpg & td3 * dev(lwq) continuous examples: ddpg, d4pg & td3

fix(nyz): fix random action policy randomness

5e2265e

fix(nyz): fix new pipeline ddpg/td3/d4pg act_scale bug

43d4ea9

polish(lwq): polish VAE implementation (#404)

83b94ec

* polish(lwq): polish VAE * remove base VAE class

fix(nyz): fix action_space seed comaptibility bug

dc0e2e6

fix(xjx): discard message sent by self in redis mq (#354)

0bbd6a5

* Discard message sent by self in redis mq * Set running flag of redis mq * Rename nng finished to running * Fix self._mq is none * Turn deque into list

feature(zp): add c51/qrdqn/iqn new pipeline example (#407)

3a65fd8

* add c51/qrdqn/iqn newpipeline example * fix style

karroyan merged commit 6ae5797 into karroyan:main Jul 7, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

merge code #1

merge code #1

karroyan commented Jul 7, 2022

merge code #1

merge code #1

Conversation

karroyan commented Jul 7, 2022

Description

Related Issue

TODO

Check List