GitHub - CAI23sbP/GRU_AC: GRU-PPO for stable-baselines3.

GRU-PPO for stable-baselines3.

How to train

python3 train.py

After traning done, LSTM PPO vs GRU PPO vs PPO results are saved as videos

results in BipedalWalker-v3 env

[Note]

all parameters are same, except which has recurrent neural network or not.

LSTM and GRU, both have same hidden_state shape.

LSTM PPO

LSTM-episode-0.mp4

GRU PPO

GRU-episode-0.mp4

PPO

PPO-episode-0.mp4

library compatibility

torch: 1.13.1+cu116

stable_baselines3: 2.3.0

sb3_contrib: 2.3.0

My sister project (related to GRU)

preference based RL with GRU reward model for imitation library

https://github.com/CAI23sbP/RecurrentRLHF

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
gru_package		gru_package
README.md		README.md
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

GRU-PPO for stable-baselines3.

How to train

results in BipedalWalker-v3 env

library compatibility

My sister project (related to GRU)

About

Releases

Packages

Languages

CAI23sbP/GRU_AC

Folders and files

Latest commit

History

Repository files navigation

GRU-PPO for stable-baselines3.

How to train

results in BipedalWalker-v3 env

library compatibility

My sister project (related to GRU)

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages