Twin Delayed Deep Deterministic Policy Gradient for Gait Learning (Walker2d-v2) by Pytorch
- To train a new TD3 agent:
python train.py
- To test trained TD3 agent:
python test.py
python 3.7
mujoco-py 2.0.2.13
gym 0.15.4
Twin Delayed Deep Deterministic Policy Gradient for Gait Learning (Walker2d-v2) by Pytorch
python train.py
python test.py
python 3.7
mujoco-py 2.0.2.13
gym 0.15.4