TD3_Pytorch_for_Walker

Twin Delayed Deep Deterministic Policy Gradient for Gait Learning (Walker2d-v2) by Pytorch

Usage

python train.py

python test.py

python 3.7
mujoco-py 2.0.2.13
gym 0.15.4