BipedalWalker-v3-TD3_RL

Teaching an bipedal bot how to walk using a TD3 algorithm (variant of Reinforcement Learning - Actor & Critic method)

Paper for the TD3 algorithm: https://arxiv.org/pdf/1802.09477.pdf
/https://share.vidyard.com/watch/tDm31KXAXSTrkoDunmM2od? (data recordings)

The experiment focusses on training the BipedalWalker using reinforcement learning algorithm (TD3) using 3 FC layers for both actor and critic and then explore how different variations in the state and action space effect the walking styles and learning patterns of the model.
The experiments have been conducted with the following configurations of the environment using custom gym wrapper functions:

Reduced state space (17/24 states available for training)
Reduced action space (3/4 actions available for the model)
Limited action space (action range limited to half of its potential [-0.5,0.5])
Limited action and reduced state space (19/24 states and config-3)

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

BipedalWalker-v3-TD3_RL

Files

README.md

Latest commit

History

README.md

File metadata and controls

BipedalWalker-v3-TD3_RL