BipedalWalker-v3-TD3_RL

Teaching an bipedal bot how to walk using a TD3 algorithm (variant of Reinforcement Learning - Actor & Critic method)

Paper for the TD3 algorithm: https://arxiv.org/pdf/1802.09477.pdf
/https://share.vidyard.com/watch/tDm31KXAXSTrkoDunmM2od? (data recordings)

The experiment focusses on training the BipedalWalker using reinforcement learning algorithm (TD3) using 3 FC layers for both actor and critic and then explore how different variations in the state and action space effect the walking styles and learning patterns of the model.
The experiments have been conducted with the following configurations of the environment using custom gym wrapper functions:

Reduced state space (17/24 states available for training)
Reduced action space (3/4 actions available for the model)
Limited action space (action range limited to half of its potential [-0.5,0.5])
Limited action and reduced state space (19/24 states and config-3)

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
Baseline		Baseline
Deep_All_Reduced		Deep_All_Reduced
Deeper_Baseline		Deeper_Baseline
Limited_Action		Limited_Action
Limited_Action_Reduced_Observation		Limited_Action_Reduced_Observation
Reduced_Action_3		Reduced_Action_3
Reduced_Observations		Reduced_Observations
README.md		README.md
TD3.py		TD3.py
export_graph.py		export_graph.py
model.py		model.py
test.py		test.py
train.py		train.py
train_3_actions.py		train_3_actions.py
train_reduced_action.py		train_reduced_action.py
train_reduced_obs.py		train_reduced_obs.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BipedalWalker-v3-TD3_RL

About

Releases

Packages

Languages

imsrinin/BipedalWalker-v3-TD3_RL

Folders and files

Latest commit

History

Repository files navigation

BipedalWalker-v3-TD3_RL

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages