Deep Reinforcement Learning for Dynamic Robot Locomotion

This repository contains implementations for the project "Deep Reinforcement Learning for Dynamic Robot Locomotion", by Lisa Coiffard (2022).

This provides implementations for the training and visualisations of trained controllers presented in the main report.

Installation

To clone this repository run the following command in terminal: git clone --recurse-submodules https://gitlab.doc.ic.ac.uk/AIRL/students_projects/2021-2022/lisa_coiffard/qd_pmtg

Dependencies

The trained controllers can be visualised in simulation by installing the following dependencies: pip install pybullet gym absl-py numpy opensimplex==0.3 matplotlib seaborn sklearn

Examples

Supplementary videos links can be found in report's appendix A.

To visualise the generated archive of TGs run python plot_map_elites/plot_2d_map.py centroids_500.dat archive_400000.dat

To visualise the trained controller for the specialist agent on flat terrains first assign the index 178 to line 93 of the pmtg_wrapped.py script to select the TG it was trained on, then run python visualise_flat_terrain.py --archive=archive_400000.dat --filename=policies/specialist_flat.npz

To visualise the trained controller for the generalist agent on flat terrains run visualise_flat_terrain.py --archive=archive_400000.dat --filename=policies/generalist_flat.npz --tg_select=1

To visualise the trained controller for the specialist agent on flat terrains first assign the index 178 to line 93 of the pmtg_wrapped.py script to select the TG it was trained on, then run python visualise_domain_randomisation.py --archive=archive_400000.dat --filename=policies/specialist_terrains.npz

To visualise the trained controller for the generalist agent on flat terrains first assign the index 178 to line 93 of the pmtg_wrapped.py script to select the TG it was trained on, then run python visualise_domain_randomisation.py --archive=archive_400000.dat --filename=policies/generalist_terrains.npz --tg_select=1

Basic usage

To train the controllers according to the methods presented in the main report, run the chosen experiment on the HPC by uncommenting the appropriate training file command in the singularity.def file. Push the changes. Change directory to the singularity folder and run the command:

./build_final_image

Copy to the final image to the HPC with the command:

spc final_qd_pmtg_XXX.sif user@login.hpc.ic.ac.uk:

(replace user with your user name and select the appropriate image name). You can now create a .job script and submit your job to train your agent.

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
.idea		.idea
__pycache__		__pycache__
data		data
environment		environment
map_elites		map_elites
plot_map_elites		plot_map_elites
pmtg		pmtg
policies		policies
robot		robot
scenes		scenes
singularity		singularity
tasks		tasks
third_party		third_party
utils		utils
.gitignore		.gitignore
README.md		README.md
archive_400000.dat		archive_400000.dat
centroids_500.dat		centroids_500.dat
train_domain_randomisation.py		train_domain_randomisation.py
train_domain_randomisation_deep.py		train_domain_randomisation_deep.py
train_flat_terrain.py		train_flat_terrain.py
trajectory_generator_qd.py		trajectory_generator_qd.py
visualise_domain_randomisation.py		visualise_domain_randomisation.py
visualise_flat_terrain.py		visualise_flat_terrain.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Reinforcement Learning for Dynamic Robot Locomotion

Installation

Dependencies

Examples

Basic usage

About

Releases

Packages

Languages

LisaCoiffard/QD_PMTG

Folders and files

Latest commit

History

Repository files navigation

Deep Reinforcement Learning for Dynamic Robot Locomotion

Installation

Dependencies

Examples

Basic usage

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages