Distributive Prioritised Hindsight Experience Replay

General Hyper-Parameters:

ENV_NAME = 'FetchSlide-v0'/'FetchPush-v0'/'FetchPickAndPlace-v0'/'FetchReach-v0' (if using FetchReach-v0, change ddpg.py/line 393 &ddpg.py/line 388 as per instructions of the function)
limit (replay memory)
critic_lr = 5e-4 (default)
actor_lr = 5e-4 (default)
batch_size = 32(default)
nb_steps = 1500000 (#steps of training)
file_interval = 10000 (#steps before saving)
network_architechture (no idea what could be best)
targer_model_update = 1e-3 (soft update for target model)
delta_clip = np.inf (huber loss hyperparameter) [ignore for now]

Hindsight Experience Replay

HER = True/False
K = any interger (default: 4 works best)
HER_strategy = 'future' OR 'episode'

Prioritised Experience Replay

PER = True/False
alpha = float between 0 to 1
beta = float between 0 to 1
epsilon = very small (edge case for sampling)

Distributive PHER

actor batch size
actor memory size
actor memory prioritisation (alpha)
actor bias annealing (beta)

Instructions:

General:

Set the permission of .sh file by doing: chmod u+x train.sh
Set the hyperparameters in the train.sh file accordingly
After setting the hyperparameters, run ./train.sh
To stop training, press Ctrl+C at any point. (Try stopping after 10000 steps)
The code then plots and tests 5 episodes (default) of the learned model
To resume training from pretrained weights, use argument --pretrained

Extra:

Keep check_json.py in the same folder as ddpg.py (in general keep the entire folder strucuture unchanges)
make /HER/ and /PHER/ subfolders in examples directory

Note: Code based on keras-rl (https://github.com/keras-rl/keras-rl) repository

References:

keras-rl, Matthias Plappert, 2016, https://github.com/keras-rl/keras-rl
Hindsight Experience Replay(A. Marcin et al, 2018)
DISTRIBUTED PRIORITIZED EXPERIENCE REPLAY (Dan Horgan et al, 2018)
Universal Value Function Approximators (S. Tom et al, 2017)
https://github.com/minsangkim142/hindsight-experience-replay/blob/master/HER.py

Name		Name	Last commit message	Last commit date
Latest commit History 75 Commits
core		core
core_distributive		core_distributive
extras		extras
utils		utils
.gitignore		.gitignore
README.md		README.md
Report.pdf		Report.pdf
train.sh		train.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Distributive Prioritised Hindsight Experience Replay

General Hyper-Parameters:

Hindsight Experience Replay

Prioritised Experience Replay

Distributive PHER

About

Releases

Packages

Contributors 2

Languages

ojasjoshi/distributive-prioritised-hindsight-experience-replay

Folders and files

Latest commit

History

Repository files navigation

Distributive Prioritised Hindsight Experience Replay

General Hyper-Parameters:

Hindsight Experience Replay

Prioritised Experience Replay

Distributive PHER

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages