Skip to content

An implementation of Proximal Policy Optimization (PPO)

Notifications You must be signed in to change notification settings

SalmanMohammadi/ppo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

PPO

An implementation of Proximal Policy Optimization (PPO) using Generalized Advantage Estimation (GAE) and multi-processing.

Installing

python3 -m venv venv to set up a virtual environment

cd pip install . to install, or pip install -e . for development.

python src/run_ppo.py config/pendulum.yaml trains PPO for a given config file. Examples for different environments with hyperparameters I've found that work well can be found in config/.

Paper Notes

See https://salmanmohammadi.github.io/content/ppo/ for an explanation of the method.

About

An implementation of Proximal Policy Optimization (PPO)

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages