PPO-StableBaselines3

This repository contains a re-implementation of the Proximal Policy Optimization (PPO) algorithm, originally sourced from Stable-Baselines3.

The purpose of this re-implementation is to provide insight into the inner workings of the PPO algorithm in these environments:

LunarLander-v2
CartPole-v1

Requirements

Install Python version 3.9.x
Install Visual C++ 14.0 or greater from https://visualstudio.microsoft.com/visual-cpp-build-tools/
Run pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu118
Run pip install stable-baselines3[extra]==2.2.1
Run pip install swig
Run pip install gymnasium
Run pip install gymnasium[box2d]

Run the script

Change the game in main.py as you wish (LunarLander-v2 / CartPole-v1)
Simply run python main.py

Test your model

Simply run python test.py (as of now, running the test script will load my best model for both LunarLander-v2 and CartPole-v1)

To-do

Rollout Buffer
Model
Training phase
Testing phase
Run game from Terminal (Example: python main.py --game 'LunarLander-v2')
Load model from Terminal (Example: python main.py --game 'LunarLander-v2' --model 'model.pt')
Support CarRacing-v2 environment

Disclaimer

This repository includes parts of code that has been adapted from the Stable Baselines library (https://github.com/DLR-RM/stable-baselines3) for educational purposes only. The original code is the property of its respective owners and is subject to their licensing terms.

I do not claim any ownership, copyright, or proprietary rights over the code obtained from Stable Baselines. The use of this code in this repository is solely for educational and learning purposes, and any commercial use or distribution is subject to the original licensing terms provided by Stable Baselines.

The original Stable Baselines code is licensed under the MIT License, and any use of their code in this repository is also subject to the terms of the MIT License.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
configs		configs
log_dir_CartPole-v1		log_dir_CartPole-v1
log_dir_LunarLander-v2		log_dir_LunarLander-v2
models		models
README.md		README.md
main.py		main.py
model.py		model.py
monitor.py		monitor.py
rollout.py		rollout.py
test.py		test.py
trainer.py		trainer.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PPO-StableBaselines3

Requirements

Run the script

Test your model

To-do

Disclaimer

About

Releases

Packages

Languages

SlimShadys/PPO-StableBaselines3

Folders and files

Latest commit

History

Repository files navigation

PPO-StableBaselines3

Requirements

Run the script

Test your model

To-do

Disclaimer

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages