RL Lab

This repository contains the code for various reinforcement learning algorithms implemented in PyTorch. The code is mostly my personal notes and experiments with these algorithms for education purpose. The code is not optimized, nor intended, for production use.

The algorithms are trained on various Farama Gymansium environments. The code is written in a modular way so that it can be easily extended to other environments. Some trained weights are also provided in the models directory for reference. I did not compare against openai baselines but feel free to optimize the code and compare the results.

The repository is being updated regularly as I explore more algorithms.

Learn more about gymnasium here: Farama Gymnasium

Algorithms

There are only model-free DRL algorithms for now. The list is not exhaustive, and I will keep adding more algorithms as I learn about them.

Q-Learning Algorithms	Policy Gradient Algorithms
- [x] Deep Q-Network (DQN)	- [x] Deep Deterministic Policy Gradient (DDPG)
- [x] Double Deep Q-Network (DDQN)	- [x] Proximal Policy Optimization (PPO)
	- [x] REINFORCE (vanilla policy gradient)

Environments

Classic Control	Box2D	Atari
CartPole-v1	LunarLander-v2	PongNoFrameskip-v4
MountainCar-v0	BipedalWalker-v3	BreakoutNoFrameskip-v4
Acrobot-v1
Pendulum-v0

Requirements

Using a requirements.txt will likely not work so I will not provide one. You should just install these core libraries used using your own preferred method. I use conda to install (most of) the libraries.

torch (used for almost all models)
tensorflow (only used in 1-2 models)
gymnasium (for Gym classical control)
ale-py (for Atari environments)
stable-baselines3 (wrapper for Gym environments)
tensorboard (for logging)
wandb (for logging)
scikit-learn, numpy, matplotlib, pandas, scipy
yfinance (for financial data API)

Former Quantitative Trading Project

The gdqn and deep_learning folder contains the code for a former quantitative trading project. The project was an attempt to use reinforcement learning to train a trading agent to trade on the stock market. The agent and environment and data have already been built. The project is still in progress but I amm not sure when will it progress further.

Financial Optimization Notes

The optimization folder contains some notes on financial optimization. Many people do experiments with using reinforcement learning in financial markets, so I believe it is a good idea to build up some basic knowledge regarding numerical optimization in finance.

Again, the algorithms are not optimized and are for educational purposes only.

Resources

I would like to give thanks to:

Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto
OpenAI Spinning Up
DeepMind X UCL
Various online textbooks such as this one
Various blog articles and YouTube videos

Research Papers

The following research papers were used as references for the implementation of the algorithms. This is not an exhaustive list, but it is a good starting point for anyone interested in learning more about reinforcement learning. I would keep adding to them from time to time.

Name		Name	Last commit message	Last commit date
Latest commit History 63 Commits
ddpg		ddpg
deep_learning		deep_learning
gdqn		gdqn
models		models
optimization		optimization
pg		pg
ppo		ppo
utils		utils
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
setenv.sh		setenv.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL Lab

Algorithms

Environments

Environments

Requirements

Former Quantitative Trading Project

Financial Optimization Notes

Resources

Research Papers

About

Releases

Packages

Languages

License

supreme-gg-gg/rl-lab

Folders and files

Latest commit

History

Repository files navigation

RL Lab

Algorithms

Environments

Environments

Requirements

Former Quantitative Trading Project

Financial Optimization Notes

Resources

Research Papers

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages