Skip to content

A comprehensive collection of Reinforcement Learning Algorithms implemented in Jupyter Notebook aimed to provide clear, practical, and well-documented code for foundational algorithms.

License

Notifications You must be signed in to change notification settings

Sarmadzandi/Reinforcement-Learning-Algorithms

Repository files navigation

Reinforcement Learning Algorithms

Reinforcement Learning Python

Introduction

This repository includes implementations of the following algorithms:

  • Deep Q-Learning: Utilizing experience replay and target networks.
  • Multi-Armed Bandits: Including strategies like epsilon-greedy and Upper Confidence Bound (UCB).
  • N-step Tree Backup: Implementation for n-step bootstrapping.
  • Off-Policy Learning: Algorithms such as Q-learning.
  • On-Policy Learning: Methods like SARSA.
  • Thompson Sampling: Bayesian approach for balancing exploration and exploitation.
  • Expected SARSA: An enhancement over SARSA with expected rewards.
  • Gradient Preference-Based Methods: Various policy gradient algorithms.
  • Policy Iteration: Classical dynamic programming algorithm for solving MDPs.

About

A comprehensive collection of Reinforcement Learning Algorithms implemented in Jupyter Notebook aimed to provide clear, practical, and well-documented code for foundational algorithms.

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published