Reinforcement Learning RL Zoo - Implementation of RL algorithms. Dependencies Pytorch OpenAI Gym RL Algorithms Classic Algorithms Value Iteration Policy Evaluation - Monte Carlo Estimation Policy Improvement - Monte Carlo ES SARSA, On-Policy TD Control Q-learning, Off-Policy TD Control Q-learning Deep Q-learning Double DQN Prioritized Experience Replay Policy Gradient REINFORCE Proximal Policy Optimization (PPO) Actor-Critic Adavantage Actor-Critic (A2C) Deep Deterministic Policy Gradient (DDPG) Twin-Delayed Deep Deterministic Policy Gradient (TD3) References Reinforcement Learning - An Introduction, Sutton and Barto Deep Reinforcement Learning - Course by Lerrel Pinto