This repository contains all of the Reinforcement Learning-related projects I've worked on. The projects are part of the graduate course at the University of Tehran.
monte-carlo epsilon-greedy policy-gradient sarsa dynamic-programming policy-iteration model-based-rl n-armed-bandit-problem on-policy off-policy double-q-learning model-free-rl n-step-bootstrapping n-step-expected-sarsa n-step-tree-backup ucb-algorithm
-
Updated
Oct 2, 2021 - HTML