Q-learning and Q-value iteration algorithms for the Block-World environment.
-
Updated
Jan 31, 2021 - Jupyter Notebook
Q-learning and Q-value iteration algorithms for the Block-World environment.
Build an RL (Reinforcement Learning) agent that learns to play Numerical Tic-Tac-Toe. The agent learns the game by Q-Learning.
Add a description, image, and links to the q-value-iteration topic page so that developers can more easily learn about it.
To associate your repository with the q-value-iteration topic, visit your repo's landing page and select "manage topics."