Reinforcement Learning with tabular methods: TD-learning (Q-learning and SARSA) and MENACE-like approach applied to a Rubik's cube with a move set restricted to 180-degree turns.
-
Updated
Aug 1, 2021 - C
Reinforcement Learning with tabular methods: TD-learning (Q-learning and SARSA) and MENACE-like approach applied to a Rubik's cube with a move set restricted to 180-degree turns.
K-armed bandit problem approached with a variety of action-selection learning algorithms.
🎮 포켓몬 길찾기 게임 (광운대학교 컴퓨터정보공학부 고급C프로그래밍 팀프로젝트)
Add a description, image, and links to the epsilon-greedy topic page so that developers can more easily learn about it.
To associate your repository with the epsilon-greedy topic, visit your repo's landing page and select "manage topics."