epsilon-greedy

Here are 3 public repositories matching this topic...

RFLeijenaar / RL-Tabular-Rubikscube

Reinforcement Learning with tabular methods: TD-learning (Q-learning and SARSA) and MENACE-like approach applied to a Rubik's cube with a move set restricted to 180-degree turns.

reinforcement-learning q-learning epsilon-greedy sarsa simulated-annealing td-learning softmax menace-matchboxes

Updated Aug 1, 2021
C

RFLeijenaar / RL-KArmed-Bandit

Star

K-armed bandit problem approached with a variety of action-selection learning algorithms.

reinforcement-learning epsilon-greedy k-armed-bandit pursuit-algorithms reinforcement-comparison stochastic-gradient-ascent

Updated Dec 9, 2020
C

Hyeon9mak / HCP_2020

Star

🎮 포켓몬 길찾기 게임 (광운대학교 컴퓨터정보공학부 고급C프로그래밍 팀프로젝트)

q-learning epsilon-greedy q-learning-algorithm frozen-lake-game

Updated Dec 6, 2020
C

Improve this page

Add a description, image, and links to the epsilon-greedy topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the epsilon-greedy topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly