qLearning/README.md at main · zefea/qLearning · GitHub

Implementation of Q-learning algorithm

The aim is to train the mouse as to be able to find the cheese with the best/closest path possible.
Comparision of a random selection and softmax selection can be seen from the pull-request report. Also the same results is avaiable in output files seperately.
There is also a .c version of the program without plots.
- Mouse and Cheese