Stockcheese

Chess AI fully based on artificial neural networks. Intended to be trained against itself.

Stockcheese is pure artifical neural network following the actor-critic paradigm and doesn't rely on explicit/brute-force/"heuristic" monte carlo tree search. Since the critic not only learns the value function of the current state, but also the value function for a sequence of states: aka the monte carlo tree search function, thanks to game sequence processing layers within the architecture among other details.

Creative implementation details

Contains both dense and sparse rewards.
Unlike the standard actor-critic, the critic sents its ouput to the actor as a feedback in both training and inference, as the actor doesn't learn the the monte carlo tree search function, only to match state to action.
Draw is punished proportionally to how high is win rate, if wins are scarce, draw is seen as progress. With high win rates, the opposite.
Exploration is rewarding for wins, allowing SC to learn to explore better trees.
Faster wins are more rewarding. However, slow losses and draws are not more punishing.
Loss function transitions from squared to absolute error as lower error is achieved.

Install for dev

git clone
pip install -r requirements.txt

Use trained model

(Not yet) See save_path = "source/Stockcheese_weights.hd5" for trained weights.

Why pick tensorflow over pytorch

Tensorflow/Keras is very easy to customize and extend, there are examples covering almost every case and the documentation is clearer and wider.

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
source		source
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stockcheese

Creative implementation details

Install for dev

Use trained model

Why pick tensorflow over pytorch

About

Releases

Packages

Languages

License

jm-willy/Stockcheese

Folders and files

Latest commit

History

Repository files navigation

Stockcheese

Creative implementation details

Install for dev

Use trained model

Why pick tensorflow over pytorch

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages