RL for Tic-Tac-Toe 10x10

Code for the extended version of Tic-Tac-Toe game: 10x10 board, winner should place 5 "x" or "o" in a row/column/any diagonal.

We implement an RL agent trained via AlphaZero algorithm [1].

Repository Structure

├── agents                      # pre-trained agents to play with
│   └── ...
│
├── assets                      # images used in gui
│   └── ...
│
├── gui                         # classes to draw the game board
│   └── ...
│
├── neuroxo                     # project's library
│   └── ...
│
├── notebooks                   # some experimental visualization
│   └── ...
│
├── scripts
│   ├── run_zero_data_gen.py    # 1st part of training: continuously generate new data using the best model
│   └── run_zero_train_val.py   # 2nd part of training: trains the current model using generated data, runs validation against the best model
│
└── play.py                     # Our "main" function. Here, you can play against the RL agent or simply enjoy multiplayer.

Installation

Execute from the directory you want the repo to be installed:

git clone git@github.com:BorisShirokikh/neuro-xo.git
cd neuro-xo
pip install -e .

References

[1] Silver, David, et al. "Mastering the game of go without human knowledge." nature 550.7676 (2017): 354-359.

Name		Name	Last commit message	Last commit date
Latest commit History 84 Commits
agents		agents
assets		assets
gui		gui
neuroxo		neuroxo
notebooks		notebooks
scripts		scripts
.gitignore		.gitignore
README.md		README.md
install.sh		install.sh
play.py		play.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL for Tic-Tac-Toe 10x10

Repository Structure

Installation

References

About

Releases

Packages

Contributors 2

Languages

BorisShirokikh/neuro-xo

Folders and files

Latest commit

History

Repository files navigation

RL for Tic-Tac-Toe 10x10

Repository Structure

Installation

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages