TD-Othello README

CS 701 Fall 2016 Seminar Project, Middlebury College

How to play (in othello/playOthello.py):

1. Load a neural network. These networks are loaded with the following inputs:
    - filename
    - training player
    - opponent training player
    - lambda value (9 for 0.9, 1 for 1.0)
2. Choose a mode to play with:
    - play0: Plays a game with two computer players in the command line
    - play1: Plays a game with one computer player and one human player in the command line
    - play2: Plays a game with two human players in the command line
    - playGui: Plays a game with one computer player and one human player in a Tkinter GUI

TD Lambda algorithm:

TD Lambda is a temporal difference learning algorithm designed by Richard Sutton. More info on the algorithm can be found in the paper in this repository or at this link: https://webdocs.cs.ualberta.ca/~sutton/papers/sutton-89.pdf

Known bugs:

With lambda < 1, the output values of the neural network approach 1 for all board states at some point during training. Performance decreases when this occurs.

Name		Name	Last commit message	Last commit date
Latest commit History 74 Commits
Final Report		Final Report
alphabeta		alphabeta
nn_dec_random		nn_dec_random
nn_random		nn_random
othello		othello
pos_values		pos_values
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TD-Othello README

CS 701 Fall 2016 Seminar Project, Middlebury College

How to play (in othello/playOthello.py):

TD Lambda algorithm:

Known bugs:

About

Releases

Packages

Contributors 2

Languages

wernst/td-othello

Folders and files

Latest commit

History

Repository files navigation

TD-Othello README

CS 701 Fall 2016 Seminar Project, Middlebury College

How to play (in othello/playOthello.py):

TD Lambda algorithm:

Known bugs:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages