Skip to content

Deep reinforcement learning algorithm to solve Connect 4, based on AlphaZero

License

Notifications You must be signed in to change notification settings

zjeffer/connect4-deep-rl

Repository files navigation

AlphaZero on a Connect 4 environment

A deep reinforcement learning algorithm that plays Connect 4, based on AlphaZero. I'm creating this because my chess algorithm learns too slowly, and I wanted to know if the problem is the amount of data needed, or my implementation of the algorithm itself.

See https://zjeffer.github.io/connect4-deep-rl/ for Doxygen documentation.

TODO

  • Connect 4 environment
  • MCTS algorithm
  • Neural network
  • AlphaZero self-play
  • Argument parsing
  • Load settings from file
  • Unit tests:
    • Horizontal win
    • Vertical win
    • Diagonal win
    • Easy puzzle
    • Harder puzzle
    • ...?
  • Save played moves to memory, and memory to file
  • AlphaZero training
  • AlphaZero evaluation
  • Automatic pipeline for selfplay, training and evaluation
  • Play against computer
  • GUI?

Hits

About

Deep reinforcement learning algorithm to solve Connect 4, based on AlphaZero

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published