[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
board-game reinforcement-learning pytorch gym mcts gomoku tictactoe atari alpha-beta-pruning monte-carlo-tree-search continuous-control board-games alphazero self-play mcts-algorithm muzero stochastic-muzero efficientzero sampled-muzero gumbel-muzero
-
Updated
Dec 20, 2024 - Python