This project is the successor to Phillip. While the original Phillip used pure deep RL, this one starts with behavioral cloning on slippi replays, which makes it play a lot more like a human.
I occasionally have the bot available to play via netplay on my twitch channel.
I am hesitant to release any trained agents as I don't want people using them on ranked/unranked, so at the moment the bot isn't available to play against locally.
My youtube channel has some recordings, mainly of the original phillip. There is also a video of Aklo playing the new bot.
The main entry points are scripts/train.py
for imitation learning, scripts/eval_two.py
for playing a trained agent, and slippi_ai/rl/run.py
for self-play RL. Dataset creation is handled by slippi_db/parse_local.py
.