Skip to content

Actions: vladfi1/slippi-ai

Actions

All workflows

Actions

Loading...
Loading

Showing runs from all workflows
132 workflow runs
132 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

[mixture] Measure entropy, teacher KL and reward stats.
slippi-ai test cases #82: Commit d613989 pushed by vladfi1
October 27, 2024 08:11 6m 24s mixture-rl
October 27, 2024 08:11 6m 24s
[mixture] Measure entropy, teacher KL and reward stats.
slippi-ai test cases #81: Commit d3edc40 pushed by vladfi1
October 26, 2024 23:48 2m 22s mixture-rl
October 26, 2024 23:48 2m 22s
Disable jit-compilation in RL as it leads to learning instability.
slippi-ai test cases #80: Commit 92e254b pushed by vladfi1
October 26, 2024 23:48 5m 8s rl-dev
October 26, 2024 23:48 5m 8s
Logging tweaks.
slippi-ai test cases #79: Commit 9a8c10e pushed by vladfi1
October 26, 2024 07:34 5m 53s rl-dev
October 26, 2024 07:34 5m 53s
[train_two] Toggle agent jit.
slippi-ai test cases #78: Commit cd2e250 pushed by vladfi1
October 25, 2024 21:53 5m 16s rl-dev
October 25, 2024 21:53 5m 16s
train two jit
slippi-ai test cases #77: Commit 64186f5 pushed by vladfi1
October 25, 2024 21:52 5m 10s rl-dev
October 25, 2024 21:52 5m 10s
Test RL with delay.
slippi-ai test cases #76: Commit 6a9ecd9 pushed by vladfi1
October 25, 2024 13:50 6m 12s rl-dev
October 25, 2024 13:50 6m 12s
ReplayBatchedEnvironment alternative to purely fake env.
slippi-ai test cases #75: Commit 649d2f7 pushed by vladfi1
October 25, 2024 13:49 4m 38s rl-dev
October 25, 2024 13:49 4m 38s
Do value function burnin on each new exploiter training phase.
slippi-ai test cases #74: Commit 1254468 pushed by vladfi1
October 24, 2024 15:16 7m 31s mixture-rl
October 24, 2024 15:16 7m 31s
Fix networks_test
slippi-ai test cases #73: Commit 66c2eb5 pushed by vladfi1
October 23, 2024 13:17 4m 50s dev
dev
October 23, 2024 13:17 4m 50s
[train_two] Log learner metrics for player 2.
slippi-ai test cases #72: Commit d866bac pushed by vladfi1
October 23, 2024 12:51 1m 43s rl-dev
October 23, 2024 12:51 1m 43s
Attempt to handle run_env dying more gracefully.
slippi-ai test cases #71: Commit 276832e pushed by vladfi1
October 23, 2024 12:36 1m 42s rl-dev
October 23, 2024 12:36 1m 42s
Cap mean_actor_kl to avoid policy collapse during RL.
slippi-ai test cases #70: Commit 0095fb6 pushed by vladfi1
October 23, 2024 12:28 1m 44s rl-dev
October 23, 2024 12:28 1m 44s
Correct fps/mps estimate for imitation/q-learning.
slippi-ai test cases #69: Commit 2f43684 pushed by vladfi1
October 23, 2024 12:09 1m 44s dev
dev
October 23, 2024 12:09 1m 44s
Correct fps/mps estimate for imitation/q-learning.
slippi-ai test cases #68: Commit 2a51c7b pushed by vladfi1
October 22, 2024 12:16 1m 54s nash
October 22, 2024 12:16 1m 54s
Attempt to fix env errors on KeyboardInterrupt.
slippi-ai test cases #67: Commit 4fc1510 pushed by vladfi1
October 20, 2024 13:11 4m 46s dev
dev
October 20, 2024 13:11 4m 46s
[nash] Attempt to optimize primal-dual solver with cholesky decomposi…
slippi-ai test cases #66: Commit 4dc76e2 pushed by vladfi1
October 20, 2024 12:17 4m 52s nash
October 20, 2024 12:17 4m 52s
Configure jit-compilation for q-learning.
slippi-ai test cases #65: Commit 8711fb3 pushed by vladfi1
October 20, 2024 12:16 5m 13s q-learning
October 20, 2024 12:16 5m 13s
Log rl-eval stats to separate path.
slippi-ai test cases #64: Commit c3fb7e3 pushed by vladfi1
October 20, 2024 12:10 5m 5s q-learning
October 20, 2024 12:10 5m 5s
Periodically reset rl_evaluator's env.
slippi-ai test cases #63: Commit 24d906c pushed by vladfi1
October 20, 2024 09:45 4m 37s q-learning
October 20, 2024 09:45 4m 37s
Periodically eset rl_evaluator's env.
slippi-ai test cases #62: Commit 5cc7b2e pushed by vladfi1
October 20, 2024 09:45 4m 39s q-learning
October 20, 2024 09:45 4m 39s
Fix missing name code logging in q-learning.
slippi-ai test cases #61: Commit e2ec94b pushed by vladfi1
October 20, 2024 07:56 10m 51s q-learning
October 20, 2024 07:56 10m 51s
Fix missing name code logging.
slippi-ai test cases #60: Commit 1fa233c pushed by vladfi1
October 19, 2024 16:09 5m 16s dev
dev
October 19, 2024 16:09 5m 16s
Log imitation loss by character.
slippi-ai test cases #59: Commit c0d4323 pushed by vladfi1
October 19, 2024 15:57 4m 53s multichar
October 19, 2024 15:57 4m 53s
Enable delay in q-learning.
slippi-ai test cases #58: Commit 0f9d2df pushed by vladfi1
October 19, 2024 14:39 4m 50s q-learning
October 19, 2024 14:39 4m 50s