Library for multi-armed bandit selection strategies, including efficient deterministic implementations of Thompson sampling and epsilon-greedy.
-
Updated
Dec 7, 2024 - Go
Library for multi-armed bandit selection strategies, including efficient deterministic implementations of Thompson sampling and epsilon-greedy.
Add a description, image, and links to the thompson topic page so that developers can more easily learn about it.
To associate your repository with the thompson topic, visit your repo's landing page and select "manage topics."