thompson-sampling

Here are 103 public repositories matching this topic...

alison-carrera / onn

Online Deep Learning: Learning Deep Neural Networks on the Fly / Non-linear Contextual Bandit Algorithm (ONN_THS)

reinforcement-learning neural-network pytorch thompson-sampling reinforcement-learning-algorithms machine-learning-library neural-architecture-search contextual-bandits mab pytorch-implemention multiarmed-bandits pytorch-implementation thompson-algorithm

Updated Dec 11, 2019
Python

alison-carrera / mabalgs

Star

👤 Multi-Armed Bandit Algorithms Library (MAB) 👮

arm algorithm reinforcement-learning simulation monte-carlo rank thompson-sampling reinforcement-learning-algorithms ucb reward multi-armed-bandit montecarlo-simulation contextual-bandits ranking-algorithm mab ranked-mab

Updated Sep 6, 2022
Python

Eric-Bradford / TS-EMO

Star

This repository contains the source code for “Thompson sampling efficient multiobjective optimization” (TSEMO).

machine-learning matlab thompson-sampling multi-objective-optimization genetic-algorithms black-box-optimization gaussian-processes bayesian-optimization kriging expensive-to-evaluate-functions surrogate-based-optimization spectral-sampling

Updated Jun 19, 2020
MATLAB

sail-sg / oat

Star

🌾 OAT: Online AlignmenT for LLMs

thompson-sampling alignment distributed-training dueling-bandits dpo distributed-rl llm rlhf llm-aligment online-alignment llm-exploration

Updated Dec 22, 2024
Python

stitchfix / mab

Star

Library for multi-armed bandit selection strategies, including efficient deterministic implementations of Thompson sampling and epsilon-greedy.

go golang data-science reinforcement-learning thompson-sampling experimentation multi-armed-bandits multi-armed-bandit thompson multiarmed-bandits

Updated Jan 21, 2025
Go

andrecianflone / thompson

Star

Thompson Sampling Tutorial

reinforcement-learning thompson-sampling bandit bandit-algorithm

Updated Jan 25, 2019
Jupyter Notebook

farhanchoudhary / Machine_Learning_A-Z_All_Codes_and_Templates

Star

All codes, both created and optimized for best results from the SuperDataScience Course

natural-language-processing reinforcement-learning deep-learning clustering cross-validation naive-bayes-classifier thompson-sampling neural-networks classification dimensionality-reduction grid-search principal-component-analysis clustering-algorithm upper-confidence-bounds k-fold xgboost-algorithm association-rule-learning machine-learning-az

Updated Nov 5, 2017
Python

Nikronic / Machine-Learning-Models

Star

In This repository I made some simple to complex methods in machine learning. Here I try to build template style code.

Updated Nov 7, 2021
Python

niffler92 / Bandit

Star

Bandit algorithms

simulation thompson-sampling multiarm-bandit contextual-bandit bandit-algorithms linucb

Updated Oct 12, 2017
Python

michaelosthege / pyrff

Star

pyrff: Python implementation of random fourier feature approximations for gaussian processes

thompson-sampling gaussian-processes bayesian-optimization

Updated May 26, 2022
Jupyter Notebook

antoine-hochart / bandit_algo_evaluation

Star

Offline evaluation of multi-armed bandit algorithms

thompson-sampling epsilon-greedy policy-evaluation multi-armed-bandit upper-confidence-bound

Updated Dec 1, 2020
Python

nphdang / Bandit-BO

Star

Bayesian Optimization for Categorical and Continuous Inputs

machine-learning optimization thompson-sampling hyperparameter-optimization hyperopt gaussian-processes bayesian-optimization multi-armed-bandits hyperparameter-tuning automl automated-machine-learning smac categorical-variables continuous-variable acquisition-functions gpyopt batch-bayesian-optimization

Updated Jul 20, 2020
Python

RonyAbecidan / Neural-Thompson-Sampling

Star

Study of the paper 'Neural Thompson Sampling' published in October 2020

neural-network thompson-sampling multi-armed-bandits non-linear-optimization contextual-bandits neural-tangent-kernel neural-thompson-sampling

Updated Sep 27, 2022
Jupyter Notebook

v-i-s-h / MAB.jl

Star

A Julia Package for providing Multi Armed Bandit Experiments

reinforcement-learning julia julia-language thompson-sampling reinforcement-learning-algorithms multi-arm-bandits ucb julia-package exp julialang mab bandit-experiments

Updated Jul 19, 2018
Julia

akshaykhadse / reinforcement-learning

Star

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

reinforcement-learning linear-programming thompson-sampling epsilon-greedy ucb policy-evaluation mdps multi-armed-bandits policy-iteration randomised-algorithms reinforcement-learning-excercises kl-divergence markovian-epidemic-processes reinforcement-learning-analysis multiarm-bandit ucb1 howards-pi batch-switching randomized-policy-iteration

Updated May 21, 2018
Python

ZIYU-DEEP / Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems

Star

A curated list on papers about combinatorial multi-armed bandit problems.

thompson-sampling multi-armed-bandit combinatorial-optimization bandit-algorithms combinatorial-bandit

Updated May 10, 2021

aijunbai / thompson-sampling

Star

Thompson Sampling based Monte Carlo Tree Search for MDPs and POMDPs

mdp mcts thompson-sampling pomdps

Updated Jun 20, 2016
C++

ishank-juneja / Correlated-AoI-Bandits

Star

Author's implementation of the paper Correlated Age-of-Information Bandits.

thompson-sampling ucb multi-armed-bandit aoi age-of-information correlated-multi-armed-bandits correlated-arms aoi-regret

Updated Jun 19, 2021
Python

sharmaroshan / Ads-Optimization

Star

Optimizing the best Ads using Reinforcement learning Algorithms such as Thompson Sampling and Upper Confidence Bound.

data-science reinforcement-learning eda data-visualization thompson-sampling data-analysis beginner upper-confidence-bound

Updated May 24, 2019
Jupyter Notebook

atse0612 / Machine-Learning-A-Z

Star

python r random-forest numpy linear-regression regression pandas random-generation naive-bayes-classifier thompson-sampling logistic-regression matplotlib kernel-support polynomial-regression upper-confidence-bounds bayesian-statistics model-building

Updated Mar 29, 2018
Jupyter Notebook

Improve this page

Add a description, image, and links to the thompson-sampling topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the thompson-sampling topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

thompson-sampling

Here are 103 public repositories matching this topic...

alison-carrera / onn

alison-carrera / mabalgs

Eric-Bradford / TS-EMO

sail-sg / oat

stitchfix / mab

andrecianflone / thompson

farhanchoudhary / Machine_Learning_A-Z_All_Codes_and_Templates

Nikronic / Machine-Learning-Models

niffler92 / Bandit

michaelosthege / pyrff

antoine-hochart / bandit_algo_evaluation

nphdang / Bandit-BO

RonyAbecidan / Neural-Thompson-Sampling

v-i-s-h / MAB.jl

akshaykhadse / reinforcement-learning

ZIYU-DEEP / Awesome-Papers-on-Combinatorial-Semi-Bandit-Problems

aijunbai / thompson-sampling

ishank-juneja / Correlated-AoI-Bandits

sharmaroshan / Ads-Optimization

atse0612 / Machine-Learning-A-Z

Improve this page

Add this topic to your repo