GitHub - sharyalZ/D-CMAB-TS: It is a repository containing the implementation of combinatorial multi-armed bandit algorithm with Thompson Sampling to solve combinatorial optimization problems involving single agent as well as multiple agents in a decentralized environment.

Decentralized-Combinatorial Multi-Armed Bandit-Thompson Sampling (D-CMAB-TS)

It is a repository containing the implementation of combinatorial multi-armed bandit algorithm with Thompson Sampling to solve combinatorial optimization problems involving single agent as well as multiple agents in a decentralized environment. Full algorithm can be found here.

Architecture

The repository includes following files:

src: Contains the implementation of the learning agent class.
Jupiter Notebook Examples: Contains Jupyter Notebook examples utilizing the implemented learning agent to solve single agent and multi-agent combinatorial optimization problems in a decentralized manner.

Contact

For any further information, you can contact me at sharyal.zafar@ens-rennes.fr.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
Jupiter Notebook Example		Jupiter Notebook Example
src		src
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Decentralized-Combinatorial Multi-Armed Bandit-Thompson Sampling (D-CMAB-TS)

Architecture

Contact

About

Releases

Packages

Languages

License

sharyalZ/D-CMAB-TS

Folders and files

Latest commit

History

Repository files navigation

Decentralized-Combinatorial Multi-Armed Bandit-Thompson Sampling (D-CMAB-TS)

Architecture

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages