Adaptive Hanabi

Introduction

This repo contains the implementation of the benchmark described in our paper Towards Few-shot Coordination: Revisiting Ad-hoc Teamplay Challenge In the Game of Hanabi.

The codebase is mostly based on off-belief-learning repo.

hanabi-learning-environment is a modified version of the original HLE from Deepmind.

Environment Setup

Please refer to the setup instruction oon off-belief-learning repo.

Run the Code

To pre-train hanabi agents,

cd pyhanabi
sh scripts/iql.sh

To finetune an agent to a pre-traind agent, use the following script:

cd pyhanabi
sh scripts/adaptation.sh

Note that, before running the script, --load_model and --coop_agents should be specified that shows the path to directory of the learner and cooperative partners checkpoints.

Download Models

To download the trained models used in the paper, go to models folder and run

sh download_pool.sh

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
hanabi-learning-environment		hanabi-learning-environment
models		models
pyhanabi		pyhanabi
rela		rela
results		results
rlcc		rlcc
searchcc		searchcc
third_party		third_party
.clang-format		.clang-format
.gitignore		.gitignore
.gitmodules		.gitmodules
CMakeLists.txt		CMakeLists.txt
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
get_pybind_flags.py		get_pybind_flags.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adaptive Hanabi

Introduction

Environment Setup

Run the Code

Download Models

About

Releases

Packages

Contributors 2

Languages

License

chandar-lab/adaptive-hanabi

Folders and files

Latest commit

History

Repository files navigation

Adaptive Hanabi

Introduction

Environment Setup

Run the Code

Download Models

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages