Diverse PSRO

Diverse PSRO is a variation of the Policy Space Response Oracle algorithm which promotes training a behaviourally diverse set of policies by using the theory of determinantal point processes (DPPs). This approach allows to train less exploitable more diverse strategies as well as bringing a new geometrically interpretable way of measuring population diversity.

How to run Diverse PSRO

The code on this repository can be run by cloning the repository

git clone https://github.com/diversepsro/diverse_psro

Creating a new Anaconda environment

conda env create -f environment.yml
conda activate diverse_psro

You can now run Random Games of Skill by executing

python3 random_games_skill.py

You can now run Real World Meta-Games by executing

python3 spinning_tops_dpp.py

You can now run Non-transitive Mixture Model by executing

python3 non_mixture_model.py

Performance of Diverse PSRO

Diverse PSRO is evaluated in three different settings, each of them using a different version of diverse oracle.

Game	Oracle
Random Games of Skill	Diverse BR
Real World Meta-Games	Diverse BR
Non-transitive mixture model	Diverse gradient ascent

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
payoffs_data		payoffs_data
results		results
README.md		README.md
environment.yml		environment.yml
non_mixture_model.py		non_mixture_model.py
random_games_skill.py		random_games_skill.py
spinning_tops_dpp.py		spinning_tops_dpp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Diverse PSRO

Diverse PSRO

How to run Diverse PSRO

Performance of Diverse PSRO

Random Games of Skill

Real World Meta-Games

Non-transitive mixture model

About

Releases

Packages

Contributors 2

Languages

diversepsro/diverse_psro

Folders and files

Latest commit

History

Repository files navigation

Diverse PSRO

Diverse PSRO

How to run Diverse PSRO

Performance of Diverse PSRO

Random Games of Skill

Real World Meta-Games

Non-transitive mixture model

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages