GitHub - StanfordAI4HI/poela: POELA: Policy Optimization with ELigible Actions

Instructions for running the code

To run POELA:

python src/run_pg.py --action_mask_type=nn_action_dist --threshold=0.6 --var_coeff=0.1

To run PO-mu/PO-CRM:

python src/run_pg.py --action_mask_type=step --threshold=0.01 --var_coeff=10.0

To run BCQ:

python src/run_ql.py --state_clipping=0 --threshold=0.01

To run PQL:

python src/run_ql.py --state_clipping=1 --threshold=0.01

Acknowledgements

The code is an adaptation of the BCQ official implementation.

This is code for the paper Offline Policy Optimization with Eligible Actions https://arxiv.org/abs/2207.00632 Yao Liu, Yannis Flet-Berliac, and Emma Brunskill Conference on Uncertainty in AI (UAI) 2022

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
src		src
tabular_example		tabular_example
.gitignore		.gitignore
README.md		README.md
cancer_ope.ipynb		cancer_ope.ipynb
mimic_ope.ipynb		mimic_ope.ipynb
run.sh		run.sh
run_batch.sh		run_batch.sh
run_pg_cancer.sh		run_pg_cancer.sh
run_pg_cartpole.sh		run_pg_cartpole.sh
run_ql_cancer.sh		run_ql_cancer.sh
run_ql_cartpole.sh		run_ql_cartpole.sh
run_ql_mimic.sh		run_ql_mimic.sh
run_ql_mimic_batch.sh		run_ql_mimic_batch.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Instructions for running the code

Acknowledgements

About

Releases

Packages

Contributors 2

Languages

StanfordAI4HI/poela

Folders and files

Latest commit

History

Repository files navigation

Instructions for running the code

Acknowledgements

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages