APO

Author's implementation of "Average-Reward Reinforcement Learning with Trust Region Methods".

Installation

This code is based my forked version of rlpyt. To reproduce the results in paper, please run python run_exp.py. Note that this python file contains all the hyperparameters I have tried on 8 GPUs. Please set your hyperparameters manually before your experiments.

Bibtex

@inproceedings{ma2021average-reward,
    title={Average-Reward Reinforcement Learning with Trust Region Methods},
    author={Ma, Xiaoteng and Tang, Xiaohang and Xia, Li and Yang, Jun and Zhao, Qianchuan},
    journal={International Joint Conferences on Artificial Intelligence},
    pages={2797--2803},
    year={2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
apo		apo
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
exp.py		exp.py
main.py		main.py
run_exp.py		run_exp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

APO

Installation

Bibtex

About

Releases

Packages

Languages

License

xtma/apo

Folders and files

Latest commit

History

Repository files navigation

APO

Installation

Bibtex

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages