Reinforcement Learning Project

Deep Reinforcement Learning from Human Preferences

Students: Yanis Lalou, Agathe Minaro, Luka Trailovic

Implementation of the paper: Deep Reinforcement Learning from Human Preferences by Paul F Christiano, Jan Leike, Tom B Brown, Miljan Martic, Shane Legg, Dario Amodei (2017).

To use it, run the notebook_demo.ipynb.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.gitignore		.gitignore
README.md		README.md
a2c_agent.py		a2c_agent.py
actor_critic_cnn.py		actor_critic_cnn.py
d_space.py		d_space.py
get_human_choice.py		get_human_choice.py
notebook_demo.ipynb		notebook_demo.ipynb
requirements.txt		requirements.txt
reward_function.py		reward_function.py
rl_from_human_preferences.py		rl_from_human_preferences.py
train_human_rewarder.py		train_human_rewarder.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Reinforcement Learning Project

Deep Reinforcement Learning from Human Preferences

About

Releases

Packages

Contributors 2

Languages

agatheminaro/rl-project-human-preferences

Folders and files

Latest commit

History

Repository files navigation

Reinforcement Learning Project

Deep Reinforcement Learning from Human Preferences

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages