Students: Yanis Lalou, Agathe Minaro, Luka Trailovic
Implementation of the paper: Deep Reinforcement Learning from Human Preferences by Paul F Christiano, Jan Leike, Tom B Brown, Miljan Martic, Shane Legg, Dario Amodei (2017).
To use it, run the notebook_demo.ipynb.