Yuhao-Wan

Diane Wan Yuhao-Wan

Achievements

time-varying-discount time-varying-discount Public

A practical method to reduce discounting-induced bias during training in deeep Q-networks.

Python
deep-reinforcement-learning deep-reinforcement-learning Public

Implementations of deep reinforcement learning algorithms in Tensorflow

Python 2 1
Pairwise-combinatorial-learner Pairwise-combinatorial-learner Public

Python implementation of algorithm in Learning Combinatorial Functions from Pairwise Comparisons

Python 1 1
Gaussian-processes Gaussian-processes Public

Python implementation of "A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning"

Jupyter Notebook 1