Pinned Loading
-
time-varying-discount
time-varying-discount PublicA practical method to reduce discounting-induced bias during training in deeep Q-networks.
Python
-
deep-reinforcement-learning
deep-reinforcement-learning PublicImplementations of deep reinforcement learning algorithms in Tensorflow
-
Pairwise-combinatorial-learner
Pairwise-combinatorial-learner PublicPython implementation of algorithm in Learning Combinatorial Functions from Pairwise Comparisons
-
Gaussian-processes
Gaussian-processes PublicPython implementation of "A Tutorial on Bayesian Optimization of Expensive Cost Functions, with Application to Active User Modeling and Hierarchical Reinforcement Learning"
Jupyter Notebook 1
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.