Skip to content

Latest commit

 

History

History
34 lines (20 loc) · 1.86 KB

README.md

File metadata and controls

34 lines (20 loc) · 1.86 KB

Training agents in OpenAI-Gym
with Policy-Gradient methods

cartpole_render A trained agent balancing an inverted pole on a moving cart.
lunar_lander_render
A trained agent controlling boosters to land a spaceship.

Training plots

plots Training Policy Gradient on the CartPoleV1 environment.

lunar_lander_plots Training Policy Gradient on the LunarLander-v2 environment.

actor_critic_lunarlander_v2 Actor Critic plots for the LunarLander-v2 environment.

Architectures (Click to expand)

Acknowledgements

  • Sutton, R. S., Barto, A. G. (2018). Reinforcement Learning: An Introduction. The MIT Press.
  • Graesser, L., Keng, W. L. (2019). Foundations of Deep Reinforcement Learning: Theory and Practice in Python. Addison-Wesley Professional.
  • Chris Yoon, Dec 30, 2018, Deriving Policy Gradients and Implementing REINFORCE
  • Silver, D. (2015, December 21). RL Course by David Silver - Lecture 7: Policy Gradient Methods.