Skip to content

Training agents in OpenAI-Gym with Policy-Gradient methods.

Notifications You must be signed in to change notification settings

till2/policy-gradient-methods

Repository files navigation

Training agents in OpenAI-Gym
with Policy-Gradient methods

cartpole_render A trained agent balancing an inverted pole on a moving cart.
lunar_lander_render
A trained agent controlling boosters to land a spaceship.

Training plots

plots Training Policy Gradient on the CartPoleV1 environment.

lunar_lander_plots Training Policy Gradient on the LunarLander-v2 environment.

actor_critic_lunarlander_v2 Actor Critic plots for the LunarLander-v2 environment.

Architectures (Click to expand)

Acknowledgements

  • Sutton, R. S., Barto, A. G. (2018). Reinforcement Learning: An Introduction. The MIT Press.
  • Graesser, L., Keng, W. L. (2019). Foundations of Deep Reinforcement Learning: Theory and Practice in Python. Addison-Wesley Professional.
  • Chris Yoon, Dec 30, 2018, Deriving Policy Gradients and Implementing REINFORCE
  • Silver, D. (2015, December 21). RL Course by David Silver - Lecture 7: Policy Gradient Methods.

About

Training agents in OpenAI-Gym with Policy-Gradient methods.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published