AnuraagRath / DeepQLearning-A.I-learns-to-balance-a-pole Public

Notifications You must be signed in to change notification settings
Fork 0
Star 0

Using Deep Q Learning, The Reinforcement 'Q' learning model is used along with a Neural Network to provide optimal 'q' function values i.e the optimal 'Actions' for the 'Agent' to undergo at a given time to balance a pole. The Deep-Q-Network is created using Pytorch. This is a base model which is to be improved upon.

0 stars 0 forks Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
pic		pic
DeepQNetwork_A.I_initial.ipynb		DeepQNetwork_A.I_initial.ipynb
README.md		README.md

Repository files navigation

DeepQLearning A.I learns to balance a pole:

Using Deep Q Learning, The Reinforcement 'Q' learning model is used along with a Neural Network to provide optimal 'q' function values i.e the optimal 'Actions' for the 'Agent' to undergo at a given time to balance a pole. The Deep-Q-Network is created using Pytorch. This is a base model which is to be improved upon. The DQN model is implemented using DeepMind's paper.

Balancing Pole:

The Neural Network:

We feed a lesser Resolution of a number of successive snapshots of the states into the Neural Network

Discounted Rate of Return:

The Algorithm:

DeepMind's paper:
Algorithm:
Code Implementation:

ε-greedy strategy and Exponential decay:

Calculating Loss:

About

Using Deep Q Learning, The Reinforcement 'Q' learning model is used along with a Neural Network to provide optimal 'q' function values i.e the optimal 'Actions' for the 'Agent' to undergo at a given time to balance a pole. The Deep-Q-Network is created using Pytorch. This is a base model which is to be improved upon.

Report repository

Releases

No releases published

Packages

No packages published

Languages

Jupyter Notebook 100.0%