Solve the Gridworld Problem with Reinforcement Learning

Based on the book --- Reinforcement Learning: An Introduction (2nd ed, 2018) by Sutton and Barto

For the Reinforcement Learning course Assignment 2 (see 'Gridworld Problem 1.pdf') at Memorial University of Newfoundland, Jul. 18, 2024

For Gridworld Problem 2, see https://github.com/zyxsjdy/Solve-the-Gridworld-Problem-with-Reinforcement-Learning-Cont

Environment

python 3.11.9

Packages: numpy, cvxpy, copy

How to run

Use Jupyter Notebook to open the '.ipynb' file, and click Run All to run the file.

'P1.ipynb' for part 1, 'P2.ipynb' for part 2, and 'environment.py' for the gridworld environment.

See explanation for results in 'Report.pdf'.

Included topics

Reinforcement Learning

Gridworld problem

Part 1:

Bellman equation

Bellman optimality equation

Iterative policy evaluation

Value iteration

Policy iteration

Policy improvement

Part 2:

Monte Carlo method

On-policy Monte Carlo ES (exploring starts)

On-policy first-visit Monte Carlo control for ϵ-soft policy

Off-policy Monte Carlo control

Off-policy Monte Carlo prediction (policy evaluation)

References

https://marcinbogdanski.github.io/rl-sketchpad

https://github.com/BY571/Medium_Code_Examples

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Solve the Gridworld Problem with Reinforcement Learning

Environment

How to run

Included topics

References

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
Gridworld Problem 1.pdf		Gridworld Problem 1.pdf
P1.ipynb		P1.ipynb
P2.ipynb		P2.ipynb
README.md		README.md
Report.pdf		Report.pdf
environment.py		environment.py

zyxsjdy/Solve-the-Gridworld-Problem-with-Reinforcement-Learning

Folders and files

Latest commit

History

Repository files navigation

Solve the Gridworld Problem with Reinforcement Learning

Environment

How to run

Included topics

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages