Non-conventional Value Function Approximation methods in Reinforcement Learning

This project evaluates and compares different value function approximation methods in Reinforcement Learning using a range of parametric and non-parametric function approximation models. The parametric models (Neural Network and Linear Model) were implemented under the Deep Q-Network architecture [1] using the PyTorch framework [2] for their training. The non-parametric models (Decision Tree, Random Forest, Support Vector Regression, k-Nearest Neighbours, Gaussian Process) were implemented under the Fitted-Q Iteration architecture [3] and were defined through the Scikit-learn library [4]. Finally, the Online Gaussian Process model was implemented from scratch following the work of [5].

Function approximation models evaluated

Neural Network
Linear Model
Decision Tree
Random Forest
Support Vector Regression
K-Nearest Neighbours Regression
Gaussian Processes
Online Gaussian Processes

Environments

SimpleGridworld
WindyGridworld
CartPole
LunarLander

Evaluation Criteria

Performance
Reliability
Sample efficiency
Training time
Interpretability

Getting started

Create and activate virtual environment:

python3 -m venv [name_of_venv]
source [name_of_venv]/bin/activate

Clone repository:

git clone https://github.com/atsiakkas/non_conventional_value_function_approximation.git

Install requirements:

cd non_conventional_value_function_approximation
pip install -e .

Project

https://github.com/uoe-agents/non_conventional_value_function_approximation

References

[1] Mnih, V., Kavukcuoglu, K., Silver, D., Rusu, A.A., Veness, J., Bellemare, M.G., Graves, A., Riedmiller, M., Fidjeland, A.K., Ostrovski, G. and Petersen, S., 2015. Human-level control through deep reinforcement learning. nature, 518(7540), pp.529-533.

[2] Paszke, A., Gross, S., Massa, F., Lerer, A., Bradbury, J., Chanan, G., Killeen, T., Lin, Z., Gimelshein, N., Antiga, L. and Desmaison, A., 2019. Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems, 32.

[3] Ernst, D., Geurts, P. and Wehenkel, L., 2005. Tree-based batch mode reinforcement learning. Journal of Machine Learning Research, 6, pp.503-556.

[4] Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V. and Vanderplas, J., 2011. Scikit-learn: Machine learning in Python. the Journal of machine Learning research, 12, pp.2825-2830.

[5] Csató, L. and Opper, M., 2002. Sparse on-line Gaussian processes. Neural computation, 14(3), pp.641-668.

Name		Name	Last commit message	Last commit date
Latest commit History 126 Commits
agents		agents
custom_envs		custom_envs
function_approximators		function_approximators
plots		plots
results		results
tests		tests
train		train
utils		utils
.gitignore		.gitignore
README.md		README.md
__init__.py		__init__.py
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Non-conventional Value Function Approximation methods in Reinforcement Learning

Function approximation models evaluated

Environments

Evaluation Criteria

Getting started

Project

Contents

References

About

Releases

Packages

Languages

uoe-agents/non_conventional_value_function_approximation

Folders and files

Latest commit

History

Repository files navigation

Non-conventional Value Function Approximation methods in Reinforcement Learning

Function approximation models evaluated

Environments

Evaluation Criteria

Getting started

Project

Contents

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages