SnakeRL

Getting a computer to play snake using reinforcement learning, written from scratch in Python.

Usage

The program has 3 modes: (1) Standard learning mode (2) Watch a pre-trained neural network play (saved in pretrained.npy - you can press s to save the current best neural network and replace this file) (3) Human mode - try playing the game for yourself

Design

Neural Network

The same neural network is evaluated 3 times for each possible direction (left, right or straight ahead) the snake can go. The direction with the highest confidence is selected. This design keeps the network very small, allowing it to train much faster.

Inputs (4 neurons):

Distance differential with food if snake proceeds with this direction (normalised between -1 and 1)
Object ahead of snake, to its left and to its right relative to the direction being evaluated (-1 for tail or wall, 1 for food, 0 for nothing)

Single hidden layer of 3 neurons
Single output neuron representing confidence for this direction

Fitness evaluation

Fitness is given by: fitness=lifetime * (score+1)^2

Where lifetime is the number of moves (including moving straight ahead) the snake has made in total (note that snakes are limited to 150 moves between food acquisitions to prevent the strategy of indefinitely moving in a circle).

Genetic Algorithm

After every snake in a generation has died (the number of neural networks in a generation can be changed with the population_size parameter in snakerl.py) the next generation is generated.

The top 50% best performing neural networks automatically move onto the next generation.
Remaining neural networks are a crossover generated from a pair of neural networks selected using roulette selection.
- Some of their weights can also be "mutated" (the rate at which this occurs can be changed with the mutation_rate parameter in snakerl.py). When a weight is mutated, it's value is offset by a random number generated from a normal pdf

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
.gitignore		.gitignore
README.md		README.md
display.py		display.py
geneticalgorithm.py		geneticalgorithm.py
inputs.py		inputs.py
neuralnetwork.py		neuralnetwork.py
pretrained.npy		pretrained.npy
snake.py		snake.py
snakerl.py		snakerl.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SnakeRL

Usage

Design

Neural Network

Fitness evaluation

Genetic Algorithm

About

Releases

Packages

Languages

brrm/snakerl

Folders and files

Latest commit

History

Repository files navigation

SnakeRL

Usage

Design

Neural Network

Fitness evaluation

Genetic Algorithm

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages