Netflix movie prediction

Introduction

This project was created for my course assistance systems in cooperation with Florian Eder. The dataset was released by netflix in relation to a competition for the best algorith to predict new movies for users. You can find the dataset here.

Our approach simplified the idea by only predicting on one single outcome movie you manually choose.

If you want to try it out now, you can visit our interactive demo with the small data set or high accuracy data set.

Installation

Install the shiny package with the R console

install.packages("shiny")

Download and run this GitHub repository

shiny::runGitHub(repo = "THDMoritzEnderle/netflix_prediction", ref="main")

Usage

Choosing the right dataset

When using the local install, you will be prompted to select a dataset. This might help you choose, which one fits your needs best:

dataset name	size	information
large data set	232 MB	By far the largest data set, containing 900 movies and 95k customers. Use this, if you have a high end CPU or lots of time
small data set	6 MB	Smallest dataset containing 100 movies and 20k customers. Use this for testing without expecting excact results
normal data set	80 MB	Best for the average user. Contains 530 movies and 47k users. Balance between accuracy and loading times
few movies	61 MB	Only contains 90 movies but 231k customers. This results in very high accuracy but comes at the cost of the few movies.

What can you do?

To start off, select the movies you watched and would like to rate on the top left. The movie posters will appear right next to it. Rate the movies based on your liking.

Below this input field, you can select your goal movie, this is the movie you want to know the prediction of.

When you're done selecting and rating all movies, press the submit button and the model will start training. This may take a while based on the dataset, your hardware and the amount of movies you've selected.

When the training is done, you can see the prediction appear on the movie poster in the bottom left corner. The graph shows the influences of each movie.

Further plans

As for now, this project will not be continued. Possible further development could include the prediction for all movies and thus creating a ranking between those.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
www		www
LICENSE		LICENSE
README.md		README.md
app.R		app.R
plotter.R		plotter.R
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Netflix movie prediction

Introduction

Installation

Usage

Choosing the right dataset

What can you do?

Further plans

About

Releases

Packages

Contributors 2

Languages

License

M-Enderle/netflix_prediction

Folders and files

Latest commit

History

Repository files navigation

Netflix movie prediction

Introduction

Installation

Usage

Choosing the right dataset

What can you do?

Further plans

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages