An analysis of Yelp data from Kaggle
This is an analysis of the data provided on Kaggle at: https://www.kaggle.com/yelp-dataset/yelp-dataset
Included in this repo is:
- A docker file for running the code.
- A presentation summarizing the findings.
- Data files. This repo does not contain ALL of the provided data, just that which is used.
- A python module models.py which builds and trains the models.
To run the code that builds the models:
- Run 'docker build -t yelp_data .'
- Run 'docker run -it yelp_data /bin/bash' to start and enter the container.
- The model can be run using 'python models.py'
The file yelp_presentation.pdf includes a summary of the findings from this analysis.