Bigdata-Analytics-Apache-Spark

Bigdata Analytics using Yelp API

Dataset Description:

The dataset comprises of three csv files, namely user.csv, business.csv and review.csv. Note that some of the content, such as id fields are encoded. Note that the files are separated by "^" character.

Business.csv file contain basic information about local businesses. Business.csv file contains the following columns "business_id","full_address","categories" 'business_id': (a unique identifier for the business) 'full_address': (localized address), 'categories': [(localized category names)]
Review.csv file contains the star rating given by a user to a business. Use user_id to associate this review with others by the same user. Use business_id to associate this review with others of the same business. review.csv file contains the following columns "review_id","user_id","business_id","stars" 'review_id': (a unique identifier for the review) 'user_id': (the identifier of the reviewed business), 'business_id': (the identifier of the authoring user), 'stars': (star rating, integer 1-5),the rating given by the user to a business
user.csv file contains aggregate information about a single user across all of Yelp user.csv file contains the following columns "user_id","name","url" user_id': (unique user identifier), 'name': (first name, last initial, like 'Matt J.'), this column has been made anonymous to preserve privacy 'url': url of the user on yelp

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
LICENSE		LICENSE
README.md		README.md
average.scala		average.scala
problems.md		problems.md
program 1.scala		program 1.scala
program 2.scala		program 2.scala
program 3.scala		program 3.scala
program 4.scala		program 4.scala
program 5.scala		program 5.scala
word count.scala		word count.scala

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bigdata-Analytics-Apache-Spark

About

Releases

Packages

Languages

License

alwinjohns/Bigdata-Apache-Spark

Folders and files

Latest commit

History

Repository files navigation

Bigdata-Analytics-Apache-Spark

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages