Skip to content

praveen-gopal-reddy/Data_Analysis_with_Pyspark

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Data Analysis with Pyspark.

  • The listenings_genre_pyspark.ipynb jupityer notebook contains data analysis carried out on genre and songs dataset using Pyspark. Operations such as cleansing, filters, aggregation and visualization is applied to explore the datasets.
  • The call_detail_record_pyspark.ipynb contains data analysis using spark sql and aggregate functions, visualization using matplotlib on a dataset which contains Hourly phone calls, SMS and Internet communication. The dataset can be downloaded from https://www.kaggle.com/marcodena/mobile-phone-activity

About

Data analysis using Pyspark sql and aggregate functions.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published