Skip to content

Dataset analysis as a final exercise to test the various concepts learned throughout the semester in the subject 'Statistics for DataSc', and its application using python

Notifications You must be signed in to change notification settings

SowmeshSharma0411/DatathonSol

Repository files navigation

DatathonSol

Dataset analysis as a final exercise to test the various concepts learned throughout the semester in the subject 'Statistics for DataSc', and its application using python

  1. Classification of varibales : categorical/nominal, etc.

2.Summary statistic for each attribute in the dataset.

  1. Removal of NaNs using KNN Imputation and removal of outliers.

4.Bar graphs for each categorical variable, and insights based on them.

5.Histograms for each numerical variable and insights based on them like skewness.

6.Boxplot analysis and insights based on them like IQR.

7.ScatterMatrix to explore the realtionship b/w each and every variable; Pearson Coefficients.

8.Simple Regression Model for the variables specified in the question; R^2 score.

About

Dataset analysis as a final exercise to test the various concepts learned throughout the semester in the subject 'Statistics for DataSc', and its application using python

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published