Skip to content

ankitk2109/Data_Mining

Repository files navigation

Data_Mining

  1. Association: This folder contains program(run.py) and dataset(inside specs folder) to apply Apriori and FPgrowth algorithm. The steps that have been performed are based on tasks that are written in report.docx file.
  2. Data_Transformation_Reduction_Discretisation: This folder contains program(run.py) and dataset(inside specs folder) to do MinMax Normalization, Reducing attributes using Principal Component Analysis(PCA) and Discretising through equal width and equal frequency on data. The steps that have been performed are based on tasks that are written in report.docx file.
  3. Regression & classification: This folder contains program(run.ipynb) and dataset(inside specs folder) to do Linear regression and Decision tree classfication using sklearn on the dataset. The steps that have been performed on data are based on tasks that are written in report.docx file. kMeans and Clustering: Program(run.py) and dataset(inside specs). Undersatnding how kMeans and DBSCAN algorithm works.Changing max_iter and n_init values in kmeans to see how algorithm performs. Also checking the performance of DBSCAN while changing eps and min_samples value.