Skip to content

Simona7-code/Distributed_Data_Analysis_and_Mining-course_project__2022

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 

Repository files navigation

Distributed_Data_Analysis_and_Mining-course_project__2022

The main goal of this course project was to choose an high dimensional dataset and use specific machine learning tools/libraries/algorithms for big data analysis (such as Spark and map-reduce approaches) in order to be able to extract insights from the data.
Both supervised (classifications) and unsupervised (clustering) learning approaches have been attempted.
Further details and results are available and illustrated in the report (pdf file).

The project was carried out in collaboration with Emanuele Sabatini, Federico Volpi, Marco Mannarà and Pierluigi Brasile.

##################################################################################

L'obiettivo principale del progetto del corso era scegliere un dataset ad alta dimensionalità e utilizzare specifici strumenti/librerie/algoritmi di apprendimento automatico per l'analisi dei big data (come Spark e approcci map-reduce) al fine di poter estrarre informazioni dai dati. Sono stati tentati approcci di apprendimento supervisionato (classificazioni) e non supervisionato (clustering). Ulteriori dettagli e risultati sono disponibili e illustrati nel rapporto (file pdf).

Il progetto è stato svolto in collaborazione con Emanuele Sabatini, Federico Volpi, Marco Mannarà and Pierluigi Brasile.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published