Skip to content

A project on Topic Modelling and Text Summarization of NIPS research papers

Notifications You must be signed in to change notification settings

RPorcedda/Text-Mining

 
 

Repository files navigation

alt text

NIPS Papers: Topic Modelling and Text Summarization

Text Mining & Search Exam Project
MSc Data Science @ University of Milan Bicocca

Authors:

Gaetano Chiriaco

Riccardo Porcedda

Gianmarco Russo



How to navigate the repo

WARNING:The original work was conducted in a different folder. Before executing the notebooks, please, modify all paths in the import.

  • Papers_Preproc.ipynb: contains all papers preprocessing and feature engineering described in the report

  • LDA.ipynb: Topic modeling with Latent Dirichlet Allocation

  • PLSA.ipynb: Topic modeling with probabilistic Latent Semantic Analysis

  • DocSignal.ipynb: Our 1D U-net Text Summarizer as described in the report

  • Sumy.ipynb: Text Summarization with Sumy TextRank for comparison

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 100.0%