Skip to content

Web Logs Data Unsupervised, Supervised Learning, Association Rule Mining & ARIMA Prediction. Web Crawling of citation information from Google Scholar

Notifications You must be signed in to change notification settings

ahujaya/Web-Logs-Unsupervised-and-Supervised-Machine-Learning-Association-Rule-Mining-ARIMA-Prediction

Repository files navigation

Web-Logs-Unsupervised-and-Supervised-Learning-Association-Rule-Mining-ARIMA-Prediction

Web Logs Data Unsupervised, Supervised Learning, Association Rule Mining & ARIMA Prediction. Web Crawling of citation information from Google Scholar.

Part I - Data Analytics — Web Log Data

  1. Data ETL
  • Load Data
  • Feature Selection
  1. Unsupervised learning
  2. Supervised learning
  • Data Preparation
  • Logistic Regression
  • K-fold Cross Validation
  1. Association Rule Mining

Part II - Web Crawling

  1. Crawl the professor Gang Li citation information from 2003 to 2021
  2. Train Arima to predict the 2018 to 2020 citation
  • Train Arima Model
  • Predicting the citation and Calculate the RMSE
  • Draw the visualization to compare
  1. Conduct the Grid Search with parameter selection and then predict the 2021 and 2022
  • Grid Search
  • Select the best parameter values and Predict for 2021 and 2022

About

Web Logs Data Unsupervised, Supervised Learning, Association Rule Mining & ARIMA Prediction. Web Crawling of citation information from Google Scholar

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published