Skip to content

Compute the top N viewed Wikipedia pages for a given date

Notifications You must be signed in to change notification settings

raphaelauv/Spark-wikipedia-topRank

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commit
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Spark wikipedia topRank

Overview

TL;DR

cd src/exoETL
gradle shadowJar
cd build/libs
java -jar exo-all-0.0.1.jar --dateHour 20190815 10

Development

Core pre-requisites

If using local IDE:

  • Java 1.8
  • Scala 2.11
  • Spark 2.3.0

(IntelliJ IDEA 2019.2 (Community Edition))

About

Compute the top N viewed Wikipedia pages for a given date

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages