apache-spark

Apache Spark is an open source distributed general-purpose cluster-computing framework. It provides an interface for programming entire clusters with implicit data parallelism and fault tolerance.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

apache-spark

Here are 48 public repositories matching this topic...

Enth / zeppelin-notebooks

ruivieira / scala-base-notebook

AdamJeddy / Zeppelin-Notebook-Archive

avikdatta / ipython_notebook_backup

anandprabhakar0507 / Python-rdd-notebook

alext234 / pyspark-movie-lens

ShakeelRaja / TextAnalysisNLPSpark

epomatti / az-databricks-etl

gurayorhan / Data-Mining-With-Apache-Spark

iamirmasoud / pyspark_basics

Gingernaut / apache-spark-playground

IBMDeveloperMEA / Introduction-to-Big-Data-analysis-Machine-Learning-in-Python-with-PySpark

nelsonfleig / lambda-architecture

izlata / can_fed_election

just-modeling / jupyterhub-k8s-apache-spark

itasli / accidents-circulation-routiere

Animeshsinghiit / Spark-and-Pyspark

Ludovik99 / Analysis-of-Gas-Stations-with-Apache-Spark

DeevanshiSharma / Bundesliga-Big-Data-Analysis-using-PySpark

Wolvarun9295 / Spark-Elasticsearch-5MilData

Related Topics