This repo gives idea to deploye an real time big data enviroment which contains Apahce Spark-Apache Kafka and Cassandra database
If you want to manage huge big data(structered or not) and you want it's in realtime ,you should use a big data enviroment and my suggestion is below and inside of repo.
Please install firstly
*Zookeeper
*Apache Kafka
*Apache Spark (1.6.0 used)
*Cassandra
Requirements
*Cassandra-Spark connector
*Kafka-Spark connector