Skip to content

Code, notes and Hands-on for the CCA175 Cloudera Spark and Hadoop Certification

Notifications You must be signed in to change notification settings

IagoFeliubadalo/CCA175-Exam-Preparation

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 

Repository files navigation

CCA175-Exam-Preparation

Code, notes and Hands-on for the CCA175 Cloudera Spark and Hadoop Certification Feel free to add or request anything that hasnt been covered.

https://www.cloudera.com/more/training/certification/cca-spark.html

Data Ingest

The skills to transfer data between external systems and your cluster. This includes the following:

  • Import data from a MySQL database into HDFS using Sqoop
  • Export data to a MySQL database from HDFS using Sqoop
  • Change the delimiter and file format of data during import using Sqoop
  • Ingest real-time and near-real-time streaming data into HDFS
  • Process streaming data as it is loaded onto the cluster
  • Load data into and out of HDFS using the Hadoop File System commands

Transform, Stage, and Store

Convert a set of data values in a given format stored in HDFS into new data values or a new data format and write them into HDFS.

  • Load RDD data from HDFS for use in Spark applications
  • Write the results from an RDD back into HDFS using Spark
  • Read and write files in a variety of file formats
  • Perform standard extract, transform, load (ETL) processes on data

Data Analysis

Use Spark SQL to interact with the metastore programmatically in your applications. Generate reports by using queries against loaded data.

  • Use metastore tables as an input source or an output sink for Spark applications
  • Understand the fundamentals of querying datasets in Spark
  • Filter data using Spark
  • Write queries that calculate aggregate statistics
  • Join disparate datasets using Spark
  • Produce ranked or sorted data

Getting Started Tutorial

https://www.cloudera.com/developers/get-started-with-hadoop-tutorial.html

Some websites about the CCA175 Certification

https://www.itexams.com/exam/CCA175

This website is pretty cool! Also there are some youtube videos, check them out!

Cool website -> http://arun-teaches-u-tech.blogspot.com/

Youtube videos -> https://www.youtube.com/playlist?list=PLRLUm7no962j8cf-mpXjrQqusWvw-gIJx

About

Code, notes and Hands-on for the CCA175 Cloudera Spark and Hadoop Certification

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published