Blog Don’t Get Left Behind in the AI Race: Your Easy Starting Point is Here Read now

CCA Spark and Hadoop Developer Exam (CCA175)

  • Number of Questions: 8–12 performance-based (hands-on) tasks on Cloudera Enterprise cluster. See below for full cluster configuration
  • Time Limit: 120 minutes
  • Passing Score: 70%
  • Language: English

 

Exam Question Format

Each CCA question requires you to solve a particular scenario. In some cases, a tool such as Impala or Hive may be used. In most cases, coding is required.

Evaluation, Score Reporting, and Certificate

Your exam is graded immediately upon submission and you are e-mailed a score report within three days of your exam. Your score report displays the problem number for each problem you attempted and a grade on that problem. If you fail a problem, the score report includes the criteria you failed (e.g., “Records contain incorrect data” or “Incorrect file format”). We do not report more information in order to protect the exam content. Read more about reviewing exam content on the FAQ.

If you pass the exam, you receive a second e-mail within a week of your exam with your digital certificate as a PDF and your license number.

Audience and Prerequisites

There are no prerequisites required to take any Cloudera certification exam. The CCA Spark and Hadoop Developer exam (CCA175) follows the same objectives as DENG-254: Preparing with Cloudera Data Engineering and the training course is an excellent preparation for the exam. 

Watch a free OnDemand course to help prepare for your certification

Have questions? Read our Certification FAQ

Contact us at certification@cloudera.com

Required Skills


Transform, Stage, and Store

Convert a set of data values in a given format stored in HDFS into new data values or a new data format and write them into HDFS.

  • Load data from HDFS for use in Spark applications

  • Write the results back into HDFS using Spark

  • Read and write files in a variety of file formats

  • Perform standard extract, transform, load (ETL) processes on data using the Spark API

Data Analysis

Use Spark SQL to interact with the metastore programmatically in your applications. Generate reports by using queries against loaded data.

  • Use metastore tables as an input source or an output sink for Spark applications

  • Understand the fundamentals of querying datasets in Spark

  • Filter data using Spark

  • Write queries that calculate aggregate statistics

  • Join disparate datasets using Spark

  • Produce ranked or sorted data

Configuration

This is a practical exam and the candidate should be familiar with all aspects of generating a result, not just writing code.

  • Supply command-line options to change your application configuration, such as increasing available memory

 

Exam delivery and cluster information

CCA175 is a remote-proctored exam available anywhere, anytime. See the FAQ for more information and system requirements.

CCA175 is a hands-on, practical exam using Cloudera technologies. Each user is given their own CDH6 (currently 6.1.1) cluster pre-loaded with Spark 2.4.

All websites, including Google/search functionality and access to Spark external packages is disabled. You may not use notes or other exam aids.

Certification FAQ

Your form submission has failed.

This may have been caused by one of the following:

  • Your request timed out
  • A plugin/browser extension blocked the submission. If you have an ad blocking plugin please disable it and close this message to reload the page.