Professional Documents
Culture Documents
Cloudera Developer Training For Apache Hadoop
Cloudera Developer Training For Apache Hadoop
Take your knowledge to the next level with Cloudera’s Apache Hadoop
Training and Certification
Cloudera’s four-day developer training course delivers the key concepts and
expertise necessary to create robust data processing applications using Apache
Hadoop.
Through lecture and interactive, hands-on exercises, attendees will navigate the
Hadoop ecosystem, learning topics such as
“Cloudera has true expertise in their
• MapReduce and the Hadoop Distributed File System (HDFS) and how to write
ranks, offering intimate insight and
MapReduce code
experience with the Apache
• Best practices and considerations for Hadoop development, debugging Hadoop ecosystem.”
techniques and implementation of workflows and common algorithms
Justin Hancock,
• How to leverage Hive, Pig, Sqoop, Flume, Oozie and other projects from the Director
Apache Hadoop ecosystem
Upon completion of the course, attendees are able to attempt the Cloudera
Certified Developer for Apache Hadoop (CCDH) exam. Certification is a great
differentiator; it helps establish individuals as leaders in their field, providing
customers with tangible evidence of their skills.
Audience
This course is intended for experienced developers who wish to write, maintain,
and/or optimize Apache Hadoop jobs. A background in Java is preferred, but
experience with other programming language such as PHP, Python or C# is
sufficient.
Cloudera, Inc. 210 Portage Avenue, Palo Alto, CA 94306 USA | 1-888-789-1488 or 1-650-362-0488 | cloudera.com
©2011 Cloudera, Inc. All rights reserved. Cloudera and the Cloudera logo are trademarks or registered trademarks of Cloudera Inc. in the USA and other countries. All other trademarks are the property of their
respective companies. Information is subject to change without notice.
TRAINING SHEET
Cloudera, Inc. 210 Portage Avenue, Palo Alto, CA 94306 USA | 1-888-789-1488 or 1-650-362-0488 | cloudera.com
©2011 Cloudera, Inc. All rights reserved. Cloudera and the Cloudera logo are trademarks or registered trademarks of Cloudera Inc. in the USA and other countries. All other trademarks are the property of their
respective companies. Information is subject to change without notice.
TRAINING SHEET
Computing Environment The current mix of computing resources and demands that motivates use of a technology like Apache
Hadoop
Hadoop Distributed File System How files are stored and managed in HDFS; the infrastructure that supports HDFS
MapReduce The phases of execution and framework for running a MapReduce job. Expected properties of job runs
based on number of mappers, number of reducers and distribution of data
Hadoop API The Java classes that make up the API for developers who wish to write Apache Hadoop MapReduce
jobs
Hadoop Platform The basic purpose, design and operation of tools that augment the Apache Hadoop core to make a
comprehensive platform, including Hadoop Streaming, fuse-dfs, Apache Hive, Apache Pig, Apache
Flume, Apache Sqoop, Apache HBase, Apache Oozie and HUE
Cloudera, Inc. 210 Portage Avenue, Palo Alto, CA 94306 USA | 1-888-789-1488 or 1-650-362-0488 | cloudera.com
©2011 Cloudera, Inc. All rights reserved. Cloudera and the Cloudera logo are trademarks or registered trademarks of Cloudera Inc. in the USA and other countries. All other trademarks are the property of their
respective companies. Information is subject to change without notice.