Professional Documents
Culture Documents
Scala
Understand Scala and its implementation
Apply Control Structures, Loops, Collection, and more.
Master the concepts of Traits and OOPS in Scala
Understand functional programming in Scala
Get an insight into the big data challenges
Learn how Spark acts as a solution to these challenges
Install Spark and implement Spark operations on Spark Shell
Understand the role of RDDs in Spark
Implement Spark applications on YARN (Hadoop)
Stream data using Spark Streaming API
Implement machine learning algorithms in Spark using MLlib API
Analyze Hive and Spark SQL architecture
Implement SparkSQL queries to perform several computations
Spark
• What is Spark?
• Review: From Hadoop MapReduce to Spark
• Spark Overview Spark Basics
• Using the Spark Shell
• RDDs (Resilient Distributed Datasets)
• Functional Programming in Spark Working with RDD
• Creating RDDs
• RDD Operations Aggregating Data with Pair RDDs
• Key-Value Pair RDDs
• Map-Reduce
• RDD Operations Writing and Deploying Spark Applications
• Spark Applications vs. Spark Shell
• Spark Context
• Spark Application (Scala)
• Running a Spark Application
• Spark Application Web UI
• Hands-On Exercise: Write and Run a Spark Application
• Executing Parallel Operations
• Configuring Spark Properties
• Logging Parallel Processing
• Advanced Spark Streaming
• Common Spark Use Cases