Professional Documents
Culture Documents
MASTER’S PROGRAM
In collaboration with IBM
www.simplilearn.com
1 | www.simplilearn.com
Contents
Program Outcomes 07
Who Should Enroll 08
Courses 09
Step 1: Big Data for Data Engineering 09
Step 2: Data Engineering with Hadoop 10
Step 3: Data Engineering with Scala 11
Step 4: Big Data Hadoop and Spark Developer 12
Step 5: Python for Data Science 14
Step 6: PySpark Training 15
Step 7: Apache Kafka 17
Step 8: MongoDB Developer and Administrator 19
Step 9: AWS Technical Essentials 21
Step 10: Big Data on AWS 22
Step 11: Big Data Capstone 23
Electives 24
Certificates 26
3 | www.simplilearn.com
Key
Features
4 | www.simplilearn.com
About IBM and Simplilearn
collaboration
About Simplilearn
Simplilearn is a leader in digital skills by the industry’s highest completion
training, focused on the emerging rates. Partnering with professionals and
technologies that are transforming our companies, we identify their unique needs
world. Our Blended Learning approach and provide outcome-centric solutions to
drives learner engagement and is backed help them achieve their professional goals.
5 | www.simplilearn.com
Learning Path - Big Data Engineer
PySpark
Apache Kafka
Training
Electives
• Scala for Data Science
• Spark for Scala Analytics
• Industry Master Class -
Data Engineering
• Python for Data Science
• AWS Technical Essentials
• Core Java Certification Training
6 | www.simplilearn.com
Big Data Engineer Master’s Program
Outcomes
Master tools and skills such as data Learn how Kafka is used in the real
model creation, database interfaces, world, including its architecture
advanced architecture, Spark, Sala, and components, get hands-on
RDD, SparkSQL, Spark Streaming, experience connecting Kafka to
Spark ML, GraphX, Sqoop, Flume, Pig, Spark, and work with Kafka Connect
Hive, Impala, and Kafka architecture
7 | www.simplilearn.com
Who Should Enroll in this Program?
8 | www.simplilearn.com
S
T
E
Big Data for Data Engineering P
1
This introductory course from IBM will teach you the basic concepts and 2
terminologies of big data and its real-life applications across multiple
industries. You will gain insights on how to improve business productivity 3
by processing large volumes of data and extracting valuable information
from them.
4
5
Key Learning Objectives 6
Understand what big data is, sources of big data, and real-life 7
examples
8
Learn the key difference between big data and data science
Master the usage of big data for operational analysis and better
9
customer service 10
Gain knowledge of the ecosystem of big data and the Hadoop
framework
11
Course curriculum
Lesson 1 - What is Big Data?
9 | www.simplilearn.com
S
T
E
Data Engineering with Hadoop P
1
Apache Hadoop is one of the most in-demand technologies for analyzing 2
big data. This introductory Hadoop course by IBM will give you an
overview of what Hadoop is and its components, such as MapReduce 3
and Hadoop Distributed File System (HDFS). Additionally, this course will
teach you to explore with large data sets and use Hadoop’s method of
4
distributed processing. 5
6
Key Learning Objectives
7
Understand Hadoop’s architecture and primary components, such as
MapReduce and (HDFS) 8
Add and remove nodes from Hadoop clusters, check the available disk 9
space on each node, and modify configuration parameters
10
Learn about Apache projects that are part of the Hadoop ecosystem,
including Pig, Hive, HBase, ZooKeeper, Oozie, Sqoop, and Flume. 11
Course curriculum
Lesson 1 - Introduction to Hadoop
10 | www.simplilearn.com
S
T
E
Data Engineering with Scala P
1
Kickstart your learning of Scala with this introductory course and 2
familiarize yourself with Scala programming. Upon completion of this
course, carefully crafted by IBM, you will be able to write Scala codes, 3
perform big data analysis using Scala, and create your own Scala projects.
4
5
Key Learning Objectives
6
Create your own Scala Project
Lesson 4 - Collections
11 | www.simplilearn.com
S
T
E
Big Data Hadoop and Spark Developer P
1
Simplilearn’s Big Data Hadoop Training Course helps you master big 2
data and Hadoop ecosystem tools such as HDFS, YARN, MapReduce,
Hive, Impala, Pig, HBase, Spark, Flume, Sqoop, and Hadoop Frameworks, 3
including additional concepts of the big data processing life cycle.
Throughout this online, instructor-led Hadoop training, you will work
4
on real-time projects in retail, tourism, finance, and other domains. 5
This big data course also prepares you for Cloudera’s CCA175 Big Data
certification. 6
7
Key Learning Objectives 8
Learn how to navigate the Hadoop Ecosystem and understand how to
optimize its use
9
Ingest data using Sqoop, Flume, and Kafka
10
Implement partitioning, bucketing, and indexing in Hive 11
Work with RDD in Apache Spark
12 | www.simplilearn.com
Course curriculum
Lesson 1 - Introduction to Bigdata and Hadoop
13 | www.simplilearn.com
S
T
E
Python for Data Science P
1
Kickstart your learning of Python for data science with this introductory 2
course and familiarize yourself with programming. Upon completion of
this course, carefully crafted by IBM, you will be able to write your Python 3
scripts, perform fundamental hands-on data analysis using the Jupyter-
based lab environment, and create your own data science projects using
4
IBM Watson. 5
6
Key Learning Objectives
7
Write your first Python program by implementing concepts of
variables, strings, functions, loops, and conditions 8
Understand the nuances of lists, sets, dictionaries, conditions and 9
branching, objects, and classes
10
Work with data in Python, including reading and writing files, loading,
working, and saving data with Pandas 11
Course curriculum
Lesson 1 - Python Basics
14 | www.simplilearn.com
S
T
E
Pyspark Training P
1
Pyspark Training will provide an in-depth overview of Apache Spark, the 2
open-source query engine for processing large datasets, and how to
integrate it with Python using the PySpark interface. This course will show 3
you how to build and implement data-intensive applications as you dive
into the world of high-performance machine learning. You’ll learn how to
4
leverage Spark RDD, Spark SQL, Spark MLlib, Spark Streaming, HDFS, 5
Sqoop, Flume, Spark GraphX, and Kafka.
6
Key Learning Objectives 7
Understand how to leverage the functionality of Python as you deploy 8
it in the Spark ecosystem
9
Master Apache Spark architecture and how to set up a Python
environment for Spark 10
Learn about various techniques for collecting data, understand RDDs 11
and how to contrast them with DataFrames, how to read data from
files and HDFS, and how to work with schemas
15 | www.simplilearn.com
Course curriculum
Lesson 01 - A Brief Primer on Pyspark
16 | www.simplilearn.com
S
T
E
Apache Kafka P
1
In this Apache Kafka certification course, you will master the architecture, 2
installation, configuration, and interfaces of Kafka open-source
messaging. With this Kafka training, you will learn the basics of Apache 3
ZooKeeper as a centralized service and develop the skills to deploy Kafka
for real-time messaging. This course is part of the Big Data Hadoop
4
Architect Master’s Program and is recommended for developers and 5
analytics professionals who wish to advance their expertise.
6
Key Learning Objectives 7
Describe the importance of big data
8
Describe the fundamental concepts of Kafka 9
Describe the architecture of Kafka 10
Explain how to install and configure Kafka 11
Explain how to use Kafka for real-time messaging
17 | www.simplilearn.com
Course Curriculum
Lesson 1 - Getting Started with Big Data and Apache Kafka
18 | www.simplilearn.com
S
T
E
MongoDB Developer and Administrator P
1
Become an expert MongoDB developer and administrator by gaining an 2
in-depth knowledge of NoSQL and master the skills of data modeling,
ingestion, query, sharding, and data replication. This course includes 3
industry-based projects in the elearning and telecom domains. It is
best suited for database administrators, software developers, system 4
administrators, and analytics professionals.
5
6
Key Learning Objectives
7
Develop expertise in writing Java and NodeJS applications using
MongoDB 8
Master the skills of replication and sharding of data in MongoDB to
optimize read/write performance
9
Perform installation, configuration, and maintenance of the MongoDB
10
environment
11
Get hands-on experience creating and managing different types of
indexes in MongoDB for query execution
19 | www.simplilearn.com
Course curriculum
Lesson 1 - Introduction to NoSQL Databases
20 | www.simplilearn.com
S
T
E
AWS Technical Essentials P
1
This AWS Technical Essentials course teaches you how to navigate the 2
AWS management console; understand AWS security measures, storage,
and database options; and gain expertise in web services like RDS and 3
EBS. This course, prepared in line with the latest AWS syllabus, will help
you become proficient in identifying and efficiently using AWS services. 4
5
Key Learning Objectives 6
Understand the fundamental concepts of AWS platform and cloud
computing
7
Identify AWS concepts, terminologies, benefits, and deployment
8
options to meet business requirements
9
Identify deployment and network options in AWS
10
11
Course curriculum
Lesson 01 - Introduction to Cloud Computing
21 | www.simplilearn.com
S
T
E
Big Data on AWS P
1
In this AWS Big Data certification course, you will become familiar with 2
concepts cloud computing and its deployment models; the Amazon
web services cloud platform; Kinesis Analytics; AWS big data storage, 3
processing, analysis, visualization, and security services; EMR; AWS
Lambda and Glue; machine learning algorithms; and much more. 4
5
Key Learning Objectives 6
Understand how to use Amazon EMR for processing the data using
Hadoop ecosystem tools
7
Understand how to use Amazon Kinesis for big data processing in
8
real-time
9
Analyze and transform big data using Kinesis Streams
10
Visualize data and perform queries using Amazon QuickSight
11
Course curriculum
Lesson 1 - AWS in Big Data Introduction
Lesson 2 - Collection
Lesson 3 - Storage
Lesson 4 - Processing I
Lesson 5 - Processing II
Lesson 6 - Analysis I
Lesson 7 - Analysis II
Lesson 8 - Visualisation
Lesson 9 - Security
22 | www.simplilearn.com
S
T
E
Big Data Capstone P
1
This Big Data Capstone project will give you an opportunity to implement
the skills you learned throughout this program. Through dedicated
2
mentoring sessions, you’ll learn how to solve a real-world, industry-aligned 3
big data problem. This project is the final step in the learning path and will
enable you to showcase your expertise in big data to future employers. 4
5
6
7
8
9
10
11
23 | www.simplilearn.com
Elective Course
24 | www.simplilearn.com
AWS Technical Essentials
This AWS Technical Essentials course teaches you how
to navigate the AWS management console; understand
AWS security measures, storage, and database options;
and gain expertise in web services like RDS and EBS. This
course, prepared in line with the latest AWS syllabus, will
help you become proficient in identifying and efficiently
using AWS services.
25 | www.simplilearn.com
Certificates
Upon completion of this Master’s Program, you will receive certificates from
IBM and Simplilearn in the Big Data Engineer courses in the learning path.
These certificates will testify to your skills as an expert in data engineering.
Upon program completion, you will also receive an industry recognized Master’s
Certificate from Simplilearn.
26 | www.simplilearn.com
Advisory board member
27 | www.simplilearn.com
USA
INDIA
www.simplilearn.com