You are on page 1of 30

Post Graduate Program

in Data Science
Co-Developed with IBM

1 | www.simplilearn.com
03
About the Program

04
The Key Features of the Post Graduate Program in
Data Science by Purdue University

05
About Post Graduate Program in Data Science in
Partnership with Purdue University

05
About Simplilearn

06
Program Eligibility Criteria and Application
Process

08
Learning Path Visualization

09
Program Outcomes

10
Who Should Enroll in this Program

11
Courses

Table 11 Step 1 - Programming Refresher

of 12 Step 2 - Statistics Essential for Data Science

14
Contents
Step 3 - R Programming for Data Science

15 Step 4 - Data Science with R

17 Step 5 - Python for Data Science

18 Step 6 - Data Science with Python

20 Step 7 - Machine Learning

22 Step 8 - Tableau

24 Step 9 - Natural Language Processing

25 Step 10 - Capstone Project

26
Electives

28
Certificates and Badges

29
Advisory Board Members

2 | www.simplilearn.com
About
the Program
Accelerate your career with this
acclaimed Post Graduate Program
in Data Science, in partnership with
Purdue University and co-developed
with IBM, and featuring the perfect
mix of theory, case studies and
extensive hands-on practicum. In
partnership with Purdue University,
this program is a comprehensive
Data Science education – leveraging
Purdue’s academic excellence in Data
Science and Simplilearn’s partnership
with IBM.

Designed to give recent graduates


and experienced professionals an
extensive Data Science education,
this Post Graduate Program is a
blend of online self-paced videos, live
virtual classes, hands-on projects,
and labs, with mentorship sessions
to provide a high-engagement
learning experience and real-world
applications to help you master
essential Data Science skills. This
program offers in-depth exposure
to technologies including R, Python,
Machine Learning, Tableau, Natural
Language Processing, and prepares
you for an exciting career in Data
Science.

3 | www.simplilearn.com
The Key Features of the Post
Graduate Program in Data Science
by Purdue University

Purdue Post Purdue University International


Graduate Program alumni status recognition by
certificate
Purdue University

Leading industry- Co-developed Industry-


recognized analytics program with IBM recognized IBM
course certificates

25+ hands-on Limited class size Enrollment in


projects and for an optimal Simplilearn’s
one capstone experience JobAssist

4 | www.simplilearn.com
About the Post Graduate Program
in Data Science in partnership with
Purdue University
Purdue University, a top public Upon successfully completing this
research institution, offers higher program, you will:
education at its highest proven
Receive a joint Purdue-Simplilearn
value. Committed to affordability,
certificate of completion
the University has frozen tuition
and most fees at 2012-13 levels. Be eligible able to join the Purdue
Committed to student success, alumni association and participate
in its various networking
Purdue is changing the student
opportunities and career events
experience with a greater focus
on faculty-student interaction
and creative use of technology.
Committed to pursuing scientific
discoveries and engineered solutions,
Purdue has streamlined pathways for
faculty and student innovators who
have a vision for moving the world
forward.

This Data Science Post Graduate


Program in partnership with Purdue
University will open pathways for
your career in virtually every realm
of business—from healthcare to
education to manufacturing.

About Simplilearn
Simplilearn is a leader in digital skills training, focused on the emerging technologies
that are transforming our world. Our unique blended learning approach drives learner
engagement and backed by the industry’s highest course completion rates. Partnering
with professionals and companies, we identify their unique needs and provide
outcome-centric solutions to help them achieve their professional goals.

5 | www.simplilearn.com
Program Eligibility Criteria and
Application Process
Those wishing to enroll in the Post Graduate Program in Data Science in
partnership with Purdue University will be required to apply for admission to the
program.

Eligibility Criteria
For admission to this Post Graduate Program in Data Science, candidates
should have:
A bachelor’s degree with an average of 50% or higher marks
Proficiency in a programming language and collegiate mathematics
Current university students in their final year with an average of 50%
or higher marks can also apply

Application Process
The application process consists of three simple steps. An offer of
admission will be made to the selected candidates and accepted by the
candidates by paying the admission fee.

STEP 1 STEP 2 STEP 3


SUBMIT AN APPLICATION ADMISSION
APPLICATION REVIEW

Complete the application After you submit your An offer of admission will be
and include a brief statement application, a panel of made to qualified candidates,
of purpose to telling our admissions counselors will and you can accept this offer
admissions counselors review your application and by paying the program fee.
why you’re interested and statement of purpose to
qualified to be part of the Post determine your qualifications
Graduate Program in Data and interest in the program.
Science.

6 | www.simplilearn.com
Talk to an Admissions Counselor
We have a team of dedicated admissions counselors who are here to help
guide you in applying to the program. They are available to:

Address questions related to the application

Assist with financial aid (if required)

Help you resolve your questions and understand the program

7 | www.simplilearn.com
Learning Path
Electives
IBM Watson for Chatbots
Machine Learning with R
Core Java
Big Data Hadoop and Spark Developer
Introduction to Artifical Intelligence
Programming
Data Science with SAS
Refresher

Statistics R Programming for


Essential for Data Science
Data Science

Python for Data Science


Data Science with R

Data Science Machine Tableau


with Python Learning

Capstone Natural Language


Project Processing

8 | www.simplilearn.com
Program Outcomes

Gain an in-depth understanding of Master the concepts


data structure and data manipulation recommendation engine, and time
series modeling and gain practical
mastery over principles, algorithms,
and applications of Machine Learning
Understand and use linear and
non-linear regression models and
classification techniques for data
analysis Learn to analyze data using Tableau
and become proficient in building
interactive dashboards

Obtain a comprehensive knowledge


of supervised and unsupervised
learning models such as linear Understand deep reinforcement
regression, logistic regression, learning techniques applied in
clustering, dimensionality reduction, Natural Language Processing
K-NN and pipeline

Understand the different


Perform scientific and technical components of the Hadoop
computing using the SciPy package ecosystem and learn to work with
and its sub-packages such as HBase, its architecture and data
Integrate, Optimize, Statistics, IO, storage, learning the difference
and Weave between HBase and RDBMS, and use
Hive and Impala for partitioning

Gain expertise in mathematical


computing using the NumPy and Understand MapReduce and its
Scikit-Learn package characteristics, plus learn how to
ingest data using Sqoop and Flume

9 | www.simplilearn.com
Who Should Enroll in this Program?
This program caters to working frame of mind are most suited
professionals from a variety of to pursue this Post Graduate
industries and backgrounds; the Program in Data Science,
diversity of our students adds including:
richness to class discussions and
interactions. IT professionals

The Data Science role requires Analytics managers


an amalgam of experience, Data
Science knowledge, and using the Business analysts
correct tools and technologies.
It is a solid career choice for Software developers
both new and experienced
Beginners or recent
professionals. Aspiring
graduates with bachelor’s or
professionals of any educational
master’s degree
background with an analytical

10 | www.simplilearn.com
S
T
E
P
Programming Refresher 1
2
Programming is an increasingly important skill; this course will
establish your proficiency in handling basic programming concepts. 3
The course will cover the basics of Java, Python, and C++. By the
end of this program, you will understand what is object-oriented
4
programming, the basic programming concepts like data types, 5
variables, strings, loops, functions, and software engineering concepts
like multithreading and multitasking. 6
7
Key Learning Objectives 8
Obtain fundamental knowledge on the basics of Java, Python, and
C++
9
Expertise in object-oriented programming and understand the
10
basic programming concepts like data types, variables, strings,
loops, functions

Comprehend software engineering concepts like multithreading


and multitasking

Course curriculum
Lesson 1- Course Introduction

Lesson 2- Basics of Java, Python, and C++

11 | www.simplilearn.com
S
T
E
P
Statistics Essential for Data Science 1
2
Statistics is the science of assigning a probability through the collection,
classification, and analysis of data. A foundational part of Data Science, 3
this course will enable you to define statistics and essential terms
related to it, explain measures of central tendency and dispersion, and
4
comprehend skewness, correlation, regression, distribution. You will be 5
able to make data-driven predictions through statistical inference.
6
Key Learning Objectives 7
Understand the fundamentals of statistics
8
Work with different types of data 9
How to plot different types of data 10
Calculate the measures of central tendency, asymmetry, and variability

Calculate correlation and covariance

Distinguish and work with different types of distribution

Estimate confidence intervals

Perform hypothesis testing

Make data-driven decisions

Understand the mechanics of regression analysis

Carry out regression analysis

Use and understand dummy variables

Understand the concepts needed for data science even with Python
and R

12 | www.simplilearn.com
Course curriculum
Lesson 1 - Introduction

Lesson 2 - Sample or Population Data?

Lesson 3 - The Fundamentals of Descriptive Statistics

Lesson 4 - Measures of Central Tendency, Asymmetry, and Variability

Lesson 5 - Practical Example: Descriptive Statistics

Lesson 6 - Distributions

Lesson 7 - Estimators and Estimates

Lesson 8 - Confidence Intervals: Advanced Topics

Lesson 9 - Practical Example: Inferential Statistics

Lesson 10 - Hypothesis Testing: Introduction

Lesson 11 - Hypothesis Testing: Let’s Start Testing!

Lesson 12 - Practical Example: Hypothesis Testing

Lesson 13 - The Fundamentals of Regression Analysis

Lesson 14 - Subtleties of Regression Analysis

Lesson 15 - Assumptions for Linear Regression Analysis

Lesson 16 - Dealing with Categorical Data

Lesson 17 - Practical Example: Regression Analysis

13 | www.simplilearn.com
S
T
E
P
R Programming for Data Science 1
2
Gain insight into the R Programming language with this introductory
course. An essential programming language for data analysis, R 3
Programming is a fundamental key to becoming a successful Data
Science professional. In this course, you will learn how to write R code,
4
learn about R’s data structures, and create your own functions. After the 5
completion of this course, you will be fully able to begin your first data
analysis. 6
7
Key Learning Objectives 8
Learn about math, variables, and strings, vectors, factors, and vector
operations
9
Gain fundamental knowledge on arrays and matrices, lists, and data
10
frames

Get understanding on conditions and loops, functions in R, objects,


classes, and debugging

Learn how to accurately read text, CSV, and Excel files, plus how to
write and save data objects in R to a file

Understand and work on strings and dates in R

Course curriculum
Lesson 1 - R Basics

Lesson 2 - Data Structures in R

Lesson 3 - R Programming Fundamentals

Lesson 4 - Working with Data in R

Lesson 5 - Stings and Dates in R

14 | www.simplilearn.com
S
T
E
P
Data Science with R 1
2
The next step to becoming a data scientist is learning R—the most in-
demand open source technology. R is a powerful Data Science and 3
analytics language, which has a steep learning curve and a very vibrant
community. This is why it is quickly becoming the technology of choice
4
for organizations who are adopting the power of analytics for competitive 5
advantage.
6
Key Learning Objectives 7
Gain a foundational understanding of business analytics
8
Install R, R-studio, and workspace setup, and learn about the various R 9
packages
10
Master R programming and understand how various statements are
executed in R

Gain an in-depth understanding of data structure used in R and learn


to import/export data in R

Define, understand and use the various apply functions and DPYR
functions

Understand and use the various graphics in R for data visualization

Gain a basic understanding of various statistical concepts

Understand and use hypothesis testing method to drive business


decisions

Understand and use linear, non-linear regression models, and


classification techniques for data analysis

Learn and use the various association rules and Apriori algorithm

Learn and use clustering methods including K-means, DBSCAN, and


hierarchical clustering

15 | www.simplilearn.com
Course curriculum
Lesson 1 - Introduction to Business Analytics

Lesson 2 - Introduction to R Programming

Lesson 3 - Data Structures

Lesson 4 - Data Visualization

Lesson 5 - Statistics for Data Science I

Lesson 6 - Statistics for Data Science II

Lesson 7 - Regression Analysis

Lesson 8 - Classification

Lesson 9 - Clustering

Lesson 10 - Association

16 | www.simplilearn.com
S
T
E
P
Python for Data Science 1
2
Kickstart your learning of Python for Data Science with this introductory
course and familiarize yourself with programming. Carefully crafted by 3
IBM, upon completion of this course you will be able to write your Python
scripts, perform fundamental hands-on data analysis using the Jupyter- 4
based lab environment, and create your own Data Science projects using
IBM Watson.
5
6
Key Learning Objectives 7
Write your first Python program by implementing concepts of 8
variables, strings, functions, loops, conditions
9
Understand the nuances of lists, sets, dictionaries, conditions and
branching, objects and classes 10
Work with data in Python such as reading and writing files, loading,
working, and saving data with Pandas

Course curriculum
Lesson 1 - Python Basics

Lesson 2 - Python Data Structures

Lesson 3 - Python Programming Fundamentals

Lesson 4 - Working with Data in Python

Lesson 5 - Working with NumPy arrays

17 | www.simplilearn.com
S
T
E
P
Data Science with Python 1
2
This Data Science with Python course will establish your mastery of
Data Science and analytics techniques using Python. In this Python 3
for Data Science course, you’ll learn the essential concepts of Python
programming and gain in-depth knowledge in data analytics, Machine
4
Learning, data visualization, web scraping, and natural language 5
processing. Python is a required skill for many Data Science positions, so
jump-start your career with this interactive, hands-on course. 6
7
Key Learning Objectives 8
Gain an in-depth understanding of Data Science processes, data
wrangling, data exploration, data visualization, hypothesis building,
9
and testing 10
Install the required Python environment and other auxiliary tools and
libraries

Understand the essential concepts of Python programming such as


data types, tuples, lists, dicts, basic operators and functions

Perform high-level mathematical computing using the NumPy package


and its vast library of mathematical functions

Perform scientific and technical computing using the SciPy package


and its sub-packages such as Integrate, Optimize, Statistics, IO, and
Weave

Perform data analysis and manipulation using data structures and


tools provided in the Pandas package

Gain expertise in Machine Learning using the Scikit-Learn package

Gain an in-depth understanding of supervised learning and


unsupervised learning models such as linear regression, logistic
regression, clustering, dimensionality reduction, K-NN and pipeline

18 | www.simplilearn.com
Use the Scikit-Learn package for natural language processing

Use the matplotlib library of Python for data visualization

Extract useful data from websites by performing web scraping using


Python

Integrate Python with Hadoop, Spark, and MapReduce

Course curriculum
Lesson 1 - Data Science Overview

Lesson 2 - Data Analytics Overview

Lesson 3 - Statistical Analysis and Business Applications

Lesson 4 - Python Environment Setup and Essentials

Lesson 5 - Mathematical Computing with Python (NumPy)

Lesson 6 - Scientific Computing with Python (Scipy)

Lesson 7 - Data Manipulation with Pandas

Lesson 8 - Machine Learning with Scikit–Learn

Lesson 9 - Natural Language Processing with Scikit Learn

Lesson 10 - Data Visualization in Python using Matplotlib

Lesson 11 - Web Scraping with BeautifulSoup

Lesson 12 - Python Integration with Hadoop MapReduce and Spark

19 | www.simplilearn.com
S
T
E
P
Machine Learning 1
2
Simplilearn’s Machine Learning course will make you an expert in Machine
Learning, a form of Artificial Intelligence that automates data analysis to 3
enable computers to learn and adapt through experience to do specific
tasks without explicit programming. You will master Machine Learning 4
concepts and techniques, including supervised and unsupervised learning,
mathematical and heuristic aspects, and hands-on modeling to develop
5
algorithms and prepare you for your role with advanced Machine Learning 6
knowledge.
7
Key Learning Objectives 8
Master the concepts of supervised and unsupervised learning, 9
recommendation engine, and time series modeling
10
Gain practical mastery over principles, algorithms, and applications of
Machine Learning through a hands-on approach that includes working
on four major end-to-end projects and 25+ hands-on exercises

Acquire thorough knowledge of the statistical and heuristic aspects of


Machine Learning

Implement models such as support vector machines, kernel SVM,


naive Bayes, decision tree classifier, random forest classifier, logistic
regression, K-means clustering and more in Python

Validate Machine Learning models and decode various accuracy


metrics. Improve the final models using another set of optimization
algorithms, which include Boosting & Bagging techniques

Comprehend the theoretical concepts and how they relate to the


practical aspects of Machine Learning

20 | www.simplilearn.com
Course curriculum
Lesson 1 - Introduction to Artificial Intelligence and Machine Learning

Lesson 2 - Data Wrangling and Manipulation

Lesson 3 - Supervised Learning

Lesson 4 - Feature Engineering

Lesson 5 - Supervised Learning Classification

Lesson 6 - Unsupervised learning

Lesson 7 - Time Series Modelling

Lesson 8 - Ensemble Learning

Lesson 9 - Recommender Systems

Lesson 10 - Text Mining

21 | www.simplilearn.com
S
T
E
P
Tableau 1
2
This Tableau Desktop 10 training will help you master the various aspects
of the program and gain skills such as building visualization, organizing 3
data, and designing dashboards. You will also learn concepts of statistics,
mapping, and data connection. It is an essential asset to those wishing to 4
succeed in Data Science.
5
6
Key Learning Objectives
7
Grasp the concepts of Tableau Desktop 10, become proficient with
statistics and build interactive dashboards 8
Master data sources and datable blending, create data extracts and 9
organize and format data

Master arithmetic, logical, table and LOD calculations and ad-hoc


10
analytics

Become an expert on visualization techniques such as heat map, tree


map, waterfall, Pareto, Gantt chart and market basket analysis

Learn to analyze data using Tableau Desktop as well as clustering and


forecasting techniques

Gain command of mapping concepts such as custom geocoding and


radial selections

Master Special Field Types and Tableau Generated Fields and the
process of creating and using parameters

Learn how to build interactive dashboards, story interfaces and how to


share your work

22 | www.simplilearn.com
Course curriculum
Lesson 1 - Getting Started with Tableau

Lesson 2 - Working with Tableau

Lesson 3 - Deep Diving with Data and Connections

Lesson 4 - Creating Charts

Lesson 5 - Adding Calculations to your Workbook

Lesson 6 - Mapping Data in Tableau

Lesson 7 - Dashboards and Stories

Lesson 8 - Visualizations for an Audience

23 | www.simplilearn.com
S
T
E
P
Natural Language Processing 1
2
This Natural Language Processing course will give you a detailed look
at the science behind applying Machine Learning algorithms to process 3
large amounts of natural language data. You will learn the concepts of
statistical machine translation and neural models, deep semantic similarity 4
model (DSSM), neural knowledge base embedding, deep reinforcement
learning technique, neural models applied in image captioning, and visual
5
question answering using Python’s Natural Language Toolkit (NLTK). 6
Key Learning Objectives 7
Apply Deep Learning models to solve machine translation and
8
conversation problems
9
Implement deep structured semantic models (DSSM) to retrieve
information 10
Understand deep reinforcement learning techniques applied in Natural
Language Processing
Use neural models applied in image captioning and visual question
answering

Course curriculum
Lesson 1 - Introduction to Natural Language Processing
Lesson 2 - Feature Engineering on Text Data

Lesson 3 - Natural Language Understanding Techniques


Lesson 4 - Natural Language Generation
Lesson 5 - Natural Language Processing Libraries
Lesson 6 - Natural Language Processing with Machine Learning and
Deep Learning
Lesson 7 - Speech Recognition Technique

24 | www.simplilearn.com
S
T
E
P
Data Science Capstone Project 1
2
This Data Science Capstone project will give you an opportunity to
implement the skills you learned throughout this Program. Through 3
dedicated mentoring sessions, you’ll learn how to solve a real-world,
industry-aligned Data Science problem, from data processing and model 4
building to reporting your business results and insights. The project is
the final step in the learning path and will enable you to showcase your
5
expertise in Data Science to future employers. 6
7
Key Learning Objectives
8
Simplilearn’s online Data Science Capstone course will bring you through
the Data Science decision cycle, including data processing, building a 9
model and representing results. The project milestones are:
10
Data Processing - In this step, you will apply various data processing
techniques to make raw data meaningful.

Model Building - You will leverage techniques such as regression and


decision trees to build Machine Learning models that enable accurate
and intelligent predictions. You may explore Python, R, or SAS to
develop your model. You will follow the complete model-building
exercise from data split to test and validate data using the k-fold cross-
validation process.

Model Fine-tuning - You will apply various techniques to improve the


accuracy of your model and select the champion model that provides
the best accuracy.

Dashboarding and Representing Results - As the final step, you will


be required to export your results into a dashboard with meaningful
insights using Tableau.

25 | www.simplilearn.com
Elective Course

IBM Watson for Chatbots


This course provides a practical introduction on how to
build a chatbot with Watson Assistant without writing
any code and then deploy your chatbot to a real website
in less than five minutes. It will teach you to plan, build,
test, analyze, and deploy your first chatbot.

Machine Learning with R


In this course, you will learn how to write R code,
learn about R’s data structures, and create your own
functions. With the knowledge gained, you will be
ready to undertake your first very own data analysis.
You’ll further learn about Supervised vs Unsupervised
Learning, look into how Statistical Modeling relates to
Machine Learning, and do a comparison of each using R.

Core Java Certification Training


If you’re looking to master functions of Big Data and
Hadoop, a core fundamental to your training will be
to understand Core Java. Java, by Oracle, is used in a
variety of platforms from gaming consoles, laptops, and
mobile technology. Java is considered a central platform
due to having its own runtime environment. After this
course, you will understand the methods related to
Big Data and Java as well as a basic understanding of
Java 8 and appropriate use cases. Gain expertise in
basic concepts of Core Java and acquire a complete
understanding of JDBC architecture and JUnit
Framework.

26 | www.simplilearn.com
Big Data Hadoop and Spark Developer
Learn how to work with Big Data and its components.
Deep-dive into Hadoop and its ecosystem including
MapReduce, HDFS, Yarn, HBase, Impala, Sqoop and
Flume. This course provides an introduction to Apache
Spark which is the next step after Hadoop. After
completing this course, you will be able to successfully
pass the Cloudera CCA175 certification but embrace
this technology as part of your role as a Data Scientist.

Introduction to Artificial Intelligence


Introduction to Artificial Intelligence course is
designed to help learners decode the mystery of
Artificial Intelligence and its business applications. The
course provides an overview of Artificial Intelligence
concepts and workflows, Machine Learning and Deep
Learning, and performance metrics. You’ll learn the
difference between supervised, unsupervised, and
reinforcement learning, be exposed to use cases, and
see how clustering and classification algorithms help
identify Artificial Intelligence business applications.

Data Science with SAS


Data Science with SAS training course is designed
to enable learners to become adept in analytics
techniques using SAS Data Science tools. This online
course covers a holistic overview of analytics and
graphic user interface (GUI). You will learn how
to combine dataset methods, understand select
statements and joins in SQL, and comprehend the
need for macro variables. This online training course
will also teach you how to apply data manipulation and
optimization techniques; advanced statistical concepts
like clustering, linear regression and decision trees;
data analysis methods to solve real-world business
problems, and predictive modeling techniques.

27 | www.simplilearn.com
Certificates

Upon completion of this Post Graduate Program in Data science by Purdue


University, you will receive the Post Graduate certificate from Purdue University
and IBM. You will also receive certificates from Simplilearn for the Data Science
courses in the learning path. These certificates will testify to your skills as an
expert in Data Science.

28 | www.simplilearn.com
Advisory Board Member

Gerry McCartney
Executive Vice President for Purdue Online
and Oesterle Professor of Information
Technology at Purdue University

Gerry McCartney is executive vice president


for Purdue Online and Oesterle Professor of
Information Technology at Purdue University.
McCartney spearheads the online education
initiative, Purdue Online, adopted in June 2018
by Purdue University President Mitch Daniels
and the Board of Trustees. Under McCartney,
Purdue’s online offerings are rooted in market
research and online analytics to provide the
best experience and most value for students
seeking the same kind of world-class education
traditionally available from Purdue. McCartney
previously held executive management positions
at Purdue and the Wharton School of the
University of Pennsylvania.

Ronald Van Loon


Big Data Expert, Director - Advertisement

Named by Onalytica as one of the three most


influential people in Big Data, Ronald van Loon
is an author for a number of leading Big Data
and Data Science websites, including Datafloq,
Data Science Central, and The Guardian. He is
also a renowned speaker at industry events.

29 | www.simplilearn.com
USA

Simplilearn Americas, Inc.


201 Spear Street, Suite 1100, San Francisco, CA 94105
United States
Phone No: +1-844-532-7688

INDIA

Simplilearn Solutions Pvt Ltd.


# 53/1 C, Manoj Arcade, 24th Main, Harlkunte
2nd Sector, HSR Layout
Bangalore - 560102
Call us at: 1800-212-7688

www.simplilearn.com

30 | www.simplilearn.com

You might also like