You are on page 1of 47

Master of Science in

DATA SCIENCE
Get the Full Picture

!!!

Appearances Are Often Deceptive


DELVE INTO DATA
01

Table of
Contents
2 About upGrad

3 Why upGrad?

4 Program Highlights

5 Faculty and Industry Experts


8 upGrad Learning Experience

10 Industry Projects

11 Learning Path

12 Master’s Curriculum

45 Meet the Class

46 Career Support

47 Experience upGrad Offline

48 Hear from Our Learners

49 Program Details and Admission Process


02

About
upGrad
upGrad has delivered over 20 million hours of
learning, delivering programs by collaborating with
universities across the world including Liverpool
John Moores University, IIT Madras, IIIT Bangalore
and Deakin Business School among others.

Online education is a fundamental Intelligence, is excited to offer a one-


disruption that will have a far-reaching of-its-kind, academically rigorous and
impact. upGrad was founded taking this industrially relevant Master of Science in
into consideration. upGrad is an online Data Science.
education platform to help individuals
develop their professional potential in the The faculty includes an average of 15+
most engaging learning environment. years of experience. The faculty covers
the conceptual depths of topics such
Since its inception, upGrad has delivered as Data Science, Machine Learning and
over 20 million hours of learning, delivering AI, and Big Data Analytics. These will be
programs by collaborating with universities complemented by industry-relevant case
across the world, including LJMU, IIT Madras, studies from major industry verticals by
IIIT Bangalore and Deakin Business industry leaders with 8+ years of experience
School among others. And it doesn’t end from upGrad’s industry network.
there.
Furthermore, our strong placement network,
upGrad, in collaboration with IIIT industry mentorship and the credibility of a
Bangalore, a renowned university Master’s Degree will provide you with just
offering programs specialising in Data the right push to accelerate your career
Science, Machine Learning and Artificial in Data Science!
03
01

INR 1.23 CR
Highest Salary

Why
upGrad?

433%
Highest Hike

300+
Hiring Partners

50%
Avg Salary Hike

700+ 2+ Million
Industry Experts Learners
04

Program
Highlights
Dual Accreditation and Alumni Status
Get certified by IIITB and LJMU, UK and
gain dual alumni status on successful
completion of the program along
with access to LJMU’s digital library.

Programming Language & Tools


Learn 5+ Programming Languages and
Tools like Python, Tableau, MySQL and
more. Optional modules for further
upskilling.building, career fairs, industry
mentors and much more.
For the Industry, by the Industry
Learn from 60+ case studies and
industry experts who mentor
you throughout the program.

5 Specialisations
Choose from 5 specialisation-
son the basis of your back-
ground and career aspirations
and get the learning you want.

Live Classroom Session


Live Classroom hour with Dr Manoj
Jayabalan, Post-Doctoral Fellow at
LJMU, to solve queries related to
dissertation.

Global Access to Jobs


With 360-degree career support
and dual alumni status, gain global
access to jobs.
05
01

Faculty and Industry


Experts

Dr. Debabrata Das Chandrashekar Ramanathan S. Anand


Director, IIITB Dean Academics, IIITB CEO, Gramener

Dr. Debabrata Das is Director of IIITB. Prof. Chandrashekar has a PhD from A gold medallist from IIM Bangalore,
He has received his PhD from IIT-KGP. Mississippi State University and an alumnus of IIT Madras and London
His main areas of research are IoT and experience of over 10 years in several Business School, Anand is among the
Wireless Access Network. multinational organisations. top 10 data scientists in India with 20
years of experience.

Tricha Anjali Behzad Ahmadi Anshuman Gupta


Ex-Associate Dean, IIIT-B Data Scientist Walmart Labs Director - Data Science, Pitney Bowes

Prof Tricha has a Ph.D from Georgia An M. Tech graduate and PhD from He has a PhD (Dual) from Penn State
Tech as well as an integrated M.Tech. Jersey Institute of Technology, Behzad University as well as a BTech Degree
from IIT Bombay. Her research interests possesses tremendous years of expe- from IIT Bombay.
include computer networks. rience in Data Science and ML.
06

Prof. G. Srinivasaraghavan Mirza Rahim Baig Sajan Kedia


Professor, IIITB Ex- Lead Analyst, Flipkart Ex- Data Science Lead, Myntra

Prof. Srinivasaraghavan has a PhD in Mirza is a veteran professional with Sajan graduated from IIT, BHU and has
Computer Science from IIT-K and 18 10+ years of experience in applications tons of experience in Data Science, Big
years of experience with Infosys and of data science, machine learning in Data, Spark, Machine Learning and
several other MNCs. e-commerce and healthcare. Natural Language Processing.

Rajesh Sabapathy Prof. Dhiya Al-Jumeily Bijoy Kumar Khandelwal


Sr Director, Data Science, UHG Group Professor - AI, LJMU COO, Actify Data Labs

Rajesh has 10+ years of experience A Senior Member of the IEEE and a Bijoy comes with a deep understanding
leading Data Science teams in various Chartered IT Professional. He is a fellow of the private and cloud architectures
domains solving complex problems of the UK Higher Education Academy. and has helped numerous companies
using Deep Learning & ML technique. make the transition.

Ujjyaini Mitra Ankit Jain


Head of Analytics, Zee5 ML Engineering Manager, Meta

An alumnus of McKinsey and Co, An alumnus of IIT Bombay, UCB, and


Flipkart and Bharati Airtel with over HBS with over 9 years of experience.
11 years of experience. Ankit has been recognised as 40
Under40 Data Scientist for 2022.
07
01

Dr. Atif Waraich Prof. Paulo Lisboa Dr Gabriela Czanner


Faculty - Computer Science, Head of Dept - Applied Mathematics, Faculty - Engineering and Technology,
LJMU LJMU LJMU

A Senior Faculty of Engineering and Studied Mathematical Physics at LU A Senior Lecturer in Statistics and Data
Technology at LJMU who has multiple and was the chairman of Industrial Science at the Department of Applied
publications in the healthcare domain. Mathematics at LJMU in 1996 and Head Mathematics at LJMU. Her research
of Graduate School in 2002. focus is Advanced Statistics for
Decision Support.
08

upGrad Learning
Experience

Student Support Team


• We have a dedicated/ Student Support Team
for handling your queries via email or call- Industry Networking
back requests • Live sessions by experts on various
• This support team is available 7 days a week, industry topics
24 hours a day • One-on-one discussion and feedback
sessions with industry mentors
Industry Mentors
• Receive unparalleled guidance from industry upGrad BaseCamp (PRE-COVID)
mentors, teaching assistants and graders • Fun-packed, informative and career
• Receive one-on-one feedback on sub- building workshop sessions by indus-
missions and personalised feedbacks on try professionals and professors
improvement • Group activities with your peers and
alumni

Expert Feedback
• Personalised expert feedback on
assignments and projects
• Regular live sessions by experts to
clarify concept-related doubts
Q&A Forum
• Timely doubt resolution by industry
experts and peers
• 100% expert-verified responses to
ensure quality learning
0901

New
Additions

Career Essential Soft-skills Program


• Excel your personal & professional life with
upGrad’s Soft Skills Program

• Study Three fundamental Skills - Interview


& Job Search, Corporate & Business Com-
munication and Problem Solving

• Get access to 40+ learner hours of soft


skills content delivered by the best faculty
& Industry experts

30-Hour Programming Bootcamp for Non-tech


Learners
• Non-tech background? No need to fear
Programming anymore

• A 30-hour Python Programming bootcamp,


focusing on developing Basic + Intermediate
Python Programming Concepts to assist non-
tech learners

• A blended learning experience delivered via


Interactive live sessions and assessments
10

Industry
Projects

IMDb Movie Analysis Uber Supply-Demand Gap Lead Scoring Fraud Detection

Creditworthiness of Speech Recognition Image Captioning Social Media Listening


Customers

SHOP

Telecom Churn Interactive Market Retail Giant Sales And many more!
Campaign Analysis Forecasting
01
11

Learning
Path Preparatory Course
0 week
Data Toolkit
12 weeks
Machine Learning
10 weeks

Choose any of the 5 Specialisations


22 weeks (with 4 weeks of Capstone)

Natural Language Deep Learning Business Business Intel- Data Engineer-


Processing Tools: Python, Ex- Analytics ligence/ Data ing
Tools: Python, Excel cel, TensorFlow Tools: Python, Analyics Tools: Hadoop,
mySQL, Excel Tools: Python, Pow- HBase, Sqoop,
er BI, Excel, mySQL, Hive, Flume,
MongoDB, Shiny, PySpark, Spark,
Tableau Airflow

Executive PG Executive PG Executive PG Executive PG Executive PG


Programme in Programme in Programme in Programme in Data Programme in
Data Science Data Science Data Science Science Data Science
(Natural Language (Deep Learning) (Business (Business (Data Engineering)
Processing) Analytics) Intelligence/ Data
Analytics)

Research Methodology Dissertation

MSc - LJMU MSc - LJMU MSc - LJMU MSc - LJMU MSc - LJMU
(Natural Language (Deep Learning) (Business Analytics) (Business Intelligence/ (Data Engineering)
Processing) Data Analytics)
12

Master’s of Science
in Data Science
COMMON CONTENT
PRE-PROGRAMME PREPARATORY CONTENT
DATA ANALYSIS IN EXCEL

1. INTRODUCTION TO EXCEL Taught by one of the most renowned data


scientists in the country (S.Anand, CEO,
2. DATA ANALYSIS IN EXCEL - I:
Gramener), this module takes you from
FUNCTIONS, FORMULAE, AND
a beginner level Excel user to an almost
CHARTS
professional user.
3. DATA ANALYSIS IN EXCEL - II:
PIVOTS AND LOOKUPS

ANALYTICS PROBLEM SOLVING

1. THE CRISP-DM FRAMEWORK This module covers concepts of the CRISP-


- BUSINESS AND DATA DM framework for business problem-solving.
UNDERSTANDING

2. CRISP-DM FRAMEWORK
- DATA PREPARATION,
MODELLING, EVALUATION
AND DEPLOYMENT

COURSE 1: DATA TOOLKIT


INTRODUCTION TO PYTHON

1. UNDERSTANDING THE Build a foundation for the most in-demand 2 WEEKS


UPGRAD CODING CONSOLE programming language of the 21st century.

2. BASICS OF PYTHON

3. DATA STRUCTURES IN
PYTHON

4. CONTROL STRUCTURE AND


FUNCTIONS IN PYTHON

5. OOP IN PYTHON

*The Curriculum is subject to change as per the inputs from university or industry experts
01
13
01

PROGRAMMING IN PYTHON

1. LOGIC AND SYNTAX Learn how to approach and solve logical 1 WEEK
BUILDING problems using programming.

2. DATA STRUCTURES: LISTS,


STRINGS, DICTIONARIES, AND
STACKS

3. TIME COMPLEXITY

4. SEARCHING AND SORTING

5. TWO POINTERS

6. RECURSION

PYTHON FOR DATA SCIENCE

1. INTRODUCTION TO NUMPY Learn how to manipulate datasets in Python 1 WEEK


using Pandas which is the most powerful
2. INTRODUCTION TO
library for data preparation and analysis.
MATPLOTLIB

3. INTRODUCTION TO PANDAS

4. GETTING AND CLEANING


DATA

DATA VISUALIZATION IN PYTHON

1. INTRODUCTION TO DATA Humans are visual learners and hence no 1 WEEK


VISUALIZATION task related to data is complete without
visualisation. Learn to plot and interpret
2. DATA VISUALISATION USING
various graphs in Python and observe how
SEABORN
they make data analysis and drawing insights
easier.

EXPLORATORY DATA ANALYSIS

1. DATA SOURCING Learn how to find and analyse the patterns in 1 WEEK
the data to draw actionable insights.
2. DATA CLEANING

3. UNIVARIATE ANALYSIS

4. BIVARIATE ANALYSIS AND


MULTIVARIATE ANALYSIS

*The Curriculum is subject to change as per the inputs from university or industry experts
14

CREDIT EDA CASE STUDY

1. PROBLEM STATEMENT Solve a real industry problem through the 1 WEEK


concepts learnt in exploratory data analysis.
2. EVALUATION RUBRIC

3. FINAL SUBMISSION

4. SOLUTION

INFERENTIAL STATISTICS

1. BASICS OF PROBABILITY Build a strong statistical foundation and learn 1 WEEK


how to ‘infer’ insights from a huge population
2. DISCRETE PROBABILITY
using a small sample.
DISTRIBUTIONS

3. CONTINUOUS PROBABILITY
DISTRIBUTIONS

4. CENTRAL LIMIT THEOREM

HYPOTHESIS TESTING

1. CONCEPTS OF HYPOTHESIS Understand how to formulate and validate 1 WEEK


TESTING - I: NULL AND hypotheses for a population to solve real-life
ALTERNATE HYPOTHESIS, business problems.
MAKING A DECISION, AND
CRITICAL VALUE METHOD

2. CONCEPTS OF HYPOTHESIS
TESTING - II: P-VALUE
METHOD AND TYPES OF
ERRORS

3. INDUSTRY DEMONSTRATION
OF HYPOTHESIS TESTING:
TWO-SAMPLE MEAN AND
PROPROTION TEST, A/B
TESTING

DATA ANALYSIS USING SQL

1. DATABASE DESIGN Data in companies is definitely not stored in 1 WEEK


excel sheets! Learn the fundamentals of da-
2. DATABASE CREATION IN
tabase and extract information from RDBMS
MYSQL WORKBENCH
using the structured query language.
3. QUERYING IN MYSQL

4. JOINS AND SET OPERATIONS

*The Curriculum is subject to change as per the inputs from university or industry experts
01
15
01

ADVACED SQL & BEST PRACTICES

1. WINDOW FUNCTIONS Apply advanced SQL concepts like window- 1 WEEK


ing and procedures to derive insights from
2. CASE STATEMENTS, STORED
data and answer pertinent business ques-
ROUTINES AND CURSORS
tions.
3. QUERY OPTIMISATION AND
BEST PRACTICES

4. PROBLEM-SOLVING USING
SQL

SQL ASSIGNMENT: RSVP MOVIES

1. PROBLEM STATEMENT In this assignment, you will work on a movies 1 WEEK


dataset using SQL to extract exciting insights.
2. EVALUATION RUBRIC

3. FINAL SUBMISSION

4. SOLUTION

COURSE 2 - MACHINE LEARNING I


LINEAR REGRESSION

1. SIMPLE LINEAR REGRESSION Venture into the machine learning community 2 WEEKS
by learning how one variable can be predict-
2. SIMPLE LINEAR REGRESSION
ed using several other variables through a
IN PYTHON
housing dataset where you will predict the
3. MULTIPLE LINEAR prices of houses based on various factors.
REGRESSION

4. MUTLIPLE LINEAR
REGRESSION IN PYTHON

5. INDUSTRY RELEVANCE OF
LINEAR REGRESSION

LINEAR REGRESSION ASSIGNMENT 1 WEEK

1. PROBLEM STATEMENT Build a model to understand the factors on


which the demand for bike sharing systems
2. EVALUATION RUBRIC
vary on and help a company optimise its
3. FINAL SUBMISSION revenue.
4. SOLUTION

*The Curriculum is subject to change as per the inputs from university or industry experts
12
16

LOGISTIC REGRESSION 2 WEEKS

1. UNIVARIATE LOGISTIC Learn your first binary classification tech-


REGRESSION nique by determining which customers of a
telecom operator are likely to churn versus
2. MULTIVARIATE LOGISTIC
who are not to help the business retain
REGRESSION: MODEL
customers.
BUILDING AND EVALUATION

3. LOGISTIC REGRESSION:
INDUSTRY APPLICATIONS

CLASSIFICATION USING DECISION TREES

1. INTRODUCTION TO DECISION Learn how the human decision making 1 WEEK


TREES process can be replicated using a decision
treeand tune it to suit your needs.
2. ALGORITHMS FOR DECISION
TREES CONSTRUCTION

3. HYPERPARAMETER TUNING
IN DECISION TREES

UNSUPERVISED LEARNING: CLUSTERING

1. INTRODUCTION TO Learn how to group elements into different 1 WEEK


CLUSTERING clusters when you don’t have any pre-
defined labels to segregate them through
2. K-MEANS CLUSTERING
K-means clustering, hierarchical clustering,
3. HIERARCHICAL CLUSTERING and more.
4. OTHER FORMS OF
CLUSTERING: K-MODE,
K-PROTOTYPE, DB SCAN

BASICS OF NLP AND TEXT MINING

1. REGEX AND INTRODUCTION Do you get annoyed by the constant spams 1 WEEK
TO NLP in your mailbox? Wouldn’t it be nice if we had
a program to check your spellings? In this
2. BASIC LEXICAL PROCESSING
module learn how to build a spell checker &
3. ADVANCED LEXICAL spam detector using techniques like phonet-
PROCESSING ic hashing, bag-of-words, TF-IDF, etc.

*The Curriculum is subject to change as per the inputs from university or industry experts
01
01
17

BUSINESS PROBLEM SOLVING

1. INTRODUCTION TO BUSINESS Learn how to approach open ended real 1 WEEK


PROBLEM SOLVING world problems using data as a lever to draw
actionable insights.
2. BUSINESS PROBLEM
SOLVING: CASE STUDY
DEMONSTRATIONS

CASE STUDY: LEAD SCORING

1. PROBLEM STATEMENT Help the Sales team of your company iden- 1 WEEK
tify which leads are worth pursuing through
2. EVALUATION RUBRIC
this classification case study.
3. FINAL SUBMISSION

4. SOLUTION

SPECIALISATION: DEEP LEARNING


COURSE 3 - MACHINE LEARNING II
BAGGING & RANDOM FOREST

1. POPULAR ENSEMBLES Learn how powerful ensemble algorithms can 1 WEEK


improve your classification models by build-
2. INTRODUCTION TO RANDOM
ing random forests from decision trees.
FORESTS

3. FEATURE IMPORTANCE IN
RANDOM FORESTS

4. RANDOM FORESTS IN
PYTHON

BOOSTING 1 WEEK

1. INTRODUCTION TO Learn about ensemble modelling through


BOOSTING AND ADABOOST bagging and boosting and understand how
weak algorithms can be transformed into
2. GRADIENT BOOSTING
stronger ones.

*The Curriculum is subject to change as per the inputs from university or industry experts
18

MODEL SELECTION & GENERAL ML TECHNIQUES

1. PRINCIPLES OF MODEL Learn the pros and cons of simple and 1 WEEK
SELECTION complex models and the different methods
for quantifying model complexity, alongwith
2. MODEL EVALUATION
general machine learning techniques like
3. MODEL SELECTION: BEST feature engineering, model evaluation, and
PRACTICES many more.

PRINCIPAL COMPONENT ANALYSIS 1 WEEK

1. PRINICIPAL COMPONENT Understand important concepts related to


ANALYSIS AND SINGULAR dimensionality reduction, the basic idea and
VALUE DECOMPOSITION the learning algorithm of PCA, and its practi-
cal applications on supervised and unsuper-
2. PRINCIPAL COMPONENT
vised problems.
ANALYSIS IN PYTHON

ADVANCED REGRESSION

1. GENERALIZED LINEAR In this module, take a more advanced look 1 WEEK


REGRESSION at regression models and learn the concepts
related to regularization.
2. REGULARIZED REGRESSION

ADVANCED ML CASE STUY 1 WEEK

1. PROBLEM STATEMENT Build a regularized regression model to


understand the most important variables to
2. EVALUATION RUBRIC
predict the house prices in Australia.
3. FINAL SUBMISSION

4. SOLUTION

*The Curriculum is subject to change as per the inputs from university or industry experts
01
01
19

COURSE 4 - ADVANCED MACHINE LEARNING AND DEEP LEARNING


TIME SERIES ANALYSIS

1. INTRODUCTION TO In this module, you will learn how to analyse 2 WEEKS


TIME SERIES AND ITS and forecast a series that varies with time.
COMPONENTS

2. WORKING WITH STATIONARY


TIME SERIES

3. END-TO-END ANALYSIS OF
TIME SERIES

INTRODUCTION TO NEURAL NETWORKS AND ANN

1. STRUCTURE OF NEURAL Learn the most sophisticated and cut- 3 WEEKS


NETWORKS ting-edge technique in machine learning -
Artificial Neural Networks or ANNs
2. FEED FORWARD IN NEURAL
NETWORKS

3. BACKPROPAGATION IN
NEURAL NETWORKS

4. MODIFICATIONS TO NEURAL
NETWORKS

5. HYPERPARAMETER TUNING
IN NEURAL NETWORKS

NEURAL NETWORK ASSIGNMENT

1. PROBLEM STATEMENT Build a neural network from scratch in Ten- 1 WEEK


sorflow to identify the type of skin cancer
2. EVALUATION RUBRIC
from image.
3. FINAL SUBMISSION

4. SOLUTION

*The Curriculum is subject to change as per the inputs from university or industry experts
20

COURSE 5 - ADVANCED DEEP LEARNING AND COMPUTER VISION


CONVOLUTIONAL NEURAL NETWORKS

1. INTRODUCTION TO Learn the basics of CNN and OpenCV and 2 WEEKS


CONVOLUTIONAL NEURAL how to classify image data using various
NETWORKS architectures which you will then implement
using Python and Keras.
2. BUILDING CNNS WITH
PYTHON AND KERAS

3. CNN ARCHITECTURES AND


TRANSFER LEARNING

4. STYLE TRANSFER AND


OBJECT DETECTION

CONVOLUTIONAL NEURAL NETWORKS -INDUSTRY APPLICATIONS

1. INDUSTRY DEMONSTRATION: Apply CNNs to Computer Vision tasks like 1 WEEK


USING CNNS WITH FLOWERS detecting anomalies in chest X-Ray scans.
IMAGES

2. INDUSTRY DEMONSTRATION:
USING CNNS WITH X-RAY
IMAGES

OBJECT DETECTION & IMAGE SEGMENTATION (OPTIONAL)

1. FUNDAMENTALS OF OBJECT Learn the applications of DL in computer 0 WEEK


DETECTION vision through industry-relevant detection
algorithms such as RCNNs, YOLO and SSD.
2. REGION-BASED DETECTORS

3. ONE-SHOT DETECTORS

4. CUSTOM OBJECT DETECTION

5. SEMANTIC SEGMENTATION

*The Curriculum is subject to change as per the inputs from university or industry experts
01
01
21

RECURRENT NEURAL NETWORKS

1. WHAT MAKES A NEURAL Ever wondered what goes behind machine 1 WEEK
NETWORK RECURRENT translation, sentiment analysis, speech rec-
ognition? Learn how RNN helps in these ar-
2. VARIANTS OF RNNS:
eas having sequential data like text, speech,
BIDIRECTIONAL RNNS AND
videos, and a lot more.
LSTMS

3. BUILDING RNNS IN PYTHON

GESTURE RECOGNITION

1. TWO ARCHITECTURES: 3D Make a Smart TV system which can control 2 WEEKS


CONVS AND CNN-RNN STACK the TV with user’s hand gestures as the re-
mote control
2. UNDERSTANDING
GENERATORS

3. STARTER CODE
WALKTHROUGH

4. PROBLEM STATEMENT AND


FINAL SUBMISSION

COURSE 6 - CAPSTONE PROJECT


CAPSTONE PROJECT

1. AN OVERVIEW OF THE Choose from a range of real-world indus- 4 WEEKS


DOMAIN AND ASSOCIATED try woven projects on advanced topics like
CONCEPTS Recommendation Systems, Fraud Detection,
Emotion Detection from faces, Social Media
2. PROBLEM STATEMENT
Listening, Speech Recognition among many
3. EVALUATION RUBRIC others.
4. MID SUBMISSION

5. FINAL SUBMISSION

6. SOLUTION

*The Curriculum is subject to change as per the inputs from university or industry experts
22

SPECIALISATION: NATURAL LANGUAGE


PROCESSING
COURSE 3 - MACHINE LEARNING II
BAGGING & RANDOM FOREST

1. POPULAR ENSEMBLES Learn how powerful ensemble algorithms can 1 WEEK


improve your classification models by build-
2. INTRODUCTION TO RANDOM
ing random forests from decision trees.
FORESTS

3. FEATURE IMPORTANCE IN
RANDOM FORESTS

4. RANDOM FORESTS IN
PYTHON

BOOSTING

1. INTRODUCTION TO Learn about ensemble modelling through 1 WEEK


BOOSTING AND ADABOOST bagging and boosting and understand how
weak algorithms can be transformed into
2. GRADIENT BOOSTING
stronger ones.

MODEL SELECTION & GENERAL ML TECHNIQUES

1. PRINCIPLES OF MODEL Learn the pros and cons of simple and 1 WEEK
SELECTION complex models and the different methods
for quantifying model complexity, along with
2. MODEL EVALUATION
general machine learning techniques like
3. MODEL SELECTION: BEST feature engineering, model evaluation, and
PRACTICES many more.

PRINCIPAL COMPONENT ANALYSIS

1. PRINICIPAL COMPONENT Understand important concepts related to 1 WEEK


ANALYSIS AND SINGULAR dimensionality reduction, the basic idea and
VALUE DECOMPOSITION the learning algorithm of PCA, and its practi-
cal applications on supervised and unsuper-
2. PRINCIPAL COMPONENT
vised problems.
ANALYSIS IN PYTHON

*The Curriculum is subject to change as per the inputs from university or industry experts
01
01
23

ADVANCED REGRESSION

1. GENERALIZED LINEAR In this module, take a more advanced look 1 WEEK


REGRESSION at regression models and learn the concepts
related to regularization.
2. REGULARIZED REGRESSION

ADVANCED ML CASE STUY

1. PROBLEM STATEMENT Build a regularized regression model to 1 WEEK


understand the most important variables to
2. EVALUATION RUBRIC
predict the house prices in Australia.
3. FINAL SUBMISSION

4. SOLUTION

COURSE 4 - ADVANCED MACHINE LEARNING AND NATURAL


LANGUAGE PROCESSING
TIME SERIES FORECASTING

1. INTRODUCTION TO In this module, you will learn how to analyse 2 WEEKS


TIME SERIES AND ITS and forecast a series that varies with time.
COMPONENTS

2. WORKING WITH STATIONARY


TIME SERIES

3. END-TO-END ANALYSIS OF
TIME SERIES

NEURAL NETS FOR NLP

1. UNDERSTANDING NEURAL Learn the most sophisticated and cut- 1 WEEK


NETWORKS ting-edge technique in machine learning -
Artificial Neural Networks or ANNs.
2. LOSS FUNCTIONS AND BACK
PROPAGATION

3. UNDERSTANDING
TENSORFLOW

4. CASE STUDY : IMDB MOVIE


REVIEW CLASSIFICATION

*The Curriculum is subject to change as per the inputs from university or industry experts
24

SYNTACTIC PROCESSING

1. INTRODUCTION TO Learn how to analyse the syntax or the 2 WEEKS


SYNTACTIC PROCESSING grammatical structure of sentences using
POS tagging and Dependency parsing.
2. PARSING

3. INFORMATION EXTRACTION

4. CONDITIONAL RANDOM
FIELDS

SYNCTACTIC PROCESSING ASSIGNMENT

1. PROBLEM STATEMENT Use the techniques such as POS tagging and 1 WEEK
Dependency parsing to extract information
2. EVALUATION RUBRIC
from unstructured text data
3. FINAL SUBMISSION

4. SOLUTION

COURSE 5- ADVANCED NATURAL LANGUAGE PROCESSING


SEMANTIC PROCESSING

1. INTRODUCTION TO Learn the most interesting area in the field of 2 WEEKS


SEMANTIC PROCESSING NLP and understand different techniques like
word-embeddings, topic modelling to build
2. DISTRIBUTIONAL SEMANTICS
an application that extracts opinions about
3. INDUSTRY APPLICATIONS OF socially relevant issues.
DISTRBUTIONAL SEMANTICS

4. TOPIC MODELLING

APPLIED DL IN NLP

1. INTRODUCTION TO MACHINE Apply the concepts of DL in natural language 2 WEEKS


TRANSLATION processing problems through encoder-de-
coder architecture, NMTs, and implement
2. ATTENTION-BASED NMT
them in TensorFlow.
MODEL

3. CUSTOM MODEL BUILDING IN


TENSORFLOW

*The Curriculum is subject to change as per the inputs from university or industry experts
01
25
01

CASE STUDY: AUTOMATIC TICKET CLASSIFICATION

1. PROBLEM STATEMENT Categorise support tickets with the help of 2 WEEKS


Unsupervised learning and Topic modelling
2. EVALUATION RUBRIC

3. FINAL SUBMISSION

4. SOLUTION

COURSE 6 - CAPSTONE PROJECT


CAPSTONE PROJECT

1. AN OVERVIEW OF THE Choose from a range of real-world indus- 4 WEEKS


DOMAIN AND ASSOCIATED try woven projects on advanced topics like
CONCEPTS Recommendation Systems, Fraud Detection,
Emotion Detection from faces, Social Media
2. PROBLEM STATEMENT
Listening, Speech Recognition among many
3. EVALUATION RUBRIC others.
4. MID SUBMISSION

SPECIALISATION: BUSINESS ANALYTICS


COURSE 3 - ADVANCED MACHINE LEARNING
BAGGING & RANDOM FOREST

1. POPULAR ENSEMBLES Learn how powerful ensemble algorithms can 1 WEEK


improve your classification models by build-
2. INTRODUCTION TO RANDOM
ing random forests from decision trees.
FORESTS

3. FEATURE IMPORTANCE IN
RANDOM FORESTS

4. RANDOM FORESTS IN
PYTHON

MODEL SELECTION & GENERAL ML TECHNIQUES

1. PRINCIPLES OF MODEL Learn the pros and cons of simple and 2 WEEKS
SELECTION complex models and the different methods
for quantifying model complexity, alongwith
2. MODEL BUILDING AND
general machine learning techniques like
EVALUATION
feature engineering, model evaluation, and
3. FEATURE ENGINEERING many more.
4. CLASS IMBALANCE

*The Curriculum is subject to change as per the inputs from university or industry experts
26

TIME SERIES FORECASTING

1. INTRODUCTION TO In this module, you will learn how to analyse 2 WEEKS


TIME SERIES AND ITS and forecast a series that varies with time.
COMPONENTS

2. SMOOTHING TECHNIQUES

3. INTRODUCTION TO AR
MODELS

4. BUILDING AR MODELS

MODEL SELCTION CASE STUDY

1. PROBLEM STATEMENT Apply your business acumen to the newly 1 WEEK


learnt machine learning techniques, and
2. EVALUATION RUBRIC
select the right model most appropriate for a
3. FINAL SUBMISSION provided business scenario.
4. SOLUTION

COURSE 4 - DATA VISUALISATION AND STORYTELLING


VISUALISATION USING TABLEAU

1. DATA EXPLORATION IN Learn basic visualisation techniques using 1 WEEK


TABLEAU the most in-demand visualization tool in the
industry.
2. VISUALISING AND ANALYSING
DATA IN TABLEAU WITH
BASIC PLOTS

ADVANCED EXCEL

1. EXCEL FUNCTIONS Learn the advanced concepts in Excel and 1 WEEK


start to perform data analysis like a pro!
2. DATA ANALYSIS IN EXCEL

3. ADVANCED TOOLS AND


VISUALISATIONS

*The Curriculum is subject to change as per the inputs from university or industry experts
01
01
27

VISUALISATION USING POWERBI

1. POWERBI: INTRODUCTION Take your visualization game a step forward 1 WEEK


AND SETUP by understanding how to operate PowerBI.

2. VISUALISING AND ANALYSING


DATA IN POWERBI

3. DATA TRANSFORMATIONS
USING POWERBI

STRUCTURED PROBLEM SOLVING USING FRAMEWORKS

1. INTRODUCTION TO Learn how to attack a business problem 1 WEEK


STRUCTURED PROBLEM using various structured frameworks like 5W,
SOLVING 5WHYs, and SPIN.

2. INTERVIEWING AND
FRAMEWORKS - I: 5W AND
5WHYS

3. INTERVIEWING AND
FRAMEWORKS - II: SPIN

4. INDUSTRY DEMONSTRATIONS
ON FRAMEWORKS

5. UNDERSTANDING BUSINESS
MODEL CANVAS AND ISSUE
TREE FRAMEWORK

6. INDUSTRY DEMONSTRATIONS
ON ISSUE TREE FRAMEWORK

7. SPECIALIZED FRAMEWORKS
FOR BUSINESS PROBLEMS:
7PS, 5CS, ETC.

DATA STORYTELLING

1. INTRODUCTION TO DATA Learn how to effectively strategise, com- 1 WEEK


STORYTELLING municate, and fine grain your data analysis
projects and understand how to optimal-
2. COMPONENTS OF A
ly present your findings to technical and
GOOD STORY WITH
non-technical stakeholders and upgrade your
DATA - UNDERSTANDING
storytelling skills.
YOUR STAKEHOLDER AND
STAKEHOLDER EMPATHY,
LEVELS OF DETAILS FOR
DIFFERENT STAKEHOLDERS
- CXO/LEADERSHIP VS TEAM
PRESENTATIONS, VISUALS, ETC.

3. GOLDEN RULES FOR DATA


STORYTELLING

*The Curriculum is subject to change as per the inputs from university or industry experts
28

AIRBNB CASE STUDY

1. PROBLEM STATEMENT Use your newly learnt UI tools skills to anal- 1 WEEK
yse an AirBnB dataset to make important
2. EVALUATION RUBRIC
business decisions. But the analysis is just a
3. FINAL SUBMISSION small part; can you also effectively present it
4. SOLUTION using Data Storytelling to the right stakehold-
ers?

COURSE 5: SOLVING BUSINESS REQUIREMENTS


OPERATIONS RESEARCH IN EXCEL

1. INTRODUCTION & CONCEPTS Learn about the world of operations research 1 WEEK
OF OPTIMISATION through linear and integer optimisations.

2. OPTIMISATION USING EXCEL

3. OPTIMISATION USING
PYTHON

4. OR IN INDUSTRY -
WAREHOUSE PROBLEM,
ASSIGNMENT PROBLEM, JOB-
SHOP SCHEDULING, ETC.

DATA ARCHITECTURE

1. COMPONENTS OF EFFECTIVE Given a broad business challenge, describe 1 WEEK


DATA ARCHITECTURE how you would approach the development of
a Machine Learning Architecture strategy us-
2. TECHOLOGY AND
ing the Structured Problem Solving Method.
INFRASTRUCTURE

3. TOOLS TO BUILD
AN EFFECTIVE DATA
ARCHITECTURE

DATA STRATEGY

1. BACKGROUND OF DATA Understand how to identify the right business 2 WEEKS


STRATEGY problems (Revenue/Cost Perspective, Val-
ue Chain) using the DS project assessment
2. CORE OF DATA STRATEGY-I
framework. You will also learn how to man-
3. CORE OF DATA STRATEGY-II age a product from production to deploy-
4. CASE STUDIES FOR DATA ment and understand the overall lifecycle
STRATEGY management of an Analytics/DS project.

*The Curriculum is subject to change as per the inputs from university or industry experts
01
29
01

BUSINESS CASE STUDY

1. PROBLEM STATEMENT Understand how a project in the industry is 2 WEEKS


taken up and solved through a comprehen-
2. EVALUATION RUBRIC
sive business case study.
3. FINAL SUBMISSION

4. SOLUTION

COURSE 6 - CAPSTONE PROJECT


CAPSTONE PROJECT

1. POWER BI - OPTIONAL Solve an end-to-end real-life industry prob- 4 WEEKS


lem from a wide variety of domains.
2. AN OVERVIEW OF THE
DOMAIN AND ASSOCIATED
CONCEPTS

3. PROBLEM STATEMENT

4. EVALUATION RUBRIC

5. MID SUBMISSION

6. FINAL SUBMISSION

7. SOLUTION

SPECIALISATION: BUSINESS INTELLIGENCE/


DATA ANALYTICS
COURSE 3: ADVANCED DBS AND BIG DATA ANALYTICS
DATA MODELLING

1. DATABASE DESIGN RECAP In this module, you will learn and use data 1 WEEK
modelling on a dataset to solve a business
2. BUILDING BLOCKS OF DATA
problem.
MODELLING

3. PROBLEM SOLVING USING


DATA MODELLING

4. DATA MODELLING: OPTIONAL


ASSIGNMENT

*The Curriculum is subject to change as per the inputs from university or industry experts
30

ADVANCED SQL AND BEST PRACTICES

1. WINDOW FUNCTIONS Apply advanced SQL concepts like window- 1 WEEK


ing and procedures to derive insights from
2. CASE STATEMENTS, STORED
data and answer pertinent business ques-
ROUTINES, AND CURSORS
tions
3. QUERY OPTIMISATION AND
BEST PRACTICES

4. PROBLEM SOLVING USING


SQL

INTRODUCTION TO BIG DATA AND CLOUD

1. BIG DATA AND CLOUD Understand the basics of big data and cloud 1 WEEK
COMPUTING and learn to work with an EMR cluster on a
cloud-based service.
2. AMAZON WEB SERVICES

3. BIG DATA STORAGE AND


PROCESSING - HADOOP

4. EMR CLUSTER IN AWS

ANALYTICS USING SPARK

1. EXPLORATORY DATA Use PySpark to do EDA and Predictive Analy- 2 WEEKS


ANALYSIS WITH PYSPARK sis using Spark’s ML library.

2. PREDICTIVE ANALYSIS WITH


SPARK MLLIB

BIG DATA CASE STUDY

1. PROBLEM STATEMENT Use your analytics skills to work on a large 1 WEEK


dataset in cloud to solve an industry prob-
2. EVALUATION RUBRIC
lem.
3. FINAL SUBMISSION

4. SOLUTION

COURSE 4 - DATA VISUALISATION AND STORYTELLING


VISUALISATION USING TABLEAU

1. DATA EXPLORATION IN Learn basic visualisation techniques using 1 WEEK


TABLEAU the most in-demand visualization tool in the
industry.
2. VISUALISING AND ANALYSING
DATA IN TABLEAU WITH
BASIC PLOTS

*The Curriculum is subject to change as per the inputs from university or industry experts
01
31
01

ADVANCED EXCEL

1. EXCEL FUNCTIONS Learn the advanced concepts in Excel and 1 WEEK


start to perform data analysis like a pro!
2. DATA ANALYSIS IN EXCEL

3. ADVANCED TOOLS AND


VISUALISATIONS

VISUALISATION USING POWERBI

1. POWERBI: INTRODUCTION Take your visualization game a step forward 1 WEEK


AND SETUP by understanding how to operate PowerBI.

2. VISUALISING AND ANALYSING


DATA IN POWERBI

3. DATA TRANSFORMATIONS
USING POWERBI

STRUCTURED PROBLEM SOLVING USING FRAMEWORKS

1. INTRODUCTION TO Learn how to attack a business problem 1 WEEK


STRUCTURED PROBLEM using various structured frameworks like 5W,
SOLVING 5WHYs, and SPIN.

2. INTERVIEWING AND
FRAMEWORKS - I: 5W AND
5WHYS

3. INTERVIEWING AND
FRAMEWORKS - II: SPIN

4. INDUSTRY DEMONSTRATIONS
ON FRAMEWORKS

5. UNDERSTANDING BUSINESS
MODEL CANVAS AND ISSUE
TREE FRAMEWORK

6. INDUSTRY DEMONSTRATIONS
ON ISSUE TREE FRAMEWORK

7. SPECIALIZED FRAMEWORKS
FOR BUSINESS PROBLEMS:
7PS, 5CS, ETC.

*The Curriculum is subject to change as per the inputs from university or industry experts
32

DATA STORYTELLING

1. INTRODUCTION TO DATA Learn how to effectively strategise, com- 1 WEEK


STORYTELLING municate, and fine grain your data analysis
projects and understand how to optimal-
2. COMPONENTS OF A
ly present your findings to technical and
GOOD STORY WITH
non-technical stakeholders and upgrade your
DATA - UNDERSTANDING
storytelling skills.
YOUR STAKEHOLDER AND
STAKEHOLDER EMPATHY,
LEVELS OF DETAILS FOR
DIFFERENT STAKEHOLDERS
- CXO/LEADERSHIP VS TEAM
PRESENTATIONS, VISUALS, ETC.

3. GOLDEN RULES FOR DATA


STORYTELLING

AIRBNB CASE STUDY

1. PROBLEM STATEMENT Use your newly learnt UI tools skills to anal- 1 WEEK
yse an AirBnB dataset to make important
2. EVALUATION RUBRIC
business decisions. But the analysis is just a
3. FINAL SUBMISSION small part; can you also effectively present it
4. SOLUTION using Data Storytelling to the right stakehold-
ers?

COURSE 5: ADVANCED PROBLEM SOLVING AND PROGRAMMING


DATA STRUCTURES - SETS, DICTIONARIES, STACKS, QUEUES

1. IN-BUILT DATA STRUCTURES Learn user defined data structures -Stack, 1 WEEK
Queue, Trees in Python that help in ad-
2. STACK
vanced data manipulation
3. QUEUE

4. TREES

SEARCHING AND SORTING

1. SEARCHING Learn most fundamental searching and 1 WEEK


sorting algorithms and design techniques
2. SORTING

3. TWO POINTERS

*The Curriculum is subject to change as per the inputs from university or industry experts
01
33
01

ALGORITHM ANALYSIS + RECURSION

1. ALGORITHM ANALYSIS Learn how to assess the efficiency your code 1 WEEK
using algorithm analysis techniques and
2. TIME AND SPACE
learn to write recursive algorithms
COMPLEXITY

3. RECURSION

ADVANCED DATABASE PROGRAMMING USING PANDAS

1. ADVANCED DATA WRANGLING Learn and implement advanced wrangling 1 WEEK


WITH PANDAS - I functions and techniques in Pandas related
to date-time, multi-columns aggregation,
2. ADVANCED DATA WRANGLING
hierarchical indexing, and more.
WITH PANDAS - II

PYTHON & SQL LAB

1. SQL: TIMED TEST + In this competitive assignment, you will solve 2 WEEKS
ASSIGNMENT a variety of programming questions in both
SQL and Python in a timed environment. You
2. PYTHON: TIMED TESTS I & II
will also demonstrate one of the questions
3. VIDEO SUBMISSION through a video submission to help improve
your interviewing skills.

COURSE 6 - CAPSTONE PROJECT


CAPSTONE PROJECT

1. AN OVERVIEW OF THE Solve an end-to-end real-life industry prob- 4 WEEKS


DOMAIN AND ASSOCIATED lem from a wide variety of domains.
CONCEPTS

2. PROBLEM STATEMENT

3. EVALUATION RUBRIC

4. MID SUBMISSION

5. FINAL SUBMISSION

6. SOLUTION

*The Curriculum is subject to change as per the inputs from university or industry experts
34

SPECIALISATION: DATA ENGINEERING


COURSE 3 - DATA ENGINEERING - I
DATA MANAGEMENT AND RELATIONAL DATABASE MODELLING

1. ENTERPRISE DATA Understand the concepts of Data Manage- 1 WEEK


MANAGEMENT ment and learn to model data from a Rela-
tional Database.
2. RELATIONAL DATABASE
MODELLING

3. NORMAL FORMS AND ER


DIAGRAMS

INTRODUCTION TO BIG DATA (OPTIONAL)

1. 4VS OF BIG DATA This module you will learn what big data is, 0 WEEK
its various characteristics, and its determin-
2. BIG DATA: INDUSTRY CASE
ing factors. You will also get an idea of the
STUDIES
various sources of big data and the wide
range of big data applications in different
industries such as retail, healthcare, and
finance.

INTRODUCTION TO CLOUD AND AWS SETUP

1. INTRODUCTION TO CLOUD Understand what is cloud and setup your 1 WEEK


AWS account which will be required duing
2. AWS SETUP
the program.

INTRODUCTION TO HADOOP AND MAPREDUCE PROGRAMMING

1. CONCEPTS RETAILED TO Understand the world of distributed data 1 WEEK


DISTRIBUTED COMPUTING processing and storage with Hadoop. Learn
to write MapReduce jobs in Python.
2. HADOOP DISTRIBUTED FILE
SYSTEM

3. MAPREDUCE PROGRAMMING
IN PYTHON

*The Curriculum is subject to change as per the inputs from university or industry experts
01
35
01

ASSIGNMENT (OPTIONAL)

1. INTRODUCTION, PROBLEM Solve an assignment to brush up the skills 0 WEEK


STATEMENT AND GRADING learnt so far.
RUBRICS

NOSQL DATABASES AND APACHE HBASE


NOSQL DATABASES AND MONGODB(OPTIONAL)

1. CONCEPTS OF NOSQL Learn the concepts of NoSQL databases. 1 WEEK


DATABASES Understand the working of Apache HBase.

2. INTRODUCTION TO APACHE
HBASE

3. HBASE PYTHON API

4. COMPARISION OF NOSQL
DATABASES

DATA WAREHOUSING (OPTIONAL) 0 WEEK

1. INTRODUCTION TO DATA Understand the intricacies behind designing


WAREHOUSE AND DATA a data warehouse and a data lake for use
LAKES case/s.

2. DESIGNING DATA
WAREHOUSING FOR AN ETL
DATA PIPELINE

3. DESIGNING DATA LAKE FOR


AN ETL DATA PIPELINE

DATA INGESTION WITH APACHE SQOOP AND APACHE FLUME

1. INTRODUCTION TO DATA Get familiar with the challenges involed in 1 WEEK


INGESTION data ingestion. Use Sqoop and Flume to
ingest structured and unstructured data into
2. STRUCTURED DATA
Hadoop.
INGESTION WITH SQOOP

3. UNSTRUCTURED DATA
INGESTION WITH FLUME

*The Curriculum is subject to change as per the inputs from university or industry experts
36

MAP REDUCE PROGRAMMING ASSIGNMENT

1. PROBLEM STATEMENT AND Practise MapReduce Programming on a Big 1 WEEK


SAMPLE DATASET Dataset.

2. SOLUTION

COURSE 4 - DATA ENGINEERING - II


HIVE & QUERYING

1. FUNDAMENTALS OF APACHE Manage and query a data warehouse with 2 WEEKS


HIVE Apache Hive. Learn to write optimized HQL
for large scale data analysis.
2. WRITING HQL FOR DATA
ANALYSIS

3. PARTITIONING AND
BUCKETING WITH HIVE

ASSIGNMENT (OPTIONAL)

1. INTRODUCTION, PROBLEM Solve an assignment to brush up the skills 0 WEEK


STATEMENT AND GRADING learnt so far.
RUBRICS

AMAZON REDSHIFT

1. DATA WAREHOUSING WITH Learn to deploy a Redshift cluster and use it 1 WEEK
REDSHIFT for querying data.

2. ANALYZE DATA WITH


REDSHIFT

INTRODUCTION TO APACHE SPARK

1. SPARK ARCHITECTURE Get introduced to Apache Spark, a lighting 1 WEEK


fast big data processing engine.
2. RDD, DATAFRAME
API,SPARKSQL

*The Curriculum is subject to change as per the inputs from university or industry experts
01
37
01

PROJECT: ETL DATA PIPLINE

1. INTRODUCTION AND Make use of Sqoop, Redshift & Spark to de- 2 WEEKS
PROBLEM STATEMENT sign an ETL data pipeline.

2. GRADING RUBRICS AND


SUBMISSION

AWS CLOUD INFRASTRUCTURE (OPTIONAL) 0 WEEK

1. THE AWS CLOUD PLATFORM Do a deep dive into AWS Cloud

2. BUILDING AND DEPLOYING


VIRTUAL MACHINES

3. AWS CLOUD STORAGE


SOLUTIONS

4. APPLICATION DEPLOYMENT

5. CLOUD ADMINISTRATION
AND SECURITY

6. LOAD BALANCING AND


BACKUP STRATEGIES

7. CLOUD AUTOMATION

COURSE 5 - DATA ENGINEERING - III


OPTIMISING SPARK FOR LARGE SCALE DATA PROCESSING

1. RUNNING SPARK ON Use PySpark to create large scale data pro- 1 WEEK
MULTINODE CLUSTER cessing applications.

2. SPARK MEMORY & DISK


OPTIMISATION

3. OPTIMISING SPARK CLUSTER


ENVIRONMENT

*The Curriculum is subject to change as per the inputs from university or industry experts
38

APACHE FLINK (OPTIONAL)

1. INTRODUCTION TO APACHE Get Introduced to Apahce Flink and learn 0 WEEK


FLINK query batch data

2. BATCH DATA PROCESSING


Use DataStream API to create a stream pro-
WITH FLINK
cessing application
3. STREAM PROCESSING WITH
APACHE FLINK

4. SQL API

REAL-TIME DATA STREAMING WITH APACHE KAFKA

1. INTRO TO REAL-TIME Understand the producer-consumer architec- 1 WEEK


DATA PROCESSING ture of Apache Kafka. Learn to set up a Kafka
ARCHITECTURES cluster for managing real-time data.

2. FUNDAMENTALS OF APACHE
KAFKA

3. SETTING UP KAFKA
PRODUCER AND CONSUMER

4. KAFKA CONNECT API &


KAFKA STREAMS

REAL-TIME DATA PROCESSING USING SPARK STREAMING

1. SPARK STREAMING Learn about the real-time data processing 1 WEEK


ARCHITECTURE architecture of Apache Spark. Build Spark
Streaming applications to process data in
2. SPARK STREAMING APIS
real-time.
3. BUILDING STREAM
PROCESSING APPLICATION
WITH SPARK

4. COMPARISION BETWEEN
SPARK STREAMING AND
FLINK

*The Curriculum is subject to change as per the inputs from university or industry experts
01
01
39

ASSIGNMENT (OPTIONAL)

1. INTRODUCTION, PROBLEM Solve an assignment to brush up the skills 0 WEEK


STATEMENT AND GRADING learnt so far.
RUBRICS

BUILDING AUTOMATED DATA PIPELINES WITH AIRFLOW 1 WEEK

1. FUNDAMENTS OF AIRFLOW Automate Data Pipelines with Airflow.

2. WORKFLOW MANAGEMENT
WITH AIRFLOW

3. AUTOMATING AN ENTIRE
DATA PIPELINE WITH
AIRFLOW

ANALYTICS USING PYSPARK

1. EXPLORATORY DATA Use PySpark to do EDA and Predictive Analy- 1 WEEK


ANALYSIS WITH PYSPARK sis using Spark’s ML library.

2. PREDICTIVE ANALYSIS WITH


SPARK MLLIB

PROJECT: REAL TIME DATA PROCESSING

1. INTRODUCTION AND Build an end-to-end real-time data process- 1 WEEK


PROBLEM STATEMENT ing application using Spark Streaming and
Kafka.
2. GRADING RUBRICS AND
SUBMISSION

COURSE 6 - CAPSTONE PROJECT


CAPSTONE PROJECT

1. AN OVERVIEW OF THE The capstone project will stich all the compo- 4 WEEKS
DOMAIN AND ASSOCIATED nents of data engineering together.
CONCEPTS

2. PROBLEM STATEMENT

3. EVALUATION RUBRIC

4. MID SUBMISSION

5. FINAL SUBMISSION

6. SOLUTION

*The Curriculum is subject to change as per the inputs from university or industry experts
40

COURSE - RESEARCH METHODOLOGIES (8 WEEKS)


INTRODUCTION TO RESEARCH AND RESEARCH PROCESS

FAMILIARISE WITH What is research, importance of reseach, what is data, what


DIFFERENT ASPECTS is information, what is knowledge?
OF RESEARCH AND Importance of research, types of originality, characteristics
FORMULATE A RESEARCH of research, research process
QUESTION Criticism in research and its importance, Peer reviews in
research and its importance
Types of research: Scientific vs Rest, Objectives of research
Structure of a research proposal: Components of the
Research proposal covered over the course
Identify a research problem, formulate a research question,
characteristics of a good research question

RESEARCH DESIGN

DEVELOP AN Types of research methods and pyramid of evidence


UNDERSTANDING OF 1. Study of existing researches and links between them
VARIOUS RESEARCH 2. Applied and incremental
DESIGNS 3. Discover
Applied vs Fundamental, Quantitative vs Qualitative,
Bayesian vs Frequentis, Hypothesis driven research vs
Exploratory resarch
Sample Size and Power, Precision vs accuracy trade-off,
p-value vs confidence intervals using a case study

LITERATURE REVIEWING

LEAN HOW TO READ AND Intro to lit review process, what is a lit review, benefits of lit
CRITIQUE A PAPER, AND review, literature reivew process (read, analyse and cite)
HOW TO CITE A PAPER How to read and critique a paper
Types of sources that could be cited during research, the
importance of citations and how to cite
What makes a good reference, How to use reference
management software, Related scientific ethics

*The Curriculum is subject to change as per the inputs from university or industry experts
01
41
01

RESEARCH PROJECT MANAGEMENT

LEARN HOW TO PLAN Project management in reseach: research question, planning of


THE PROJECT AND HOW the project, initiation, monitoring, closure.
TO ARRANGE FOR DATA Project requirements on data: data collection, data access,
data sources, availability, credibility and usability of data from
different sources.
Project requirements on analysis software: Analytical methods
in Data Science, software requirement (R, Minitab, Matlab…),
and data cleaning skills.
Project requirements on time: planning, breaking the work
down to tasks, Gantt Charts, Milestones identification and
Deliverables, Re-planning.

REPORT WRITING AND PRESENTATION SKILLS

MASTER GOOD Art of writing a paper


SCIENTIFIC WRITING AND Parts of a paper
PROPER PRESENTATION Tools to write papers
SKILLS Publishing papers: Journals + Seminars
Citation Methods and Rules
Defending your thesis

SCIENTIFIC ETHICS

DEVELOP AN Honor Code, Definition of Plagiarism, Type of Plagiarism, Code


UNDERSTANDING OF THE of good practice
ETHICAL DIMENSION IN Research Claims, Professional Standard, IP, Conflict of Interest
RESEARCH Legal aspects of data: Ethical Approvals for studies involving
humans such as questionaire based research, Storing Primary
Data,

COURSE - DISSERTATION (16 WEEKS)


SUBMITTING THE IN- Representative Thesis to Select From:
DEPTH RESEARCH Investigate dietary patterns and metabolite fingerprints of
WORK IN A FINAL takeaway (fast) food consumers using PCA and clustering
THESIS REPORT AND methods
PRESENTING IT. Investigate a diagnosis of eye diseases using imaging ophthalmic
data
Structure medical images with information geometry
Using Social media feed to place tweets regarding natural
disasters on a map
Preventing credit card fraud through pattern recognition
Developing a recommender system for a Media giant
Risk modelling for Financial activities and Investment Banking

Disclaimer: Program curriculum is subject to change basis inputs from the institute and experts. Please refer to the website for update details, or speak to our
Admission Counsellors.
42

Meet the
Class
INDUSTRIES OUR STUDENTS COME FROM
5% Healthcare
5% E-Commerce

1% Telecom
57% IT
1% Finance

15% Other

1% Consulting

1% Education
3% Retail

1% Manufacturing

10% BFSI

WORK EXPERIENCE 15% 6.1-9 years


21% 3.1-6 years

33% 0-3 years 11% 9.1-12 years

20% +12.1 years


01
43

Elements of
Career Services
Jobs on Career Centre Just-In-Time Interview Prep (JIT)
Career Centre offers upGrad jobs across expe- For upcoming job interviews JITs are conducted
rience levels and CTC ranges. within 48 hours for eligible programs.
• Easy apply feature for upGrad hiring partner • Tailored to the job role and target domain
vacancies • Real-time feedback and tips for improvement
• Create a resume at profile builder with one
click to apply for various jobs.

upGrad Elevate High-Performance Coaching


• Recruitment Drive to connect you with the Dedicated coaches working with you to identify
best talent admirers in the industry best-suited career opportunities.
• Get access to a wide range of opportunities • Help you define your value proposition
and find the perfect job • Lay out a Career Path and help you
• Apply your learnings to real industry adhere to your timelines and goals
problems • Help you with interview preparations, finding
jobs in the market, salary negotiations and other
preparation as required
Interview Preparation
Pre-recorded content on topics such as:
• Profile building, communications, etc. Personalised Industry Session
• Problem-solving approach 90-minute sessions over the weekend by leading
• Approaching guesstimates industry experts.
• Domain-specific interview question bank • Session categories: Career, Technical
and much more and Communications
• Doubt resolution
• Develop proof of concepts and apply
Profile Builder (AI-Powered)
theoretical concepts in the real world
An easy-to-use Resume, LinkedIn and Cover • Assess skill levels
letter preparation tool. • Peer Networking
• Resume Score: AI-Driven Resume Score • Classroom element
• Real-time recommendations to improve. • Business communication sessions and
• Match your resume to the JD and check much more
fitment.
• LinkedIn Profile Review.
• Cover Letter creation. Career Mentorship Sessions
Get personalised career advice through 1-1
sessions with industry experts.
• Goal setting for better employment results

Disclaimer: Career services are subject to change. Please refer to the website or speak to our Admission Counsellor for updated details.
44

Experience upGrad
Offline
UPGRAD BASECAMPS
Held across all major cities in India, upGrad basecamps
bring together learners, faculty and industry experts
for a power-packed day of activities, career-building
sessions and live group projects. Get to know your
peers and faculty and hone your networking skills
in an exciting environment.

CAREER FAIRS
Attend regular hiring drives in major cities across
India, giving you the opportunity to interview with
upGrad’s 300+ hiring partners ensuring you get every
opportunity you deserve.

HACKATHONS
Team up and put your learning to use with our offline
Hackathons: designed to help you apply concepts
and meet, network, and grow!
01
45

Hear from
Our Learners
Sachin Aggarwal, Experience: 18+ Years
“Learning with IIITB and upGrad has been an experience like no other. Being enrolled
on an online program, you have your worries about how the program and teach-
ing methods will be. My favourite part about the learning experience has been the
well-designed and thoughtful content shared by IIITB professors and industry experts
on upGrad platforms. Kudos to upGrad!”

Shravani Shahapure, Experience 16 Years


“For someone who really wants to pursue a career in the field of Data Science, it
is worth opting for the complete course by IIITB and upGrad. IIITB and upGrad’s
online course on Data Science gives many opportunities and develops students
for their future as they provide the best professors, thought-provoking assignments
and case studies.”

Savita Upadhyay, Experience: 4 Years


“It has been an amazing journey with upGrad till now. Starting with their course ma-
terial to live sessions to mentor support, each helps you to always be on track and
progress efficiently with the Data Science course. My sincere thanks to the entire
team of upGrad and Professors of IIITB for showing me the path and direction for
my dream to become a Data Analyst.”

Tuhin Pal, Experience: 5 Years


“I appreciate the platform upGrad has provided and the way they have arranged
modules and assignments. Modules are locked until you complete the previous one,
so it feels like clearing a semester and going to the next one.”
46

Program Details and


Admission Process
PROGRAM DURATION AND FORMAT PROGRAM FEE
18 Months | Online Without Immersion: INR 4,99,000 (Including taxes)
With Immersion: INR 6,99,000 (Including taxes)
PROGRAM START DATES ELIGIBILITY
Please refer to the website for program start dates. Bachelor’s Degree with 50% or equivalent passing
upgrad.com/data-science-masters-degree-iiitb/ marks. No coding experience is required.

WEEKLY COMMITMENT (15 hours/week)

6-7 HOURS 6-7 HOURS 1 LIVE SESSION


Asynchronous learning time. Assignments and projects. Every two weeks.

SELECTION PROCESS

STEP 1: Selection Test STEP 2: Review and Shortlisting of STEP 3: Enrollment for Access
Fill out an application and take a Suitable Candidates to Prep Content
short 17-minute online test with Our faculty will review all applications, Make a quick block payment
11 questions. considering the educational and with assistance from our loan
professional background of an partners where required,
applicant and review the test scores receive immediate access to
where applicable. Following this, the prepped content and begin
Offer Letters will be rolled out so you are your upGrad journey.
assured of a great peer group to learn
and network with.

FOR FURTHER PRIYANKA PRAJAPATI Disclaimer: Program fee and


payment options are subject to
INFORMATION, Program Marketing Manager, Data Science
change. Please refer to the website
CONTACT admissions@upgrad.com for updated details or speak to our
1800 210 2020 admission counsellor.
We are available 24*7

COMPANY
COMPANY INFORMATION
INFORMATION
upGrad
upGrad Education
Education Private
Private Limited
Limited,
Nishuvi,
Nishuvi, 75, Annie
75, Dr. Annie Besant
Besant Road,
Road,
Worli,
Worli, Mumbai
Mumbai - 400018.
– 400018.

You might also like