Professional Documents
Culture Documents
DATA SCIENCE
TABLE OF
CONTENTS
02. About IU 2
Online education is a fundamental disruption that will have a far-reaching impact. upGrad was
founded taking this into consideration. upGrad is an online education platform to help individuals
develop their professional potential in the most engaging learning environment.
Since its inception, upGrad has delivered over 20 million hours of learning, delivering programs by
collaborating with universities across the world including LJMU, IIT Madras, IIIT Bangalore and
Deakin Business School among others. And it doesn’t end there.
upGrad, in collaboration with IIIT Bangalore, a renowned university and International University of
Applied Sciences, Germany offering programs specialising in Data Science, Machine Learning and
Artificial Intelligence, is excited to offer a one-of-its-kind, academically rigorous and industrially
relevant Master Degree in Data Science.
The faculty includes an average of 15+ years of experience. The faculty covers the conceptual
depths of topics such as Data Science, Machine Learning and AI, and Big Data Analytics. These
will be complemented by industry-relevant case studies from major industry verticals by industry
leaders with 8+ years of experience from upGrad’s industry network.
Ronnie Screwvala
Co-founder & Executive Chairman
1
ABOUT
IU
>6,000
Business Cooperations
In today’s globalised world a simple degree is not enough. IU recognised this fact long ago, and
therefore, we have always striven to offer our students much more than a simple degree. With our
innovative, international, English-based Bachelor’s and Master’s programmes, our goal is to redefine
the standards of what it takes to be a manager. Through the in-depth, subject-specific knowledge
taught by IU’s highly experienced professors, and the practical experience they are exposed to
during their studies, our students gain a cutting edge advantage over public university students.
With a network of over 6,000 reputable partner companies and organisations around the world, IU
can make your career in a competitive, global environment possible.
About us
2
ELIGIBILITY CRITERIA
FOR TRANSFER TO IU, GERMANY
• The applicant must complete the Executive PG Programme in Data Science from IIIT, Bangalore
with minimum 50% marks.
• Minimum one-year work experience in relevant fields after graduation and before the learner
reaches Germany. The relevant professional experience should be in any of the following areas:
3
PROGRAM
HIGHLIGHTS
Dual Accreditation and Alumni Status
Get certified by IIITB and International University of Applied Sciences (IU), Germany
and gain dual alumni status on successful completion of the program.
Customisable Curriculum
Choose from 6 specialisations at IIITB and 10 electives at IU on the basis of your
background and career aspirations and get the learning you want.
4
FACULTY AND
INDUSTRY EXPERTS
Chandrashekar Ramanathan
Dean Academics, IIITB
Tricha Anjali
Ex-Associate Dean, IIITB
Prof. S. Sadagopan
Director, IIITB
Anshuman Gupta
Director - Data Science, Pitney Bowes
5
Ujjyaini Mitra
Head of Analytics, Zee5
An alumna of McKinsey and Co., Flipkart, and Bharti Airtel with over 11
years of experience.
Ankit Jain
Sr. Research Scientist, Uber AI Labs
Sajan Kedia
Lead Data Scientist (Pricing), Myntra
6
Prof. G. Srinivasaraghavan
Professor, IIITB
S. Anand
CEO, Gramener
7
UPGRAD
LEARNING EXPERIENCE
Coaching Format
Dedicated Student Support Online format with weekly live sessions from
industry experts to help with topic walk-throughs,
Weekly real-time doubt clearing sessions doubt resolution and personalised project
feedback. Offline sessions such as Basecamps and
Live Discussion forum for peer-to-peer Hackathons.
doubt resolution monitored by technical
experts
Mentorship
60+ live interactive sessions with
industry experts, fortnightly
personalised group (1:8) mentorship
sessions and a dedicated student
support mentor for proactive
mentoring.
8
NEW
ADDITIONS
9
INDUSTRY
PROJECTS
10
LEARNING
PATH
6
Choose any of the 6 Specialisations
25 weeks (with 6 weeks of Capstone)
Data Science Natural Language Deep Learning Business Analytics Business Intelligence/ Data Engineering
Generalist Processing Data Analyics
Tools: Python, Tools: Python, Tools: Python, Excel, Tools: Python, Tools: Python, Power Tools: Hadoop,
Tableau, SQL Excel TensorFlow mySQL, Excel BI, Excel, mySQL, HBase, Sqoop,
MongoDB, Shiny, Hive, Flume,
Tableau PySpark, Spark,
Airflow
Journey in
Description
Taught by one of the most renowned data scientists in the country (S.Anand, CEO, Gramener), this
module takes you from a beginner level Excel user to an almost professional user.
Description
This module covers concepts of the CRISP-DM framework for business problem-solving.
Module
• INTRODUCTION TO PYTHON - I
Description
Build a foundation for the most in-demand programming language of the 21st century.
• INTRODUCTION TO PYTHON - II
Description
Learn to apply some of the commonly used paradigms of functional programming in Python.
• PROGRAMMING IN PYTHON
Description
Learn how to approach and solve logical problems using programming.
Description
Data in companies is definitely not stored in excel sheets! Learn the fundamentals of database and
extract information from RDBMS using the structured query language.
12
• PYTHON FOR DATA SCIENCE
Description
Learn how to manipulate datasets in Python using Pandas which is the most powerful library for
data preparation and analysis.
• VISUALISATION IN PYTHON
Description
Humans are visual learners and hence no task related to data is complete without visualisation.
Learn to plot and interpret various graphs in Python and observe how they make data analysis
and drawing insights easier.
Description
Learn how to find and analyse the patterns in the data to draw actionable insights.
Description
Reinforce the concepts learnt in data science through this rigorous assignment involving the past.
Description
Build the mathematical foundation required for understanding the Machine Learning Algorithms.
• INFERENTIAL STATISTICS
Description
Build a strong statistical foundation and learn how to ‘infer’ insights from a huge population using a
small sample.
• HYPOTHESIS TESTING
Description
Understand how to formulate and validate hypotheses for a population to solve real-life business
problems.
• ADVANCED SQL
Description
Apply advanced SQL concepts like windowing and procedures to derive insights from data and
answer pertinent business questions.
Description
Solve a real industry problem through the concepts learnt in exploratory data analysis.
13
MACHINE LEARNING 3 ASSIGNMENTS
Module
Description
Venture into the machine learning community by learning how one variable can be predicted using
several other variables through a housing dataset where you will predict the prices of houses based
on various factors.
Description
Build a model to understand the factors on which the demand for bike sharing systems vary on and
help a company optimise its revenue.
• LOGISTIC REGRESSION
Description
Learn your first binary classification technique by determining which customers of a telecom
operator are likely to churn versus who are not to help the business retain customers.
Description
Learn how to group elements into different clusters when you don’t have any pre-defined labels to
segregate them through K-means clustering, hierarchical clustering, and more.
Description
Learn how to approach open ended real world problems using data as a lever to draw actionable
insights.
Description
Apply the machine learning concepts learnt to help an internation NGO cluster countries to
determine their overall development and plan for lagging countries.
Description
Help the Sales team of your company identify which leads are worth pursuing through this
classification case study.
14
SPECIALISATION 1: DATA SCIENCE GENERALIST
Module
Description
Learn how the human decision making process can be replicated using a decision tree and other
powerful ensemble algorithms.
Description
Learn the pros and cons of simple and complex models and the different methods for quantifying
model complexity, along with general machine learning techniques like feature engineering,
model evaluation,and many more.
• ML LAB I: CLASSIFICATION
Description
Understand important concepts related to dimensionality reduction, the basic idea and the
learning algorithm of PCA, and its practical applications on supervised and unsupervised
problems.
• ADVANCED REGRESSION - I
Description
In this module, take a more advanced look at regression models and learn the concepts related
to regularisation.
Description
An introduction to the world of NLP and basic text processing skills. Learn how to build a
classification engine that works on (unstructured) textual data.
Description
Learn advanced visualisation techniques using the most in-demand visualisation tool in the industry.
15
• DATA STORYTELLING
Description
Learn how to effectively strategise, communicate, and fine grain your data analysis projects and
understand how to optimally present your findings to technical and non-technical stakeholders
and upgrade your storytelling skills.
Module
• DATA MODELLING
Description
In this module, you will learn and use data modelling on a dataset to solve a business problem.
Description
Apply advanced SQL concepts like windowing and procedures to derive insights from data and
answer pertinent business questions
Description
Learn how to assess the efficiency your code using algorithm analysis techniques and learn to write
recursive algorithms
Description
Learn most fundamental searching and sorting algorithms and design techniques
Description
Learn user defined data structures -Stack, Queue, Trees in Python that help in advanced data
manipulation
• PYTHON - OOPS
Description
Learn OOP concepts such as Class, Object, Method, Inheritance, Polymorphism, Data Abstraction
and Encapsulation.
16
• PYTHON WEEKLONG LAB
Module 3 ASSIGNMENTS
• CAPSTONE PROJECT
Description
Solve an end-to-end real-life industry problem from a wide variety of domains. Make a video
presentation of your working demo to showcase in your portfolio.
Module
• TREE MODELS
Description
Learn how the human decision making process can be replicated using a decision tree and other
powerful ensemble algorithms.
Description
Learn the pros and cons of simple and complex models and the different methods for quantifying
model complexity, alongwith general machine learning techniques like feature engineering,
model evaluation, and many more.
Description
Understand important concepts related to dimensionality reduction, the basic idea and the
learning algorithm of PCA, and its practical applications on supervised and unsupervised
problems.
• ADVANCED REGRESSION
Description
In this module, take a more advanced look at regression models and learn the concepts related to
regularisation.
Description
Build a regularised regression model to understand the most important variables to predict the
house prices in Australia.
17
• BAGGING AND BOOSTING
Description
Learn about ensemble modelling through bagging and boosting and understand how weak
algorithms can be transformed into stronger ones.
Description
In this module, you will learn how to analyse and forecast a series that varies with time.
Description
Solve the most crucial business problem for a leading telecom operator in India and southeast
Asia - predicting customer churn.
• TEXT PROCESSING
Description
Do you get annoyed by the constant spams in yor mail box? Wouldn’t it be nice if we had a
program to check your spellings? In this module learn how to build a spell checker & spam
detector using techniques like phonetic hashing,bag-of-words, TF-IDF, etc.
Description
This module will help you in understanding how to extract meaningful features from the processed
text data. Using these features you will be able to extract entities, classify POS tags, generating
similarity score between two question strings.
Description
To perform a sentiment analysis on product reviews from Amazon using NLP & Machine learning.
This assignemnt will be focused to give you a business understanding on how to do product
optimisation using NLP.
• INTRO TO DL
Description
Learn the most sophisticated and cutting-edge technique in machine learning - Artificial Neural
Networks & how to apply Deep learning for NLP.
18
• NLP INDUSTRIAL APPLICATIONS
Description
Learn how to use NLP with Neural networks for different industrial applications like text
classification, Question pair similarity, Text generation, Topic modelling.
Description
Imagine if you could make a restaurant booking without opening Zomato. Build your own restaurant
search chatbot with the help of RASA - an open source framework and deploy it on Slack.
CAPSTONE
Module
• DEPLOYMENT
2 ASSIGNMENTS
Description
Learn how to productionise your model and deploy it on the server. 3 ASSIGNMENTS
• CAPSTONE
Description
Choose from a range of real-world industry woven projects on advanced topics like
Recommendation Systems, Fraud Detection, Emotion Detection from faces, Social Media
Listening, Speech Recognition among many others.
Module
• TREE MODELS
Description
Learn how the human decision making process can be replicated using a decision tree and other
powerful ensemble algorithms.
Description
Learn the pros and cons of simple and complex models and the different methods for quantifying
model complexity, along with general machine learning techniques like feature engineering,
model evaluation, and many more.
19
• PRINCIPAL COMPONENT ANALYSIS
Description
Understand important concepts related to dimensionality reduction, the basic idea and the learning
algorithm of PCA, and its practical applications on supervised and unsupervised problems.
• ADVANCED REGRESSION
Description
In this module, take a more advanced look at regression models and learn the concepts related to
regularisation.
Description
Build a regularised regression model to understand the most important variables to predict the
house prices in Australia.
2 ASSIGNMENTS
• BAGGING AND BOOSTING
Description 3 ASSIGNMENTS
Learn about ensemble modelling through bagging and boosting and understand how weak
algorithms can be transformed into stronger ones.
Description
In this module, you will learn how to analyse and forecast a series that varies with time.
Description
Solve the most crucial business problem for a leading telecom operator in India and southeast
Asia - predicting customer churn.
Module
Description
Learn the most sophisticated and cutting-edge technique in machine learning - Artificial
Neural Networks or ANNs.
20
• NEURAL NETWORKS ASSIGNMENT
Description
Build a neural network from scratch in Numpy to identify handwritten digits.
Description
Learn the basics of CNN and OpenCV and apply it to Computer Vision tasks like detecting
anomalies in chest X-Ray scans, vehicle detection to count & categorise them to help the
government ascertain the width and strength of the road.
Description
Ever wondered what goes behind machine translation, sentiment analysis, speech recognition?
Learn how RNN helps in these areas having sequential data like text, speech, videos, and a lot
more.
• GESTURE RECOGNITION
Description
Make a Smart TV system which can control the TV with user’s hand gestures as the remote control.
CAPSTONE
Module
• DEPLOYMENT
Description
Learn how to productionise your model and deploy it on the server.
• CAPSTONE
Description
Choose from a range of real-world industry woven projects on advanced topics like
Recommendation Systems, Fraud Detection, Emotion Detection from faces, Social Media
Listening, Speech Recognition among many others.
2 ASSIGNMENTS
21
SPECIALISATION 4: BUSINESS ANALYTICS
Module
• TREE MODELS
Description
Learn how the human decision making process can be replicated using a decision tree and other
powerful ensemble algorithms.
Description
In this module, you will learn how to analyse and forecast a series that varies with time.
Description
Apply the concepts learnt in time series to solve a forecasting problem for a retail giant.
Description
Learn the pros and cons of simple and complex models and the different methods for quantifying
model complexity, along with general machine learning techniques like feature engineering,
model evaluation, and many more.
Description
Learn how to write optimised SQL query that require less memory and execute in lesser amount
of time.
• ADVANCED EXCEL
Description
Learn the advanced concepts in Excel and start to perform data analysis like a pro!
Description
Solve the most crucial business problem for a leading telecom operator in India and southeast
Asia - predicting customer churn.
22
BUSINESS REQUIREMENTS 2 ASSIGNMENTS
Module
Description
Learn how to attack a business problem using various structured frameworks like 5W, 5WHYs, and
SPIN.
Description
Apply your learnings from the course to solve a real-life business problem.
• OPERATIONS RESEARCH
Description
Learn about the world of operations research through linear and integer optimisations.
• DATA STORYTELLING
Description
Learn how to effectively strategise, communicate, and fine grain your data analysis projects and
understand how to optimally present your findings to technical and non-technical stakeholders
and upgrade your storytelling skills.
Description
Understand how a project in the industry is taken up and solved through a comprehensive
business case study.
Module
• CAPSTONE PROJECT
Description
Solve an end-to-end real-life industry problem from a wide variety of domains. Available capstone
project choices -
(i) Stock Analysis & Portfolio Management
(ii) E-Commerce & Marketing
(iii) Heatlhcare
(iv) Supply Chain Optimisation
(v) Credit Card Fraud Detection
23
SPECIALISATION 5: BUSINESS INTELLIGENCE / DATA ANALYTICS
Module
• DATA MODELLING
Description
In this module, you will learn and use data modelling on a dataset to solve a business problem.
Description
Learn how to write optimised SQL query that require less memory and execute in lesser amount
of time.
Description
In this assignment, you will work on a movies dataset using SQL to extract exciting insights.
• ADVANCED EXCEL
Description
Learn the advanced concepts in Excel and start to perform data analysis like a pro!
Description
Take your knowledge of query languages a step further by learning about MongoDB - a NoSQL
database which is becoming more and more popular in the industry.
Description
Understand the basics of big data and cloud and learn to work with an EMR cluster on a cloud
based service.
Description
In this module, you learn about the architecture and features of the Hive Query Language.
Description
Understand how a project in the industry is taken up and solved through a comprehensive
business case study.
24
STORYTELLING WITH ADVANCED VISUALISATIONS 2 ASSIGNMENTS
Module
Description
Learn advanced visualisation techniques using the most in-demand visualisation tool in the industry.
Description
Apply the new found Excel and Tableau skills to solve an exciting business assignment.
Description
Take your visualisation game a step forward by understanding how to operate PowerBI.
Description
Get a brief introduction to another popular open-sourced visualisation library in Python and
learn to code and create powerful, pretty, and interactive visualisations.
• DATA STORYTELLING
Description
Learn how to effectively strategise, communicate, and fine grain your data analysis projects and
understand how to optimally present your findings to technical and non-technical stakeholders
and upgrade your storytelling skills.
Description
Understand how a project in the industry is taken up and solved through a comprehensive
business case study.
Module
25
• CAPSTONE PROJECT
Description
Solve an end-to-end real-life industry problem from a wide variety of domains. Available capstone
project choices -
(i) Web & Social Media Analytics
(ii) Finance and Risk Analytics
(iii) Marketing and Retail Analytics
(iv) Supply Chain Analytics
(v) Fraud Analytics
Module
Description
This module you will learn what big data is, its various characteristics, and its determining factors.
You will also get an idea of the various sources of big data and the wide range of big data
applications in different industries such as retail, healthcare, and finance.
Description
Understand what is cloud and setup your AWS account which will be required during the program.
Description
Understand the world of distributed data processing and storage with Hadoop. Learn to write
MapReduce jobs in Python.
Description
Practise MapReduce Programming on a Big Dataset.
Description
Understand the concepts of Data Management and learn to model data from a Relational
Database.
26
• NOSQL DATABASES AND APACHE HBASE NOSQL DATABASES AND MONGODB
(OPTIONAL)
Description
Learn the concepts of NoSQL databases. Understand the working of Apache HBase.
Description
Understand the intricacies behind designing a data warehouse and a data lake for use case/s.
Description
Get familiar with the challenges involed in data ingestion. Use Sqoop and Flume to ingest
structured and unstructured data into Hadoop.
Description
Manage and query a data warehouse with Apache Hive. Learn to write optimised HQL for large
scale data analysis.
Description
Use HQL to analyse a Big Dataset
• AMAZON REDSHIFT
Description
Learn to deploy a Redshift cluster and use it for querying data.
Description
Get introduced to Apache Spark, a lightning fast big data processing engine.
Description
Practise Apache Spark and its core libraries on the NYC Parking Ticket dataset.
Description
Make use of Sqoop, Redshift & Spark to design an ETL data pipeline.
27
DATA ENGINEERING - II 3 ASSIGNMENTS (1 MANDATORY, 2 OPTIONAL)
Module
Description
Use PySpark to create large scale data processing applications.
• APACHE FLINK(OPTIONAL)
Description
Get Introduced to Apache Flink and learn query batch data. Use DataStream API to create a stream
processing application.
Description
Understand the producer-consumer architecture of Apache Kafka. Learn to set up a Kafka cluster
for managing real-time data.
Description
Learn about the real-time data processing architecture of Apache Spark. Build Spark Streaming
applications to process data in real-time.
Description
This assignment revolves around building Spark structured streaming application to processing
stock data in real-time.
Description
Automate Data Pipelines with Airflow.
Description
Use PySpark to do EDA and Predictive Analysis using Spark’s ML library.
Description
An assignment related to a classification based problem statement.
28
• PROJECT: REAL TIME DATA PROCESSING
Description
Build an end-to-end real-time data processing application using Spark Streaming and Kafka.
Module
• CAPSTONE PROJECT
Description
The capstone project will stitch all the components of data engineering together.
29
ADDITIONAL STUDY ABROAD
IU, GERMANY MODULES
Advanced Mathematics
The course reviews differentiation and integration and then discusses partial differentiation,
differentiation, vector algebra and vector calculus. Matrix calculation and vector spaces are
fundamental to many modern data processing algorithms and are discussed in detail.
Module Contents
• Calculus
• Integral transformations
• Vector algebra
• Vector calculus
• Matrices and vector spaces
• Information theory
In this module, students will reflect on current societal and political implications of the application
of data science models. To this end, pertinent topics will be introduced via articles that are then
critically evaluated by the students in the form of a written essay.
The evaluation and definition of use cases is the fundamental groundwork from which the projects
can be defined. This does not only include the scope and technical requirements of a project but
also how value can be derived from the project. A crucial aspect is the definition of what makes a
project successful, both in terms of a technical evaluation as well as a business centric perspective
and how the status quo can be monitored effectively during the progress of a project. The course
also discusses how to avoid common fallacies and understand the implications of introducing
data-driven decisions into traditional management structures.
Module Contents
• Use case evaluation
• Model-centric evaluation
• Business-centric evaluation
• Monitoring
• Avoiding common fallacies
• Change management
30
Project: Data Science Use Case
In this course, students choose a project task in accord with their tutor from a variety of options.
The goal is to prototypically implement a data science model or system in a suitable development
environment. The choice of approach, the system or software implemented, and the resulting
performance on the task are to be reasoned about, explained, and documented in a project report.
To this end, students make practical use of the methodological knowledge acquired in previous
courses by applying them to relevant real-world problems.
Advanced Statistics
After defining and introducing the fundamental concepts of statistics, the course will cover
important probability distributions and their prevalence in application scenarios; discuss
descriptive techniques to summarize and visualize data effectively; and discuss the Bayesian
approach to statistics.Estimating parameters is a key ingredient in optimizing data models, and the
course will give a thorough overview of the most important techniques.
Module Contents
• Introduction to statistics
• Important probability distributions and their applications
• Bayesian statistics
• Descriptive statistics
• Data visualization
• Parameter estimation
• Hypothesis tests
31
2. MASTER OF SCIENCE (M.SC.) DATA SCIENCE FROM IU, GERMANY
Module ECTS
• Electives B 10
Module ECTS
ELECTIVE B OPTIONS
Module
• Management
• Corporate Finance
• Cognitive Computing
32
MEET
THE CLASS
5% Healthcare
5% E-Commerce
1% Manufacturing
1% Telecom
1% Education
15% Other
3% Retail
1% Consulting
WORK EXPERIENCE
33
HEAR FROM
OUR LEARNERS
34
ADMISSION
PROCESS
PROGRAM DURATION AND FORMAT
12 months Online | 12 months On-Campus in Germany
ELIGIBILITY CRITERIA
Bachelor’s Degree with minimum 50% in any of the following fields - Business, Engineering, IT,
Transport & Logistics, Mathematics, and Medical/Nursing/Pharmacy.
SELECTION PROCESS
STEP 1: Selection Test STEP 2: Review and Shortlisting of STEP 3: Enrollment for Access
Suitable Candidates to Prep Content
Fill out an application and take a Our faculty will review all applications, Make a quick block payment
short 17-minute online test with considering the educational and with assistance from our loan
11 questions. professional background of an partners where required,
applicant and review the test scores receive immediate access to
where applicable. Following this, the prep content and begin
your upGrad journey.
assured a great peer group to learn
and network with.
COMPANY INFORMATION
upGrad Education Private Limited
Nishuvi, 75, Dr. Annie Besant Road Worli, Mumbai - 400018 1
info@upgrad.com | 18002102020