You are on page 1of 6

DATA SCIENCE

PRODEGREEThe only program in India that is delivered


truly with the industry

Knowledge Partner

Collaboration with KPMG in India, a global leader in Analytics & AI

Become a skilled Data Scientist/Analyst with this project-based course

Real-business projects and case studies by KPMG in India, project mentoring by KPMG in India experts

Covers multiple Analytics tools such as Python, R, SQL, and Tableau

Upskill without leaving your job in just 4 months – Classroom & Online format

TOP 4 BIG DATA


2016

10
1 0110

Leading Institute in 0 011


1 01
1 00

SAS & Python


1

ANALYTICS TRAINING TRAINING INSTITUTE


Analytics Vidhya
INSTITUTE IN INDIA
INDUSTRY LANDSCAPE

CUMULATIVE ANALYTICS
MARKET IN INDIA

= $30 BILLION
DOMESTIC ANALYTICS
MARKET

= $3.03 BILLION
TO GROW 2X 2025
EXPECTED BY

1,50,000 NEW OPENINGS


IN DATA SCIENCE 62% GROWTH
ON 2019

AVERAGE SALARY ACROSS SECTORS

BFSI MANUFACTURING IT E-COMMERCE

13.5 11.8 11.8 10


LPA LPA LPA LPA

ANALYTICS EMPLOYERS

OVERVIEW OF PROGRAM

DATA JOB
SCIENCE PYTHON R SQL TABLEAU READINESS

180-HOUR PROGRAM AVAILABLE IN CLASSROOM & ONLINE DELIVERY FORMAT


CURRICULUM

STATISTICS FUNDAMENTALS & R


R Installation & Walk-Through of R Studio | Understanding Data Structures in
R - lists, matrices, vectors | Intro to R Programming | R Base Software |
Understanding CRAN | RStudio the IDE | Basic Building Blocks in R |
BASICS OF R FOR Understanding Vectors in R | Basic Operations Operators and Types |
DATA SCIENCE Handling Missing Values in R | Subsetting Vectors in R | Matrices and
Data Frames in R | Logical Statements in R | Lapply, sapply, vapply and
tapply Functions

DATA VISUALIZATION Grammar of Graphics | Bar Charts | Histograms | Pie Charts | Scatter
USING R Plots | Line Plots and Regression | Word Clouds | Box Plots | GGPLOT2
Measures of Central Tendency in Data | Measures of Dispersion |
STATISTICAL Understanding Skewness in Data | Probability Theory | Bayes Theorem |
FUNDAMENTALS – I Probability Distributions | Hypothesis Testing
Analysis of Variance and Covariance | One-way Analysis of Variance |
STATISTICAL Assumption of ANOVA | Statistics Associated with One-way Analysis of
FUNDAMENTALS – II Variance | Interpreting the ANOVA Results | Two-way Analysis of Variance |
Interpreting the ANOVA Results | Analysis of Covariance

DATA SCIENCE WITH R


Merge, Rollup, Transpose and Append | Missing Analysis and Treatment |
EXPLORATORY DATA Outlier Analysis and Treatment | Summarizing and Visualizing the Important
ANALYSIS WITH R Characteristics of Data | Univariate, Bivariate Analysis | Crosstabs,
Correlation
What is Regression Analysis? | Limitations of Regression | Covariance and
Correlation | Multivariate Analysis | Assumptions of Linearity Hypothesis
Testing | Limitations of Regression | Implementing Simple & Multiple Linear
LINEAR REGRESSION Regression | Making Sense of Result Parameters | Model Validation |
Handling other issues/assumptions in Linear Regression | Handling Outliers,
Categorical Variables, Autocorrelation, Multicollinearity, Heteroskedasticity
Prediction and Confidence Intervals
Implementing Logistic Regression | Making Sense of Result Parameters: Wald
LOGISTIC Test, Likelihood Ratio Test Statistic, Chi-Square Test Goodness of Fit Measures |
REGRESSION
Model Validation: Cross Validation, ROC Curve, Confusion Matrix
Introduction to Predictive Modeling with Decision Trees | Entropy &
Information Gain | Standard Deviation Reduction (SDR) | Overfitting Problem
DECISION TREES Cross Validation for Overfitting Problem | Running as a Solution for
Overfitting

LINEAR LDA Objective | Why Discriminant Analysis? | Discriminant Function |


DISCRIMINANT Assumption of LDA | Advantages & Disadvantages of LDA | Applications of
ANALYSIS LDA | LDA for Classification

DATA SCIENCE WITH PYTHON


Anaconda Installation | Walk-through of Jupyter | Python Basics | Data
BASICS OF PYTHON Structures in Python | Control & Loop Statements in Python | Functions &
FOR DATA SCIENCE Classes in Python | Working with Data
Data Acquisition (Import & Export) | Indexing | Selection and Filtering Sorting
DATA FRAME & Summarizing | Descriptive Statistics | Combining and Merging Data Frames
MANIPULATION WITH Removing Duplicates | Discretization and Binning | String Manipulation |
PANDAS
Matplotlib | Numpy
EXPLORATION What is EDA? | Processes in EDA | Handling Data Types | Univariate and
DATA ANALYSIS WITH Bivariate Analysis | Hypothesis Testing
PYTHON
Understand Time Series Data | Visualizing Time Series Components |
TIME SERIES Exponential Smoothing | Holt's Model | Holt-Winter's Model | ARIMA
Time Series Analysis | White Noise | Python Implementation | Feature
ARCH & GARCH Engineering for Time Series Data
What is Clustering? | K-means Algorithm | Types of Clustering | Evaluating
CLUSTERING
K-means Clusters
DIMENSIONALITY Principal Component Analysis (PCA) | Scree Plot | One-eigenvalue Criterion |
REDUCTION Factor Analysis
Machine Learning Modelling Flow | How to treat Data in ML | Parametric &
INTRODUCTION TO Non-parametric ML Algorithm | Types of Machine Learning | Scikit-Learn
MACHINE LEARNING Library
Introduction to Linear Regression | Linear Regression Using Gradient Descent
LINEAR REGRESSION Linear Regression Using OLS | Linear Regression Using Stochastic Gradient
Descent
CURRICULUM

LOGISTIC Introduction to Logistic Regression | Logistic Regression Using Stochastic


REGRESSION Gradient Descent
Performance Measures | Bias-Variance Trade-Off | Overfitting & Underfitting
MODEL TUNING Optimization Techniques
K Nearest Neighbour | Understanding KNN | Voronoi Tessellation |
KNN Choosing K | Distance Metrics - Euclidean, Manhattan, Chebyshev

DECISION TREE & Decision Tree | Fundamental Concepts of Ensemble | Hyper-Parameters |


RANDOM FOREST Bagging - Extra Trees, Random Forest | Boosting - AdaBoost, Gradient Boosting
Vector | Support Vector Machines (SVM) | Understanding Hyperplane |
SVM Perceptron Algorithm | SVM Kernels | SVM Optimization | Applications of
SVM

SQL PROGRAMMING
Introduction to SQL | DDL & DML Statement | Select Statement |
BASIC SQL Aggregate Functions | Where, Order By, Distinct, Group By, Like, And &
Or Clause | Update & Delete Query
Joins | Union, Union all, Intersect | Using Views & Indexes |
ADVANCE SQL
Sub Queries | Null Values & Date Function

DATA VISUALIZATION WITH TABLEAU


Introduction to Visualization | Working with Tableau | Visualization in Depth
TABLEAU BASICS Data Organisation | Advanced Visualization | Mapping | Enterprise
Dashboards Data Presentation
BEST PRACTICES FOR Have a Methodology | Know Your Audience | Define Resulting Actions |
DASHBOARDING AND Classify Your Dashboard | Profile Your Data | Use Visual Features Properly |
REPORTING Design Iteratively

JOB READINESS

Resume Building
1:1 Career Capstone Project
and Interview Prep Mock Interviews
Mentorship Presentation
workshop

PROJECTS

Property Price Prediction Real Estate Price Prediction


using Linear Regression in R using Linear Regression

Bank Credit Card Default Prediction Identifying Good and Bad


using Logistic Regression in R Customers for Granting Credit

Predict Wine Quality with Decision Breast Cancer Prediction - KNN


Tree (Regression Trees * Classifier & How to Choose the
Classification Trees) K Value

Multi-Class Classification with Bank Marketing Analytics - Decison


Linear Discriminant Analysis Tree & Random Forest Classifier

Forecasting and Predicting the Default Prediction of Credit Card


Furniture Sales using ARIMA Clients - SVM Classifier using
Different Kernels

Reduce Data Dimensionality for a


Data query with SQL
House Attribute Dataset using PCA

Use K-means Clustering to Group


Teen Students into Segments for Building Tableau Dashboard
Targeted Marketing Campaigns
KEY HIGHLIGHTS

KPMG IN INDIA-ENDORSED CURRICULUM


Cutting-edge and industry-relevant curriculum covering Data
Science with Python, R, SQL & Tableau, developed in consultation
with data science experts from different industries.

PROJECT-BASED LEARNING: HANDS-ON PROJECTS &


REAL-BUSINESS CASE STUDIES
The Prodegree provides an edge through our unique project-
based methodology, focusing on 14 hands-on projects, as well as
real-business case studies and capstone projects from KPMG
in India, where you learn by implementing data science
concepts on real business problems.

PROJECT MENTORSHIP BY KPMG IN INDIA EXPERTS


KPMG in India mentors will guide you to work on and deliver
your capstone project – a real-business project shared by
KPMG in India.

DEDICATED CAREER MENTORSHIP


A dedicated industry mentor with over a decade of experience
to guide you on the most suitable career path based on your
skills and interests and resolve your career-related queries.

PROJECT PORTFOLIO
Build a demonstrable project portfolio on Github and showcase
it to potential employers.

CAREER ASSISTANCE
The Imarticus Career Assistance Services team prepares you to be
job-ready through extensive interview prep, resume building &
mock interviews. Interested candidates also get job leads from our
placement partner group.

CERTIFICATION
On completion of the Data Science Prodegree, aspirants will receive an industry endorsed
Certificate of Completion , which is co-branded by KPMG in India and Imarticus Learning.

Knowledge Partner:
Knowledge Partner:

Certification of Completion
Awarded to

Tejas Patil
upon successful completion of the curriculum
as prescribed by the Institute for

Data Science Prodegree

Nikhil Barshikar Harish Thakkar


Director Head of Faculty

Imarticus Learning Private Limited February, 2019 www. imarticus.org


FACULTY

VINAY BORHADE DR D. PRADEEP KUMAR ARUNKUMAR NAIR


Vinay’s tech expertise includes Dr D. Pradeep Kumar holds Arunkumar has over 19 years
AI – Machine Learning, Python, over seven years of research of experience in IT, big data
PL-SQL, and Big Data – experience in machine learning, analytics, data visualization,
Netezza, Java/J2EE. Having data mining, soft computing, data warehouse, 24X7 DBA,
served more than 10 years time series forecasting and Cloud and application projects.
with Bank of America (Merrill related topics. He holds a UGC- He has worked on multiple
Lynch), he has worked on NET lectureship and GATE CS. projects for clients like Rocky
projects like Finance, Liquidity His specialities include research Mountain, Navteq (Nokia),
and Capital Risk (Regulatory and development of various M&T Bank, WeightWatchers,
Reporting) and has won repeat soft computing hybrid models Hollywood Media, SHRM USA.
business from clients for BOA of time series forecasting and Arunkumar is passionate
using technologies like applying them in banking and about solving complex
Machine Learning, Capitalize: finance and related domains. business problems with data
Data Analytics, Quartz, Python, Pradeep has been nominated science for multiple industries.
IBM Netezza, Oracle by Analytics India Magazine as He has trained and helped
(Hexadata). one of the top ten most multiple batches to transition
prominent data science into data science industry.
academicians in India.

Indicative Faculty**

CAREER ASSISTANCE

The Career Assistance team at Imarticus provides support throughout the program to
guide and help navigate ample career options.

1 RESUME BUILDING 2 INTERVIEW PREP 3 PLACEMENT PORTAL

We help you refine and We prepare you to ace the We give you unlimited
polish your resume with technical interview rounds access to our private and
tips to help you land with model interview Q&A public leads and
your coveted job and extensive mock references on our
interviews placement portal

COLLABORATION WITH KPMG IN INDIA


KPMG in India is the knowledge partner for the Data Science Prodegree course.
KPMG in India contributes to this partnership by providing real business case
studies, capstone projects and mentoring learners to deliver their projects. The
course is industry-aligned and teaches you what employers need.

Certificate Co-branded Capstone Projects


by KPMG in India

Project Mentorship Real Business Case


by KPMG in India Experts Studies

www.imarticus.org
SCAN THE QR
FOR MORE INFO.

You might also like