You are on page 1of 15

What is this document about?

• Anatomy of Statistical Modeling


• It is a step by step approach towards building analytical projects as it reflects
what is to be done next in the journey.

• Stage by Stage breakdown of activities

• This tool will always help you in the process and bring you out of any confusion
ever.

• It is designed to know how a project is done in the analytics and statistical


areas, as to what comes after what and which technique needs to be used.
How to clean data, prepare data, and model data.
Anatomy of Statistical Model
A step-by-step approach to solve a Business Problem

5
Model Evaluation
4
Model Selection
3 and Building
Find out how
Accurate are the
Data Preparation
2 Identify Best-Fit
Predictions

Data Discovery ML Algorithm


1 and Collection
Preparing the
Business Data for Model
Problem
Gathering Data

Understanding
Business
Problem
Anatomy of Statistical Model
A step-by-step approach to solve a Business Problem

Data Model
Business Data Model
Discovery and Selection and
Problem Preparation Evaluation
Collection Building

 Understand Project
Objectives

 Define the Problem

 Investigate Question and


gather Requirements

 Convert into a Statistical


Problem
Anatomy of Statistical Model
A step-by-step approach to solve a Business Problem

Data Model
Business Data Model
Discovery and Selection and
Problem Preparation Evaluation
Collection Building

 Understand Data

 Define Variables and


Create Data Dictionary

 Validate for Correctness


Anatomy of Statistical Model
A step-by-step approach to solve a Business Problem

Data Model
Business Data Model
Discovery and Selection and
Problem Preparation Evaluation
Collection Building

 Univariate Analysis

 Data Cleaning

 Feature Engineering

 Bivariate Analysis and


Hypothesis Testing

 Data Split
Anatomy of Statistical Model
A step-by-step approach to solve a Business Problem

Data Model
Business Data Model
Discovery and Selection and
Problem Preparation Evaluation
Collection Building

Box Plot
Anatomy of Statistical Model
A step-by-step approach to solve a Business Problem

Data Model
Business Data Model
Discovery and Selection and
Problem Preparation Evaluation
Collection Building
Anatomy of Statistical Model
A step-by-step approach to solve a Business Problem

Data Model
Business Data Model
Discovery and Selection and
Problem Preparation Evaluation
Collection Building
Anatomy of Statistical Model
A step-by-step approach to solve a Business Problem

Data Model
Business Data Model
Discovery and Selection and
Problem Preparation Evaluation
Collection Building
Anatomy of Statistical Model
A step-by-step approach to solve a Business Problem

Data Model
Business Data Model
Discovery and Selection and
Problem Preparation Evaluation
Collection Building
Anatomy of Statistical Model
A step-by-step approach to solve a Business Problem

Data Model
Business Data Model
Discovery and Selection and
Problem Preparation Evaluation
Collection Building

 Regression

 Classification

 Clustering

 Association Rule Mining,


etc.
Anatomy of Statistical Model
A step-by-step approach to solve a Business Problem

Data Model
Business Data Model
Discovery and Selection and
Problem Preparation Evaluation
Collection Building
Anatomy of Statistical Model
A step-by-step approach to solve a Business Problem

Data Model
Business Data Model
Discovery and Selection and
Problem Preparation Evaluation
Collection Building

 Score And Predict using


Test Sample

 Check the Robustness


and Stability of the Model

 Check Model
Performance, Accuracy,
ROC, AUC, KS, etc.
Anatomy of Statistical Model
A step-by-step approach to solve a Business Problem

Data Model
Business Data Model
Discovery and Selection and
Problem Preparation Evaluation
Collection Building
A step-by-step approach to solve a Business Problem

Data Model
Business Data Model
Discovery and Selection and
Problem Preparation Evaluation
Collection Building

 Understand Project  Understand Data  Univariate Analysis  Regression  Score And Predict
Objectives architecture  Data Cleaning using Test Sample
• Outlier Treatment  Classification
 Define the Problem  Data List Preparation • Missing Value  Check the
and Identification of Treatment  Clustering Robustness and
 Investigate Question Data Sources  Feature Engineering Stability of the
and gather • Variable Creation  Association Rule Model
Requirements  Collect Initial Data • Data Mining, etc.
Transformation  Check Model
 Convert into a  Define Variables and • Dimension Performance,
Statistical Problem Create Data Reduction Accuracy, ROC,
Dictionary  Bivariate Analysis AUC, KS, etc.
and Hypothesis
 Validate for Testing
Correctness  Data Split
• Training Set
• Testing Set

Business Impact : Return on Investment (ROI)

You might also like