Welcome
Hi my name is Rahmat
Analytics blogger, Telco & Banking Data Analyst, Quant Researcher
Currently, I’m working as Data Scientist at XL Axiata
LinkedIn: linkedin.com/in/rahmathidayatullah
1
Copyright © 2024, RevoU. Do Not Distribute.
You can understand & implement:
1. Understand process of data analytics
2. Understand problem, issue tree and
Objectives hypothesis
3. Able to use excel eg. simple formula and
data viz
4. Understand how to make good
presentation
Copyright © 2024, RevoU. Do Not Distribute.
My career journey 💻
Promoted as
Move to Data Join RevoU principal data
Got Bachelor of statistics Join XL axiata as demand
management Division as lecture scientist at XL
forecast analyst
axiata
While finishing final
thesis, got offered in
research company
Current
2010
Join Bank Negara Promoted as manager of PMO Promoted as data
Indonesia as credit in BNI 46 analytics specialist in
analyst network analytics
3
Copyright © 2024, RevoU. Do Not Distribute.
Copyright © 2024, RevoU. Do Not Distribute.
Quick Poll: Seberapa Analitik Kamu?
Or join slido.com with code #3728 187
Copyright © 2024, RevoU. Do Not Distribute.
Data Analytics
Introduction
Copyright © 2024, RevoU. Do Not Distribute.
Copyright © 2024, RevoU. Do Not Distribute.
Copyright © 2024, RevoU. Do Not Distribute.
Do we need data analytics in modern banking?
Data analytics in banking no longer luxury, it is necessity for survival and growth. Banking
literally data driven business, every transaction, customer interaction, and market
fluctuation generates data. To leverage this vast amount of information, analytics becomes
indispensable.
Data analytics provides valuable insights derived
from data, enabling decision-makers to make
informed and evidence-based decisions. This
reduces reliance on intuition and guesswork.
Copyright © 2024, RevoU. Do Not Distribute.
Importance of Data Analytics
Enhance customer Increase operational
Risk mitigation
experience efficiency
Banking is inherently
risky, as it involves By understanding By analyzing vast
managing funds customer behavior, amounts of data,
between depositors preferences, and banks can identify
and borrowers. Banks needs through data bottlenecks, optimize
aim to profit by analysis, banks can processes, and allocate
balancing risk and offer personalized resources effectively.
return, while adhering products, services, and For instance,
to strict regulations. interactions. This leads predictive analytics
to increased customer Copyright © 2024, RevoU. Do Not Distribute.
can forecast
Effective risk
Data
“Data is raw, unorganized information that can be collected and
processed to reveal patterns and trends. It can be in various
forms, such as numbers, text, images, or audio. Essentially, it's
the foundation for generating insights and making informed
decisions.”
Copyright © 2024, RevoU. Do Not Distribute.
What is Data Analytics
Data is collected into raw form and processed according to the requirement of a
company and then this data is utilized for the decision making purpose
Data analysis is the process of gathering, cleaning, analyzing, and
mining data, interpreting results and reporting the findings.
With data analysis, we find patterns within data and
correlations between different data point to gain insights from
the data in order to derive conclusions for better decision
making process
Copyright © 2024, RevoU. Do Not Distribute.
To make it simple.. Think LEGO!
Imagine you have a massive box of LEGO bricks representing a vast amount of data. Each LEGO
brick is like a data point, and the colors, sizes, and shapes of the bricks represent different attributes
and values within your dataset.
First, you collect a diverse set of LEGO bricks from
various sources. These bricks may have different
colors, sizes, and shapes, reflecting the variety of
data in your dataset.
Before you can build something meaningful, you need to
clean and organize your LEGO bricks. You remove any
damaged or duplicate bricks, ensuring that each brick is
in good condition.
Now that you have your cleaned LEGO bricks, you start
building simple structures to explore their possibilities.
These structures represent exploratory data analysis,
where you visually and statistically examine the basic
characteristics and relationships in your dataset.
Copyright © 2024, RevoU. Do Not Distribute.
4 Types of Data Analytics
Your basic skill set
Copyright © 2024, RevoU. Do Not Distribute.
Business Understanding
Determine the business
Data Analytics Process objective, assess the
situation, determine data
mining goals and then
Analysis is an interactive produce the project plan
as per the requirement
process of a person Deployment Data Exploration
tackling a problem, Plan the deployment,
To gather initial data,
describe and explore
finding the data required monitoring and data and verify data
maintenance and
to get an answer, produce a final report
Data quality to ensure it
contains the data we
analyzing that data, and and review the project Analytics require
Process
interpreting the results in
Data Evaluation Data Preparation
order to provide a
recommendation for Evaluate the results of Select data as per the
the test cases and need, clean it, construct
action. review the scope of it to get useful
errors in this phase. information and then
Data Modeling
integrate it all
Select a modeling
technique, generate test
design, build a model and
assess the model built
Copyright © 2024, RevoU. Do Not Distribute.
Different Role
in Data Team
Copyright © 2024, RevoU. Do Not Distribute.
Data Team Machine Learning
Data Analyst (DA) Data Engineer (DE) Data Scientist (DS)
Roles Engineer (MLE)
Summary Data analysts are generally Data engineers are Data scientists are A person in IT who focuses
more entry level with some professionals with professionals with an on researching, building and
programming skills, decent advanced software expertise in designing self-running
knowledge of mathematics, development skills and programming, artificial intelligence (AI)
and good business acumen expert knowledge in mathematics, and systems to automate
databases business analytics predictive models
Job ・Collecting and interpreting ・Providing technical skills ・Undertaking data ・Designing & developing
Responsibi data to maintain & process collection, pre- machine learning and deep
lity large datasets processing and analysis learning systems
・Identify trends & patterns in
data ・Managing the entire data ・Building models to ・Running machine learning
lifecycle: ingestion, address business tests and experiments
・Analysing results
processing, surfacing, & problems ・Implementing appropriate
・Data visualisation and storage ・Presenting information ML algorithms
reporting the results
using data visualization
techniques
Skill Sets ・Statistical tools like Excel, ・Big Data ・Data Structures ・Statistical tools like Excel,
SPSS, SAS ・Apache Hadoop ・Strong Programming SPSS, SAS
・SQL ・NoSQL skills ・SQL, Hive
・Python ・Spark ・Machine Learning ・Machine Learning
・Data Manipulation & ・Big Data ・Feature Engineering
Visualisation ・Cloud Computing ・Data Visualization
・Probability & Statistics System Design ・Probability & Statistics
Copyright © 2024, RevoU. Do Not Distribute.
Skills needed Relationship for each role
Now you are
learning this
MATHS, STATS,
ALGORITHMS
DATA DATA
ENGINEER ANALYST
DATA
SCIENTIST
SOFTWARE DATA
ENGINEERING COMMUNICATION
& DOMAIN
EXPERTISE
Source:Udacity Already have
domain expert
Copyright © 2024, RevoU. Do Not Distribute.
What is Data Journalism?
Uncovering hidden insights through
analysis and visualization. Elevating
storytelling with the power of numbers,
providing context, and deepening
understanding beyond traditional
reporting
All of these things come together to
create a comprehensive view of a
evidence based story to help people
make better, actionable decisions.
Copyright © 2024, RevoU. Do Not Distribute.
Analytics Workflow
Exploratory
Business Data Data
Analysis / Validation Visualization
Understanding Understanding Preparation
Modeling
Data has been An Insight/ Chart, Diagram,
Asses your
PRD Document. Consist of selected, Analysis. Or Dashboard,
Out model /
Problem Statement, Objective, cleaned, and Build a model Tracker, etc.
put findings.
Inscope/Outscope metrics integrated into and assess the Communicating
Triple check it
the format model built your findings
Copyright © 2024, RevoU. Do Not Distribute.
Life as Data
Analyst
● ICT :Information and
Communication
Technology
Read more about life as
data analyst
Copyright © 2024, RevoU. Do Not Distribute.
5 min
Q and A
Copyright © 2024, RevoU. Do Not Distribute.
Problem
Understanding
Copyright © 2024, RevoU. Do Not Distribute.
Understanding Root Causes
Copyright © 2024, RevoU. Do Not Distribute.
Root Cause vs Symptoms
Symptoms Causes
● A symptom is a sign or ● A root cause is the real reason for
indication of a root cause, but a business or performance
it is not, by itself, a cause. problem.
● Most often, symptoms lack ● Root causes are specific and can
specificity and are difficult to be easily categorized.
categorize. ● Solutions can be implemented
● It is very difficult to develop that either remove or mitigate
solutions to directly address a the effects of a root cause
symptom.
Copyright © 2024, RevoU. Do Not Distribute.
SMART Problem Statement
SMART is an acronym used when creating objectives to define a set of criteria that are
easy to understand and to know when they have been fulfilled.
A good problem statement should:
● Specific – target a specific area for improvement
● Measureable - quantify or show an indicator of progress
● Attainable/Achievable – they need to be agreed, to be
attainable and able to be implemented
● Realistic - states what results can realistically be achieved,
given available resources
● Time-bound - there need to be deadlines, but are they
reasonable?
Copyright © 2024, RevoU. Do Not Distribute.
How to Structure your Problem
● Without structure, your ideas won’t stand up.
● Use structure to strengthen your thinking.
Feel Free to be MECE Don’t Reinvent the wheel Every problem is unique
Breaking the problem before them into its component elements
Copyright © 2024, RevoU. Do Not Distribute.
MECE Principle
MECE stands for mutually exclusive, collectively exhaustive
Copyright © 2024, RevoU. Do Not Distribute.
Not MECE vs MECE
NOT MECE MECE
Copyright © 2024, RevoU. Do Not Distribute.
How to MECE
MECE stands for mutually exclusive, collectively exhaustive
It is a framework for solving complicated
problems.
When you apply it to a problem, you break that
problem into subproblems that are:
mutually exclusive (they don’t overlap) and
collectively exhaustive (they cover all
possibilities)
When you have a complicated problem, that’s
when MECE will be most helpful.
Partitioning the problem into smaller problems
makes it easier to solve.
Copyright © 2024, RevoU. Do Not Distribute.
Methods to Find Root Cause
Pareto Principle 5 Why Issue Tree
You can pareto principle to
prioritize which one is the
important root cause
Copyright © 2024, RevoU. Do Not Distribute.
The 5 Whys method is part of the Toyota Production
5 Whys System. Developed by Sakichi Toyoda, a Japanese inventor
and industrialist, the technique became an integral part of
the Lean philosophy
When applying the 5 Whys technique, you
want to get to the problem's essence
and then fix it. Actually, the 5 Whys may
show you that the source of the problem
is quite unexpected.
Keep in mind that “5” is just a number.
Ask “Why” as many times as you
need to complete the process and take
appropriate actions.
Copyright © 2024, RevoU. Do Not Distribute.
Another example
5 Whys
How to conduct the 5 Why
technique:
1. Write down the specific problem.
2. Ask Why the problem happens
and write the answer down below
the problem.
3. If the answer doesn’t identify the
root cause of the problem in Step
1, ask Why again and write that
answer down.
4. Loop back to step 3 until the
team is in agreement that the
problem’s root cause is identified.
Copyright © 2024, RevoU. Do Not Distribute.
Copyright © 2024, RevoU. Do Not Distribute.
When to Use a 5 Whys Analysis
You can use 5 Whys for troubleshooting, quality improvement, and
problem solving, but it is most effective when used to resolve
simple or moderately difficult problems.
It may not be suitable if you need to tackle a complex or critical
problem.
Copyright © 2024, RevoU. Do Not Distribute.
Issue Tree
● Issue trees break down complex
problems into distinct, smaller
components. Issue trees typically
take on the following structure
● An issue tree is the series of
questions or issues that must be
addressed to prove or disprove a MECE
hypothesis.
● An issue tree is simply the laying
out of issues and sub-issues into a
MECE visual progression
● See MECE material
Copyright on RevoU.
© 2024, previous
Dotopic
Not Distribute.
Decline in interest
income
Decrease in
fee-based income
Revenue
decrease
Reduction in
other income
Let’s discuss! Need your expertise to help me constructing
Profit in branch surabaya this issue tree
last 3 months? Rise in operating
expenses
Cost Increase
Increase in
personnel costs
Higher bad debt
provisions
Copyright © 2024, RevoU. Do Not Distribute.
Exploratory Data
Analysis with
Excel
Copyright © 2024, RevoU. Do Not Distribute.
Data Analysis Process
Data Requirement Data Data
Data Collection Data Cleaning Data Analysis
Gathering Interpretation Visualization
Dataset
Copyright © 2024, RevoU. Do Not Distribute.
Functions for Data Manipulation in Excel
● Basic function (Sum, Average, ● Combine logical and basic
median, count, counta) function (SUMIF, COUNTIF)
● Logical function (IF, IFS, AND, ● Multiple condition (SUMIFS,
OR) COUNTIFS)
Copyright © 2024, RevoU. Do Not Distribute.
SUM
Returns the sum of a series of numbers and/or cells
https://support.google.com/docs/answer/3093583?hl=en&ref_topic=3105474
Copyright © 2024, RevoU. Do Not Distribute.
SUM
https://support.google.com/docs/answer/3093669?hl=en
Copyright © 2024, RevoU. Do Not Distribute.
SUMIF
Returns a conditional sum across a range.
Sample Usage
SUMIF(A1:A10,">20")
SUMIF(A1:A10,"Paid",B1:B10)
Syntax
SUMIF(range, criterion, [sum_range])
range - The range which is tested against criterion.
criterion - The pattern or test to apply to range.
https://support.google.com/docs/answer/3093583?hl=en&ref_topic=3105474
Copyright © 2024, RevoU. Do Not Distribute.
SUMIF
https://support.google.com/docs/answer/3093583?hl=en&ref_topic=3105474
Copyright © 2024, RevoU. Do Not Distribute.
COUNT
Returns the number of numeric values in a dataset.
Sample Usage
COUNT(A2:A100,B2:B100,4,26)
COUNT(1,2,3,4,5,C6:C20)
Syntax
COUNT(value1, [value2, ...])
value1 - The first value or range to consider when counting.
value2, ... - [ OPTIONAL ] - Additional values or ranges to consider when counting.
https://support.google.com/docs/answer/3093620?hl=en
Copyright © 2024, RevoU. Do Not Distribute.
COUNT
Count only numeric values,
Including those which appear more than once.
Text values are ignored.
https://support.google.com/docs/answer/3093620?hl=en
Copyright © 2024, RevoU. Do Not Distribute.
What’s the difference between COUNT and COUNTA?
COUNTA: Returns the number of records that aren’t blank in dataset.
COUNT: Returns the number of records that contain numeric values in dataset.
https://support.google.com/docs/answer/3093620?hl=en
Copyright © 2024, RevoU. Do Not Distribute.
COUNTIF
https://support.google.com/docs/answer/3093480?hl=en
Copyright © 2024, RevoU. Do Not Distribute.
Logical Function
IF SUMIF and SUMIFS
○ =IF(A1>10, "Yes", ● =SUMIF(A1:A10, ">10",
"No") B1:B10)
○ Checks whether a condition is ● =SUMIFS(B1:B10, A1:A10,
met and returns one value if ">10", C1:C10, "Yes")
true and another if false. ● Adds up values based on one or
multiple criteria.
AND, OR
COUNTIF and COUNTIFS
○ =AND(A1>10, B1<5)
○ =OR(A1>10, B1<5) ● =COUNTIF(A1:A10, ">10")
○ Checks multiple conditions ● =COUNTIFS(A1:A10, ">10",
and returns TRUE or FALSE. B1:B10, "Yes")
● Counts the number of cells that meet
one or multiple criteria.
Copyright © 2024, RevoU. Do Not Distribute.
IFS Function
Syntax
IFS(condition1, value1, [condition2, value2, …])
condition1 - The first condition to be evaluated. This can be a boolean, a number, an array, or a
reference to any of those.
value1 - The returned value if condition1 is TRUE.
condition2, value2, …- Additional conditions and values if the first one is evaluated to be false.
Feature IF Function IFS Function
Syntax =IF(condition1, value_if_true1, IF(condition2, =IFS(condition1, value_if_true1, condition2, value_if_true2, condition3,
value_if_true2, IF(condition3, value_if_true3, ...))) value_if_true3, ...)
Example Categorizing scores: Categorizing scores:
Use Case
Example =IF(A1>=90, "A", IF(A1>=80, "B", IF(A1>=70, "C", =IFS(A1>=90, "A", A1>=80, "B", A1>=70, "C", A1>=60, "D", A1<60,
code IF(A1>=60, "D", "F")))) "F")
Copyright © 2024, RevoU. Do Not Distribute.
More functions to
clean up the data
Copyright © 2024, RevoU. Do Not Distribute.
More Functions for Data Manipulation in Excel
● Text function (left, right, mid,
concatenate, Len, Trim)
● Date Time Function (year,
month, day, week)
Copyright © 2024, RevoU. Do Not Distribute.
Text Function
CONCATENATE LEN
● =CONCATENATE(A1, " ", B1) ● =LEN(A1)
● Joins two or more text strings into one ● Returns the length of a text string.
string.
TRIM
LEFT, RIGHT, MID
● =TRIM(A1)
● =LEFT(A1, 5) ● Removes extra spaces from a text
● =RIGHT(A1, 5) string.
● =MID(A1, 2, 3)
● Extracts a specified number of
characters from a text string.
Copyright © 2024, RevoU. Do Not Distribute.
Lookup and Reference Function
CONCATENATE LEN
● =CONCATENATE(A1, " ", B1) ● =LEN(A1)
● Joins two or more text strings into one ● Returns the length of a text string.
string.
TRIM
LEFT, RIGHT, MID
● =TRIM(A1)
● =LEFT(A1, 5) ● Removes extra spaces from a text
● =RIGHT(A1, 5) string.
● =MID(A1, 2, 3)
● Extracts a specified number of
characters from a text string.
Copyright © 2024, RevoU. Do Not Distribute.
Text Split Function
The TEXTSPLIT function in Excel is used to split a text string into multiple cells based on a delimiter. This function is particularly
useful when you need to break down a string into its constituent parts. Here’s a guide on how to use the TEXTSPLIT function:
Syntax
=TEXTSPLIT(text, column_delimiter, [row_delimiter], [ignore_empty], [pad_with])
Parameters
● text: The text string to be split.
● column_delimiter: The delimiter that separates columns (e.g., a comma, space, etc.).
● row_delimiter (optional): The delimiter that separates rows.
● ignore_empty (optional): A Boolean value to ignore empty values (TRUE/FALSE).
● pad_with (optional): A value to pad with when the number of delimiters is not equal.
Copyright © 2024, RevoU. Do Not Distribute.
Proper Function
Syntax
PROPER(text_to_capitalize)
text_to_capitalize - The text which will be returned with the first letter of each word in
uppercase and all other letters in lowercase.
Good for title case headline
Relatives of the function LOWER() AND UPPER()
Copyright © 2024, RevoU. Do Not Distribute.
Concatenate () Function
Syntax
CONCATENATE(string1, [string2, ...])
string1 - The initial string.
string2 ... - [ OPTIONAL ] - Additional strings to append in sequence.
Notes
When a range with both width and height greater than 1 is specified, cell values are appended
across rows rather than down columns. That is, CONCATENATE(A2:B7) is equivalent to
CONCATENATE(A2,B2,A3,B3, ... , A7,B7) .
Copyright © 2024, RevoU. Do Not Distribute.
Left () Function
Syntax
LEFT(string, [number_of_characters])
string - The string from which the left portion will be returned.
number_of_characters - [ OPTIONAL - 1 by default ] - The number of characters to
return from the left side of string.
Notes
0 is a valid input for number_of_characters and will cause LEFT to return the empty
string.
Copyright © 2024, RevoU. Do Not Distribute.
Demo Time!
More Function to Clean Up
Data :)
Copyright © 2024, RevoU. Do Not Distribute.
Pivot Table and Filter
Copyright © 2024, RevoU. Do Not Distribute.
Pivot Tables
Add or edit pivot tables
1. On your computer, open a spreadsheet.
2. Select the cells with source data you want to use. Important: Each column needs a header.
3. In the menu at the top, click Data Pivot table. Click the pivot table sheet, if it’s not already open.
4. In the side panel, next to "Rows" or "Columns," click Add, then choose a value.
Note: Sometimes, you'll see recommended pivot tables based on the data you choose. To add a
pivot table, under "Suggested," choose a pivot table.
5. In the side panel, next to "Values," click Add, then choose the value you want to see over your rows or
columns.
6. You can change how your data is listed, sorted, summarized, or filtered. Next to what you want to
change, click the Down Arrow .
Copyright © 2024, RevoU. Do Not Distribute.
Table source for pivot
Dataset
Copyright © 2024, RevoU. Do Not Distribute.
Pivot Tables Result
Pivot Result Copyright © 2024, RevoU. Do Not Distribute.
Exploratory data analysis is an approach of analyzing
data sets to summarize their main characteristics, often
What is using statistical graphics and other data visualization
methods. A statistical model can be used or not, but
Exploratory primarily EDA is for seeing what the data can tell us
Data Analysis beyond the formal modeling or hypothesis testing task.
(EDA) ?
The main characteristics we can uncover from the data
can be started by using any statistics descriptive.
Copyright © 2024, RevoU. Do Not Distribute.
EDA in practice
Exploratory Data Analysis (EDA) is the process of analyzing and visualizing a
dataset to understand its characteristics, identify patterns, spot anomalies, and
formulate hypotheses. EDA involves several steps such as data cleaning, summary
statistics, visualization, and identifying relationships between variables.
Copyright © 2024, RevoU. Do Not Distribute.
Step-by-Step EDA
1. Data Loading and Initial Inspection
○ Inspect the data to understand its structure, data types, and completeness.
2. Data Cleaning
○ Check for missing values and handle them appropriately.
○ Convert data types if necessary (e.g., dates, numerical values).
○ Handle any inconsistencies or errors in the data.
3. Summary Statistics
○ Generate summary statistics for numerical columns (e.g., mean, median, standard deviation).
○ Analyze categorical columns (e.g., count, unique values).
4. Data Visualization
○ Create visualizations to understand the distribution of variables (e.g., histograms, box plots).
○ Visualize relationships between variables (e.g., scatter plots, correlation matrices).
5. Identify Patterns and Insights
○ Look for patterns, trends, and anomalies in the data.
○ Analyze relationships and correlations between variables.
Copyright © 2024, RevoU. Do Not Distribute.
Types of Data
Copyright © 2024, RevoU. Do Not Distribute.
Dimensionality of Data Sets
● Univariate: Measurement made on one variable per
subject
● Bivariate: Measurement made on two variables per
subject
● Multivariate: Measurement made on many variables per
subject
Copyright © 2024, RevoU. Do Not Distribute.
Univariate Analysis
Copyright © 2024, RevoU. Do Not Distribute.
Scatter Plot
Seems no correlation
from visual scatter plot
Copyright © 2024, RevoU. Do Not Distribute.
Dashboarding
with Excel
Copyright © 2024, RevoU. Do Not Distribute.
Business Use Case
As a data analyst, you want to explore a dataset which contains sales data information for
multi brand and industry. No business questions, but you want to get familiar with data and
build dashboard for generating the business insight. The dataset consist of few columns:
- Date: Exact date of each sales transaction.
- Year & Month: Year & month format for sales transaction.
- Region: Region where sales transaction happened.
- Sales Rep: The name of sales representative who support the sales transaction.
- Brand: Brand products sold during the transaction.
- Customer Industry: Customer industry of transaction.
- Sales Channel: Sales channel of transaction.
- Sales Value: Sales amount in dollars.
Copyright © 2024, RevoU. Do Not Distribute.
Business Use Case
Let’s explore the dataset first and think about the metrics that we want to use in dashboard.
Please go and check step by step exploratory data analysis slide previously to define the metrics here.
Hypothesis question to answer based on raw data:
● Which numerical variables that we want to check/ analyse individually?
○ Date, Year & Month, Region, Sales Rep, Brand, Customer Industry, Sales Channel
● Which categorical variables that we want to check/ analyse individually?
○ Sales Value, # Transaction
● Which variables that we want to check the relationship between them?
○ Sales Rep x Sales Value
○ Region x Sales Value
○ Brand x Sales Value
○ Industry x Sales Value
Copyright © 2024, RevoU. Do Not Distribute.
Dashboard-ing
Copyright © 2024, RevoU. Do Not Distribute.
IMPORTANCE OF DATAVIZ
Copyright © 2024, RevoU. Do Not Distribute.
Why Dataviz is so important?
“The human brain processes images
60,000 times faster than text, and
80 percent of information
transmitted to the brain is visual.”
Source: t-sciences.com
Copyright © 2024, RevoU. Do Not Distribute.
77
Copyright © 2024, RevoU. Do Not Distribute.
Want to make it faster?
Just add a little sort
We represent it better
Copyright © 2024, RevoU. Do Not Distribute.
Another Example
Which category has the lowest average sales from 2013-2016?
Copyright © 2024, RevoU. Do Not Distribute.
Importance of Dataviz
Data viz allows decision makers and executives to weigh the
alternatives of different outcomes of their decisions.
● Helps decision makers make better business decisions by
strategizing the best business outcomes
● Provides answers to key business questions
● Comprehend information quickly and identify relationship,
pattern, and trend to help executives take decisions
● Help communicating the story to others through simplicity, clarity,
intuitiveness, insightfulness
Copyright © 2024, RevoU. Do Not Distribute.
COMMON
USES OF DATAVIZ
Presentation/Explaining Exploratory Analysis Confirmatory Analysis
Visuals aim to lead the viewers to: Visuals are designed to offer numerous Visuals aim to help viewers
● describe situations understand the traditional statistics
subsets of data in order to:
● answer questions tools, for example:
● data exploration
● support decisions ● significance
● familiarizing the current situations
● communicate information, or ● inference, and
● identifying an area of interest
● solve specific problems ● confidence interval.
● raise questions
● answer questions
● discover opportunities
Copyright © 2024, RevoU. Do Not Distribute.
EXAMPLES
Presentation/Explaining Exploratory Analysis Confirmatory Analysis
This graphic below could clearly explain the
Based on your Exploratory Data Analysis, you now
country with the greatest demand for a certain
build a new predictive model. You can visualize
product compared globally, in a concrete month. your elevation (>73 m) and price per square foot
This graph invites us to explore how the linguistic
(>$19,116.7) observations as the boundaries of
standard of U.S. presidential addresses has declined regions in your scatterplot. Homes plotted in the
over time. The visual is interactive and explanatory, green and blue regions would be in San Francisco
in addition to indicating the readability score of and New York, respectively.
various presidents’ speeches.
Copyright © 2024, RevoU. Do Not Distribute.
Data Design & Relationship
Copyright © 2024, RevoU. Do Not Distribute.
Preattentive attributes
These are things that our
brain processes in
milliseconds, before we pay
attention to everything else.
How many 9s are here?
Copyright © 2024, RevoU. Do Not Distribute.
Size color hue
color value
color value +
length
Copyright © 2024, RevoU. Do Not Distribute.
categorical color scheme showing states in the USA.
This color is unnecessary.
Copyright ©Wexler,
2024, RevoU. Do Not Distribute.
S & Shaffer, J & Cotgrave, A. The Big Book of Dashboard
The Use of Color in
Data Visualization
Color should be used in data visualization in
three primary ways: sequential, diverging, and
categorical.
In addition, there is often the need to highlight
data or alert the reader of something important.
Only use color when it’s really necessary. It
needs to have a purpose (other than decorative
purposes)
Copyright © 2024, RevoU. Do Not Distribute.
Color Vision
Deficiency
One common solution among data
visualization practitioners is to use blue
and orange.
Using blue instead of green for good and
orange instead of red for bad works well
because almost everyone (with very rare
exceptions) can distinguish blue and
orange from each other.
This blue-orange palette is often referred
to as being “color-blind friendly.”
Programs to check CVD:
Adobe Illustrator CC, Chromatic Vision
Simulator, NoCoffee vision Simulator
Copyright © 2024, RevoU. Do Not Distribute.
Data Relationships
Copyright © 2024, RevoU. Do Not Distribute.
Practical Approach to Choose
Visualization (Few, 2009)
Make it easy to compare
Choose a graphic that will Represent the information in a
data; highlight trends and
capture the viewer’s attention simple, clear, and precise way
differences.
Establish an order for the
Give the viewer a clear way to elements based on the
explore the graphic and quantity that they represent;
understand its goals that is, detect maximums and
minimums.
Copyright © 2024, RevoU. Do Not Distribute.
Some Examples of Good Charts
Concise, clear segmentation, ordered from highest to
lowest
Use Simple Color Schemes and compare apple to apple
Copyright © 2024, RevoU. Do Not Distribute.
Some Examples of Bad Charts
Copyright © 2024, RevoU. Do Not Distribute.
Get back to
this principle!
Copyright © 2024, RevoU. Do Not Distribute.
1 2 3 4
Why are we bothering with a
communication training?
Communication is the lifeblood of an
organization. If we can’t do it effectively,
at best we are less effective, at worst will
fail.
1 2
Everyone is super busy We need to constantly shift our
(especially data analysts’ way of communicating (e.g.,
stakeholders) and only has offline-to-online, email-to-chat,
limited time. etc).
3 4
70% of communication is Communication relies heavily on
non-verbal - this is lost when context, which varies in the
we are doing everything international/multicultural
online companies
Copyright © 2024, RevoU. Do Not Distribute.
1 2 3 4
Data Analyst skill sets and role in a nutshell
You as
Data
Analyst
C-Levels
Communication Leads
Other skills:
● Problem solving
● Teamwork inspire
influence Decision Making
● Business
Acumen suggest
● Data Analysis recommend
● Technical skills
● Reporting
Concepts
Copyright © 2024, RevoU. Do Not Distribute.
1 2 3 4
What is Good Quality Business Communication?
1. Concise
2. Easy to Understand (by the
audience(s))
3. Clear conclusion/next steps
PS. It’s The Communicator’s Responsibility!
Copyright © 2024, RevoU. Do Not Distribute.
1 2 3 4
Our goal in this lecture: to create best-in-class business/reporting deck
https://www.oliverwyman.com/content/dam/oliver-wyman/M
E/publications/COVID-19-Special-Primer.pdf
Copyright © 2024, RevoU. Do Not Distribute.
1 2 3 4
Characteristics of great business deck
● Cohesive theme in all slides
○ Templates, layout, theme color are simple and look professional
○ Graphs / charts are created from the same tool
○ Every title in the slide is an insight, followed by details and supported charts
● Concise and insightful information
○ Use a proper communication framework
○ Easy to follow
○ Audience can grasp the main idea / hypothesis from the reports
● Use professional tools
○ Microsoft PowerPoint/Google slides (not Canva or other simplified slides)
○ Charts are attached properly in the slides (not screenshots)
○ Use these powerful tools to enhance the presentation (consulting presentations
from elite firms like McKinsey, Bain, and BCG use these)
■ Think-cell
■ Office Timeline
Copyright © 2024, RevoU. Do Not Distribute.
2 3 4 Joshua | Communication Structure
Communication Structure: Why?
A lot of stuff to communicate Help audience understand
● Data
● Analysis ● Clarify what you are
● Funnels going to say
● Arguments ● Make it easy to follow
● Facts
● Findings
● Conclusions
Copyright © 2024, RevoU. Do Not Distribute.
2 3 4
Simple Structures More Complex Structures
● Bullet points
● Chronological Situation Complication Recommendation
● By Process Flow
Hypothesis
Supporting Supporting
Ideas Ideas
Evidence Evidence Evidence Evidence
Copyright © 2024, RevoU. Do Not Distribute.
2 3 4
Let’s begin “Writing” the Presentation
Context:
You’re making a presentation to convince your
partner to purchase a house instead of renting
in a consulting way!
(not in bubbly canva way)
Disclaimer #1 the numbers that presented is fictional and used for educational purpose only.
Disclaimer #2 in real-life, some analysts may not be having an opportunity to create a Copyright
business deck,
© but it’s good
2024, to learn
RevoU. Dohow
NottoDistribute.
make one.
2 3 4
Planning Steps by Steps
Outline/Structur
List Sort Sharpen
e
Jot down all Group into logical Organize into Remove stuff not
important facts buckets of logical storyline critical to
related to related items communication
communication
Copyright © 2024, RevoU. Do Not Distribute.
2 3 4
This the typical outline for business/strategy report in presentation
Outline/Structur
e
Organize into
logical storyline
Copyright © 2024, RevoU. Do Not Distribute.
2 3 4
Let’s apply that into our presentation
Horizontal Relationships
Section
Introduction/
Exec. Summary
or
Vertical Main Story
Relation
ships
Next Steps
Appendices
Copyright © 2024, RevoU. Do Not Distribute.