You are on page 1of 1

Histogram and frequency polygon

Bar chart

Tree map

Word cloud Visualizing data

Line chart

Scatter Plot

Heat Map

parameter vs sample statistic Definition of population and sample

Discrete data
Trimmed mean Numerical 
affected by outliers Arithmetic mean data 
Based on  (Quantitative  Continuous 
Winsorized mean
statistical  data) data
perspective
Weighted mean Mean
Nominal 
Measures of central  Categorical  data
Geometric mean tendency data 
A>G>H (Qualitative 
data) Ordinal data
Harmonic mean

Time series data


Median
Based on how 
Data 
they are  Cross sectional data
Mode classification
collected

Panel data
Quantile

Based on their  Structure data


Quartile
organized 
Organizing data form Unstructured data
Quintile Measure of location
Types
Decile • One dimensional array
Process • Two dimensional array
R2: Organizing, visualizing 
Fomula Percentile and describing data
Raw data Formatted data Quantitative analysis

Range How data are 


organized

Mean absolute deviation (MAD)

Variance Absolute dispersion One dimensional 


Describing data
Measures of dispersion
array
Upside risk
Standard deviation
Target downside deviation Downside risk

Coefficient of variation (CV) Relative dispersion


CV = Std / Mean
mean = mode = median
Two dimensional 
array (table)
skewness = 0 Normal distribution

kurtosis = 3

many outliers in upper tail Measure of skewness


Absolute frequency Cumulative absolute frequency
Positively skewed (skewness > 0)
Types of frequency
long upper tail
Relative frequency Cumulative relative frequency
Types of skewed
many outliers in lower tail
Frequency  Constructing a 
Negatively skewed (skewness < 0)
Summarizing  distribution frequency 
long lower tail distribution
data

less peaked, thinner tail Platykurtic (kurtosis < 3)


a tabular format that displays the frequency distributions 
p l normal distribution Mesokurtic (kurtosis = 3) Measure of kurtosis Contingency  of two or more categorical variables simultaneously
table

Application: Confusion matrix for evaluating the performance 


more peaked, fatter tail Leptokurtic (kurtosis > 3) of a classification of model

no linear relationship Correlation = 0

perfect positive linear relationship Correlation = 1 Correlation coefficient

perfect negative linear relationship Correlation = -1

You might also like