Professional Documents
Culture Documents
Interdisciplinary Statistical
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Example: Insect Data 3
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Central Tendency 5
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Mode: The most Frequent Observation 6
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Mode from Grouped Data (contd.) 8
• Let
• 𝑓𝑓𝑀𝑀 denote the frequency of the class interval with highest frequency
(the modal class interval)
• 𝐿𝐿, 𝐻𝐻 denote the lower and upper boundaries of the modal class interval
• 𝑓𝑓𝐿𝐿 and 𝑓𝑓𝐻𝐻 represent the frequencies corresponding to the class
intervals just before and after the modal class interval
• Then the estimated mode is
𝑓𝑓𝑀𝑀 − 𝑓𝑓𝐿𝐿
𝑀𝑀 = 𝐿𝐿 + (𝐻𝐻 − 𝐿𝐿)
Statistical Structures in Data, PGDBA Programme, ISI, 2022
𝑓𝑓𝐻𝐻 − 𝑓𝑓𝐿𝐿 October 12, 2022
Mode: Illustration 9
MODE
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Multimodal data 10
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Median: The Middle-most Observation 11
MEDIAN
Smallest Largest
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Median: Example with an Odd Number of
Observations 12
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Example: Insect Data 14
MEDIAN=40 MEDIAN=16.5
• Let
• 𝑓𝑓 denote the frequency of the class interval which contains the 𝑛𝑛⁄2-th
ordered observation (the median class interval)
• 𝐿𝐿, 𝐻𝐻 denote the lower and upper boundaries of the median class interval
• 𝑓𝑓𝐿𝐿 represent the cumulative frequency up to the median class interval
• Then the estimated median is
𝑛𝑛
− 𝑓𝑓𝐿𝐿
𝑀𝑀𝑒𝑒 = 𝐿𝐿 + (𝐻𝐻 − 𝐿𝐿) 2
Statistical Structures in Data, PGDBA Programme, ISI, 2022 𝑓𝑓 October 12, 2022
Arithmetic Mean (or, simply, Mean) 18
Smallest Largest
Q1 Q2 Q3
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Quartiles 23
Q1 Q2 Q3
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Example: Insect Data 24
Q2=40 Q3=42
Q1=39
Statistical Structures in Data, PGDBA Programme, ISI, 2022 With Insecticide A October 12, 2022
Generalization: Quantiles 25
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Computation of Quantiles: Ungrouped Data 29
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Computation of Quantiles (contd.) 31
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Measures of Dispersion or Spread 32
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Dispersion 33
Range
40 43 44 46
=Largest - Smallest
= 48 - 35 = 13 40 43 45 48
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Interquartile Range 36
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Mean Absolute Deviation (MAD) 37
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Variance and Standard Deviation 39
• Variance (𝑠𝑠 2 ): Average of the SQUARED deviations from the arithmetic mean
∑ 𝑥𝑥 65
𝑥𝑥 𝑥𝑥 − 𝑥𝑥̅ (𝑥𝑥 − ̄ 2
𝑥𝑥) 𝑥𝑥̄ =
𝑛𝑛
=
5
= 13
5 -8 64
9 -4 16 2
2
∑ (𝑥𝑥 − 𝑥𝑥)
̄ 130
16 3 9 𝑠𝑠 = = = 26
17 4 16 𝑛𝑛 5
18 5 25 𝑠𝑠 = 5.1
• Standard deviation (s): positive square root of the variance
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Coefficient of Variation 40
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Measures of Shape 41
• Skewness
• Absence of symmetry
• Majority of extreme values to one side of a
distribution
• Kurtosis
• Peakedness/flatness of a distribution
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Shape Descriptors 42
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Skewness 43
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Skewness 44
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Measures of Skewness based on Quartiles 45
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Pearson’s Skewness Coefficients 46
𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 − 𝑀𝑀𝑜𝑜𝑜𝑜𝑜𝑜
• Pearson’s Skewness coefficient (of the first type) 𝑆𝑆1 =
𝑠𝑠𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑
3 𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀 − 𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀𝑀
• Pearson’s skewness coefficient (of the second type) 𝑆𝑆2 =
𝑠𝑠𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡𝑡 𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑𝑑
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Kurtosis 47
Peakedness of a
distribution Leptokurtic
• Leptokurtic
• high and thin
Mesokurtic
Platykurtic
• Mesokurtic
• normal in shape
• Platykurtic
• flat and spread out
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Box and Whisker Plots (Box Plots) 48
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Box Plots 49
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Box Plots (contd.) 50
• Outer Fences
• Lower outer fence = Q1 - 3.0 IQR
• Upper outer fence = Q3 + 3.0 IQR
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Box and Whisker Plot 51
Minimum Q1 Q2 Q3 Maximum
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Skewness and Box Plots 52
S<0 S=0 S>0
1
Upper quartile
} Whisker
}
.5
Inter-quartile
Median
range (IQR)
Lower quartile
} Whisker
0
-.5
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Comparing Different Datasets with Boxplots 54
1
.5
0
-.5
1 2 3
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Whiskers: Variations 55
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Detection of Outliers with Fences 57
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Motivation for the fences 58
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Variations of the Box Plot 59
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Variations of the Box Plot 60
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022
Example: Insect data 62
With Insecticide A
With Insecticide B
Statistical Structures in Data, PGDBA Programme, ISI, 2022 October 12, 2022