You are on page 1of 70

statistic

descriptive
NANI KURNIATI, PhD

DEPARTMENT OF INDUSTRIAL AND SYSTEMS ENGINEERING


INSTITUT TEKNOLOGI SEPULUH NOPEMBER (ITS)

Industrial and Systems Engineering ITS


SUB POKOK BAHASAN
Industrial and Systems Engineering ITS

• Metode grafis untuk data kualitatif


• Metode grafis untuk data kuantitatif
• Metode numerik untuk data kualitatif
• Metode numerik untuk data kuantitatif baik
tunggal maupun kelompok
• Ukuran penyebaran / variabilitas
• Ukuran posisi relatif

Nani Kurniati, PhD


TUJUAN
Industrial and Systems Engineering ITS

1. Bagaimana menyajikan data dalam bentuk


yang lebih berguna (useful ways)
2. Bagaimana mengetahui pola data
3. Bagaimana menyimpulkan bentuk dasar data
(basic shape)
Mendeskripsikan suatu data sangat
tergantung pada jenis data apakah
kuantitatif atau kualitatif

Nani Kurniati, PhD


METODE STATISTIK DESKRIPTIF
Industrial and Systems Engineering ITS

Deskripsi statistik dapat dilakukan dengan


dua cara :
- Metode grafik
- Metode numerik

Nani Kurniati, PhD


Industrial and Systems Engineering ITS

Thinking Challenge

Nani Kurniati, PhD


Industrial and Systems Engineering ITS

Thinking Challenge

Nani Kurniati, PhD


Industrial and Systems Engineering ITS

Thinking Challenge

Nani Kurniati, PhD


Metode Grafik

Industrial and Systems Engineering ITS


Industrial and Systems Engineering ITS

Data Presentation – Metode Grafik

Data
Presentation

Qualitative Numerical
Data Data

Summary Stem-&-Leaf Frequency


Table Display Distribution

Bar Chart Pie Chart Dot Chart Histogram

Nani Kurniati, PhD


Qualitative Data - Summary Table

1. Lists Categories & No. Elements in Category


2. Obtained by Tallying Responses in Category
3. May Show Frequencies (Counts), % or Both
Row Is
Major Count Tally:
Category |||| ||||
Accounting 130
|||| ||||
Economics 20
Management 50
Total 200
Qualitative Data - Bar Chart
Industrial and Systems Engineering ITS

Horizontal Major Bar Length


Bars for Shows
Categorical Frequency
Variables
Mgmt. or %

Equal Bar
Econ. Widths
1/2 to 1 Bar
Width
Acct.

Zero Point 0 50 100 150


Percent Used Also Frequency
Nani Kurniati, PhD
Qualitative Data - Pie Chart
1. Shows Breakdown of Total
Quantity into Categories
2. Useful for Showing Relative Majors
Differences
3. Angle Size
Mgmt.
Econ. 25%
(360°)(Percent)
10% 36°

Acct.
65%
(360°) (10%) = 36°

Industrial and Systems Engineering ITS


Qualitative Data - Dot Chart
Industrial and Systems Engineering ITS

Like Major Line Length


Horizontal Shows
Bar Chart Frequency or %
Mgmt.
Horizontal
Equal
Lines for Econ. Spacing
Categorical
Variables
Acct.

Zero Point 0 50 100 150


Percent Used Also Frequency
Nani Kurniati, PhD
Example
You’re an analyst for IRI. You want to show the market shares
held by Windows program manufacturers in 1992. Construct a
bar chart, pie chart, & dot chart to describe the data.1

Mfg. Mkt. Share (%)


Lotus 15
Microsoft 60
WordPerfect 10
Others 15

Industrial and Systems Engineering ITS


Chart Solution*
Industrial and Systems Engineering ITS

Mfg.
Market Share
Lotus Others
Wordperf. 15%
Microsoft
10%
Wordperf. Lotus
15%
Others

0% 20% 40% 60% Microsoft


Market Share (%) 60%

Mfg.

Lotus

Microsoft

Wordperf.

Others

0% 20% 40% 60%


Market Share (%)
Nani Kurniati, PhD
Quantitative Data - Stem-and-Leaf Display

1. DivideEach Observation
into Stem Value and Leaf 2 144677
Value 26
3 028
– Stem Value Defines
Class
– Leaf Value Defines
4 1
Frequency (Count)

2. Data: 21, 24, 24, 26, 27, 27, 30, 32, 38, 41
Industrial and Systems Engineering ITS
Quantitative Data - Frequency Distributions
Industrial and Systems Engineering ITS

• What is a Frequency Distribution?


• A frequency distribution is a list or a table …
• containing the values of a variable (or a set of ranges within which the data
falls) ...
• and the corresponding frequencies with which each value occurs (or
frequencies with which data falls within each range)
• A frequency distribution is a way to summarize data
• The distribution condenses the raw data into a more useful form...
• and allows for a quick visual interpretation of the data

Nani Kurniati, PhD


Class
Aturan Thumb :
Less than 25 ; 5 – 6 classes
25 – 50 ; 7 – 14 classes
More than 50 ; 15 – 20 classes

Aturan Cramer : k = 1 + 3,3 log N

Class Frequency
15 but < 25 3
25 but < 35 5
35 but < 45 2
Quantitative Data - Histogram
Industrial and Systems Engineering ITS

Class Freq.
Count 15 but < 25 3
5 25 but < 35 5
35 but < 45 2
Frequency 4
3
Relative
Frequency 2 Bars
Touch
Percent 1
0
0 15 25 35 45 55
Lower Boundary
Nani Kurniati, PhD
Numerical Data Properties
for ungrouped data

Metode numerik

Industrial and Systems Engineering ITS


Industrial and Systems Engineering ITS

Numerical Data Properties

Central Tendency
(Location)

Variation
(Dispersion)

Shape

Nani Kurniati, PhD


Industrial and Systems Engineering ITS

Numerical Data Properties & Measures

Numerical Data
Properties

Central
Variation Shape
Tendency
Mean Range Skew
Median Interquartile Range
Mode Variance
Standard Deviation

Nani Kurniati, PhD


Mode Example
Industrial and Systems Engineering ITS

• No Mode
Raw Data: 10.3 4.9 8.9 11.7 6.3 7.7
• One Mode
Raw Data: 6.3 4.9 8.9 6.3 4.9 4.9
• More Than 1 Mode
Raw Data: 21 28 28 41 43 43

Nani Kurniati, PhD


Thinking Challenge
•You’re a financial analyst for
Prudential-Bache Securities. You
have collected the following
closing stock prices of new stock
issues: 17, 16, 21, 18, 13, 16, 12,
11.
•Describe the stock prices
in terms of central tendency.
Summary of
Central Tendency Measures

Measure Equation Description


Mean  Xi / n Balance Point
Median (n+1) Position Middle Value
2 When Ordered
Mode none Most Frequent
Range
Industrial and Systems Engineering ITS

1. Measure of Dispersion
2. Difference Between Largest & Smallest
Observations

Range = X l arg est − X smallest

Nani Kurniati, PhD


Disadvantages of the Range
Industrial and Systems Engineering ITS

• Ignores the way in which data are distributed

7 8 9 10 11 12 7 8 9 10 11 12
Range = 12 - 7 = 5 Range = 12 - 7 = 5
• Sensitive to outliers

1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,5
Range = 5 - 1 = 4

1,1,1,1,1,1,1,1,1,1,1,2,2,2,2,2,2,2,2,3,3,3,3,4,120
Range = 120 - 1 = 119

Nani Kurniati, PhD


Industrial and Systems Engineering ITS

Numerical Data Properties & Measures

Numerical Data
Properties

Central
Variation Shape
Tendency
Mean Range Skew
Median Interquartile Range
Mode Variance
Standard Deviation

Nani Kurniati, PhD


Variance & Standard Deviation
Industrial and Systems Engineering ITS

• 1. Measures of Dispersion
• 2. Most Common Measures
• 3. Consider How Data Are Distributed
• 4. Show Variation About Mean (X or )
X = 8.3

4 6 8 10 12
Nani Kurniati, PhD
Sample Variance Formula
Industrial and Systems Engineering ITS


2
Xi − X n - 1 in denominator!
(Use N if Population
i =1
S =
2
Variance)
n −1

2 2 2
X1 − X + X2 − X +  + Xn − X
=
n −1

Nani Kurniati, PhD


Sample Standard Deviation Formula
Industrial and Systems Engineering ITS

S= S 2

n
 Xi − X
2

i =1
=
n −1

2 2 2
X1 − X + X2 − X +  + Xn − X
=
n −1
Nani Kurniati, PhD
Variance Example
Industrial and Systems Engineering ITS

• Raw Data:10.3 4.9 8.9 11.7 6.3 7.7


n n

  Xi
2
Xi − X
i =1 i =1
S = 2
where X = = 8.3
n −1 n

+  + 7.7 − 8.3
2 2 2
10.3 − 8.3 + 4.9 − 8.3
S = 2
6 −1
= 6.368
Nani Kurniati, PhD
Industrial and Systems Engineering ITS

Comparing Standard Deviations

Data A
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 s = 3.338

Data B
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 s = .9258
Data C
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 s = 4.57

Nani Kurniati, PhD


Summary of Variation Measures
Measure Equation Description
Range Xlargest - Xsmallest Total Spread
Interquartile Range Q3 - Q1 Spread of Middle 50%

 (X − X)
Standard Deviation 2 Dispersion about
i
(Sample) Sample Mean
n −1

 (Xi − X )
Dispersion about
Standard Deviation 2
(Population) Population Mean
N
Variance (Xi -X )2 Squared Dispersion
(Sample) n-1 about Sample Mean
Industrial and Systems Engineering ITS

The Empirical Rule

If the data distribution is bell-shaped, then


the interval: μ  1σ
contains about 68% of the values in
the population or the sample
X

68%

μ
μ  1σ
Nani Kurniati, PhD
Industrial and Systems Engineering ITS

The Empirical Rule


μ  2σ contains about 95% of the values in
the population or the sample
μ  3σ contains about 99.7% of the values
in the population or the sample

95% 99.7%

μ  2σ μ  3σ

Nani Kurniati, PhD


Shape
1. Describes How Data Are Distributed
2. Measures of Shape
– Skew = Symmetry

Left-Skewed Symmetric Right-Skewed


Mean Median Mode Mean = Median = Mode Mode Median Mean
Quartiles
Industrial and Systems Engineering ITS

1. Measure of Noncentral Tendency


2. Split Ordered Data into 4 Quarters
Q1 Q2 Q3

25% 25% 25% 25%

3. Position of i-th Quartile


i  n +1
Positioning Point of Qi =
4
Nani Kurniati, PhD
Percentiles
Industrial and Systems Engineering ITS

• The pth percentile in an ordered array of n


values is the value in ith position, where
p
i= (n + 1)
100
• Example: The 60th percentile in an ordered array of 19
values is the value in 12th position:

p 60
i= (n + 1) = (19 + 1) = 12
100 100
Nani Kurniati, PhD
Interquartile Range
Industrial and Systems Engineering ITS

1. Measure of Dispersion
2. Also Called Midspread
3. Difference Between Third & First Quartiles
Interquartile Range = Q3 − Q1
4. Spread in Middle 50%
5. Not Affected by Extreme Values

Nani Kurniati, PhD


Industrial and Systems Engineering ITS

Interquartile Range

Example:
Median X
X Q1 Q3 maximum
minimum (Q2)
25% 25% 25% 25%

12 30 45 57 70

Interquartile range
= 57 – 30 = 27

Nani Kurniati, PhD


Box Plot
Industrial and Systems Engineering ITS

• 1. Graphical Display of Data Using


5-Number Summary

Xsmallest Q1 Median Q3 Xlargest

4 6 8 10 12

Nani Kurniati, PhD


Industrial and Systems Engineering ITS

Shape of Box and Whisker Plots

• The Box and central line are centered between the


endpoints if data is symmetric around the median

• A Box and Whisker plot can be shown in either vertical or


horizontal format

Nani Kurniati, PhD


Industrial and Systems Engineering ITS

Distribution Shape and Box and Whisker Plot

Left-Skewed Symmetric Right-Skewed

Q1 Q2 Q3 Q1 Q2 Q3 Q1 Q2 Q3

Nani Kurniati, PhD


Industrial and Systems Engineering ITS

Box-and-Whisker Plot Example

• Below is a Box-and-Whisker plot for the following data:

Min
0 2 2 Q12 3 3 Q2
4 5 Q3 27
5 10 Max

• This data
00 is22very
33 55right skewed, as the plot depicts
27
27

Nani Kurniati, PhD


Methods for detecting outliers
Industrial and Systems Engineering ITS

Outlier : An observation y that is unusually large or small relative


to the other values in a data set
Outliers typically are attributable to one of the following causes:
• The measurement is observed, recorded or entered into
computer incorrectly’
• The measurement comes from different population
• The measurement is correct, but represents a rare event

Nani Kurniati, PhD


Rule of thumb for detecting Outliers

• Z scores
Observation with z scores greater than 3 in absolute value

z = ( y − y) / s
• Box Plot
Observation falling between the inner and outer fences are
deemed suspect outliers
Observation falling beyond outer fences are deemed highly
suspect outliers
Errors in Presenting Data
1. Using ‘Chart Junk’
2. No Relative Basis in Comparing Data
Batches
3. Compressing the Vertical Axis
4. No Zero Point on the Vertical Axis
‘Chart Junk’
Industrial and Systems Engineering ITS

Bad Presentation Good Presentation


Minimum Wage Minimum Wage
1960: $1.00 $
4
1970: $1.60
2
1980: $3.10
0
1990: $3.80 1960 1970 1980 1990

Nani Kurniati, PhD


No Relative Basis
Industrial and Systems Engineering ITS

Bad Presentation Good Presentation


A’s by Class A’s by Class
Freq. %
300 30%
200 20%
100 10%
0 0%
FR SO JR SR FR SO JR SR

Nani Kurniati, PhD


Compressing Vertical Axis

Bad Presentation Good Presentation


Quarterly Sales Quarterly Sales
$ $
200 50

100 25

0 0
Q1 Q2 Q3 Q4 Q1 Q2 Q3 Q4
No Zero Point on Vertical Axis

Bad Presentation Good Presentation


Monthly Sales Monthly Sales
$ $
45 60
42 40
39 20
36 0
J M M J S N J M M J S N
How To Lie With Statistics

Find what the funny with these jokes

Industrial and Systems Engineering ITS


Industrial and Systems Engineering ITS

Nani Kurniati, PhD


Industrial and Systems Engineering ITS

Nani Kurniati, PhD


Industrial and Systems Engineering ITS

Nani Kurniati, PhD


Industrial and Systems Engineering ITS

Nani Kurniati, PhD


Case 1 : Phantasmo Stock Price
Industrial and Systems Engineering ITS

Nani Kurniati, PhD


Look can be misleading
Industrial and Systems Engineering ITS

Nani Kurniati, PhD


Industrial and Systems Engineering ITS

Watch Out with Scales

Nani Kurniati, PhD


Industrial and Systems Engineering ITS

Case 2

Nani Kurniati, PhD


Industrial and Systems Engineering ITS

Case 3

Nani Kurniati, PhD


Industrial and Systems Engineering ITS

Check this advertizing (Case 4)

Nani Kurniati, PhD


What they do mean ?
Industrial and Systems Engineering ITS

Always ask the definition of measures for which somebody gives u statistics

Nani Kurniati, PhD


Industrial and Systems Engineering ITS

mean ?

Always Ask the Maximum doesn’t


Source of say much
Population
Check for wrinkle reduction up to 61%. What that is

Nani Kurniati, PhD


Case 5 : Tungu & Bulugu Island
Industrial and Systems Engineering ITS

Nani Kurniati, PhD


Industrial and Systems Engineering ITS

Mislead on Averages

Nani Kurniati, PhD


Waldner is an outlier
Industrial and Systems Engineering ITS

Nani Kurniati, PhD


Industrial and Systems Engineering ITS

PRECISION PROBLEM

Nani Kurniati, PhD


Case 6 : Contradictive
Industrial and Systems Engineering ITS

Nani Kurniati, PhD

You might also like