Professional Documents
Culture Documents
PROPRIETARY
Descriptive Statistics
Using
Usingscientific
scientificmethods
methods
to
tocollect,
collect,organize,
organize,
summarize,
summarize,andandpresent
present
data.
data.
Examples:
• All of the Ford seatbelts ever made on A-line
• All purchase orders dated 2003
• All parts made on machine D since the last
improvement was implemented
• The height of every person in the US
Examples:
• 100 of the Ford seatbelts from A-line
• 20 purchase orders from every month of 2003
• The first 60 parts from machine D since the last
improvement was implemented
• The heights of 3,000 people from all over the US
Central tendency
• Where is the middle?
Dispersion
• How spread out is the data ?
Mean
• The sum of all the data
divided by the number of
data
Median
• The numeric middle
Mode
• The most frequently
occurring value
average = Summation of
Case 1 Case 2
Worker 1 $30,000 Worker 1 $30,000
Worker 2 $35,000 Worker 2 $35,000
Worker 3 $40,000 Worker 3 $40,000
Worker 4 $45,000 Worker 4 $45,000
Worker 5 $50,000 US President $400,000
Find
Findthe
themean
meanand
andthe
themedian
medianfor forboth
bothcases.
cases.
Sometimes
Sometimesthe
themedian
medianisisaabetter
betterindicator
indicatorofofcentral
central
tendency
tendency(e.g.
(e.g.when
whenoutliers
outliersexist
existininthe
thedata).
data).
Range
• The largest number minus
the smallest
Variance
• A measure of the difference
between the points and the
mean
Standard Deviation
• The square root of the
variance
• A measure of the
difference between the 22
ss
X
X
X
X
ii 22
n = _8__
X = _10_
Sum of
X = 10 Squares= _28_
2
s = _4__
s = _2__
Instructions
Total
Divide
Take
Count
Square
Subtract
Plot
Divide
Addthe each
the
upeach
the
square
all
the
total
number
point
sum
of
term
mean
of
the
root
on
of
the
of
in
Xi 8 13 7 10 12 11 10 9 80
of
squared
points
samples
the
from
the
squares
the
second
variance
by
each
graph
terms
and
thebypoint
row
number
add
with
n-1
toand
find
to
the
Xi-X -2 3 -3 0 2 1 0 -1 of
write
(degrees
the
respect
samples
find
samples
sum
the
that
it’s
standard
to
of
of
together
value
deviation
the
to
squares
freedom)
find
mean
in the
to
(Xi-X)2 4 9 9 0 4 1 0 1 28 to from
find (X
{find
deviation
third
mean
the
the
ai-X)
row 2
total
variance
mean }
Mean,
Mean, median,
median, range,
range, standard
standard
deviation,
deviation, and
and variance
variance apply
apply
regardless
regardless ofof how
how the
the data
data are
are
distributed.
distributed.
0.5 0.6 0.7 0.8 0.9 1.0 1.1 1.2 1.3 1.4 1.5
cm
0.5 0.6 0.7 0.8 0.9 1.0 1.1 1.2 1.3 1.4 1.5
cm
Size of marble
0.5 0.6 0.7 0.8 0.9 1.0 1.1 1.2 1.3 1.4 1.5
cm
Size of marble
0.5 0.6 0.7 0.8 0.9 1.0 1.1 1.2 1.3 1.4 1.5
cm
Size of marble
0.5 0.6 0.7 0.8 0.9 1.0 1.1 1.2 1.3 1.4 1.5
cm
Size of marble
PROPRIETARY
Pareto Chart
Pareto Chart of defect
250 100
200 80
150 60
Percent
Count
100 40
50 20
0 0
defect scratch chip dent bubble Other
Count 125 65 32 15 8
Percent 51.0 26.5 13.1 6.1 3.3
Cum % 51.0 77.6 90.6 96.7 100.0
100
Percent
Count
that the category with 100 40
cumulative frequency
A
A pareto
pareto chart
chart is
is used
used to
to
prioritize
prioritize resources
resources to to where
where
they
they will
will have
have the
the most
most impact
impact
60
50
travel time
40
30
20
10
1 10 20 30 40 50 60 70 80 90 100
Index
points in consecutive 60
order 50
travel time
40
context of time 20
10
1 10 20 30 40 50 60 70 80 90 100
50 Limits that
Individual Value
indicate
40
_ unusual
X=34.60
30 outcomes
20
10
LCL=4.85
0
1 1
Mean 1 10 20 30 40 50 60 70 80 90 100
Observation
Individual Value
40
of the data 30
_
X=34.60
limits 10
LCL=4.85
• They are calculated from the 0
1 1
data itself 1 10 20 30 40 50 60
Observation
70 80 90 100
20
15
Frequency
10
0
0 10 20 30 40 50 60 70
travel time
number of occurrences 20
N 100
(frequency) 15
Frequency
• Most of the travel times occur 10
Class Heights
• Collect the heights of all the people in the class
• Determine the Mean, Median, Range, and
Standard Deviation of those heights
• Assume that heights are normally distributed,
and that the class members are typical of all
Autoliv employees. About what percent of
Autoliv employees are over 6 feet (1.8 meters)
tall?
• Question
• What percent of marbles will be rejected if the size specifications are set at 2
standard deviations above and below the mean?
• Solution
• Two standard deviations on both sides will include 95% of the parts.
Therefore, the amount rejected will be: 100%-95% = 5%
• Question
• What percent of marbles will be rejected if the upper specification is set at
the mean and the lower specification is set at 12 standard deviations below
the mean?
• Solution
• Half of the parts will fall below the mean and almost no parts will fall below
12 standard deviations. Therefore, the amount rejected will be: 50%-0% =
50%
• Question
• What percent of marbles will be rejected if the size specifications are set
at 6 standard deviations below the mean and 1 standard deviation above
the mean?
• Solution
• Six standard deviations below the mean will include about 50% of the
parts. One standard deviation above the mean will include about 68%/2
= 34%. Therefore, the amount rejected will be: 100%-(50%+34%) = 16%
• Question
• If nothing disturbs the process, what are the chances of getting a marble
more than 6 standard deviations below the mean?
• Solution
• Almost zero