Professional Documents
Culture Documents
Basic Statistics
Basic Statistics
Course Objectives D M A I C
Confidential Proprietary
1 2
1
07/03/2023
Course Outline D M A I C D M A I C
1. Introduction
Part 1.0
2. Rationale of Statistics
3. Data Collection
4. The Normal Distribution Introduction
1. Descriptive Statistics
Page 3 • Basic Statistics • Jan-15 Confidential Proprietary Page 4 • Basic Statistics • Jan-15 Confidential Proprietary
3 4
2
07/03/2023
D M A I C What is Statistics D M A I C
Statistics
Page 5 • Basic Statistics • Jan-15 Confidential Proprietary Page 6 • Basic Statistics • Jan-15 Confidential Proprietary
5 6
3
07/03/2023
Population
Population Population
Sample
(Parameter) (Parameter)
✓ The entire set of observations ✓ A subset of data taken from a
that are of interest in a population; denoted by the
Sample statistical investigation; denoted Sample small letter n (the sample size)
(Statistic) by the letter N (the population (Statistic)
✓ Statistics (or sample statistics)
size)
are terms used to describe the
✓ Parameters (or population key characteristics of a sample;
parameters) are terms used to usually denoted by a Latin
describe the key characteristics letter, i.e., x, s
of a population; usually denoted
✓ Sample Statistics are usually
by a small Greek letter, i.e., ,
measured in order to learn
something about Population
Parameters
Page 7 • Basic Statistics • Jan-15 Confidential Proprietary Page 8 • Basic Statistics • Jan-15 Confidential Proprietary
7 8
4
07/03/2023
Random Samples
of Size n=3
Population, N
ADVANTAGES
x1 , s1 ✓ Faster and cheaper than 100% data collection
✓ Avoids handling damage during inspection
x2 , s2 ✓ Requires lesser manpower
Sample
Statistics
x3 , s3
DISADVANTAGE
✓ Data may not be as precise or
Population Parameters (, ) x4 , s4 exact as in 100% data collection
Consider a few different samples of 3 drawn from a population of interest to us. Would
we expect to get the same average and range for each sample? Why or why not?
Page 9 • Basic Statistics • Jan-15 Confidential Proprietary Page 10 • Basic Statistics • Jan-15 Confidential Proprietary
9 10
5
07/03/2023
Page 11 • Basic Statistics • Jan-15 Confidential Proprietary Page 12 • Basic Statistics • Jan-15 Confidential Proprietary
11 12
6
07/03/2023
D M A I C What is a Measure D M A I C
Collection
Page 13 • Basic Statistics • Jan-15 Confidential Proprietary Page 14 • Basic Statistics • Jan-15 Confidential Proprietary
13 14
7
07/03/2023
Page 15 • Basic Statistics • Jan-15 Confidential Proprietary Page 16 • Basic Statistics • Jan-15 Confidential Proprietary
15 16
8
07/03/2023
Page 17 • Basic Statistics • Jan-15 Confidential Proprietary Page 18 • Basic Statistics • Jan-15 Confidential Proprietary
17 18
9
07/03/2023
Page 19 • Basic Statistics • Jan-15 Confidential Proprietary Page 20 • Basic Statistics • Jan-15 Confidential Proprietary
19 20
10
07/03/2023
1 2 3 4 5 6 7 8 ▪ Characterizes a product or
process feature in terms of a
parameter such as size, weight or
Theoretically, any value within an interval is possible with a fine enough measuring device
time
0 1000
Page 21 • Basic Statistics • Jan-15 Confidential Proprietary Page 22 • Basic Statistics • Jan-15 Confidential Proprietary
21 22
11
07/03/2023
What is the
balance telling
us?
Page 23 • Basic Statistics • Jan-15 Confidential Proprietary Page 24 • Basic Statistics • Jan-15 Confidential Proprietary
23 24
12
07/03/2023
Increasing Information
Data are classified into two or Mode Male/Female,
data since you can derive more information with less Nominal
more categories;
Unordered;
Chi-Square With/Without,
Pass/Fail
data. The values of the scale have no
'numeric' meaning
Discrete
Data are grouped according to Median 1st/2nd/3rd,
rank or order; Percentile Small/Medium/L
✓ If continuous data is not available, discrete data Ordinal Scale assignment is by the
property of "greater than," "equal
arge
can be analyzed, results found, and decisions made to," or "less than."
Page 25 • Basic Statistics • Jan-15 Confidential Proprietary Page 26 • Basic Statistics • Jan-15 Confidential Proprietary
25 26
13
07/03/2023
Extra caution must 1. Data integrity or validity must Because all sampling 1. Get samples at random
be taken when studies always have
collecting data. The be high (100% as much as errors, uncertainties
- this avoids bias ensuring
enumerated items at possible) and risks associated
right are some of the
most important
with them, an
experimenter needs to
that each sample has an
considerations… do two things to ensure equal chance of being
2. Data traceability must be that the result of the
selected
present study would be close to
the “true” population
characteristic:
Page 27 • Basic Statistics • Jan-15 Confidential Proprietary Page 28 • Basic Statistics • Jan-15 Confidential Proprietary
27 28
14
07/03/2023
3 Analysis of Data
- involves concise numerical measures
The
like central tendency, spread and shape
Normal
4 Interpretation of Data
- conclusions are based on the
charts, graphs
Distribution
Page 29 • Basic Statistics • Jan-15 Confidential Proprietary Page 30 • Basic Statistics • Jan-15 Confidential Proprietary
29 30
15
07/03/2023
▪ It is the most common continuous distribution in the entire field of statistics; it’s
graph called the normal curve is a bell-shaped curve which approximately 1. Bell-shaped and extends indefinitely in both directions
describes many phenomena that occur in nature, industry, and research, i.e.,
weight, IQ, thickness, etc.
2. Asymptotic: the curve comes closer and closer to the
▪ In process control, it is used as a model of a good or stable process behavior horizontal axis without even reaching it
▪ It is completely described by two parameters: mean and standard deviation
3. Symmetric with respect to the mean
99.73 % of product
95.45 % of product Useful probabilities 4. Unimodal (only have one mode or “hump”)
68.27 %
5. Skewness, = 0; Kurtosis, = 3.0
- 3 - 2 - 1 + 1 + 2 + 3
Page 31 • Basic Statistics • Jan-15 Confidential Proprietary Page 32 • Basic Statistics • Jan-15 Confidential Proprietary
31 32
16
07/03/2023
Workshop D M A I C D M A I C
Part 2.0
The Normal Distribution
Descriptive
(Instruction to be given by the facilitator)
Statistics
Page 33 • Basic Statistics • Jan-15 Confidential Proprietary Page 34 • Basic Statistics • Jan-15 Confidential Proprietary
33 34
17
07/03/2023
Page 35 • Basic Statistics • Jan-15 Confidential Proprietary Page 36 • Basic Statistics • Jan-15 Confidential Proprietary
35 36
18
07/03/2023
D M A I C Central Tendency D M A I C
Count
→ Mode: the most frequently
Central Tendency occurring data value
(Location)
Measure
Page 37 • Basic Statistics • Jan-15 Confidential Proprietary Page 38 • Basic Statistics • Jan-15 Confidential Proprietary
37 38
19
07/03/2023
Mean D M A I C Median D M A I C
Example:
Example:
Given: (Even) xi : 62, 73, 78, 78, 78, 86, 86, 89, 90, 95
Given: xi : 62, 73, 78, 78, 78, 86, 86, 89, 90, 95 (Odd) xi : 62, 73, 78, 78, 78, 86, 86, 89, 90
Page 39 • Basic Statistics • Jan-15 Confidential Proprietary Page 40 • Basic Statistics • Jan-15 Confidential Proprietary
39 40
20
07/03/2023
Mode D M A I C Discussion D M A I C
Given: xi : 62, 73, 78, 78, 78, 86, 86, 89, 90, 95
Mode = 78
Page 41 • Basic Statistics • Jan-15 Confidential Proprietary Page 42 • Basic Statistics • Jan-15 Confidential Proprietary
41 42
21
07/03/2023
D M A I C Variability D M A I C
Count
→ Variance
Variability → Standard Deviation
▪ Standard deviation is the most
Page 43 • Basic Statistics • Jan-15 Confidential Proprietary Page 44 • Basic Statistics • Jan-15 Confidential Proprietary
43 44
22
07/03/2023
Workshop D M A I C Range D M A I C
Beads Experiment
(Instruction to be given by the facilitator) Example:
Given: xi : 62, 73, 78, 78, 78, 86, 86, 89, 90, 95
Range = 95 – 62 = 33
Page 45 • Basic Statistics • Jan-15 Confidential Proprietary Page 46 • Basic Statistics • Jan-15 Confidential Proprietary
45 46
23
07/03/2023
Example: Example:
Given: xi : 62, 73, 78, 78, 78, 86, 86, 89, 90, 95 Given: xi : 62, 73, 78, 78, 78, 86, 86, 89, 90, 95
(x − x) 2
(62 − 81.5)2 + (73 − 81.5)2 + ... + (95 − 81.5)2 (x − x) 2
(62 − 81.5)2 + (73 − 81.5)2 + ... + (95 − 81.5)2
s =
2
= = 93.39 s= = = 9.66
n −1 10 − 1 n −1 10 − 1
Page 47 • Basic Statistics • Jan-15 Confidential Proprietary Page 48 • Basic Statistics • Jan-15 Confidential Proprietary
47 48
24
07/03/2023
Discussion D M A I C D M A I C
Measures of
Shape
Page 49 • Basic Statistics • Jan-15 Confidential Proprietary Page 50 • Basic Statistics • Jan-15 Confidential Proprietary
49 50
25
07/03/2023
Skewness D M A I C Skewness D M A I C
Page 51 • Basic Statistics • Jan-15 Confidential Proprietary Page 52 • Basic Statistics • Jan-15 Confidential Proprietary
51 52
26
07/03/2023
Kurtosis D M A I C Exercise D M A I C
n(n +1) xi − x 3(n −1)2 Q2: Is there really a problem in Plating? Should Customer A be nervous about
n 4
Kurtosis = − ON’s process capability? If there is a problem, what will be the focus of
(n −1)(n − 2)(n − 3) i=1 s (n − 2)(n − 3) improvement (process location or spread)?
Page 53 • Basic Statistics • Jan-15 Confidential Proprietary Page 54 • Basic Statistics • Jan-15 Confidential Proprietary
53 54
27
07/03/2023
Summary D M A I C Summary D M A I C
1. Population refers to the entire set of observations that are of interest in 9. Vital information are extracted from the samples by measuring:
a statistical investigation; denoted by the letter N (the population size) a) Central Tendency (Location): Mean, Median, Mode
2. Sample refers to a subset of data taken from a population; denoted by b) Variability (Spread): Range, Variance, Standard Deviation
the small letter n (the sample size) c) Shape: Skewness, Kurtosis
3. Sample Statistics are measured in order to learn something about
Population Parameters
4. Advantages of sampling studies: (a) faster and cheaper than 100%
data collection, (b) avoids handling damage during inspection, (c)
requires lesser manpower
5. Disadvantage of sampling studies: risks are involved, data may not be
as precise or exact as in 100% data collection
6. Data collection to draw sample statistics must be carried out carefully
considering the right sample size and employing randomization
7. There are two general types of data for collection:
a) Qualitative
b) Quantitative: Discrete (attribute) and Continuous (Variables)
8. The Normal Distribution is the most common continuous distribution in
the entire field of statistics
Page 55 • Basic Statistics • Jan-15 Confidential Proprietary Page 56 • Basic Statistics • Jan-15 Confidential Proprietary
55 56
28
07/03/2023
D M A I C
References
1. The Six Sigm a Handbook
By Thomas Pyzdek
6. Wikipedia
57
29