Professional Documents
Culture Documents
Note : These hand outs are used for GB Training of Henry Harwin
Management Academy and for the purpose of GB course reference 2
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 1
5/30/2021
PROCESS MAP
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 2
5/30/2021
Do not forget to
Walk the process 6
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 3
5/30/2021
Inputs Outputs
Making Bread
Flour Good flavor
Yeast Right Texture
Water Color
Energy Correct Weight
Equipment
Personnel
Other Ingredients
Include all process steps, including verification & rework. (1000 meter view)
Making Bread
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 4
5/30/2021
60X,s
10
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 5
5/30/2021
PRIORITIZATION MATRIX
12
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 6
5/30/2021
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 7
5/30/2021
Collection of Data
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 8
5/30/2021
17
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 9
5/30/2021
19
TYPES OF DATA
20
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 10
5/30/2021
TYPES OF DATA
21
TYPES OF DATA
22
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 11
5/30/2021
TYPE OF DATA
23
Exercise - Data
24
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 12
5/30/2021
25
26
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 13
5/30/2021
● You have a large enough sample to find what you’re looking for.
If it’s a rare event or you want to be very precise, you’ll need a
large sample (big enough)
27
Sampling Plan
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 14
5/30/2021
Sampling Strategies
29
BASIC STATISTICS
INFERENTIAL STATISTICS
30
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 15
5/30/2021
When you have a collection of data, there are three things to know
Mean
Median
Mode
32
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 16
5/30/2021
Mean
“Mean” is the statistical term for what most people
call “average”. It is usually the best indicator of
where the center is.
5 X 1
3 X2
We also use
6 this notation X3
2 =4
Mean X4
33
Mean
34
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 17
5/30/2021
Median
Sorted
Set of Data
Data
8 7
8
13 Median is the middle value,
9
7 there are as many samples
10
12 above as there are below.
10 10
This is the physical middle – median
11 Mean is the statistical mean 11
10 12
9 13
35
Mode
Mode is the most common value, the
peak of the distribution.
It is a very weak indicator of where
the center is.
.
36
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 18
5/30/2021
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 19
5/30/2021
When you have a collection of data, there are three things to know
Range
Variance
Standard deviation
40
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 20
5/30/2021
Range
Largest
Set of Data
8 Smallest
13 Definition
7 Range=Largest minus the smallest
10 = (13-7) = 6
12 For groups of 2-10 items, range is about as
11 sophisticated as you need to get.
10 Easy to calculate.
9 Easily distorted by one unusually large or small
datum (outlier). 41
Variance
Set of Data
8
13
7
=
10
12
11
10
Calculate the Mean
9
42
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 21
5/30/2021
12
11
10
9
In English 7
we say X bar
What does (Xi - Xbar) mean?
43
Mathematics
Calculate the deviation (Xi – Xbar)
13
12
11
10
9
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 22
5/30/2021
Mathematics
Square the differences, and fill in the (Xi-Xbar)2 row
13
12
11
10
9
Xi 8 13 7 10 12 11 10 9
Fill the others in Xi- Xbar -2 3 -3 0 2 1 0 -1
(Xi- Xbar )2 4 9
45
Mathematics
Sum the (Xi-Xbar)2 row
13
We write it as
12
11 S(Xi- Xbar )2
10
9
This is the
8
“sum of the squares”,
7 a measure of
total variation
Xi 8 13 7 10 12 11 10 9
Xi- Xbar -2 3 -3 0 2 1 0 -1
(Xi- Xbar )2 4 9 9 0 4 1 0 1 = 28
46
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 23
5/30/2021
Formulae
Standard Deviation
S(Xi- X )2 S(Xi-X)2
n-1 n-1
The standard deviation is just the square root of the variance.
The standard deviation of our set of sample data =
48
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 24
5/30/2021
Standard Deviation
The standard deviation is the most common measure of
spread for collections of data larger than 10 items. It also
works fine for collections as small as n=2.
It is not easily “pulled” by one outlier.
49
Symbols
Roman letters for Greek letters for
estimates based true population
on a sample. parameters
Mean X m
Range R
Standard s
deviation
s
Variance s2 s2 50
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 25
5/30/2021
Summary of Measures
Mean is usually the best measure of where the middle of the data is.
Median is another measure of where the middle is, and it is best when
the data contains outliers, or is known to be non-normal.
Range is the maximum minus the minimum. It is easy to compute, and
gives a good measure of “spread” for small groups of data.
Standard deviation and variance are very sophisticated measures of
“spread”. They are not easily influenced by an outlier.
Mean, median, range, standard deviation, and variance apply
regardless of how your data are distributed. 51
Exercise
For each data set, choose which measure of
“middle” and “spread” is best.
Data Set Middle Spread
2,5,3
3,4,6,1,4,5,7,2,4,1000
,1,5,7,3
3,4,6,1,4,5,7,2,4,
1,5,7,3,3,7,4
2,4,6,651
52
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 26
5/30/2021
When you have a collection of data, there are three things to know
Distributions
A bag of marbles is sorted according to size.
-2 -1 1 +1 +2
mm mm mm mm
cm
54
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 27
5/30/2021
Distributions
A bag of marbles is sorted according to size.
-2 -1 1 +1 +2
mm mm mm mm
cm
55
Dotplot
This gives us a natural graph of the number of cases (frequency) vs.
diameter of the marble. This is a dotplot.
Number
of cases
-2 -1 1 +1 +2 Size of
mm mm mm mm
cm marble
56
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 28
5/30/2021
Histogram
If we make a bar chart, with bars the length of the
stacks of marbles, we have made a histogram.
Number
of cases
-2 -1 1 +1 +2 Size of
mm mm mm mm
cm marble
57
Distribution Curve
If we do an infinite number of measurements, and make our increments of
size infinitesimal, we get a continuous distribution curve.
Number
of cases
-2 -1 1 +1 +2 Size of
mm mm mm mm
cm marble
58
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 29
5/30/2021
Distribution Curve
The number of cases that happen between any two points on the horizontal axis is approximately
the area under the distribution curve, between those two points.
Number Point 1
of cases
Point 2
-2 -1 1 +1 +2 Size of
mm mm mm mm
cm marble
59
60
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 30
5/30/2021
Normal Distribution
The Normal Distribution often occurs in nature.
It is a very useful model.
61
62
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 31
5/30/2021
63
64
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 32
5/30/2021
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 33
5/30/2021
68
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 34
5/30/2021
● We can test whether a given data set can be described as “normal” with a
Normal Probability Plot
69
m2
m3
m1
70
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 35
5/30/2021
68.26%
40%
30% 95.44%
20%
99.73%
10%
0%
m - 3s m - 2s m - s m m + s m + 2s m + 3s
50%
71
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 36
5/30/2021
95.44%
-¥ 99.73%
+¥
Std Dev (s) -3 -2 -1 0 1 2 3
Characteristics Z -
Lower Z
Specification Upper
Limit Specification
6.0s 6.0s Limit
LSL USL
Z - Values 74
-6.0 0 6.0
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 37
5/30/2021
3 4 .1 % 3 4 .1 %
1 3 .6 % 1 3 .6 %
0 .1 3 5 % 2 .2 % 0 .1 3 5 %
2 .2 %
-3 -2 -1 0 +1 +2 +3
S ta n d a r d D e v ia tio n U n its
75
INTRODUCTION TO MINITAB
76
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 38
5/30/2021
Project Manager
Gives you control
over items in
your project
Data Window
This is where data is entered
It can be typed,
pasted from other applications
or generated internally by Minitab 77
NORMALITY TEST
78
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 39
5/30/2021
Box Plot
79
EXAMPLE --
In a Factory, the reactor Cycle time is measured every day for 365 days
In year 2002 2003 and 2004
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 40
5/30/2021
● In Input Constant, give value for which you want The Calculations Done
● If calculations need to be done for more than 1 values, give values in a
Column in worksheet and use Input Column option 81
82
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 41
5/30/2021
● Find out what fraction of data points lie left of mean in Case 1 ?
● Use Calc>Probability Distributions> Normal.
● Choose Cumulative Probability
● Specify Mean =0 and Standard Deviation =1
● Choose Input Constant and Specify value =0.
● Click OK
83
● Repeat the calculation for Case 2. What do the results tell us?
Cumulative Distribution Function
Normal with mean = 0 and standard deviation = 1
x P( X <= x )
0 0.5
84
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 42
5/30/2021
● If we now need to find out what fraction of data points fall between μ-σ and
μ+σ in Case 1 ?
Use Calc>Probability Distributions> Normal.
■ Choose Cumulative Probability
■ Specify Mean =0 and Standard Deviation =1
■ Choose Input Constant and Specify value =-1.
■ Click OK
Use Calc>Probability Distributions> Normal.
■ Choose Cumulative Probability
■ Specify Mean =0 and Standard Deviation =1
■ Choose Input Constant and Specify value =1.
■ Click OK
85
86
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 43
5/30/2021
● Find out fraction of Data points within μ±2σin both the cases
1.
2.
● What about fraction of Data points within μ±3σin both the cases
1.
2.
● Why are the results always same in both the cases ?
● Does this match our knowledge of Normal Distribution ?
● What are the Practical Limits of a Normally Distributed Process ?
● These Practical Limits represent what fraction of Data points ?
87
88
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 44
5/30/2021
Non Normal Distribution This can be due to the data naturally following a specific type of non normal
distribution (for example, bacteria growth naturally follows an exponential
distribution).
89
However, there may be times when your data is supposed to fit a normal
distribution, but doesn’t. If this is a case, it’s time to take a close look at your data.
Outliers can cause your data the become skewed. The mean is especially
sensitive to outliers. Try removing any extreme high or low values and testing your
data again.
Insufficient Data can cause a normal distribution to look completely scattered. For example,
For example..classroom test results are usually normally distributed.
An extreme example: if you choose three random students and plot the results on a graph,
you won’t get a normal distribution. You might get a uniform distribution (i.e. 62 62 63) or you might
get a skewed distribution (80 92 99). If you are in doubt about whether you have a sufficient sample size,
90
collect more data.
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 45
5/30/2021
You may still be able to run these tests if your sample size is large
enough (usually over 20 items) for non-normal distributions
You can also choose to transform the data with a function, forcing it to
fit a normal model.
91
Chebyshev’s Rule
● Applies to any data set regardless of the frequency distribution of the data
So, for any data set, approximately 90% of the data will fall within +/- 3 StDev of the mean 92
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 46
5/30/2021
Propritary/Gopalakrishna/Nirmala
Pharma Consultancy Services 47