Professional Documents
Culture Documents
Editor
A.Z. Keller, Department of Industrial Technology and Management,
University of Bradford, U.K.
Aims and Scope. Fundamentalquestions which are being asked these days of all products,
processes and services with ever increasing frequency are:
What is the risk?
How safe is it?
How reliable is it?
How good is the quality?
How much does it cost?
This is particularly true as the govemment, industry, public, customers and society
become increasingly informed and articulate.
In practice none of the three topics can be considered in isolation as they all interact and
interrelate in very complex and subtle ways and require a range of disciplines for their
description and application; they encompass the social, engineering and physical sciences
and quantitative disciplines including mathematics, probability theory and statistics.
The major objective of the series is to provide a series of authoritative texts suitable for
academic taught courses, reference purposes, post graduate and other research and
practitioners generally working or strongly associated with areas such as:
Safety Assessment and Management
Emergency Planning
Risk Management
Reliability Analysis and Assessment
Quality Assurance and Management
Special emphasis is placed on texts with regard to readability, relevance, clarity, ap-
plicability, rigour and generally sound quantitative content.
The titles published in this series are listed at the end of this volume.
Quality Improvement with
Design of Experiments
A Response Surface Approach
by
IVAN N. VUCHKOV
University of Chemical Technology and Metallurgy,
Sofia, Bulgaria
and
LIDIA N. BOYADJIEVA
University of Chemical Technology and Metallurgy,
Sofia, Bulgaria
....
''
SPRINGER-SCIENCE+BUSINESS MEDIA, B.V.
A C.I.P. Catalogue record for this book is available from the Library of Congress.
and
CONTENTS
Introduction 73
Residualplots 74
Normaland half-normal plots 77
2.3.10. TRANSFORMATIONS OF VARIABLES 80
2.3.11. WEIGHTED LEAST SQUARES 82
2.4. Bibliography 84
Appendix A.2.1. Basic equation ofthe analysis ofvariance 84
Appendix A.2.2. Derivation of the simplified formulae (2.1 0) and (2.11) 85
Appendix A.2.3. Basic properties ofleast squares estimates 86
Appendix A.2.4. Sums ofsquares for tests for lack offit 88
Appendix A.2.5. Properties ofthe residuals 90
CHARACTERISTICS 311
6.5.2.USE OF NUMERICAL OPTIMIZATION PROCEDURES 315
6.5.3.PRACTICALPROBLEMS 318
6.6. Model based tolerance design 325
6. 7. Summary of the model based approach to quality
improvement through reduction of the transmitted error 328
6.8. Friction welding example 328
6.9. Bibliography 338
Appendix A.6.1. Development ofthe algorithm of subsection 6.4.1 338
Appendix A.6.2. Development ofthe algorithm ofsubsection 6.4.2 342
models 408
8.5.1. USING COMBINED ARRAYS AND OPTIMIZING LOSS
ST ATISTICS VIA MODELLING THE UNDERLYING RESPONSE 408
8.5.2. MONTE CARLO EXPERIMENTS 409
8.5.3. USE OF TAGUCHI METROD WITH MECHANISTIC MODELS 409
8.6. Specific problems of quality improvement based on
mechanistic models 409
8.7. Bibliography 412
Appendix A.8.1. Derivation offormulae (8.4) and (8.6) 412
Appendix A.8.2. Development offormulae (8.11) and (8.12) 416
Appendix A.8.3. Derivation ofmean and variance models for third order
polynomials 419
BffiLIOGRAPHY 482
PREFACE
This book is devoted to the problern of quality improvement of products and processes
through robust engineering design. Taguchi was a pioneer in this field and his
methodology became popular among the engineers and statisticians. Many successful
applications were reported. While Taguchi's engineering ideas were widely recognized,
many statisticians expressed criticism with respect to his statistical procedures. This
inspired development ofan alternative to Taguchi's approach based on response surface
methodology.
High quality of a product can be achieved when the mean values of its performance
characteristics are close to given targets, while the variations are as ·small as possible.
The fundamental idea behind this book is to use models to this end. Experiments have to
be carried out in order to collect information for model building. Response surface
methodology provides a variety of economical designs that can be used for quality
improvement. Two models can be obtained on the basis of observations - one for mean
value and another for variance of product's performance characteristic. With these
models optirnization procedures can be used to find product parameters that rninimize
performance characteristic' s variance while keeping the mean value on a target.
The book integrates design of experiments, model building and optimization techniques
for robust product or process design. The first chapter is introductory and gives a
general idea for model-based robust engineering design.
In order to make the book self-contained the basic ideas of the response surface
methodology are given in chapters 2 and 3. Chapter 2 presents statistical methods for
data analysis. Analysis of variance and some important combinatorial experimental
designs are considered. They are used in Chapter 4, where Taguchi's methodology for
quality improvement is presented. In Chapter 2 are also given basic principles of the
regression analysis. This is the main model-building tool used in the response surface
methodology.
In Chapter 3 we consider some important response surface designs. Methods for
interpretation of the models obtained through regression experiments are also discussed.
This is a basis for understanding the material given in Chapters 5 to 10.
Chapter 4 presents the main ideas and tools for quality improvement through design of
experiments proposed by Taguchi. On this basis a model-based approach to this problern
was developed, which is an attempt to combine the engineering ideas of Taguchi with
response surface methods for model building and optimization.
As it was noted, model-based robust engineering design requires two models: one for the
mean value and another for the variance. They take into account two sources of
variation: errors in product parameters and extemal noise factors. In the next chapters is
shown that the models reflecting the errors in product parameters and the extemal noise
factors have different properties. In Chapter 5 models of the mean value and variance
xvi
induced by the errors in product parameters are considered. The mechanism of error
transmission from the product parameters to the response is studied. An analysis of the
accuracy ofthese models is also presented.
Chapter 6 proposes optimization procedures for robust design based on models of
products or processes with errors in parameters. Optimization criteria are discussed.
Analytical and numerical optimization methods are considered, model-based decision
making and tolerance design procedures are proposed.
Chapter 7 deals with methods for design of experiments, model building and
optimization for products and processes, which are subject to both errors in parameters
and external noise factors.
Models of mean value and variance of the performance characteristics induced on the
basis of mechanistic models of products or processes are considered in Chapter 8. If
there are errors only in product parameters these models can be obtained without
experiments or simulations of noises. Methods for incorporating experimental data in the
mean and variance models are given in this chapter.
Models for quality improvement of products and/or processes with qualitative and
quantitative factors are proposed in Chapter 9. Optimization procedures are given as
weil.
Methods for building mean and variance models based on repeated observations in the
design points and their modifications for unreplicated observations are considered in
chapter 10. Graphical tools for studying location and dispersion effects are given.
Parameter estimation in cases with errors in factor Ievels is briefly presented. At the end
of this chapter attention is paid to a model-based approach to robust design of signal-
dependent systems.
The book has been written for engineers and statisticians working in the field of quality
improvement and for senior and graduate students in engineering. Previous versions of
the text have been used as a textbook for students and in industrial short courses.
lt is assumed that the readers possess knowledge of basic statistical methods and matrix
algebra. Through all chapters the main ideas are presented avoiding complicated
mathematical proofs. They are explained through many real and constructed examples.
For readers who want to go in details the proofs are given in appendices to most of the
chapters. Additional reading in the end of each chapter and an extensive bibliography in
the end of the book are provided.
The idea ofwriting the book came to us from a short course organized by Dr. E. Walter
for industrial engineers in SUPELEC, Paris where we presented the main ideas of
model-based quality improvement. We would like to thank Prof. Henry Wynn, Dr. Eric
Walterand Dr. Luc Pronzato for their helpful discussions during the research activities
that Iead to the development of this book. The authors very much appreciate the editorial
assistance ofMrs. Milena Todorova, who greatly improved the readability ofthe book.
We are most grateful to our families for their continuous encouragement of our research
and teaching activities that made possible the appearance oftbis book.
In a production process technology, machines, and workers are the same for each
product. It is expected products to be the same too, but they are not. Some behave
perfectly when used, others need adjustment or repair and some are unusable at all.
Why does this happen? How to find the causes of the variations in product's
performance characteristics and now to eliminate or at least decrease them?
Answering these questions will Iead to high quality production of goods.
Product's quality can be regarded as a totality of specific features and
characteristics which need to be estimated in order to answer the question whether the
product conforms to its purpose.
Each product has specific functional characteristics, also called performance
characteristics. Their desired values are known from specifications or standards.
Deviations from these target values may cause Iosses to the society. These Iosses can be
financial or could involve customer's dissatisfaction, loss due to a company's bad
reputation, losing market share in long term, etc.
Causes of deviations from the desired values can be grouped into following categories:
i) Manufacturing imperfections, for example
Human errors,
Raw material variability,
Poor operation of the machines,
Measurement errors.
ii) Environmental influences. They can occur both in the manufacturing process
and in the usage of the product. Such influences can be the environmental temperature,
dust, humidity, vibrations, improper use ofthe product, etc.
iii) Product deterioration. Product' s performance characteristics often become
worse with time.
Take as an example the truck tyre production. The truck tyre is a complex
product, consisting of several rubber layers (protector, carcass, breaker, etc.) with
different functions and composition. Each of the layers consists of a !arge number of
components (often more than 10). After the preparation of rubber mixtures and their
preliminary treatment the elements of the tyre are stuck tagether in a given manner. Then
the tyre is put in a press where it is shaped and vulcanized under given pressure and
curing temperature.
2 CHAPTER I
The product performance characteristics are subject to random variations. Denote the
performance characteristic by y. It can be regarded as consisting of two parts: a non
random part "1 and a random disturbance denoted by e:
INTRODUCTION TO QUALITY IMPROVEMENT 3
y = 17+ 6.
f(y)
f1+3cr y
Figure 1.1. Probability density function for a normally distributed random variable
It is weil known that 99.7% of all normally distributed observations fall within the
interval 1]± 3 a, where a is the standard deviation ofy.
Having a sample of n observations on y one can compute the estimates of the
mean value
1 n
y=-LYu
n u=I
- 1 ~(y.-Y-)2 .
s 2 --L.J
n u=I
Denote by USL the upper specification Iimit and by LSL the lower specification
Iimit for y. Figure 1.2 shows several dispositions of the probability density function with
respect to the specification Iimits. The target value of the performance characteristic is
denoted by r.
4 CHAPTER 1
Figure 1.2a shows a weil-centred process with small enough vanat1on. All
performance characteristic's values are within the specification Iimits and there is also
some tolerance which makes it possible to expect that the performance characteristic will
remain within the specification Iimits even when small changes in the production
conditions occur.
One can see that the mean value 1J of the characteristic shown on Figure 1.2b is
shifted to the left of the target value r: That means that a part of the production will be
defective because its performance characteristic is below LSL. On Figure 1.2c one can
see a distribution for which the performance characteristic's variance is too !arge and
both LSL and USL are violated.
Even if the product' s pieces with performance characteristic outside the
specification Iimits are removed, the quality shownon Figure 1.2b is still worse than for
Figure 1.2a, because a !arger part of customers will get a product that has performance
characteristic nearer to LSL than to the target value r. In addition, the removal of the
defectives Ieads to financiallosses to the producer.
To provide a high quality product an engineer must take countermeasures against the
sources of variation. This can be done in all stages of the production process and usage
of the product. In the worst case, a fraction of non-conforming products will reach the
customer. Obviously this will darnage company's reputation and its market share in long
term, and will increase enormously after-sales service costs. This can be avoided through
a strict final inspection. A strict final inspection on its own would not be sufficient,
because it can detect the defectives but not the causes for their occurrence. The defective
products will come again and again from the product lines and this will cause loss to the
company.
INTRODUCTION TO QUALITY IMPROVEMENT 5
Far better solution is to detect the causes for variations and to remove them.
Traditional statistical methods for quality control are applied at the manufacturing stage
in an attempt to reduce the manufacturing imperfections in the product. They are called
on-line quality control methods. The most used ofthem are:
• Cause and effect diagrams,
• Control sheets,
• Histograms,
• Pareto diagrams,
• Scatter diagrams,
• Control charts,
• Stratification.
Manufacturing engineers are aware that the cost of detection and correction of
manufacturing imperfections increases rapidly as the product moves · along the
manufacturing line. Most inexpensive is to correct the manufacturing imperfections
immediately after they occur.
The following question arises:
Js it possible to prevent the appearance of manujacturing imperfections,
environmental noises or product deterioration or at least to decrease their injluence on
product performance characteristic?
The answer is yes and this can be achieved in many ways. For example, the
production process can be considerably improved and stabilized by introducing
automatic control systems. This is a good but rather expensive way of quality
improvement.
Another approach would be to make the product robust (insensitive) to
manufacturing imperfections and environmental variables.
We can illustrate this opportunity through the truck tyre example. Suppose we
are interested to obtain tensile strength y of a rubber composition equal to a given target
value r. It depends on certain parameter p used in the formulation (Figure 1.3).
Two values of p can ensure the desired value of the tensile strength: p 1 and p 2 .
Which one is better?
Suppose that the performance characteristic of the natural rubber varies within
some interval araund p 1 or p 2 . This will result in variations in the tensile strength of the
rubber composition y. They can be decreased in two different ways:
i) By tightening the tolerance interval for p. That means to purchase high quality
natural rubber which is expensive.
ii) By shifting the operating point from p 2 to p 1 . One can see that the same
variation of p will cause less variation of y if the operating point was chosen at p 1 rather
than at p 2 . This is a eheaper solution than tightening the tolerance intervals.
In other words high quality products can be produced even if the raw materials
are not perfect.
In a sirnilar way by proper choice of the performance characteristics of a product
or a process it can be made robust against manufacturing imperfections, environmental
noises, etc. The methods for robust product or process design do not differ in all these
cases. That is why for the sake of brevity we will often speak of robust product design,
remembering that all discussions are also applicable to robust process design.
The example of Figure 1.3 is simplified. Usually the performance characteristics
depend on many parameters and to obtain a high quality product we need to consider
several performance characteristics. The idea illustrated by Figure 1.3 can be applied for
any number of parameters and performance characteristics. Forthis purpose systematic
methodology is needed. An attempt to develop such methodology was undertaken by the
Japanese Professor Genichi Taguchi who called this approachoff-line quality control.
Off-line quality control activities are conducted at the product or process design
stage and are aimed at making them robust against manufacturing imperfections,
environmental noises and product deterioration. They can ensure high quality production
at a low cost.
The life of a product starts with the product and process design, the next step is
manufacturing and at the end the customers use the product. To get a new product one
should invest in research which will make the product robust against manufacturing
imperfections and environmental variable variations. If this has not been done the
product performance characteristics variations can be too high and this may lead to many
defectives at the manufacturing stage. The cost for elirninating these effects is higher
because adjustment or repair is needed for any particular fraction of defective product. If
this opportunity is rnissed again and the defective products reach the market, the loss for
the society is much higher because the customers do not posses the necessary specialized
knowledge, skills and equipment to repair the product. They must use special services
and this is expensive.
In this book we consider only off-line quality control methods.
INTRODUCTION TO QUALITY IMPROVEMENT 7
Consider a product or process shown on Figure 1.4. Denote by 'I the perforrnance
characteristic of interest (for example, tensile strength, viscosity, output voltage, etc.).
We consider 'I as a deterministic function of m product/process parameters (for example,
temperature, pressure, concentration, input voltage, etc. ).
PJ -J--..-J
Product y
P2-~J--~
(Process)
Z; = P; +e;,
i = 1,2, ... ,m.
and vibrations during the product's usage. In an antibiotic production, for examp1e, the
environmental temperature during the production process can be considered as an
influential external noise factor.
1t is expected that the external noise factors can be set on given values during an
experiment, but it is not possible to keep them fixed in the mass production or in usage
when they vary randomly within certain intervals.
iii) Random outpul noise &. It incorporates all random influences that are not
taken into account by the errors in factors eP e2 , ... em or by the external noise factors
n1 ,n2 , ... ,nq.
Consequently, the real value of the performance characteristic of the product in
usage is the following:
Looking at Figure 1.3 one can draw the conclusion that the mean value and the variation
of the output characteristics depend on
i) The parameter values (the operating points),
ii) The statistical characteristics of the noises (the errors in product parameters,
the external noise factors and the output noise).
To find the best values of the parameters which minimize the performance
characteristic's variability, while keeping its mean value on a target r one can conduct an
experiment for different combinations of the product parameters and the noises. Then
one can see which combination provides the desired value T of the performance
characteristics and minimizes the output variance.
Very important question is how to organize the experiments. They must be
designed in such a way that will provide the engineer with information necessary for
making decisions. There is no computational method able to extract information that is
not contained (at least implicitly) in the primary data.
A Iist ofthe values offactor combinations (product parameters and noise factors)
to be carried out during the experimentation phase is called design of experiments.
Confusion can arise with the term engineering design which is entirely different thing.
Therefore, one must be careful using these terms.
Having the results of the experiments one can try to analyze them. It is
convenient to use models for this purpose. They are relationships connecting the
performance characteristic (often called also response) with the factors. lt is also
necessary to check how good is the model that fits the data. This procedure is called
goodness-oj -fit lest.
At the end specific optimization procedures must be applied to help. the engineer
in making decisions.
INTRODUCTION TO QUALITY IMPROVEMENT 9
Ideally teamwork will help very much solving an engineering problem. The team
must incorporate people with different skills and knowledge:
a) Person(s) authorized to make decisions about the product (process),
b) Specialist(s) in the technology ofthe product or process under consideration,
c) Specialist(s) in the design and analysis of experiments who has deep statistical
knowledge and is able to use specialized statistical software.
It is possible that the above three specialists are the same person.
In this book we deal with the most important aspects of the experimental design,
model building, testing goodness of fit and optimization procedures aimed at product or
process improvement. Different approaches for solving these problems are possible. We
consider the so-called Taguchi method and a model-based approach using the results of
the response surjace methodology.
The response surface methodology is an approach to product and process
optirnization which is based on representation of the performance characteristic as a
function of some factors. These factors can be product parameters p 1 , p 2 , ... p m and/or
extemal noise factors n1 , n 2 , ... , nq. This model is obtained through an experiment.
Further on we will use the notation x; for factors expressed in natural measuring scale.
In Chapter 2 we introduce coding of factors which is convenient in the design and
analysis of experiments. The vector of coded factors is denoted x = (x1 x 2 . . . xJr .
For example, consider a problern arising in the production of inner tubes for car
tyres. The rnixture formulation consists of eight components, but in the case under
consideration the engineers are interested to see how the elongation y (%) depends on
two factors: the proportion of synthetic butadiene ruhher (SBR), denoted by x;
and
measured in weight parts (w.p.) and the proportion of soot x~ (w.p.). During the
experiments these proportians are varied within the following intervals:
A test for Iack of fit shows that this is an adequate model for the elongation.
Equation ( 1. 1) can predict the value of elongation y for any combination of SBR
( x;) and soot ( x~) within the variation intervals defined above. The relationship between
the response y and the factors x; and x~ can be graphically presented through a response
surjace which is shown on Figure I. 5. The response surface is a locus of points that
satisty equation (I. I).
Elongation
7
60
20 20
Figure 1.5. Response surface: elongationy as function ofthe amount ofSBR, x; and soot, x;
A two dimensional projection of the response surface on the plane defined by the
coordinate axes x; and x~ is called contour plot. Each contour of this plot corresponds
to a constant value of the response. Contour plot for the elongation of the inner tube is
shownon Figure 1.6.Careful exploration ofthe contour plot can reveal many interesting
properties of the performance characteristic. An interesting point on Figurel.6 is so
INTRODUCTION TO QUALITY IMPROVEMENT II
called stationary point which can be obtained by Setting the firstderivatives of y to zero.
The coordinates of the stationary point are x;, = 27.3079 w.p. and x~. = 30.0667 w.p.
When the response surface Iooks as a saddle like in Figure I. S the stationary point is
called saddle point. By moving the operating point to the left or to the right of the saddle
point on Figure I. 6 the elongation is increased, while the movement along the coordinate
axis x~ Ieads to its reduction. Using the contour plots one can make decisions about the
optimal parameter values. For example, suppose that the elongation should not be less
than 660 %. The values of factors can be chosen to fall within the region on the right of
the contour corresponding to y = 660. The final decision should be made on the basis of
studying other performance characteristics and taking into account the prices of mixture
components.
Often the number offactors is more than two. In this case the decision can not be
made merely on the basis of contour plots and more sophisticated optimization
procedures should be employed.
The totality of methods for experimental design, model building, statistical
analysis of models and exploration of response surfaces is called response surjace
methodology (RSM). The models used in RSM are usually regression equations. There
are many different kinds of designs. Some of them are not used in RSM. The response
surjace designs are only a subset ofthe designs ofexperiments.
The response surface methodology provides powernd tools for product or
process improvement. They are useful for choosing optimal parameter values that
provide performance characteristics satisfying customer requirements, while keeping the
product price low. As we already noted, efforts of people with different skills and
knowledge are needed for a successful application ofthe response surface methodology.
12 CHAPTER 1
Variance
17
16
15
14
50
60
40
20 20 --~I
XI
Figure 1. 7. Elongation variance surface
INTRODUCTION TO QUALITY IMPROVEMENT 13
30 40 50 60
XI
~
Figure 1.8. Elongation variance contour plot
14
STATISTICAL METHODS FOR DATA ANALYSIS 15
Analysis oj Variance (ANOVA) is a method for examining the effect of one or more
factors on a given process or product. It can be used to analyze data from experiments
with both quantitative and qualitative factors.
Subsection 2.1.1 explains the idea of the analysis of variance in a simple case
when only one factor is considered to influence the results of experiments. lt is
generalized for multiple factors in the next subsections.
Main results
Consider the following example. Three operators are working in shifts producing the
same product and using the same equipment, raw materials, and technology. There are
differences in the product's performance characteristic and the process manager needs to
know whether the operators (their skills and discipline) cause them. The influence of a
three-level factor (operator) on the performance characteristics has to be therefore
studied.
Let a factor B has k Ievels B(11B(2l ... ,B(k). An experiment is conducted and r
observations for all factor Ievels of the variable y are made. The total number of
observations is N = rk and the results are given in Table 2.1.
The expectations of the i-th factor Ievel observation are denoted E(yi) = f.Ji , i =
1,2, ... ,k.
Consider a statistical hypothesis H 0 :p1 = f-1 2 =... = f-lk. If it is true then factor B
has no significant effect on the observations. H 0 is usually called null hypothesis.
To test the null hypothesis the estimates of the expectations J.ii are first computed
as follows:
1 r
.Y, =-_Lyij, i = 1,2, ... ,k.
r J=I
1 k r
y=-k_L_Lyij.
r i=I J=I
r
It is easy to see that LYi1 = ryi and
j=l
1 k
y=-k_Ly;. (2.1)
j=l
Assurne that the observations are non-random. Then y; = J.ii and y = E(y) = f.1 .
Wehave from (2.1)
(2.2)
Additionally assume that the null hypothesis H 0 is true and p 1 = p 2 =... = J.ik =c. Then
(2.2) is transformed to:
1 k
p=-,Lc=c.
k j=l
This result means that if a factor is not influencing the observations, then their mean
values for all of the factor Ievels and the total mean are equal.
In real life the Observations are random and y and j/1, j/2 , ... , yk are never equal
even if the null hypothesis it true. In spite of this we will compare y with .YP j/2,... ,Yk
and if the differences are small enough the null hypothesis will be accepted.
There are two sources of variation that cause the observations to deviate from the
total mean:
• Variation due to the factor,
• Variation due to other sources. It is called residual variation.
To study the variation Iet' s consider the total sum of squares:
STATISTICAL METHODS FOR DATA ANALYSIS 17
(2.3)
(2.4)
where
(2.5)
and
k r
The sum Q8 takes into account the variation due to the factor, because it shows
the difference between the means y; and the total mean y . If the factor is insignificant
then the deviations y; - y and the sum Q8 are small. Therefore, Q8 characterizes the
variation between the groups of observations, corresponding to the factor Ievels.
The sum QR is formed by the deviations yif - y; caused by the residual variation
which is also called variation within the groups of observations.
Let us compute following variance estimates:
(2.7)
and
(2.8)
with
and
VR = k(r - 1) = kr - k.
v=kr-1.
and therefore,
18 CHAPTER2
(2.9)
Simplified jormulae
Simplified formulae for Q, Q8 and QR can be derived, to reduce the computations. They
are also used for explanation of results in the next chapters. The proofs are given in
Appendix A.2.2.
Denote N = kr and Iet Y. be one ofthe observations, u = 1,2, ... ,N. Denote also
2
1( N
K=- LY. )
N u=l
and
r
Bi= LYij"
J=l
Q= LY~-K, (2.10)
u=l
1 k
QB=- LBi2-K (2.11)
r i=l
and
QR= Q- QB. (2.12)
If the number of the observations for each factor Ievel is different - 'i , r2 , ... , rk,
then Q8 can be computed using the formula:
STATISTICAL METHODS FOR DATA ANALYSIS 19
(2.13)
Computational procedure
The procedure is as follows:
1. Compute
r
and
N k
_Ly;=_LBii.
u=l i=l
2. Compute
and
VR =N -k. (2.17)
2. Equations (2.I4) and (2.I5) are used to calculate B; and Bw For example
I ( ) 2 9.54 2
K=-
I8 2.94+282+378
. . = -I8- = 50562
. .
and
N
v=N-I=I8-I=I7,
VB= k -I = 3 - I = 2,
and
VR =k{r-I)=3(6-I)= I5.
Often the product performance characteristic depends on more than one factor.
Therefore, we need a version of ANOVA which is appropriate for analysis of
experimental data with several factors. Multiple and one way classification do not differ
too much. Suppose there are m factors of interest without interactions between them.
Then the total sum of squares Q can be resolved into m+ 1 sums as follows:
Interactions between two factors can be studied assuming that they are not
aliased with other effects. Effects are aliased if they can not be separately estimated.
More details on aliasins are given in Chapters 3 and 4. In this case the total sum of
squares can be presented as follows
(2.18)
where QA,Q8 , ... ,QM aresums ofsquares due to main effects offactors, QA8 ,QAc•··· are
sums of squares due to interaction effects and QR is the residual sum of squares.
Consider first the case with repeated Observations. Denote n the number of
different experimental trials and Iet r be the number of repetitions which is one and the
same for each trial. The total number of Observations is N = nr.
Total sum of squares can be computed as follows:
(2.19)
i=l j=l i=l j=l
where
_ 1 n r
y=-:L:~:>ij
nr i=I j=I
nr
STATISTICAL METHODS FOR DATA ANALYSIS 23
k. A2 A2 A2
"' {~ -)2 =-+-+
QA=rL....rtavz-Y I - -K,
2 ... +ka (2.20)
1=1 Tlja ~a rrka
where y1 , I= 1, 2, ... , k a is the arithmetic mean of all 11ja Observations for 1-th Ievel of
factor A, while A1 denotes the sum ofthese observations.
The degrees offreedom for QA are vA = ka -1.
If the effect of a given interaction, for instance AB, is not aliased with other
effects, it can be studied by computing the following sum of squares:
where y11 is the arithmetic mean of all Tlj1 observations obtained for combination of Ievels
A = I and B = t, I= 1, 2, ... , ka, t = 1, 2, ... , kb, while A1B1 is the sum of these Observations.
Thedegreesoffreedomfor QAB are vAB =(ka -1Xkb -1).
Formulae (2.20) and (2.21) can be derived similarly as for the single factor case.
For more detai1s see Logothetis and Wynn (1989).
If there are repeated observations the residual sum of squares can be resolved
into two parts:
(2.22)
where QR1 represents the error between different experimental conditions (the error
between experiments), and QR 2 is the sum of square due to the errors within experiments
(the error among replications).
The sum QR 1 can be computed in the following way. Consider the "experimental
trial" as a new factor with n Ievels. The sum of squares due to this factor can be
computed similarly to (2.20):
24 CHAPTER2
where
- 1~ . 12 , ... n.
Y;=-"-'yii' z=,
r i=J
The sum Q17 takes into account the effects of all factors and interactions L Q.1 ,
and the effect ofthe error between the experimental conditions QR 1. Hence:
where
(2.24)
respectively.
By testing s;1 against s; 2 one decides on the significance of the inter-
experimental error. Following ratios are used for this purpose:
or
Denote the critical values of the F-test for significance Ievel a as follows:
F;RI == F(a, v RI, v Rz) and F,;
1 == F(a, v R2 , v RI). Therefore:
•lf F';1 :::; F;RI or F;; :::; F;; 1 then the inter-experimental error is insignificant, it is
pooled together with the replication error and the source influence is tested for
significance against the residual variance s; .
•lf F; 1 > F;RI then the influence of factors and interactions is tested against the
variance ofthe inter-experimental error s; 1 •
The results are usually presented as shown in Table 2.5.
TABLE 2..
5 ANOVA : muI. I compansons
tlple
Source Sumof Degrees of Variance F-ratio
squares freedom
A QA VA szA FA
B QB VB szB FB
... ... ... ... .. .
AB QAB VAB s~e FAB
... ... ... ... .. .
Residual! QRI VRI s;I
2
Residual2 QR2 VR2 SR2
Residual QR VR szR
Total Q V
Consequently, the condition (2.24) is not satisfied and QR can not be resolved
into two parts according to (2.22).
The analysis of variance in the case without repeated observations is carried out
as follows. First the total sum of squares is computed using the formula
i~J i~J
where
)2
LY;
n
(
1
LY; and K = ....o...':.....:·~J_____..e._
n
y =-
n ;~ 1 n
The degrees of freedom for Q are v =n- 1. Making r =1 one can compute the
sums of squares due to the effects of factors and interactions by formulae (2.20) and
(2.2l).The residual sum ofsquares is
where
STATISTICAL METHODS FOR DATA ANALYSIS 27
The effect of factor A is significant if F > F(a, v A, v R), where F(a, v A, vR) is
the critical value of F -distribution for Ievel of significance a and degrees of freedom
VA and VR.
T ABLE 2..
6 Water treatment data
No. A B Y;i Ai
I I I 3.6 3.8 4.3
2 I 2 3.0 3.2 3.4 AI =30.0
3 I 3 2.8 2.6 3.3
4 2 I 4.3 3.9 4.I
5 2 2 3.2 2.8 3.3 ~ =32.4
6 2 3 3.8 3.4 3.6
7 3 I 3.8 3.4 3.6
8 3 2 3.7 3.8 3.6 ~ =31.5
9 3 3 3.I 3.0 3.5
10 4 I 3.6 3.2 3.4
11 4 2 3.3 3.9 3.6 A4 =32.I
I2 4 3 3.4 3.9 3.8
36
LY; =126
i=l
Sums of all observations for a given factor Ievel are shown in the table. For
example
3 3
AI= LLYiju = 3.6 + 3.8 + 4.3 + 3.0 + 3.2 + 3.4 + 2.8 + 2.6 + 3.3 = 30.0,
j;I u;I
4 3
B2 = LLYi2u = 3.0 + 3.2 + 3.4 + 3.2 + 2.8 + 3.3 + 3.7 + 3.8 + 3.6 + 3.3 +
i;I u;I
The values of A1 are given in Table 2.6, while ~ =45.0 and~ =40.2 are
computed in a similar way.
28 CHAPTER2
1 ( LYI ) =-=441.
36 2 2
K=- 126
N 1=1 36
In order to find the total sum of squares one first computes the following sum:
36 12 3
LY!
1=1
= LLY~ = 446.56.
i=1 j=1
V= N- 1 = 36- 1 = 35.
V8 = kb -1 = 3-1 = 2.
QAB can be found using (2.21) where the sums A;Bi correspond to each
combination of the Ievels of A and B. For example,
STATISTICAL METHODS FOR DATA ANALYSIS 29
In Table 2.6 each combination of factor levels appears only once and
1j 1 = r12 = ... = rka.kb = 1. The number of the repeated observations for each combination is
r = 3 . Therefore, the number of experiments for each factor level combination is
rAB = 3 x 1 = 3. Using (2.21) we obtain
while
s~ = QA = 0 · 38 =0.1267,
VA 3
s~ = Q8 = 1· 14 = 0.5700,
V8 2
s~ = QR = 1·54 = 0.0642.
VR 24
30 CHAPTER2
F = s~ = 0.1267 = 1. 97
A s~ 0.0642 '
F
8
=s;s~ =~=8.88
0.0642 '
TABLE 2..
7 ANOVA table fior water treatment exampJe
Source Sum of squares Degreesof Variance F- ratio
freedom
A 0.3800 3 0.1267 1.97
B 1.1400 2 0.5700 8.88
AB 2.5000 6 0.4167 6.49
Residual 1.5733 24 0.0642
Total 5.5600 35
The Fisher distribution critical values are taken from F-distribution tables using
significance Ievel a = 0. 05:
So far we considered analysis of variance based on data obtained from an experiment but
we did not explain how the data were collected. The design of experiments is a very
important problern because:
• If the experiment is not properly prepared the data may not contain sufficient
information for making conclusions,
• The number of experimental runs could be too large and the obtained
information not precise,
• A proper set of experiments would enable the use of simple and standardized
data processing procedures.
There are also other benefits that will be explained later, when discussing various
procedures for design of experiments.
A design of experiments is a table (matrix) which defines the values of factor
Ievels in a set of elementary experiments (runs). It is usually prepared in advance except
the so-called sequential procedures for design of experiments in which the conditions for
each run are selected upon the results of the preceding experiments.
There are many different types of designs of experiments which are appropriate
for different conditions of experimentation and data processing schemes. We start with
some simple designs that are used to collect data for analysis of variance.
If during the experiment all possible combinations of the factor Ievels are fulfilled
this design is called Juli jactorial design. If only a fraction of the full factorial design is
conducted then we have ajractionaljactorial design.
The number of all possible factor Ievel combinations in a full factorial design is
N = k1k 2 ••• km, where k; is the number ofthe i-th factor's Ievels, while m is the number of
factors. With the increase of m the number of runs rapidly increases. For example, if
m=6 and k 1 =k2 =... =km=10, the number of runs is N=10 6 runs. This is an
inconceivably large number of experiments. That is why fractional designs are very often
used instead of full factorial ones. The problern is how to choose the fraction.
Typical designs used with the ANOVA procedure are considered in the next
section.
In these designs all Ievels of a factor are combined with the Ievels of other factors in a
completely random manner. This is herewith demonstrated by the following example.
32 CHAPTER2
Example 2.3.
Let y(%) is the yield of a substance from a chemical reaction. Suppose it depends
on three factors: Type oj catalyst with Ievels A, B, C and D; Temperature with Ievels I
(100°C), II (120°C), III (140°C) and IV (160°C); Pressure with Ievels 1, 2, 3 and 4
which correspond to 1, 1.2, 1.4 and 1.6 atm.
To study the effect of the catalyst on the yield, one can use a completely
randomized design. It can be constructed by casting lots or by use of random numbers.
Suchdesign is shown in Table 2.8.
The values of the yield y are given in the brackets in percentages. One way
ANOVA scheme is used to analyze the data (see subsection 2.1.2). The results are given
in Table 2.9.
The F-ratio is
F = 844.67 = 22.63.
A 37.33
The critical point of the F-distribution for significance Ievel of a =0.05 and
vA = 3 and vR = 12 degrees offreedom is FrA= F(a,v A•vR)= 3.49. The conclusion is
that the type of catalyst has a significant effect on the yield because FA > FrA .
•
2.2.3. RANDOMIZED BLOCK DESIGNS
The design considered in subsection 2.2.2 does not provide homogeneaus conditions for
all types of catalyst. The yield depends al~o on the temperature but the Ievels of this
STATISTICAL METHODS FOR DATA ANALYSIS 33
factor are not uniformly arranged trough all experiments. For instance catalyst C has
been combined with temperature II two times but not a single time with temperatures III
and IV, as weil as catalyst D has been combined twice with II and not with I, etc. To
avoid such disarrangement certain restrictions on the randomization become necessary.
Such restriction can be a rule allowing for each temperature only one experiment to be
conducted with each catalyst. Such design is shown in Table 2.1 0. It is called
randomized block design. The four blocks in Table 2.10 correspond to each of the
temperature Ievels. Experiments are randomized within the blocks.
Example 2.4.
Two-way ANOV A is applied to the data in Table 2.10 in order to study the
effects of two factors: type of catalyst and temperature. Interactions between factors, if
any, are not taken into account. Using the earlier discussed method (subsection 2.1.3)
results shown in Table 2.11 are obtained.
It is worth mentioning that the residual sum of squares, which is equal to 448 in
Table 2.9, is now divided into two parts. One accounts for the variation due to the factor
temperature and is equal to 386 and the other- for all other noise effects and is equal to
62.
Compute two F-ratios:
In the randornized block design the Ievels of the third factor (the pressure) are still non-
uniformly distributed amongst the experiments. For example, in Table 2.10 the catalyst C
is combined twice with each of pressures 1 and 2 and never with 3 and 4. Other sirnilar
examples are also shown in Table 2.1 0. To avoid this another restriction on the
randornization can be introduced imposing the rule that experiments using each catalyst
at each temperature for each Ievel of the pressure will be carried out. Such design is
called Latin square. Table 2.12 illustrates it as being applied to the chernical reaction
example.
. ~uare or stud"
T ABLE 2.12. L atm . 1d
1ymg t he catalJyst effiect on t he Yie
Pressure Temperature
I II III IV
1 B(55) C(72) D(59) A(82)
2 C(65) D(68) A(78) B(45)
3 A(87) B(51) C(56) D(60)
4 D(71) A(85) B(39) C(59)
AB and BA
BA AB.
One of these Latin squares can be chosen randomly using a simple procedure. It
is demonstrated for the Latin square ofTable 2.12 with k = 4.
STATISTICAL METHODS FOR DATA ANALYSIS 35
1. Construct a standard Latin Square. In the first row put the Latin letters in
alphabetical order:
ABCD
BCDA
CDAB
DABC
Each successive row caQ be obtained from its predecessor by shifting the first
Ietter to the last posit1on in the row and all other letters - one position left.
2. Two random sequences of integer numbers from 1 to k are formed for
example, using tables of random numbers or by casting lots. For k=4, the following
random sequences were chosen:
2,3,4,1
1,2,4,3.
3. The order of columns in the standard Latin square is changed according to the
first random sequence. For our example first take the second column, followed by the
third, the fourth and then the first:
BCDA
CDAB
DABC
ABCD
4. The order ofthe rows in the obtained Latin square is changed according to the
second random sequence as shown in Table 2.12.
Examp1e 2.5.
The results of the three-way ANOVA are shown in Table 2.14.
Latin squares are three factor designs. Similar designs for four factors are called Graeko-
Latin squares. They can be obtained by superimposing two orthogonal Latin squares.
Two Latin squares are orthogonal if after their superimposing each pair of letters is
written only once in the new square. Usually Latin letters are used in one of the squares
and Greek letters in the other.
Consider for instance two 3 x 3 orthogonal Latin squares shown below:
ABC aßy
BC A yaß
C AB ßra
In a Graeko-Latin square each Greek or Latin Ietter is written only once in each
row and column. Each combination of Greek and Latin letters appears only once in the
square. Fisher and Yates (1963) give tables oforthogonal Latin squares that can be used
to construct Graeko-Latin squares. In some cases construction of Graeko-Latin squares
is impossible. For example, 6 x 6 orthogonal Latin squares do not exist.
The data obtained through a Graeko-Latin square can be analyzed by multi- way
ANOVA procedure. If the number of factors is greater than four then three or more
Latin squares must be superimposed. As a result a design of experiments called hyper
Graeko-Latin square is obtained. This is shown by an example.
A B c
1 2 3 4 5 1 2 3 4 5 1 2 3 4 5
2 3 4 5 1 4 5 1 2 3 3 4 5 1 2
3 4 5 1 2 2 3 4 5 1 5 1 2 3 4
4 5 12 3 5 1 2 3 4 2 3 4 5 1
5 1 2 3 4 3 4 5 1 2 4 5 1 2 3
. Choosing the Ievels of the factor BaO according to the square A, these of the
factor Mg() - according to B and the Ievels of the temperature T - according to C we
obtain a hyper Graeko-Latin square which is shown in Table 2.17. The Ievels of the
factor FeO are denoted by Roman numbers: I, II, III, IV and V and the Ievels of the
factor MnO- by Arabic numbers: 1, 2, 3, 4, 5. The logarithmic values ofthe viscosity are
given in the brackets. lt was shown that their distribution is approximately normal.
I sIags expenments
TABLE 2.I8. AN0 VA tabie fior earty
Source Surnof Degreesof Variance F-ratio
squares freedom
FeO 4.582 4 1.1456 25.3I5
MnO 2.547 4 0.6367 I4.072
BaO O.I64 4 0.0410 0.906
MgO 0.425 4 O,I062 2.348
T 1.442 4 0:3605 7.967
Residual O.I8I 4 0.04525 -
Total 9.34I 24 - -
•
2.2.6. ÖTHER DESIGNS
Incomp/ete randomized b/ock designs can be obtained ftom a randomized block design
by omitting some of the experiments. They are called balanced if each pair of elements
appears same number of times. Table 2.19 shows a design with two factors where the
first ofthem has Ievels I, II, III, IV and the second one- 1, 2, 3, 4. Combinations ofthe
factor Ievels for which experiments are conducted appear as asterisks (*) and the ones
with no observations- as dashes (-).
x-(x
- I
This equation is usually called regression model. The aim of the regression
analysis is to find the regression model in an explicit form.
x' 2
X' IM x' 1
Factors x;,i= 1,2, ... ,1 are measured in some physical scales which are often
different for each of them. During the experiments the factors are changed within some
intervals
(2.26)
0) 0)
I
----+-~----1-------+-----,>• x'
x'
lffiUI
x'iO x' I
is the basic Ievel of the factor x; . Denote the half-interval of factors' variation by
ST ATISTICAL METHODS FOR DAT A ANALYSIS 41
So called coded jactors x; will be often used. They are dimensionless and vary in
the same intervals:
-1 ~ x, ~ 1, i = 1, 2, ... , I . (2.27)
They can be obtained from (2.26) using the following formula for coding the
factors:
(2.28)
To find optimal parameter values one needs the function 17(x1 ,x2 , ... ,xJ Usually it is
unknown and must be obtained through experiments. We do not know which are the best
values of the parameters and that is why we start with an arbitrarily chosen operating
point x 0 . Taking into account that 17 is a continuous function we can describe it locally
araund the point x 0 by a Taylor series expansion as follows:
I o 217(x)
+~ - xJ 0 )+
1-I I
~-
~ ~20 0 I {x - x 0 'Ii1\.x
X=Xo I I J
i=I J=i+I X; Xi
I 1-1 I I
71(x)= ßo + "f.ß;x; + L "f.ß;JX;X1 + "f.ßiiX; 2 + ... (2.29)
i=l
where /; = ./;(x) are known functions of the factors not comprising unknown
coefficients.
One can see that (2.29) is a special case of (2.30) with /; defined as follows:
and coefficients
(2.31)
i=l
44 CHAPTER2
. analJVSlS
TABLE221 E~xpenmenta!data fior regress1on .
Design matrix Vector of observations
No XI x2 ... XI y
1 Xn x2I ... XII YI
2 XI2 X22 ... xl2 y2
... ... ... ... ... ...
u xlu x2u ... X Iu Yu
... ... ... ... ... ...
N XIN x2N ... XIN YN
ll
Usually equation (2.31) is called regression model. We also introduce an
extended design matrix, called also matrix of regressors:
... f2k
... ftk = [r~:f2
!Nk f~
Each row of this matrix contains values of the functions ft , / 2 , ••• , fk for a given
experimental run. The predicted response vector y = {Y1 y2 ... yN Ycan be written
as follows:
y=FB. (2.32)
E(y)= 77,
2. The factors x;.,i = 1,2, .. . ,I; u = 1,2, ... , N are measured without errors durlog
the experiment.
3. Fis non-random matrix offull rank (rank F = k).
4. There are no constraints on the possible values of the model parameter
estimates.
QR=:L(y.-ß.Y. (2.33)
u=l
Model coefficients must be chosen to minimize QR . For this purpose the first
derivatives of QR are put equal to zero and the following system of linear equations is
obtained:
46 CHAPTER2
(2.34)
N
~ "" 2
~k...J;.
•=I 11=1 u=l •=I
~ N N
(2.35)
where
N
gii = "f..J;;,
u=l
Ku ... glkl
[ ... g2k
G= ~-2·1
gkl gkk
Often another form of (2.38) is used which is based on the matrix F defined in
2.3.3. By direct computations one can see that G = FrF and rp= Fr y. With these
notations the normal equations can be rewritten in the form
(2.39)
(2.40)
Under the assumptions of Section 2.3.4 the estimates (} have the following
properties:
1. They are unbiased: E(B)= (}. That means that the method of estimation does
not introduce systematic error in the estimates.
2. The estimates (} are consistent. That means that they tend to the true values of
parameters (} when the number of observations N tends to infinity.
3. The estimates ~ are efficient which means that their variance is not larger than
the variance of any other estimate Y; :
The proofs ofthese properties are given in Plackett (1960), Rao (1973), Seher (1977).
The variances and covariances of the estimates ~ are elements of so called
I
covariance matrix
o- 2 (8
cov(~,oJ
1)
cov(~P~klj
v(o )= cov(B B o- \B
2, 1)
2
2) cov(02 , (}k
(2.41)
Consequently, the variances and covariances ofthe regression model coefficients are
(2.42)
and
(2.43)
STATISTICAL .METHODS FOR DATA ANALYSIS 49
where c;; and cif are diagonal and off-diagonal elements of C, correspondingly. As the
estimate () in (2.40) is a linear transformation of the normally distributed vector y it has
normal distribution with mean () and covariance matrix ca;.
The variance of the predicted response is
(2.44)
Example 2. 7.
Let us consider an example based on simulated data. Suppose we want to study
the concentration y (%) of a substance in a chemical reactor as a function of two
parameters: reaction time, t (hours) and temperature r eq. During the experiment
these factors vary within the intervals
I h. :::; t:::; 6 h.,
70°C:::; r:::;I45°C.
An optimal composite design (see Chapter 3) is used and the data are shown in Table
2.22.
One can work with the original factors but the normal equations are much simpler
if coded factors are used. The values of the coded factors can be obtained using formula
(2.28). For the first factor x{0 = 10 = 3. 5 h and w 1= 2.5 h and for the second factor
x~ 0 = r0 = I 07. 5 oc and w2 = 3 7. 5 oc. The coded values of the factors are computed by
the formulae
50 CHAPTER2
X1 = (t- 3.5)/2.5
and
x2 = (r -107.5)/37.5.
Let us estimate the coefficients ofthe following second order polynomial model:
1
-1 -1
-1 -1
-1 -1 1
F= 0 0 0 0 0
0 1 0 0 1
1 0 0 0
0 -1 0 0 1
-1 0 0 0
To obtain the transposed matrix we exchange the rows and the columns ofF:
1 1 1
-1 -1 0 0 0 -1
-1 -1 0 0 -1 0
Fr=
-1 -1 0 0 0 0 0
0 0 1 0 1
0 1 0 0
9 0 0 0 6 6
0 6 0 0 0 0
0 0 6 0 0 0
G =FrF =
0 0 0 4 0 0
6 0 0 0 6 4
6 0 0 0 4 6
Multiplying Fr by the vector y which is given in the last column of Table 2.22 we obtain
the right hand side ofthe normal equations:
Compute the inverse of FrF. We need it for both parameter estimation and
statistical analysis. It is easy to calculate
1 1 1
BI =-535--348--364= 59.89
A
1.8 3 3 '
A 1
82 = -16 = 2.67
6
where
-ry; =11.041,
r r
Qe = L(yua- YJ 2
= LY~a
i=l •=I
Ya = ~ LYua = 60.37,
i=l
STATISTICAL METHODS FOR DATA ANALYSIS 53
v. = r -1 = 10- 1 = 9
and
s 2 = Q. = 11. 041 = 1 22
• v. 9 . .
Using (2.42) and (2.43) compute the variances and covariances of the regression
coefficients as follows:
2 (' ) 1 0.677,
S \{)I =-X 1.22 =
1.8
S \{)2 = S 2f')
2f') 1
\{)3 =-X 1.22 = 0.203,
6
2f' ) =-X
S \{)4
1 1.22 = 0.305,
4
\()5 = s 2f')
s 2f') \()6 =-1 x 1.22 = 0.61,
2
As shown in subsection 2.3.5 the estimates B; are normally distributed with mean value
B; and variance c;; 0:. Therefore, the statistics
54 CHAPTER2
where tr = t( a I 2, ve) is the critical point of Student's distribution for significance Ievel
a and vedegrees offreedom.
Some software packages provide significance tests based on individual
confidence intervals. The null hypothesis is H0 :B; = 0. Therefore, if
~ is insignificant.
This test is not quite reliable, because it does not take into account the
covariances among regression coefficients' estimates. A better method for model
structure selection is the method of best possible regression which is also available in
most statistical software packages.
Simultaneaus confidence intervals can also be found. As shown in many books on
regression analysis (Draper and Smith (1981), Seher (1977), Myers (1990)) a joint
confidence region is defined by the inequality
where tr = t(a I 2, vs} is the critical point of Student's distribution with ve degrees of
freedom. An estimate of the standard deviation of the predicted response is obtained
from (2.44) as follows:
STATISTICAL METHODS FOR DATA ANALYSIS 55
where the elements of f are computed for point x of the factor space.
Note that the confidence interval on the mean response depends on the
experimental design through the matrix F.
Example 2.8.
Compute the individual confidence interval on B2 from Example 2.7. We found
s;
that the least squares estimate of B2 is 02 =2.67 and that = 1.22 with vs = 9 degrees
of freedom. The critical point of Student's distribution for a = 0.05 and vs = 9 is
tT = t(0.025,9) = 2.262. The second diagonal element ofC is c22 = 1/6. Therefore
or
Let us also compute the confidence interval of the mean value for time 6 hours
and temperature 107.5 °C. In terms of coded variables x2 =0 and the vector f for this
combination of factors is f =(1 1 0 0 1 0Y. Putting x 1 =1 and x2 = 0 in the
regression model, we obtain y =59.89 + 2.67-4.33 =58.23. The standard deviation for
the predicted response is
For its computation we used {FTFt from Example 2.7. Hence the confidence interval
on the mean value is
"
y,y
X
Figure 2.5. Fitting linear and quadratic models to data
to fit the rnodel to these data then the accuracy is unsatisfactory and the residual surn of
squares QR is too !arge.
U sing a second order polynomial
would be rnuch better in this case and the residuals Yu - y~2 ) would be smaller.
Statistical procedures based on analysis of variance are used for testing whether
the rnodel is well chosen. They are called testsjor Iack ojfit.
STATISTICAL .METHODS FOR DATA ANALYSIS 57
where
I N
y= NLYu·
u::::;J
As shown in Appendix A.2.4 the total sum of squares can be presented as follows:
(2.46)
where
N
QM=L(Y.-.YY (2.47)
u=1
(2.48)
u=l
which shows the variation not explained by the model. lt is called residual sum of
squares.
These formulae can also be presented as follows:
QM = LY~-K (2.49)
u=l
and
58 CHAPTER2
N N
QR =LY= -LY= .
u=l u=l
(2.50)
They are convenient for computations but are more sensitive to computational
errors than (2.47) and (2.48). The reason is that both terms in the right band side of
(2.49) and (2.50) usually differ only in the last digits. We demoostrate this by example
2.10.
The residual sum of squares takes into account the random variation and the
variation due to significant functions .h which are not included in the regression.
Therefore, if the model structure is well chosen then QR will explain only the random
error.
The degrees offreedom for Q, QM and QR are
v=N -1,
VM=k-1,
where k is the number of regression coefficients. It takes into account k linear links
between the observations that appear in QM and QR through .Y•. The predicted response
t
.Y. is a function of k estirnates 0 =(Fr F Fr y which are linear transformations of the
Observations y.
It is easy to check that the following equation is true:
The lack offit test is based on comparison between the following mean squares:
S2 _ QM (2.51)
M-
VM
(2.52)
which takes into account the variation not explained by the regression.
These results are presented in Table 2.24.
STATISTICAL METHODS FOR DATA ANALYSIS 59
=:Ly;- :L.Y;
u-1 u-1
N
(
r )2
I r I r LYua
s&2 =-""' - )2 =-
I L... (y ua - Ya ""'
L...Yua2 - ->..:•:....:.=I_____"_
, (2.53)
r- •=I r - 1 •=I r
60 CHAPTER2
- 1 r
Ya =- LYua·
r u=!
(2.54)
J
2. Find the critical value of F-distribution Fr = F(a, v R, v for a given Ievel of
significance a and vR = N- k, vE = r -l.
3. Compare F and F., and make one of the following conclusions:
• If F ~ Fr the model fits the data weil. In this case s; and s; do not differ too
much and the model structure is correctly chosen.
• If F > Fr the difference between s; and s; can not be explained only by the
random character ofthe observations and the model does not fit the data weil enough. In
this case another model should be used, for instance higher order polynomial.
This procedure is not often used because it needs additional independent
observations. When there are replicated runs in a design of experiments the model
adequacy can be tested without additional independent observations. Suppose that the
design has h different points and there are r; observations at each point, i = l,2, ... ,h.
The residual sum of squares can be resolved into two parts:
(2.55)
where
h
QL = L:ri{Yi- Y;Y
j::::}
is the pure error sum of squares, and Y; is the arithmetic mean of the observations in i-th
design point.
The corresponding degrees of freedom are
STATISTICAL METHODS FOR DATA ANALYSIS 61
h
VR =N-k= ~); -k,
i=l
Vc = ~:>; -h,
i=l
and
Q sz
s2&
=-" and F =___b_2 .
v" s"
(2.56)
If the model fits accurately the data and there are not random errors then QR = 0,
Q = QM and R = 1. If the model has nothing in common with the data then QM = 0,
Q = QR and R = 0. Consequently, the value of R is within the interval 0 :<=:: R :<=:: 1.
In reality R rarely obtains its Iimit values 0 and 1. It is desirable the values of R to
be near to 1 but this is not sufficient to judge if the model fits the data weil. Even high
values ofthe multiple correlation coefficient may be insignificant. To test the significance
of R the following F-ratio has tobe computed
(2.57)
62 CHAPTER2
(2.58)
• If F !5: Fr, then R is insignificant. Therefore, the model can not explain the
variation ofthe observations around the mean value and needs improvements.
• If F > Fr the multiple correlation coefficient is significant and the regression fits
the data well.
Insignificance of R means that the model structure is wrong or important factors
are not included in it.
Box and Wetz (1973) suggest the F-ratio computed by (2.58) to be chosen at
least four times the critical value Fr . Box and Draper ( 1987) propose even more strong
inequality: F> 10F;..
As shown in Chapter 5 the quality of the regression model is vital for quality
improvement problems. That is why the Iack offit test should be conducted carefully.
Example 2.9.
Consider two multiple correlation coefficients:
• R = 0.95 for a model with k=3 coefficients which have been estimated on the
basis of N = 5 Observations.
• R = 0. 5 for a model with k = 9 coefficients and N = 109.
Let us test the significance of these multiple correlation coefficients using the
algorithm given above. For the first one we compute
STATISTICAL METHODS FOR DATA ANALYSIS 63
In this case the critical value ofF-distribution for a = 0.05, vM = 8 and vR = 100
is FT = 2. 03. As F > FT the multiple correlation coefficient R = 0. 5 is significant in spite
ofthe fact that it is much smaller than 0.95.
•
Example 2.10. Statistkaianalysis of a regression model
Consider the statistical analysis ofthe model obtained in Section 2.3.5. We use
the data ofTable 2.22.
Let us start with the analysis of variance. First compute the correction K:
(
LYu
N )2
K= u=I = _!_(67+56+ ... +60+53Y =_!_535 2 =31802.78.
N 9 9
For the computation of QR and QM we have to find the predicted responses for
the design points. Forthis purpose we put the values of xi and x2 from Table 2.21 in the
model which was obtained in Section 2.3.5:
For instance for u=2 we see from the second row ofTable 2.21 that xi = 1 and x 2 = -1.
Putting them into the model we obtain
64 CHAPTER2
N N
QR = LY:- LY: = 32005- 32007.465 =- 2.465,
u=l u=l
LY: -K = 32007.465-31802.78
N
QM = = 204.685.
u=l
It is surprising that QR has a negative value. This can only be a result ofrounding error.
For comparison we compute the same sums using formulae (2.47) and (2.48).
First we calculate
1 N 535
.Y=-LY.
N u=l
=-=59.44.
9
QM = L{Yu-
u=l
YY = 198.744
and
N
One can see that the accumulation of errors has made substantial difference in the
computation of QM and QR by formulae (2.48) and (2.49).
The total sum of squares is
N
Q = L(y·- .YY = 202.19.
u=l
V= N -1 = 9-1 = 8,
VM = k -1 = 6- 1 = 5,
s~ = QM = 198.74 = 39.748,
VM 5
s~ = QR = 3.45 = 1.15.
VR 3
F= s~2 = 39.75 = 34 . 56 .
SR 1.15
hypothesis H 0 : ciR = 0:. Compute the F -ratio putting in the numerator the !arger
estimate, s;
= 1. 22
F= s;
2
= 1. 22 = 1. 06 .
SR 1.15
The critical value ofthe F-ratio for a = 0.05, V 8 = 9, vR = 6 is Fr= 8.81. As F <Fr the
null hypothesis can not be rejected. We conclude that the model fits the data weil
enough.
•
2.3.8. STEPWISE REGRESSION AND ALL POSSffiLE REGRESSIONS
One of the complicated problems with regression analysis is the choice of model. Draper
and Smith (1981) show that a model with nuisance coefficients increases the variance of
the estimates. If significant functions are omitted in the regression it becomes inadequate
and the multiple correlation coefficient is insignificant. If the number of observations is
not very high the deletion of the insignificant effects from the model will increase the
number of the degrees of freedom which is always desirable.
In the beginning of the research program the experimenter usually has a vague
idea ( or has no idea) about the model structure. That is why a set of functions J; which
are supposed to fit the data weil, can be chosen. For example, ifthere are two factors x1
and x2 and the experimenter supposes that the model might contain first, second and
third order effects he/she can use all the functions in a third order polynomial as follows:
(2.59)
Extending this equation with d new terms we obtain the following alternative model:
(2.60)
STATISTICAL METHODS FOR DATA ANALYSIS 67
where k2 = k1 +d.
The residual sums of squares corresponding to these models are
Ql = L(y·- )\.Y
u=l
and
N
Q2 = L(y·- Ji2.Y
u=l
Comparing it with the residual mean square for the second model
s2 _ _ß_
2 - N-k2
Using (2.56) one can express FP via the multiple correlation coefficients R,2 and R; for
models (2.59) and (2.60), correspondingly, as follows:
Radidoes not always increase with the growth of the number of terms in the regression.
Another criterion for the choice of a model is the difference between F and F;. .
Usually the model for which F- F;. is largest is preferred.
Many statistical packages compute also the so-called prediction error sum of
squares (PRESS), proposed by Allen (1971,1974). To calculate PRESS statistic one
observation (say yJ is deleted and a regression model is fitted on the basis of the
remairring N -1 observations. Aso called prediction error (or PRESS residual) for the u-
th point i(u) = Yu - Y(u) is calculated where Y(u) is the predicted value for the u-th
STATISTICAL METHODS FOR DATA ANALYSIS 69
observation computed on the basis of the regression model. The sum of squares of the
prediction errors 8(. l for u = 1, 2, ... , N is called PRESS-statistic:
N
PRESS= LB(:).
u=l
PRESS=L ~
N ( A )2 ,
•=I 1-m••
t
where m•• = r: (FrF r., and &. are the ordinary residuals: &. = Y. - y•. Therefore, the
PRESS-statistic can be computed using a single regression model.
The PRESS-residual shows how influential the observation is. An observation
with great difference between &. and 8(•) has a great influence on the regression. The
prediction is more precise when PRESS-statistic is smaller.
Efficient procedures have also been developed for computing all possible
regressions and they are available in many statistical packages. They fit models consisting
of all possible combinations of regressors. More on the computational procedures is
written by Kleinbaum et al. (1988), Draper and Smith (1981), Seber (1977). The
selection of the final model can be based on the same criteria as in stepwise regression.
It should be noted that there might be more than one regression models from
which to choose. The regressors are often correlated. That is why several models with
different structures may be almost equally good.
Example 2.11.
The heat generation y eq of a breaker formulation for truck tyres .is studied. It
depends on four factors: x 1 - soot, x 2 - oil, x 3 - accelerator and x 4 - sulphur. All amounts
are measured in weight parts (w.p.). The correspondence between coded and natural
values ofthe factors is shown in Table 2.26.
W e assume that a second order polynomial model contains the full set of
necessary functions:
Step 9.
Mean value of the response 25.65
Standarddeviation ofthe response 4.44
Standard deviation of the residua1s 1.5561
Multiple correlation coefficient 0.9671
STATISTICAL METHODS FOR DATA ANALYSIS 71
F-ratio 16.08
Constant term b0 31.19
Standard deviation of b0 1.127
The regressors in the equation are given in Table 2.28, while the regressors out of
the equation are shown in Table 2.29.
TABLE 2.29. Stepwise regression: regressors out ofthe equation for step 9
Table 2.30 shows the analysis ofvariance for the equation obtained at step 9.
This kind of information has been obtained at each of the 12 steps. Finally the
equation at step 9 with k = 10 coefficients was chosen. As we already noticed the
72 CHAPTER2
Table 2.31 shows the values of multiple correlation coefficient R and F-ratio for
all 12 steps of the procedure. Though R increases when new regressors are included in
the model this does not mean that a model with maximal number of regressors is the
best. The difference between the computed and the critical values of the F-ratio is
greatest and the adjusted multiple correlation coefficient is highest for step 9.
So far we considered a model with coded factors. Models with factors presented
in their natural measurement scales can also be estimated. In this case the coded factors
in Table 2.27 are replaced by the corresponding natural values and stepwise regression is
estimated. The following regression model is obtained at step 9:
The multiple correlation coefficient for this model is R = 0.9076 and the
corresponding F -ratio is F = 5.19 . Therefore, the model with coded factors provides
more precise prediction than the model with factors in natural measurement scales. They
differ in structure because the correlation between factors depends on their measurement
scales.
There are several advantages in using models with coded factors rather than those
•
with natural ones. When the factors are written in natural measurement scales their
magnitudes can differ too much and this usually causes numerical problems in parameter
estimation procedures. The model coefficients depend on the measuring scales and they
can not be used for comparing the effects of factors. The rounding errors in the
coefficients can strongly affect the prediction especially if the absolute values of the
factors are !arge.
Introduc/ion
In this section we consider some simple graphical tools for analysis of residuals. They are
useful for testing the assumptions of linear regression analysis listed in subsection 2.3.4
and for model structure selection.
Given N observations
(2.61)
The predicted response Yu is an estimate for 1Ju (x 1", X 2" , ... ,x1J. Putting it in
(2.61) we obtain the residuals
elements m•• ofthe matrix M = F(FrFtFr are called leverages and can be computed
as follows:
T{
m•• =fu \F F
T )-I
r. =a 2 \.Yu
{ :. ) 2
Ia&,
Residualplots
The residuals can be plotted against:
• predicted responses .Y.,
• factors X;,
• time
These plots are helpful in revealing violations of the regression analysis
assumptions, detecting outliers, time trends, and Iack offit.
The so-called unit normal deviates E:. I sR are often used, where s~ is the residual
variance. It is known that any normally distributed random variable & falls within the
intervals:
1\
Eu
SR · outher
3
2
1
o o I o
. ·.
....
.
• I • I
-1 . .
0 0 0 I I 0
1
•• f •
-2
-3
The existence of a linear or curvilinear tendency in the residual plots shows that
some important linear or curvilinear terms are rnissing in the model or are incorrectly
calculated. For example, Figure 2.7a shows that a second order term is missing. lf the
residuals are plotted against factor xj the conclusion from Figure 2. 7a is that the term
()jjxj2 has to be included in the model. The plot on Figure 2. 7b indicates that some of the
linear terms are missing or rniscalculated.
3 3
. ·.: ..··
...
... . . .- .
.: . . -: .
-3 -3
a) b)
~u ~
SR SR
3 3
2
1
.
. . .. . . -1
Y-'iu) -2
Y-'iu)
-3 -3
c) d)
Figure 2. 7. Residualplots (a) second order term is missing. (b) linear term is missing.
(c) response error variance depends either on the predicted response or on the factor xi Yu
(d) violations ofthe assumptions or inadequate model structure not observed.
76 CHAPTER2
The pattern shown in Figure 7c is observed when the variance of & depends on
the response .Y. or the factor X;. In this case observations are heteroscedastic and a
better model can be obtained by using transformations or refitting data by weighted least
squares method (see subsections 2.3.10 and 2.3.11).
If the regression analysis assumptions listed in subsection 2.3.4 arenot violated
the residual plot Iooks as in Figure 2.7d. The points are concentrated around the
horizontal axis and their density decreases with the distance from it.
. . . ..
. . ·:·:·: .
time time
a) b)
Figure 2.8. Residuals arranged by the time ofthe Observations: (a) linear trend
(b) negative correlation
Interesting conclusions can be drawn from residual plots with time on the
horizontal axis. In this case the residuals are ordered according to the time of conducting
experiments. For instance the plot in Figure 2.8a shows that there isalinear time trend,
while Figure 2.8b indicates a negative correlation ofthe Observations.
Often the so called standardized residuals are used instead of the unit normal
deviates &. I sR" They are defined as follows:
Example 2.12.
Figure 2.9 shows a plot ofthe standardized residuals for the breaker formulation
data of example 2.10 obtained with MINITAB software. Only two of the standardized
residuals are out of the interval ( -2, 2) and are very close to its borders. There are not
indications for heterogeneity of variance, outliers or model inadequacies.
2 0
0 0
0
0 0
e'u 0 00 0
0
0 oOO
-1 0 0
0 0
-2
0
15 2i/\
y
•
Normaland half-normal plots
The assumption about normality of a random variable can be tested using the so called
normal or half-normal plots.
Consider a random variable Y. The probability of observing Y smaller or equal to
a giveny is called cumulative density .function (c.d.f):
• F(y) tends to zero for infinitely small values of y and to one for infinitely !arge
values ofy.
• F(y) is non-decreasing function.
Assurne that y is normally distributed random variable with expectation 17 and
variance er. The following transformed variable
y-17
U=--
0'
is called normal score. It has zero expectation and unit variance and its distribution is
called unit normal distribution. Figure 2.10a shows the c.d.f ofunit normal distribution.
F(u) or F(y)
0.9
0.8
0.5
0.2
0.1
0.01
-3 -2 -1 0 2 3 u -3 -2 -1 0 2 3 u
TJ-cr Tl T]+cr
a) b)
Figure 2.10. Cumulative density function ofunit normal distribution. (a) unifonn scale on the axes (b)
nonnal probability paper
The scale on the vertical axis can be chosen so that c.d.f of the unit normal
distribution is a straight line as shown in Figure 2.10b. This way the so called normal
probability paper is obtained. The cumulative distribution function F(y) has the same
form as F(u) but the horizontal axis is rescaled replacing u by y = 17 + ua.
Suppose we have N Observations yPy2, ... ,yN. Fitting a regression we can
compute the residuals e;
=Y; - Y; and order them according to their magnitude, say
Cl~ C2 ~ ... ~ CN.
David (1970, pp. 64-67 and 161-163) shows that if e; has approximately unit
normal distribution, then
STATISTICAL METHODS FOR DATA ANALYSIS 79
Therefore, the value (i- 0.5)/ N can be considered as an estimate of the probability
P(&; <c;).
Two types of normal probabi/ity p/ots are used:
i)Points with coordinates [F(e;1s;] are plottedonnormal probability paper.
u;
ii) Using the unit normaldistributionanormal score is found so that
(
Pus;.u; ) = i-3/8 or P\us;.u;
{ ) i-05
=--·-.
N+1/4 N
In order to obtain a normal probability plot the residuals B; are plotted against u;.
If the residuals are normally distributed, the points roughly lie on a straight line. The
outliers fall far from the straight line in the lower left and the upper right comers of the
plot. For non-normally distributed random variables, great deviations from the straight
line occur not only in the comers.
The points are not exactly on the straight line because of the random fluctuations.
Another reason for departures from straight line is that the normal plots are applicable
for uncorrelated observations, while the residuals are correlated as we mentioned in this
subsection. Nevertheless the normal plots of the residuals are useful for qualitative
conclusions about normality of observations and outliers.
One can see from Figure 2.1 Oa that F(- u) is a mirror image of F(u) and
F(O)= 0.5. Therefore, if c.d.f. is plotted against the absolute values of u then
F~ul)~ 0.5. In case of zero mean distributions, the normal probability paper can be
rescaled so that the values of c.d.f. are larger than 0.5. Arranging the absolute values of
the residuals in order of their magnitude, computing
and putting F~e; I) against I&; I on normal probability paper, we obtain the so called half-
normal probability plots ofthe residuals. They can be interpreted as the normal plots.
Example 2.13.
Using MINITAB two types of normal plots of the standardized residuals c' are
obtained for the breaker formulation data of example 2.10. They are shown in Figure
2.11. As the standardized residuals roughly lie on a straight line we conclude that they
are normally distributed. This confirms our conclusions obtained on the basis of residual
plots in example 2.11.
80 CHAPTER2
Probabilitf---.,....---.---......- -_,....--.....--. I I
I I
I I I I
01 ----- --------
' I I I
001 ----- --------- ~-- ------- ~--- -------:--------- -~ ---
I I I I
-2 -1 0
Stand. residuals
a)
0
2-
0 0
1- 00
oO
~>' 00
u 0 ooo
oOO
-1 oo
0 0
-2
0
-2 -1 0
b)
Figure 2.11. Normalprobability plots for the breaker formulation data (a) standardized residuals against
normal scores (b) probability against standardized residuals
•
2.3.10. TRANSFORMATIONS OF VARIABLES
The initially chosen metrics of variables may not be the most suitable for fitting a
polynomial equation to data. Sometimes non-linear transformation of the response or
factors, or both, can considerably simplify and improve the model. For example, while in
the original metrics a response can be adequately fitted by second order polynomial, a
first order polynomial may be a good model after transforming the response and/or the
factors.
STATISTICAL METHODS FORDATA ANALYSIS 81
Consider first transformations of the responses. Difficulties often arise if for some
runs the recorded response is too !arge, while for others it is too small. In this case a
non-linear transformation of the observed response Y (for example, y = log 10 Y)
contracts the scale in the region of !arge responses. The model for the transformed
response y is often much simpler than for the original one.
The transformations are also useful for stabilizing the variance and for obtaining
distributions close to the normal one. Variance stabilization can be achieved when it is
known that standard deviation of a response is functionally dependent on its mean value.
For example, Iet the differences between the observed responsein various runs are !arge
and the standard deviation of the response is proportional to its mean value. In this case
the efficiency of the ordinary least squares estimates is low because the assumption for
constant variances is not met. A logarithmic transformation of the response contracts the
scale in the region where the standard deviations are great and results in stabilizing the
variance over the whole region of interest.
Sometimes the distribution of the response Y in the original metrics is not normal,
while after some non-linear transformation it becomes approximately normal. It is
known, for example Hahn and Shapiro (1967), that the size ofparticles obtained through
fractionating of some material in a mill follows log normal distribution. Therefore, if we
want to build a model of the particle size as function of some factors we should take the
logarithm of size as a response.
Simple power transformations of the form y = ya are useful in many cases. Most
frequently used are the square root (a = 0.5), the inverse square root (a = -0.5) and the
reciprocal transformation ( a = -1). The logarithmic and the square root transformations
are applicable only if the response is positive.
A farnily of transformations, depending on a parameter A., is proposed by Box
and Cox (1964):
In this relationship Y is the geometric mean of the observations in original metrics which
can be computed from the formula
. J n
lnY =- :LinJ;..
n u=l
The parameter A. is chosen to minimize the residual sum of squares QR(A.) for a
set of A. values. Forthis purpose the model is fitted for each value of A. and In QR (A.) is
plotted against A. The value i that produces minimum of In QR (A.) is the maximum
likelihood estimate of A.
82 CHAPTER2
where a is a given significance Ievel, %2 (a,1) is the corresponding upper critical point of
% 2 -distribution with one degree of freedom, and vR is the number of residual degrees of
freedom.
Non-linear transformations of factors x1 , x2 , ... , x1 are also useful in some cases.
They do not affect the distribution of the response. These transformations can be chosen
empirically or on the basis of some theoretical knowledge about the model.
For more information about transformations see Box and Cox (1964), Box and
Draper (1987), Logothetis and Wynn (1989) and Box and Tydwell (1962).
Due to non-linear transformation of errors (see Fig. 1.3) it is possible to minimize the
performance characteristic's variability. Hence the response variance depends on the
factors and the observations are heteroscedastic.
Two types of experiments are used in quality improvement problems:
• Experiments without errors in factors. Usually in this case the classical
regression analysis assumptions are satisfied and the ordinary least squares can be used
to fit equations.
• Experiments with errors in factors. Responses obtained from such experiments
are heteroscedastic random variables and the first assumption of the classical regression
analysis does not hold. In this case the ordinary least squares give biased and inefficient
estimates ofthe regression coefficients (Vuchkov and Boyadjieva (1981)). Unbiased and
efficient estimates based on heteroscedastic Observations can be provided by weighted
least squares.
Let us introduce some weights w; such that the contributions of Observations in
the residual sum of squares are small when their variance is large:
For heteroscedastic Observations the weights are chosen equal to the inverses of
their variances:
Taking into account (2.31) we rewrite the residual sum of squares in the form
(2.65)
Denote
and
T T
F. F.B= F. y., (2.66)
A
where
I
F. = W 2 F,
(2.67)
84 CHAPTER2
2.4. Bibliography
(2.4)
where
(2.3)
(2.5)
i=l j=l i=l
and
(2.6)
or
k r
D = 2LL(yij- yiXJ;i- y)=
i=l j=l
(A.2.1.2)
i=l j=l i=l
where
k k
B= L(yi- y)=LYi -liy. (A.2.1.3)
i=l i=l
one can see that B = 0. Hence, D is also equal to zero and (A. 2. 1. 1) can be presented in
the form of (2.4).
•
Appendix A.2.2. Derivation ofthe simplified formulae (2.10) and (2.11)
Q= L(y.- YY.
•=I
N N
Q = ~>:- 2YLY. + NY 2 (A.2.2.1)
u=l u=l
1 N
.Y= N~Y.
N N
Q= 1>; -2Ny2 +NY2 = LY; -Ny2. (A.2.2.2)
u=l u=l
(2.10)
where
1. Under the assumptions of Section 2.3.4 the least squares estimates are
unbiased: E(B)
= () .
•
2. The covariance matrix of least squares estimates is
where C = (Fr F t. Denote by c;; and c,1 the diagonal and the off-diagonal elements of
C respectively. The variances and covariances ofleast square estimates are:
STATISTICAL METHODS FOR DATA ANALYSIS 87
(2.42)
(2.43)
or
(A.2.3.2)
As t t
B= (FrF Fr y and () = (FrF Fr 17, we can rewrite equation (A.2.3 .I) as follows:
v(o)=E[(o-BXo-Br] = E{[(FrFtFr(y-ry)iFrFtFr{y-ry)J} =
In the last equation we take into account that Fis non-random and (FrF t is symmetric
matrix. E[(y -17 )(y -qY] is the covariance matrix of observations and according to the
assumptions ofSection 2.3.4 is v(y)=a;I. Substituting a;1 for E[(y-q)(y-qy] in
(A.2.3.3) we obtain (2.41). Equations (2.42) and (2.43) follow immediately from (2.41) .
•
3. The variance oj the predicted response is
(2.44)
Proof By definition
(2.46)
where
ST ATISTICAL METHODS FOR DAT A ANALYSIS 89
(2.45)
u:::l
QM = IcY.- .YY'
u:::J
(2.47)
and
(2.48)
u=l
(A.2.4.2)
u::;::}
we see that the first ofthe normal equations (2.34) can be written as follows:
L(yu -y.)=O,
u=I
because fu 1 = 1. Hence,
-IN IN~~
Yu = NLYu = NLYu =Yu·
u=l u=l
The residual sum of squares can also be written in the following matrix form:
90 CHAPTER2
QR = i:Cv.- .Y.Y
u=l
= ~-For~-FO)=
(A.2.4.5)
(A.2.4.6)
According to the normal equatiöns the last term of(A.2.4.6) is equal to zero and
= YT y- NY 2 =LY: - NY2 .
N
QM + QR (A.2.4.7)
u=l
~ ~
L OJ;= iff'
k
ji =
i=l
STATISTICAL METHODS FOR DATA ANALYSIS 91
obtained on the basis of N observations using the normal equations FrFB= Fr y. Denote
the N - vector of residuals by 8 = y - y. The following properties of the residuals hold
under the assumptions ofthe linear regression analysis listed in Section 2.3.4:
1. E(e) = 0, provided that the postulated model is true.
Proof If the postulated model is true then 71 = FB and under the assumptions of
Section 2.3.4, the model coefficient estimates are unbiased: E(o)= B. Therefore,
v(&)=Ha;, (A.2.5.1)
where
(A.2.5.2)
(A.2.5.3)
According to the assumptions accepted in Section 2.3.4 the Observations are not
)=
correlated and with equal variances. Therefore, E&yr a;I and
(A.2.5.4)
and
H 2 =HH=(I-MXI-M)=I-2M+M 2 =1-M=H.
92 CHAPTER2
Taking into account that (FrFt is a symmetric matrix we can write the off-diagonal
elements ofH as follows:
T(
h.i =f. F F
T )-I fi =fi T( T
F F
)-1 f. =hiu.
Hence, His symmetric and H = Hr. Using in (A.2.5.4) the fact that HHr = HH= H we
obtain the result (A. 2. 5 .1).
•
3. The leverages m•• take values within the interval 0 ::;; m•• ::;; I.
m•• =fr(FrFtr
and they arenon negative because (FrFy 1 isanon negative definite matrix. H = H 2 is
non negative definite matrix and h•• = 1-m•• are also non negative. Hence, 0 ::;; m•• ::;; 1.
•
4. The residuals have a multivariable normal distribution with expectation
E(&) =0 and covariance matrix V(&)= Her;.
e
The normality follows from (A.2.5.3) because = Hy isalinear transformation
of a normally distributed vector y. The expectation and the · covariance matrix of the
residuals are according to properties I and 3.
•
5. The residuals e are correlated with the observed response y and not
correlated with the predicted one y.
Proof.
(i)The covariance matrix of eand y is
The residuals are correlated with the observed responses because H is a non-diagonal
matrix.
STATISTICAL METHODS FOR DATA ANALYSIS 93
(A.2.5.7)
The last equality follows from the symmetry of H and M. As M is idempotent matrix we
can present (A.2.5.7) in the form:
(2.52)
Proof If the postulated model is true E(y) = 17 =F(}. According to (2.52) and
(A.2.5.3) we can write
94 CHAPTER2
(A.2.5.8)
(A.2.5.9)
The expectation of QR is
However, yTH77 = 77THy because the transposition does not change these scalar values.
Taking expectation we obtain
(A.2.5.11)
r
E(yTHy) = E[(y-7] H(y -77) + 77TH11)
(A.2.5.13)
77 rH77 = orFr[I-F(FrFtFr]Fo =
3.1. Introduction
96
DESIGN OF REGRESSION EXPERIMENTS 97
ii) left to vary during the experiment. In this case their effects are considered as
random and this increases the response variance.
During experiments factors must be set to given Ievels and measured accurately.
Sometimes the factors can not be kept precisely on their preliminary given Ievels. Then it
is important to write down the real factor values and to carry out the regression analysis
with them.
• Region of operability and region of interest. Factars can be varied over an
extensive area in the factor space called operability region. It is usually unknown or
poorly known and modelling the performance characteristic in it is impossible or very
difficult. However, experimenteis can locally approximate the response by a polynomial
regression in the neighbourhood of a point x0 which is initially supposed to be the best.
This model is adequate in a much smaller region of interest. Analyzing the regression
model investigators can decide to conduct new experiments within another region of
interest in order to improve the process or product performance characteristics. Applying
this procedure several times researchers can reach operating conditions far from the
initially chosen point in the Operability region.
Often the region of interest is defined by an initial point x0 and by the intervals of
factors araund it. They must be chosen so that the response is measurable for all possible
combinations of the factor Ievels. For example, consider an experiment designed for
studying the eroshing strength of concrete as a function of its components which are
cement, gravel, sand and water. If the variation intervals of all components are chosen
between 0 % and 100 %, combinations of Ievels such as I 00 % water and 0 % of the
other components might be in the design. It is clear that the eroshing strength of the
water can not be measured. In this case realistic factors intervals must be chosen for
which the eroshing strength can be measured (though it may not be optimal or even high
enough). Sometimes the wrang choice offactor Ievels can be dangerous. For example, in
an oxidation process the growth of the oxygen amount can considerably increase the
yield of a chemical reaction. However, too much oxygen could make the reaction so fast
that it could cause an explosion and destroy the installation. Therefore, good engineering
knowledge is crocial for the choice offactors and their Ievels.
• Design of experiments. Same a priori information about the order of polynomial
model could be very useful for a good choice of the experimental design. If such a priori
information is not available, it is recommended to start with a low order polynomial
model. After conducting the experiments and building the model a test for Iack of fit is
carried out. If the model is adequate it can be used for optimization or response surface
exploration. If not - then the design is augmented with new points so that a higher order
model to be fitted.
The most important requirements for the response surface designs are:
i) Designs should ensure that the fitted value y(x) is as close as possible to the
troe value 17(x).
ii) Designs should not require very high number of rons. Designs with too many
rons are irrelevant when the experiments are expensive or time consuming.
98 CHAPTER3
iii) They should provide good detectability of Iack of fit and a possibility to check
the assumptions ofthe regression analysis.
iv) Designs of experiments should not require an impractically high number of
factor Ievels. This could be inconvenient from engineering point ofview.
v) Designs should provide an intemal estimate ofthe output error.
vi) Augmentation with new design points should be possible to allow models of
increasing ordertobe built ifnecessary.
Box and Draper (1987) Iist other requirements to designs. Sometimes they
contradict each other and the weight, which should be put on each of them, depends on
the specific situation. Weights are matter of judgment and therefore, different
experimenters may make different choices in identical situations.
An appealing design property is the so called orthogonality. A design is
orthogonal if its information matrix FrF is diagonal. The orthogonality provides
independent model coefficient estimates and consequently, a possibility to interpret
independently the effects of factors and their interactions.
• Implementation of the experiments. We consider the steady state of the
processes and that is why the measurements of the response must be conducted after the
transient processes are terminated. The dead time must also be taken into account. This
is especially very important for slow processes that are for example typical for chemical
industry.
It is advisable to conduct experiments in a randomly chosen sequence.
Randomization allows to decrease the effect of heterogeneities in the raw materials or in
energy supplies.
• Data processing. Regression analysis is used to build models. A test for Iack of
fit is obligatory and testing the regression analysis assumptions through analysis of the
residuals is recommended. Transformations of the response and/or the factors can be
useful in many cases.
• Making engineering decisions. Models can help engineers to make decisions
about the product or process. Analyzing the effects of factors and their interactions,
using optimization procedures, contour plots and canonical analysis of the model
engineers can make a good decision taking into account the economic aspects of the
problem. The optimal solution must be checked through a confirmatory experiment.
In Chapter 2 we have shown that the regression model can be considered as
Taylor series expansion of a continuous function. For many engineering problems first
and second order models are satisfactory. Further on in this chapter we consider first and
second order designs.
One ofthe requirements listed in Section 3.1 is closeness ofthe predicted response y(x)
to the true value 77(x). As y(x) is an unbiased estimate of 77(x) this requirement means
DESIGN OF REGRESSION EXPERIMENTS 99
that the variance ofthe predicted response should be minimal. From (2.44) we know that
this variance is
The closer the estimates of regression coefficients ~ to their true values the
higher the accuracy. One can judge about the closeness of () to () on the basis of the
confidence ellipsoid which is defined by the relationship
The parameters characterizing this ellipsoid such as length of axes and volume
depend on the information matrix FrF, or equivalently on the covariance matrix of
100 CHAPTER3
tr(FrF t 1
=
k
trC = ~>;;
j:;;:}
.
According to (2.42) the variance of an estimate B; is
(3.1)
with elements
In this formula ~1 is the proportion of the observations at /-th support point. The
function ~(x) is called probability measure. It is non zero only at the support points of
the design.
A design with probability measure ~. which satisfies the condition
(3.2)
Example 3.1
Consider the following model:
102 CHAPTER3
;5. = 0. 0962.
x2
1 2 3
x1
4 5 6
7 8 9
Kiefer and Wolfowitz (1959) showed that the continuous D-optimal and G-
•
optimal designs are equivalent and following equation is true for them:
The closeness of this design to the G-optimal one is measured through so called G-
efficiency:
k
G.ff = J(x).
Both G.JJ and D.ff are zeros for singular information matrices (if IM(~~~ 0 then
frM- 1 (~) ~ oo) and are equal to 1 for the continuous D-optimal designs, so that
O~G.JJ ~1 and O~D.JJ ~1.
An important property of the continuous D-optimal designs is that they are
invariant under non-singular linear transformations of the functions in the regression
model (Kiefer (1961}, Fedorov (1972)). Therefore, the use of coded factors does not
change the properties of the designs.
All criteria considered in this section are model-dependent because the matrix F
depends on the model structure. For example, a second order rotatable or D-optimal
design does not have these properties if it is used to estimate models of lower or higher
order or even if the order of the design is the same, but some terms in the polynomial are
added or deleted. Hence, the choice of design based on initial experimenter' s guess about
the model structure, may be more or less successful. Efficiency of designs depends on the
shape and size of the region of interest which are also subject to experimenter's
judgment.
Using the sequential nature of experimentation, researchers can improve the
initially chosen design. After analysis of a model based on the initial design they can
augment it with new points taking into account the updated model structure or region of
interest.
Example 3.2.
Consider the following model:
104 CHAPTER3
where x is varied within the interval -1 ::; x ::; 1. Figure 3. 4 shows a line denoted "0"
which corresponds to this equation. Suppose that this line is unknown and we want to
estimate ß0 and ß1 using two Observations ( N = 2 ). Consider two experimental designs
as follows:
1. Design D1 : the observations are allocated at the two ends of the interval, i.e. in
the points x}!l = -1 and x~~ = 1.
2. Design D2 : the observations are allocated at arbitrary points x} 2 l and x~2 ).
y
T)
Figure 3. 4. Sensitivity to errors of a straight line estimated through data from D-optimal and arbitrary
design
We suppose that the random errors 6 1 and 6 2 for the observations are one and
the same for both designs. The line "1" on Figure 3.4 shows an estimate of the true line
"0" when design D1 is used and the response errors are 6 1 and 6 2 • For the experiments
conducted by design D2 with the same errors the estimated line is denoted by "2". One
can see that when the points are moved along the interval -1 ::; x ::; 1 the most accurate
estimate is obtained for design D~> i.e. for x1 = x1(!l = -1 and x2 = x~~ = 1. We will show
that they are the support points of a continuous D-optimal design with proportians
; 1~ l =; ~~) = 0. 5 . The matrix of regressors is
4
.77
Figure 3.5. Cantours of normalized information matrix determinant for singlevariable linear model
Figure 3.5 shows the contours ofthe determinant IM- (.;)1. One can see that it is
1
minimal for x1 = x1(!) = -1 and x 2 = x~~ = 1. Therefore, design D 1 is D-optimal one. The
normalized information matrix for this design is
Therefore,
M
-1
(.;.)= (10 0)1
One can see that the best estimate of the straight line 11 0 11 can be obtained through
design D1 which is D-optimal. The deviations of the predicted response 11 111 from the true
one ( 11 0 11 ) are largest for the support points of the D-optimal design (at the ends of the
interval). For these points we can compute the variance of the predicted response
provided that a: = 1. For instance for x 2 = x~0 = 1 we obtain:
106 CHAPTER3
(3.5)
Table 3.1 shows the design matrix X, the extended design matrix F and the
vector of observations y.
This design can also be used to estimate model parameters of other polynomial
models like
DESIGN OF REGRESSION EXPERIMENTS 107
or
or
However, it can not be applied for models containing second or higher power of
the factors. For example, consider the model
(3.6)
It has 4 coefficients like model (3.5). The corresponding design matrix of a full
factorial design for this model is shown in Table 3.2.
One can see that the first and the fourth columns in this table coincide. That is
why Fis not offull rank and the information matrix FrF is singular:
4 0 0 4]
[0 0 4 0
FTF = 0 4 0 0 .
4 0 0 4
The determinant of FTF is zero, the matrix (FrFt doesn't exist and formula
(2.41) can not be used to estimate the model coefficients. The same is true for any even
power offactors (x;2 ,xt, ... ). Ifthe effects ofhigher order even powers offactors arenot
negligible and are not included in the model then they change the intercept b0 because the
functions x0 ,x;2 ,x;4 , ... are not distinguishable for any experimental run. In this case we
say that b0 is an aliased estimate for ßo ,/3;; ,/3;;", .... This is written as follows:
I I
If odd powers of the factors are used in the model, for example, if J.. = then x:,
f4 is equal to Xj and the matrix FrF is singular too. When significant odd powers of the
factors are not in the model, then bi are aliased estimates for the coefficients before the
odd powers ofthe factors:
I I
Often when the region of interest is small enough a linear model can fit the
response and the coefficients Ao ßiii ,/l;iii , ßiiiii may be negligible because they can be
interpreted as high order derivatives in the Taylor series expansion.
TABLE 3..
3 Full f:acton"al destgn
. fior three f:actors
F matrix Vecto
r
No. .ft /2 /3 !4 fs !6 !7 fs y
Xo XI x2 x3 XIX2 XIX3 X2l3 XIX2X3
1 1 -1 -1 -1 1 1 1 -1 YI
2 1 1 -1 -1 -1 -1 1 1 Y2
3 1 -1 1 -1 -1 1 -1 1 Y3
4 1 1 1 -1 1 -1 -1 -1 Y4
5 1 -1 -1 1 1 -1 -1 1 Ys
6 1 1 -1 1 -1 1 -1 -1 Y6
7 1 -1 1 1 -1 -1 1 -1 Y1
8 1 1 1 1 1 1 1 1 Ys
A simple rule for construction of full factorial designs with an arbitrary number of
factors is:
The entries of the design matrix X are always equal to -1 or 1 with signs
changingfor xi ajter 2° =1 row, jor x2 ajter i =2 rows, jor x3 ajter 2 2 =4 rows, etc.,
according to the powers oj 2: 2° ,i ,2 2 ,2 3 ,24 , ...
Table 3.3 shows the extended design matrix F for the following model
The points of a full factorial design are allocated at the vertices of a cube (square
form= 2).They areshownon Figure 3.6 form= 2 and m = 3.
x2
x1
a) b)
Figure 3.6. Full factorial design (a) for two factors (b) for three factors
Form > 3 the points of a full factorial design are allocated at the vertices of a
hyper cube. It has the same properties as a three-dimensional cube.
As one can see from Figure 3.6 the points of the full factorial design are
symmetrically allocated with respect to the origin of the coordinate system. This fact
provides remarkable properties ofthistype designs which we will consider next.
We consider some properties of the two-level full factorial design which can be easi1y
checked through Table 3.3. They are:
1. The sum of the first column elements of matrix F is always equal to the number
ofruns N:
This property follows from the fact that the elements of the first column of F are
always equal to 1.
2. The sum ofthe elements in the rest ofthe columns ofF always equals zero:
This property is due to the symmetry of the design with respect to the center of
the coordinate system.
The next two properties define the elements of the information matrix G = FrF
for a two level full factorial design.
3. The sum of squares of each column of F (which is equal to the corresponding
diagonal element ofG) is always equal to N:
N
Taking into account properties 3 and 4 one can see that G is a diagonal matrix
with diagonal elements equal to N:
N 0 0 0
0 N 0 0
G =FTF = 0 0 N 0 =NI, (3.10)
0 0 0 N
Consequently,
k
frM- 1f =frf =""'f.J/. (3 .11)
i=l
As J; are functions of the type X; or X;Xj and the region of interest is defined by the
inequalities -1::::: X; ::::: l,i = 1,2, ... ,/ the maximum of (3.11) is achieved for X;= ±1 and
x1 = ±1, i.e. at the vertices of the /-dimensional cube which are also points of the full
factorial designs. For these points /; 2 =I and therefore
DESIGN OF REGRESSION EXPERIMENTS 111
According to the equivalence theorem (3.3) the full factorial design is D-optimal.
The optimal proportion of the observations at the design points is ;. = 1I 2 1. Kiefer
(1959) showed that this design is also A- and E-optimal.
Ifthe model is linear in factors:
Lxj
l
frM- 1f = 1+ 2 .
J=l
is constant in a sphere with radius pwhich means that the design is rotatable.
Note that if the model contains both linear and interaction terms, then the full
factorial design is not rotatable. In this case the functions J; are
/ 1 = 1,/2 = xP ... , x x x
fz+ 1 = 1, / 1+2 = x1 2 , ... , fk = x1_ 1 1 . According to (3 .11) one obtains
k l 1-1 I
The last term of this equation is not constant over a sphere and consequently, the full
factorial design is not rotatable in this case.
Figure 3. 7a shows the contours of variance er 2 (Y) for a model linear with respect
to two factors x1 and x2 . They are concentric circles and therefore, in this case the design
is rotatable. The same design is no more rotatble for a model with linear and interaction
terms with two factors:
112 CHAPTER3
This can be seen from 3. 7b where the variance contours are not circles. In both cases it is
assumed that if. = 1.
0.8
0.6
0.4
t x2
0.2
0
-0.2
0.8
0.6
0.4
-1._~~~--~~~--~~L_~
-1 -0.5 0 0.5
X
~
b)
Parameter estimation
Least squares method is used for parameter estimation:
(3.13)
Taking into account that G = FrF is a diagonal matrix its inverse is easily computed as
follows:
1
- 0 0 0
N
I
0 - 0 0
N
C = (FrF)- = 1
1 =_!_I. (3.14)
0 0 - 0 N
N
... ... ...
I
0 0 0
N
(3.15)
~ 1 N
(}; = N 'L/.;Yu• i = 1,2, ... ,k. (3.16)
u=l
(3.17)
1 N
h; = N'Lx.;y., i=1,2, ... ,l. (3.18)
u=l
114 CHAPTER3
In a similar way for the interactions we substitute .f.j = x.ix.1 and obtain:
bij = - L
1 N
N u~1
x.ixujYu' i = 1, 2, ... ' 1- 1, j = i + 1, i + 2, ... ,1. (3.19)
The computations with formulae (3.17) to (3.19) are very simple because in the
full factorial experiment the values of x.i and x.ix.1 are always equal to ± 1.
Suppose r repeated observations are taken at each design point. Then formulae
(3 .17) to (3 .19) can be used with
1 r
Yu =- LYuj·
r J~1
substituted for Y•.
The model coefficients are computed by use offormula (3.16). For example,
1 16 1( 8 16 )
b4- - X
- 16 ~ u4Yu - 16
---
~Y. + ~Y. --
In the last row in Table 3.4 are given so called contrasts which are equal to the
doubled values of regression coefficients: 2bi, 2bi1 , 2bif1·
•
T ABLE 3. 4. Full factorial design for four factors
rJl
I
Vl
-
116 CHAPTER3
I 1-1 I
y(x)= b0 +~);X;+ L Lb;1 x;x1 . (3.20)
i=l i=l j=i+l
Putting x1 = x2 =... = x1 = 0 we see that b0 is equal to the predicted response y(o) at the
center of the coordinate system in coded factors which corresponds to the basic Ievels of
the factors in natural (physical) scale.
Taking into account (3 .17} we also see that b0 is the mean value of the responses
observed during the experiment and
(3.21)
y y
200 200
2h,J~.
150 150
100 100
50~----r----------r~ 50 ~--~~---------r~
-1 -1
~ ~
Figure 3.8. Main effects offactors X1 and X2 (a) b1 > 0 (b) b1 < 0
DESIGN OF REGRESSION EXPERIMENTS II7
I 8 I 8
2b 2 =-LYu(X2 =I)--LYu(x2 =-1)=102.I99-I39.323=-37.I24.
8 u~l 8 u~l
y
200
b -0
14-
F
···::. :. ·7.·::·:::·::::::::::::::··::::i·······•. D
x4 ~-J
E
C x4~ 1 •
50 50 . L . . . . . - - f - - - - - - - - + - -
·1 ·1
~ ~
Figure 3.9. Interaction effects (a) h34 > 0 (b) h14 ~ 0
1 4 1
Yc =-4 LY.(x3 = -1,X4 = 1) = -(69.832+143.473 +44.228+ 106.228)= 90.934,
u;J 4
1 4 1
Yv = 4 ~y.(x3 = 1,X4 = 1) = 4(111.110+202.282+91.637 + 169.819) = 143.712,
1 4 1
YE =- LY.(x3 = -1,x4 = -1) =-(128.652+195.786+94.606+168.393)= 146.859,
4 u;J 4
1 4 1
YF =- LY.(x3 = 1, x4 = -1) = -(99.432+ 164.015+21.002+ 121.704) = 101.538.
4 u;J 4
(3.22)
Consequently, all estimates of the model coefficients are uncorrelated and have
equal variances.
DESIGN OF REGRESSION EXPERIMENTS 119
The regression coefficients can be tested for significance using the t-test given in
subsection 2.3.6. The following value is computed to this end:
()
l sEJS:.
t =--'-
Taking into account (3.22) and substituting the residual variance sR for sE one can write
down
where Ir is the critical value of Student's distribution for significance Ievel a and vR
degrees of freedom.
Some statistical packages compute the probability
The sums Q1, Q2, ... , Q1 , Q12 , ... , QH 1 are due to the effects of the factors and
interactions and QR is the residual sum of squares. As shown in Appendix A.3 .1 using
(3. 16) one can compute the sums due to main effects as follows:
Each of these sums has one degree of freedom (V; = v;1 = 1) and therefore, they are
equal to their mean squares: s;2 = Q I V;,~ = Q1 I Y;1 . The effects of factors and
interactions can be tested for significance using the following relationships:
where
If F; > F(a,1, vR) or F;i > F(a,1, v R) then the corresponding effect is significant
at Ievel a.
The Iack of fit test is the same as for the standard regression analysis procedure
(see Chapter 2).
Table 3.6 presents the significance test for the regression coefficients. The
standard deviation ofthe coefficient estimates is computed using formula (3.32):
Columns 4 and 5 show the significance test based on t-criterion. One can see that p is
less than or equal to 0.05 for b0 A and b34 and is 0.066 for b2 . These coefficients can be
considered as significant at Ievel 0.1. The same results are obtained by comparison oft-
ratio with the critical value of Student's distribution for a = 0. 05:
tr ={1-~,vR )=12.706. For b0 ,b1 and b34 the computed t-ratio is smaller than
12.706. The critical t-ratio value for significance Ievel a=0.1 is tr =6.314.
Aslt2 1 = 9. 65 the estimate b2 is significant at Ievel a = 0.1.
The same conclusions can be drawn through the analysis of variances due to the
regression given in columns 6, 7 and 8 of Table 3.6. The ratios F; = s,2 I s; are shown in
column 8 of Table 3.6. The critical value of the F-distribution for significance Ievel
a = 0. 05 is F;. = F(0.05;1;1)= 161.4. Therefore, b1 and b34 are significant at Ievel 0.05. It
is easy to check that b2 is significant at a = 0. 1.
122 CHAPTER3
The multiple correlation coefficient for this model is R = 0.96 and the corresponding F-
ratio is equal to 95.44. The critical value ofF-distribution for significance level 0.05 and
degrees offreedom vM = 3, vR = 12 is Fr = 3.49. The multiple correlation coefficient is
significant.
80
Effects 60 _
40 -
20 -
0-
0 0 0
-20 -
·2 ·1
Normal scores
a)
DESIGN OF REGRESSION EXPERIMENTS 123
80~----------------------~
x1 o
Effects 60 _
40-
20 -
0-
-20-
o x2
-40 -"""'r---=----,-------.--------r--------r'
-2 -1
Normal scores
b)
Figure 3.1 0. Normal and half normal probability plots of the effects (a) normal plot
(b) half-normal plot
Figure 3. 10 shows normal and half-normal plots of the effects. They confirm the
conclusion that significant effects have x1 , x 2 and x 3 x 4 . All other effects Iie roughly on a
straight line.
•
3.4. Two Ievel fractional factorial designs
W e have already noted that the number of runs of the full factorial design is N = i. It
increases rapidly with the growth of I. For example, for I= 2, 5, 10 and 15 we obtain N
= 4, N = 32, N = 1024 and N = 32768, respectively. When I is !arge N becomes
unacceptably !arge and the full factorial design can not be used in practice.
One advantage of the two-level full factorial experiment is that it is orthogonal.
This provides some attractive properties of this design (see subsection 3.2.2). The
problern is how to decrease the number of runs preserving the orthogonality of the
design. We can use a fraction of the full factorial design. If it is also orthogonal it is
called regular jraction. Consider a linear in factors model for I = 3:
If we want to conduct only 4 experiments, we can use a full factorial design for
two factors and vary the Ievels of the third factor x3 in the same manner as the
interaction ~x2 • That means to put x3 = ~x2 . This equation is called generating relation
or generator. We obtain a design which is shown in Table 3.7.
The elements of the matrix F shown in Table 3. 7 are the same as for a full
factorial design with 2 factors (see Table 3.1). Only the interpretation ofthe last column
is changed: we now consider it as corresponding to the factor X:J instead of the
interaction x1x2 • That is why the design remains orthogonal.
The design of Table 3. 7 consists of half the runs of the full factorial design for
three factors given in Table 3.3. The runs numbered 1, 2, 3 and 4 in Table 3.7 are the
same as runs 5, 2, 3 and 8 in Table 3.3. That is why Table 3.6 is called also a half
jraction of the full factorial experiment.
We could also choose X:J = -x1 ~. Then the other half of the full factorial design
(points 1, 4, 6 and 7 ofTable 3.3) would be selected.
With the fractional factorial design of Table 3. 7 one can not study separately the
effects of the factor x3 and the interaction ~x2 because in all runs their Ievels coincide.
That is why b3 characterizes their joint effect and is called aliased (or confounded)
estimate of ß3 and ß12 :
(3.26)
One can see from Table 3.7 that following equations arealso true: Xi = x2 x3 and
~ = x1x3 •That means that h1 and h2 are also aliased estimates:
(3.27)
(3.28)
It is clear that the price of run number reduction is the aliasing of the main effects
with some interactions. The variance of the estimates increases with the reduction of N
(see (3.22)).
DESIGN OF REGRESSION EXPERIMENTS 125
a quarter-fraction has
(3.29)
This is the defining contrast. Multiplying it by any of the factors or the interactions one
can study the aliasing ofthe factors and the interactions. For example, multiplying (3.29)
by x2 we have
Together with Xj = x2 x3 these equations form the system of aliasing also given by
(3.26), (3.27) and (3.28).
Consider now a design with four factors (I= 4) which has 8 runs. To find such a
fractional design one must take t = 1 because N = 2 4 - 1 = 8. The generating relation can
be chosen as follows: x4 = x1x 2 x 3 . Consequently,
(3.30)
Only one of the aliased functions must be included in the model. Otherwise there
will be coinciding colurnns in the extended design matrix F which will not be of full rank.
Table 3.8 shows a fractional factorial design with generating relation x4 = l1~x3 .
Only first order terms and the following interactions are included in the model: x1x2, x1x3
and x1x4 . The other interactions are aliased according to (3.31).
(3.32)
For the construction of a fractional factorial design one must choose the
resolution of the design. If the intervals of variation of factors are small and the
experimenter feels that all interactions are negligible then resolution III design can be
chosen. A fractional factorial design of resolution IV must be chosen when one wants the
main effects to be not aliased with each other and with second order effects. A design of
resolution V has to be chosen when one needs the main effects and the second order
interactions to be not aliased, while the third and higher order interactions can be
neglected.
One rnight think that equating some of the factors to the highest order products
of other factors he/she obtains a fractional factorial design of highest resolution. This is
true for half fractions when only one generating relation has to be chosen. If the design
has to be highly fractionated and several generators are necessary, this choice may not be
the best. This is shown in Example 3.6. Tables ofrecommended generators for fractional
designs of given resolution with different number of factors are given by Box, Hunter
and Hunter (1978) and Box and Draper (1987).
Choosing the generators, the experimenter also chooses I and consequently, the
number of runs N = 2 1- 1 . The higher the resolution of a fractional factorial design, the
!arger the number of runs N.
The following procedure can be used to build a fractional factorial design:
1. Choose the resolution of the design and 1 taking into account the maximal
number of runs which can be conducted.
2. Build a full factorial design for I- I factors.
3. Choose generators which ensure the desired resolution of the design. It is not
allowed the right hand sides of the generators to be equal to products of factors that
differ only by signs, because in this case linear dependence between the colurnns of F
occurs.
4. Augment the design matrix with colurnns corresponding to the generators
chosen in step 3.
Example 3.6
Let us construct a fractional factorial design of highest possible resolution with I
= 7 factors which has no more than 20 runs. The full factorial design with 7 factors has
N = 2 7 = 128 points.
We choose t = 3 because N = 2 7- 3 = 16, while N = 2 7- 2 = 32. Construct a full
factorial design for I - I = 7 - 3 = 4 factors ( xi, x2 , x 3 and x 4 ) as shown in colurnns 2 to 5
ofTable 3.9.
128 CHAPTER3
Three generators are necessary for the values of x5, x6 and x1. They can be chosen
in different ways:
a) Through equating the factors x5,x6 and x1 to higher order products of
x1 ,x2,x3 and x4. For example, Iet x1 = x1 x2 ~x4 . According to step 3 of the rule given
above we have to choose the next two generators to be products of three factors, for
example, x6 = - X2~X4 and x5 = x1X2X4. Multiplying both sides of these generators by
x1,x6 and x5, correspondingly, we obtain
(3.34)
.. 2 7- 3 fra.
TABLE39 ctionalf:acton"aldes1gn
.
No. x, x2 x3 x4 Xs = x2~x4 x6 = x1x2x4 x1 = -x1x3x4
1 2 3 4 5 6 7 8
1 -1 -1 -1 -1 -1 -1 1
2 1 -1 -1 -1 -1 1 -1
3 -1 1 -1 -1 1 1 1
4 1 1 -1 -1 1 -1 -1
5 -1 -1 1 -1 1 -1 -1
6 1 -1 1 -1 1 1 1
7 -1 I I -I -I I -I
8 1 1 1 -1 -1 -1 1
9 -1 -1 -1 1 I I -1
10 1 -I -I 1 I -I I
11 -1 1 -I 1 -1 -1 -1
I2 1 1 -1 1 -1 I I
13 -1 -1 1 1 -1 1 1
14 1 -1 1 1 -1 -1 -1
I5 -I 1 I 1 I -I I
16 1 I 1 I I 1 -I
(3.35)
As the left-hand sides of (3.34) and (3.35) are all equal to I we obtain the so-called
generalized defining contrast:
DESIGN OF REGRESSION EXPERIMENTS 129
The fractianal factanal design with these generatars is af resalutian III because the
smallest number af factars in the generalized defining cantrast is 3.
b) Using third arder praducts in the right hand side afthe generatars:
and
(3.36)
•
130 CHAPTER3
The simplified formulae for regression coefficient estimation given in Section 3.3.3 and
the corresponding rules for testing their significance can be applied to fractional factorial
designs too. This can be explained with the orthogonality of both full and fractional
factorial designs.
The test for Iack of fit is according to the regression analysis procedures
described in Chapter 2.
TABLE311 Des1gn
. of expenments andbse o rvauons fior amme extracuon oflead
Run Con- Con- Tem- Solid/ Extra
No. or- cen- cen- pe- Liquid c-tion
der tration tration · rature ratio time Degree of Iead extraction
r - of of
DETA H2S04 observed mean predicted
XI x2 x3 x4 Xs y' y" y y
1 1 -1 -1 -1 -1 -1 13.25 14.45 13.85 13.45
2 6 1 -1 -1 1 1 45.75 44.35 45.05 44.79
3 7 -1 1 -1 1 1 42.24 41.72 41.98 42.23
4 2 1 1 -1 -1 -1 36.03 34.47 35.25 35.65
5 3 -1 -1 1 I -1 63.88 62.32 63.10 63.49
6 4 1 -1 1 -1 1 82.98 84.86 83.92 84.19
7 5 -1 I 1 -1 1 82.22 81.54 81.88 81.63
8 8 1 I 1 1 -1 85.62 86.54 86.08 85.69
Other second order interactions are not considered because they are aliased with
main effects, with Xj~ or x2x3 . All interactions higher than second order are neglected
and the following model is obtained:
The regression coefficients' estimates are calculated by formulae (3 .17) - (3 .19). For
example
1
bl = -(-13.85 +35.25- 63.10+ 83.92-81.88 + 45.05-41.98 + 86.08)= 6.19'
8
1
bl3 = -(13.85 -44.79+ 42.23-35.65-63.49 +84.19-81.63 +85.69)= 0.07.
8
Only two interaction effects are included in the model because all other second order
interactions are aliased with main effects or with x1x 3 and x2 x3 .
The model is tested for Iack of fit by use of the procedure given in subsection
2.3.7. The pure error sum ofsquares is
132 CHAPTER3
with
h
vE=Lij-h =2x8-8=8
i=l
The predicted values of the response in the design points are given in the last row
of Table 3.10. The sum of squares due to Iack of fit for this model is
h
QL = Lr;(Y;- yJ2 = 1.78
i=l
F= s~ = 0.89 ,=2.12.
s. 0.42
•
3.5. Blocking
If the number of runs is high it is possible for a relatively long period of time the
experimental conditions to change due to non-random changes of raw material, operator,
measuring device or some other reasons. If their effect is additive and does not interact
with the factors it can be estimated and eliminated. For this purpose the design is split
into blocks for which one can expect that the experimental conditions are approximately
constant. Suppose that the differences between the blocks can be considered as caused
by a factor B which changes in time (for example, raw material). Ifthe experiment is split
into two blocks, one before the change of the raw material and the other - after this
change, then the block effect is 2ß8 , where ß 8 is a regression coefficient before B. In
order to split the experiments into two blocks we put the factor B equal to highest
degree product of the other factors. In the first block we select these runs for which B =
+1 and the second consists of runs with B = -1.
For example, consider a full factorial experiment with 5 factors: xpx2 ,x3 ,x4 and
Xs. The total nurober of runs is N = 2 5 = 32. Suppose that we can keep the experimental
conditions approximately constant only in a half of these runs. Then .we can put
B = XjX2 x3x4 x5 and select for the first block only these runs for which B = + 1, and for the
second block runs with B = -1. This design is shown in Table 3.12. Each block in this
table can be considered as a half-fraction of the full factorial design. The blocking
variable B is not confounded with the other factors or interactions, except with
x1x 2 x3x4 x5 . Therefore, no main or interaction effect, except that of x1x2 x3X 4 X 5 , is affected
by the inter-block difference. Usually the fifth order interaction is negligible and its
aliasing with the block effect is not very important.
If the changes of experimental conditions occur in shorter time intervals, then the
design can be split into smaller blocks. For example, the full factorial design with 5
factors and 32 runs can be split into 4 blocks, each consisting of 8 runs. Forthis purpose
we introduce two blocking variables ~ and B2 . The corresponding blocking generators
have to be chosen to provide as higher as possible resolution of fractional factorials for
each block, taking into account that the blocking variables can interact with each other.
134 CHAPTER3
~pnttmg 25 d.
TABLE312 . Sr. es1gn mto two bl ocks
Block I Block II
Run XI x2 xJ x4 Xs B= Run XI x2 x3 x4 Xs B=
No. xlx2x3x4xs No. xlx2x3x4xs
1 1 -1 -1 -1 -1 I 17 -1 -1 -1 -1 -1 -1
2 -1 1 -1 -1 -1 1 18 1 1 -1 -1 -1 -1
3 -1 -1 I -1 -1 1 19 1 -1 1 -1 -1 -1
4 1 1 1 -1 -1 1 20 -1 1 1 -1 -1 -1
5 -1 -1 -1 1 -1 1 21 1 -1 -1 I -1 -1
6 1 1 -1 1 -1 1 22 -1 1 -1 1 -1 -1
7 1 -1 1 1 -1 1 23 -1 -1 1 1 -1 -1
8 -1 1 1 1 -1 1 24 1 1 1 1 -1 -1
9 -1 -1 -1 -1 1 1 25 1 -1 -1 -1 1 -1
10 1 1 -1 -1 1 1 26 -I 1 -1 -1 1 -1
u· 1 -1 1 -1 1 1 27 -1 -1 1 -1 1 -1
12 -1 1 1 -1 1 1 28 1 1 1 -1 1 -1
13 1 -1 -1 1 1 1 29 -1 -1 -1 1 1 -1
14 -1 1 -1 1 1 1 30 1 1 -1 1 1 -1
15 -1 -I 1 I I 1 31 1 -1 1 1 1 -1
16 1 1 1 1 1 1 32 -1 1 1 1 1 -1
The blocks can be chosen using the Ievels of blocking variables shown in Table
3.13.
The design is shown in Table 3.14. Smaller blocks can also be obtained but at the
price of aliasing second order interactions with the blocking variable.
DESIGN OF REGRESSION EXPERIMENTS 135
One of the most important characteristics of the response surface methodology is that it
is based on a sequential approach. The information obtained at the initial stages of the
experimentation is used in the next stages to improve tl].e experimental design. An
element of the response surface methodology is the steepest ascent procedure (Box and
Wilson (1951)).
The initial operating point in the factor space chosen by the experimenter to start
a research program can be far from the optimal one. An attempt to fit a polynomial over
the whole operability region will usually Iead to a very complex model with too many
coefficients. A great nurober of observations will be needed to fit this model. Usually
investigators are not interested in far from the optimal working conditions. Therefore,
they need a simple procedure to move the process from the initially chosen operating
point towards a smaller region of interest where optimal operating conditions are
expected tobe found. Ifthe initial point is far from the extremum one can expect that the
response surface in a small region ofinterest araund it is approximately linear. Using full
or fractional factorial design a model of the form
136 CHAPTER3
m m-1 m
y=b0 + Lbixi + L 'Lbiixixi (3.37)
i=l i=l j=i+l
can be fitted.
Typically when the initial operating point is far from the optimum the coefficients
bi1 are smaller than bi . The sum of squares due to the first order terms in (3. 37) is much
larger than the sum of squares due to the second order terms and (3.37) can be
considered as approximately linear model.
The idea of steepest ascent is to move the operating point in the direction of the
gradient of the response surface
. . ~J
It is known that the gradient is a vector that is orthogonal to the contour of TJ in a given
point (Figure 3.11). As over the contours TJ does not change, one can expect that in
direction of the gradient the change of the response function would be largest.
The gradient can be easily found from (3.37). As is shown in Section 2.3.3 a
polynornial model can be considered as a Taylor expansion of the response function. If
the response can be approximated by first order polynornial then the coefficients bi in
(3.37) are estimates ofthe gradient components
0." I
ß.-- 8x
I Io
x 1+1 =x 1 ±agrady,
DESIGN OF REGRESSION EXPERIMENTS 137
where a > 0 is a parameter which determines the length of the step. Sign (+) is used if
one needs to find a maximum, for a movement towards the minimum (-) should be
chosen.
X'
2
60 50
X'1
yI • yI
a)
y
X
1
b)
Figure 3.11. Steepest ascent for two factors (a) Path of steepest ascent (b) Section of the response surface
along the path of steepest ascent
138 CHAPTER3
The factors can be rewritten in scalar form for each component as follows:
For practical use it is convenient to convert the factors in their natural measuring scale
using formula (2.28) as follows:
or
The steps for all factors must be chosen so that the movement is in the direction
of the gradient, i.e. they must be proportional to b;. For example, the step y max for a
factor for which the value of b;OJ; is maximal can be chosen on the basis of
experimenter's intuition. The steps for the other factors have to be calculated using the
equation
Y; = lb bw II I
I I
maxOJmax
Ymax . (3.38)
cooling in chamotte melting pot. The intervals of variation of factors in natural and
coded va1ues are given in Table 3.15.
T ABLE 3.15. Design of experiments and path of steepest ascent for the alumirrum
.
. expenment
modifiIcauon
Factars Molybdenum Superheat Superheat Cooling Number
content temperature time conditions ofgrains
1. Dimension % oc min -
2. Basic Ievel (X; = 0) 0.40 840 60 -
3. Halfinterval of 0.15 100 60 -
variation, W;
4. Upper Ievel (X; = 1) 0.55 940 120 graphite
5. Lower Ievel (X; = -1) 0.25 740 0 chamotte
6. Code XI Xz x3 x4 y
7. Run 1, Runorder 4 1 1 1 1 100
8. Run 2, Runorder 3 -1 I I -1 81
9. Run 3, Runorder 8 1 -I 1 -I 95
10. Run 4, Runorder 5 -1 -I 1 I 36
11. Run 5, Runorder 7 1 1 -1 -1 130
12. Run 6, Runorder 2 -1 1 -I 1 69
13. Run 7, Runorder 1 1 -1 -1 I 90
14. Run 8, Runorder 6 -1 -1 -1 -1 64
15. Coefficients, b, 20.625 11.875 -5.125 -9.375
16. b;w; 3.094 1187.5 -307.5 -
17. Step size 0.03 10 -3.0 -
18. Computed values 0.43 850 57 chamotte
19. Computed values 0.46 860 54 chamotte
20. Run 9 0.49 870 51 chamotte 108
21. Computed values 0.52 880 48 chamotte
22. Computed values 0.55 890 45 chamotte
23. Run 10 0.58 900 42 chamotte 196
24. Run 11 0.61 910 39 chamotte 366
25. Run 12 0.64 920 36 chamotte 313
26. Computed values 0.67 930 33 chamotte
27. Run 13 0.70 940 30 chamotte 142
Steepest ascent is used for finding optimal conditions. A half fraction of the full
factorial design is used in the experiment. The generating relation is X4 = x1x2x3 and the
corresponding defining contrast is x1x2x3x4 = 1. Therefore, the effects are aliased as
follows:
I40 CHAPTER3
b4 ~ ß4 + ßl23,
The design of experiments, the Observations and the path of steepest ascent are
shown in Table 3 .I5. The regression coefficients are computed using formulae (3 .I7),
(3.I8) and (3.I9).
For example
I N I
bl =- ~>iuYu =-(I00-8I+95-36+130-69+90-64)= 20.625.
N u=l 8
Table 3.I6 shows the regression coefficients and the analysis ofvariance. One can
see that the interaction effects are much smaller than the main effects. The total sum of
squares is computed by formula (2.I 0):
N )2
~>u
t; Y: -
(
N
Q= u=JN = 60739- 55278.I25 = 5460.875.
The sums of squares due to effects are computed as shown in subsection 3.3.3.
For example
N
The sum of squares due to main effects is 5444.50 and is much higher than this of
the interaction effects (16.375). One can assume that the interaction effects are
negligible. This assumption is checked by significance test for regression model
coefficients. Three independent observations are conducted at the center of the design
and following response values are observed: 80, 82 and 78. The mean value is y = 80
and an independent estimate of the response variance with 2 degrees of freedom is
obtained:
1-±(y. -8oY = 4.
s; = -3 -1 u~l
are insignificant. In this formula Ir = t(0.025,2)= 2.92 is the critical value of Student's
distribution.
This test confirms the assumption that all interaction effects are insignificant.
Ignoring the interactions we obtain the following regression model:
u=l
N
u=l
The F-ratio is
142 CHAPTER3
It is rounded to 0.03 %. In a similar way the step size for the superheat time is
r3 =-3 min.
Starting from the basic Ievels of factors the operating point moves by steps in the
direction ofthe gradient as follows:
3.7.1. INTRODUCTION
If the region of interest includes the extremum of a response surface a model containing
only main effects and interactions is not good enough. In this case a second order
polynomial of the form
DESIGN OF REGRESSION EXPERIMENTS 143
I H I I
.Y(x)= b0 + _'Lb;X; + L _'Lbifxixf + _'Lb,;X;2 (3.39)
i:::l j=i+1 i=l
ho ~ ßo + L ß;; ·
i:::::l
The intercept b0 can be considered as value of the response in the origin of the
coordinate system when N 0 . Hence an independent estimate of this value from repeated
observations at the design center should provide the same response. Suppose that there
are N 0 repeated observations at the origin and Yo is their arithmetic mean. If b0 is much
different from .Yo we conclude that there are high values of ß;; and therefore, a model of
the form (3.39) has to be used. Note, that this comparison may not discover the
curvature when ß;; are with different signs.
It was shown in Section 3.3.1 that the quadratic effects can not be estimated from
observations obtained through two Ievel full or fractional factorial designs. A second
order design must have at least three Ievels of all factors. However, two Ievel full and
fractional factorials are often used as building blocks of second order designs. Addition
of new points allows to estimate pure quadratic effects.
The points of many second order designs are symmetrically allocated throughout
the region of interest. This symmetry provides some attractive properties. For example,
the main and interaction effects can be estimated independently if two Ievel full factorial
design or high-resolution fraction is used as a building block. They can be split into
144 CHAPTER3
blocks for elimination of non-random time trends and they can provide a satisfactory
distribution of the predicted response variance over the region of interest.
An important dass of symmetric designs are the so-called centrat composite designs. If a
full or fractional factorial experiment was conducted and a linear model or model with
interactions did not fit weil the data one can use the already obtained observations and
add new ones tobe able to estimate the coefficients ofmodel (3.39). This is the idea of
centrat composite designs. They consist ofthree types ofpoints:
i) Points of a full or fractional factorial design ("cube points"). Their number is
2 1- 1 . The rules for the construction of these designs are given in subsections 3.3.1 and
3.4.1.
ii) "Star" or axial points. They are allocated at equal distances a from the center
ofthe design and on the coordinate axes. All their coordinates except one (which is ±a)
are equal to zero. The number ofthe star points is N = 2/.
iii) Centrat points with all coordinates equal to zero. Denote their number by N 0 .
In fact this is one point at which N 0 observations are conducted.
The cube and the star points can be replicated t;, and r, times, respectively.
Consequently, the total number of runs of a centrat composite design is
N = 2 1-r rc + 2/r, + N 0 .
Figure 3.12 shows a central composite design for I= 2, t = 0, a. = 1, N 0 = 4 .
• •
No=4
a=l
• •
Figure 3.12 Centtal composite design with two factors: t = 0, a = 1, N0 = 4
DESIGN OF REGRESSION EXPERIMENTS I45
a =( i~~reJ.
The values of a for 2 to 8 factors and re = rs =I are given in Table 3.I6. N 0 can
be chosen Iager than the minimal value which is 2 to 4. Values of N 0 providing uniform
precision rotatable designs are given by Box, G. E. P. and Hunter, J. S. (1957), seealso
Table 3.I7.
If experimenters expect nonrandom changes, they can avoid the aliasing of the
time trend with the effects of factors and interactions by blocking. Box and Hunter
(1957) obtained the conditions for orthogonal blocking.
A central composite design can be split into two blocks, one of which is formed
by Ne cube points plus Neo center points, and the other of Ns star points plus Nso center
points. The total number of the center points is N 0 = Neo + Nso. Minimal values of N cO =2
(except form= 8, when Ne 0 =4) and Ns 0 =I (Ns 0 =2 form= 5, t = 1). are recommended.
The value of a can be computed from the following equation (Box and Draper (1987)):
This a will not achieve exact rotatability but the variance contours are close to spherical
ones.
If smaller blocks are needed then the cube part of the design can be split in the
same way as in Section 3. 5. The number of the center points Neo must be a multiple of
the number of the smaller cube blocks and these points must be equally divided among
146 CHAPTER3
the blocks. If the star is replicated, then each replication with some N 80 star points can be
considered as an orthogonal block.
For more information on rotatable designs see Herzberg (1964), Herzberg
(1966), Herzberg (1967), Draper and Herzberg (1968).
1 -1 -1 -1 -1 9 -1 0 0 0
2 1 -1 -1 1 10 1 0 0 0
3 -1 1 -1 1 11 0 -1 0 0
4 1 1 -1 -1 12 0 1 0 0
5 -1 -1 1 1 13 0 0 -1 0
6 1 -1 1 -1 14 0 0 1 0
7 -1 1 1 -1 15 0 0 0 -1
8 1 1 1 1 16 0 0 0 1
Hartley's designs
These designs have small number of points and are useful when the experiments are
expensive or time consuming. Hartley (1959) assumed that in the cube part of a
composite design the main effects could be aliased with two-factor interaction because
the star points provide additional information for them. The condition for the cube
portion is two-factor interaction to be not aliased with other two factor interactions.
Hartley's designs are formed according to the rules for central composite designs
with a = ±1. Versions with one (N0 = 1) or zero (N0 = 0) center points are possible.
Table 3.19 shows recommended generators for Hartley's designs. In this table are given
the value oft for the fractional design, the number of observations N and the number of
DESIGN OF REGRESSION EXPERIMENTS 147
model coefficients k as weil. The numbers in the brackets in the column for N are for a
design without center points (N0 =0).
Table 3.20 shows a Hartley's design for 6 factors with one central point and
generators chosen according to Table 3.19.
. fior 6 f:actors
T ABLE 3 20 Hartle I s des1gn
No. XI x2 x3 x4 Xs x6 No. XI x2 x3 x4 xs x6
I -I -I I -I -I I I7 -I 0 0 0 0 0
2 I -I -I -I -I I I8 I 0 0 0 0 0
3 -I I -I -I -I I I9 0 -I 0 0 0 0
4 I I I -I -I I 20 0 I 0 0 0 0
5 -I -I I I -I -I 2I 0 0 -I 0 0 0
6 I -I -I I -I -I 22 0 0 I 0 0 0
7 -I I -I I -I -I 23 0 0 0 -I 0 0
8 I I I I -I -I 24 0 0 0 I 0 0
9 -I -I I -I I -I 25 0 0 0 0 -I 0
IO I -I -I -I I -I 26 0 0 0 0 I 0
11 -I I -I -I I -I 27 0 0 0 0 0 -I
I2 I I I -I I -I 28 0 0 0 0 0 I
13 -1 -I l l l l 29 0 0 0 0 0 0
I4 I -I -I I I I
I5 -I I -I I I I
I6 I I I I I I
Box and Behnken (1960) proposed second order designs formed by combining two-level
factorials with incomplete block designs. They are symmetric (except for I = 6). The
designs for I = 4 and I = 7 are rotatable, the designs for I = 4 and I = 5 can be split into
blocks.
Small composite designs are proposed also by Westlake (1965), Hoke (1974),
and Draper (1985). Saturated designs with N = k are developed by Rechtshafner (1967)
and M. Box and Draper (1974). Note however, that if N = k the degrees offreedom for
the residual variance are zero ( vR =N- k =0) and the adequacy ofthe model can not be
tested.
148 CHAPTER3
Plackett and Burman (1946) proposed two Ievel designs for studying up to
I = N- 1 variables in N runs, where N is a multiple of 4 0 The main effects for these
designs are aliased with many two-factor interactionso
As was noted the centrat composite designs and some other designs given in Section
30703 are symmetrico For many symmetric second order designs the information matrix
FTF has a specific form with many off-diagonal elements equal to zeroo Rearrange for
convenience the terms in model (3039) as follows:
Denote
where r = 1(1- 1) I 2 0
With these notations the information matrix of a symmetric design is of the form
N 0 0
0 All 0
F F= [
T
0 0 BI,
a 0 0
and
N
The lxl matrix H has diagonal elements A and offdiagonal ones Bo 11 and I, are
I x I and r x r identity matriceso
The inverse matrix of FTF is
DESIGN OF REGRESSION EXPERIMENTS 149
s; s;
s 2 (b0 )= d 1 ,s 2 (bJ = d 5 ,s 2 (bJ= d 6 s;, (3 .40)
s2 (bJ = d s; ,cov(bo.bii )= d s; ,cov(bii>bii )= d4s;.
3 2
I 1-1 I I
jl(x) = h0 + :~);x; +
i=l
L L h;1x;x1 + Lh;;X;
i=l j=i+l i=l
2
I 1-1 I I
F= i3z
;;·
R
include third order interactions x;x1x1, X;2 x 1 and X;X~ can be tested even for three Ievel
designs.
Normal and half-normal plots can be used if the effects are independently
estimated. It was shown above that for symmetric designs the estimates of the intercept
and ofthe coefficients h;; are correlated. Box and Draper (1987) give a simple procedure
for orthogonalization of F. We consider it in the context of three Ievel designs. In this
case the matrix F can be orthogonalized through replacement in the model of f.q = x~; by
- 2
f.q = x.;- -2
X; , where q = 2, 3, ... , I and
-2-
X; -
_!_ ~ 2
L...Jxui.
N u=1
After this change the transformed information matrix FrF becomes diagonal and
the corresponding model can be written in the form:
I 1-1 I I
ao = ho + Lh;;X;2 ·
i=l
Y. = LB.L,
i::::l
where ].; = f.; for all terms of the model except for the quadratic ones and
~ ~ ~ -r-
(}1 = a0 , B2 = b1 ,. .. , (}k = b11 . As the transformed information matrix F F is diagonal,
(3.41)
where f; is the i-th column of the transformed matrix F. The sum of squares due to the
effect (}; is equal to
(3.42)
152 CHAPTER3
The proof of(3.42) is similar to the proof of(3.24) which is given in Appendix A.3.1.
The effects estimated by formula (3 .42) can be used for construction of half-
normal plots.
A D-optimal composite design is used. The difference between the highest and
the smallest values of the performance characteristic are very big. That is why
logarithmic transformations are used for both of them. The design and the logged
observations are shown in Table 3.23. The following models are obtained by ordinary
least squares:
y = -1.563 -1.089x
1 1- 0.111x2 - 0.055x3 - 0.052x4 + 0.170x12 -
and
y2 = 3.097- 0.644x1 - 0.184x2 - 0.131x3 - 0.119x4 + 0.019x12 +
DESIGN OF REGRESSION EXPERIMENTS 153
TABLE 3.23. Experimentaldesign and logged Observations for cement raw material slime experiment
The analysis ofvariance for j/1 and j/2 is shown in Table 3.24.
In this section we consider procedures for generating D - optimal designs. They are
based on a simple idea. First an initial design with non-singular information matrix
FrF is chosen and then the design is sequentially augmented with new points, which
maximize the information matrix' determinant at each iteration.
154 CHAPTER3
Suppose that at a given step a design has N points and its non-singular
information matrix is F/FN' Assurne that at the next step an experiment is conducted for
a point xN+l· Denote fN+l a vector offunctions in the regressionmodelthat corresponds
to x N +1 . As is shown in Appendix A. 3 .2 following equations are true:
(3.43)
and
(3.44)
The idea for D-optimal designs construction comes from (3.44). For a given
!F~FN! maximal increase of the information matrix deterrninant is achieved if fN+l is
chosen so that to maxirnize the value of f~+ 1 (F/FNtrN+l· According to (2.44) this
value is proportional to the variance of the predicted response. Therefore, the next
observation must be conducted in the point with highest uncertainty of prediction. This
way the information about the response is increased as much as possible. This idea was
first proposed by Sokolov (1965) and Box and Hunter (1965) in different contexts and
was developed further for D-optimal designs generation by Vuchkov (1968), Vuchkov
and Krug (1969a,b), Kono and Sakaguchi (1969) and Wynn (1970).
The algorithm for generation ofD- optimal designs is:
1. Start with an initial non-singular design with N points and information matrix
F/FN.
3. Compute
information matrix of the initial design is rather high. For more detail on the starting
procedure see Vuchkov (1977).
There are many modifications of the algorithm for D-optimal designs generation,
see for example Fedorov (1972), Mitchell (1974), Silvey, Titterington and Torsney
(1978), Galil and Kiefer (1980), Welch (1984), Yonchev (1988), Vuchkov, Damgaliev,
Donev (1989), Hardin and Sloane (1993). Most of them improve D-efficiency of the
design for a given N as compared with the procedure given above at the price of more
complicated computations. However, all algorithms including the procedure given in this
section very quickly produce designs with D - efficiency in the interval between 0.88 -
0.99. The lower bound of this interval is for !arge number of factors (/ == 7 or 8), the
upper bound is for 2 factors. Table 3.25 shows the D-efficiencies of designs obtained by
the procedure given above for N chosen to provide a reasonable number of the degrees
offreedom for the residual variance.
As more sophisticated algorithms can not improve designs significantly (the upper
bound for D.ff is 1) for practical use we prefer the algorithm given in this section which
is very simple. Tak:ing into account that D-optimal designs are dependent on the model
structure and on the shape ofregion ofinterest (see Section 3.9) achieving very high D-
efficiency may be an illusion in many practical situations. Second and third order designs
developed by use ofthis algorithm are given in Vuchkov et al. (1978).
The composite designs and the other symmetric designs considered in subsections
3.7.2 and 3.7.3 have many attractive properties and they should be used whenever the
practical problern corresponds to the assumptions under which they were developed.
However, the procedure for sequential generation of D - optimal designs has some
advantages in some non-standard situations, for example:
• augmentation of a set of experimental runs with new points,
• generation of designs for models which are not full second order polynornials,
• generation of designs for qualitative and quantitative factors with different
number of Ievels of the qualitative factors,
• generation of designs in regions of interest with irregular shape,
• flexible choice of the number of runs according to the needs of the
experimenter.
There are also other applications of the procedures for sequential generation of
D-optimal designs. Same ofthem are mentioned in the bibliography (Section 3.2).
156 CHAPTER3
The procedure of subsection 3. 8.1 requires search of global maximum of the function
f~+l (F~FN t f N+l which is rather complicated. Therefore, some simplifications of the
search procedures are necessary. One possibility to avoid the global search over the
entire region of interest is to compute the values ·of f~+ 1 (F~FN t f N+l only for the
support points of a continuous D-optimal design. For a given model there might be
several continuous D-optimal designs with different supports. The supports of some
second order designs are found by Kiefer (1961), Farrel, Kiefer and Walbran (1967),
Kono (1962). The most often used set of candidate-points for second order designs on a
cube is three Ievel full factorial design which has 31 points. Another support proposed by
Kono (1962) contains following points: i vertices ofthe /-dimensional cube, li- 1 edge
centroids of the cube and the overall centroid of the cube. Other sets of candidate points
can also be used (Vuchkov, Damgaliev, Donev (1989)).
Second order designs are developed by Vuchkov et al (1978). Vuchkov (1982)
gives camparisans between sequentially generated designs and other designs.
The D-optimal designs generated by sequential procedures are usually
asymmetrical. The experimenter can choose any number of runs N ~ k. Standard
regression analysis procedures are used for estimation and statistical inference.
The design and the observations are shown in Table 3.27. Logarithmic
transformation is used for the specific resistance: y 1 =log10 p.
DESIGN OF REGRESSION EXPERIMENTS 157
TABLE 3.27. Experimentaldesign and Observations for the textile material experiment
No. Proportion of Proportion of Logged specific Electromagnetic wave
CuS04 ,x1 (%) resin, X 2 (%) resistance, y 1 = log 10 p damping,, y 2 (dB)
I I 5.365 0.3
2 -I 6.396 0.7
3 -I 6.216 0.6
4 -I -I 6.585 0.5
5 0 I 5.244 1.6
6 I 0 6.970 1.3
7 0 -I 6.021 1.7
8 -I 0 7.609 1.3
9 0 0 6.333 2.3
10 I I 5.392 0.4
11 1 -1 6.339 0.7
12 -1 1 6.271 0.5
(3 .45)
and
(3.46)
The effect of x1 in the model for j/2 is insignificant and can be ignored.
TABLE3.28. ignifi s
· 1cance test of regresston coeffi'
1c1ents fior modesI (3.45) and (3.46)
Model (3.45) Model (3.46)
Pre- Coeffi- St. t-ratio p Sum Degr. Coeffi- St. t-ratio p Sumof Deg
die- cient devi- of of cient devi- squares of
tor ation squares free- ation free
dom dom
Cons- 6.698 0.043 156.39 .000 - - 2.359 0.037 63.01 .000 - -
tant
XI -0.286 0.020 -14.22 .000 0.686 1 - - - - - -
x2 -0.349 0.020 -17.37 .000 1.322 1 -0.070 0.017 -4.05 .005 0.159 .1
xzI
0.559 0.039 14.15 .000 0.485 1 -1.088 0.034 -31.57 .000 2.994 .1
xlx2 -0.155 0.023 -6.79 .001 0.076 1 -0.099 0.020 -5.04 .001 0.035 .1
x22 -1.099 0.039 -27.28 .000 2.652 1 -0.738 0.034 -21.42 .000 1.202 .1
The statistical analysis of these models is shown in Table 3.28 and Table 3.29.
The critical points of F-distribution at significance Ievel 0.05 are
Fr =F(0.05,5,6)=4.39 and Fr =F(0.05,4,7)=4.12. Both multiple correlation
coefficients are significant and the models can be used for optimization. Analysis of the
corresponding response surfaces is given in Examples 3.12 and 3. 13.
158 CHAPTER3
•
3.8.3. SYMMETRICAL SECOND ORDER D-OPTIMAL DESIGNS
The procedure for sequential generation of D-optimal designs can be started with a
symmetrical design and then small sets of points (blocks) can be sequentially added to
obtain designs with symmetrical structure and information matrix ofthe type (3.40).
Consider an example for two-factor design. Table 3.30 shows the support points
that form a 32 factorial design.
T ABLE 3 30 3 2 facton·a1 desum
.
No. XI x2 No. XI x2
1 1 1 6 0 1
2 1 -1 7 1 0
3 -1 1 8 0 -1
4 -1 -1 9 -1 0
5 0 0
For example, if the experimenter wants to conduct 27 runs he/she must repeat
points I, 2, 3, 4 four times, point 5- three times, and points 6, 7, 8, 9- two times.Other
symmetrical block-designs are given in Vuchkov et al (1978).
Pesotchinsky (1972) proposes symmetrical non-sequential D-optimal designs.
Table 3.32 presents a design of this type for 4 factors. It consists of 6 sets of points as
shown in the table, the total number of points is 42 and the D-efficiency of the design is
0.967.
TABLE 3.32. Sets ofpoints ofPesotchinsky's design for 4 factors
Sets of points Generators Number of points
1. Fractional factorial design with 2 4 - 1 points x4 = xlx2 8
2. x1= 0, Full factorial design for x2 , x3, x4 - 8
3. x2 = 0, Full factorial design for x1, x3, x4 - 8
4. x3 = 0, Full factorial design for .X\,X2,x4 - 8
5. x4 = 0, Full factorial design for x1, x2 , x3 - 8
6. x1 = x2 = x3 = x4 = 0 - 2
Pesotchinsky (1972) gives designs of this type for 5 and 6 factors. Their D-
efficiencies are 0.978 and 0.976, correspondingly.
3.9.1. MODEL-DEPENDENCE
0.5
-0.5
-1
4.94
3
2
1.696
X
X
1 ...
b)
Figure 3.13. Variance contours for two-factor 13 point rotatable design (a) for complete second order
polynomial (3.47) (b) for incomplete second order polynomial (3.48)
DESIGN OF REGRESSION EXPERIMENTS 161
6.27
0.8
0.6
0.4
ix2 002
6.27
-0.5 0 0.5
•1
X
--4
a)
5.63
2.6
-1 -0.5 0 0.5
X
-----4-
b)
Figure 3.14. Variance contours for a 13 point nearly D-optimal design (a) for complete second order
polynomial (3.47) (b) for incomplete second order polynomial (3.48)
3 .14a, while the contours of variance obtained for estimation of model (3 .48) using the
same design are shown in Figure 3 .14b.
The exact D-optimal design for model (3.48) has 6 support points and allocates
measure 116 at each ofthem (Vuchkov and Krug (1969a)). The corresponding variance
contours are shown in Figure 3. 15.
4 4
0.8
0.6
i
0.4
0.2
X] 0
-0.2
-0.4
-0.6
-0.8
~
Figure 3.15. Variance contours for the exact D-optimal design for model (3.48)
The contours given in Figure 3.13a and Figure 3.14a can not be directly compared
because they are obtained for different regions of interest. In the rotatable design the
factors are allowed to vary in the interval [-1.41, 1.41], while for the D-optimal design
they vary in the interval [-1, 1]. The properties of both rotatable and D-optimal designs
are strongly dependent on the choice of the region of interest.
Consider an example. The response depends on two factors: temperature (x;,oc)
and acidity of a solution pH ( xD. The intervals of variation of both factors are as
follows:
i) The region of interest and the operability region coincide and they are de.fined
by (3.49). Hence, experimentation is not allowed outside this region.
DESIGN OF REGRESSION EXPERIMENTS 163
If experimenters want to use a rotatable design they must code the variables so
that their boundary values in natural measuring scale to correspond to coded
valuesa = ±1.414. This is possible if the half-interval of variation is computed by the
following formula:
1( , , )
W; =- X;max - X;o ·
a
1
(1)2 =--(6.6-5.9)=0.495.
1.414
The coded factor Ievels and the corresponding natural values of the factors are
given in Table 3.33. They are computed by the formula
The allocation of points of a D-optirnal design with I3 points for the same
problern is shown in Figure 3.I6b. Two observations are allocated at each of the cube
vertices ( ±1, ±I), one observation is at the edge centers (±I, 0 ), (0, ±I) and one is at the
overall center (0,0). The D-efficiency ofthis design is 0.998.
a) b)
Figure 3.16. Two factor second order designs with 13 points (a) rotatable design, D.ff = 0.476
(b) nearly D-optimal design, D.ff = 0.998
In order to cornpare both designs we rnust use the sarne rnethod of scaling.
Assurne that the natural boundary values of the factors correspond to ± I in coded scale.
Then the "cube" points ofthe rotatable design have coordinates ±1/1.414 = ± 0.707,
whiie the "star" points have coded coordinates (± 1, 0) and (0, ±I). The center point is
with coordinates x1 = x2 =·o. The D-efficiency ofthis design is 0.476.
One can see that the D-efficiency of the rotatable design for this case is rnuch
srnaller than for the nearly D-optirnal one. This result is due to the fact that after shifting
the "cube" points of the rotatable design inside the region of interest the corners of the
square with vertices (±I, 0) and (0, ±I) arenot covered by the design points. In fact one
rnust extrapolate in this area. In contrast sorne of the points of the nearly D-optirnal
design are allocated in the vertices and the whole region of interest is covered by the
design.
Figure 3. I 7 shows the variance contours for two factor rotatable design for
rnodel (3.47). In cornparison with the variance for the nearly D-optirnal design with 13
points (Figure 3.I4a) the variance in the corners of the square defined by inequalities
-I:;:; X; :;:; I for the rotatable design is rnuch higher.
For rotatable designs the higher the nurnber offactors the higher the distance ±a
of axial points frorn the origin. As the operability region in natural rneasurernent scale is
fixed the "cube" points are nearer to the design center. For exarnple, if I = 7 and a
rotatable design with half-fraction is used then the star points are at distance a = ±2.828
frorn the overall center. With N0 = I4 points at the center the rotatable design has 92
runs. The D-efficiency of this design is 0.040. A sequentially generated design obtained
DESIGN OF REGRESSION EXPERIMENTS 165
by use of the procedure of subsection 3.8.1 with the same number of points has D-
efficiency 0.976. The D-efficiencies for uniform precision rotatable central composite
designs with different number of factors in cubical and spherical regions of interest are
shown in Table 3.34. The designs with t = 0 are with two Ievel full factorial design as a
building block, while if t = 1 a halffraction is used.
Figure 3.17. Variance contours for two-factor rotatable design for model (3 .47) and operability region
defined by (3.49)
ii) The region of interest is a sphere with a given radius. Suppose we are not
interested in the response at the corners of the cube and want to explore the response
surface only inside a sphere with radius a. Table 3.34 shows that the uniform precision
rotatable designs have much higher D-efficiencies when they are used for a spherical
region of interest.
For spherical regions of interest the rotatable designs given in Table 3.17 have
very good predictive properties. For instance a rotatable design with two factors has D-
efficiency 0.969 for spherical region ofinterest, i. e. this design is nearly D-optimal. The
166 CHAPTER3
D-efficiency for the rotatable designs with larger number of factors is also very high for
spherical regions ofinterest. They are shown in Table 3.35.
iii) The operability region is /arger than the region of interest. In this case
experiments outside of the region of interest defined by the inequalities (3.49) are
allowed. The star points of a rotatable design can be chosen outside of this region. The
variance contours for a two factor rotatable design in this case are shown in Figure
3.13a.
DESIGN OF REGRESSION EXPERIMENTS 167
X
1 ...
Figure 3.19. Variance contours for a second order nearly D-optimal design on a cube with
vertices at ± 1 • Region of interest defined as -1.41 < X1 , x2 < 1. 41
The variance contours for a nearly D-optimal design for the same case are shown
in Figure 3 .19. The rotatable design provides better prediction at the center because 5
points are allocated at it. It has of course the advantage that the information is uniformly
distributed in this area. However, the prediction provided by the D-optimal design is
better at the borders of the region of interest.
The experimenter must not forget that the extension of the region of interest
through choice of larger intervals of variation for the D-optimal designs or !arge a for
the rotatable designs may make the model more complicated.
The dependence of the design properties on the model assumption and on the
region of interest shows that the initial choice of a design may not be optimal. The design
can be improved using a sequential procedure in which the runs are taken in blocks, the
data are analyzed after each block and the next block is generated taking into account
this analysis.
168 CHAPTER3
U sing a second or third order model the engineer can explore the response surface within
the region ofinterest and find the optimal product or process parameters. Well-developed
optirnization methods can be used. They are given in many books, for example
Luenberger (1989), Reklatis et al. (1983), Gill et al. (1981), M. J. Boxet al.(1969). There
are many software packages for optimization, for example MATLAB optirnization
toolbox, IMSL Math/Library, NAG-Libniry, Conn et al. (1992), Ladsan et al. (1978),
Liebman et al. (1986). However, engineers usually want to know more about the product
or process than they can obtain from a formal optimization procedure. Engineering
decisions often are made taking into account considerations which can not be easily
formalized through models. Such considerations are for example company's policy with
respect to vendors and customers, varying prices of components and raw materials,
existence of stocks of components and raw materials, time and expenses for
transportation of raw materials, etc. That is why the engineers often prefer to have a set
ofvariants for discussion, rather than only one "optimal" solution. The problern is how to
select variants which are worthy to be considered further. The models obtained through
design of experiments can be very useful in this respect. They make possible development
of tables that show the predicted values of one or several performance characteristics for
different sets of product/process parameters. They can also be used for plotting contours
of constant values of the performance characteristics which are very useful for a visual
interpretation of the properties of the product.
1\
'
/
An often-used method for development of variants is the grid search. Though time
consuming, it provides sufficient information about the performance characteristics. The
idea of this method is shown in Figure 3. 20 for two factors, x1 and x2 • The procedure
starts with computation of the predicted response y(x1 , x2 ) for constant value of x2 and
increasing values of x1 by steps Ax1 . When x1 becomes equal to its upper Iimit, x2 is
increased by a step Ax2 and x1 is varied again by steps from its minimal to its maximal
values. The values of y(x1 , x2 ) are compared at each step, the best values are arranged
and stored together with the corresponding values of x1 and x2 • This way the response
function is computed for the knots of a grid and the best of the obtained responses are
selected for further discussion.
This method can be applied for more than two factors. For !arge number of
factors and small steps the computations grow too fast. The number of grid knots is
q1q2... qP where q1,q2, ... ,q1 are the numbers ofvalues given to xpx2, ... ,x1, respectively.
For instance, for three factors (/=3) with 11 values each (q1 = q2 =% = 11) the number of
response function computations is 113 = 1331. For more factors the amount of
computations rapidly increases.
If the number of combinations of factor Ievels is great the interpretation of the
results is difficult. Eliminating all factor combinations for which the response does not
meet the standards can decrease the number of variants for discussion. If there are still
too many variants, the engineer can intentionally tighten the intervals in which the
responses are allowed to change. Another way of decreasing the number of combinations
is to apply grid search successively with different length of the steps. First the steps of the
factors are chosen !arge so that the number of factor values combinations is small. After
identifying the most promising areas within the region of interest, new iterations with
smaller steps can be carried out for a more precise computation of the optimal factor
values. The parameter sets and their corresponding performance characteristic's values
which are selected are shown on the computer's screen or printed for further discussion.
Generally, if the function is too complicated, the grid search can omit very sharp
peaks, especially if the steps are not small enough. However, for second order
polynomials which are the most used models in response surface methodology such sharp
peaks do not exist. Depending on the step, the solution found by the grid search may be
more or less close to the stationary point of the response surface. We should remernher
however, that the grid search is only the first step of the optimization procedure. A
detailed exploration of the response surface in the neighbourhood of the point chosen by
the grid search can be done by contour plots or canonical (ridge) analysis ofthe response
surface (see subsection 3.10.2).
Contours of constant values of the response are very useful for making
engineering decisions. They can help engineers to see the direction in which the parameter
changes are most useful and to understand the behaviour of the product or process
performance characteristics. Contours of several performance characteristics can be
superimposed on each other to identify areas of admissible values of product parameters.
170 CHAPTER3
In cases with more than two factors the usual practice is first to find some optimal
factor values through optimization procedures and then to fix all factors except two on
their optimal values. The obtained contours are sections of the response surface but
nevertheless they are very useful for the engineers. One can make as much as necessary
different sections and can "see" the behaviour of as many performance characteristics as
necessary. In case of several factors the canonical analysis which is considered in the next
section can also be very useful.
The experimenter should keep in mind that the predicted optimal factor values, as
weil as the contours are not exact because estimates of the model coefficients are used.
That is why the "optimal" solution is a random vector. The true optimal solutionlies in a
region around the best point found by a regression model. Box and Hunter (1954)
proposed a formula for computation of 100(1- a)% confidence region on the stationary
point location for a second order response surface. We consider this problern in details in
subsection 3.10.3.
Sometimes the product has more than one performance characteristic. Let the
number ofthe performance characteristics be r and Iet they be denoted yPy2 , ... ,y,. Their
optimal values are usually obtained for different factor values. Figure 3.21 shows an
example with two performance characteristics y 1 and y 2 that depend on one factor x.
Suppose that we want to maximize y1 and y 2 . One can see that the optimal factor value
x1opt for y 1 corresponds to poor values of y 2 and vice versa. In this case it is better to use
some compromise performance characteristic Yc· The optimal factor value xcopt for the
compromise characteristic is not the best neither for y 1 nor for y 2 but it provides some
reasonable compromise solution.
A compromise solution can be obtained using the so-called desirability .functions
Values dj called desirabilities are juxtaposed to some given values of the response yj.
The values of dj vary from 0 to I, with zero corresponding to the undesirable values of
the performance characteristic and 1 - to its most desirable value. The choice of dj is
subjective and depends on the engineer's judgment. For example, engineers can specity
what value of the performance characteristic is considered as undesirable, bad,
DESIGN OF REGRESSION EXPERIMENTS 171
satisfactory, good and very good. Then they assign to them desirability Ievels as shown in
Table 3.36.
The values of D are in the interval between 0 and 1. The higher the desirability
values of the performance characteristics the higher the generalized desirability D.
Contrary, D = 0 if at least one value of d; is zero. This is due to the fact that if at least
one of the performance characteristics is out of specifications then the product is
defective.
In the region of interest there might be areas where D = 0, i.e. this function may
be discontinuous. Forthis reason it is not recommended to fit regression models for D.
Instead, regression models can be obtained for each performance characteristic and then
D to be computed during the grid search. Gradient methods for optimization may be
unsuitable because ofthe possible discontinuity ofthe generalized desirability function.
The values of D are printed in tables of variants. The computer program can be
organized to select a number of parameter sets with highest values of D which to be used
by engineers for discussion.
The desirability functions must be used cautiously. As we noted, their choice is
subjective. Rarrington (1965) and Derringerand Suich (1980) propose other methods for
choosing desirability functions. Polynomial approximation of the desirability function can
be used for one-sided transformation of the response. If the response has both lower and
upper Iimit two-sided desirability function should be used (Derringer and Suich (1980)).
An unrealistic definition of desirabilities can be a cause for a poor solution of the
optimization problem. In particular this can happen if for some performance characteristic
the value d; = 1 is assigned to a value of Y, which can never be obtained in practice.
Therefore, it is better to use the desirability functions for selecting variants for further
discussion, rather than for making final decisions.
172 CHAPTER3
and
d 2 = -0.8703 + 0.4026ln Y2 - 0.001283(ln YzY.
Maximum ofthe generalized desirability D = ~d1 d2 is found through grid search.
The best six values obtained are shown in Table 3.38.
DESIGN OF REGRESSION EXPERIMENTS 173
-0.5 X 0 0.5
opt
Figure 3.22. Contour plot for generalized desirability Das function of X2 and X3 for XI = X4 = -1
The contour plots for ~ and Ji are shown on Figure 3.23a and Figure 3.23b,
respectively.
174 CHAPTER3
0.8
0.6
0.4
r: -0.2
0.5
0.6
r:
0.4
-0.2
-1 L-~----~~--~--~~~~~~~
-1 -0.5 X 0 0.5
opt
Figure 3.23. Contour plot for the performance characteristics as function of X 2 and X3 for
x1 = x4 =-1. (a) plot for the structural viscosity ( ~)
(b) contour plot for the ultimate shearing stress ( ~ )
•
DESIGN OF REGRESSION EXPERIMENTS 175
Useful method for exploring second order response surfaces is the so called canonica/
analysis of the models.
A second order polynomial of the form
I 1-1 I I
y(x)=bo + Lb;x; + L Lb;,x;xi + Lb;;X;2 (3.50)
i=l ;=i+1 i=l
(3.51)
where x = Y Y
(x1 x2 • • • x 1 is l vector of factors, b = (b1 b2 . • • b1 is l vector of
coefficients in the linear terms of (3.50) and Bis l x l matrix with elements
biiifi = j
[Bl ={ -1 b 1if.1* J. .
2 I)
It is known (Gantmacher (1959)) that the derivatives ofthe scalar product oftwo
vectors xrb and of a quadratic form xrBx with respect to vector x are
and
dxrBx = 2Bx.
dx
o.Y
-=b+2Bx=O. (3.52)
OX
1 ~]
xs = --B
2 b (3.53)
I76 CHAPTER3
or
I
Bx s =--b
2 . (3.54)
(3.55)
Further on we consider two canonical forms of (3. 50). Let the eigenvalues of B be
denoted ~ , it2 , ... , it1 . They are defined as roots of the following characteristic equation:
IB- itii = 0.
Define also the eigenvectors t; that satisfY the equation
(3.56)
BT=TA. (3.57)
Using (3.57) we can write Bin the form B = TATT. Substitutins this matrix in (3.5I) we
obtain
(3.58)
Denote
(3.59)
or
x=Ts. (3.60)
(3.6I)
DESIGN OF REGRESSION EXPERIMENTS 177
where r = TTb.
The stationary point in (-coordinates can be obtained from (3.53) and (3.59) as
follows:
response surface.
4 °1
I
I
I
I
X
2s '8,~
1;; .,._ - ----=2•------'----------.
2 X
Js
W e can shift the center of the coordinate system at the stationary point by the substitution
(3.62)
or
(3.63)
negative then y has maximum at the stationary point and for positive definite matrices B
the stationary point corresponds to the minimum of y. The values of Y; in the A-
canonical form characterize the slope of the response surface in direction of Ö;- axis,
while A; depends on the curvature in this direction.
where
0 ) ( 1 088 -0.0498).
bo = 2"359 ' b = ( -0.07 and B = --0.~498 -0.738
T=(-0.138112 0.990417)·
0.990417 0.138112
Compute the vector
and
DESIGN OF REGRESSION EXPERIMENTS 179
.Y.=b0 +2.x~b=2.359+2.(o.002188
2 2
-0.047822)( 0 )=2.36029.
-0.07
0.8
0.6
0.4
~2~2 ..
-0.2-
-0.4
-0.6
-0.8
-1
-1 -0.5 0.5
Figure 3. 25. Contours for the logged electromagnetic wave damping coefficient
Figure 3.25 shows the contours for .Y(S') and y(o). They are ellipses with
maximum at the stationary point, because both coefficients in second order terms of these
equations are negative.
If all eigenvalues are non-zero but some of them are positive, while others are
•
negative, the contours ofthe response surface have a saddle point (Figure 3.26).
The eigenvalues of B are 2 1 =-1.1023 and 2 2 =0. 5622 and the matrix of eigenvectors is
T = (0.0467 0.9989 J.
0.9989 -0.0467
X
s
=( -0.1753
0.2315 J
0.8
0.6
0.4
-0.8
-1 -1 -0.5 I 0 ; 0.5
X
--4
Figure 3.26. Cantours for the logged specific resistance with saddle point
•
DESIGN OF REGRESSION EXPERIMENTS 181
Consider again the elliptical contours on Figure 3.24. If one of the eigenvalues
(for exarnp1e, A2 ) is very small, while the other (A 1) is !arge then the maximum is
attenuated in the direction of 82 . In the Iimit case when A1 = 0 the contours are straight
lines parallel to 82 and the response surface is stationary ridge (Figure 3.27).
Real response surfaces are rarely of this form because the models are obtained
from data subject to errors and the eigenvalues might be small but they are usually not
exactly equal to zero. However, the contours may be so attenuated in one direction that
the response surface may be practically looking like stationary ridge.
Example 3.14.
Consider the following model
The eigenvalues of B are A1 = 0. 0018 7 and A2 = 1. 323 1. As they are very different in
magnitude the response has a ridge. The matrix of eigenvectors is
T = (- 0.4109 0.9117)
0.9117 0.4109
and Ys = 3.1905.
182 CHAPTER3
0.8
F\t;
0.6 ~/ 'öl
\
r
\
0.4 \
\
\
\ /
x2 0 \/ /
"\
\
-0.2 \
\
-0.4 \
\
-0.6 \
\
-1
-1 -0.5 0 0.5 1
X
~
Figure 3.28. Stationary ridge for the model ofExample 3.14.
•
3.10.3. CONFIDENCE REGION ON THE LOCATION OF THE STATIONARY
POINT
Suppose that a second order model of the form (3 .51) is fitted to data and the
coordinates ofthe stationary point are computed using equation (3.53). Box and Hunter
(1954) found that the 100(1- a )% confidence region for the stationary point consists of
all points that satisfy following inequality:
(3.65)
DESIGN OF REGRESSION EXPERIMENTS 183
where
;r = b + 2Bx
o=_2::'_
iJx '
In the general case V8 may be difficult for computation. However, for the widely used
symmetric second order designs, which have a special structure of the C matrix given in
subsection 3.7.4, the elements ofV8 can be computed relatively easy. Let the matrix Cis
As is shown in Appendix 3.4 the i-th diagonal element of V8 is the variance of t1;
and can be estimated as follows
Carter, Chinchilli, Myers and Campbell (1986) and Peterson (1993) consider
confidence intervals in connection with ridge analysis. They propose confidence intervals
for the eigenvalues of matrix B which play an important role in the ridge analysis.
Stablein, Carter and Warnpier (1983) and Myers and Montgomery (1995) considered
ridge analysis with confidence intervals.
3.11. Bibliography
There is a vast literature on design of regression experiments. Here we note some texts
that are relevant as additional reading on the problems considered in Chapter 3. Classical
books on principles of design of experiments are Fisher (1966) and Cox (1958).
Response surface designs are given in Box, Hunterand Hunter (1978), Box and Draper
(1987), Khuri and Cornell (1987), Montgomery (1991), Myers and Montgomery (1995).
The theory of optimal designs is considered in Fedorov (1972), Vuchkov (1978), Silvey
(1980), Bandemer and Näther (1980), Kiefer (1985), Pazman (1986), Ermakov and
Zhigliavsky (1987), Atkinson and Donev (1992), Pukelsheim (1995). Books on
applications of experimental designs to industry and science are Nalimov and Chernova
(1965), Box and Draper (1969), John (1971), Adler, Markova and Granovsky (1975),
Ghosh (1990), Mead, R. (1990), Deming and Morgan (1993), Goupy (1993).
Bibliographies on experimental design are written by Herzberg and Cox (1969),
Atkinson (1982), Atkinson (1988), Myers, Khuri and Carter (1989).
Vuchkov et al. (1978) and Nalimov (1982) developed catalogues of experimental
designs. Specialized software packages for design and analysis of response surface
experiments areDESIGNEXPERT and ECHIP.
and
DESIGN OF REGRESSION EXPERIMENTS 185
(2.46)
Using (A.2.4.1) and (A.2.4.3) one can rewrite this equation in the following form:
u::::l
As y= FB and FrFB= Fry the above equation can also be written as follows:
u::::l
or in scalar form:
z>:
N k N
= I~I.t.IYu +QR.
u=l i=l u=I
N N
LXu;Yu LXu;XuJYu
b =""""="-'-!_ _ or b =""""~=..!,_! _ __
I N I] N
Suppose that at a given step a design has N points and its nonsingular information matrix
is F/FN. Assurne that at the next step an experiment is conducted at a point xN+I'
Denote f N+I a vector of fi.mctions in the regression model which corresponds to xN+I·
Then following equations are true:
(3.43)
and
(3.44)
186 CHAPTER3
(A.3.2.1)
H=(~ :}
then following relationship exists:
(A.3.2.2)
(3.51)
(3.62)
where
(3.53)
b = -2Bx•. (A.3.3.1)
As (= Trx, B = TATr and r = Trb, equation (A.3.3.1) can also be written as follows:
b = -2TATr x. = -2TA(.,
or
(A.3.3.2)
As x. = T(. we obtain
or
(A.3.3.3)
o
In the coordinate system 81 , 8 2 , ... , m all coordinates are transformed using the
o o
equation = (- (., or ( = + (.. Substituting ( in the A-canonical form one obtains
(A.3.3.4)
1
At;.=-- y. (A.3.3.5)
2
(A.3.3.6)
We shall prove that the elements of matrix V8 = E{o- E(o}lo- E(o)f} for symmetric
second order designs are:
(A.3.4.1)
and
(A.3.4.2)
where d3 ,d4 ,d5 and d6 are elements ofmatrix C, defined in subsection 3.7.4 as follows:
(A.3.4.3)
The covariance terms in this equations are zero because b; and bij are not correlated with
any other coefficient. The variance terms can be computed taking into account that
(2.42)
and
DESIGN OF REGRESSION EXPERIMENTS 189
(2.43)
Substituting estimates for the variances and taking into account the definition of
matrix C we obtain
Removing the brackets and taking into account that for the covariance between two
random variables ~ and BJ is
after some tedious algebraic manipulations one can obtain the following result:
•
CHAPTER4
4.1. Introduction
Since 1982 the so-called Taguchi method for off-line quality control has attracted the
attention of many engineers and statisticians all over the world. Many successful
applications were reported and high quality products were developed at low cost. At the
same time many statisticians expressed some concems about the methods for design of
experiments and data analysis used by Taguchi. This initiated research aimed at
integrating the Taguchi's engineering ideas with well-established statistical methodology.
This process is still going on and many debates on Taguchi method can be found in the
specialized engineering and statistical joumals.
In this chapter we describe the main ideas of the Taguchi method. They are used
as a basis for development of a model-based approach to quality improvement which is
presented in the next chapters of this book. Therefore, our aim is not to give a detailed
description ofTaguchi's approach, such description can be found in Taguchi (1986a) and
Taguchi (1987). Many books and papers by other authors present details and
applications of Taguchi method. References to most popu1ar ones are given in Section
4.15.
Consider a product coming from three different production lines, say 1, 2 and 3. W e are
interested in the same performance characteristic y, the desired target value is -r , the
lower and upper specification limits (LSL and USL) are the same for all three lines. The
probability density functions of the performance characteristic for lines 1, 2 and 3 are
shown in Figure 4.1. The values of the performance characteristic y are within the
specification limits for all three lines. However, the quality of products coming down
from these lines is not the same and the customers will probably discover this. The mean
value of y is centered on the target -r for distributions 1 and 3 but a larger part of the
products have performance characteristic y closer to -r for the production line 3 rather
than for 1. The values of y are almost uniformly distributed within the interval (LSL, USL)
for production line 1, while a large part of the production of 1ine 3 has values of y closer
to -r. The mean value of y for production line 2 is shifted to the left from T and therefore,
190
TAGUCHI'S APPROACH TO QUALITY IMPROVEMENT 191
great percentage of the products have lower than the target value of y. The quality of
products coming out from line 3 is superior to those from lines 1 and 2.
f(y)
LSL USL y
Figure 4.1. Probability density functions of Observations from three production lines
Taguchi takes into account these differences defining a /oss .function as follows:
(4.1)
l(y)
AB
LSL 't USL y
If one takes into account only the specification Iimits it can happen that two
products A and B have very similar performance characteristics, while A is rejected as
defective and B is accepted because y is within specifications for this product. The loss
192 CHAPTER4
function has a little bit greater value for A than for B. This reflects the fact that the
quality of A and B is not much different.
For calculation ofthe constant kc in (4.1) Taguchi proposes the following idea.
Suppose that the specification Iimits are as follows:
LSL = r-A,
USL= -r+A,
and the customers interval is (r- A, r + A). Denote by C the cost to the customer for
repairing or discarding a product that is outside this interval. Putting these notations in
(4.1) we obtain
and
There are two problems with the practical application of the loss function. One of
them isthat often it is not easy to define the cost C and therefore, kc is not easy to be
calculated. The second problern is that the target value r is often unknown or
unknowable. Sometimes one can choose an ideal value of r which is in fact unattainable.
In many cases the performance characteristic y can take only positive values
(y > 0). For such characteristics Taguchi defines the loss function for the smaller the
better case (when one wants y tobe as small as possible) and the /arger the better case
(when a value ofy as large as possible is wanted).
For the smaller the better case we can put r= 0 and the loss function becomes
l(y)= k;.
y
Y]
L is usually called expected lass, while E[(y- T is called mean squared error (MSE).
It is shown in Appendix A.4.2 that the expected loss can also be presented in the
form
(4.3)
where 17 is the expectation of the performance characteristic (77 = E(y )) , r is the desired
target value and o; is the variance ofthe performance characteristic:
(4.4)
One can see from (4.3) that there are two sources that form the expected loss:
• The deviation of the performance characteristic's mean value from the target
described by the term (77- T Y'
• The variations of the performance characteristic around its mean value
characterized by the variance cJ
y"
It is interesting to note that Taguchi considers the loss imparted by the product to
the society. He says (Taguchi (1986a), p.l):" Quality is the loss a product causes to the
society after being shipped, other than any Iosses caused by its intrinsic functions ".
We consider the design stage of the product/process life cycle. Taguchi divides it into
three parts: system design, parameter design and tolerance design.
System design is in fact the conceptual design of a product or process which is
based on underlying engineering knowledge. It starts with certain scientific or
engineering idea and comes to completion with the development of the product
prototype. The conceptual design has nothing in common with the statistical design of
experiments.
Product design 's objective is to choose values of product parameters that ensure
minimal variability of the performance characteristics, while keeping its mean value on a
target. This rninimizes the expected loss as weil. As 17 and o;in (4.3) are functions of
product/process parameters they can be chosen to minirnize the expected loss.
To study how the product or process parameters affect the values of 17 and o;
one can use design of experiments. This stage comprises some optimization procedures
194 CHAPTER4
as weil. The product design is the most important tool for quality improvement at low
cost.
If the variance reduction obtained by parameter design is insufficient one can go
to the next stage called by Taguchi tolerance design. That means tightening the
tolerances of the most influential parameters or external noise factors to reduce the
performance characteristic's variability. However, keeping the product parameters within
narrow tolerance Iimits means using high quality elements or raw materials or using
automatic control systems. This could be expensive and therefore, tolerance design must
be applied only if the parameter design can not provide satisfactory results. In this
connection Taguchi (Taguchi (1986a), p.79) says:" Narrow tolerances should be the
weapon of last resort, to be used only when parameter design gives insufficient results
and never without careful evaluation of the loss due to variability. Cost calculations
determine the tolerances."
The idea of the parameter design is simple. In order to study the joint effects of product
or process parameters and noise factors one can conduct experiments in which the Ievels
of product or process parameters are taken in different combinations with the Ievels of
the noise factors. The data obtained from these experiments can be used to predict the
best parameter values.
Taguchi uses designs called orthogonal arrays. We discuss their properties in
Section 4.5. Taguchi's parameter design is a cross product of two orthogonal arrays -
one for the product or process parameters and another for the noise factors (errors in the
factor Ievels or external rtoises). The first orthogonal array is called parameter design
matrix, the second one - noise matrix. The experiments of the noise matrix are repeated
for each row of the parameter design matrix. Therefore, both product parameters and
noise factors must be controllable during the experiment. Sometimes this is not the case
with the noise factors and repeated observations are used instead ofthe noise matrix (see
Example 4.2).
For example, consider a product with three parameters p 1 , p 2 , p 3 , whose
performance characteristic depends on two noise factors: n1 and n2 . The full factorial
designs at two Ievels, considered in Chapter 3, areorthogonal arrays. We can choose the
parameter design matrix to be a two-level full factorial with 23 = 8 runs and the noise
matrix a full factorial design with 2 2 =4 runs. The parameter design for this case is
shown in Figure 4.3 and Table 4.1.
The example given in Figure 4.3 and Table 4.1 is very simple. In many cases the
number of product parameters and noise factors is high, the number of factor Ievels can
also be higher than two Ievels (for example, 3, 4, 5, etc.).
TAGUCHI'S APPROACH TO QUALITY IMPROVEMENT 195
No "I '2
I -I -I
2 I -I
3 -I I
No 1'\ Pz 1 3 f 4 I I
I -I -I -1
2 I -1 -I
3 -1 1 -1 •
4 I I -I •
5 -1 -1 1
•
I~
1 -I 1
-I I I
18 1 I I
~I -1 -I
Parameter design 2 1 -1
matrix (NP runs) 3 -I 1
4 1 1
Noise design
matrix (Nn nms)
Figure 4.3. Cross product design for three product parameters and two noise factors
If the number of runs in the parameter design matrix is equal to NP and of the
noise matrix is Nn then the number of runs for the Taguchi's product design is equal to
N = NpNn. In the example ofTable 4.1 NP= 8,Nn =4 and N = 8.4 = 32.
There are some differences in the notations used in the literature on combinatorial
designs and on response surface designs. In the combinatorial design literature the
factors are usually denoted by Latin letters: A, B, C. ... and their levels - by consecutive
integer numbers: 1, 2, 3, ....
In the response surface methodology the notations of factors are usually
xi , x 2 , x 3 , ... ( we shall also use PI , p 2 , p 3 , . . . or ni , n2 , n3 , ... ) and the levels of factors are in
the intervals -1 :s; X; :s; 1, i = 1, 2, ... ,/.
Table 4.2 showsanorthogonal array for 4 factors, each ofthem having 3 levels,
while Table 4.3 shows the same design written in response surface notations. The levels
1,2,3 in Table 4.2 correspond to -1,0, 1 in Table 4.3.
A design matrix is called orthogonal array if for every pair of colurnns each
combination of factor's levels appears the same number of times. In Tables 4.2 and 4.3
each combination oflevels appears only once for every pair of colurnns.
When the factors are coded like in Table 4.3 an indication of orthogonality isthat
the information matrix FrF is diagonal. For example, if we consider Table 4.3 as a F-
matrix, the corresponding information matrix is
G=F T F= 0 0 6 [~ ~ ~ ~ I
0 0 0
TAGUCHI'S APPROACH TO QUALITY IMPROVEMENT 197
In Taguchi's designs the columns of the orthogonal arrays are used for studying
the main effects of the factors. If there are free columns, they can be used to estimate the
interaction effects. Here the problern is that experimenters must know in advance which
interactions are significant. U sually such information is not available.
Taguchi says that the interactions can be eliminated through a good engineering
reformulation of the problern and especially of the response function. This is a point of
disagreement with most statisticians. In general a reformulation of the problern in order
to eliminate the interactions is a very difficult task. U sually the engineering knowledge
before the experiment is insufficient to allow such transformation of the response.
Some of the designs, which we considered in the previous chapters, are
orthogonal arrays. For example, two-level full and fractional factorial designs given in
Chapter 3 are orthogonal arrays. A thorough examination of Table 4.2 shows that this
design is in fact a Graeko-Latin square (compare Table 4.2 with Table 2.14). The so-
called Plackett-Burman designs are orthogonal arrays as weil. A great number of
orthogonal arrays are developed and catalogued, see for example Taguchi (1986a, 1987),
Logothetis and Wynn (1989). Taguchi and Konishi (1987) catalogued orthogonal arrays.
Special notations are used for orthogonal arrays. For example, the notation
L9 (3 ) in Table 4.2 means that this is a design with 9 rows (9 experimental runs) for 4
4
factors, each factor with 3 Ievels. The notation ~ 8 (3 7 x 2 1 ) refers to a design with 18
rows, 7 factors changed at 3 Ievels and one factor at two Ievels.
It is important to consider the aliasing of the effects in an orthogonal array. For
example, consider the L8 (2 7 ) orthogonal array shown in Table 4.4. Putting -1 for 1 and
1 for 2 one can see that this design is a 1116 replication of a two-level full factorial
design. The aliasing of this design is studied in Chapter 3.
At the bottom ofTable 4.4 are written all two factor interactions aliased with the
factors corresponding to the array columns. They can also be written in the form of an
upper triangular matrix as shown in Table 4.5. In the catalogues the orthogonal arrays
are usually given tagether with this upper triangular matrix of interactions.
For example, if one wants to study the aliasing of factor C (column 3 of Table
4.4) he/she can see froro Table 4.5 that entry 3 can be found on the crossing of 1 and 2
(A x B), 5 and 6 (Ex F) and 4 and 7 (D x G). Therefore, the factor C is aliased with
A x B, E x F and D x G.
The saroe orthogonal array can be used for less than 7 factors, using the free
colurons for studying soroe interactions. Suppose that there are 4 factors of interest: A,
B, C and D. We can assign thero to colurons I, 2, 4 and 7 ofthe Lg(2 7 ) array as is shown
on Table 4.6. In this table the free columns are assigned to the interactions A x B, A x C
and BxC.
The use ofthe orthogonal array L8 (2 7 ) given in Table 4.4 for studying the roain
effects roeans that all interactions are supposed to be negligible. Sirnilarly the interaction
A x B can be studied using the Ls (2 7 ) orthogonal array of Table 4.6 if the interactions
D x G and Ex F are insignificant. In subsection 4.9 is shown that Taguchi's roethods for
data analysis are based on the independent interpretation of the roain effects. This is the
reason why Taguchi insists on the elirnination of interactions.
This is a resolution IV design in which no main effect is aliased with any other main
effect or with any two-factor interaction, but two factor interactions are aliased with
each other.
The orthogonal arrays can comprise factors with different number of levels. Table
4.7 shows an ~ 8 (2 1 x 37 ) orthogonal array in which one ofthe factors has two levels and
7 factors have 3 Ievels.
lt is not always possible to study the interactions with orthogonal arrays. For
example, the only interaction which can be interpreted when ~ 8 (2 1 x 37 } airay is used is
that between the first two columns. There are some orthogonal arrays for which the
study of interactions is not at all possible, for example ~ 2 (2 11 } and ~6 (2 11 x3 12 ).
However, for the most of the orthogonal arrays triangular matrices of interactions are
available.
200 CHAPTER4
Changing factor Ievels is not always equally easy for all factors. Let us consider for
example an experiment in which there is one factor named equipment and which must be
changed at two Ievels (type I and type 2). There are 7 other factors, one of which is
measuring device (types l, 2, 3), the remaining 6 factors are at 3 Ievels each and it is
relatively easy to change their Ievels (for example, they are voltages and resistances of
some potentiometers). Forthis experiment one can apply the orthogonal array given in
Table 4.7. A good choice is to assign the factor equipment to the first column of the
array, not only because it has two Ievels but also because the change of its Ievels during
the experiment is most difficult and the Ievels in the first column are changed only once.
The next difficult to change factor is measuring device. We assign it to the second
column of the array because the changes of the Ievels in this column happen more rarely
than for the other columns of the array. The Ievels of the other factors are equally easy to
change and it doesn't matter to which column (3 to 8) they are assigned.
Split plot design is a design in which additional treatments are introduced by
dividing each plot into two or more portions. In the example considered above the
difficulty ofchanging the factor Ievels was considered as "additional treatment".
In Taguchi's experiments split plot (or split unit) designs are used when there are
factors with Ievels which are difficult for change. The factors are divided into groups of
order I, 2, 3, etc. depending on their changing ability. The highest order group includes
TAGUCHI'S APPROACH TO QUALITY IMPROVEMENT 201
the easiest to change factors. The groups are indicated at the bottarn of the orthogonal
array (see Table 4.7).
4. 7. Linear graphs
Taguchi uses linear graphs to denote the changing ability of the factors and to show
which factors and interactions can be studied through a given design. For this purpose he
uses the following symbols:
0 for indication of colurnns of group 1,
~ for indication of columns of group 2,
® for indication of colurnns of group 3,
• for indication of columns of group 4.
Consider once again the 4,(2 7 ) orthogonal array given in Table 4.4. The linear
graphs corresponding to this design are given in Figure 4.4.
7
3 5 @
I
\
\
\
'
2 6 4
a) b)
Figure 4.4. Linear graphs for orthogonal array 4, (2 7 )with 4 factors and 3 interactions (a) for the
design ofTable 4.6 (b) for the design ofTable 4.8
Figure 4.4 shows that if we have 4 factors: A,B,C,D there are two possible
ways to assign them to the orthogonal array colurnns. The first one corresponds to
Figure 4.4a and to Table 4.6. If factor A is most difficult to change, it is assigned to
column 1, the next difficult to change factor Bis assigned to column 2 and the factors C
and D, which are equally difficult to change, are assigned to columns 4 and 7,
respectively. The interaction between the factors in the columns 1 and 2 (A x B) is
202 CHAPTER4
The graph on Figure 4. 4b indicates another assignment of the factors among the
columns which is shown in Table 4.8. The only difference with the above given design is
that in Table 4.6 column 6 corresponds to the interaction Bx C, while for the design in
Table 4.8- to AxD.
Conducting experiments according to the parameter design shown in Figure 4.3 one
obtains N = N pNn values of the performance characteristic and has to consider them
when making decisions. Specialperformance measures (or perjormance statistics) are
used in Taguchi method for estimating the factors' effects.
As a performance measure Taguchi uses signal-to-noise ratio which is defined as
follows:
(4.5)
where
(4.6)
are estimates ofthe mean value and the variance ofthe performance characteristic.
TAGUCHI'S APPROACH TO QUALITY IMPROVEMENT 203
The signal-to-noise ratio is computed for every row ~ = 1,2, ... ,N) of the
parameter design matrix and the computations are based on the results of experiments
obtained from the noise design matrix (u = I,2, ... ,NJ.
This signal-to-noise ratio is used when a specific target value is best. To obtain a
robust product one can maximize the signal-to-noise ratio (i.e. to minimize the variance),
while keeping the mean value on a target. W e consider this problern in the next section.
Note that the signal-to-noise ratio has the meaning of a square of the inverse of
the variation coefficient which is defined as a I 17, a being the standard deviation and 17
- the mean value of a random variable y. The logarithm and the scale coefficient I 0 in
(4.5) do not change this interpretation.
The signal-to-noise ratio takes into account both components ofthe expected loss
in (4.3). However, as the target value r is not explicitly included in ; some special
considerations are needed depending on the definition of r.
For the smaller the better case (see Section 4.2) Taguchi recommends the
following definition ofthe signal-to-noise ratio:
N.
; = -Iolog[ -L-
1 I ] N.
2 (4.9)
N. u=I Yu
Taguchi has defined many other signal-to-noise ratios appropriate for various
special cases. The definitions given above are the most used ones.
204 CHAPTER4
Our discussion in this section will be illustrated by a parameter design for a product with
4 parameters p 1 ,p2 ,p3 ,p4 and 2 noise factors: n1 and n2 . Assurne that a L9 (3 4 )
orthogonal array is used as a parameter design matrix (see Table 4.2) and for the noise
design is chosen a full factorial experiment L4 (2 2 ). The corresponding cross-product
design, which is also called crossed array, is given in Table 4.9.
expensive. Taguchi recommends pooling together the small sources of variation until the
error variance corresponds to almost half of the available degrees of freedom.
This approach has been criticized by Box and Ramirez ( 1986) who noticed that
the pooling could result in an extreme bias in the statistical analysis. They propose using
half-normal plots as a safer procedure for selecting significant effects.
If some of the sources are pooled together with the residual then for the
computation of the contribution ratio Taguchi calculates the pure variation of a given
source of variation. Let for example, the sum of squares that corresponds to source A is
QA, while the total sum of squares is Q.
The significance of Ais tested through F-criterion:
If the effect of A is insignificant then the nominator and the denorninator of this ratio are
almost equal. The error sum of squares can be estimated by vAs~. Hence, the pure error
ofA is:
The contribution of a factor or interaction to the total variation is estimated using the so
called contribution ratio. For instance the contribution ratio of source A is defined as a
ratio expressed in percents between the sum of squares Q~ and the total sum of squares
Q:
PA= Q~ 100 %. (4.10)
Q
TI
3
I~ 3
2
3
13
PI p2 p3 p4
2
A type A plot is shownon Figure 4.5. The signal-to-noise ratio is put against the
product parameters and vertical lines are plotted. They show the significance of the
factor' s effects for ~. The points on the verticallines correspond to the mean value of ~
computed for all rows of the parameter design matrix in which a given factor has the
same Ievel. For example, the point 1 on Figure 4.5 for the factor p 2 corresponds to the
mean value of ~ calculated from rows 1, 4 and 7 ofTable 4.9, where p 2 takes Ievel 1:
The other points on the verticalline for p 2 are found by computing the following values:
and
The verticallines for the other factors (p1 , p 3 , p 4 ) are found in a similar way.
The distance between the end points on a vertical line measures the effect of a
factor. Looking at Figure 4.5 one can say that factor p 2 has the greatest effect, the
second greatest effect is that of factor p 4 followed by p 3 and p 1 . The points are
asymmetrically allocated on the vertical lines. That means that the corresponding
quantitative factor has a non-linear effect on the signal-to-noise ratio. For example, the
effect of p 2 is non-linear. However, a symmetrical allocation of the points does not
necessarily mean that the effect is linear. This is explained below (see Figure 4.8).
T AGUCHI'S APPROACH TO QUALITY IMPROVEMENT 207
Some ofthe main effects in Figure 4.5 arerather small, for example PI and p 3 .
Analysis ofvariance (ANOVA) is used to determine their significance.
Assuming that the interactions are negligible Figure 4.5 can be used to choose
optimal parameter values. In order to maximize ; one has to choose Ievel 2 for p 2 and
Ievel I for p 4 . If PI and p 3 have significant main effects then their optimal Ievels are I
and 3, correspondingly. Iftheir main effects are insignificant, the choice oftheir Ievels is
not important and other engineering considerations can be taken into account.
Separate plot can be drawn for each factor as shown in Figure 4.6. In these
graphs, which we call type B plots, the mean value of signal-to-noise ratio is put against
the factor Ievels.
2 3
a) b)
/
p
3
2 3 2 3
c) d)
Figure 4. 6. Type B plots
Construction of type B plots and their interpretation are similar to those for type
A plots. The interactions can be studied by putting the values of ; corresponding to one
factor (say p 4 ) against the Ievels ofthe other factor in the interaction (for example, p 2 ).
The obtained lines are parallel as shown in Figure 4. 7a. That means that the interaction
p 2 x p 4 is insignificant. If these lines are not parallel like in Figure 4. 7b, then the
208 CHAPTER4
2 3 2 3
~ ~
Figure 4. 7. Plots for interaction effects (a)Insignificant interaction between p 2 and p4
(b) Significant interaction between p 2 and p 4
Figure 4.8 shows an effect presented by type A and B plots. Though the points on
the plot A are symmetrically allocated, plot B shows a strong non-linear effect.
TypeA Type B p
ii) Adjustment parameters that have a great effect on the mean value but almost
no effect on the variance.
Some authors (Logothetis and Wynn (1988)) consider a third subset offactors:
iii) Neutral parameters. They do not affect either the mean value or the variance
ofthe response, and consequently, the signal-to-noise ratio too.
If such separation is possible one can maximize the signal-to-noise ratio by
changing the c.ontrol factor values. Then the mean value can be adjusted to the target T
by changing only the adjustment parameters. The neutral parameters must be set at their
cheapest Ievels, this way reducing the product cost.
The separation of the parameters into three groups is possible using plots type A
or B. Two graphs are plotted - one for the signal-to-noise ratio and another for the mean
value.
For example, consider the type A plots shown in Figure 4.9. The factors most
affecting the signal-to-noise ratio are p 3 ,ppp4 and p 7 (Figure 4.9. a). They are control
factors. One ofthem, p 1 , barely affects the mean value. Factars with great effects on the
mean value are p 6 ,p1 ,p4 and p 3 (Figure 4.9.b). One ofthem, p 6 , has almost no effect on
~. It is called adjustment factor. Factars p 2 and p 5 have almost no effects neither on the
mean value nor on the signal-to-noise ratio. They areneutral factors.
If there are no interactions, one can choose the factor Ievels as follows:
• Set p 1 at Ievel 2, p 3 at Ievel 3, p 4 at Ievel 3 and p 7 at Ievel 2. This will
maximize the signal-to-noise ratio ~.
• Adjust the mean value of the performance characteristic to the target value T
by changing p 6 .
• Set p 2 and p 5 at Ievels that make the product cheap.
2
3 2
2
mean %I TI I~ ...
(S/N ratio) 3 ~; • 2 ~;
PI p2 p3 p4 Ps p6 1 p7
2
a)
210 CHAPTER4
y 2
2
2
rz
3 1
overall '2 ,
mean 2 ~ 1
2
3 1
3
p1 p2 p3 p4
3
Ps P6 p7
b)
Figure 4.9. C1assification offactors: control, adjustment, neutral (a) plot for the signal- to-noise ratio
(b)plot for the mean value
The objective of Taguchi method is to develop a high quality product at low cost. This
can be achieved using the fact that the performance characteristic is non-linearly
dependent on the product or process parameters. A cost-effective solution of this
problern can be obtained if inferior grade materials and component parts are used to
make the prototype and then to decrease the variability through proper choice of the
operating point on the non-linear performance characteristic.
According to Taguchi and Wu (1980, p.32) "nothing is more foolish than
research using high priced raw materials or component parts". Such research can not
provide a cost-effective product because it misses the opportunity to obtain good
performance characteristics at low cost. The opportunity to develop a high quality
product using inferior quality materials and components must be realistically assessed by
engineers before starting the experiments.
The Ievels of the product parameters must be chosen to cover the region in which
an improvement of the performance characteristic is expected. The region of interest for
the noise factors should be selected to cover a wide region of manufacturing
imperfections that can appear in mass production and large tolerances of component
parts or raw materials. This can ensure cost effectiveness of the product.
TAGUCHI'S APPROACH TO QUALITY IMPROVEMENT 211
If the ehosen toleranees of raw materials or eomponents are too wide and ean
not ensure satisfaetory quality of the produet they ean be tightened at the last researeh
stage ealled by Taguehi tolerance design. We eonsider it in Seetion 4. 11.
A priori information about the mean values m; and the standard deviations s; of
the quantitative noise faetors is neeessary for the ehoiee of their Ievels during the
experiment. F or faetors assumed to have linear effeets on the performanee eharaeteristie
Taguehi reeommends to use two faetor Ievels: (m; - s;) and (m; + sJ If the noise
faetors are supposed to have quadratie effeets on y then three faetor Ievels are
reeommended by Taguehi: ~; - .)3 I 2s; ), m; and ~; + .)3 I 2s; ). This ehoiee of the test
Ievels is based on the assumption that the noises have symmetrie distributions. It is
important to note that this ehoiee of test Ievels is seleetive rather than random. That is
why noise design matrix does not provide test eonditions that exaetly eorrespond to the
real produetion proeess. In mass produetion noise distributions are usually eontinuous,
while in the Taguehi eross-produet design they are approximated by two-point or three-
point diserete distributions, aeeording to the number of test Ievels for the noise faetörs.
This faet is illustrated by Figure 4.10.
>
m·-'/Js·
I ~2 I
m·I
a) b)
Figure 4.1 0. Continuous error distributions and their approximation with discrete distributions
(a)two-point discrete distribution (b)three-point discrete distribution
If it is supposed that the noise distribution is non symmetrie or if the noise faetors ean
not be kept on given Ievels during the experiment, or are unknown, Taguehi recommends
to use repeated observations for eaeh trial run of the parameter design.
Engineers must be eautious with repeated observations. Consider an example in
whieh the rubber mixture tensile strength is being tested. There are two different ways of
taking samples for repeated testing:
212 CHAPTER4
• A rubber plate is taken and the samples are cut of it and tested,
• Each test sample is obtained startins the preparation of the rubber mixture from
the very beginnins (dosing, mixing, forming, curing, etc.).
In the first case the differences in the tests results are only due to the
heterogeneity of the rubber plate, while in the second one the variation is due to all noise
factors existing in mass production such as imperfections in dose fixing, mixing, building-
up, curing process, etc.
lt is always necessary to run a confirmatory experiment to verify that the new parameter
settings really improve the performance characteristic. It may happen that some
important factors are not taken into account in the parameter design, or the assumptions
of no-interactions or those related to the noise distributions are wrong. In this case the
conclusions may be wrong. That is why a confirmatory experiment is a compulsory part
of the research.
If there is no improvement or if the improvement is not satisfactory then the
hypotheses about the product or process should be reassessed. New factors or
interactions can be introduced in the parameter design, the response function can be
reformulated to avoid interactions, if possible, etc. Then a new iteration of the parameter
design may be necessary. When this can not improve the quality to a desired extent one
has to go to the next step called tolerance design.
ratios introduced in Section 4.9. These contribution ratios are computed on the basis of
an experiment with product parameters varied around their optimal values according to
the noise design matrix. After tightening the tolerances a new confirmatory experiment is
conducted in order to see if the performance characteristic is within the desired Iimits.
We summarize the Taguchi method, though some variations are possible which can
exceed the bounds given below. A quality improvement problern can be solved using the
following scheme ofthe Taguchi method:
1. Define the problern and determine the objective. Identify the product
parameters and noise factors and specify their intervals of variation.
2. Choose orthogonal arrays for parameter design and noise matrix and form
Taguchi's cross-product design. Assign the product parameters or their interactions to
the columns of the parameter design matrix and the noise factors to the columns of the
noise matrix.
3. Conduct the experiment and collect data.
4. Analyze the data. Evaluate the performance characteristic for each run of the
parameter design matrix and analyze the results using analysis of variance and the
graphical methods given in Section 4.9.
5. Select new parameter values using the methods described in Section 4.9.
6. Confirm that the new parameter settings do improve the performance
characteristic. Forthis purpose the parameter values are set to their optimal values found
in step 5 and a noise design is conducted in order to evaluate the signal-to-noise ratio for
the new parameter settings. The mean value obtained on the basis of the confirmatory
experiment is compared with the target.
7. Apply tolerance design ifnecessary.
Taguchi was not the first who developed and applied methods for design of experiments.
There is a tradition that started with Fisher and was continued by Yates, Box, Kiefer and
many others. However, Taguchi introduced some new elements in this tradition which
are of great importance to engineering and especially to quality improvement. The most
important ofthese ideas are:
• Taguchi introduced the so-called loss function which shows how close the
performance characteristic is to the target. He showed that the performance criterion for
high quality product must be "closeness to target" rather than "within specifications"-
• Taguchi was not the first who applied experimental design to study product or
process variability as a function ofthe parameters. Morrison (1957) and Michaels (1964)
considered the problern of making a product robust to errors transrnitted from its
214 CHAPTER4
4.14. Examples
In this Section we consider two case studies with application ofthe Taguchi method. The
first case study is a successful application of this method, while the second one shows
that the difficulties of taking into account the interaction effects in Taguchi method can
be a reason for not reaching the optimal solution to the quality improvement problem.
4 3
Consider the assembly process of two parts shown in Figure 4.11: body (1) and
plate (2). The plate is transported by a fixture (3) against the body. Using a pusher (5)
the plate is pushed through an assembly jig (4) with a large input chamfer. The jig is
directing the elastic legs of the plate towards the openings of the body. The assembly
force y(N) applied to the pusher is considered as a performance characteristic. It
depends on three parameters: form ofthe pusher PI with two Ievels (form 1 and form 2),
width of the jig's passage p 2 (mm), and height of the jig's passage p 3 (mm). The
variations of the performance characteristic are mainly due to the following external
noise factors:
• error ni (mm) in the relative position of the jig' s passages and the fixture in the
direction of Z-axis,
• width ofthe plate n 2 (mm),
• height of the plate n3 (mm) and
• error n 4 (mm) in the relative position of the jig's passages and the fixture in the
direction of Y-axis.
The standard deviation of the performance characteristic must not exceed 1 N.
The objective of the parameter design is to find optimal values of PI, p 2 and p 3
which provide minimal value of the performance characteristic and minimize the variation
caused by the noise factors. This corresponds to "the smaller the better" case in
Taguchi's terrninology. Columns 1, 2 and 4 of the L8 (2 7 ) orthogonal array given in
(3
Table 4.6 are used as parameter design matrix. The noise matrix is chosen to be a L 9 4 )
three-level orthogonal array shown in Table 4.2. The correspondence between the coded
and the natural values ofthe factors is shown in Table 4.10.
218 CHAPTER4
The cross-product design and the observations are given in Table 4.11. The
values of the performance characteristic are given in the point of intersection of a row of
parameter design matrix and a column of the noise matrix. For example, the value
y 45 = 9.4 corresponds to row No. 4 ofparameter design matrix and colurnn No. 5 ofthe
noise matrix. lt is obtained for the following coded factor Ievels:
Y
and n=(n1 n 2 n 3 n4 =(2 2 3 1Y.
The free colurnns 3, 5 and 6 ofTable 4.6 are used für studying the effects ofthe
interactions p 1p 2 , p 1p 3 and p 2 p 3 , correspondingly. The mean value Y; and the signal-to-
noise ratio ~;, i = 1, 2, ... , 8 are calculated by formulae (4. 6) and (4. 7) on the basis of 9
values of the performance characteristic for each row of parameter design matrix. The
results of these computations are shown in Table 4.12. The total means of the
performance characteristic y and signal-to-noise ratio ~ for all rows of the parameter
design matrix are given in the last row ofTable 4.12.
For example, the values of j/1 and ~1 are computed as follows:
YI = ~(4.3 +9.8 +9.1 + 5.2+ 6.2 + 6.0+ 6.7 + 4.6 + 8.1) = 6.67
and
~~ = -10log 10 [~(4.3 2 + 9.8 2 + 9.1 2 + 5.2 2 + 6.2 2 + 6.0 2 + 6.7 2 + 4.6 2 + 8.1 2 )] = -16.80.
TAGUCHI'S APPROACH TO QUALITY IMPROVEMENT 219
Table 4.12. Values of y1 and ~i for the rows ofparameter design matrix
No. PI P2 P1P2 P3 P1P3 P2P3 Y; ~i
1 2 3 4 5 6 7 8 9
1 1 1 1 I I 1 6.67 -16.80
2 1 1 1 2 2 2 4.93 -14.08
3 1 2 2 I I 2 15.19 -23.96
4 1 2 2 2 2 I 12.93 -23.09
5 2 I 2 1 2 I 7.14 -17.39
6 2 I 2 2 I 2 10.42 -20.49
7 2 2 I I 2 2 14.24 -23.50
8 2 2 I 2 I I 17.76 -25.21
y = 11.16 ~= -20.56
Table 4.13. Average values ofthe perfonnance characteristic and signal-to-noise ratio
Product Performance characteristic Signal-to-noise ratio
parameters and Levels Levels
interactions 1 2 I 2
PI 9.93 12.39 -19.48 -21.65
p2 7.29 15.03 -17.19 -23.94
P1P2 10.90 11.42 -19.89 -21.23
P3 10.81 11.51 -20.41 -20.72
P1P3 12.51 9.81 -21.62 -19.51
P2P3 11.12 11.20 -20.62 -20.51
For example, consider the average values of the performance characteristic and
signal-to-noise ratio computed for the Ievels of p 3 • The data are taken from columns No.
5 and No. 8 ofTable 4.12. For p 3 = 1 the mean value of Y; for rows No. 1, 3, 5 and 7 is
computed as follows:
while for p 3 = 2 the values of yj for rows No. 2, 4, 6 and 8 are averaged:
Similarly the average values of the signal-to-noise ratio for the levels of p 3 are
computed using data from columns No. 5 and No. 9 ofTable 4.12 as follows:
Figure 4.12 showstype A graphs based on the data from Table 4.13. They are
used for ordering the effects of product parameters and their interactions. The product
parameters p 2 , p1 and the interaction p 1p 3 have strong effects on the signal-to-noise ratio
q, while the effects on the mean value of the performance characteristic are ordered as
follows: p 2 ,p1p 3 and p 1•
15.03 a
1:P1
2:P2
3:P1P2
4:P3
5:P1P3
6:P2P3
11.16
?.29 1
-1?.19 ...
t
V -P•rf'. Ch•r.
Figure 4.J2.Effects of factors and interactions for the assembly of two parts
y- mean assembly force, .; - signal-to-noise ratio
TAGUCHI'S APPROACH TO QUALITY IMPROVEMENT 221
Analysis of variance is used to test effects' significance. Table 4.14 shows the
results ofthe ANOVA procedure for the signal-to-noise ratio.
The total sum of squares Q is computed using the data of column No. 9 of Table
4.12:
Q =I (~i- ~J =[(-16.8 + 20.56Y + (-14.08 + 20.56) 2 + .. + (- 25.21 + 20.56Y ]= 114.43.
z::::l
v=NP-1=8-1=7,
222 CHAPTER4
The mean squares can be computed as a ratio between a given sum of squares
and the corresponding degrees offreedom. For example
2- 2(pI )-
SI - S -
Ql-
-
9.39-939
- .
V1 1
and
2 -
SR-
QR -- 1.31 -- l . 31 .
VR 1
The mean squared errors for the factors and the interactions are compared with
the residual mean squared error through the F-ratio. For example, the F-ratio for p 2 is
F 2 = F(p 2 )= s~ = 9 1. 09 = 69.45.
SR 1.31
As noted in Section 4.9 Taguchi recommends pooling together the small sources
of variation until the error variance corresponds to almost half of the available degrees of
freedom. In this example we pool together with the residual p 1p 2 , p 3 and p 2p 3 . Table
4.15 is the pooled ANOVA table for the signal-to-noise ratio.
After pooling the contributions of the factors and the interactions are computed
using the corrected sums of squares as shown in Section. 4.9. For example, the
contribution ratio for pi is computed as follows:
The critical value ofthe F-ratio for significance Ievel a= 0.05 is the same for the
effects of PP p 2 and pip3 included in Table 4.15: Fr =F(0.05,1,4)= 7.7086. Hence,
only p 2 has a significant effect on the signal-to-noise ratio.
The analysis of variance for the performance characteristic y is slightly different
because in this case repeated observations are available. The results are shown in Table
4.16. They are based on data from Tables 4.11 and 4.12.
The total sum of squares is:
8 9
Q= LL (yi}- .Y) 2 = (4.3 -1u6Y +(9.8-11.16) 2 + ... +
i~I }~I
The sum of squares due to the effect of product parameter PI is based on Table
4.13 and is computed as follows:
224 CHAPTER4
The other sums of squares for the effects included in Table 4.16 are computed in
a sirnilar way. The residual sum of squares is:
There are 9 Observations for each row of the parameter design matrix, which
allows to resolve the residual sum of squares into two parts: Qn 1 which is due to the inter
experimental error and Qn 2 due to the replication error. They are calculated as follows:
= [(4.3- 6.7Y + (9.8- 6.7Y + ... +(8.1- 6.7Y + ... + (16.7 -17.76Y ]= 934.83.
v=kr-1=8x9-1=71,
The variance estimates are obtained as ratios between the sums of squares and
their corresponding degrees offreedom. For example
TAGUCHI'S APPROACH TO QUALITY IMPROVEMENT 225
and
chosen to be equal to 1 the level for p 3 must be equal to 2. Hence, the optimal
combination of the factors is PI = 1, p 2 = l and p 3 = 2. In natural measuring scales the
optimal geometric form of the pusher is form 1, the width of the jig's passage has to be
set at 15 mm and the height ofthe jig's passage must be equal to 4.1 mm.
One can see that the optimal parameter values are the same as for row No. 2 of
the parameter design matrix (Table 4.11 ). In spite ofthat a confirmatory experiment is
conducted with parameters fixed at their optimal values PI= 1, p 2 = 1 and p 3 = 2, while
the noise factors are varied according to the noise matrix. The observations are shown in
Table 4.18.Using the data from Table 4.18 following values are computed: y = 4. 94,
s~ = 1.3 8 and ; = -14. 09 . They are very close to the values obtained for run No. 2 of
the parameter design shown in Table 4.11.
.. ·a1 toIerances
. h t he Imti
Table 4 19 P00 led ANOVA tabl e fior expenments w1t
Source of Sumsof Degrees of Mean F-ratio p(%)
variation squares freedom squares
ni 1.076 2 0.538 6.63 8.27
n3 8.I76 2 4.088 50.40 72.57
n4 1.629 2 0.8I4 10.04 I3.28
Residuals 0.162 2 0.08I - 5.88
Total Il.043 8 100
(the effect of n3 ). For technological reasons the height tolerance can not be decreased
much. A new tolerance that is equal to 0.7 ofthe initial one is chosen.
To test the effect of tolerance design the parameters are fixed on their optimal
Ievels p 1 = 1, p 2 = 1 and p 3= 2 and new experiments with the noise matrix are conducted.
In these experiments the Ievels offactors n1 ,n2 andn4 are the same as in Table 4.10,
while for n3 the values corresponding to 1, 2 and 3 are 4.015 mm, 4.05 mm and 4.085
mm. The observations can be arranged as in Table 4.18. The only difference is in the
performance characteristic values, which are now equal to 4.1, 5.1, 6.8, 5.3, 5.0, 3.9,
5.7, 4.3 and 4.2. Using these. data the following estimates of the expectation and
standard deviation of the performance characteristic are obtained: y = 4. 933N and
sY = 0. 9314N. The standard deviation is now below the desired value of IN.
~i = 10log[~i~}
i = 1,2, ... ,18.
T ABLE 4.22. Average values ofthe performance characteristic and signal-to-noise ratio
f1thf1
or e actor leve
lfs o"wm dow-f1.
onmng process
Performance characteristic Signal-to-noise ratio
Parameters Levels Levels
1 2 3 1 2 3
PI 3.081 3.193 . 34.119 34.641 .
p2 3.007 3.470 2.934 35.437 35.294 32.409
P3 3.117 3.124 3.170 34.840 36.081 32.219
p4 3.059 3.218 3.134 34.182 35.742 33.215
Ps 3.210 3.040 3.161 34.758 35.089 33.293
P6 3.210 3.129 3.071 35.512 38.024 26.603
230 CHAPTER4
. nal-to-nmse rat1o
TABLE 4 23 ANOVA table tior the sigJ
Source of Sumsof Degrees of Mean F-ratio
variation squares freedom squares
P! 1.224 I 1.224 0.04
p2 35.031 2 17.516 0.61
P3 46.664 2 23.322 0.81
P4 19.504 2 9.752 0.34
Ps 10.953 2 5.477 0.19
P6 224.280 2 112.140 3.89
Residual 172.99 6 28.832 -
Total 510.63 17 - -
3.47
Factars
1 P1
8 PS
3 P3
4 P4
:> P:>
o; po;
~.14
2.'33
38.02 2
~.60
J. 2 4 s
.. ( -Perf. Stat .
Figure 4.13. Effects of factors and interactions for the window forming process
y- mean assembly force, .;- signal-to-noise ratio
Analysis of variance is carried out to study the significance of the factor's effects. The
ANOVA results for the signal-to-noise ratio are shown in Table 4.23. The critical values
of the F-criterion at significance Ievel a=0.05 are F(ü.05,2,6)=5.1433 and
F(0.05,1,6)= 5.9874. Table 4.23 shows that there are not product parameters with
significant effects on the signal-to-noise ratio. Analysis of variance is carried out for the
TAGUCHI'S APPROACH TO QUALITY IMPROVEMENT 231
case when all effects except p 3 and p6 are pooled to the residuals. The results are shown
in Table 4.24. Factor p 6 is now significant because F(0.05,2,13)= 3.80.
Table 4.25 shows ANOVA results for the performance characteristic. The
residual sum of squares is resolved into two parts: QRI and QR 2. The corresponding F-
ratio is
F.' = s~ 1 = 1.1 74 = 150.31
Rl ~2 0.008 '
effects of p 3 , p 4 , p 5 and p 6 are pooled to the Residual 1. The variance of the pooled
residual is computed as follows:
Comparing the pooled residual with the replication error we find that it is
significant because
Taking into account the plots and the pooled ANOVA tables for y and f one can
choose the following values of the factors PI= 2, p 2 = 2, p 3 = 2, p 4 = 2, p 5 = 2 and
p 6 = 2. This combination of factor levels provides a maximum of signal-to-noise ratio.
A confirmatory experiment under these conditions shows the following values of the
performance characteristic, its standard deviation and the signal-to-noise ratio:
For comparison the product parameter values before the experiment were PI = 1,
P2 = 2, P3 = 2, p4 = 2, Ps = 1, P6 = 1 and
The original Taguchi method is described in Taguchi (1986a), Taguchi (1987), Taguchi
and Wu (1979), Taguchi (1993a), Taguchi (1993b). The terminology used in these books
often differs from those adopted in the statistical literature. For description of the
correspondence between Taguchi's and statistical terminology see Kackar (1985). This
paper also presents an introduction to Taguchi method. Taguchi and Konishi (1987)
developed a catalogue of orthogonal arrays, and tables of orthogonal arrays are also
given in the books by Taguchi quoted above, by Logothetis and Wynn (1989), Phadke
(1989) andin other books.
Several books are devoted entirely to relative simple descriptions of the Taguchi
method and give many examples, for instance Barker ( 1985), Ross ( 1988), Phadke
(1989), Gien and Peace (1993), Fowlkes and Creveling (1995), Park (1996). The book
234 CHAPTER4
by Logothetis and Wynn (1989) presents the Taguchi method with many extensions and
comments, including critical remarks. Same model-based methods for quality
improvement arealso included. The book by Grove and Davis (1992) describes the main
ideas of Taguchi method and Taguchi's cross-product designs. Data analysis and decision
making are based on use of normal and half-normal plots.
Several books contain one or more chapters devoted to Taguchi method, for
example Ryan (1989), DeVor, R., Chang T. and Sutherland, J. W. (1992), Myers and
Montgomery (1995). The books edited by Bendell, Disney and Pridmore (1989), Dehnad
(1989) and Ghosh (1990) are collections of papers presenting case studies based on
Taguchi method. Case studies are presented also by Taguchi (1993c) and Wu (1993),
Lochner and Matar (1990).
There are hundreds of papers on Taguchi method. Most of them present
applications, but there are papers discussing the theoretical basis of the method as weil.
Opinions vary from appraisal to rejection. Nair (1992) edited a balanced discussion on
the Taguchi method, see also the discussion on the paper by Kackar ( 1985) and the
paper by Box, Bisgaard and Fung (1988).
TAGUCHI'S APPROACH TO QUALITY IMPROVEMENT 235
The loss due to deviation of the performance characteristic y from the target value T can
be presented in the form
(4.1)
Prooj The loss can be expanded by Taylor series araund T up to second order
term as follows:
There is not any lass if the performance characteristic is equal to the target and
1(-r) = 0. The firstderivative
o~~)ly=< = o,
because the minimum of the loss function is at y = -r. Putting
k
c
= _!..c~ 2/(y
2! oy2 y=<
)I
we obtain (4.1).
•
Appendix A.4.2. Expected loss
Proof Denote E(y) = Tl and rewrite the expected loss in the form
L =kß[(y-q+q--rY ]=
E((y -7]x7]- r)) = (7J- T )E(y -7]) = (7J- T XE(y)-7J] = (7J- rX7J -7]) = 0.
Consequently,
In this chapter we suppose that the performance characteristic's variability is only due to
errors in factors and that external noise factors do not exist. This makes possible to
reveal some specific properties of error transmission from product parameters to the
response. In Chapter 6 we consider optirnization procedures which are specific for that
case. In Section 7.3 we show that these properties can also be useful for reducing the
number of experimental runs.
Let us consider again Example 1.1 of Section 1.3 for the tensile strength y of a rubber
composition that depends on the amount of a component p. In the production process p
can not be fixed exactly on a given value and varies within a tolerance interval as shown
in Figure 5. 1. Therefore, in the production process the real value of the amount of the
component is p + e where e is random error. One can see from Figure 5.1 that the error
in product parameter settings is transmitted to the response. Distribution of transmitted
error depends on the distribution of e and on the form of the function 17 = 17(p) .
237
238 CHAPTER5
mean to compute the performance characteristic's variance one needs only an estimate of
the error variance. This approach is much easier than to estimate the product
performance variability through cross-product designs and signal-to-noise ratio as in
Taguchi method. We discuss this problern in details in Section 5.3.
One can choose the optimal parameter value using a model of the performance
characteristic. Suppose that the experimenter wants y to be equal to a given target value
T. If there was no random error in the product parameter and if Tl = ry(p) was a
deterministic function then the response would be equal to T for parameter values p = PI
and p = p 2 . However, in presence of error in the parameter the variation of the
performance characteristic y = ry(p + e) is not the same for p = PI and p = p 2 . One can
see from Figure 5.1 that the choice p = PI provides smaller variability of the performance
characteristic than p = p 2 . Consequently, the product quality can be improved by
choosing a parameter value which minimizes variance while keeping the mean value on a
target. In our case this is p = PI .
In fact the situation is more complicated. The mean value of the performance
characteristic depends on errors in several product parameters. Suppose we have a
regression model obtained through an experiment without errors in the factors p and Iet
y(p) be the predicted value. The mean value of the performance characteristic in mass
production is y(p) = y(p) + bias. The bias is caused by errors transmitted from the
product parameters to the response. In many cases it is negligible.
In order to find the optimal parameter values one needs two models describing
the mean value and the variance of the perjormance characteristic in mass production.
With these models quality improvement can be defined as an optimization problern of
variance minimization while keeping the mean value on a target.
In this chapter the idea illustrated by Figure 5. 1 is generalized for several product
parameters. We consider the mechanism of error transmission and derive models of the
mean value and variance in mass production.
5.2. Models of the mean value and the variance in mass production
In this chapter we consider the case when there are errors in product or process
parameters in mass production but external noise factors do not exist. Usually these
errors are raw material variations or manufacturing imperfections.
Denote the vector of product or process parameters by p = (pi p 2 ... Pm f
and the vector of errors by e = (ei e 2 emf. The true parameter values in mass
production are:
QUALITY IMPROVEMENT THROUGH REDUCTION OF THE ERRORS 239
Let us start with an example in which the performance characteristic 1J depends on two
parameters p 1 and p 2 . Suppose that in absence of errors the following second order
polynornial model is true:
(5.2)
240 CHAPTER5
Assurne that the rnodel coefficients are known constants. The effect of their
replacernent with least square estirnates is ignored for the rnornent. This problern is
considered in Section 5.5.3.
In rnass production the product parameters are subject to errors e1 and e2 . That is
why the rneasured value ofthe performance characteristic in rnass production is
(5.4)
(5.6)
QUALITY IMPROVEMENT THROUGH REDUCTION OF THE ERRORS 241
This equation shows that for linear relationships between the performance characteristic
and the product parameters the variance can not be reduced using parameter design.
Variance reduction through optimal choice of parameters is possible only if the product
performance characteristic is nonlinear. This is shown in Figure 5.2. For linear
performance characteristic (Figure 5.2a) the translation of the operating point from
p' top" does not change the variance of y, while for the nonlinear characteristic in
Figure 5.2b this causes a substantial reduction of a 2 .
~
;;:;"'"""....
p' p" p
a)
/
y
I
Q p' p" p
b)
Figure 5. 2. Transmission of errors from product parameters to the response (a) linear perforrnance
characteristic (b) nonliner perforrnance characteristic
Figure 5.2 shows the error transmission from a single parameter to the response.
The equations for mean value and variance for this case can be obtained from ( 5.4) and
(5.5) putting ß2 = ß12 = ß22 = 0. This interpretation is valid for any number offactors as
weil. This is shown in the next section.
242 CHAPTER5
Using equations (5.4) and (5.5) an engineer can find parameter values that
rninirnize the variance a 2 while keeping the mean value y(p) on a target r. In Chapter 6
we discuss this problern in details. Here we confine ourselves to model building.
Example 5. 1. Chernical reactor example (continued)
Consider again the chernical reactor example of Section 2.3.5. In Example 2.7 the
following regression model is obtained:
y = 59.89+2.67p1 +3.33p2 + 2.5p1p 2 - 4.33p12 +3.67p;. (5.6a)
0.
0.
0.
1 p
2
-0.
0.
-0.
-0.
-0.
-1'---'----""'---....:;;_--'---""------'--'------'--'
-1 0.5
Figure 5.3. Mean value contours for the chemical reactor example
QUALITY IMPROVEMENT THROUGH REDUCTION OF THE ERRORS 243
Only the intercept of this model differs from the intercept of the original
regression model by 60.05- 59.89 = 0.16.
The variance in mass production is found by (5.5):
•
5.2.3. GENERALIZATIONS
The results ofSection 5.2.2 can be generalized for an arbitrary number offactors.
Models of mean value and variance in mass production based on second and third
order polynomials
One of the most frequently used models in response surface methodology is the second
order polynomial. Form factors it is written as follows:
m m-1 m m
Using the same ideas as in Section 5.2.2 we obtain the following models
describing the performance characteristic and its variation in mass production:
• Model of the expected value (the mean value) of the performance
characteristic:
(5.9)
i=l
J
• Model of the variance
Both models are obtained under the assumption that the errors in factors are
independent. The proofs of ( 5. 9) and ( 5.1 0) are given in Appendix A. 5.1.
Model (5.10) can significantly be simplified for some widespread cases. For
symmetric distributions of the errors in factors pi3 =0. If the error distributions are
normal, then f.1; 3 = 0 and f.1; 4 = 3u: .
In this important case formula ( 5. 10) becomes:
(5.11)
m m-I m
HOT= i"l.Jf;.u~ +
i=l
L 'L/fijcl;dj.
i=l j=i+1
(5.12)
This value does not depend on product parameters and therefore, it does not affect the
choice of optimal parameter values.
Replacing cl;, a; and ß;1 with their estimates s12 , s~ and b;1 we obtain
HOT= 2[(- 4.33 )2 X 0.04 2 +3.67 2 X 0.09 2 ]+ 2.5 2 X 0.04 X 0.09 = 0.3 .
Therefore, a variance model in which high order terms are taken into account is
0.
0.
0.
r p
0.
0
2
-0.
-0.
-0.
-0.
-1"'--->----'---"'----"'o..-'-------'---L---L-..l
-1 0.5
Sometimes the high order terms are negligible, but it may be dangeraus to ignore
•
them in all cases. A discussion on this problern is given in Section 5.5.2, see also the
friction-welding example in Section 6.8.
A carefullook at equations (5. 9) and ( 5.l 0) shows that the conclusions in Section
5.2.2 are applicable for any number of product parameters. If the regression model is
linear with respect to pi (i.e. ßii = A = 0) the variance does not depend on product
parameters and therefore, only tightening their tolerance intervals can reduce response
variation.
In some cases for a given region of interest the response may depend only on
main and interaction effects, while all quadratic effects are negligible (ßii = 0). The
elimination of quadratic effects does not change the order of the variance model.
Equation (5. 11) shows that with or without quadratic effects the variance model is a
second order polynomial with respect to product parameters.
To make decisions engineers need to know whether a product parameter has an
effect on the mean value or on the variance, or on both of them. We say that a factor pi
has location e.ffect when the change of pi results in a considerable change of the
performance characteristic's mean value. Similarly Pi has dispersion e.ffect when its
change causes profound changes in the performance characteristic's variance. Figure 5.3
and Figure 5 .4 show that p 1 and p 2 in Example 5. 1 have both location and dispersion
effects.
246 CHAPTER5
lt may happen that a given factor pi has a strong effect on the mean value and
almost no influence on the variance. For example, if A =0, while ßi and ßii in equations
( 5. 7) and ( 5. 11) are large and the error variance c1; is small, we say that the factor Pi has
a strong location effect and a negligible dispersion effect. Conversely, if ßi1 = 0 and
ßi and ßii are relatively small, but c1;
is large the dispersion effect is strong while the
location effect is negligible. This fact is used by Taguchi to divide the product parameters
into control and adjustment factors. lt is in conformity with Taguchi's idea that the
interactions have to be eliminated. However, often all factors have both dispersion and
location effects. Therefore, the classification of factors as control or adjustment ones
may be an impossible task.
Models of the mean value and variance can also be derived for the case when the
performance characteristic is described by a third order polynomial as follows:
m m m
+ 'Lßiiip: + L 'LßiijP 2 iPj. (5.13)
i=l i=l j=l.j*i
Assurne that the errors in product parameters are independent and normally
distributed and E(eJ = E(e) = 0, i =1,2, .. .,m . Under these assumptions the model of the
mean value in mass production is:
(5.14)
+HOT+a;, (5.15)
where
QUALITY IMPROVEMENT THROUGH REDUCTION OF THE ERRORS 247
HOT= 2f(ßii
i=l
+3ßwP; + fpiiJPJJ
j=l,j;t:.i
2
U;4 +
(5.16)
Equations ( 5.15) and ( 5.16) are obtained as special cases of the results derived in
Appendix A.8.1 ofChapter 8. Their proofs are given in Appendix A.8.3.
In cantrast with the case when the performance characteristic's model is a second
order polynomial one can see that the high order terms ( 5. 16) depend on product
parameters. The variance is a fourth order polynomial with respect to parameters
p;, i = 1,2, ... ,m.
or p = {pl P2 ··· Pm Y·
• vector of coefficients in the linear part of ( 5. 7):
248 •CHAPTER5
The coefficients in the nonlinear part of (5. 7) are written in a m x m matrix 11 with
following elements:
ß;; fori = j
{
11ij= 1ß ti . . .
-2 .,.. or1 -:1= J
(5.17)
For example, model (5.6a) of Example 5.1 can be presented in the form (5.17)
with ß0 ,ß and 11 defined as follows:
In the widespread case of independent errors the covariances are equal to zero
(p;p;ai=O) and ~. is diagonal matrix: ~. = diag(a 12 ,a; , ... ,a!).
QUALITY IMPROVEMENT THROUGH REDUCTION OF THE ERRORS 249
Compute
(5.20)
The trace of a square matrix is equal to the sum of its diagonal elements. Hence
(5.21)
(5.22)
Formula (5.12) for the high order terms can be written in matrixform as follows
ß;;2 tior 1. = 1.
[2i'2 ];}
{
= 1 ß2 ti .......
-4 ,,.. or 1 +- 1
Example 5.4. Matrix form of the variance model for second order regression
model with two parameters
We will show that for second order regression model with two parameters (5.17)
the expression (5.22) is equivalent to (5.11), while (5.23) coincides with (5.12). Forthis
case the vector (ß + 2 g' p) can be presented in the form
In the case of two product parameters the variance matrix of uncorrelated errors
is :E. = diag(u 12 , u;). Therefore, the first term of (5.22) can be written in the form:
(5.24)
1 2 .
4ßl2 J
p;2
= 2Iß~p~+~2~a; ·
i=l
In order to use the formulae of Section 5.2 one needs the estirnates of the error
distribution rnornents: the rnean value E(ei ), the varianced; and the third and fourth
order rnornents pi3 and pi 4 . They can be found in two ways: through observations or on
the basis of tolerance intervals.
1 n
E;=-Lz;., i=1,2, ... ,n. (5.25)
n u~I
(5.26)
(5.27)
The estimates of the third and fourth moments of the error distribution are
(5.28)
with k = 3 or k = 4. However, these estimates are biased. The unbiased estimates are:
(5.29)
and
~ _ n(n 2 -2n+3) ~ _ 3n(2n-3) ~2
f.J;4 - ( n-1 X X )
n-2 n-3 f.J;4 ( XV
n-l n-2 1,.._n-3 ) P,;2 '
(5.30)
TABLE 5..
1 E sumauon
. of· product parameters moments
No. Zlu Ziu -zi (zlu - zl y (zlu - zl )3 (zlu - zl t Z2u Z2u -z2 (ziu -z;)x
(z2• -z2)
1 2 3 4 5 6 7 8 9
1 20.7 -1.71 2.9241 -5.000 8.550 10.3 -1.71 2.9241
2 23.1 0.69 0.4761 0.329 0.227 10.6 -1.41 -0.9729
3 20.4 -2.01 4.0401 -8.121 16.322 11.2 -0.81 1.6281
4 20.8 -1.61 2.5921 -4.173 6.719 13.2 1.19 -1.9159
5 21.1 -1.31 1.7161 -2.248 2.945 14.5 2.49 -3.2619
6 21.7 -0.71 0.5041 -0.358 0.254 13.0 0.99 -0.7029
7 27.8 5.39 29.0521 156.591 844.024 10.5 -1.51 -8.1389
8 21.8 -0.61 0.3721 -0.227 0.138 11.4 -0.61 0.3721
9 21.3 -1.11 1.2321 -1.368 1.518 13.3 1.29 -1.4319
10 25.4 2.99 8.9401 26.731 79.925 12.1 0.09 0.2691
Sum 224.1 51.849 162.155 960.624 120.1 -11.231
Biased estimates of third and fourth order moments of the error in the first factor
are computed by (5.28):
Using (5.29) and (5.30) one can compute the corresponding unbiased estimates as
follows:
10 2
A
p 13 =( X )x18.0172=25.02389
10-1 10-2
and
A = 10(10 2 -2x10+3) x 106736 _ 3x10(2x10-3) x 5 761 2 =
P 14 (lo-1X1o-2X1o-3) · (10-1X1o-2X1o-3) ·
The moments of the error e2 are computed similarly from the data in Table 5.1.
They are
s 22 .4 89 = 2.0543
= 1810-1 ,
- = 8.2361 = 0.9151
f..l 23 10-1 ,
- = 62.4471 = 6.9386
f..l 24 10-1 ,
p = 1. 3889
23 X 0. 9151 = 1. 27098,
p = -11.231 = -0.3627.
12
(10 -1}J5.761 v'2.0543
•
5.3.2. V ARIANCE ESTIMATION USING TOLERANCE INTERV ALS
If the errors are assumed to be normally distributed the mean values and variances of raw
material or component parts performance characteristics can be calcu1ated on the basis of
tolerance intervals.
Suppose that the nominal value of a product/process parameter expressed in
some physical scale is p;. Its tolerance interval can be given in two ways:
• with constant tolerance Iimits : (p; - r;;, + r;; ), p;
• with tolerance Iimits given in percents (K; %) ofthe nominal va1ue:
It is known that if the errors in parameters are norrnally distributed, their true
va1ues in mass production z;
belong to the interval p; ± 30: with probability 0.997. It is
clear that r;; = 30: and d; = ~ I 9. If the tolerance interval is given in percents the
2
corresponding value of the error variance is u; 2 = (K;p; /300Y. In this case the error
QUALITY IMPROVEMENT THROUGH REDUCTION OF THE ERRORS 255
variance depends on the nominal parameter value. This fact must be taken into account
in the optimization procedure because the value of 0: 2 changes tagether with p;.
0: 2 = (5 x 1200/300) 2 = 4oo n 2 .
•
In a design of experiments usually all factors are coded by formula (2.28) and
coded values of error standard deviations are used in all models of Section 5.2. Suppose
that in an experiment some parameter p; is measured in natural (physical) units in the
interval p;min ~ p; ~ p;rrw<. The half-interval ofvariation used in (2.28) is
where p;0 is the centre of the interval p;min ~ p; ~ p;max. The factor values are coded
using formula P; = (p; - P;o )Im; .
In mass production factors are subject to errors. Their true values in coded form
p; p;
are equal to z; = (z; - 0 )Im; . As 0 and m; are constants it is easy to see that
(5.31)
(5.32)
5oo n~p;~15oo n.
256 CHAPTER5
The coded value of the variance of a resistor with nominal value p; = 1000 Q is
computed by (5.32) as follows:
Note that these variances computed on the basis of tolerances differ from the
Taguchi's values given in subsection 4. 9 .1.
•
5.4. Further generalizations
In Chapter 2 was given a more general form of the regression model which can be used
for any degree of the polynomial. Models of mean value and variance of a performance
characteristic in mass production can be derived on the basis of (2.30). They are useful
for studying the effect of true model coefficient replacement with least square estimates.
This problern is considered in the next sections.
Suppose that a regression model is given in the form
k
77(p)= l:B;.t;(p)= fr B, (5.33)
i::::l
y(z) = fr B + gr B + e . (5.35)
QUALITY IMPROVEMENT THROUGH REDUCTION OF THE ERRORS 257
All polynomial models can be presented in the form (5.35), see Example 5.7.
Taking into account (5.33) one can compute the mean value of the performance
characteristic in mass production:
Denote also 1f1 = g- E(g) and 'P = E(lfllflr). As rr Bis non-random, the variance can be
computed as follows:
(5.37)
Usually e and & are uncorrelated and 2cov(gr B,c )= 0. Consequently, the
variance ofthe performance characteristic in mass production is
(5.38)
If the regression model is a second order polynomial (5.36) and (5.38) are
equivalent to (5.4) and (5.5). This is demonstrated by the following example.
The matrix 'P, corresponding to a full second order polynomial model, is derived
in Appendix A.5.3 under the assumptions that errors in factors are with zero
expectations and are uncorrelated. All distribution moments of order higher than second
one are neglected as weil.
Example 5.8.
Consider a second order polynomial model with two parameters:
Defining
and
one can present this model in the form (5.33). In this example the distribution moments
are taken into account. If the product parameters are subject to errors the vector f is
transformed into
or
258 CHAPTER5
The vector g = h - f is
Provided that E(e1 ) = E(e 2 ) = 0, one can compute the expectation of g as follows:
lfl = g - E(g) =
-'o
-~ eI
This equation can be simplified taking into account that E(e1 ) = E(e 2 ) = 0 and assuming
that the errors in factors are uncorrelated, i.e. p 12 = OandE(e1e 2 )= 0. For the sake of
simplicity we also neglect all moments of order higher than second:
The other elements of 'I' can be computed under the same assumptions in a
sirnilar way. The matrix 'I' is
0 0 0 0 0 0
0 a2I 0 p2al2 2p1a{ 0
0 0 a22 pla; 0 2pp;
'I'=
0 P2a12 P1a2
2 p2a2
I 2 + p2a2
2 I 2plp2al2 2plp2a;
0 2plal2 0 2plp2al2 4p2a2
I I 0
0 0 2pp; 2plpp; 0 4p2a2
2 2
Putting (} and 'I' into (5.38) one obtains a formula for prediction ofthe variance in mass
production:
Example 5.9.
•
In Example 5. 1 we obtained a variance model provided that the regression model
is:
and the estimates of error variances are s12 = 0. 04 and s; = 0. 09. This model can be
written in the form (5.38). Forthis purpose we define
The matrix 'I' is obtained in Example 5.8. Substituting s12 and s; for ~ and a;
one can write
0 0 0 0 0 0
0 0.04 0 0.04p2 0.08pl 0
0 0 0.09 0.09pl 0 0.18p2
'i'=
0 0.04p2 0.09pl 0.09pl2 +0.04p; 0.08plp2 0.18plp2
0 0.08pl 0 0.08plp2 0.16pl2 0
0 0 0.18p2 0.18plp2 0 0.36p;
260 CHAPTER5
The response error variance iss;= 1.148. Hence, the variance rnodel (5.38) can
be written as follows:
Carrying out the rnatrix cornputations one can obtain the variance rnodel in the
form given in Exarnple 5. 1:
•
5.5. Accuracy of the predicted mean value and variance in mass production
There are three rnain sources of inaccuracy in the prediction by the rnodels introduced in
subsection 5.2:
1. Incorrect structure of the regression rnodel,
2. Neglecting high order distribution rnornents and high order terms in the rnean
and variance rnodels,
3. Substitution of estirnates b with coefficients ß in the regression rnodel.
If significant terms are ornitted in the regression rnodel this affects the rnodels of the
rnean value and variance of the perforrnance characteristic in rnass production. They rnay
give inaccurate predictions and the product pararneters chosen on the basis of these
rnodels rnay not be optimal. It is irnportant therefore, to pay attention to building a high
quality regression rnodel. Stepwise regression, all possible regressions or Daniel plots
can be used for a correct rnodel structure deterrnination.
Let us consider the consequences of an incorrect structure selection for rnodel
(5.33) considered in Section 5.4. Suppose that the true rnodel structure is 77= fr (), while
the postulated rnodel is 711 =fiT~' where rr (rr :r;)
= and ()T = (or :o;). Therefore,
the postulated rnodel has fewer terms than the true one.
Using the notations of Section 5.4 we present the vectors g and 71 = f r (} in
block form as follows:
(5.39)
(5.40)
The conclusion is that if some of the significant factors or functions J; are not in
the model their transmitted error causes a bias 11ci" in the variance estimate. The model
(5.39) can be written in the form
(5.41)
where c?N = !1ci" + 0: is variance of some noise incorporating all noise components
which are not taken into account in the variance model (CT ~ ~ CT; ) .
The variance c?N depends on the regression coefficients and on the functions f
that are included in the term 2 01r '1'12 02 through '1'12 . Therefore, if the postulated model
has less terms than necessary ~ can not be considered as a constant. In any case the
Observations are heteroscedastic but the true model of variance explains the sources of
heteroscedasticity, while a model with incorrect structure incorporates them into a single
term c?N which can not be used for making decisions.
An important question that should be addressed is how good the models of mean
value and variance are in mass production. Starting this discussion we should not forget
that these models are derived analytically on the basis of a regression model. The
experiments used to estimate the regression are without errors in product parameters,
while models ( 5. 9), ( 5.1 0) and ( 5.11) provide predicted mean values and variances in
mass production where errors can not be eliminated. Therefore, the values predicted by
these models should not be compared with the experimental data used for derivation of
262 CHAPTER5
the regression equation. The obtained optimal parameter values can be verified through a
confirmatory experiment in mass production. Several Observations should be made for
the optimal parameter values and the obtained sample mean and variance should be
compared with the values predicted by models (5.9) and (5.10) or (5.11).
The qua1ity of mean value and variance models in mass production depends on
goodness-of-fit for the regression model. For an accurate regression model and good
estimates of noise variances the models (5.9), (5.10) and (5.11) provide good prediction
of the mean value and variance in mass production if the assumptions for their derivation
are met. That is why the attention of the engineer or the scientist should be focused on
obtaining an accurate regression model using the recommendations for design of
experiments and statistical data analysis given in Chapters 2 and 3. The assumptions
about the noise distributions should also be checked. If there is no evidence that the
noise is normally distributed formula (5.10) must be used to compute the performance
characteristic's variance.
Now consider in details the case of second order regression model. High order
terms in the model of variance in mass production may not be negligible in the following
cases:
• The error distribution is not normal or is at least non-symmetrical. One can
see from ( 5.1 0) that in this case the high order terms of the variance model depend on
third and fourth error distribution moments f.J; 3 and f.J; 4 . In this equation pi3 is multiplier
of an expression containing product parameters. Hence, the optimal parameter values
depend on third order moments.
• The error variance depends on the values of the product parameters. As
shown in Section 5.3.2 this is typical when tolerance intervals or component parts
depend on the nominal parameter values. There can be other reasons for such
dependence as weil.
In the case considered in Section 5.3.2 the error variance can be· written as
follows:
Asp;= OJ;P; + p;0 , the error variance can be presented in the form
(5.42)
where
(5.43)
This is a fourth order function of the product parameters. Therefore, Cl has more
than one extremum and global optimization methods should be used for variance
minimization. The high order terms can be neglected only if all interaction and second
order effects are negligible (p;; = ßii =o).
264 CHAPTER5
Note however, that contrary to the case with constant error variance, if cl, is
given by (5.42) then the response variance in mass production depends on P; even ifthe
regression model is linear. This can be seen after substitution of ß;; = ßif = 0 into (5.43).
The following model is obtained:
The bias in the mean value of the performance characteristic depends also on the
product parameters when error variances are their functions. This becomes evident after
substituting (5.42) into (5.9):
i=l i=l
a) b)
p' p" p
c)
Figure 5.5. Transmission of errors to the response (a) linear performance characteristic (b) and (c)
nonlinear performance characteristic
QUALITY IMPROVEMENT THROUGH REDUCTION OF THE ERRORS 265
Figure 5.5 shows how a 2 depends on variable error variance in case of single
product pararneter p. Suppose that the error variance a;is higher for p = p" than for
p = p' . Figure 5. 5a shows that for linear regression model the variation of y is smaller
for p' than for p". Consequently, p' is better choice of the product parameter value
than p". The situation is not so clear with performance characteristics that are non-
linearly dependent on product parameters. Figure 5.5b shows a case when the effect of
the increased error variance is compensated by the response function non-linearity and
p" is the better choice despite of the fact that the error variance for this point is !arger
than for p'. However, if the nonlinear effect is not significant as in Figure 5.5c the
growth of error variance can not be compensated and p' remains the better choice for
the product parameter value than p".
r
In Sections 5.2 and 5.4 we assumed that the regression model coefficients in (5.7) and
(5.33) are known constants. In fact only estimates 0 = (01 02 Ok are available
and they are substituted in the models for (} =(B1 B2 ... Bk Y. This causes some
variability of the predicted mean value y(p) and variance a (p) .
2
Consider equation (5.33). Suppose that the factor Ievels are set without errors in
the experiment. The least square estimates of the regression coefficients are
where F is N x k matrix of regressors. Putting B for (} in the model of the mean value
in mass production (5.36) one obtains
(5.44)
The first term in the right band side of(5.44) can be presented in the form
(5.45)
The expectation of :e with respect to the output noise & is computed putting (5.45) into
B
( 5 .44) and taking into account that are unbiased estimates: E = 0 . The following
E (8)
relationship is obtained:
of Bfor 0 causes an increase of s 2 compared to Cl. The bias term in ( 5 .46) can be
estimated as follows:
(5.47)
1=1 1=1 1=1+1
( ~)
V\0 =\F F
{ T )-1 aE2=CaE,2
or
and
{ ~ ~ ) 2
cov\01 , 01 = c11 a E,
where c11 and c11 are diagonal and off-diagonal elements of C, correspondingly. Putting
these equations in (5.47) we obtain
QUALITY IMPROVEMENT THROUGH REDUCTION OF THE ERRORS 267
k k~l k J (5.48)
= 52 - s; ( ~ 'I'ucu - 2~ t~l'Pitclt '
The model coefficients can be estimated on the basis of data obtained through two level
full or fractional factorial designs. As shown in Chapter 3 in this case Cu = 1I N and
clt = 0 for l,t = 1,2, ... ,k, l t: t. Hence, the last term in the brackets of(5.48) is zero. The
elements lf/u are computed in Appendix A.5.2. They are:
'Pll = 0,
'Pu= a7 for i = 1,2, ... ,m; l =2, ... ,m+ 1,
'l'z1 = a7 p~ + djp;2 + a7dj for i = 1,2, ... ,m -1;) = 1,2, ... ,m,i t: j and l = m + 2, ... ,k.
Using these elements and equation (5.11) in (5.48) one obtains the unbiased estimate of
the variance:
268 CHAPTER5
(5.50)
s s;
The bias in 2 can be neglected if is small and N is high.
Unbiased estimate of Cl can also be obtained for a dass of symmetrical second
order designs with covariance matrix ofthe form:
(5.5I)
The proof of this equation is given in Appendix A.5.2. The bias changes only the
intercept and the quadratic terms in the model.
Example 5.IO. Unbiased variance model for the chemical reactor example
In Example 5.2 we found the following model of variance which takes into
account all distribution moments up to fourth:
QUALITY IMPROVEMENT THROUGH REDUCTION OF THE ERRORS 269
1 1 1
Therefore, d3 =2,d5 ="6 and d6 ="4 The error variances are s12 = 0.04 and
s; = 0.09and s; = 1.148. The bias is
bias = 1.148{!(0.04+0.09)+!(o.04pi +0.09p12 + 0.04x 0.09)+
6 4
•
Example 5.11. Molybdenum extraction from nitrate-sulphate solutions
In this example we consider as a performance characteristic the degree of
molybdenum extraction y [%]. It depends on four factors: consumption of extractive
substance p{[%), concentration of NO;: p~[gll], concentration of SO~': p;[gll],
duration of phase contact: p;[ min]. A three Ievel experiment is carried out. The
correspondence between the natural and coded factor Ievels is shown in Table 5.2.
270 CHAPTER5
An orthogonal composite design is used. The design and the observed values are
given in Table 5.3.
TABLE53
.. Des1gn fior molbdenum extracuon expenment
No. PJ p2 P3 p4 y No. PJ P2 P3 p4 y
1 -1 -1 -1 -1 68.6 13 -1 -1 1 1 76.4
2 1 -1 -I -I 97.5 I4 I -I I I 94.2
3 -1 I -1 -1 69.2 15 -I I 1 I 68.0
4 1 1 -1 -I 95.0 I6 1 1 I 1 96.0
5 -I -I I -I 65.0 I7 -1 0 0 0 73.2
6 1 -1 1 -1 96.6 18 1 0 0 0 90.3
7 -1 I I -I 69.0 I9 0 -1 0 0 77.3
8 1 1 I -I 98.0 20 0 I 0 0 81.5
9 -1 -I -1 1 69.0 21 0 0 -1 0 80.0
10 1 -I -I I 97.0 22 0 0 1 0 78.5
11 -1 1 -1 I 77.5 23 0 0 0 -1 65.5
I2 I I -I 1 98.1 24 0 0 0 1 75.3
The main effects of only p 1 and p 4 and their interaction p 1p4 as weil as all quadratic
effects are significant.
The covariance matrix C = (FrFt
is ofthe form (5.51). The non-zero elements
of this matrix are d1 = 0.229167,d2 = -0.0625A = 0.395833, d 4 = -0.104167,
d 5 = 0.055556 and d 6 = 0.0625. Because ofthe specialform ofthe covariance matrix C
main and interaction effects are not correlated with any other effects. Therefore, the
significance of the main effects p 1 and p4 and the interaction p1p 4 can also be checked
by t-test as shown in subsection 2.3.6. An independent estimate of response error
variance based on 5 independent observations at point p 1 = p 2 = p 3 = p 5 = 0 is obtained.
lt iss~ = 3.35 with V6 =5-1= 4 degrees offreedom. For example, the t-statistic for the
effect of p 4 is
QUALITY IMPROVEMENT THROUGH REDUCTION OF THE ERRORS 271
t4=_1J_= 1. 51 =35
s&,Jd; .J3.35x0.055556 · '
and for the effect ofthe interaction p 1p 4 is
- lbl41 - l-1.311 -
tl4- u- -2.86.
se...;d6 .J3.35 x 0.0625
The t-statistic for the effect of p 1 is 11 = 29.21. The critical value of t-statistic at
significance Ievel 0.05 is tr =t(v8 ,a!2)=2.116. Therefore, the effects of p 1 , p 4 and
p 1p 4 are significant.
Using the independent estimate of response error variance one can test the model
adequacy by formula (2.54). Compute
F= s~2 = 10.66 = 3. 18 .
S8 3.35
The degrees of freedom are vR = 16, V =4 and the critical value of F-distribution for
8
significance Ievel 0.05 is Fr = F(0.05,16,4) = 5.84. As F <Fr this test shows that the
model is adequate.
The multiple correlation coefficient is 0.975 and the corresponding F-ratio
computed by (2.58) is F=43.59. The degrees of freedom for this test are
vM = k - 1 = 7 , vR = N - k = 16 . The critical value of F-distribution for significance Ievel
0.05 is Fr = F(0.05,7,16)= 2.65. The multiple correlation coefficient is significant
because F > Fr.
In mass production there are errors in three of the process parameters: p 1 , p 2 , p 3 .
The standard deviations ofthese errors in coded values are s1 = 0.15,Sz = s 3 = 0.05. The
fourth factor is free of errors in mass production and we put s4 = 0. The errors cause
bias in the mean value in mass production which is equal to
m
Ir (ß:E.) = ~);;S;2 = 5.96 X 0.15 2 + 3.61 X 0.05 2 + 3.46 X 0.05 2 = 0.1518.
i=l
For example, compute the predicted response in mass production for the
following parameter set: P 1 = P 2 = P 3 = p4 = -1:
"'
HOT= 2"
L..J b s
2 4
m
i=1
ll I
+""' ""'b
m-1
,i..J
m
L..J s s
i=l j=i+l
I)
2
I
2
)
=
s; = 3.35,
• bias
or
The transmitted variance varies over the factor space. In some of the points it
may be much higher than the bias. For example, if PI = 1 and p 2 = p 3 = p4 =-1
y =95.84+0.15 =95.99
and
s = st; + s; - bias = 15.2687+0.0363+3.35-0.1697 = 18.4853.
2
The contour plots of mean and variance of the molybdenum extraction degree are
shown in Figure 5. 6 and Figure 5. 7. The corresponding three dimensional surfaces are
given in Figure 5. 8 and Figure 5. 9. The values of the parameters PI and p 2 are fixed as
follows: PI = 1, p 2 = -1.
0.
0.
rPI
0.
0.
-0.
-0.
-0.
-0.
-0.5 0 0.5
p3
~
Figure 5. 6. Mean values contour plot for molybdenum extraction degree (PI =1, p 2 =-1)
274 CHAPTER5
0. E 15.8 -
0.
EHI..
op. 16.3 -
-
0. ~
16.6
0'- 17 -
-0
~ 17.5 -
-0
18
-0
1
-1
~
-0.5
18.3 -
Figure 5. 7. Variance contour plot for molybdenum extraction degree (Pi =1, p 2 =-1)
100
98
96
94
92
1
Figure 5.8. Three dimensional surface for molybdenum extraction degree mean value
(Pi =1, P2 =-1)
QUALITY IMPROVEMENT THROUGH REDUCTION OF THE ERRORS 275
18.5
18
17.5
17
16.5
16
15.5
1
Figure 5. 9. Three dimensional surface for molybdenum extraction degree variance ( p 1 = 1, p 2 = -1)
•
5.6. Bibliography
Morrison (1957) gives an early example of using error transmission formulae to study
variability in engineering designs. Box (1963) and Draper & Beggs (1970) considered
error transmission from factors to the response in connection with experimental design.
Model based approach, which takes into account transmitted errors, is used by
Stancheva and Papazov ( 1972) for quality improvement of electric motors. Vuchkov and
Boyadjieva (1981) give models (5.36), (5.37) and (5.38) ofmean value and variance ofa
performance characteristic, affected by errors in factors. Formula (5.22) without the high
order terms (HOl) is proposed by Box and Fung (1986) for minimizing transmitted
errors.
A.5.1. Derivation of mean value and variance models for second order polynomials
with m factors
Suppose that the performance characteristic's model is a second order polynomial of the
form
(5.16)
(A.5.1)
or
A.5.2)
We assume that:
• Errors in factors e = (e1 , e 2 , ... ,em J are independent,
• e and & are independent of each other,
• All error distribution moments higher than fourth order are negligible.
• E(e)= O,E(e)= 0, E(eer )= L., E(e;2 )= CJ;2 ,E(e( )= f.J; 3 , E(e: )= f.J; 4 ·
1. Model of the mean value in mass production
We will prove the following equation:
5.10)
5.22)
where
(5.23)
QUALITY IMPROVEMENT THROUGH REDUCTION OF THE ERRORS 277
a 2 = var[y(p)]=var[7J(p)+erß+2era'p+era'e+e ]=
A.5.3)
(A.5.4)
(A.5.5)
iii)
(A.5.7)
where v includes all possible covariances between pairs of errors, for example
eiej and e.er for i,j,r,s = 1,2, ... ,m and i,j not coinciding with r, s.
One can write the variances in the first two terms of(A.5.8) as follows:
(A.5.9)
278 CHAPTERS
(A.5.10)
To compute the covariances u between pairs of errors e;e i and e,e1 one must
consider all possible combinations ofindices i,j,r,s=l,2, ... ,m for i,j not coinciding
with r, s. The following four combinations are possible:
•i:t:.j:t:.r:t:.s,
• i=j, r :t:. i and r :t:. s,
•i=j=randr:t:.s,
• i = j and r=s, buti :t:.r.
Taking into account the independence of errors one can find that for the first
three combinations
For the fourth combination ofindices one can write the following equation:
Consequently u = 0 .
(A.S.ll)
i=l i=l j=i+l
(A.5.14)
The last termoftbis expression is zero, because E(e;) = 0. Hence, (A.5.14) is rewritten
as follows:
QUALITY IMPROVEMENT THROUGH REDUCTION OF THE ERRORS 279
m m m m
~ = E(e;,'LL 1: rsere.) = LL 1: rsE(eiere.}.
~1~1 r~s~
By definition 1: u= ßjj and 1: iJ= }_A· Hence, Jri = ßufl.i 3 and (A.5.13) is equal to
2
m m
2nr (ß+21:pf = 2L(ßi +2L 1: iip)ßufl.i3 =
i~l j~l
(A.5.15)
m m-1 m
az = P"L.ß + 4pr 1: "L.1l P + 4P"L.1l P + L pi; (ui4 - a: )+ L L ßi:aiza: +
i=l i=l j=i+l
= (ß+21:pf'E.(ß+21lp) +
J,
Taking into account that
(5.50)
(5.48)
m m-1 m
and a full or fractional two Ievel design is used. As for this design C = (Fr F t = N- I
1
In order to get the final ·form of (A.5.16), one must compute 1fF = g- E(g) and
\f = E(lfFifFr) using the notation g =h -f where for model (5.49)
Therefore,
QUALITY IMPROVEMENT THROUGH REDUCTION OF THE ERRORS 281
Taking into account that E(ei) = 0, i =1,2, ... ,m, and that the errors in factors are
independent one can see that E(g) = 0 and lf/ = g - E(g) = g. The diagonal elements of
'I' =E(lfl'lfl' r ) are
where k is the number of elements for g. As the errors ei are independent '!'11 can be
computed as follows:
~~=0,
~1 = d; for i = 1,2, ... ,m,/ = 2, ... ,m+ 1,
(A.5.17)
~1 = d;p: +djpi2 +d;dj for i = 1,2, ... ,m-1;j=i+1, ... ,m
and l=m+2, ... ,k.
Formula (5.50) is obtained after substitutins sj2 for aj2 and the values of '!'11 in
(A.5.16).
m m-1 m m
If symmetrical second order design is used the unbiased estimate of the variance in mass
production is
(5.52)
282 CHAPTER5
Proof We use sorne elernents of the proof of equation (5.50) given above. The
m
polynornial (5.7) differs frorn (5.49) only by the terrns ·Lß;;P;2 . Therefore, the dirnension
i=I
(5.48)
For syrnrnetrical second order designs the rnatrix Cis ofthe form
where lfll' t = 1,2, ... ,k are elernents ofvector 1f1 and 1{11 = 0.
Therefore, the last terrn in the brackets of (5.48) is
(A.5.19)
1=1 1=1+1 l=m+2+r 1=1+1
(A.5.20)
The first 1+ m + m(m - 1) I 2 diagonal elements of the matrix 'I' are given by
(A.5.17). The other diagonal elements ofthis matrix are
(A.5.21)
for I= m + 3 + m(m -1)/2, ... ,k. For normally distributed errors J..li3 = O,J..l; 4 = 3at and
(A.5.22)
Putting (A.5.17) and (A.5.22) into (A.5.20) and taking into account the diagonal
elements ofC we obtain (5.52) after replacing d, with their estimates s;2 .
•
A.5.3. Derivation of matrix 'I' for a full second order polynomial model
m m-1 m m
Suppose that the errors in the factors are with zero expectations, they are uncorrelated
and all distribution moments of order higher than second are neglected.
Proof The proofis similar tothat given in Example 5.8.
The vector of regressors is
284 CHAPTER5
=[1 .A-H; ··· Pm+em AA+JJ.f?.+Pf1-f-i1f?. ··· P",..JJm+P",_lem+Pnf",..l+e",..lem If+2J.ll1-+if ··· Jl..+7p.fm+i..]r
Under the assumptions for uncorrelated errors and zero means of their distributions the
expectation of g can be computed as follows:
Q is mxl vector with elements deterrnined by the errors in the quadratic terms of
the polynornial. An element of Q is
i=1,2, ... m.
'I'u is mxl matrix determined by errors in the linear terms (pk) and interactions
{pi p i). An element of this matrix is
'I'LQ is a mxl matrix determined by errors in the linear terms (pi) and quadratic
terms (p;). An element of this matrix is
0if i*-}*k*l
PiPICTi if i *- j *- l,i = k
2
'1'1Q is a /x/ matrix determined by errors in the interaction and quadratic terms.
An elementoftbis matrix corresponding to the interaction PiPi and the quadratic term
p; is:
286 CHAPTER5
•
CHAPTER6
6.1. Introduction
287
288 CHAPTER6
(6.1)
where y is a given value of the performance characteristic and r is the target value. The
scale coefficient kc can be computed as shown in Section 4.2. Its value doesn't play any
role in the optimal parameter settings if only one performance characteristic is of interest.
However, it is important in multicriterion optimization problems.
The expected lass is defined as follows:
As shown in Appendix A.4.1 the expected loss can be presented in the form
(6.3)
(6.4)
The optimality criterion (6.4) is useful in cases when a specific target value r is
considered as best choice of the performance characteristic. Further on we call them
specific target value is best cases.
The expected loss (6.4) consists of two parts: one of them ( kc ~(p )- r] 2) is due
to the deviation of the performance characteristic y from the target r and another
( kß 2 (p)) is due to
the variation of the performance characteristic caused by errors in
product parameters in mass production. Note that if the target r is not specified well this
can cause uncertainty in the solution. If r is much higher or much smaller than it can be
realistically achieved, the first term in the brackets of (6.4) is prevailing over the variance
and determines the choice of the product parameters. That is why when using (6.4) r
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCTS 289
(6.5)
Engineers can find the locus of the minimal values of R(p) for different a and
make decision about a taking into account engineering considerations that are not
formalized in the models.
Sometimes the value of the performance characteristic must be as small as
possible. For example, concentration of some unwanted substance can be a performance
characteristic in a medicine production. Obviously the target must be absence of this
substance and • == 0. If the performance characteristic can not be negative as this is in
this example, then putting • == 0 in (6.4) one obtains the following optimality criterion
for the smaller the better problem:
(6.6)
(6.7)
290 CHAPTER6
If T = 0 for the smaller the better case and r-1 =0 for the !arger the better case
are unattainable then it is better to use target values closer to the realistic ones and to
apply formula (6.4).
The definition of the quality improvement optimization problem, which was already
given in Chapter 5, is:
Minirnize the performance characteristic's variance er
under the condition that
the mean ji is equal to a target value T.
a)
b)
Figure 6.1. Components of the expected loss (a) extremal points of mean and variance do not coincide
(b) coinciding extremal points of the mean and variance
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCTS 291
That means to find such values p. of product parameters which satisfy the
following conditions:
y(p.)= r (6.8)
In general the optimal solutions obtained by the criteria (6.4) and (6.8) do not
coincide. This can be seen in Figure 6.1a where the components of the expected lass
kc ~- r J and kJi are shown. In this figure kc is equal to I.
Let p. and p L be the optimal solutions chosen according to criteria (6.8) and
(6.4), correspondingly. If the extremums of kc ~- r J
and B(k) do not coincide
(Figure 6.1a) then the variance obtained by (6.8) is 0: which is !arger than &:in. The
solution p L chosen by minimizing the lass function (6.4) does not provide neither
minimal variance nor minimal deviation of the performance characteristic from the target,
but finds a comprornise between these components of the lass function.
As shown in subsection 6.3.2 in a very important case the solutions obtained
through lass function minimization and conditional rninimization of variance are the same
(Figure 6.1.b).
Taguchi's signal-to-noise ratio can also be used as optimality criterion with the model-
based approach. For the specijic target value is the best case it is defined by Taguchi as
follows:
(6.9)
Substituting y(p) for 17(p) and 0' 2 (p) for a 2 (p) we obtain
(6.10)
(6.11)
(6.12)
292 CHAPTER6
6.3. Robustness against errors in product parameters: the larger the better and the
smaller the better cases
In this section we consider a relatively simple but important case, characterized by the
following asswnptions:
• Only errors in the factors are taken into account, i.e. there are no external noise
factors. There are many practical problems corresponding to this assumption, see the
friction welding example of Section 6.8, Wheatstone bridge example and band-pass filter
examples considered in Chapter 8, etc.
• The product performance characteristic is fitted by second order polynomial
model.
• The variances of the errors in factors are constant over the region of interest.
This means that the variances do not depend on the nominal values of the product
parameters.
• The errors in product parameters are independent and normally distributed.
• The bias in the variance model is negligible.
In this section we consider optimization procedures without constraints on
parameters and for one performance characteristic of the product. Though narrowing the
applications these assumptions form a dass of problems with considerable practical
value. They help to understand better the mechanism of error transmission from product
parameters to the performance characteristic. Relatively simple solutions are possible for
this case. We defer the more complicated optimization problems for the next sections.
Suppose a second order polynomial model is obtained through experiments with
a product proh)type:
m m-1 m m
(6.14)
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCTS 293
and
(6.16)
... m m-1 m
HOT= iiA:CT,4 +
1=l
L Lhi~if;a7.
i=l j=1+l
(6.17)
In this section we use the matrix forms of equations (6.15) and (6.16) which are
(6.18)
and
iP = (b +2Bp) I.(b +2Bp) +HOT +CT}' (6.19
,CT;).
where I. = diag(CT 12 ,CT; , ...
Under the assumption that the variances of the errors in the parameters are
constant over the region of interest the high order terms (HOl) are constant. Therefore,
the performance characteristic's variance in mass production{6.19) consists oftwo parts:
• A part that can not be minimized through choice of product parameters:
' 2
HOT+CTc.
• Apart Ci. which can be minimized with respect to product parameters p:
Consider the case when the performance characteristic depends on a single parameter of
the product. Suppose that a second order regression model is obtained through
experiments with a prototype
(6.21)
and
(6.22)
. Let us find the extremum of y(p) by putting its first derivative equal to zero:
JY{p)
- - = b1 +2bnp=O. (6.23)
dp
(6.24)
(6.25)
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCTS 295
From (6.23) and (6.25) one can see that the stationary point of cr 2 {p) coincides
with that of y(p) and its abscissa can be computed by formula (6.24).
The second derivative of the variance is
(6.26)
These results are illustrated by Figure 6.2. It shows that the higher the slope of
J(p) the higher the variance cr (p).
At the stationary point, where cJY(p )I dp = 0 the
2
variance depending on the parameters is zero ( if. = 0), while the performance
characteristic's variance is
-2 -2 2 2 2 4 2
a =a. +HOT+a, =HOT+a, =2b11 aP+a,.
A A
This result is very interesting from practical point of view but we defer this discussion for
the next section.
-
A
y
y=4+p-2l.
Suppose there is an error in the product parameter p in rnass production with
zero rnean and variance er!= 0.2, while the response error variance is er!= 0.1.
The rnean value of the perforrnance characteristic in rnass production can be
cornputed by (6.21) as follows:
y=y+b er; =4+ p-2p
11
2 -2x0.2
or
y = 3.6 + p- 2p 2•
or
ii? =(1-4py x0.2.
ii 2 = 0.2(1-4py +0.42
Figure 6.3 shows the curves corresponding to y,Y,ii?,ii 2 and HOT+er;. The
coordinate ofthe stationary point is cornputed by (6.24) as follows:
b 1
P, =- 2~11 =- 2x{-2) = 0.25.
In this point the pararneter dependent part of the variance is zero ( ä; = 0) and the total
variance is minimal:
-2 2
er =HOT +er. = 0.42.
A
Figure 6.3 shows that the higher the slope of y, the higher the variance ä;.
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCTS 297
s~-------------------------.
-1 0 l1s
•
6.3.3. MULTIVARIABLE ANALYTICAL SOLUTION
The results obtained in subsection 6.3.2 can easily be generalized for products depending
on several parameters.
Consider product/process performance characteristics subject to the assumptions
of subsection 6. 3. 1. In this section we consider only the smaller the better and the !arger
the better cases, i.e. the cases when the target value coincides with the extremum
(minimum or maximum) ofthe performance characteristic.
Assurne that the product performance characteristic depends on m parameters
p = (p1 p 2 • . • p m Y and it can be fit by second order polynornial ( 6. 14). Equations
(6.18) and (6.19) give the mean value and the variance ofthe performance characteristic.
Equating to zero the vector of first derivatives of y(p) with respect to p we
obtain
1 -1
p,=--B b. (6.28)
2
07(p) = 2B.
opopT
oii 2
- =4B~.(b+2Bp.)=O. (6.29)
op
Comparing (6.27) with (6.29) one can see that the stationary point of variance
surjace coincides with the stationary point of the perjormance characteristic's mean
va/ue surjace. The matrix ofsecond derivatives ofvariance is 8Br~.B. It is always non
negative definite because the matrix ~. is non negative definite. Consequently, the
stationary point P. a/ways corresponds to the minimal variance.
Substituting oy(p )I op = b + 2Bp in (6.20) we obtain
(6.30)
... m m-1 m
cP(pJ=nor +a-; = i'fA7a-; + L :Lbi~a;o; +0:.
4 (6.31)
j=:l j:::;} j::;J+}
Further decrease of the variance is only possible through tightening the tolerances
( decreasing of 0:, i = 1, 2, ... , m) but not through choosing different pararneter values.
At the stationary point P. the value of the expected loss L, given by (6.4) is
minimaland .Y. = r. Therefore,
6.4. Model based robust product design in cases when a specific target value is best
Assurne we want to attain a specific target value r for the performance characteristic's
mean ji in mass production and that r does not coincide with the extremum of y(p). In
this case the method of subsection 6.3.3 can not be applied. For the moment we do not
consider the case when some ofthe errors in product parameters have zero variances, i.e.
in this section we assume that :E. is positive definite matrix. The case with non- negative
definite :E. is considered in subsection 6.4.2.
Suppose that the models ofthe mean value (6.18) and variance (6.19) in mass
production are given. Using (6.18) one can find the contours of y(p) and in particular
the one for which y(p) = r. For the case with two parameters this contour is shown in
Fig 6.4.
Figure 6.4. Mean and variance contours for a product with two parameters
(6.33)
3. Find the maximum eigenvalue f-lmax ofmatrix _!_:E- 1B- 1 which has the same sign
4 e
as r- ji,. For example, if r- ji, < 0 then f-lmax is the maximal negative eigenvalue.
Denote tmax the eigenvector that corresponds to f-lmax .
4. Compute Ci.min as follows:
(i.min = T- Ys . (6.34)
f-lmax
p.1 ,2 -- p s -+_!_B-1 - -
2 tmax a.min -
(6.35)
302 CHAPTER6
(6.36)
•A second order response surface with a saddle point (Myers and Carter (1973)):
(6.38)
For both cases we assume standard deviations O't = 0. 05, 0'2 = 0. 025 and correlation
coefficient p 12 = 0. 6. Therefore, the error covariance matrix is
0.0025 0.00075 )
(
L.. = 0.00075 0.000625 .
Table 6.1 shows how the solutions are obtained. Since in both cases r- Ys < 0,
1
f..lmax is chosentobe the highest negative eigenvalue of -L.:tB-t
4
The solutions are shown in Figure 6.5 and Figure 6.6. They were confirmed by
grid search.
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCTS 303
78.8988 82.17
(2.272 3.946) (-1.01 - 8.61)
y, = h _!_brB- 1 b+tr(B~.)
0 _ 79.991 83.724
4
Stationary point coordinates
P =-_!.B- 1b
s 2 (0.2, 0.5) (-0.439, -0.311)
1 ..--1 -1
Eigenvectors of - <... B
4 • 14.728 - 20.201) 3.900 - 24.694)
( (
T=(t 1 t 2 ) 14.648 47.806 34.831 35.872
1 ..--1 -1
Eigenvalues of: - <.... B
4
(- 28.648 - 545.414) (- 25.205 98.302)
(u1 p2)
Maximal eigenvalue Pmax -545.414 -25.205
Eigenvector corresponding to
Pmax: t:,. ( -20.201, 47.806) (3.900, 34.831)
Target value T 76.00 75.00
(j •min = .Jrr(T----=-y,......,.)--,--fP-max- 0.085 0.588
Optimal parameter va1ues
(1.567, -0.874) (-1.503, -1.043)
P•1 2 -- Ps +_!.
-
~ T- Y. B-1 tmax (-1.167, 1.874) (0.625, 0.421)
• 2 Pmax
Minimal value of variance O:rnin
0.0073 0.346
304 CHAPTER6
2.5
1.5
rs
A
~(p)=76
p 0
2
-0.5
-1
-1.5
-2
-2 -1.5 -1 -0.5 0 0.5 1.5 2
p
I
~
Figure 6.5. Optimal solutions for second order model with negative definite matrix B
2.5~------~----------~----------------~
0.5
tP, 0
-0.5 a2
*min
=0.346
-1
-2
As shown in subsection 6.3.3 the variance ä; is equal to zero ifthe target coincides with
the extremal value of the performance characteristic. In general, when this condition is
not satisfied, ä; > 0. However, in a special case when the nominal value of at least one
product parameter can be set without error in mass production one can obtain ä; = 0
even if r does· not coincide with the extremum of y(p).
Let us consider a product or process for which mi of all m parameters can be set
without errors in mass production and consequently, they are non-random. Their
variances and their covariances with other errors in the parameters are zero. Hence, the
covariance matrix of errors in factors can be written in the following block-matrix form:
where L m-m is a matrix of rank m - mi and 0 are matrices with zero elements.
I
(6.40)
Figure 6. 7 illustrates an example with two product parameters PI and p 2 . Suppose that in
mass production the parameter PI is set with errors, while p 2 is free of errors. In this
case the contours of variance a; (p) are parallel straight lines. The value of ä; is
constant at each point of a given line. The line that passes through the stationary point of
y(p) corresponds to the crest ofthe ridge. As shown in subsection 6.3.3 the value of ä;
at the stationary point p_, is equal to zero. Consequently, all over the ridge crest the
variance ä; is zero. In order to solve the robust product design problern one must find
306 CHAPTER6
the points of intersection of the ridge crest with the contour y(p) = r. In Figure 6. 7
they are denoted by u\~~~ and u~:,~1 .
~
I
I
I
-----1
l-----
---~---
--·---
1
I
I
1
I
I
L---
__ ,e---
------1
I
y=r
u
1
Figure 6. 7. Parameter optimization for a product with two parameters, p 2 free of errors
(6.41)
where Ä. 1 ,Ä. 2 , .•• ,Ä.m are the eigenvalues of Le and R,,Pz, ... ,Pm are new canonical
variables. The following equations link u= (u 1 u2 •.. uJr with vector
p= (R, p2 Pm J:
p = sru, (6.42)
or
u=SP. (6.43)
where S is a matrix of the eigenvectors corresponding to .Ä.P .Ä. 2, ... , Ä.m. If Le is diagonal
matrix then S is identity matrix and the axes u; and P; coincide.
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCTS 307
It follows from (6.41) that the value of 0: in the direction of a given principal
axis, say Pm, is equal to AmP,;. Suppose without Ioss of generality that the parameter Pm
is free of errors in mass production. In this case d., = 0, the corresponding diagonal
element of L 0 is zero and the covariances ofthe errors in Pm with the errors in the other
factors are zero and hence, Am =0. Consequently, the variance 0: at each point along
the principal IDP.s Pm is equal to zero, i.e. 0: = AmP,; = 0.
For the algorithm we need the matrix R = !srB- S 1 (see Appendix A.6.2).
4
Denote by R,. the diagonal element of R which corresponds to Am= 0 and
Consequently, to the parameter Pm which can be set without errors.
The algorithm runs as follows:
1. Compute the eigenvalues A1 ,A2 , •.• ,Am of L 0 and the corresponding matrix of
eigenvectors S. Specify the eigenvalues which are equal to zero.
2. Find the coordinates ofthe stationary point of y(p) using (6.28):
1 -1
P.=--B b.
2
(6.44)
308 CHAPTER6
(6.45)
2.5---...-----r--.----,,..----.----,----r---,
1.
-0.
-1.
Figure 6.8. Optimal solutions for rnodel (6.37) with lti = 0 or lt2 = 0 in coordinate systern PI, p 2
For the minimax surface (equation (6.38)) the variance ;;; is equal to zero only in
direction of ~ axis if -r<ji, (colurnn 3 ofTable 6.2), or in the direction of ~ if -r> .Y.
(colurnn 6).
Consider colurnns 3 and 4 of Table 6.2 that correspond to the case when p 2 can
be set without errors in mass production. In this case Rm = -0.0139 and the mean value
for the stationary point is .Y. =83.734. Suppose that the target value is •= 75. We
obtain z-- .Y. =75-83.734 = -8.734. The signs of z-- .Y. and Rm coincide and the
solution can be found as shown in colurnn 3. However, if z-= 88, then z-- .Y. =4.266 and
there is no solution in the direction of ~ axis providing ;;; = 0.
These results are shown in Figure 6. 9 in coordinate system PI , p 2 . If p 2 is free of
=
errors then ;;; = 0 along the axis ~ u2 • The maximum of ji along this axis is in the
stationary point, where .Y. = 83.734. Moving the operating point from p,along ~ axis
one can only decrease ji. That is why if the target is z-= 75, there are two optimal
solutions p.I (1t2 = 0) and p. 2 (1t2 = 0) for which the target is achieved , while ;;; = 0.
However, for the case when z-= 88, the target value is higher than the greatest
possible response along ~ axis (ji, =83. 734). Therefore, in this case a solution
providing ;;; =0 does not exist.
For z-= 88 a solution can be found along the axis ~ ui. In this direction =
.Y. =83.734 is the minimal possible value. Consequently, for z-= 88 the variance Cl. is
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCTS 311
equal to zero only if p 1 is free of errors in mass production. The corresponding solutions
are denoted in Figure 6. 9 by p. 1 (A-1 = 0) and p. 2 (A-1 = 0). The results of computations
are given in column 5 for r= 75 and column 6 for r= 88.
Ifthe achievement of r is impossible (for example, if r= 88, but p 1 can not be
set to a given value without errors in mass production), then the optimal solution can be
obtained through the algorithm of subsection 6.4.1 or by numerical procedures.
2.5,-----~-----------,------------------,
1.5
i
0.5
P, o
-0.5
-1
-1.5
-2
-2.5L___________________ _ L_ _ _ _l-~----------~
-5 5
Figure 6.9. Optimal solutions for model (6.38) with A1 = 0 or A2 = 0 in coordinate system p 1,p2
•
6.5. Model based decision making in quality improvement
However, if the optimal value p 1opt corresponding to y 1 is chosen, the values of y 2 and
o;, are unsatisfactory and vice versa.
Figure 6.1 0. Compromise optimization for two performance characteristics of a single parameter
product
In this situation if both performance characteristics are equally important for the
customer a compromise product parameter value Pc can be chosen as shown in Figure
6.10. Neither the mean values, nor the variances for both performance characteristics are
the best ones, but Pc provides some reasonable values for each ofthem.
If one of the performance characteristics (say y 1) is more important than the other
(yJ, then weights w1 and w2 can be given to each ofthem (w 1 > w2 ) so that the solution
p cw is shifted towards p 1opt. The choice of the weights is a matter of customer or
engineer's preferences and it can be formalized by the quality function deployment (QFD)
approach (Mizuno and Akao (1994)).
The problern of compromise optimization for more than one parameter or more
than two characteristics is not much different. Consider a product or a process with r
performance characteristics: 171' 172 , ... , 17,. The target values for each of them are denoted
<1 , <2 , ... , <,, correspondingly. They can be achieved by choosing appropriate values of
the product (process) parameters p = (p 1 p2 ... PmY. Suppose there are errors in
parameters in mass production e = (e 1 e2 eJr, so the measured values of the
performance characteristics in mass production are
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCTS 313
Models of the mean values y(p) and variances ii 2 (p) of the performance
characteristics can be obtained using the methods in Chapter 5. They can be employed
for optimization in two different ways:
• Through compromise optimization,
• Through search of extremum of one performance characteristic, while keeping
the others within prescribed intervals.
Consider first the compromise optimization The criteria for compromise
optimization can be defined by generalizing the criteria considered in Section 6.2.
Generalized loss function. A compromise optimality criterion can be obtained by
generalizing the loss function as follows:
(6.46)
This criterion is based on (6.4) and can be used when a specific target value is the
best. For the smaller the better and the larger the better cases generalized compromise
criteria based on (6.5) and (6.6) are possible.
The weights kci can be chosen as in Section 4.2. Their dimension is
currency unit
(dimension of the performance characteristic Y'
so that Lg is the loss imparted to the society by the products, measured in the
corresponding currency units.
Forasingleperformance characteristic, the coefficient k in the loss function is not
influencing the optimal solution. In contrast, in the generalized loss function the
coefficients k; plays an important role in making the final decision because they put
different weights to each performance characteristic.
There are two problems which can arise with this generalized criterion:
• lt may be difficult to calculate the weights k ci because it may not be easy to
define the cost of repairing or replacing the product which is necessary for these
calculations.
• The generalized loss function Lg is at least of fourth order with respect to the
product parameters and the analytical methods developed in the previous sections of this
chapter can not be used. In this case the solution can be found using the numerical
procedures given in subsection 6.5.2.
314 CHAPTER6
where w; are weights chosen to take into account the importance of the performance
characteristics for the customer. If all performance characteristics are equally important,
then w1 = w2 = ... = w, = w and (6.47) gives the center of a polyhedron with vertices at
the individual parameter vectors for each performance characteristic.
Compromise based on generalized performance characteristics or desirability
functions. Desirability functions for the mean values of the performance characteristics
d~J and for the variances d(a 2 (y;)) can be defined as in subsection 3.10.1. A
desirability function which takes into account both the mean value and variance of the
performance characteristic is
Obviously 0 :;::; cf; :;: ; 1. The generalized desirability function can be defined as in
subsection 3. 10.1 :
Vuchkov and Boyadjieva (1988)), but the computational methods are different. Vining
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCTS 315
and Myers (1990) use Lagrangemultipliers and add a constraint for spherical region of
interest.
For the larger the better and the smaller the better cases Vining and Myers
consider the mean value y as a primary function. Then they propose to establish several
possible values for the variance, find the resulting optimum values for y subject to these
constraints on the variance, and select the best compromise combination of factors.
Under the conditions of Section 6.3 the stationary points of mean and variance surfaces
coincide and the procedure of Vining and Myers is not necessary. However, if there are
external noise factors the optimal points for mean value and variance do not coincide and
the dual response approach is applicable. This problern is considered in details in Chapter
7.
Dei Castillo and Montgomery (1993) noted that the dual responseproblern could
be solved by standard non-linear programming techniques. They use the generalized
reduced gradient algorithm. This approach is more general and allows finding the optimal
solution in regions of interest with different shape.
Lin and Tu ( 1995) noted that sometimes it is better to use loss function
minimization than to employ dual response surface approach. Their argument is that it
may happen that for small deviations of y from the target T the variance can decrease
substantially. In this case the parameters that correspond to the minimal variance are a
y
better solution than the set of parameters corresponding to = -r and the minimization
of loss function better suits this choice. This problern does not exist if the target is
defined as minimum or maximum and the performance characteristic's variation is due to
errors in product parameters. As under the conditions of Section 6.3.3 the stationary
points of the mean and variance surfaces coincide, the optimal parameter values found by
loss function minimization and by conditional minimization ofvariance also coincide. For
a specific target value is best case small changes of the mean value correspond to small
values of the variance which are proportional to the gradient of the mean value function.
One can see from (6.15) and (6.16) that iffor some parameter changes the corresponding
changes of y are small, the changes of the values in the brackets of (6 .16) are small as
weil. In this case a big change can occur in Cl if the variances ~ of some errors in
product parameters are very high. In reality this usually is not the case and the problern
of Lin and Tu (1995) can hardly occur if the variation is due to errors in product
parameters. However, if some external noises also contribute to it then this problern can
really exist.
The analytical optimization procedures that we considered in Sections 6.3, 6.4 and 6.5
are easy for implementation. However, they have some limitations. They are
inconvenient in the following cases:
• The variances of errors in the product parameters are not constant throughout
the region of interest. In this case the contours of the variance function are not ellipses or
316 CHAPTER6
ellipsoids. The contours of the mean value of the performance characteristic also change,
because their models are no Ionger second order polynomials of product parameters. The
friction-welding example considered in Section 6.8 is oftbis type.
• The model of the performance characteristic is not a second order polynomial.
Though widely used, the second order polynomials sometimes do not fit weil the data
and third order terms have to be added to improve the goodness of fit. Other examples
are the mechanistic models. In Chapter 8 we discuss quality improvement procedures
based on mechanistic models.
• The bias in the variance model due to Substitution of estimates for the true
model coefficients is not negligible. In this case the variance is given by (5.48) and the
results obtained in Sections 6.3 and 6.4 arenot applicable. In subsection 5.5.3 is shown
that the bias depends on product parameters. That is why the stationary points of mean
and variance surfaces do not coincide. This is shown in Example 5.11. The
corresponding contour plots are given in Figure 5. 6 and 5. 7.
• Loss function is chosen as optimality criterion. The solutions proposed in
Sections 6.3 and 6.4 are based on the use of the specific properties of second order
response surfaces. The loss function
is a second order function of product parameters only if y{p) is linear with respect to p.
Even when y(p) comprises only linear and interaction terms L is of fourth order with
respect to the parameters.
The case of linear dependence of y(p) on product parameters is not interesting
for the quality engineer, because as shown in subsection 5.2.2, in this case response
variance reduction can be achieved only by tightening the tolerances and not by
parameter optirnization This is an expensive solution of a quality improvement problem.
• Engineers want to explore the response surface near the optimum. The
analytical methods can provide an optimal set of product parameters, but they can not
explain what happens with performance characteristics when the parameters are changed.
Engineers usually prefer to have a set of variants for discussion rather than only one
solution. In this case they can take into account some considerations which are not
formalized in the equations of the mean value and variance of the performance
characteristic or in the loss function. Such considerations can be prices of raw materials
or components, policy of management with respect to the vendors or the market, etc.
Contour plots of the mean values and variance of the performance characteristics in the
neighbourhood of the optimal point are very useful in making the final engineering
decision. Of course two-dimensional plots provide a limited insight on the process or
product characteristics and decisions made only on the basis of contour plots in case of
more than 2 factors may be incorrect. Nevertheless several well-chosen two-dimensional
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCTS 317
plots can provide engineers with interesting information about the product. They can
compare the results with their own experience and eventually discover some unknown to
them features ofthe product/process.
• There are constraints on factors. This is a typical situation where numerical
procedures can be applied easier than analytical ones. Sometimes the constraints form
regions of interest with rather complicated form. Examples can be found in mixture
experiments (Vuchkov and Yonchev (1979), Cornell (1990)).
Analytical solutions of the quality improvement problern for constrained factor
spaces are possible (Vuchkov and Boyadjieva (1992)). However, they are rather
complicated and the numerical optimization may be preferable.
There are many algorithms for numerical optimization (Wilde and Beightler
(1967), Fiacco and McCormick (1967), Box, Davies and Swann (1969), Himmelblau
( 1972), Biles and Swain (1980), Gill, Murray and Wright (1981 ), Steuer (1986),
Luenberger (1989), Stoyanov (1990), Rustagi (1994)). They differ in speed of
computation, accuracy, reliability of finding extremum, ability for global optimization or
optimization in special conditions (for example, for ridge-type response surfaces). For
relatively small number of product parameters (up to 5-6) the grid search is very simple
and convenient method, because it can be used to obtain a number of variants which are
a basis for discussions. When the number of factors is great a quick solution may be
obtained by the methods in Sections 6.3 and 6.5. When they are not applicable fast
optimization gradient, random search or other procedures can be used.
A reduction of the dimension of the optirnization problern is possible in cases of
conditional minimization of variance and when a specific target value is best. If the target
value is r one of the factors can be expressed as a function of the others using the
condition y = r. This reduces the space dimension by one. Suppose for example, that the
mean value ofthe performance characteristic in mass production is given by (6.18) which
can also be presented as follows:
,.. m m-1 m m m
where
m m-1' m-2 m-1 m-1
m-1
h, =bm + "f..bzmPz'
i=l
318 CHAPTER6
F or a given iteration of the grid search procedure one can give constant values of
PI , p 2 , ... , Pm- I and can compute Pm by the formula
Substituting pi,p2 , ... ,Pm-I•Pm into {6.19) one can rninirnize Cl under the
condition y = -r.
If p m(1) or p m(2) are outside the interval of definition of Pm or if ~2 - 4hrflz < 0
then the corresponding values of ppp2 , .. . ,Pm-PPm are ignored and the algorithm goes to
the next iteration ofthe grid search.
A reduction of the dimension of the optirnization problern is possible in a sirnilar
way for other search algorithms.
Both the analytical and numerical optimization procedures given in this chapter
are developed for quantitative factors and for cases with performance characteristic's
variation due to errors in product or process parameters. Several extensions for cases
with qualitative factors or external noise factors are given in Chapters 7 and 9.
There are several important problems with the applications of the model based approach
for quality improvement. They are as follows:
Computation of the loss function and the cost of the product. We should not
forget that in quality improvement the emphasis is put on rninirnizing the performance
characteristics' variability at a low cost. Therefore, one can use the loss function and the
cost function as additional criteria for selecting variants for further discussion. The loss
function takes into account the loss for the society, while the cost function measures only
producer's costs. The computation ofthe loss function is considered in Sections 4.2 and
6.5.1. Sometimes it is possible to define a cost function C(p) depending on the product
parameters. A typical example is a chernical product which is a mixture of m
components. Let the proportions of the components in the rnixture be PP p 2 , •.• , p m.
Suppose that the cost of a unit of the i-th component is c;, i = 1, 2, ... , m . Denote also the
permanent costs which do not depend on the values of product parameters by c0 . Then
the cost of a piece of the final product is
In addition, upper Iimits vi can be imposed on the values of the variances for
each performance characteristic as follows:
(6.49)
Note that if the operating point p is chosen so that Y; (p) is equal or near to one
ofthe specification Iimits defined by (6.48), it can happen that a large part ofthe product
will be defective because of the random variation of the response around its mean value
y;(p). This can be avoided ifthe inequality (6.48) is strengthened as follows:
(6.50)
where Y; (p) ± K; is the interval within which random variations of the response are
expected.
If Y; is normally distributed one can roughly choose K; =3a[yi (p )], where
a[yi (p)] is the predicted standard deviation of the response in mass production. It can
be computed using (6.16). In generat the distribution of Y; in mass production may be
not normal because the errors in product parameters are transmitted to the response
through a nonlinear relationship. However, if the intervals of variation of the errors in
product parameters are not very large, this relationship can be considered as
approximately linear around the operating point. Therefore, if the errors in factors are
320 CHAPTER6
Use of contour plots. Contour plots of the mean value and variance are useful
tools for presenting experimental results and can help engineers to better understand the
underlying process and the error transmission mechanism. They are sections of the
response surface. In a contour plot only two product/process parameters are varied,
while the others are kept constant. However, several contour plots can provide engineers
with thought provoking information from which useful conclusions can be drawn.
The number of contour plots which can be obtained using models of the mean
value and the variance, is enormous. Two simple rules can be used for initial selection of
some of them:
• Usually the bestvariant is selected using optimization procedures or the Iist of
variants (Table 6.3). Then the constant values of the parameters for a contour plot are
chosen to be equal to their selected optimal values.
• Cantours can be plotted for some specific set of parameters which are of special
interest to the engineer. Such sets can be defined on the basis of previous experience or
some hypothesis about the product's behaviour.
Areas of admissible values of product parameters can be outlined taking into
account the admissible values of the response. If there are several responses of interest,
their contours can be superimposed to provide areas in which all inequalities of the type
(6.49) and (6.50) are satisfied.
Decisions based on the results of subsection 6.4.2. As shown in subsection 6.4.2
if the target does not coincide with the extremum, the variance CY; can be set equal to
zero if at least one parameters is kept without errors in the mass production. It can be
worthy to put special efforts to this end, for example, by purchasing high quality raw
material or by introducing an automatic control system designed to keep one of the
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCTS 321
parameters constant. The decision which parameter to be chosen can be made using the
models.
If the parameter p 2 is free of errors during the production process the variance
contours form a ridge system oriented as shown in Figure 6.7. The optimal solutions,
allocated on ~ -axis, are u~!~~ and u~~pr· When p 1 is kept without errors the ridge system
is oriented as in Figure 6.11 and the optimal solutions on ~-axis are u~~pt and u~1~pr· For
example, if p 1 is a very expensive component, used in a chemical reaction, while the
other component p 2 is relatively cheap, then the solution must be chosen to contain
minimum amount of PI.
I
\
u2 \ \
\ \
\ \
\ u(I) I \
\ \
\ 2opt\
\ \
-
-·----
\ \
\ \ \
\ l--_l----
\ \ 1 \
\ \
--~--- \ \
\ \ \
\
\
\
\
Figure 6.11 Parameter optimization for a product with two parameters, PI free of errors
and
322 CHAPTER6
and
Consider first the Iogged specific resistance. Its mean value in mass production
can be computed by formula (6.I8):
where
b0 = 6.698, b = (- 0.286 - 0.349Y,
B= ( 0.559 - 0.0775J
andL = (O.OI 0 J
- 0.0775 -1.099 e 0 0.0 I .
The coordinates of the stationary point and the eigenvalues of matrix B for this
model are computed in Example 3.13 as follows:
The eigenvalues have different signs and therefore, the performance characteristic
has a saddle point.
The mean value of the performance characteristic in the stationary point is:
- =b
Ys 1 T B -! b+tr ( B.:...
-4b ._, )
= 6.69.
0
m m-1 m
a (p.) = HOT+a! = 2I;b;;u; + L
2 I;b;~a;~ +a!.
i=l i=l j=i+l
-0.
-0.
-0.
Figure 6.12. Contours of the mean value (bold lines) and variance of the logged specific resistance
The minimum of the mean value surface is not in the saddle point and therefore,
the optimal solution does not correspond to the stationary point.
Similar analysis is carried out for the logged electromagnetic wave damping
coefficient y 2 (Figure 6.13).
324 CHAPTER6
-0.
-0.
-0.
0.5
0.6
0.4
t X
2
0.2
0
-0.
-0.
-0.
-0.
-0.5 0 0.5
X
-4
b)
Figure 6.13. Conteurs of the mean and variance of the electromagnetic wave damping (a) mean value
(b) variance
The maximum of the mean value and the minimum of the variance for this
performance characteristic are obtained at the stationary point.
As the requirements with respect to y 1 and y 2 are controversial for a good
solution we need a compromise. It can be found using a desirability function as shown in
Chapter 3.
•
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCTS 325
U sing an optimization procedure one can substantially decrease the variance of the
performance characteristic. Consider for example, the variance in mass production (5.11)
defined for a second order regression model provided that the bias due to estimation is
negligible:
t; ;~;~~if;~
2
a;.
m [ m ) m m-1 m
er 2 = t;erj2 ßi + 2ßiipi + ,'J::f!iJPi +2~~er~ + + (6.51)
Allterms in the right hand side of(6.51) are positive. The last two ofthem do not
depend on parameter values and can not be eliminated by parameter design.
Consequently, the parameter design can only help the elimination of the first term in the
right hand side of (6.51). Note that the first term is usually the greatest one and this
emphasizes the importance of parameter design.
The lower bound of the variance er 2 with given values of i = 1, 2, ... ,m is if;,
m m-1 m
er2 = 2r~er: +
i=l
L
i=l
L~cl;~.
j::::i+l
2
[ )
s 2(p.)= rs;
m m m m-1 m
bi + 2biiPi• + LbiJpj. +2Lbi;s: + L Lbi:si2 s:+s&2 . (6.52)
i=l j=l,j~i i=l i=l j=i+l
(6.53)
(6.54)
. Using (6.54) one can judge forthebest choice of 9;. Tightening the tolerances
makes the product more expensive. Therefore, the values of 91' 9 2 , ... , 9 m in (6.54) have
to be chosen on the basis of calculated cost reduction.
Suppose that low cost tolerances are used during the parameter design and the
variance s2 is not small enough. Suppose one wants to reduce it to s; .
Let the cost of .9;
times reduction of the i-th tolerance interval is y(9ji = 1,2, ... ,m. The cost of variance
reduction of the performance characteristic is
c(9) = fr(9; ).
i=l
C = USL-LSL
P 6u y
where USL and LSL are upper and lower specification Iimits correspondingly.
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCTS 327
In this section only the problern of reduction of the transmitted variation from the
product parameters to the response is considered. Extensions for other cases are given in
the next chapters.
The quality improvement of a product or process can be done in the following
steps:
1. Provide a clear statement of the problem. Identify the responses to be studied
and determine the measurement method.
2. Determine the factors which will be varied during the experiment and their
intervals of variation. ·
3. Choose a design ofexperiments (see Chapters 2, 3 and 4).
4. Conduct the experiments without errors in factors and record the data.
5. Fit a regression model using the methods ofChapter 2. Pay a special attention
to model structure selection.
6. Carry out a test for Iack of fit and correct the model if necessary (see
Chapter 2).
7. Estimate the moments of the errors in the product parameters in mass
production as shown in Section 5. 3.
8. Put the estimated regression coefficients and error moments in the models of
the mean value and variance in mass production. Use the formulae of subsections 5.2.3
or 5.5.3.
9. Define an optimization problem. Compute the optimal parameter values p. and
the corresponding mean values jii (p.) and variances s/ (p.) for all performance
characteristics. Make an engineering decision on the basis of the numerical and graphical
information provided by the models of the performance characteristics' mean values and
variances in mass production. A discussion on these problems is given in the preceding
sections oftbis chapter.
Ifnecessary, iterate the previous steps.
10. Choose new tolerance intervals if s/ (p.) are higher than the desired values
si~ for some performance characteristics. Use the formulae given in Section 6.6.
11. Conduct a confirmatory experiment for verification of the obtained optimal
parameter values and new tolerance intervals.
deviation below 20. The factors are listed in Table 6.4. Denote by p;, i = 1, 2, ... , 6 the
values of factors measured in a physical scale. Their ranges are p;min :s; p; :s; p;rnax. Coded
values of the factors are used in the design of experiments which is shown in Table 6. 5.
They are denoted by P; and are computed by formula (2.28).
T ABLE 6.4. Factor levels, low cost tolerances and half ranges offactor variations for the friction
weld"mg example.
l
Speed Heating Upset Length Heating Upset
Factors pressure pressure time time
(rpm) (psi) (psi) (thous) (sec) (sec)
p; p~ p; p; p; p~
1-st level (p = -1) 1000 4000 8500 -30 2.8 3.2
2-nd level (p = 0 ) 1200 4400 9000 0 3.2 3.6
3-rd level (p = 1 ) 1400 4800 9500 30 3.6 4.0
Low cost
tolerances K; (%) ±10 ±15 ±15 ±10 ±20 ±20
Halfranges of
variationm; 200 400 500 30 0.4 0.4
1 2 3 4 5 6 7 8 9 10 11
1 -1 -1 -1 -1 -1 -1 104.3 38.04 104.32 85.20
2 -1 -1 0 0 0 0 135.1 27.89 135.09 111.43
3 -1 -1 1 1 1 1 128.6 45.16 128.63 99.78
4 -1 0 -1 0 0 1 123.8 42.41 123.78 95.01
5 -1 0 0 1 1 -1 134.6 45.59 134.57 115.43
6 -1 0 1 -1 -1 0 134.7 27.06 134.67 110.96
7 -1 1 -1 1 1 0 150.6 38.88 150.61 126.89
8 -1 1 0 -1 -1 1 116.2 43.24 116.22 87.40
9 -1 1 1 0 0 -1 151.2 45.03 151.17 132.09
10 0 -1 -1 0 1 0 134.2 31.73 134.15 109.69
11 0 -1 0 1 -1 1 134.1 41.28 134.10 104.49
12 0 -1 1 -1 0 -1 132.0 40.67 132.01 112.09
13 0 0 -1 1 -1 -1 125.8 38.47 125.76 105.85
14 0 0 0 -1 0 0 140.9 28.67 140.91 116.41
15 0 0 1 0 1 1 158.5 46.85 158.57 128.99
16 0 1 -1 -1 0 1 129.6 44.86 129.61 99.99
17 0 1 0 0 1 -1 164.5 50.00 164.51 144.64
18 0 1 1 1 -1 0 156.1 29.91 156.10 131.61
I9 I -I -I I 0 I 111.7 43.96 111.72 81.17
20 1 -I 0 -I I -I 109.6 46.74 I09.59 88.73
2I 1 -I 1 0 -1 0 146.7 30.60 146.73 121.36
22 1 0 -I -1 1 0 I25.6 37.59 125.63 100.19
23 1 0 0 0 -1 1 I28.3 44.40 128.29 97.79
24 1 0 1 1 0 -1 139.1 44.84 139.11 118.26
25 1 1 -1 0 -1 -1 119.9 44.07 119.94 99.15
26 1 1 0 1 0 0 148.0 36.84 148.01 122.58
27 1 1 1 -1 1 1 150.1 53.13 150.05 119.49
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCT.S 331
Column 8 of Table 6.5 shows the values of the tensile strength y obtained by
Barker and Clausing through an experiment without errors in factors. U sing noise arrays
they computed the standard deviations s"" which are shown in Column 9. Through
ANOVA technique Barker and Clausing studied linear and quadratic effects offactors on
the strength and on the signal-to-noise ratio. The interaction effects are not studied in
their paper.
We used the data given in columns 2 to 8 of Table 6.5 and applied stepwise
regression to obtain the following model in coded factors:
This model is quite good. The residual variance calculated with 17 degrees of
freedom is a; R::S; =0.00154, while the multiple correlation coefficient is 0.9999. Note
that the interaction between p 2 and p 5 can not be neglected.
For the model-based approach we need the variance matrix of errors in factors
Taking into account (2.28) the variances of the errors in coded factors can be
obtained as follows:
They depend on p/and vary in the factor space. Column 10 ofTable 6.5 shows
the predicted values of the tensile strength y computed by (6.56) for the case without
errors in factors. In presence of errors the mean value of the strength is y + tr( B:E.) . It
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCTS 333
is estimated by substituting y and B for 17 and '8, correspondingly and are given in Table
6.5. One can see that the bias due to errors in factors is considerable.
Experimental data confirming these computations are available in Barker and
Clausing's paper (1984) only for run No. 1. The average value based on the experiment
is 85.23 for this run, while y + tr(B~.) = 85.20. The coincidence is satisfactory.
Substituting in (6. 51) the least square estimates b;, b,1 ,b,. for ß; ,ßl] ,ß;, we obtain
an estimate s2 for the performance characteristic's variance in mass production if. The
values of s2 and the corresponding standard deviations s are given in columns 13 and 12
ofTable 6.5. The fourth order terms are computed as follows
,.. m m-1 m
HOT= i'L,b,;a: + L Lb;~0';2 a~
J:=] i:::::] j:::::z+l
and are given in column 14 of Table 6.5. One can see that the value of HOT is rather
!arge (up to 64.7% ofthe value of s2 in run No.14) and can not be neglected.
Grid search is used to minimize s2 , while keeping the mean value of the tensile
strength approximately equal to 160. There are many solutions. Three ofthem are shown
in Table 6.6. The third one coincides with the solution given by Barker and Clausing
(1984). The results obtained by minimizing the lass function are almost the same.
Figure 6.14 and Figure 6.15 show the mean and variance of the tensile strength
respectively for p 1 = 0, p 2 = 1, p 3 = 1, p 5 = 1 and for low cost tolerances of the
parameters.
As the value of s does not meet the condition s< 20, a tolerance design is
carried out using the method described in Section 6.7. The contributions of the factors
and their interactions to total variance in mass production are computed by (6.53), where
p. = (o 1 1 0 1 oY. The contributions of p 2 , p3 , p5 , p6 and p 2 p 5 are significant.
They are given in Table 6.7. In this table we are using the following notation:
m
D
-D.5 0 0.5
~
Figure 6.14. Mean tensile strength for friction welding example (low cost tolerances of the parameters)
'"""
0.8
3300
0.6
3000
0.4
2700
{~
Pqx
QL.'<J
""""'
-{).4
~~
""'""'
-D.6
2700
-D.8
-1
-1 -D.5 0 0.5
~
Figure 6.15. Tensile strength variance for friction welding example (low cost tolerances of the
parameters)
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCTS 335
Tbe tolerances are reduced (Table 6. 7). Tbe reduction is smaller tban tbat given
by Barker and Clausing ( 1984). As a result of tolerance design tbe following values of
tbe tensile strengtband its standard deviation are obtained: y+tr(m::.) = 180.19 and s
= 19.06. Witb tolerance of beating time reduced to 1/4 of tbe original tbe standard
deviation becomes 17.11 instead of 19. 06, while tbe mean value of tbe tensile strengtb is
180.20.
Same additional comments can be made on tbe basis of tbese results. Tbe
standard deviation at tbe optimal point p. = (o 1 1 0 1 oy
was found by Barker
and Clausing (1984) to be ssv = 40.15, wbile tbe computed by tbe variance model is s =
50.29. From Table 6.5 one can also see tbat tbe standard deviation estimated by Taguchi
metbad (s.. , column 9) is always significantly smaller tban tbe standard deviation
obtained througb model (s, column 12). This difference is due to a bias in tbe standard
deviation estimates obtained through Taguchi metbod. Usually tbe errors in product
parameters bave continuous distributions , while tbeir Ievels in tbe noise matrix are fixed.
Tberefore, in Tagucbi metbad tbe estimates of standard deviations are based on data
whicb are not representative for continuous error distributions.
Contour plots for mean and variance for tbe tensile strengtb witb tbe new
tolerances are given on Figure 6. 16 and 6. 17, correspondingly.
336 CHAPTER6
0.
0
0.
0.
180
0.
Po
-0.
-0.
-0.
-0.5 0 0.5
~
Figure 6.16. Mean tensile strength for friction welding experiment (new tolerances ofthe parameters)
0.8 400
0.6
0.4 - 370
300
0.2 v-
-......
{:r------
P<P
•
-0.4
370 ----
-0.6
t---- 300
-0.8
400
-1
-1 -0.5 0 0.5
A_____.
Figure 6.17. Tensile strength variance for friction welding experiment (new tolerances ofthe
parameters)
Some simulations were carried out to study this problem. Taking into account
that the multiple correlation coefficient for model (6.56) is very high and the
corresponding residual variance is very small, (6.56) was used to stimulate data close to
the physical data of Barker and Clausing (1984). Two simulation experiments were
carried out as follows:
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCTS 337
(i) The values of factors were chosen according to L 27 and L 18 orthogonal arrays
which correspond to parameter and noise designs of Barker and Clausing. The only
difference with their experiments is that the data were simulated. For simulation of the
response we used (6.56) and normally distributed noise e with zero mean and variance
0: = O.OOI54 is added. These data were used for computation of the mean y{I8) and
the standard deviation s sv {I8) for each run of the parameter design. They are shown in
columns I5 arid I6 of Table 6. 5 respectively.
Mean values of the performance characteristic are not available in Barker and
Clausing's paper (I984) except for run No. I. However, one can see that y{I8) is very
close to the prediction y + tr(m::.) obtained by model (6.I8) which is shown in column
II. The standard deviations s sv (18) are very close to ssv obtained by Barker and
Clausing on the basis of physical experiments (column 9). Therefore, the conclusions
about the standard deviations based on simulated data are the same as for the data from
physical experiments.
(ii) In the second simulation experiment one thousand simulations were carried
out for each parameter design point. Noise design was not used. The values of the
performance characteristic Yu1 , u = I, 2, ... , 27, j = I, 2, ... , I 000 were generated using the
following procedure. In (6.56) Z 1" 1 = p + e,"1 was substituted for
1" p1" , i = I,2, ... ,6, where
e1"' is normally distributed random variable with zero mean, and 0: = O.OOI54 is added
to each value. The mean values y(1000) and the standard deviations s sv {1000)
calculated from I 000 simulations for each run of the parameter design are shown in
column (16) and (I7) ofTable 6.5. One can see that the mean values y{1000) shown in
column II are almost the same as these obtained by model (6.I8) and by the noise design
( column I5) The Standard deviations s"' (1000) agree weil with .~ computed by the
model (column I2) and are rather different from the obtained through noise matrix
(columns 9 and I6).
In the first simulation experiment the data were taken for discrete Ievels of the
noise according to the L 18 array, while in the second one a continuous distribution is
simulated. The conclusion is that the standard deviation computed on the basis of a noise
design is underestimated. Huele (1998) gives an explanation ofthis result. He shows that
if the high order terms (HOl) are significant Taguchi method underestimates the
variance. This can be seen from Table 6.5. In this table the ditferences between the
standard deviations computed by the noise matrix (column I2) are greatest for
experiments containing great high order terms (HOl) values, see for example run
No. I4.
338 CHAPTER6
6.9. Bibliography
It was shown in Section 6.3 that in case of constant error variances the steepest ascent
procedure can be employed for both mean value optimization and variance minimization
This method is described in more details in many books, For example, Box, Hunterand
Hunter (1978), Box and Draper (1987). Box and Fung (1986) consider steepest ascent in
quality improvement context. Several papers consider quality improvement problern as
dual response approach. Vuchkov and Boyadjieva (1988) discuss procedures based on
ridge analysis of mean and variance models for products with errors in factor Ievels.
Vining and Myers (1990) employ dual response approach for problems with both errors
in factor Ievels and external noise factors. Dei Castillo and Montgomery (1993), Lin and
Tu (1995), Myers and Montgomery (1995), Dei Castillo (1996) further employed this
approach.
a; =(b+2Bpfl:.(b+2Bp) (6.20)
is developed below, under the condition that the mean in mass production is equal to a
target value -r, i.e.
y(p) = b0 + prb + pTBp + tr (Bl:.) (6.18)
There are no constraints on the factor values.
First we introduce a new coordinate system by means of the Substitution
u=b+2Bp. (A.6.1)
Putting p from (A.6.2) in (6.18) we can write the performance characteristic's model in
the following form
(A.6.4)
This expression is written taking into account that B is a symmetric matrix and
(B-1 } = B- 1 . Removing the brackets in (A.6.4) we obtain the following expression:
(A.6.5)
(A.6.6)
This equation coincides with formula (6.33) of the algorithm. Putting (A.6.6) into
(A.6.5) we can obtain
(A.6.7)
(A.6.8)
A weH known result from the matrix algebra (Gantmacher (1959)) is that two
quadratic forms z,.; and Y, one of which ( ä;) is positive definite, can be simultaneously
reduced to canonical forms
(A.6.9)
and
- 2
(1'• =.)!2,1 + .,2)!2 +. ··+.,m,
)!2
(A.6.10)
where f.Jpf.J 2 , . .. ,f.Jm are the eigenvalues ofthe matrix ~ l::1B- 1, i.e. the roots offollowing
characteristic equation:
340 CHAPTER6
(A.6.11)
where T is the matrix ofthe eigenvectors of _!_ ~- 1 B- 1 . Putting (A.6.11) into (A.6.2) and
4 e
(A.6.12)
Denote by .Umax the largest eigenvalue of the matrix ±~:1 B- 1 and by ~max its
corresponding co-ordinate. lt is weil known (Gantmacher (1959)) that .Umax is the
maximum ofthe ratio Y I o-;, i.e.
y
max 0: = .Umax. (A.6.13)
This maximum is attained on the principal axis of Y. corresponding to J.lmax . One can see
from (A.6.9) that in the direction ofthis axis
Let the target for the mean value be ji = T. Then Y = T- Ys and from (A.6.14)
follows that
(A.6.15)
Finding the maximum of the ratio Y I o-; under the constraint Y = r- Ys =const means
determining such a value of ~max which minimizes o-; under the condition y = T. One can
find this coordinate from (A.6.15) as follows:
~ =+~ T-
max
Ys
f.lmax
-
(A.6.16)
The signs of •- Ys and .Umax must always coincide so that the value under square root is
always non negative.
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCTS 341
There are two solutions for ;max , because the central second order hyper surfaces
are symmetric with respect to the stationary point. The minimal variance under the
condition y = -r can be found from (A.6.10) and (A.6.16):
(6.34)
(6.35)
Figure 6.18. Simultaneous transformation of mean and variance of the performance characteristic
342 CHAPTER6
The Substitution (A.6.11) transforms er~ into a hypersphere (A.6.10) and Y into a
second order hypersurface (A.6.9). Equation (A.6.16) defines the radius of a sphere
which is tangent to the hypersurface corresponding to ji = r.
•
Appendix A.6.2. Development of the algorithm of subsection 6.4.2
Consider the case when in mass production ~ of the product parameters can be set on
their Ievels without errors. Their covariance matrix is defined as:
(6.41)
where Ap A2 , ... , Am are the eigenvalues of ~. and ~,~, ... ,Pm are new canonical
variables. The following equations link vector P = (~ P.z ... Pm) with
U =(u1 U2 ••. UmJ:
P= sru, (6.42)
or
u=SP, (6.43)
(A.6.17)
In (A.6.17) it is taken into account that all axes in P-space are orthogonal.
Any point on Pm-m, +1 , J!._.., +2 , ... , Pm axes provides er~ =0, but the solution should
also satiscy the condition ji = r. As for ~ > 1 the number of solutions is infinitely great,
we can choose any of them. For example, we can Iook for a point which is on Pm axis
and satisfies the condition ji = r.
Substituting u from (6.43) in (A.6.8) we obtain
OPTIMIZATION PROCEDURES FOR ROBUST DESIGN OF PRODUCTS 343
(A.6.18)
where Pm is the last element of P which corresponds to A.m = 0, Rm is the last diagonal
e1ement of Rand Rm_ 1 is (m -1)x (m -1) matrix and r is (in -1) vector column. One
can easily verify that
- T T 1 -1 T
Y= r- y =PS -B SP=P RP=
. s 4
(A.6.19)
As noted above, the needed solution must lie on Pm axis and all other coordinates of P
must be zeros, i.e. P",_ 1 =0. Consequently, from (A.6.19) follows that
and
p = ±~T- Ys. (A.6.20)
m R
m
Substituting (6.43) into (A.6.2) and taking into account (6.28) we can compute the p-
coordinates ofthe optimal solution as follows:
Taking into account that all coordinates of the vector Popt except Pm are equal to zero,
we obtain:
1 -1 p (A.6.21)
POJ2 =ps ±-ß Sm m
. 2
7.1. Introduction
In chapters 5 and 6 we considered only the effects of the noise in product or process
parameters on the response characteristics. However, products and processes are often
subject to external noises which can appear both in production process and product' s
usage. Taguchi calls them outer noises.
Consider For example, the truck tyre production. An external noise during the
production process is the environmental temperature. After moulding the tyre is left to
cool down. During this time the molecules of the rubber mixture continue to tie together
and the performance characteristics of the rubber are changing. These changes depend on
the rate of cooling which is a function of the environmental temperature. This
temperature depends on the season and varies within a twenty-four hours period
During usage ofthe truck tyre external noise factors are:
• Pressure. Despite of the fact that the producer prescribes a nominal air pressure
the real pressure usually varies within some intervals.
• Environmental temperature. If the environmental temperature is too high, the
cooling conditions are worsening which Ieads to loss of performance.
• Road conditions. The performance of a tyre is different on dry, wet and icy
surface. The road conditions in the mountains are much harder than for a straight
asphalt-paved road.
Though external noises are often determined by the environmental conditions
they can also be of a different nature. External noises can be some properties of the raw
materials used in a production process.
In this chapter the influence of both external noise factors and errors in product
or process parameters on the performance characteristics is studied. The following
notations are used:
Y
p = (p 1 P 2 . . . p m is a vector of product and/or process parameters,
e--\:I
fe em Y is a vector of the errors in the parameters,
344
ROBUSTNESS AGAINST BOTH ERRORS IN PRODUCT PARAMETERS 345
Engineers can reduce the external noise effect by choosing properly the product
parameters. This can be seen in Figure 7.1 where the contour plots of a performance
characteristic y are shown as a function of a product parameter p and an external noise
factor n.
p pmax
Pmin
Suppose that the external noise factor n varies within the interval nmin ~ n ~ nrmx,
while the variation interval of the product parameter is Pmin ~ p ~ Prmx . If the parameter
is chosentobe Pmin the value ofthe performance characteristic varies between 44 and 80
units and the range ofvariations is 36 units. For p = Prmx the variation ofy is between 39
and 56 units and the range is 17 units.
An opportunity to reduce the performance characteristic's variance exists due to
the interactions between the product parameters and the noise factors. In order to study
them engineers can collect information about the variation ofthe external noises and then
conduct an experiment varying simultaneously the product parameters and the external
noise factors. Suppose that during the experiment the product parameters and the
external noise factors are set without errors, so that the measured value of the
performance characteristic in the experiment is
In order to solve a quality improvement problern we can follow the same idea as
in Chapters 5 and 6 and can derive models of both mean and variance in mass
production. Using the optimality criteria of Section 6.2 one can find the optimal
346 CHAPTER 7
parameter values that make the product robust against noises. Very often, however, an
exploration of mean and variance surfaces is more valuable for the engineer or the
scientist, giving them an opportunity to combine the results obtained through models
with their own experience and with some practical requirements, that can not be
formalized by equations. Canonical analysis of both mean and variance surfaces and
numerical search procedures are very useful.
In this chapter we do not consider models of mean and variance derived as
regression equations directly from the data. Such approach is possible if there are
replicated runs in the design or ifit is a Taguchi's crossed array. We defer this discussion
for Chapter 10.
Suppose that during the experiment the factors are varied within the following intervals:
(7.3)
and
(7.4)
and
(7.7)
The variances of the coded noise factors can be computed on the basis of (7. 6) as
follows:
(7.8)
where er 2 (n;) is variance of the i-th noise factor expressed in the original measurement
scale.
Assurne additionally that n;
is a normally distributed noise. It is weil known that
in this case all observations fall within the interval ± 3cr i (n;) around the mean value n;0
with probability 0. 997. Hence, the length of the interval (7 .4) can be accepted equal to
wni = 3cri(n;). Putting this value in (7.8) we obtain that the following variance of the
coded extemal noise factors:
(7.9)
design matrix and N" in the noise matrix. In the case with compounded noise N" = 2 and
the total number of runs is 2 NP.
The problern with compounded noise is that it is difficult to decide which noise
combinations are the best and which are the worst. What is best and what is worst
depends on the interactions between the noise factors and the product parameters which
are not known before the experiments. The best noise combination for a given run of the
parameter design matrix therefore, could be the worst for another.
Another approach is to use the so-called combined arrays which are proposed by
Welch, Yu, Kang and Sacks (1990) for computer experiments and arealso used for real
experiments by Shoemaker, Tsui, and Wu (1991). In this case the product parameters
and the noise factors' Ievels are combined in a single array and the response is modelled
by regression equations. Using combined arrays one can reduce the number of
experimental runs.
Vuchkov and Boyadjieva (1992) also use combined arrays for experiments with
external noise factors. There are, however, two important differences between their
approach and those ofWelch, Yu, Kang and Sacks (1990):
• Only product parameters and external noise factors but not the errors in
product/process parameters are assigned to the colurnns of the combined arrays. This is
possible because we can take into account the transmitted error in a similar way as in
Chapter 5.
• Repeated observations are not necessary for building a model of variance in
mass production.
These differences decrease the number of experimental runs in comparison with
the combined arrays ofWelch et al. (1990).
TABLE 7 1 Comb.med array or two product parameters and twoextemal n01se actors
No. pl Pz nl nz y No. P! Pz nl nz y
1 -1 -1 -1 -1 Y! 13 -1 -1 1 1 YB
2 1 -1 -1 -1 Yz 14 1 -1 1 1 Y14
3 -1 1 -1 -1 Y3 15 -1 1 1 1 Y1s
4 1 1 -1 -1 Y4 16 1 1 1 1 Y16
5 -1 -1 1 -1 Ys 17 -1 0 0 0 Y!7
6 1 -1 1 -1 Y6 18 1 0 0 0 Y1s
7 -1 1 1 -1 Y1 19 0 -1 0 0 Y19
8 1 1 1 -1 Ys 20 0 1 0 0 Yzo
9 -1 -1 -1 1 y9 21 0 0 -1 0 Yz1
10 1 -1 -1 1 ylO 22 0 0 1 0 y22
11 -1 1 -1 1 Yn 23 0 0 0 -1 Yz3
12 1 1 -1 1 yl2 24 0 0 0 1 Yz4
ROBUSTNESS AGAINST BOTH ERRORS IN PRODUCT PARAMETERS 349
The design of experiments can be chosen depending on the number of factor Ievels, the
method of data analysis, the expected model structure and other specific features of the
product or the process. The choice of the experimental design is discussed in Chapters 3
and 4. In this design m of the columns correspond to the product parameters P; and q
columns - to the noise factors n; . lt is important to remernher that during the experiment
P; and n; must be set on given Ievels without errors.
Table 7.1 showsanoptimal composite design for two product parameters (m =2)
and two external noise factors (q = 2).
Consider a product or process with performance characteristic 17. Assurne that in mass
production it depends on the following variables:
• Product or/and process parameters p = (p1 p 2 ... PmY.They can be kept
on given values when experiments are carried out but in mass production they are subject
to errors e = (e1 e 2 emY .
r.
...
• Externat noise factors n = (nl n2 ... nq In the design Stage they can be
varied within given intervals but during the mass production and product's usage they
are uncontrollable and their values are random.
• Random noise & .
Assurne that e, n and & are independent and normally distributed with zero
expectations and the following covariance matrices:
t
m-1 m m
(7.10)
(7.11)
where
and is m x q matrix with elements riJ' i = 1,2, ... ,m; j = 1,2, ... ,q.
9
In mass production errors ej occur in the parameters pj and during the
production process and/or after shipping there are random extemal noises nj. The
measured value of the performance characteristic is therefore, different from that
obtained in the experiment. It can be expressed as follows:
(7.12)
Taking expectation with respect to e and n one can find the following model of
the performance characteristic's mean value:
(7.13)
ROBUSTNESS AGAINST BOTH ERRORS IN PRODUCT PARAMETERS 351
(7.14)
where HOT denotes high order terms and can be computed by the formula
m m-1 m q
(7.15)
The proofs offormulae (7.14) and (7.15) are given in Appendix A.7.1.
For normally distributed external noises and coded factors we take into account
(7.9) and rewrite (7.13), (7.14) and (7.15) in the form:
i=l
(7.13a)
(7.14a)
and
m m-1 m
m q
1 q-1 q 1
2Z:ai;
q
+- (
+L:L:ai~ )
+- ~~ r~d;. (7.15a)
81 ;~1 ;~1 j~1 9 L..J L..J
;~1 j~1
352 CHAPTER 7
There are many similarities between (7.13), (7.14) and (5.19), (5.22). It is not
surprising that (5.19) and (5.22) can be obtained as special cases of (7.13) and (7.14)
putting :En = 0. However, there are also important distinctions between (7.13), (7.14),
(7.15) and the models of Chapter 5. They are considered in Sections 7.3.3 and 7.4.
Before that in the next section we discuss the case of error free parameters in mass
production with performance characteristic's variation only due to extemal noises and
response errors.
As with cases with errors only in product parameters, if one merely replace a,ß,
.1'1, a', fJ and a!
in (7.14) and (7.15) by the corresponding estimates, a biased estimate of
variance in mass production is obtained. An unbiased estimate can be found as in
subsection 5.5.3 using the results of Section 5.4 with vectors fand B defined as follows:
Suppose that the production process is weil controlled, the components and the raw
materials are with low variation and the main engineer's concem are the extemal noises.
In this case errors in product parameters do not exist (e = 0).
The models of mean value and variance of product performance characteristic in
usage can be obtained by putting :E. = 0 in equations (7.13), (7.14) and (7.15) as
follows:
• model of the mean value
(7.16)
(7.17)
where
q q-1 q
HOT= 2L a~a:; +L L a~O:iO:j.
i=l i=l j=l
(7. 18)
ROBUSTNESS AGAINST BOTH ERRORS IN PRODUCT PARAMETERS 353
There is some similarity between equations (7 .16), (7. 17), (7 .18) and ( 5. 19),
(5.22) and (5.12), respectively. However, there are important differences between them:
i) The bias term tr(A :En) in the mean value (7.16) depends only on the non-
linearity of the regression model with respect to the external noise factors. Neither the
non-linearity with respect to product parameters (represented by matrix i$'), nor the
interactions between the product parameters and external noise factors (matrix t;) cause
bias in the mean value. Moreover, if the external noise factors are not correlated with
each other and :E n is a diagonal matrix, then
(7.19)
Therefore, in this case the bias in the mean value does not depend on product
parameters and can not be eliminated by their choice.
ii) Equations (7.17) and (7.18) have similar form as (5.22) and (5.12). However,
the vectors and the matrices in these equations are different and this is a reason for
different properties. The following conclusions can be drawn from (7.17) and (7.18):
a) interactions between the product parameters and the external noises which are
taken into account by the elements of matrix fj. Choosing proper values of product
parameters p one can minirnize or even eliminate a;(p) .
b) slope ofthe response surface with respect to noise factors (via a). Ifthere are
no interactions between the product parameters and the external noises (9 = 0) a
increases only the constant term of the variance, because in this case (7.20) is
independent of p.
Note that the parameter dependent part of the variance (7.20) does not depend
on the quadratic effects of the noise factors. It is shown in Section 7.3 that this can
simplify the experimental design.
• The lower bound ofthe variance attainable through parameter design is
q q-I q
A comparison with (7.11) shows that this model does not contain interactions
between noise variables and quadratic terms in noise variables. In (7.11) they are taken
into account by the term nr ,4n. Not surprisingly Myers and Montgomery (1995) obtain
a model ofvariance which differs from (7.17) only by Iack ofhigh order terms (HOT).
As shown by (7.18) the high order terms (HOT) depend only on the interactions
between noise variables and on the second order terms with respect to the noise factors
in the regression model. These terms are not included in the model of the performance
characteristic postulated by Myers and Montgomery (1995) and this is the reason why
HOTare not in their variance model.
In contrast with Myers and Montgomery (1995) we prefer to make clear
distinction between errors in product parameters and external noise factors because of
the following reasons:
• A separate consideration of the nominal values of product parameters and their
errors as factors unnecessarily increases the number of factors and hence, the number of
runs in a response surface design. As shown in Chapter 5 experiments with errors in the
product parameters are not necessary to predict the variance that they transmit to the
response.
ROBUSTNESS AGAINST BOTH ERRORS IN PRODUCT PARAMETERS 355
• Models of rnean value and variance in rnass production have different properties
with respect to errors in product parameters and extemal noise factors. As shown in
Chapter 6 when extemal noise factors do not exist and the variation is only due to errors
transmitted frorn product parameters to response then the stationary points of rnean and
variance surfaces coincide. In this case the optimization procedures can be sirnplified and
sorne interesting properties ofthe quality irnprovernent problern can be revealed. In cases
with extemal noise factors such coincidence between the stationary points of rnean and
variance surfaces does not exist and this rnakes the optimization procedures different.
This problern is considered in the next section.
"(p,n)= 5 +2p-1.5n
TJ
10
9
8
7
6
5
4
3
2
1
0
-1 0 1
p
Therefore, according to (7.I8) the high order terms are zero (HOT= 0) and formula
(7. I 7) reduces to
-1 0 1 p
Figure 7.3. Performance characteristic's variance for a model with linear effects
•Model with linear effects and interaction between the product parameter and the
external noise:
ry{p,n)= 5 + 2p -1.5n +3pn.
ROBUSTNESS AGAINST BOTH ERRORS IN PRODUCT PARAMETERS 357
T]
10
9
~ -.!:~--~
fj A~-~-///
:r
2 1 ~~v
;1-----+~--r----------.~
Figure 7. 4. Performance characteristic's variation due to parameter x noise interaction
The values ofthe performance characteristic as function ofp for n == -1,n == 0 and
n = 1 are shown in Figure 7.4. The variation of 1J due to the external noise is greatest for
p == -1, where
1](-1, -1)-1](-1,1) == 7.5- (-1.5) == 9 .
(7.22)
For p == -1 we obtain cr 2 ( - 1) == 2.25 which coincides with the value given above.
The optimal value of the product parameter can be found by equating the first derivative
of cr 2 (p) to zero:
dcr 2 (p) 6
dp == 9 (-1.5+3p)==O,
or Popt == 0.5.
358 CHAPTER 7
-1 0 1 p
Figure 7.5. Variance minimisation for a model with parameter x noise interaction
T)
10 (n=-1)
9
8
6
7 ///
5
4
3
2
(n=1)
1
0
-1 0 1 p
=(-1.5+3py x.!.+0.055555.
9
The only difference between this equation and (7.22} is coming from the term
2a: a:
1 = 0.055555. This is the part of the variance which can not be removed by
parameter design.
The optimal parameter value is p = 0. 5 as for the case of a model with linear and
interaction effects.
2\
-1 0 1 p
Figure 7. 7. Performance characteristic's variance for a second order model
An experiment with 4 factors is carried out. The factor Ievels in the experiment
are shown in Table 7.2. They are set without errors. A ~ 8 orthogonal array is shown in
Table 7.3 together with the observed (y) and predicted (j/) response, and the variance
due to the temperature variations (s 2 ).
U sing the best subset regression program of MINITAB following model was
obtained:
The models of mean value and variance of the performance characteristic in mass
production can be computed by (7.16) and (7.17). Asthereis only one noise factor :E. is
scalar, i.e. :E. = 1I 9. Quadratic effect of the extemal noise does not exist in the model
(a •• = 0) and Ais a scalar equal to zero.
The standard deviations of the errors in factors p 1 and p 2 are depending on their
nominal values and can be computed by formula (5.32):
(p ') sp;
(jl I = 3QQ X 50
This standard deviation can be expressed also through the coded value of the factor,
p;
taking into account that P; = (p;- 0 )! m;, i= 1,2. Therefore,
and
In a similar way one can find the standard deviation ofthe second factor as follows:
The regression coefficients estimates, substituted in the mean and variance models (7.13)
and (7.14a) are
l
b0 = 81.05, b = (3.14 5.44 7.25Y ,a = a = 2.08,
0 0 0
A=O,B= [ o o 1.48 ,f=(O -0.57 -1.24Y.
0 1.48 0
One can see that tr(A:Eo} = tr(B:E.) = 0 and according to (7.13) the predicted
values in the experiment and in mass production coincide. The mean value of the
performance characteristic in mass production can be computed as follows:
362 CHAPTER 7
because for the coded noise E(n) = 0. For exarnple, the predicted degree of extraction in
point No.2 of the design with Coordinates p = (-1 0 oY is
y(p) = 81.05-3.141 = 77.91.
Formulae (7.14a) and (7.15a) can be used to compute the variance in mass
production. This model is of fourth order with respect to p 1 and p 2 because ~ and o;
depend on p:
and Pi,
correspondingly. Compute for exarnple, the variance for point
p = (-1 0 of. The error variances at this point are:
cr12 =[0.01666(-1 + s)Y = 0.00444
and
er;= [0.01666(0+4.6666}Y = 0.00605.
Substituting the regression coefficient estimates in (7.14a) we obtain
I
s (p)= (b+ 2Bp )r ~.(b+ 2Bp )+-g(a+fr
2 pf (a+ rr p) +HOT +s;.
~
(b+2Bp)r ~.(b+2Bp)=
l[;~J+{~ ~~ ~H~~Jl
=(3.14Y x0.00444 +(5.44Y x0.00605=0.2230,
and
~ 1 X (-0.57 )2
HOT=- X 0.00605 = 0.00022.
9
The variances at the design points are given in the last column ofTable 7.3.
As the target is to obtain maximal value of the response, the optimal parameter
values are found through grid search under the condition that ji- 3a> 95%. Three of
the best parameter combinations satisfying this condition are shown in Table 7.4.
If the product is subject to effects of both errors in the parameters and external noises
then the response variation depends on parameters in a more complicated way. Recall
equation (7.13):
(7.23)
depends on the nonlinearities both with respect to errors in product parameters e and
external noise factors n. In the widely spread case ofuncorrelated noises it is equal to
m q
Yc = Lß;;d; + L aiicl.i ·
j::;} i=l
364 CHAPTER 7
Provided that a:
and a;j
are constant over the region of interest, the bias does not
depend on product parameters and can not be removed by their choice.
Under the same assumption of constant noise variances, the variance of the
performance characteristic in use (7.14) can be expressedas a sum ofthree terms:
(7.24)
where
cr;(p)=(ß+2g'pf ~.(ß+2g'p) (7.25)
is the variance of the error transmitted from the product parameters to the response,
(7.26)
m m-1 m q
(7.27)
is a constant part ofthe performance characteristic's variance. Only cr;(p) and cr!,(p)
can be reduced or eliminated by parameter design because er; does not depend on
product parameters.
All terms in (7.24) are non-negative. Therefore, the joint action of the errors in
product parameters and the extemal noises always results in an increase of the total
variation. However, the effect ofnon-linearity ofthe performance characteristic, which is
essential for the variation transmitted from product parameters to the response, is not so
clearly expressed in the case und er consideration. The values of the product parameters p
which minimize (7.25) and (7.26), usually do not coincide with each other and the
minimum of the performance characteristic's variance er 2 (p) is a compromise between
the minima of er,; (p) and er!, (p). That is why the optimal parameter value does not
correspond to the extremum of the performance characteristic with respect to p unlike
the case without extemal noises. This is illustrated by Example 7.3. The effect of the
extemal noises on the performance characteristic's variance can be reduced if they
interact with product parameters. Nonlinearities with respect to the extemal noises ( a;;n;2
and a;1n;n) have no effect on the parameter dependent part of the variance
er; (p) + er!, (p). They only increase the constant part er; .
ROBUSTNESS AGAINST BOTH ERRORS IN PRODUCT PARAMETERS 365
Suppose that the product parameter is subject to error with variance d. = 0.5,
while the variance of the external noise is c?. = I I 9 . The components of the performance
characteristic's variance in usage are
• variance ofthe transmitted error (7.25):
2 2 I 2 I
=2x1x0.5 +2xl.5 x-+3 x0.5x-=1.05555.
81 9
The minima of a ,; (p) and 17(p) correspond to the same parameter value p = -1.
The part of variance due to external noise a!, (p) is minimal for p = 0. 5. The minimal
performance characteristic's variance is obtained for a compromise parameter value equal
to Popt = -0.5. It can be computed by equating to zero the firstderivative of a 2 {p)as
follows:
6
= 4 X 0.5(2+ 2p)+-(-J.5 + 3p)= 0.
9
366 CHAPTER 7
02
aJ.
10
9
8
7
6
5
~
4
3
2
1
0
0 p
-1 1
(a)
11
10
/
9
8
7
6
5
3
2
1
0
-1 0 1 p
(b)
Figure 7.8. Performance characteristic and its variance for mode1 (7.28)
•
ROBUSTNESS AGAINST BOTH ERRORS IN PRODUCT PARAMETERS 367
7.4.1. PROBLEMFORMULATION
Optimal values of the product parameters can be found on the basis of models (7 .13) and
(7.14) using the optimality criteria defined in Section 6.2. Consider first the conditional
minimization of variance , while keeping the mean value on a target.
Let us see whether the analytical solutions of Chapter 6 can be used for
optimization in cases with both errors in product parameters and external noise factors.
Assurne that the variances of errors in factors are constant over the region of interest.
Denote b0 ,b,B,a,r the estimates of ß0 ,ß, 'E,a and tJ, correspondingly. Putting these
estimates in the models of the mean value and variance we obtain
(7.29)
and
(7.30)
ctY(p) = b + 2Bp.
dp
Putting it equal to zero we obtain the stationary point ofthe mean values surface:
I -I
p 11)1 =--B
2 b. (7.31)
The stationary point of the variance surface can be found from the equation
(7.32)
368 CHAPTER 7
The stationary points of the mean value and variance surfaces do not coincide. As
the analytical solutions of Chapter 6 are based on their coincidence we see that these
results are inapplicable to this case. This conclusion is also true for the case when the
product is subject only to external noise factors and not to errors in product parameters.
In this case the stationary point of the mean value surface is given by (7. 31 ), while the
stationary point ofvariance surface can be obtained by (7.32) putting ~. = 0 as follows:
Pss =- ( f~"r
T )-I f~"ll. (7.33)
This is always a non-negative definite matrix because such are ~. and ~". Therefore, in
this case the contours ofvariance are ellipses (ellipsoids) or stationary ridges.
Figure 7. 9. Optimization of a two parameter product in the case when a specific target value is best
minimal at the stationary point p,,. However, the solution must also satisfY the condition
y = T, where T is the target value for the performance characteristic. That is why the
y
solution is a tangent point of the contour corresponding to = T and the contours of
variance. There are two such tangent points: p 1 and Popt. The optimum is at Popt which is
nearer to p •• and corresponds to smaller variance than that for p 1 .
Note t~at the stationary points of mean and variance surfaces coincide if there are
not interactions between product parameters and noises (r = 0 ). In this case the
analytical solutions of Chapter 6 are valid.
Introduction
A solution of the optimization problern defined in Section 7.4.1 can be found using
Lagrange multipliers. lt is based on the ridge analysis of response surfaces, developed by
Draper (1963) and Myers and Carter (1973). Vining and Myers (1990) and Vuchkov and
Boyadjieva ( 1990a, 1992) employed these ideas for development of optimization
procedures for quality improvement. This approach is called by Vining and Myers dual
response optimization and is briefly discussed in Section 6.5.
In the next subsections we give the algorithms developed by Vuchkov and
Boyadjieva (1990,1992). For the sake of convenience we rewrite models (7.29) and
(7.30) in the following form:
(7.34)
and
(7.35)
Substitution of d1 , () and D into (7.34) and (7.35) can verify these equations.
The solution is a minimum if(7.37) isapositive definite matrix, i.e. -pB + D > 0 or
and Amax the minimaland the maximal eigenvalue ofthe matrix n- 1Band by T-a matrix
of its eigenvectors.
As shown in Appendix A.7.2 the algorithm is as follows:
I. Compute Amin and Amax and the eigenvectors tj> i = 1, 2, ... ,m of n-IB.
2. Choose one of the inequalities:
(7.39)
(7.40)
(7.41)
(7.42)
find numerically or graphically values of f.J and p opt = (p opti p opt2 .. . p optm y which
satisfy one of the inequalities (7.39) or (7.40) or (7.41) and minimize the variance '§ 2
under the condition = 1:. y
ROBUSTNESS AGAINST BOTH ERRORS IN PRODUCT PARAMETERS 371
• Put s 2 (p)=so~,(p) in the plots of p1 (s 2 (p)Jp 2 (s 2 (p)J ... ,pJs 2 (p)] to find
Putting the coefficients ofthis model into (7.34) and (7.35) predicted mean value
y and standard deviation s are computed provided that the moments of errors in
product parameters and external noises are as follows:
The results ofthese computations are given in colurnns 5 and 6 ofTable 7.6. Columns 7
and 8 show the mean value y and the standard deviation s obtained by 100 simulations
of the performance characteristic for each design point. The coincidence between y and
y as weil as between s and s is satisfactory.
372 CHAPTER 7
Assurne that the target value is -r= 4. Using the estimates of the regression
coefficients we have found the following matrices:
[ h"
B= bi 2 12
h" I
b22
2 bh" I122J= [-0 97
0.56
0.56
2.15
263]
23 1.47 '
bi 3 I 2 b23 I 2 b33 2.63 1.47 3.16
The minimal and the maximal eigenvalues of n-IB are A,mm = -7.1847 and
A,max = I. 0380. According to (7.41) the Lagrange multiplier f.J should be within the
interval -0.1392 < f.J < 0. 9633. Giving several values of f.J within this interval and using
(7.42), (7.34) and (7.35) we obtain the plots shown in Figure 7.10.
One can see that for y= T = 4 the variance is so~t = 1.2051 and the optimal
parameter values are Popti = -0.6482, Popt 2 = 0.5639 and PoptJ = -0.2595.
ROBUSTNESS AGAINST BOTH ERRORS IN PRODUCT PARAMETERS 373
1 2 3 4 5 6 7 8
1 0 -1 1 7.29 3.09 7.21 3.10
2 1 1 -1 -3.11 2.07 -3.06 2.06
3 0 1 1 14.83 5.18 14.49 5.15
4 0 -1 -1 7.48 3.75 7.61 3.44
5 0 1 -1 3.27 1.45 3.36 1.71
6 -1 0 1 3.96 2.27 4.05 2.37
7 1 0 1 11.92 5.58 12.43 5.80
8 1 -1 0 0.87 2.95 0.97 2.67
9 1 0 1 11.92 5.58 11.81 5.70
10 -1 0 1 3.96 2.27 4.16 2.32
11 -1 1 0 5.08 1.69 5.13 1.86
12 1 1 0 4.77 4.26 4.67 4.35
13 -1 -1 0 5.65 2.65 5.70 2.74
14 1 1 0 4.77 4.26 4.79 4.29
15 -1 0 0 3.21 1.29 3.17 1.41
16 -1 1 0 5.08 1.69 5.17 1.92
17 -1 1 1 8.77 3.70 8.71 3.49
18 1 -1 1 9.19 4.74 8.97 4.76
19 1 1 1 18.96 6.82 19.26 6.79
20 -1 -1 1 3.46 1.79 3.42 1.79
21 -1 1 -1 7.72 2.55 7.67 2.66
22 1 -1 1 9.19 4.74 9.10 4.85
23 -1 -1 -1 14.16 5.03 14.37 4.87
24 1 1 1 18.96 6.82 19.00 6.80
25 -1 -1 1 3.46 1.79 3.53 1.92
26 1 -1 -1 -1.13 3.07 -1.06 2.35
27 0 0 1 8.91 3.86 8.81 3.75
28 0 -1 0 4.23 2.14 4.28 2.01
29 0 1 -1 3.27 1.45 3.25 1.51
.Q.B .Q.7 .Q.6 0.1 0.2 0.3 0.4 0.5 0.6 .Q.4 .Q.3 .Q.2 .Q.1 0.0 0.1 3 4
~z ~
Figure 7.1 0. Variance s versus parameters P1.P2 , p 3 and Y
•
374 CHAPTER 7
L P;2 = R2.
m
PT p = (7.43)
i=l
Another condition is
y=r. (7.44)
A minimum ofvariance under the conditions (7.43) and (7.44) can be obtained by
minimizing the following function:
2. Compute the minimal eigenvalues A.jmin of the matrices D- pjB and the
corresponding matrices of eigenvectors T1 .
3. Choose a set of q values of o for each f.J j, For example, oj 1 , oj 2 , ... , ojq .
4. Compute
(7.45)
5. Put p ß into (7.34) and (7.35) and compute y jl and j = 1,2, ... ,r;
I= 1,2, ... ,q.
ROBUSTNESS AGAINST BOTH ERRORS IN PRODUCT PARAMETERS 375
AND EXTERNAL NOISE FACTORS
For detailed description of this method and for an example see Vuchkov and
Boyadjieva (1992).
ith
TABLE 7.7. Factor evelsfor e electrode XlSJt of copperQJate expenment
Levels
Factors Dimension Code -I 0 1
Thickness of the board mm PI 1.5 - 2.0
Diameter of the holes mm Pz 0.35 0.6 0.9
Current density A/dm 2 p3 I 2 3
Concentration of CuS04 gll p4 60 80 100
Concentration of H 2 S04 gll Ps 160 180 200
Temperature of the solution oc n 18 23 28
376 CHAPTER 7
An ~ 8 orthogonal array is chosen for the experiment. The diameters of the holes (p2 ) are
0.35 mm, 0.6 mm and 0.9 mm. The coded value of p 2 which corresponds to 0.6 mm is
The multiple correlation coefficient for this model is R = 0. 9973 . The corresponding F-
ratio is equal to 76.55, while the critical point ofF-distribution for significance Ievel 0.05
and degrees of freedom 12 and 5 is 4.6777. Therefore, the multiple correlation
coefficient is significant. The residual variance is s~ = 1.4362. It is considered as an
estimate of the output noise variance ~.
In this example only one external noise factor exists and there are no errors in the
product parameters. In this case the models of mean value and variance in mass
production are (7.16), (7.17) and (7.18). Substituting estimates for model coefficients
one can rewrite them as follows:
y(p) = b0 + bT p + pr Bp +tr(AL 0 ), (7.16a)
(7.17b)
and
q q-l q
0 0 -0.925 0 -2.144
0 0 6.709 7.367 0
B=_!_ -0.925 6.709 0 -2.579 4.523
2
0 7.367 -2.579 0 -9.333
-2.144 0 4.523 -9.333 0
of CuS04 (p4 ) because according to (7.46) only these parameters interact with the
temperature ofthe solution (n).
Equations (7 .46) and (7 .4 7) can be used to minimize the variance of the coat
thickness under the condition that its mean value is within the interval (30 fJ11Z,50 fJ11Z) for
given solution temperature n, board thickness Pi and diameter of the holes p 2 . Consider
the case with solution temperature equal to the nominal (23 °C) which corresponds to
n = 0. The optimal process and product parameters can be found by grid search.
Assuming that n has normal distribution one can expect that with high probability the
values of the coat thickness to vary within the interval J± 3s . Hence, the parameter
values should be chosen to satisfy the inequality 30 Jlm :::; y ± 3s :::; 50 Jlm .
For the 2 mm thick board the value of Pi is fixed equal to 1. For Pi = the
minimal value of s} is s.~in = 0.080945 and is obtained with p4 = ' while the
maximum is s?max = 5.8199 is calculated by (7.47) and corresponds to p 4 = -1.
Therefore, for this board the values of Pi and p 4 are fixed to be equal to I and the
parameter p 2 takes only three values: p 2 = -1, p 2 = -0.091, and p 2 = 1, which
correspond to diameters of the holes 0.35 mm, 0.6 mm and 0.9 mm, and a search
J
procedure is used to fmd values of p 3 and p 5 for which 30fJ11l:::; ± 3s:::; 50J1m. These
computations show that the concentration of H2S04 should be fixed at its lower Ievel
J
160 g/1 (or p 5 = - ). The inequality 30 Jlm :::; ± 3s :::; 50 Jlm is satisfied for values of p 3
within the interval-(-0.025, 0.25) for all diameters of the hole. Table 7.9 shows the
results of the computations for three values of p 3 • For these three cases s? = 0.08
(mmY and s 2 = 1.52 (mmY.
TABLE 7..
9 Parameter values chosen for 2 mm thick board
Coded values Natural values
Pi P2 P3 P4 P5 p{ p~ Pi p~ p~ y ~ +3s,y- A
{-~~9
r9.87,47.25)
I - I -I 2 r5
0.6 1.75 100 160 r56
39.79 (36.10,43.48)
0.25 0.9 35.25 (31.56,38.94)
I
{ -i
-0;09 0 I -1 2 r5
0.6
0.9
2 100 160 r42
38.i7
35.47
r6.73.44.11)
(34.48,41.86)
(31.78,39.16)
1 {" -i
-0;09
0.25 I -I 2
r5
0.6
2.25 100 160 r29
36.56
35.69
t3 60,40.98)
(32.87,40.25)
0.9 (32.00,39.38)
ROBUSTNESS AGAINST BOTH ERRORS IN PRODUCT PARAMETERS 379
AND EXTERNAL NOISE FACTORS
The mean and variance contours for the coat thickness for 2 mm board, diameter
of the holes 0. 9 mm and concentration of CuSO 4 I60 gll are shown in Figure 7 .II and
Figure 7.I2, respectively.
Popt
0.
0.
r
0
p4 0
-0.
-0.
-0.
-0.
-1
-1 -0.5 0
... 0.5
p3
Figure 7.11. Contours for mean coating thickness, 2 mm board, diameter of the holes 0. 9 mm and
concentration of CuS04 160 gll
For 1. 5 mm thick board (p1 = -I) is impossible to find a point for which the
Y
variance attains its minimal value s2 = I. 52 (mm and the condition the thickness to be
between 30 f.D11 and 50 f.D11 is fulfilled. The optimal process parameter values are: current
density - 3 A I dm 2 (p3 = I), concentration of CuSO 4 - 80 g/1 (p4 =0) and concentration
of H2 S04 - 190 gll (p5 = 0.5). For these parameter values the variance of the
performance characteristic in mass production is s2 = 3. 24. The mean values of the coat
thickness corresponding to different diameters ofthe holes are given in Table 7.IO.
380 CHAPTER 7
1
1.52
0. !
0. f 2
1
0. <
0.
3
p4 0
-0. 4
-0.
5
-0. E
6
-0. !
7
-1
-1 -0.5 0 0.5
~
p3
Figure 7.12. Variance contours for coating thickness, 2 mm board, diameter ofthe holes 0.9 mm and
concentration of CuS04 160 g/l
•
7.5. Bibliography
Most authors do not make difference between errors in product parameters and extemal
noise factors when they design experiments for quality improvement. Taguchi (1986,
1987) clearly notes that these two types offactors exist, but in his crossed arrays they are
included in the same way. In the combined arrays proposed by Welch, Yu, Kang and
Sacks (1990) and used by Shoemaker, Tsui and Wu (1991) the authors do not make
difference between these two types of factors. In the approach used in this book the
errors in product parameters are not considered as separate noise factors in the combined
ROBUSTNESS AGAINST BOTH ERRORS IN PRODUCT PARAMETERS 381
AND EXTERNAL NOISE FACTORS
array, because the variation due to the transmitted error can be predicted without
experiments with these errors, provided that the moments of their distributions are
known. As noted in Section 7.2 this allows to reduce the number of experimental runs.
Vuchkov and Boyadjieva (1990,1992) propose models of mean value and
variance with both errors in product parameters and external noise factors. Myers, Khuri
and Vining ( 1992) propose models of variance in which the nominal values of product
parameters are considered as non-random factors, while the errors in product parameters
and external noises are noise factors. Box and Jones (1992) consider models for cases
when only environmental factors exist. Chapter 10 of the book by Myers and
Montgomery (1995) is devoted to robust product and process design, including model
based approach and the related problems of the design of experiments and optimization
Vuchkov and Boyadjieva (1995) consider algorithms for multicriterion optimization in
the problems of robust product and process design based on models derived in this
chapter.
Several papers are devoted to the so-called dual response approach, based on the
work ofVining and Myers (1990). We defer the discussion on these papers for Chapter
10.
Appendix A.7.l. Development of models for mean value and variance with both
errors in product parameters and external noise factors
Consider a second order polynomial model of the type (7 .1 0), which can be written in
matrixform as follows:
(7.11)
The notations used in this equation are the same as in Section 7. 3. 1. If the performance
characteristic is subject to both errors in product parameters and external noise factors
its measured value is given by the equation
Removing the brackets one can rewrite equation (7.12) in the form:
(7.14)
where
m m-1 m q
(7.15)
Proof The variance of y(p +e,n) can be computed from (A.7.1) as follows:
+ 2[cov(17., 11. )+ cov(17., 11•• )+ cov(17., 11•• )]+ 2cov[(17. + 11. + 11•• 1e]. (A.7.3)
The terms in the right hand side of(A.7.3) can be computed as follows:
m m-1 m
(A.7.5)
(A.7.6)
q q-1 q
(A.7.8)
because the third order terms of normally distributed vector n are equal to zero.
(A.7.10)
(A.7.11)
m q
= trAL Ar
'? n '?'
L e = "\' ""r2a2a2
L..,.; L..,.; lJ z nJ ·
(A.7.13)
i~1 J~1
ROBUSTNESS AGAINST BOTH ERRORS IN PRODUCT PARAMETERS 385
AND EXTERNAL NOISE F ACTORS
Taking into account that n and e are independent one can rewrite this equation as
follows:
(A.7.15)
(A.7.16)
2.8. cov[(ll. + lln + 11.J, &] = cov(ry., &)+ cov(ryn, &)+ cov(ll.n, &) =0, (A.7 18)
In a sirni1ar way one can see that cov{ryn, & ) = cov{ll.n, & ) = 0.
Putting (A. 7.4), and (A. 7.12) - (A. 7.18) into (A. 7.3) we obtain (7.14).
•
Appendix A. 7.2. Derivation of algorithm for unconstrained optimization by
Lagrange multipliers
Consequently,
386 CHAPTER 7
d2rp
- - = 2(- ,uB + D). (7.37)
dpdpT
The solution is a minimum if (7.37) isapositive definite matrix, i.e. -.uB + D > 0
or
,uB-D<O. (7.38)
(A.7.19)
and
(A.7.20)
p=T~ (A.7.21)
Taking into account (A. 7.19) and (A. 7.20) one can rewrite this equation as follows:
and
(A.7.22)
p = -T(uA- It TT {,ub -B)/2 =-~ t t;t; (u;.,i -It {,ub- B), (A.7.23)
where t; are the eigenvectors of D- 1ß.
ROBUSTNESS AGAINST BOTH ERRORS IN PRODUCT PARAMETERS 387
AND EXTERNAL NOISE FAC TORS
Denote by ltmin and ltmax the minimaland maximal eigenvalues of D- 1B. We obtain from
(A.2.24) the following relationships:
JL> Xmin
1 if ;t mm. <lt max <0 · (A.7.25)
(A.7.26)
and
(A. 7.27)
s
Consequently, the indefinite multiplier f.1 must be chosen to minimize 2 under the
conditions (A.7.25), (A.7.26), (A.7.27) and y = r. This explains the algorithm ofSection
7.4.2.
•
CHAPTER8
8.1. Introduction
Product or process models, based on specialized knowledge in the field of interest can be
used in computer-aided robust product design. This can be done before making a
prototype of the product. This way the ambiguity about the initial choice of the
parameter settings can be avoided and the product development stage can be shortened.
It also gives better understanding of the product behaviour under raw material or
component variations, manufacturing imperfections and environmental noises. In this
chapter, except Section 8.4, the factors arenot coded.
Consider For example, a very simple circuit (Figure 8.1).
Uz _ B(D+C)
U:- A(B+C +D)+B(D+C)"
(8.1)
The output voltage U 2 can be computed exactly if the input voltage U 1 and the
resistances A, B, C, D are exactly equal to certain nominal values. However, in real
production resistor values usually vary within some tolerance intervals. As a result
different circuits of the type shown in Figure 8.1 give different values of the output
388
QUALITY IMPROVEMENT THROUGH MECHANISTIC MODELS 389
valtage U2 for a given inputvaltage U1, in spite ofthe fact that the nominal values of A,
B, C, D are the same for all circuits.
In practice models can describe much more complex products. Noises can also be
of different nature. For example, if a circuit comprises transistors it may be sensitive to
environmental temperature.
Robust parameter design can be carried out by experiments on a prototype as this
was shown in Chapters 4, 5, 6 and 7. However, the existence of an analytical model
makes things much easier. Computer experiments are much easier, safer, eheaper and
more convenient than real ones. They can be conducted even if the real product does not
exist. This is especially important for complex products or processes, such as complex
electronic circuits, sophisticated machines, cars, chemical processes and products, etc.
Several specific features of the computer experiments make them different from
the physical ones. The most important ofthem are:
• Analytical models are often complicated and the number of factors can be
higher than the usual for physical experiments.
• There is no random error in the response. Repeated observations give the same
result.
• An analytical model may not be an adequate approximation to the real process
or product.
The last problern is specific for each case and is beyond our scope. Further on we
assume that the analytical model is a good approximation to the reality.
If the mean values and variances of the noises in mass production are known, a
model based robust product design is possible. There are several alternative approaches
to quality improvement through mechanistic models:
• Use Monte Carlo computer Simulations of the performance characteristic in
presence of noises and compute its mean value and variance.
• Use Taguchi method as it was described in Chapter 4, but with simulated data
instead of data obtained from physical experiments. ·
• Use response surface methodology (Chapters 5, 6 and 7) with simulated data
instead of physical experiments.
• Use analytical models of the mean value and variance of the performance
characteristic in combination with optimization procedures.
In this chapter we focus our attentiop on the last two approaches.
390 CHAPTER8
First consider the case when the product or the process is subject only to errors in its
parameters. Assurne that a deterrninistic rnodel of the perforrnance characteristic
17 = 17(p, ß) is known. It rnay be nonlinear in parameters ß =(ß 1 ß 2 ßk Y.
The value ofthe perforrnance characteristic in rnass production is
(8.3)
(8.4)
where J.l; 3 and J.l; 4 are the third and the fourth order rnornents of the error distribution,
and
QUALITY IMPROVEMENT THROUGH MECHANISTIC MODELS 391
m m m-1 m
(8.5)
where H is m x m matrix with diagonal elements H;; and offdiagonal elements H;1 . The
formulae for variance and mean value in mass production for second order polynomials
(5.10), (5.11) and (5.19) can be considered as special cases of(8.4), (8.5) and (8.6),
correspondingly. To show this it is enough to put all third orderderivatives equal to zero
(rY>
n
= r<i>
JJ
= r<i>
IJ
= o) and to notice that in this case
Ai = ßi + 2ßiipi + L/3;,P,
i=l.jt:.i
(8.7)
and
H=B. (8.8)
around a given point defined by the search procedure. This region depends on the ranges
of parameter's random variations which are usually not !arge. For example, if the errors
in parameters are normally distributed one needs an accurate model within the interval
±3 ai around the nominal value of the parameter pi . Therefore, at a given step first or
second order Taylor expansion in which all derivatives are computed for the current
values ofproduct parameters is satisfactory. This simplifies the computational procedure.
Another interesting feature of the approach based on analytical models is that it
does not use simulations of the noises. The moments of the errors if; ,Jii3 ,Jii4 are
estimated on the basis of observations from the real production process or from tolerance
intervals as this is shown in Section 5.3. Putting them in equations (8.4), (8.5) and (8.6)
one can obtain the mean value and variance of the performance characteristic in mass
production without any simulations. This feature of the procedure is useful in several
respects:
• lt can save time when the models are !arge and complex.
• The simulation errors due to use of limited sample size or low quality of
random number generators are avoided.
• Experimental designs, regression analysis, ANOVA., Daniel plots and other
statistical procedures arenot necessary. This saves time and efforts .
.• The values of derivatives can be used for analyzing product performance
characteristics in a given point of factor space.
The listings, containing different variants, contour plots, graphs of the effects of
factors and interactions, given in Chapters 3, 5 and 6 are applicable in analytical models
as weil.
Tolerance design can be carried out in the same way as in Section 6.6 with small
changes of the formulae for computing factor contributions. For example, if the original
analytical model is approximated by second order Taylor expansion, one can use for
tolerance reduction formula (6.54) with contributions computed as follows:
(8.9)
vA',,B
(~:;J>
Ce F E
The problern is how to set the nominal values of the bridge parameters so that the
unknown resistance y to be measured accurately. The following notations are used: A, B,
C, D and F are resistances, E is power supply valtage and X is the reading of the
galvanometer.
An analytical model ofthe bridge is:
Ifthe resistor Bis adjusted so that X= 0, the value ofthe unknown resistor is
BD
y=-.
c
For different bridges the deviations of A, B, C, D, E, Fand X from their nominal
values cause errors in the value of the unknown resistance y. That is why Taguchi
considers the deviations in the bridge parameters A. B. C, D, E, F and X as noise factors.
The nominal values of the resistances A, C, D, F and the valtage E must be chosen to
minimize the error variance. The value of the resistor B is not subject to optimization,
because it isavariable and is tuned to obtain X= 0. Taguchi uses orthogonal arrays L36
for both parameter and noise designs and computes the performance characteristic by the
analytical model with simulated noises. As each of the arrays has 36 rows, the total
number of runs is N = 36 x 36 = 1296. Further on Taguchi uses his method and obtains
optimal values ofthe bridge parameters.
Consider the solution of this problern by the model-based approach described
above. Denote p 1 , p 2 , p 3 , p 5 the values of the resistances A, C, D, F, correspondingly
and Iet p 4 be the voltage. Their intervals of variations during the simulations are chosen
by Taguchi and shown in Table 8.1. The true values for the parameters A, C. D, F vary
within intervals ±0.3% oftheir nominal values and for the parameter E within ±5% of
its nominal value. The error in the galvanometer reading X varies within an interval of
±0.2.10-3 A ofits nominal value,
394 CHAPTER8
TABLE 8..
I Factor Ievesm
I . Wheatstone bn"dIge examp1e
Factor Ievels
Factor Dimension Pimin = 1 P;o = 2 Pimax =3
A (pl) n 20 100 500
C(p2) n 2 10 50
D(p) n 2 10 50
E(p4) V 1.2 6 30
F(Ps) n 2 10 50
In order to obtain comparable with the Taguchi method results we compute the
values ofthe standard deviations in the same way as it wasdonein Taguchi's book:
where K;is the tolerance half interval given in percents, while P; are the values offactors
A, B, C, D, F. The values of o; change tagether with P; and this should be taken into
account in the computations by formulae (8.5) and (8.6).
As the error in the galvanometer reading X does not depend on the nominal value
its standard deviation can be computed as follows:
The variance obtained by the model-based approach is slightly smaller than for
the Taguchi method but it is 114 times smaller then for the initial nominal parameter
values. The 3 a half-interval of variation of the performance characteristic y is decreased
approximately I 0 times. This is achieved only by choosing proper parameter values.
The signal-to-noise ratio obtained as a result of optimization is equal to 47.23.
Further increase oftbis ratio can be achieved through tolerance design, i.e. by tightening
the tolerance intervals of the bridge parameters. The parameters' contributions to the
total variation are computed by formulae (8. 9). The contributions of C and D are almost
equal to each other (about 30%), while the contribution ofthe galvanometer Xis about
10%. The rest of the parameters practically do not affect the variance. lt is decided to
tighten the tolerance intervals of C and D ten times, and of the galvanometer X - 5 times.
With the new tolerances the standard deviation of y decreased from 0. 008699 to
0.0009311, i.e. more than 9 times and the signal-to-noise ratio is increased to 66.64. The
half-interval 3a is now equal to 0.0027933. For the parameters obtained by Taguchi
method with the same tolerance intervals the value of the standard deviation is
0.0010193, while signal-to noise ratio is 65.85.
To allow comparison the values of the signal-to noise ratio are computed by
Taguchi method and by the model based approach in the points of a ~ 6 orthogonal array
shown in Table 8.3. These computations arenot normally needed for the algorithm. The
bridge is adjusted so that X = 0 and in this case y = 2Q. The derivatives for the model-
based approach are computed analytically and numerically. The results do not differ too
much. The bias due to the term tr(HE.) in the model of mean value is equal to about
10-sn at all points ofthe ~ 6 array. In the last column ofTable 8.3 are given the signal-
to-noise ratios computed by simulations based on a crossed array containing 1296 runs
according to Taguchi method.
From Table 8.3 one can see that the signal-to-noise ratio obtained through
model-based approach with analytical derivatives almost always coincide or is very close
to the corresponding values obtained by Taguchi method. However, the results for model
based approach were obtained without simulation of noises, while the results for Taguchi
method are based on 1296 runs ofthe cross product design. The values ofthe signal-to-
396 CHAPTER8
noise ratio computed with numerical derivatives differ slightly from the values obtained
with analytical derivatives which is a result of inaccuracies in the numerical
computations. However, for practical use these results are quite satisfactory.
TABLE 8.3. Standarddeviationsand signal to noise ratios for the points of L 36 array
Analytical derivatives Numerical derivatives Taguchi
method
Standard Signal to Standard Signal Signal
deviation noise deviation to noise to noise
No. A c D E F ratio ratio ratio
a .; a .; .;
I I I I I I 0.04869 32.3 0.04818 32.4 32.2
2 2 2 2 2 2 0.09278 26.7 0.09252 26.7 26.7
3 3 3 3 3 3 0.32106 15.9 0.32098 15.9 15.9
4 I I I 2 2 0.02999 36.5 0.02918 36.7 36.4
5 2 2 2 3 3 0.07440 28.6 0.07408 28.6 28.6
6 3 3 3 I I 0.87118 7.2 0.87090 7.2 7.2
7 I 2 3 I 2 0.29997 16.5 0.29980 16.5 16.5
8 2 3 I 2 3 0.44724 13.0 0.44716 13.0 13.0
9 3 I 2 3 I 0.07922 28.0 0.07891 28.1 28.0
10 I 3 2 I 3 0.35312 15.1 0.35296 15.1 15.0
II 2 I 3 2 I 0.30064 16.5 0.30054 16.5 16.4
12 3 2 I 3 2 0.10564 25.5 0.10541 25.6 25.5
13 I 3 I 3 2 0.01291 43.8 0.01089 45.3 43.8
14 2 I 2 I 3 5.20123 -8.3 5.19970 -8.3 -8.3
15 3 2 3 2 1 0.37254 14.6 0.37244 14.6 14.6
16 1 3 2 1 1 0.07111 29.0 0.07076 29.0 29.0
17 2 I 3 2 2 0.90159 6.9 0.90149 6.9 6.9
18 3 2 I 3 3 0.36846 14.7 0.36856 14.7 14.7
19 1 1 3 3 3 0.16874 21.5 0.16859 21.5 21.5
20 2 2 1 1 1 0.27035 17.4 0.27018 17.4 17.4
21 3 3 2 2 2 0.40018 14.0 0.40009 14.0 14.0
22 1 2 3 3 I 0.00947 46.5 0.00645 49.8 46.5
23 2 3 I 1 2 1.05910 5.5 1.05877 5.5 5.5
24 3 1 2 2 3 5.11964 -8.2 5.11925 -8.2 -8.2
25 1 2 I 2 3 0.08584 27.3 0.08556 27.4 27.3
26 2 3 2 3 1 0.01339 43.5 0.01146 44.8 43.4
27 3 1 3 1 2 22.18513 -20.9 22.17860 -20.9 -20.9
28 I 2 2 2 1 0.01242 44.1 0.01030 45.8 44.1
29 2 3 3 3 2 0.02151 39.4 0.02037 39.8 39.3
30 3 1 1 1 3 14.22120 -17.0 14.21702 -17.0 -17.0
31 1 3 3 2 3 0.14059 23.1 0.14041 23.1 23.0
32 2 I I 3 I 0.01229 44.2 0.01015 45.9 44.2
33 3 2 2 I 2 2.20532 -0.8 2.20466 -0.8 -0.9
34 1 I 2 3 2 0.01339 43.5 0.01146 44.8 43.4
35 2 2 3 I 3 4.83077 -7.7 4.82935 -7.7 -7.7
36 2 3 1 2 I 0.79936 8.0 0.79927 8.0 8.0
•
QUALITY IMPROVEMENT THROUGH MECHANISTIC MODELS 397
Consider the case when the product or the process is subject to both errors in pararneter
settings and external noises. Suppose for exarnple, that a perforrnance characteristic of a
serniconductor-based circuit depends on cornponent tolerances and environrnental
ternperature. In real production process or usage this ternperature can not be controlled
and should be considered as randorn variable. If the range of ternperature variations is
known the problern of robust product design can be resolved using rnethods which are
sirnilar to those given in Chapter 7.
Denote the analytical rnodel by 17 = 17(p + e, n, B), where p = (p 1 p 2 .. . p m Y
is a vector of product pararneters, e = (e 1 e2 eJr is a vector of errors with zero
rneans, D = (n 1 n2 nq f is a vector of the external noise factors with rnean
E(D)= Da and B = (B1 B2 BkY is a vector ofknown pararneters.
...
(8.10)
where
and
As shown in Appendix A.8.3 the models of mean value and variance of the
performance characteristic in use ofthe product can be written as follows:
(8.11)
and
(8.12)
where
-at}+4L L/f;~d;o; +
m q m m-1 m
(8.12a)
For normally distributed errors f..li3 = f-lni3 =O,f..l; 4 =3a~, and f..ln; 4 =3a~;. Hence,
theHOTare
m m-1 m q
HOT= 2'LH;7a;4 +
i=l
Li=l j=i+l
L/f;~O:o; +2 'LH:p:j +
i=l
(8.12b)
i=l j=l i=l j=l
QUALITY IMPROVEMENT THROUGH MECHANISTIC MODELS 399
It is easy to see that if the model q(p,n 0 ,l/) is a second order polynomial as
(7.11) then equations (8.11) and (8.12) coincide with (7.13) and (7.14), respectively. For
this purpose it is sufficient to compute the derivatives for n = O,e = 0 and to notice that
A = ß+2t: p, An= a+fJ r p, H =t: and Hn =1'1. Therefore, using these notations, all
discussions and optimization procedures given in Chapter 7 for polynomial models can
be employed for analytical models as weil. The advantages mentioned in Section 8.2.1
arealso applicable for models (8.11) and (8.12).
Often engineers have an analytical model of the product or process but find it difficult to
incorporate the extemal noises in it. Forthis purpose mixed models, which are partially
based on mechanism ofthe product (or process) and partiallyonexperimental data, can
be useful. In these models a part of variability (or bias in the mean value) can be
expressed analytically, while another part which is due to extemal noises can be
described through a regression equation. This is possible because the models of mean
value and variance (7. 13) and (7 .14), which are based on regression equations, are
special cases ofmodels (8.11) and (8.12).
A mixed model ofthe performance characteristic can be written as follows:
71n = ao + L
j::::}
aini + LL
i=l j;:i+}
ai)ninj + L
i=l
aiini2 + LL
j~J j~J
rijpinj. (8.14)
The vector a and the matrices 1'1 and fJ in (8.13) are defined as follows:
~ is mxq matrix with elements Yu, i = 1,2, ... ,m andj = 1,2, ... ,q.
The errors in product parameters e are with zero mean and the mean of the
externa1 noise vector is E(n) = n 0 .
The performance characteristic's mean value is:
(8.15)
where 7J(p, ()) is an analytical model of the performance characteristic depending only on
product parameters and H is a m x m matrix of second derivatives of 7J(p, ()) with
respect top. The elements ofthis matrix are defined in Section 8.2.1.
The derivative of the polynomial part of the model is:
(8.16)
Two Ievel factorial designs can be used in estimating the coefficients in (8 17), so
that independent estimates of a, and y,1 are obtained. As the elements of t1. and H can be
computed from the analytical model 7J(p, ()) they do not need to be estimated on the
QUALITY IMPROVEMENT THROUGH MECHANISTIC MODELS 401
basis of experiments. This allows the use of highly fractionated factorial designs. For
example, a resolution III fractional factorial design can be used with generator n1 = pipi,
if the interaction between pi and pi is negligible. Information about the significance of
interactions between product parameters can be obtained from the off-diagonal elements
of matrix H computed from the analytical model. If the value of ßii = 2[H];i is small, the
interaction PiPi can be considered as negligible.
_ (p;rl945
17- 4 .05 74 (p;)o 17676
The parameter values in this equation are given in natural measurement scales.
In the production process the parameters can be set without ercors on their Ievels.
However, the surface roughness variance is considerable due to the noise factors. The
target is to minimize the variance keeping the roughness below 4 ,um.
y= '1"/+(/J,
where 71 is computed by the known mechanistic model and rp is the part of the
performance characteristic due to the noise factors and their interactions with p{ and p~ .
lt can also be written in the form
It was found that all model coefficients are significant at Ievel 0.05. The multiple
correlation coefficient R = 0.978 is also significant because F = 63. 74, while the critical
value at Ievel 0.05 is F(0.05,4,11) = 3.36.
Using the regression model we define the following estimates of the vectors and
matrices:
As in the equation we use the natural measurement scales for the pararneters and the
noises, the expectations of the noises are not zeros and are equal to
As I:. = 0 and tr(AI:.) = 0 one can use (8.15) in order to compute the mean
value ofthe surface roughness in the production process:
~ (p' y41945
y(p',n')= 4.0574 (p:r 7676 +4.790-0.08173n~ 1 -0.00656p{n~2 +
+0.32008p~n~ 1 +6.146p~n~ 2 .
(p'y41945
y(p',n') = y(p',n') = = -0.1138- 0.005248p{ + 24.1216p~ + 4.0574 (p;y 17676 .
Putting the values of the elements of a, G and I:. in this equations we obtain the
following equation:
The optimal parameter values are found through grid search based on the mean and
variance models. The optimal parameter values are P{opt = 241 m. min - 1 and P~opt = 0.1.
The corresponding mean value and standard deviation are y= 1. 092 J.lffi and
s = 0.956f.1111. One can see that for the optimal parameter values
y + 3s =l.092+3x0.956=3.959 f.111l,
404 CHAPTER8
y
that satisfies the constraint + 3s < 4 f.JTfl . Parameters providing smaller values of the
variance do exist but they do not satis:ty this inequality. Contour plots ofthe performance
characteristic's mean and variance are shown in Figure 8.3.
Sometimes analytical models are rather complicated and computing derivatives can be
difficult. In this case it may be easier to approximate the performance characteristic by a
polynomial model. Computing first the performance characteristic from the analytical
model in a number of points in the factor space one can then use the least squares
method to fit the data to a polynomial model. The choice of the design and the data
analysis procedures must take into account the fact that the simulated data do not
contain errors. A more detailed discussion on these problems is given in Section 8.5. On
the basis of a regression model of the performance characteristic one can use the
methods given in Chapters 5 and 6. In this section an application of this approach to
robust design of an electric circuit is given.
QUALITY IMPROVEMENT THROUGH MECHANISTIC MODELS 405
The tolerances of C1, C2 and C3 are constant, for C4 and C5 they depend on the
nominal values and are given in percents, while for ~ and 1--z the tolerances are zero, i.e.
the inductances are not subject to random deviations.
The experimental design and the results of computations are shown in Table 8.6.
The following model is fit using stepwise regression procedure:
To build models of mean value and variance in mass production one needs the
variances of errors in product parameters. Assurne that they are normally distributed and
their tolerance intervals are Ki = ±3ai. For the factors p 1 , p 2 and p 3 the tolerance
intervals lengths are independent of the nominal values of factors and are equal to
±0.4pF. Taking into account (5.31) one can compute the standard deviation of the
errors as follows:
CTi = 0: I mi = Ki I 3mi =0.413x 1=0.13333,i=1,2,3.
Consequently, during the optimization phase when ai depends on nominal value p'.
Putting these values and the regression model coefficients in (5.14) and (5.15)
one can compute the mean value and standard deviation of the performance
characteristic in mass production. For the design points they are given in Table 8.6.These
models are used for variance minimization through grid search, under the condition that
y:s; 0.05. The best result is obtained for p. =(-1,-1,0,0,0,1,-1Y. The optimal values
y s
are: = 0.010969and = 0.019601. The optimal parameter values in natural measuring
scale are: C1 = 2 pF, C2 = 2 pF, C3 = 3 pF, C4 = 100 pF, C5 = 60 pF, ~ = 220 nH,
Lz =90 nH.
QUALITY IMPROVEMENT THROUGH MECHANISTIC MODELS 407
•
408 CHAPTER8
This approach was developed in a number of papers on design and analysis of computer
experiments published by Sacks, Welch, Mitchell and Wynn (1989), Welch, Yu, Kang
and Sacks (1990), Welch and Sacks (1991). Shoemaker, Tsui and Wu (1991) give a
discussion about the applicability of this method for experiments with product prototype,
or with a physical process.
There are two important distinctions between this approach and the Taguchi
method based on simulations:
• Combined arrays are used instead of Taguchi's cross-product designs. Product
parameters and noise factors are combined in a single array. It is similar to the combined
array used in Section 7.2 with the difference that the errors in product parameters are
considered as separate noise factors. Therefore, these combined arrays have more
columns (and number ofruns) than the ones considered in Section 7.2.
• The response itself is modelled rather than the signal-to-noise ratio or expected
loss. A prediction model of the expected loss based on the fitted regression model is
used. In contrast Taguchi does not use response surface models.
The method can be described for r responses as follows:
I. Postulate a model (usually a polynomial) for each response y, (p, e, n ),
i = 1, 2, ... , r.
2. Design an experiment with product parameters (p) and noise factors (n) and
run a simulator to obtain the corresponding performance characteristics' values.
3. Use the data to estimate the unknown coefficients ofthe following polynomial
models:
k,
4. Assess the model adequacy and improve the model if necessary. Contour plots
can be used at this step and if the prediction has low accuracy, they can suggest
subregions in the factor space for the next experiments. If necessary, reduce the size of
the input space to obtain a better prediction.
5. For given p predict the value ofthe loss statistics defined as follows:
i(p) = fJ i[Y1 (p, e, n}y2 (p, e, n} ... ,y1 (p,e, n)]g(n, e';tedn, (8.18)
where g(e, n) is probability density function of the noises and /[.] is loss function.
QUALITY IMPROVEMENT THROUGH MECHANISTIC MODELS 409
Authors usually employ quadratic loss and the density g(e, n) is tak:en as uniform
on a finite set of noise Ievels representing typical and extreme conditions. Monte Carlo
simulations are used for solving the integrals in (8.18}.
6. Minimize i(p) as function ofp.
7. Conduct a confirmatory experiment to evaluate the obtained optimal parameter
vector Popt· Forthis purpose a Monte Carlo experiment with fixed p = Popt and noise
factors varying according to g(e, n) can be carried out. Altematively a small computer
experiment can be conducted varying n and e for p = p opt.
A Straightforward method for solving the quality improvement problern is to plug the
loss statistic into a numerical optimizer and to use Monte Carlo simulators to compute
the integral (8.18}. However, due to the complexity of the analytical model Monte Carlo
experiments can be very expensive. Sacks, Welch, Mitchell and Wynn (1989) give
examples of such complex experiments from fluid dynamics, chemical engineering,
integrated circuit fabrication and other processes. For example, they consider a chemical
engineering problem, for which a single run of the code tak:es up to 20 minutes on Cray
X-MP. Running a Monte Carlo experiment with such models is impossible.
Further on we do not consider this approach. For more details on applying Monte
Carlo experiments for quality improvement see Logothetis and Wynn (1989}.
Taguchi method can be applied for quality improvement based on analytical models just
in the same manner as for physical experiments but taking the data from a simulator.
However, this approach is rather expensive. It has the same disadvantages as those
discussed in connection with physical experiments in Section 4.13.
There are some specific problems of quality improvement based on mechanistic models.
They arise in connection with the simulation codes complexity and absence of random
output errors.
The complexity of the models mak:es the solutions expensive. Therefore, it is
important to avoid simulations with several values of the noise factors. This can be
achieved by the methods given in Sections 8.2, 8.3 and 8.4. Ideally the cheapest solution
can be obtained if the analytical model is approximated well enough by second order
Taylor expansion. In this case equations (8.5) and (8.6) can be employed with the third
derivatives set equal to zero. A subsequent application of the analytical optimization
410 CHAPTER8
procedures of Chapter 6 gives an accurate and quick solution. Canonical analysis and
graphical tools can be used for exploring mean value and variance surfaces.
Models (8.5) and (8.6) arealso applicable for third order Taylor approximation of
the analytical model. In this case numerical optimization procedures must be used. They
are not very expensive because the computations based on models (8.5) and (8.6) are
much simpler than those with the original simulation code.
The method of Section 8.4, which is based on the use of response surface
approach, has similar properties with respect to computational complexity. The main
difference from the methods of Sections 8.2 and 8.3 is that the computation of
derivatives is replaced by least- squares estimation of polynomial model coefficients.
The method of Section 8. 5. 1 based on combined arrays is more complicated than
this of Section 8.4 in two respects:
• Considering the errors in product pararneters as separate noise factors increases
the nurober of terms in the regression model, the nurober of design runs and the
dimensions of the matrices in least squares estimation procedures.
• The computation of loss statistic (8.18) is connected with Monte Carlo
simulations, while it is computed much simpler using the models of Sections 8.2, 8.3 and
8.4.
Computational problems are more difficult when the original analytical model can
not be satisfactorily approximated with second or third order Taylor expansions or
polynomials, respectively. To some extent the methods of Sections 8.2 and 8.3 can avoid
this difficulty by combining modeHing and optimization procedures. As it was shown in
Section 8.2.1 low order Taylor expansions can be used at each optimization step. This
allows the use of the simple formulae (8.5) and (8.6) for computing mean value and
variance, respectively. First or second order Taylor expansions are usually satisfactory.
However, the cost of computations in this case is increased due to the necessity of
numerical evaluation of derivatives at each optimization step.
Sacks, Welch, Mitchell and Wynn (1989) try to overcome this difficulty by
considering the response Y; (p, e, n} as a realization from a stochastic process or random
function of n and e. They compute the correlation function of this process and use it for
generalized least squares estimation of model parameters. Welch and Sacks ( 1991)
report several examples when this approach improved the results with regard to fitting
polynomials by least squares. In this case the problern of specifYing the correlation
function arises. The computational burden also increases because the correlation matrix
R in the generalized least squares estimator
A third criterion appeared in a panel discussion (Nair (1992)) and was proposed
by G. Box and by Sacks and Welch. lt minimizes the loss function over the space of
noise factors:
min J(y- rY g(e, n }dnde,
Rn,e
8. 7. Bibliography
The idea ofusing analytical models for quality improvement appeared in Taguchi's books
(Taguchi (1986, 1987)). Box and Fung (1986) proposed a variance equation based on
mechanistic model of the performance characteristic. They used second order Taylor
expansion and neglected high order moments of the variance distribution. The method of
Section 8.1 appeared in Vuchkov and Boyadjieva (1990b, 1994). An application of this
approach is given in Vuchkov and Boyadjieva (1995}. The approach of Section 8.5 was
developed in a series of papers: Sacks, Welch, Mitchell and Wynn (1989}, Welch, Yu,
Kang and Sacks (1990}, Welch and Sacks (1991}. For additional reading on design of
computer experiments see Sacks, Schillerand Welch (1989}, Currin, Mitchell, Morris,
and Ylvisaker (1991}, Welch, Buck, Sacks, Wynn, Mitchell and Morris (1992}, Morris,
Mitchell, and Ylvisaker (1993}, Mitchell, Morris and Ylvisaker (1994).
where e = {e1 e 2 •.• eJr is m vector of errors in factors. Suppose that errors
e; and ei,i :t:. j are independent with zero means and diagonal covariance matrix:
Denote the third and the fourth moments of the errors J-1; 3 = E(e;) and
J-1; 4 = E(e: ), correspondingly. Assurne that the output error e is independent between
runs and its moments are E(e)= 0, var(e)= cr;,cov(e.,e.)= 0, where u and v stand for
different runs. Under these assumptions the performance characteristic's variance in mass
production can be computed as follows:
m m m-1 m
CT 2 = LA~CT;2 + LH;~(ui4 -CTn+4L LH;~CT;2 CT: +
i=l i=l i=l j=IJ#
(8.4)
QUALITY IMPROVEMENT THROUGH MECHANISTIC MODELS 413
H =_!_ o27]{p,ß)
2 opopT
where
Proof The proofis similar tothat given in Appendix A.5.2 for formula (5.10). A
third order Tay1or series expansion of 17(p + e, ß) is
(A.8.2)
e
With mutually independent 6 and e, cov[e, (.1T e + eTHe + e)] = 0. Neglecting all
moments of error distribution higher than fourth, one can also write
var(ee) = 2cov(eTHe,ee) = 0. Under these conditions equation (A.8.2) can be
rewritten as follows:
414 CHAPTER8
(A.8.5)
i=l j.::::l J=i+l
where v includes all possible covariances between pairs of errors., For example, e,e1 and
eA ( i,j,r,s = 1,2, ... ,m and i,j not coinciding with r,s).
Substituting H,, and H,, with ßii and ßu I 2 in (A.8.5) we obtain (A.5.11).
Therefore, the proof of formula (A.5.14) of Appendix A.5.2 can be used to rewrite
(A.8.5) as follows:
(A.8.6)
i=l i=) j.::::i+l
Taking into account that E(e)= 0 the third term in (A.8.3) can be written as
follows:
As the errors are independent, the i-th element ofvector E(eerHe) can be presented in
the form
) {J.I.. H .. for i = j = s .
L Lm E ( e .e .e H . = 13 u
m (A.8.8)
j = 1s = 1 l J s JS Ofor all othercases
From (A.8.7) and (A.8.8) one can obtain the following formula:
(A.8.10)
Taking into account that t =(erT( 1)e erTc 2le ... erT(mlef and having in mind the
independence of errors in factors we obtain the following expression for the i-th element
ofvector E(ee e):
where the i,j-th element of matrix T(J) is denoted by z;jll. Using this formula one can
rewrite (A.8.10) as follows:
Putting (A.8.4), (A.8.6), (A.8.9) and (A.8.12) into (A.8.3) we obtain (8.4).
und er the same assumptions and using the same notations as in Appendix A. 8 .1.
Proof Under the same assumptions as above one can find the expectation of
(8.2) from (A.8.1) as follows:
(A.8.14)
416 CHAPTER8
Taking into account that the errors in factors are independent we can also write
=E{~e[~e
L.."" l
2
L.."" T(;)+2~.-~l
~T(ile.e
L.."" L..""
)] ;k J k
]}= r'l3
11. rUl
ll .
(A.8.15)
i~l j~l pl k~ j+l
(8.10)
Suppose that the noises are with zero expectations ( E(e) = 0, E(n) = n 0 ) and
covariance matrices
and
Under these assumptions the models of mean value and variance of the performance
characteristic in product's usage can be written as follows:
(8.11)
and
(8.12)
where
(8.12a)
QUALITY IMPROVEMENT THROUGH MECHANISTIC MODELS 417
(8.16)
Taking expectation with·respect to e and n and bearing in mind that the noises
arenot corrected and E(e)= O,E(n)= n 0 , E(e)= 0 we obtain
Taking into account that q(p,n 0 ,(1) is non-random and that the noises are not
corrected and with zero expectations this equation can be simplified because
var[p, n 0 , B] =
=2cov [ q{p,n 0 ,B)+!J.r e+erHe+ tl"(n -n 0 )+(n -noY HJn- n 0 )+er 9(n- n 0 ),e]=
The values of var(!J.r e) and var(erHe) are computed in Appendix A.8.1 ((A.8.4)
and (A.8.6)). The same way one can compute var[Lir (n- n 0 )] as follows
and
·
m q
The next term 2 cov[ /).r e, er t; (D- Da)] equal to zero because
cov( tl e,er t;(D- D0)] = E[ /).r eer t;(D- Da)]- E(/'l e)E[ er t;(D- Da)]=
(A.8.25)
Putting (A.8.4), (A.8.6) and (A.8.18)- (A.8.26) into (A.8.17) we obtain (8.12)
•
Appendix A.8.3. Derivation of mean and variance models for third order
polynomials
Below are given the proofs of equations (5.14) and (5.15) of subsection 5.2.3.
Suppose that the model of the performance characteristic is a third order
polynomial:
L Lßil)P; Pi.
m m m
+ Lß;,;P; + 2 (5.13)
i=l i:=l j-zl,j+:-i
Under the assumptions that the errors in product parameters e, are independent
and normally distributed the models of mean value and variance in mass production are:
and
420 CHAPTER 8
+HOT+d ;, (5.15)
where
(5.16)
Proof
The proofis based on equations (8.5) and (8.6). Compute following expressions:
1 02 (p) m
17 -=ß +3ß . p + "ß
H n = -;+,2
- Pj
2 vy i
u ni z . ~ . ii.f '
r=-1,}~1
QUALITY IMPROVEMENT THROUGH MECHANISTIC MODELS 421
Putting these expressions in (8.5) and (8.6) we obtain (5.14) and (5.15),
respectively.
•
CHAPTER9
9.1. Introduction
422
QUALITY IMPROVEMENT OF PRODUCTS DEPENDING ON BOTH 423
obtained by experiments without errors in factors, we use them to induce new models of
the mean values and variances in mass production. Optimization procedures based on
these models allow finding the optimal parameter values.
The models of performance characteristics depending on both qualitative and
quantitative factors are reduced to the forms given in chapters 5 and 7. This way the
results of chapters 5, 6 and 7 with minor changes can be used in the case under
consideration.
9.2.1. DUMMYVARIABLES
In this chapter the quantitative product parameters are denoted by P;, the qualitative
ones - by v; and the quantitative external noise factors by n;. Assurne that the qualitative
parameters can be set without errors in the production process. Further on they are
represented by dummy variables w, . Cases with categorical external noise factors, which
are rather artificial for real production processes, are not considered. For the sake of
simplicity we start with models that do not contain external noise factors.
424 CHAPTER9
w, (l·)={lfors=t , (9.1)
Ofor s ;t: t
where s = 1,2, ... ,r; -1, t = 1,2, ... ,r;. The total number of dummy variables is
r ='i + r 2 +... +rd - d.
Consider second order polynomials with quantitative and dummy variables. Two
important features ofthese polynomials should be taken into account:
• One can see from (9.1) that w, = w;. Therefore, one must not include in the
models both linear and quadratic terms with respect to dummy variables. Further on we
omit the quadratic terms .
.• Interactions between dummy variables representing the same qualitative
variable must be excluded from the model.
If linear and quadratic terms of any dummy variable are included in the
polynomial F has columns that coincide. If any of the interactions between dummy
variables, representing the same qualitative variable, is not excluded from the polynomial
there are columns in the extended matrix F which consist only of zeros. In both cases the
information matrix FrF is singular and the least squares estimates of the model
coefficients can not be computed.
Denote by
LLoi,wiwJ
i j>i
the sum of all admissible pairwise interaction terms between dummy variables.
A second order polynomial with quantitative factors and dummy variables is
written as follows:
where 8 is output random noise with zero mean and constant variance 0:. The values of
8 are independent between the Observations.
QUALITY IMPROVEMENT OF PRODUCTS DEPENDING ON BOTH 425
Example 9 .1.
Consider a model with two quantitative factors p 1 , p 2 and two qualitative factors
vl' v 2 . Factor v1 has two levels and is represented by a dummy variable w 1, while factor
v2 has three levels and is represented by dummyvariables w2 and w3 . They are shown in
Table 9 .1. It is easy to see that wj = wj2 and the product w2 w3 is equal to zero for any
combination of levels of w 2 and w 3 . A second order polynomial for this case is
1 Leve Is of quar1tat1ve
T ABLE 9.. . f1actors anddummyvan·ab1es
Qualitative factors Dummy variables
VI Vz wi w2 w3
1 1 1 I 0
1 2 1 0 1
1 3 1 0 0
2 1 0 1 0
2 2 0 0 1
2 3 0 0 0
•
Model (9.2) can be rewritten in matrixform as follows:
(9.3)
I I
ßJI 2ß12 2_ß1m
I I
11= 2ß12 ß22 2_ß2m
... ...
I I
2_ß1m 2_ß2m ßmm
/12
.L=
['" /21
/r1
/22
Jr2
It is possible to rewrite model (9.3) in the form (5.I7) and therefore, to use the
results of Chapters 5 and 6. We will make distinction between the following two groups
ofmodels:
i) There are no interactions between qualitative and quantitative factors. although
there might be interactions within each group.
In this case the model (9.3) reduces to:
(9.4)
where
(9.5)
One can see from (9.4) that the intercept ß0 , is different for every combination of
the qualitative factors. The linear and the second order terms with respect to product
parameters do not depend on qualitative factors.
ii) There are interactions between qualitative and quantitative factors.
The model (9.3) is ofthe form
(9.6)
where
ß, = ß+.LT W, t = I,2, ... ,r 0 . (9.7)
The only difference between this model and (9.4) is that in (9.6) both intercept
ß01 and the linear terms ß/ p depend on qualitative factors. The second order terms with
respect to product parameters do not depend on qualitative factors.
QUALITY IMPROVEMENT OF PRODUCTS DEPENDING ON BOTH 427
Example 9.2.
Consider an example with 3 quantitative (pi,p 2 ,pJ and two qualitative (vi,v2 )
factors. The first qualitative factor has two Ievels: vi (!} vi (2) and the second one has
three Ievels: v2 (1), v2 (2), v2 (3). The first categorical factor can be represented by one
dummyvariable wi, while for the second we need two dummyvariables w2 and w3 . The
correspondence between the Ievels of the qualitative factors and the dummy variables is
shown in Table 9.1.
Suppose that the product performance characteristic is given in the form (9.2) as
follows:
(9.8)
This model can be written in the form (9.6). The intercept of(9.8) is ßo = 15, while the
vectors and matrices in (9.6) are defined as follows
p=(pi Pz P3Y,w=(wl Wz w3Y,
•
Models with external noise Jactors
Suppose that both product parameters p = (pi p2 pm Y and external noise
factors n = (n 1 n2 nq f are measured without errors in the experiment and the
model ofthe performance characteristic is a second order polynomial:
m m-l m m q
+I f
q q-I q q
+
m
or in matrix form:
(9.10)
where ßot>ßt>w and 11 are defined as in the previous subsection. The interactions between
qualitative and quantitative factors are taken into account through ß 1 which is defined by
(9.7).
The other notations are as follows:
• a = (a 1 a 2 ... aq f is q-vector of coefficients in the linear part of the
model with respect to the external noise factors.
• .4 is q x q matrix which contains the coefficients in the second order terms with
respect to noise factors:
1 1
aii -ai2 2alq
2
1 1
... 2a2q
.4= -ai2
2
a22
1 1
2aiq 2a2q aqq
I
and external noise factors:
rl2 ...
9=
['"Y21
Yml
r
r
22
m2
...
...
...
r,.
Y2q
r mq
'
[~" ~'']
rpl2 ...
...
'?= (/).21
(/)22 rp2q
(9.11)
QUALITY IMPROVEMENT OF PRODUCTS DEPENDING ON BOTH 429
where
(9.12)
The changes oflevel combinations ofthe qualitative factors affect only ß 01 ,ß1 and
a 1 in equation (9.11). The quadratic effects and the interactions of product parameters
and external noise factors do not depend on them.
Example 9.3.
Suppose that the performance characteristic depends on 3 quantitative and two
qualitative factors defined as in Example 9.1 but there are two noise factors as well:
n1 and n2 . Let a model ofthe form (9.9) is given as follows:
(9.13)
Model (9.13) can be written in matrixform through equations (9.11) and (9.12).
Note that the terms, which do not contain noise factors, are the same as in Example 9.2.
That is why we use the definitions of the transformed coefficients ß01 ,ß1 , the vectors
p,w,o,ß, and the matrices 8', V and 9which are given in Example 9.2. Additionally for
presenting (9.13) in the form (9.11) and (9.12) the following vectors and matrices are
defined:
9.3. Design and analysis of experiments with both qualitative and quantitative
factors
In Section 9.1 we mentioned two problems linked to the design of experiments with both
qualitative and quantitative factors:
• The number of factor Ievels of the existing response surface designs might not
coincide with the number of Ievels of the qualitative factors,
• The Ievels of the qualitative factors have to be assigned in an appropriate way
to the Ievels of the design.
In this case, as usual for the model-based approach, Taguchi's crossed arrays are
not needed. If there are errors only in the product parameters a combinatorial or
response surface design is used. For cases with errors in parameters and external noise
factors the combined designs described in Section 7.2 are suitable.
It might happen that the number of Ievels of qualitative factors coincides with the
number of Ievels in a response surface design. Consider for example, an experiment with
two quantitative factors: p 1 and p 2 and one qualitative factor v with two Ievels. If one is
interested only in main effects and interactions among factors a two Ievel full factorial
design is appropriate. In this case it is convenient to code the Ievels of v by -1 and 1.
If the number of Ievels of qualitative factor does not coincide with the number of
Ievels of the design one has to adapt the design, not the problern. There are two possible
choices:
i) Use an existing multilevel design. There are many combinatorial designs with
different number of Ievels like Latin and Graeko-Latin squares, Youden squares,
Balanced Incomplete Block designs (BIB-designs) and orthogonal arrays (see for
example, Taguchi and Konishi (1987)). Areal example (Example 9.5) is given in Section
9.7, where an orthogonal array L18 (2 1 x 37 ) is used.
ii) Generate a special design for the problern under consideration. Draper and
John (1988) give examples of a thorough analysis of the problern, which can show how
to choose a design, taking into account the sequential nature of the experimentation.
A computer-based approach is also possible which ensures a flexibility of the
design choice and can be used in rather generat context. It employs algorithms for
sequential generation of D-optimal designs. Some of them are considered in Chapter 3.
In order to apply these procedures with both qualitative and quantitative factors one can
use polynomials with quantitative variables and allow dummy variables to take only
values 0 and 1 during the search. As mentioned in Chapter 3, the designs can be
generated in blocks to use the information obtained at the earlier stages of the
experiment. In example 9.4 of Section 9 7 a sequentially generated nearly D-optimal
design is employed. Atkinson and Donev (1992) consider in details algorithms for
computer generation of D-optimal designs for models with both qualitative and
quantitative factors.
The choice of a design is linked with the methods for data analysis used after the
experiment. If decisions are based on examination of the main effects and interactions
QUALITY IMPROVEMENT OF PRODUCTS DEPENDING ON BOTH 431
orthogonal designs are appropriate, because they ensure uncorrelated estimates. These
effects can be studied using ANOVA or normal plots.
Usually the sequential procedures for generating D-optimal designs produce
asymmetrical designs. In this case the information matrix FTF and the covariance matrix
(Fr F )- 1 are non-diagonal and the regression coefficients estimates are correlated which
makes the independent interpretation of the effects impossible. This would not cause
serious problems if numerical procedures are used for model building and finding optimal
product parameter values. For example, regression analysis and numerical procedures for
optirnization, contour plots or canonical analysis can be employed. Fora choice between
models (9.4) and (9.6) partial F-ratio (see Section 2.3.7), best possible regression or
stepwise regression is suitable. Of course a combination between numerical methods and
individual interpretation ofthe effects is always desirable.
9.4. Models of mean value and variance in mass production and usage of the
product
The models of the performance characteristic given in Section 9.2.2 formally coincide
with the models used in Chapters 5 and 7. Therefore, the results obtained in those
chapters can be easily adapted to cases with both qualitative and quantitative factors.
Suppose that in mass production the product or process is subject to errors in
parameters e =(e1 e2 ..• eJr and extemal noises n =(n1 n2 ... nqf. Suppose
that experiments without errors in the factors were conducted and a regression model of
type (9.11) was obtained. In presence of errors in the parameters and extemal noise
factors the performance characteristic can be written as follows:
(9.14)
This equation Iooks exactly like (7.12). The only difference is thata"p, and Pot are
substituted in (9.14) for a,p and Po of (7.12). As the categorical variables do not
contain errors in production process or usage, a"p, and Por are not random and the
models of mean value and variance in mass production and usage do not differ from
those obtained in Chapter 7. They can be written sirnilarly to (7.13) and (7.14) as
follows:
y(p)= E(y(p, w,n)]= 17(p, w)+ tr (ß'~.)+tr~n)=
(9.15)
and
432 CHAPTER9
(9 I6)
where if; , i = I, 2, ... , m and if.; , i = I, 2, ... , q are variances of the errors in factors and of
the external noise factors, correspondingly.
The following conclusions can be drawn from (9.I5) and (9.I6):
• The bias in the mean value tr('?i L.) + tr(rl L.) does not depend on
qualitative jactors. However, the mean value itself is their function via ß0 , and ß,.
• The variance u 2 (p) depends on qualitative jactors only through their
interactions with the quantitative and the noise factors. Actually formula (9.I6) shows
that u 2 {p) is linked with the qualitative factors only through a; = ar + w r-; and
ß, = ß+ wr L .. The matrices -; and L contain regression coefficients of the qualitative
factors' interactions with the noise factors and the product parameters, correspondingly.
If there are no such interactions then a, = a, ß, = ß and the qualitative factors do not
affect the variance. This conclusion shows how important the interactions in quality
improvement problems are.
• High orderterms (HOT) are not affected by qualitative factors.
In cases when only errors in product parameters exist and there are no external
noises, the models of the mean value and the variance in mass production can be
obtained as special cases of (9.I5), (9.I6) and (9.17)
(9.I8)
The conclusions about the influence of qualitative factors given above are true for these
cases as weil.
QUALITY IMPROVEMENT OF PRODUCTS DEPENDING ON BOTH 433
Formulae (9.18) and (9.19) are similar to (5.19) and (5.22). They are suitable
when error distribution is normal. If this condition is not satisfied one can use (5.10)
instead of ( 5.19), substitutins ß 1 for ß.
Formula (9.19) is applicable ifthe bias due to substitution ofestimates for model
coefficients is negligible. Otherwise we can use formula (5.48) as in subsection 7.3.1,
with vector (} defined as follows:
o =<Pot ß11 ... Pmt ß12 Pm-l.m ß11 ... Pmm a11 aqt
9.5.1. INTRODUCTION
With the models obtained in Section 9.4 the robust product design reduces to an
optimization problem. The optimality criteria of Section 6.2 are suitable for the case
under consideration as weH. The optimization methods described in Chapter 6 and
Section 7. 4 have to be modified to take into account some specific features of the
optimization problem, when both qualitative and quantitative variables are included in the
model. Two ofthem are ofspecial interest:
• The fact that the qualitative factors can take only discrete values makes some
differences in the application of the analytical optimization procedures of Chapter 6 and
Section 7.4.
• Qualitative variables do not affect the second order terms of performance
characteristic's models (9.15) and (9.18) and the models ofvariance (9.16) and (9.19).
The smaller the better and the !arger the better cases
As shown in Chapter 6 the stationary points of the mean value and variance surfaces
coincide with each other. If there are no constraints on the parameter values pi>
i = 1,2, ... ,m, their coordinates can be obtained similarly to (6.28) by putting the first
derivatives of(9.18) and (9.19) equal to zero:
(9.21)
As b, depends on qualitative factors, there are distinct stationary points for each
combination of their Ievels. That is why the optimization procedure in this case is as
follows:
i) Compute P., using (9.2I) for all possible combinations of qualitative factors
Ievels.
ii) Put p ., in (9 .I8) and choose that combination of qualitative factors Ievels
which corresponds to an extremum of y(p). This combination minimizes the variance as
weil.
The procedure is much simpler if there are no interactions between qualitative
and quantitative factors. In this case the coefficients in the linear terms of the regression
model do not depend on qualitative factors and b, = b for all possible combinations of
their Ievels. Hence, the variance does not depend on qualitative factors and the stationary
point coordinates are
I -1
p s = --B
2 b. (9.22)
In this case the extremum of the performance characteristic is found from (9.18)
by separating the terms with qualitative and quantitative factors, because only the
intercept b0, depends on the categorical variables. The optimization procedure is:
i) Using (9.5) find the combination of qualitative factors' Ievels that provide
extremal (minimal or maximal) value of b01 , taking into account that
If the target is the same for all combinations of qualitative factor Ievels r is
substituted for T1 in (9.24) and (9.25) and the best combination of Ievels is the one
a
providing a minimum of 2(p.l) computed by (9.25).
A geometric interpretationoftbis solution related to the discussion in Section 6.4
is relevant here as weil (see Figure 6.3). Orientation of the axes of mean value and
variance surfaces depends on the matrices of the following quadratic forms: pTBp and
pTBT:E.Bp. The directions of these axes are the same for all Ievel combinations of the
qualitative factors, because B and BT:E.B do not contain categorical variables. Formulae
(9.23), (9.24) and (9.25) show that the optimal parameter values p.11 •2 , the value ofthe
performance characteristic at the stationary point ji81 and the minimal value of the
a
variance 2 (p •t ) depend on the qualitative factors.
For cases when there are no interactions between qualitative and quantitative
factors the following equality exists: b 1 = b. Therefore, the stationary point (9.22) is the
a
same for all combinations of qualitative factor Ievels. However, Ysl' P•t!2 and 2 (p.r)
depend on categorical variables via b01 • In spite of the fact that the variance surface is
independent from qualitative factors, its tangent points with the mean value surface are
functions ofthe dummyvariables via Yst and b01 •
It is tempting to use the ridge analysis of Section 6.4.2, as the qualitative factors
can be set without errors in mass production. However, this method can not be applied
because of the discrete character of the qualitative factors. The method of Section 6. 4. 2
can produce values of the categorical variables that do not really exist. This method is
appropriate only if some quantitative factors can be kept without errors during the
production process.
If both errors in product parameters and external noise factors are of interest one can
apply the procedures of Section 7.4 taking into account that at>ßt and ß 01 in equations
(9.15) and (9.16) are functions of the qualitative factors. If there are no interactions
between qualitative and quantitative factors the search of optimum parameter values can
be separated with respect to each of them, because in this case the variance does not
depend on categorical variables. When there are such interactions an optimal set of
quantitative factors is found for each combination of qualitative factor Ievels. The best
combination provides extremal value of the performance characteristic's mean value.
If constraints are imposed on the factor Ievels the stationary points might be
outside the region ofinterest. In this case the methods described in Section 9.5.2 arenot
436 CHAPTER9
suitable and methods like grid search or contour plots can be used for finding optimal
parameter values.
9.7. Examples
The design and the observed values of the performance characteristic are given in
Table 9.3.
TABLE 9.3. Experimental design, Observationsand predicted mean values and standard deviations of
.
the transm1tted error m mass product10n
No. PI Pz w y y{p,w) s;,(p, w)
1 -1 1 l 4.08 3.987 0.03092
2 1 -l 1 3.64 3.546 0.11268
3 -1 -l l 4.35 4.312 0.05777
4 1 1 1 3.30 3.221 0.10156
5 -l -l 0 32.60 32.479 0.12835
6 l l 0 28.00 28.434 0.19604
7 -l 1 0 30.50 31.074 0.07868
8 l 0 0 29.00 28.953 0.20551
9 0 -l 0 31.00 31.464 0.16990
10 0 0 l 3.53 3.889 0.04194
11 l -l 0 30.00 29.839 0.22071
12 0.1 l 0 30.75 29.924 0.14224
13 -1 0 0 31.90 31.593 0.09996
14 -l -l l 4.35 4.312 0.05777
QUALITY IMPROVEMENT OF PRODUCTS DEPENDING ON BOTH 437
Stepwise regression procedure is used for model building. The interaction effect
between the quantitative factors is insignificant and is eliminated by a stepwise regression
procedure. The following model with interactions between qualitative and quantitative
factors is obtained:
and
The notations h01 , b 1 , B, d, D, band L are used for the estimates of ßol' ßl' lf,
o, V, ß and L , respectively.
The intercept of y is b0 = 30.5773. The vectors and the matrices are defined as:
b=(-13195 -o.1o25y,
B= (
-0.3053 0 J
,L=(0.9368 0.5398).
0 0.1831
Since there is only one qualitative factor, d has only one element d 1 = -26.6913
and the matrix of the interactions between the qualitative factors has elements equal to
=
zero: D 0.
The residual variance s~ = 1. 7077 I 6 = 0.28462 can be considered as an estimate
of error variance 0:. The standard deviations of errors in the coded quantitative factors
are a 1 = 0.1 and a 2 = 0.1 and their covariance matrix is L. = diag(0.01,0.01). They are
computed taking into account ( 5.31) and the fact that the standard deviations in natural
measurement scales for both factors are a;
= ~ = 1, while the intervals of variation are
w 1 = w 2 = 10. Therefore, a; = cT, I w; = 0 .1, i = 1, 2.
The mean values of the performance characteristic in mass production are
calculated by (9.18) after substituting y for 77(p). First we compute
438 CHAPTER9
where
m m-1 m
HOT= 2Lß;;cr: + L Lb;;cr; cr;.
j:=l •=1 j~i+l
2
First compute
r
b1 =b+Lw= (-1.3195) + (0.9368) w= (-1.3195+0.9368wJ
-0.7025 0.5398 -0.7025+0.5398w
and
2Bp = 2(- 0.3053 0 J(p1 J = (- 0.6106p1 J.
.
0 0.1831 p2 0.3662p2
Hence,
-1.3195 + 0.9368w- 0.6106p1 J
(
~~-= -0.7025 + 0.5398w + 0.3662p 2
As the interaction between p 1 and p 2 is insignificant ( bii = 0 ), the second term in the
formu1a for high order terms is zero and
s 2(p, w) = o.28462+
(
-1.3195+0.9368w-0.6106pl
+ -0.7025+0.5398w+0.3662p2
0.01
)T( 0
0
X
-1.3195+0.9368w-0.6106pl
0.01 -0.7025+0.5398w+0.3662p2
)
or
QUALITY IMPROVEMENT OF PRODUCTS DEPENDING ON BOTH 439
s 2 (p,w)= 0.01(-1.3195+0.9368w-0.6106py +
. { 3.8660, for w = 1
h01 = b0 + d1w = 30.5773- 26.6913w= ;
30.5773, for w = 0
b ={(-0.3827 -0.1627},forw=l
t (
-1.3195 -0.7025 )T ,forw=O '
The coordinates of the stationary point and the corresponding predicted value of
the performance characteristic are computed using (9.21) and (9.23). The following
values are obtained:
-{(-0.6267 0.4444},forw=l
Pst - (-2.1609 1.9189 )T , for w = 0 '
_ _ { 3. 9685, for w = 1
Yst- ·
31.3278, for w = 0
440 CHAPTER9
--y
1.5
0.5
P2 0
-0.5
-1
-4 -3 -2 -1 0
P1
Figure 9.1. Contour plots of the resistance y and the transmitted error variance s,;
in mass production, type I ceramies (w = 1).
The optimal parameter vectors (Figure 9.1, Figure 9.2) and the corresponding
minimal values of s;(p.1 ) are computed using (9.24) and (9.25):
_ 2 (p )={0.0002557,forw=1
s. ~,w .
0.0162418, for w = 0
One can see that the optimal solutions for w = 0 (r = 300) areoutside the region
of interest defined by the inequalities -1 ~ P; ~ 1, i = 1, 2. For this case a graphical
solution (Figure 9.2) is found at the point P•opt = (0.043 1f. Formula (2.28) is used for
calculating the optimal parameter values in natural measuring scales:
For the first factor (i = 1) the values w1 = 10 A and p{0 = 45 Aare found from Table 9.2
and the optimal solution in natural units is computed using (2.28):
QUALITY IMPROVEMENT OF PRODUCTS DEPENDING ON BOTH 441
The optimal value of the second factor is P~opt = 50 s which corresponds to p 2 opt = 1. In
the optimal point s,; (p opt> w) = 0. 01926 ' while the variance of the performance
characteristic is s (p, w) = s,; (p, w)+ s; = 0.01926 + 0.28462 = 0.30388.
2
P• • Pst
1.5
-2 -1 0
P1
Figure 9. 2. Contour plots of the resistance y and the transmitted error variance ~;
in mass production, type II ceramies (w = 0).
Subsequent experimental work can be done for the case when w = 0 because it
can be expected that the optimal parameter values for this case wou1d be out of the
initially chosen region of interest. It can be expected that the variance would be reduced
more if p 2 is shifted towards higher values.
For the case w = 1 ( r= 40) both analytical solutions are within the region of
interest and the variance is one and the same for each of them. The following optimal
parameters are chosen: Popt = (- 0.6267 0.0299 Y.
The optimal parameter values in
natural measuring units are: P{opt = 38.733 A and P~opt = 40.299 s. At this point the
variance of the performance characteristic is S 2 (p, w) = s,; (p, w)+ s; =0.28488' while
the variance ofthe transmitted error is s;(pop" w )= 0.0002557.
1
•
442 CHAPTER9
Table 9.4 shows the qualitative factors and their natural and coded Ievels as weil
as their presentation through durnrny variables.
The Ievels of the quantitative factors are shown in Table 9.5. Apriori is known
that in the production process the embedding force is varying in constant tolerance
interval ± 13 N, while the tolerance interval ofthe embedding depth is ± 0.1 rnrn.
Second order models (9.11) with dummy variables and external noise factor are fit to
data. They are of the form
(9.26)
The coefficients b01' b 1 and a 1 depend on the qualitative factors and are equal to
b 1 =b+Cw (9.28)
and
a 1 = a + <I>r w. (9.29)
B= [ b"
1
-bl2
2
~b J
2
b
12
22
,L = [/"
/21
/31
1"
/22
/32
l
, D =
1
0
-dl2
2
1
-dl3
1
-dl2
2
0
0
1
-dl3
2
0
0
2
Taking into account these notations we can rewrite equation (9.26) in the form:
r21 - 348.830 -
d1 0.0061 - 0.1869
d2 -0.0020 -227.966 -
d3 -0.0020 -119.524 -
d12 - -174.167 -0.1793
dl3 -0.0104 101.551 -
/ll - - 0.3929
/12 -0.0152 - -
/21 0.0145 - 0.4751
/22 - - -0.2262
/31 - 16.8952 -
/32 - - -0.1772
9'n -0.0066 61.267 -
9'21 - -440.617 -
9'31 - - -
The multiple correlation coefficients R for these models are significant. They are
given in Table 9.8 where the following values are shown: F-ratio (F), critical point OfF-
distribution for significance Ievel 0.05 (Fr), residual degrees of freedom (v R), residual
variance (s~) and degrees of freedom for the variation explained by the model (v M).
Regression models are used for building models of mean and variance of the
performance characteristics during the production process. The noise variance matrices
L. and L are found as follows. The coded standard deviations of the errors in product
0
a = a; =___!I_= 0.07222
1
lü 1 3 X 60
and
0'2 = a;
lü 2
=____Q:.!__=0.04167,
3x0.8
The variance matrix of the noise factors Ln has only one element. As shown in
Section 5.3.2 it is equal to a~ = (113Y = 1/9.
The estimates given in Table 9.7 are substituted for the coefficients in (9.15),
(9.16) and (9.17). Computed mean values y
and standard deviations s of the
performance characteristics for the design points are given in Table 9.6. The variation of
the performance characteristic can be resolved into two components: variation due to
transmitted error with standard deviation str and output variation. The values of s1r are
given in Table 9.6 for comparison with the standard deviation of the total variation s.
One can see that the transmitted variation is prevailing.
The optimal parameter values are found through grid search. Only combinations
of parameters, which satisfy the requirements yi < 0. 045 mm, y 2 > 550 N, y 3 < 1 s, are
taken into account. As noted in Chapter 6 if Y; in these inequalities was replaced by y,
then for "optimal" values near the borders of these intervals (y
1 "'"'0.045 mm,
y 2 "'"'550 N,y 3 "'"' 1 s ) a Iot of the products would be defective. In order to avoid this
y
problern the following inequalities are used in the grid search procedure: 1 + 3s1 < 0. 045
y y
mm, 2 - 3s2 > 550 N and 3 + 3s3 < 1s. Few combinations of the parameter values are
found, which satisfy these inequalities, two of them with the smallest standard deviations
are given in Table 9.9. The second combination is chosen because it ensures higher
pressing-out force of the prototype part and lower standard deviation than the first one.
With respect to the other performance characteristics both combinations provide almost
the same results. The optimal parameters are embedding force p;apt = 178 N
(p1apt = -0. 7), embedding depth p;apt = 4.52 mm (Pzapt = 0. 4 ), the meta! pin must be
chosen as vibrating component, and form 1 ofthe pin provides best results.
448 CHAPTER9
The following models of the mean values and variances of the performance
characteristics were obtained for wi = w3 = 0 and W 2 = l:
~ 2
y3 =0.472790-0.048199pi +0.109973p2 +0.356400pi +0.126500pip2.
They are used to obtain the contour plots shown in Figure 9.4, Figure 9.5 and
Figure 9.6. The contour plot of yi + 3si is given in Figure 9.4a, while the plot of the
standard deviation si - in Figure 9.4b. The bold line in Figure 9.4a corresponds to
YI + 3si = 0.045 and divides the region of interest into two parts. The left one contains
the admissible values ofproduct parameters for which yi + 3si < 0.045.
QUALITY IMPROVEMENT OF PRODUCTS DEPENDING ON BOTH 449
0.5
-0.5
·'~----~--~~~----~--~~
.J -0.5 0 0.5
a)
PI
..
j_ j 1
I
I
I
I
I
I
I
-0.5 I
I
I
I
I
I
.J I
I I I
.J -0.5
b)
Figure 9.4. Contour plots for the deviation from parallelism (a) }\ + 3s1 (b) standard deviation s1
Figure 9. 5 and 9. 6 have similar meaning for the second and the third performance
characteristic.
450 CHAPTER9
a)
b)
Figure 9.5. Contour plots for the pressing-out force (a) Y2 + 3s2 (b) standard deviation s2
QUALITY IMPROVEMENT OF PRODUCTS DEPENDING ON BOTH 451
a)
I I I
0 05 o ob
o.\01 0 0~
0. h·
~11o
0.5
I I
IL &l
I
Icl.c &
I
1d.c 91I
I
P.P
I
I
-0.5 I
.1
I
I I
I
·I -0.5
b)
Figure 9. 6. Contour plots for the embedding time (a) y + 35
3 3 (b) standard deviation 532
Figure 9.4 shows that the variation of the deviation from parallelism does not
depend on the embedding depth p 2 . This can also be seen from the equation for s12 . The
value of this performance characteristic should be as small as possible. One can see from
452 CHAPTER9
y
Figure 9.4 that the minimal value of 1 in the region of interest corresponds to
p 1 = p 2 =1. The variance is minimal for p 1 =-1. However, for this value the embedding
time y is out of specifications.
3
The contour of the standard deviation s2 for the pressing out force is shown in
Figure 9.5b. lt corresponds to a ridge and along the ridge crest the standard deviation is
almost zero. Therefore, a minimal variation ofthe pressing out force can be obtained for
many different parameter values combinations. The optimal operating point is chosen at
the ridge crest and corresponds to small value of the embedding force p 1 . This parameter
y
can not be great because this violates the constraint with respect to 1 •
The surface of standard deviations for the embedding time (Figure 9. 6b) also has
a ridge. The variance is almost zeroing for values of the embedding force p 1 within the
interval -0.4 ~ p 1 ~ -0.15. The optimal operating point is chosen to satisfY the
constraints with respect to embedding time j/3 . It can not be chosenon the ridge crest of
the surface shown in Figure 9.6b because this will cause violation of the constraint for
the deviation from parallelism j/1 .
•
9.8. Bibliography
Regression analysis with dummy variables is given in many books, for example Draper
and Smith (1981), Chaterjee and Price (1977), Johnston (1972), Kleinbaumet al. (1988),
etc.
For construction of classical combinatorial designs see Fisher ( 1966), Cox
(1958), Hicks (1982). More theoretical texts on combinatorial designs are Raktoe et al.
(1981) and Street and Street (1987). Combinatorial designs, which are often used in
quality improvement problems, are the orthogonal arrays, see Taguchi and Konishi
(1987). Metbads for formation of multilevel orthogonal arrays, using two-level and
three-level orthogonal arrays are given by Logothetis and Wynn (1989). Wong and Wu
(1992) developed new orthogonal arrays.
Some problems ofthe application ofresponse surface designs for quantitative and
qualitative variables are discussed by Draper and John (1988). Atkinson and Donev
(1992), Donev ( 1988) and Donev ( 1989) considered methods for construction of nearly
D-optima1 designs with both qualitative and quantitative factors. Theoretical texts on this
problern are written by Kurotchka (1981), Wierich, W. (1986) and Lim et all (1988).
Many software packages include regression analysis with dummy variables, for
example MINITAB, DESIGN EXPERT and ECHIP (see Wheeler et al. (1993)). ECHIP
contains some Taguchi's orthogonal arrays as weil.
CHAPTER10
10.1. Introduction
Experimenters are not always aware of what the noise variables are, or can not organize
an experiment with them. However, they know that the observations are heteroscedastic,
i.e. their variance varies with the factor Ievels. In this situation once again we come
across the problern of variance minimization, while keeping the mean value on a target.
A model-based solution is readily obtainable on the basis of repeated observations. They
make possible the estimation of mean value and variance at each design poini as well as
the subsequent derivation of regression equations, which can give the solution of quality
improvement problern by optimization procedures similar to one of those, considered in
Chapters 6 and 7. This approach is discussed further in Section 10.2. Graphical tools for
studying individual location and dispersion effects of factors and their interactions are
presented in subsection 10.2.4. They use repeated observations, but with some
modifications are applicable for non-replicated experiments (Section 10.3) as well. In
Section 10.4 we discuss how the information about the individual effects can simplify the
optimization procedures using PERformance Measures Independent of Adjustment
(PERMIA). Multiple response optimization via constrained confidence regions is briefly
discussed in this section as well.
All methods considered so far are based on the assumption that the product
parameters and the external noise factors can be set to given Ievels without errors during
the experiments. If the data is collected from a production process this assumption may
not be correct. Methods for mean and variance model derivation for this case are
described in Section 10.5.
Taguchi (1986) considers a problern for quality improvement of a product (or a
system) depending on a factor which changes according to the intentions ofthe operator.
He named this factor signal jactor and the corresponding system dynamic system. The
problern is how to achieve a reliable signal factor effect, while minimizing the error
effects on the performance characteristic. An idea how the response surface
methodology can be employed for solving this problern is introduced in Section 10.6.
453
454 CHAPTERIO
Suppose, the experimenter is aware that the variance is non-homogeneaus over the
factor space, but the noise factors can not be identified nor an experiment to study them
can be conducted. The variance heterogeneity can be discovered through residual
analysis (see Section 2.3.8). lt can also be revealed ifrepeated observations at the design
points are available and dot-plots ofthe response values are obtained for allexperimental
configurations.
Suppose that an experiment is conducted on the basis of a response surface
design with N distinct points and n observations are taken at each point. The total
number of observations is Nn. The experiments can be conducted in two different ways:
• The design variables are set to their Ievels before each measurement of the
response. Errors in the settings ofthe factor Ievels, ifany, will affect the response.
• All n repeated observations are made before re-setting the product parameters
or process variables to new Ievels corresponding to a new design point. In this case the
errors · in product parameters are not taken into account and the variance non-
homogeneity is only due to some external noise factors, for example material
heterogeneity, environmental factors, etc.
The models considered in the next subsections can also be applied when the
experiments are conducted according to a Taguchi crossed array, consisting of N 1- point
parameter design and N 2 -point noise design. In this case substitute N = ~ and n = N 2 in
all formulae given in this section.
Using repeated observations one can compute the sample means and variances at each
design point as follows:
I
.Y. =- LY.,
n
(IO.I)
n J=I
and
2 1 ~'-·
s.=n-I~v.,-Yj),
- \2 (I0.2)
The values of .Y. and s;, u = 1, 2, ... , N can be considered as two responses at the
design points and ordinary least squares can be used to fit the two mode1s as follows:
•_Model ofthe mean value
(10.3)
(10.4)
where ()y;, ()m are estimates ofthe regression coefficients and fr./m are known functions
ofproduct parameters. Equations (10.3) and (10.4) are usually polynomials.
Often log-variances or Iog-standard deviations are used to fit a model instead of
(10.4), see Logothetis and Wynn (1989), Myers and Montgomery (1995). This choice is
based on the following idea. lf the observations are normally distributed, then their
variance CF 2 has a i - distribution (Figure 10.1). By cp(•) in Figure 10.1 we denote
probability density function. A log transformation shrinks the long tail of i -
distribution making the distribution of lnCF 2 approximately normal. It is known that the
least squares estimates are efficient for normally distributed observations. Hence,the Iog-
transformation improves the efficiency of the estimates. Bartlett and Kendall (1946) give
a proof of the approximate normality of log-variance distribution.
:e = L e.J.; .
k,
log (10.5)
i=l
456 CHAPTERIO
A problern with the approach given in subsection I0.2.2 is that the sample variance
computed by (I 0.2) is inefficient for small number of observations. Rao and
Subrahmaniam (I97I) recommend using (10.2) only if the number of observations at
each design point is n ~ 10 .
Some improvement of the efficiency of the variance estimates can be obtained
through the residuals of the performance characteristic's model. The idea of the
improvement comes from the fact that the residuals are computed on the basis of all
observations, while for the sample variance (10.2) only n observations are used.
Horn, Horn and Duncan (1975) and Carroll and Ruppert (1988) give surveys on
variance estimates based on residuals. The idea and some of these estimates are herewith
briefly discussed.
Suppose that a design with N points is conducted and n Observations are made at
each design point (n ~ 2). Using unweighted least squares one can compute the
following estimates:
(10.6)
where y = (y y
1 2 YN) is Nvector ofmean values computed by (10.1).
The residuals are
u = 1, 2, ... , N; i = I, 2, ... , n,
2 1~ 2
S1u =- ,t_. eui, (10.7)
n u~I
OTHER METHODS FOR MODEL- BASED QUALITY IMPROVEMENT 457
(10.8)
where
The notation (• t means that each element of the vector or the matrix (•) is
squared.
According to the MINQUE, developed by Rao (1971a, 1971b, and 1972) the
matrix T is non-singular for n ~ 2, provided that M is also a non-singular matrix. Some
of the elements of si may have negative values. That is why if an estimate is smaller than
a positive number 8, it should be replaced by the corresponding sample variance (10.2)
or by 8. As shown by Horn and Horn (1975), the probability of obtaining negative
elements of si decreases, as n increases.
Horn and Horn (1975) compare the variance estimates. All estimates computed
by (10.2), (10.7), (10.8) and (10.9) are consistent. Unbiased are the sample variance
(10.2) and MINQUE (10.9). The estimate (10.8) is unbiased only if the regression
coefficient estimates are computed by the weighted least squares with weights equal to
the true values ofvariances. The estimate (10.7) is biased.
The standard deviations of(10.8) and (10.9) are higher than for (10.7).
12
- I~
Yu =uLJYui
I~ I
and
The design of experiments, the average window size y and the sample variance s! are
shown in Tabie 10.1.
T ABLE 10.1. L, 8 array, average window size y and sample variance s! for window
fiormmg process.
No. PI P2 p3 P4 Ps p6 YI =y s2
SV y 2 =Ins!
Using the data of Tabie 10.1 and best subset regression tooi in MINITAB two
modeis are fitted: one for the mean vaiue (y1 = y) and one for the Iogged sampie
variance (y2 =Ins!):
OTHER METHODS FOR MODEL- BASED QUALITY IMPROVEMENT 459
and
The analysis ofvariance for these models is shown in Table 10.2. The computed
F-ratio is high and shows that the prediction ofthe responses is accurate enough.
..
TABLE102ANOVAfi or themodl
e s of averaEe s1ze andlogg<ed vanance
Model Multiple Degrees of Residual Residual F- ratio Critical
correlation freedom for the degrees of variance value for
coefficient R model vM freedom Vn s2R F(a=0.05)
The optimal parameter values are found using a grid search under the following
conditions:
and
The second constraint is based on the results of a preliminary grid search and is
chosen to eliminate most of the parameter combinations with great variance values. Five
ofthebest operating conditions are shown in Table 10.3.
The last column of Table 10.3 shows the values of the standard deviations. The
parameter values corresponding to the third row of Table 10.3 are chosen as a final
solution. Therefore, the optimal process parameters are distance between the wafer and
the hot plate - 2.45 mm, photoresist thickness - 11500 A, bake temperature- 106 °C,
aperture - 185 divisions, exposure time - 0.11 s, bake temperature after development -
110 °C. Under the assumption of normality of errors the window size is expected to vary
within the interval y1 ±3ssv = 3.247± 0.02,um.
Table 10.4 presents a comparison between product parameter combinations
chosen in 4 different ways: the initial operating conditions, operating conditions chosen
through Taguchi method (see Example 4.2), the best operating conditions in the design
(point No. 10 of the design) and operating conditions chosen through model based
approach.
Figure 10.2. Contours ofthe mean values j\ (p 2 , ps) for P 1 = -0.7, P 3 = -0.8, P 4 = 1, P6 = -1
Figure 10. 3. Contours of the standard deviations Ssv (P2 , Ps) for
p 1 =-0.7,p 3 =-0.8,p4 =1,p6 =-1
•
462 CHAPTER10
The half-normal plot is an appropriate tool for studying individual location and
dispersion effects.
Suppose that a two-level experiment is conducted with m product parameters and
the model ofthe mean (10.3) takes the form
m m-1 m
y=ho + "f.bipi + L "f.bifpipi, (10.10)
i=l i=l j=i+l
where -1 ~ pi ~ 1, i = 1, 2, ... , m .
A factor has location e.ffect if the average response changes considerably as the
factor Ievel changes from low to high. The location e.ffect of a factor pi is equal to 2hi,
while the location effect of an interaction pipi is 2hii.
Often a small part of the factors or interactions has significant effects. This
property is known as sparsity of e.ffects. Factors or interactions with !arge effects are
called active, the rest are inert.
A half-normal plot can graphically present the location effects. Roughly the inert
effects willlie on a straight line, while the active effects will deviate from the straight line
and occur in the corners of the plot. Note that the plot may Iook roughly linear and
without effects in the corners in two cases:
• if there are no active effects,
• if all effects are active.
Therefore, if all effects arrange as a straight line, a significance test on them is
recommended.
Dispersion effects can be studied in a similar way. Consider the following log-
variance model obtained through a full or fractional design with replications:
m m-1 m
logs 2 = c0 + "f.cipi +
i=l
L "f.cifpipi.
i=l J=i+l
(10.11)
The effects of factors and their interactions are 2ci and 2ciJ, correspondingly. A
half-normal plot ofthe dispersion effects will show which ofthem are active.
The location and dispersion effects can also be studied using dot-plots.
admixture copper antimonid, p~, [mg/1], temperature, p;, [ 0 C], alkalinity (pH) of the
solution, p~ and zinc powder concentration, p;, [g/1]. A half fraction of a two Ievel full
factorial design is carried out. The generating relation is chosen to be p 5 = p 1p 2 p 3p 4 .
Four repeated observations are made at each design point. Table 10.5 shows the
correspondence between the natural and the coded factor Ievels.
The location and the dispersion effects of the factors and their interactions can be
studied using the design of experiments, the observations, as weil as the mean value and
sample variances for each design point, given in Table 10.6.
The following models ofmean value (y1 = y) and logged variance (y2 =Ins!) are
obtained:
and
The location effects of the factors P; and the interactions P;PJ are equal to the
doubled values of the regression coefficients in the first model: 2b; and 2b;1 . The
dispersion effects are obtained in a similar way from the second model.
Half-normal plots of the location and the dispersion effects are shown in Figure
10.2 and Figure10.3, respectively. Factars p 1 and Ps and interactions P1P5 ,P2P3 ,P4 P5
have significant location effects, while only factors p 1 and p 5 have significant dispersion
effects.
~
0'1
~
TABLE 10.6. E ·- 1d
---------~------. -- -- d data for cobal - ---------- ------
No. Factors Repeated observations Mean Variance
PI P2 P3 P4 Ps YI =y s2 y 2 =Ins~
SV
Absolute po
1
effects
of Y1 10 -
5 -
0
Normal scores
Absolute35 -
effects 3.o
of y2 2.5
2.0
1.5
1.0
1
Normal scores
Using these plots and taking into account the signs of the coefficients in the
models we come to the following conclusions. Both residual concentration of cobalt (y1 )
and the logged variance (y2 = ln s!) can be decreased by choosing p 1 = -1, p 2 and p 3
with different signs and p 5 = 1. This is confirmed by point No. 11 of the design which
provides minimal mean value and variance of the residual concentration of cobalt as
compared to the other design points: y = 0.212,s! = 0.00123.
466 CHAPTER 10
Box and Meyer (1986a, 1986b) showed that location and dispersion effects could be
studied also from non-replicated experiments. Suppose that a two-level design is
conducted and one observation is made at each design point. A regression equation
y
similar to (10.10) can be fitted with replaced by y. Using this equation one can study
the location effects by a half-normal plot. However, since no repeated observations were
taken an equation for log-variance oftype (10.11) can not be obtained. Box and Meyer
(1986) showed that in this case the dispersion effects could be studied on the basis of a
ratio ofthe following type
F = ln(s 2
'
(i+)J
s2(i-) ,
(10.12)
which is computed for a given regressor f. Fora two Ievel design j is a factor P; or an
interaction p1p 1 •
In (1 0.12) s 2 (i+) is the sample variance ofthe residuals when the elements of i-th
column ofthe design are at Ievel +I and s 2 (i-) corresponds to factor level-1. Box and
Meyer note that the assumptions for the usual F-test arenot satisfied for F; and hence,
(10.12) can not be used for test of significance. A high positive value of F; shows that
s 2 (i+) is much !arger than s 2 (i-) and vice versa, while a !arge negative value indicates
that s 2 (i-) is much greater than s 2 (i+). In both cases the dispersion effect is !arge. The
dispersion effects F; can be graphically presented through half-normal or dot-plots.
The location effects influence the dispersion effects computed on the basis of
residuals because the residuals depend on the mean value. The values F; would be
appropriate measures for the dispersion effects only if all of the location effects,
including the overall mean, were known to be zero. As this usually is not the case, the
location effects should be removed by computing the residuals after eliminating all
suspected location effects.
An alternative approach to find the active factors is to use Bayes plots of
posterior probabilities. For details on computing posterior probabilities see Box and
Meyer (1986a), Box (1988) and Box and Meyer (1993).
Analysis of the individual location and dispersion effects is particularly useful
under the hypothesis for sparsity of effects. A screening experiment based on factorial or
Placket-Burman designs is usually carried out to identify the active factors. Box (1988)
notes that "fractional factorial designs and other orthogonal arrays, of course, provide us
with only a very elementary idea of the nature of the relationships 77(x) and u(x). The
"optimization" procedure used to exploit location and dispersion effects from such
designs is, therefore, necessarily cruden Further on Box (1988) notices that when more
OTHER 'METHODS FOR MODEL- BASED QUALITY IMPROVE'MENT 467
is known about the functional relationships (e.g. from response surface studies or from
mechanistic modeling) more precise conclusions can be drawn.
Half-normal plots are given for y 1 and y 2 in Figure 10.6 and 10.7, respectively.
They show that most of the location effects are insignificant. After removing the
insignificant terms the following models are obtained
refractory expenments
interactions Responses Residuals
P1P2P3 P1P2P4 P1P3P4 P2P3P4 YJ y2 res y1 resy2
12 13 14 15 16 17 18 19
-1 -1 -1 -1 25.7 11.15 1.225 -0.3600
1 1 1 -1 19.8 13.80 0.275 0.5650
1 1 -1 1 13.7 12.15 -4.675 -0.3550
-1 -1 1 1 12.2 13.92 -1.225 -0.3100
1 -1 1 1 14.7 14.38 -0.950 -0.1000
-1 1 -1 1 11.7 16.84 1.000 0.6350
-1 1 1 -1 8.4 16.29 -1.150 0.8150
1 -1 -1 -1 10.1 16.31 5.500 -0.8900
-1 1 1 1 43.7 1.36 1.375 -1.0775
1 -1 -1 1 38.6 5.88 1.225 -0.2875
1 -1 1 -1 37.3 5.07 1.075 1.6375
-1 1 -1 -1 32.0 7.35 0.725 0.1875
1 1 -1 -1 35.7 5.68 2.200 0.2725
-1 -1 1 -1 22.2 9.49 -6.350 0.3525
-1 -1 -1 1 28.3 5.57 0.900 -0.8325
1 1 1 1 21.3 9.88 -1.150 -0.2525
2.60 3.33 2.78 3.35
2.94 2.16 2.40 2.04 for y 1
0.24 -0.87 -0.29 -0.99
0.68 0.80 0.54 0.75
0.77 0.63 0.83 0.51 for y2
0.23 -0.49 0.87 -0.79
470 CHAPTER 10
20~--------------------------,
Absolute
effects
ofY1
10-
P,
0
0 0 0 0 °
o- ooooo
Normal scores
Figure 10. 6. Half normal plot of the eroshing strength after pressing, y1
Absolute 8
effects 7 -
of y2 s -
5 -
4 -
3 -
P,
0
2 -
1- 0 0
o- oooooooo o
1
Normal scores
Figure 10. 7. Half normal plot ofthe bulk porosity after pressing, y2
T ABLE 10 9 ANOVA table for models (10 13) and (10 14)
Residual Degrees of freedom Multiple F-ratio Critical value of
variance correlation F-distribution
coefficient
Model s2R Residual, vR Model, VM R F Fr (a=0.05)
(10.13) 10.07 ll 4 0.971 45.50 3.36
(10.14) 0.746 10 5 0.989 89.46 3.33
The multiple correlation coefficient is significant for both models. As the critical
values of F-distribution are much !arger than the calculated F-ratios one can think that
the models are satisfactory.
OTHER METHODS FOR MODEL- BASED QUALITY IMPROVEMENT 47I
The half normal plots and the models show that by the magnitude of the loeation
effeet on YI the faetors are arranged as follows: P4.P3 , p 2, pi. The loeation effeets on y 2
are arranged as follows: P4.P3,pi,pip4,p2.
Before plotting the dispersion effeets the influenee of faetors with signifieant
loeation effeets should be eliminated. Compute the residuals ei = YI - YI and e2 = Y2 - Y2
and the ratios
2
F =ln(s (i+)) =2ln(s{i+))·
' s 2 (i-) s(i -)
for eaeh response. The values of the residuals are shown in eolumns (I8) and (I9) of
Table I 0. 8 and the values of s(i + ~ s(i -) and F; are given in the last 6 rows of the same
table. Half-normal plots of F; for YI and y 2 are shown in Figure 10.8 and 10.9,
respeetively. Only the interaetions p 2p 4 and PIP3 have signifieant dispersion effeets on
the eroshing strength YI . This ean be seen also from the plots of residuals ei against p 2p 4
and pip3 whieh are given in Figure 10.IO and I0.1I, respeetively. The dispersion effeets
for the bulk porosity y 2 are rather small and ean be ignored.
If the target was to obtain only maximal eroshing strength yi and minimal bulk
porosity y 2 , the best ehoiee would be point No. 9 from the design (PI= p 2 = p 3 =-I,
p 4 =I and YI = 43. 7,y2 = I.36). Figure IO.IO and IO.II show that the dispersion effeets
of the interaetions on the eroshing strength YI are minimized if PIP3 =-I (the faetors
PI and p 3 have different signs) and p 2 p 4 =I (p2 and p 4 are with eoineiding signs).
Points No. 2,5,12,15 from the design satisfy these eonditions. However, as one ean see
at these points the value of the eroshing strength is mueh smaller and the value of bulk
porosity is mueh higher than for point No. 9. As a eompromise one ean ehoose points
No. IO or I I.
3~--------------------~~2~p'4o~
Dispersion
effects F.i
ofY 1 2-
1- 0 0
0 0 0 0 0
0
oo
oo
Normal scores
Dispersio'l 5 _
effects F.i
ofy
2 1.0-
0
0.5 - 0 0
0
0
0
0.0 -<-r------,-----------,--J
Normal scores
Figure 10. 9. Half normal plot of the dispersion effects, F; for bulk porosity y 2
Residuals
ofy1 5 -
0
0 -
0
8
0 @
-5- 0
0
-1
Residuals
ofy1 5 -
0
•
0
0-
§
ö 0
-S- 0
·1 0
~p3
10.4. More about the optimization procedures for robust product design
The analysis oflocation and dispersion effects by graphical methods can be supplemented
with more formal optimization algorithms on the basis of equations (10.3) and (10.5).
The optimality criteria, the dual response approach and the other optimization algorithms
of Chapters 6 and 7 can also be used in this case. Two remarks related to Sections 10.2
and 10. 3 are appropriate
The first remark concerns so-called PERformance Measures Independent of
Adjustment (PERMIA) introduced by Leon et al. (1987). As shown in Chapter 4
Taguchi's optimization procedure reduces the variance of the performance characteristic
by minimization of a performance measure and then adjusts the mean to a desirable
value. This procedure is effective when the vector of design parameters p can be
considered as consisting of two subvectors PI and p2 and the existence of a PERMIA is
assumed, influenced only by PI . The Ievels of parameters PI are chosen to minimize the
PERMIA and the mean is set to the desired value through a subsequent adjustment of
p 2 . Note that PERMIA may not exist.
The model-based optimization procedures do not need the concept of PERMIA
and that is one oftheir advantages. However, if a PERMIA exists, this can simplify them.
Suppose there are some parameters p2 which have considerable effects on the mean
values, but almost no effects on the variance. In this case the variance function can be
considered as PERMIA. The optimization procedure can be divided into two stages.
First the variance can be minimized with respect to PI. Second the mean is adjusted to
the target through p2 . As the number of parameters is reduced, the optimization
procedure at the second stage is simplified.
Studying location and dispersion effects through the graphical tools given in
Sections 10.2 and 10.3 can help to answer the question whether the variance can be
considered as PERMIA or not.
The second remark is about an opportunity to use models of type (10.3) and
(10.5) for multiple response optimization. Dei Castillo (1996) developed a procedure to
this end. It can be used if the region of interest is sphere with radius p which is presented
by the inequality pr p :-::; p 2 •
Suppose there are r performance characteristics of interest. Adequate quadratic
models of means and variances are known for each of them, so the total number of
models is 2r. Each mean or variance can be considered as a response and the constrained
stationary points of the response surfaces can be found by Lagrange multipliers (see
subsection 7.4.2) together with the corresponding optimal values of the 2r Lagrange
multipliers f.J; .
Dei Castillo (1996) defines a primary response (for example, yP) and optimizes it
subject to the following constraints:
(10.15)
474 CHAPTER IO
j = I,2, ..,2r
and
where mis the number of product parameters and sJ are the response variance estimates.
Inequalities (I 0 .I5) are introduced in subsection 3. I 0. 3 where definitions of the
vectors of 81 and the matrices Va1 are also given. These inequalities define a region in
the factor space which is an intersection of the 2r stationary points PsJ. Notice that this
region may not exist. The values of a 1 are chosen so that
I0.5.1. INTRODUCTION
The variance models in Chapters 5 to 9 are derived analytically, provided that a model of
the performance characteristic is known. This model is usually obtained by regression
analysis under the assumption that the observations come from an experiment without
errors in the factor Ievels. In this case the model is estimated by unweighted least
squares.
The assumption for Iack of errors in factor Ievels is realistic in many cases. lt
considerably simplifies data analysis, improves reliability of inference and makes usage of
experimental designs with relatively small number of runs possible. That is why methods
based on the assumption for Iack of errors in factor Ievels during the experiments are
recommended whenever such experiments are possible. Taguchi method is based on the
same assumption, because both parameter and noise designs are supposed to be
conducted with fixed Ievels of the factors.
In some cases the assumption for Iack of errors in the factors during the
experiment may not be correct. This can happen when the experiments can not be carried
out in a Iabaratory and the only way to get information is to make observations on a real
production process.
The mechanism of error transmission from factors to response has been
considered in Chapters 5 and 6. For a product with performance characteristic, whose
dependence on factors is non-linear and the factors are subject to errors, the observations
are heteroscedastic, i. e. their variances at the design points are not equal. In this case the
OTHER METHODS FOR MODEL- BASED QUALITY IMPROVEMENT 475
unweighted least squares method provides biased and inefficient estimates of the
regression coefficients.
Unbiased and efficient estimates can be obtained through weighted least square
method, provided that the variances of observations a 2 (y. ~ u = 1,2, ... ,N are known.
These estimates rninirnize the sum ofsquares ofthe weighted residuals:
(10.16)
(10.17)
Suppose that the experimental design has N points with n observations at each of them.
In this case the elements of :E can be estimated from the repeated Observations. They can
be computed as sample variances (10.2) of the repeated Observations or as variances
based on residuals (subsection 10.2.3).
Several authors study the properties of the weighted least squares estimates
(WLSE) based on repeated observations. Rao and Subrahrnaniam (1971) noted that the
sample variances (10.2) should be used only when n ~ 10. Vuchkov and Boyadjieva
(1987) showed that ifthe weights are estimated through sample variances (10.2) or are
based on residuals as in Section 10.2.3 then the WLSE are unbiased provided that the
response errors are symmetrically distributed. They are also consistent under some mild
conditions given by Vuchkov and Boyadjieva (1987). The use of sample variances (10.2)
476 CHAPTER 10
with small number of observations may give less efficient estimates than the unweighted
least squares. They are comparable in terms of efficiency with the estimates based on
residuals only for great number of repeated observations (n ~ 10). Simulations showed
that best efficiencies could be obtained by weights calculated through almost unbiased
estimates ofvariances (10.8).
If the observations are not replicated or if the number of replications is small estimates of
the regression coefficients can be obtained by considering the response variance as a
function of factors and using it for estimation of the elements of L: . This function can be
either analytically derived or obtained as an empirical model. Carroll and Ruppert (1988)
review various methods for variance function estimation. Most commonly used are the
generalized least squares and maximum likelihood methods. We will only show how
generalized least squares can be used in quality improvement problems without going
into details of this specialized field.
Assurne that the mean of the performance characteristic in mass production or
use can be modeled as follows:
k
Yu =E(yJ=lJu(B,pJ= LBifu =Brfu, (10.18)
l"'-1
(10.19)
case a generalized least squares algorithm (Carroll and Ruppert (1988), p.13) can be
adapted. We obtain the following procedure:
1. Start with an initial estimate 00 . Usually this is an estimate obtained by
unweighted least squares.
2. Substitute 00 into (5.22) or (7.14) to find an estimate of ~. Denote it by :t.
3. Compute WLSE as follows:
(10.20)
calls products of this type "dynamic systems". Grove and Davis (1992) give other
examples of such systems.
Taguchi uses cross product designs (see Section 4.4) with an outer array and
Iayout of signal and error factors for this type of problem. U sing ANOVA and graphical
representations of the effects he chooses the product parameters to minimize the signal
to noise ratio. Ifthe relationship between the performance characteristic y and the signal
factor s is linear:
y=ßs,
where
~:S.Y.
/3 = ..:.:•;::-:~,------
~>=
u;J
Note that Taguchi uses the term "dynamic characteristics" in a different way from
engineers. In engineering a dynamic characteristic is always a function of time or
frequency and describes the transient process of the system. Fixing the signal factor to
given Ievels during the experiment, Taguchi studies the steady state and not the transient
process.
We will try to find a response surface solution to the problern for quality
improvement of signal-dependent systems. We consider the signal s as non-random. The
system should be sensitive to it in order to provide proper reaction to the user' s intention
and it should be insensitive to noise factors.
Using the notations of the previous chapters we can write down a second order
polynomial model of the performance characteristic as a function of product parameters
P;, external noise factors n; and signal factor s as follows:
OTHER METHODS FOR MODEL- BASED QUALITY IMPROVEMENT 479
m mm q qq mq
m q m q
+ß s+ß
S SS
S2 + ""ß(ps)ps+
.L.... I
""a(ns)n
L....
I J
s+ """"l('(pns)pn.s+s
.L.... .L....
j I) l } •
(10.21)
F1 }=1 1=1 }=1
+a:ns+prs~n+s, (10.22)
where ßo ,ß, a, n, p,.4. 'E.t; have the same meaning as in Chapter 7, while ans ,ßps and ~
are defined as follows:
~ . m x q matnx
1s · . - 12
. -1 , 2, ... ,m,J-
' w1t' h elements K'if(pns) ,1- , , ... ,q.
(10.23)
where
a~ = a+a~ss,
tJs =tJ+~.
480 CHAPTER10
a;
Equation (10.23) can be obtained frorn (7.11) by substitutins ß0 , , ,/f". ,tJ • for
ßo, ar ,{!", tJ, correspondingly. One can see that (10.23) is a second order polynornial of
product pararneters and extemal noise factors with coefficients dependent on a non
randorn signal factor s. That is why equations (7.13) and (7.14) can be used to cornpute
the rnean value and variance after appropriate substitution of the coefficients. They can
be written as follows:
(10.24)
and
(10.25)
1
I=-
n
Lan
i=l
2
(p,si),
where si are equally spaced values of the signal factor, their nurnber in the interval of
variation being n.
The product pararneters can be chosen to rninirnize I under the condition that
y(p, s) is within a given interval when s changes.
This approach can be extended for rnore than one signal factor. All coefficients in
( 10. 23) should be considered as continuous functions (for exarnple, polynornials) of the
signal factors. They can be substituted in (10.23) and the coefficients of the obtained
rnodel can be estirnated by the least squares algorithrn. With these estirnates (10.24) and
(10.25) can be used in the sarne way as for the single factors.
OTHER METHODS FOR MODEL - BASED QUALITY IMPROVEMENT 481
10.7. Bibliography
More on variance function estimation can be found in Carroll and Ruppert (1988).
Design of experiments for estimating both mean and variance functions is proposed by
Vining and Schaub (1996). Methods for estimation oflocation and disperssion effects are
considered by Walfingerand Tobias (1998), Pau (1999), Bergman and Hynen (1997),
Ferrer and Romero (1995) Wiklander (1998), Rosenbaum (1999), Pau and Santer
(1998), Steinberg and Bursztyn (1998), Ghosh and Duh (1992). Engel (1992), Hamada
and Neider ( 1997), Engel and Hueie (1996), Lee and Neider ( 1998) consider the use of
generalized linear models in quality improvement. Taguchi (1986), Grove and Davis
(1992), Park (1996) give examples for parameter design for dynamic characteristics.
Robust design of products with dynamic characteristics is discussed by Miller and Wu
(1996), Lunani, Nair and Vasserman (1997), McCaskey and Tsui (1997).
482
BIBLIOGRAPHY
Draper, N. R. and W. Beggs (1970). Errors in the factor Ievels and experimental design.
Ann. Math. Statist., 41, 46-58.
Draper, N. R. and John J. A. (1988). Response-surface designs for quantitative and
qualitative variables. Technometrics, 30, 423-428.
Draper, N. R. and Herzberg A. M. (1968). Further second order rotatable designs. Ann.
Math. Statist., 39, 1995-2001.
Draper, N. R. and Smith, H. (1981). Applied regression analysis (2nd edn). New
York, Wiley.
Engel, J. (I 992). ModeHing variation in industrial experiments. Applied Statistics, 41,
579-593.
Engel, J. and Huele, A. F. (1996). Taguchi parameter design by second- order
response surfaces. Quality and Reliability Engineering International, 12, 95 -
100.
Engel, J. and Huele, A. F. (1996). Generalized linear models and process variation.
Proceedings ojthe I I International Workshop on Statistical Modelling.
Orvieto, Italy, Eds: Forcina, A., Marchetti, G. M., Hatzinger, R., Galmecci, G.,
137-143.
Engel, J. and Huele, A. F. (1996). A generalized linear modeling approach to robust
design. Technometrics, 38, 365-373.
Ermakov, S. M. and Zhigliavsky, A. A. ( 1987). Mathematical theory of optimum
experiments, Moscow, Nauka (Russian).
Farrel, R. H., Kiefer, J. and Walbran, A. (1967). Optimum multivariate designs.
Proceedings of the 5-th Berkeley Symposium on Probability and Statistics, 1,
113-132.
Fedorov, V. V. (I 972). Theory of optimal experiments. New York, Academic Press.
Ferrer, A.J. and Romero, R. (1993). Small samples estimation ofdisperssion effects
from unreplicated data. Communications in Statistics - Simulation and
Computation. 22, 975-995.
Ferrer, A.J. and Romero, R. (1995). A simple method to study disperssion effects from
non-necessary replicated data in industrial context. Quality Engineering, 7, 747-
755.
Fiacco, A. V. and McCormick, G. P. (1967). Nonlinear programming. New York,
Wiley
Fisher, R. A. ( 1966). The Design of experiments (8-th edn. ). Edinburgh, Oliver &
Boyd.
Fisher, R. A. and Yates, F. (1963). Statistical tab/es jor biological, agricultural and
medical research. Edinburgh, Oliver and Boyd.
Fowlkes W. Y. and Creveling C. M. (1995). Engineering methodsfor robust product
design using Taguchi methods in technology & product development.
Reading, MA, Adison-Wesley.
Freeny, A. F. and Nair, V. N. (1992). Robustparameterdesign with uncontrolled noise
variables. Statistica Sinica, 2, 313 - 314.
488
Galil, Z. and J. Kiefer (1980). Time and space saving methods, related to Mitchell's
DETMAX, for finding D-optimum designs. Technometrics, 22, 301-313.
Gantmacher, F. R. (1959). The theory ojmatrices, vols. I and II, New York, Chelsey.
Ghosh, S. (1990). Statistical design and analysis ofindustrial experiments. New York,
Dekker.
Ghosh, S. and Duh, Y.-J. (1992). Determination ofoptimum experimental conditions
using dispersion main effects and interactions of factors in replicated factorial
experiments. Journal oj Applied Statistics, 19, 367-378.
Gill, P. E., Murray, W. and Wright, M. A. (1981). Practical optimization. London,
Academic Press.
GienS., Peace, G. S. (1993). Taguchi methods: A hands on approach. Reading, MA,
Adison-Wesley.
Goupy, J. ( 1993). Methods jor experimental design: principles & applications jor
physicists & chemists. Elsevier Science, Amsterdam.
Grieko, J. M. (1993). Generalized linear models and process variation. Journal of
quality technology, 25, 288-295.
Grove, D. M. and Davis, T. P. (1992). Engineering, qualityand experimental design.
New York, Longman Scientific & Technical.
Hadjihassan, S., Walter, E. and Pronzato, L. (1996). Quality improvement via
optimization of tolerance intervals during the design stage. App/ications of
Interval Computations. Eds. Kearfott, R.B. and Kreinovich, V., Dordrecht,
Kluwer, 91-131.
Hadjihassan, S., Walter, E., Vuchkov, 1., Boyadjieva, L. and Pronzato, L. (1997).
Robust product design through experimental modelling, with application to
geotextile materials. Proceedings of IMACSIIFAC Second International
Symposium on Mathematical Modelling and Simulation in Agricultural and
Bioindustries. Ed. I. Farkas, Budapest, 81-86.
Hadjihassan, S., Pronzato, L., Walter E. and Vuchkov, I. (1997). Robustdesign for
quality improvement by ellipsoidal bounding. Mathematics of Stochastic
manujacturing Systems. Lectures in Applied Mathematics, v.33, Eds. G.G.Yin,
Quing Zhang, American Mathematical Society, Providence, Rhode Island, 127-
138.
Hahn, G. J. and Shapiro, S. S. (1967). Statistical models in engineering. New York,
Wiley.
Hamada, M., Neider, J.A. (1997). Generalized linear models for quality improvement
experiments. Journal ofQuality Technology, 29, 292-304.
Hartley, H. 0. (1959). Smallest composite designs for quadratic response surfaces.
Biometrics, 15, 611-624.
Hardin, R. H. and Sloane, N. J. A. (1993). A new approach to the construction of
optimal designs. J. Statist. Plan. lnference. 37, 339-369.
Harrington, E. C. (1965). The desirability function. Indus. Qual. Control., 21, 494-498.
489
Herzberg, A. M. (1964). Two third order rotatable designs in four dimensions. Ann.
Math. Statist., 35, 445-446.
Herzberg, A. M.(1966). Cyllindrically rotatable designs.Ann. Math. Statist.,37,242-247.
Herzberg, A. M. (1967). A method for the construction of second order rotatable
designs in k dimensions. Ann. Math. Statist., 38, 177-180.
Herzberg, A. M. and Cox, D. R. (1969). Recent work on the design ofexperiments: a
bibliography and review. Journal ojthe Royal Statistica/ Society, A, 132,29-67.
Hicks, C. R. (1982). Fundamental concepts oj design of experiments. (3rd edn). New
Y ork, Reinhart and Winston.
Himmelblau, D. M. (1972). Applied nonlinear programming. New York, McGraw
Hili.
Hake, A. T. (1974). Econornical second order designs, based on irregular fractions of
3n factorials. Technometrics, 17, 1974, 375-384.
Horn, S. D. and Horn, R. A. (1975). Camparisan ofestimators ofheteroscedastic
variances in linear models. J. Amer. Statist. Assoc., 70 ( 352 ), 872- 879.
Horn, S. D., Horn, R. A. and Duncan, D. B. (1975). Estimating heteroscedastic
variances in linear models. J. Amer. Statist. Assoc., 70 ( 350 ), 380-385.
Hou, X.(l998). Parameterdesign with monotone lass function. Statistica Sinica, 8,
87-100.
Huele, A. F. ( 1998). Statistical robust design. PhD Thesis, Amsterdam.
Hurley, P. (1994). Interactions: ignore them at your own risk (how Taguchi's
confirmation run strategy can Iead to trouble). Quality Engineering, 6, 451-
457.
Jeang, A. ( 1995). Economic tolerance design for quality. Quality and Reliability
Engineering International, 11, 113-121.
Jeang, A. (1999). Optimal tolerance design by response surface methodology.
International Journal oj Production Research, 37, 3275-3288.
Jebb, A. and Wynn, H. P. (1989). Robustengineeringdesign post Taguchi. Phi/. Trans.
R Soc. London, A327, 605-616.
John, P. W. M. (1971). Statistical design and analysis oj experiments. New York,
Macmillan.
Johnston, J. ( 1972). Econometric methods, 2-nd edition, New York, McGraw Hili.
Kackar, R. N. (1985). Off-line quality control, parameter design and the Taguchi
method. Journal ofQuality Technology, 17, No. 4, 176-209.
Kalagnanam, J. R. and Diwekar, U. M. (1997). An efficient sampling technique for
off-line quality control. Technometrics, 39, 308-319.
Kendall, M. G. and Stuart, A. (1973). The advanced theory ojstatistics.,vo/.2.:
Injerence and relationship. London, Griffin.
Khatree, R. (1996). Robustparameter design: a response surface approach. Journal of
Qua/ity Technology, 28, 187 - 198.
Khuri, A. I. (1996). Response surface models with mixed effects. Journal ojQuality
Technology, 28, 177 - 186.
490
Nazaret, W., and Liu, L. (1990). Computer aided design for quality (CADQ). AT&T
Technical Journal. 69, 46-60.
Neider, J. A. and Lee, Y. (1991). Generalized linear models for the analysis ofTaguchi-
type experiments. Applied Stochastic Models and Data Analysis, 7, 107-120.
Novik, F. S. (1972}. Mathematical methods jor design of experiments in Meta/
Science (part !), Moscow Institute ofSteel and Alloys, Moscow (Russian).
O'Donnel, E.M. and Vining G.G. (1997). Mean squared error ofprediction approach
to the analysis ofa combined array. Journal ofthe Applied Statistics, 24, 733-
746.
Park, S. H. (1996). Robust desiin and analysisjor quality engineering. London,
Chapman and Hall.
Pau, G. ( 1999). The impact of unidentified Iocation effects on disperssion effects
identification from unreplicated factorial designs. Technometrics, 41, 313-326.
Pau, G. and Santer, T.J. (1998). Selection and screening procedures to determine
optimal product design. Journal of Statistical Planning and Injerence, 67,
311-330.
Pazman, A. (1986). Foundations of optimumexperimental designs., Dordrecht, Reidel.
Pesotchinsky, L. (1972). D-optimal and nearly D-optimal designs for quadratic
regression on cube and sphere. Preprint No. 42, LSM, Moscow State University
(Russian).
Peterson, J. J. (1993). A general approach to ridge analysis with confidence intervals.
Technometrics, 35, 204-214.
Phadke, M. S. (1989}. Quality engineering using robust design. Englewood Cliffs, NJ,
Prentice Hall.
Phadke, M. S., Kackar, R. N., Speeny, D. V. and Grieco M. J. (1983). Off-line quality
control in integrated circuit fabrication using experimental design. The Bell
System Technical Journal, 62, No. 5, 1273-1309.
Pignatiello, J. J. and Ramberg, J. S. (1985). Discussion of: " Off-line quality control,
parameter design and the Taguchi method. Journal of Quality Technology, 17,
198-206.
Plackett, R. L. (1960). Principles of regression analysis. Oxford, Oxford University
Press.
Plackett, R. L. and Burman, J. P. (1946). The design of optimum multifactor
experiments. Biometrika, 33, 305-325 and 328-332.
Pledger, M. (1996). Observable uncontrollable factors in parameter design. Journal of
Quality Technology, 28, 153- 162.
Pukelsheim, F. (1995). Optimumdesign oj experiments. New York, Chapman & Hall.
Raktoe, B. L., Hedayat, A. and Federer, W. T. (1981). factorial designs. New York,
Wiley.
Rao, C. R. (1971a}. Estimation of variance and covariance components - MINQUE
theory. J. Multiv. Analysis.!, 257
494
SUBJECT INDEX
Residual45, 73,90,119,456
analysis 73
deleted 77
jack-knife 77
plot 74
standardized 76
studentized 76
variance 59
Resolution 126
Response 41
Response surface 10, 41
Response surface methodology 9, 11
Ridge analysis 435
Robustness 292
Robust product 203
Screening experiments 466
Sequential generation ofdesigns 153,185
Signal-to-noise ratio 202,291
Sparsity of effects 462
Stationary point 11,175,301,434
Steepest ascent 135
Sum of squares 17
pooled 204
residual 17, 22, 45, 57, 82
total 16, 57
Taguchi method 9,190
Taylor series expansion 42, 391
Test for Iack of fit 56, 60, 88, 118
Transformation 80
Box-Cox 81
logarithmic 81
non-linear 81
power 81
square root 81
Transmission of errors 23 7,292
Variation
between groups 17
within groups 17
pure 205
residual 16
Youden square 38
503
AUTHOR INDEX