Professional Documents
Culture Documents
Pág31 Sensory Evaluation Manual
Pág31 Sensory Evaluation Manual
SENSORY EVALUATION
MANUAL
Sensory Evaluation
ACKNOWLEDGEMENTS
These notes form the basis of a practical workshop presented for personnel at Naresuan
University, Phitsanulok, Thailand in July, 2002. We would like to thank Michael OMahony
for his permission to include copies of the statistical tables from his book Sensory
Evaluation of Food: Statistical Methods and Procedures and material supplied by the Centre
for Food Technology, DPI, Brisbane.
COPYRIGHT
Sensory Evaluation
TABLE OF CONTENTS
ACKNOWLEDGEMENTS ....................................................................................................2
PROGRAM ............................................................ ERROR! BOOKMARK NOT DEFINED.
INTRODUCTION....................................................................................................................5
THE HUMAN SENSES IN SENSORY EVALUATION .....................................................7
THE SENSES - AN INTRODUCTION .................................................................................7
SENSE OF SIGHT- .................................................................................................................9
THE SENSE OF SMELL......................................................................................................13
THE SENSE OF TASTE.......................................................................................................15
THE SENSE OF HEARING.................................................................................................19
THE SENSE OF TOUCH .....................................................................................................20
SENSORY INTERACTION .................................................................................................21
OPERATIONAL PRINCIPLES OF SENSORY TESTING .............................................23
DESIGN OF A SENSORY TESTING AREA.....................................................................31
STATISTICAL PRINCIPLES .............................................................................................34
SENSORY EVALUATION METHODS .............................................................................38
AFFECTIVE TESTS .............................................................................................................38
SPECIFIC TEST METHODS ..............................................................................................39
PAIRED PREFERENCE TEST.......................................................................................39
RANKING FOR PREFERENCE.....................................................................................41
RATING FOR PREFERENCE ........................................................................................44
SENSORY EVALUATION IN CONSUMER TESTING ..................................................46
ANALYTICAL SENSORY TESTS: ....................................................................................53
DIFFERENCE TESTING.....................................................................................................53
SIMPLE DIFFERENCE TEST ............................................................................................53
TRIANGLE TEST .............................................................................................................53
DUO-TRIO TEST..............................................................................................................55
TWO-OUT-OF-FIVE TEST.............................................................................................59
COPYRIGHT
R L Mason and S M Nottingham
3
Sensory Evaluation
A NOT A TEST.......................................................................................................59
DIFFERENCE-FROM-CONTROL TEST (DFC) .........................................................59
DIRECTIONAL DIFFERENCE TESTS ............................................................................65
PAIRED COMPARISON TEST ......................................................................................65
RANKING TEST ...............................................................................................................67
RATING TEST ..................................................................................................................69
STATISTICS FOR SENSORY: DIFFERENCE TESTING .............................................73
DESCRIPTIVE TESTING ...................................................................................................76
STATISTICS FOR SENSORY: DESCRIPTIVE TESTING ............................................81
SELECTION, TRAINING AND MOTIVATION OF A PANEL .....................................86
REPORTING .........................................................................................................................91
SELECTED BIBLIOGRAPHY............................................................................................92
JOURNALS ............................................................................................................................94
STATISTICAL TABLES......................................................................................................95
COPYRIGHT
Sensory Evaluation
INTRODUCTION
Sensory evaluation - A scientific discipline used to evoke, measure, analyse and interpret
reactions to those characteristics of foods and materials as they are perceived by the senses of
sight, smell, taste, touch and hearing.
Sensory evaluation was one of the earliest methods of quality control and it is still widely
used in industry. However, the level of application depends on the situation (e.g. beer and
wine tasting to operators sampling of products from production line).
Four variables affect sensory evaluation:
The Food
The People
The Testing Environment
Methods
Sensory evaluation
Sensory Analysis
Organoleptic Analysis
Taste Testing
Psychophysics
Subjective Evaluation
Advantages
Disadvantages
Time consuming
Expensive to run
Method selection
Analysis
Interpretation
Product development
Product matching
Product improvement
COPYRIGHT
Sensory Evaluation
Process change
Cost reduction
New raw materials selection
Quality control
Storage stability
Product grading / rating
Consumer acceptance
Consumer preference
Panel selection / training
Correlation subjective / objective
Sensory Standards
Aus Standard
Year
Title
AS 2542.0
1995
AS 2542.1.1
1984
AS 2542.1.2
1984
Sensory analysis of foods - General guide to methodology Types and choice of test
AS 2542.1.3
1995
AS 2542.2.1
1982
AS 2542.2.2
1983
AS 2542.2.3
1988
AS 2542.2.4
1988
AS 2542.2.5
1991
AS 2542.2.6
1995
AS 2542.3
1989
COPYRIGHT
Sensory Evaluation
THE HUMAN SENSES IN SENSORY EVALUATION
THE SENSES - AN INTRODUCTION
The sensory properties of foods are related to three major attributes:
These attributes are expressed as a continuum and not as finite properties. It is impossible to
rate each one individually unless special precautions are taken, e.g. blindfolds, nose clips,
coloured lights, purees.
COPYRIGHT
Sensory Evaluation
Humans possess about 30 different senses. However, the sensory properties of foods are
perceived through the senses of:
Sight;
Smell;
Taste;
Touch; and
Hearing.
Stimuli
A stimulus is any chemical or physical activator that causes a response in a receptor, e.g. eye
is receptor for light, ear is receptor for sound.
An effective stimulus produces a sensation, the dimensions of which are:
Intensity/strength;
Extent/separation;
Duration/retention; and
Hedonics/like-dislike.
Receptors
Receptors are the stimuli detecting cells of the sense organ, e.g. taste buds on tongue, light
receptors in retina of eye.
Perception
Perception is the psychological interpretation of sensations determined by comparison with
past experiences, e.g. the sour taste of lemons is the perception of the sensation received by
the receptors (taste buds) from a chemical stimulus (citric acid).
COPYRIGHT
Sensory Evaluation
SENSE OF SIGHTThe appearance of food
Stimuli = visible light
Receptor= retina of the eye
Perception=sight, vision, appearance
The appearance of foods is a major factor governing its acceptability and can be subdivided
into three main categories:
Optical properties- colour, gloss and translucency
Physical form-shape and size
Mode of presentation-lighting packaging etc
Optical properties
Vision
Vision is a complex phenomena consisting of several basic components. A stimulus, light,
from an external source interacts with the object and is brought to focus on the retina of the
eye. The retina is the receptor of vision and contains two types of cells. The rods are
responsible for vision in dim light and the cones are responsible for colour vision. Light
incident on these cells causes a photochemical reaction that generates an electrical impulse
which is transmitted to the brain via the optic nerve. Colour blindness is caused by loss or
lack of colour receptor cells in the cones. Approximately 8% of the population have some
defect with relation to colour; mostly males.
COPYRIGHT
Sensory Evaluation
Light
Visible light is that part of the electromagnetic spectrum which radiates between wavelengths
of 380 - 770 nm.
Different wavelengths produce different colours
380
450
500
575
590
- 450 nm
- 475 nm
- 575 nm
590 nm
770 nm
=violet
=blue
=green
=yellow
= red
[NOTE: All electromagnetic radiations are physically the same. However, the optical system
of the eye is such that only the visible range of wavelengths is absorbed by the lens.]
Light sources
Incandescent lights consist of a tungsten filament which is heated in an inert gas. The higher
the temperature, the more light produced. Light from this source tends to be harsh and tends
to highlight the red end of the spectrum.
Fluorescent lights operate by electrical excitation of atoms that produces spectral lines at
specific wavelengths which then impinge onto fluorescent materials which convert the
incident light into light at a longer wavelength. Light produced is softer but can produce
colour distortion at particular wavelengths.
Natural light is too variable for use in evaluating appearance of foods.
Light - Object interactions
Light incident on an object may be:
Absorbed;
Reflected;
Transmitted; and
Refracted.
The relationship between and within each of these components is responsible for the colour
and gloss characteristics of the food. The main light/object interactions produced are:
Lightness/value;
Colour/hue;
Chroma/purity; and
Gloss.
Physical form
The second class of product appearance is physical form that can be subdivided into three
parts:
COPYRIGHT
10
Sensory Evaluation
Shape;
Surface texture; and
Visual consistency.
Shape and size are important from a food technologist's point of view because these can be
altered during the manufacture of processed products. Some examples include:
Setting of a jelly
Syrups of different concentrations
Pastes and purees
Mode of presentation
This aspect should be considered from a marketing point of view and is important because it
influences sales. Mode of presentation is applicable on the supermarket shelf (at retail level)
and also in terms of presentation at the table (home and restaurant).
Factors to be considered are:
COPYRIGHT
11
Sensory Evaluation
Summary
Appearance is an important aspect of food quality as it is the first subjective evaluation made
of food quality. The product has to pass the visual assessment before the consumer can or
will consider the other parameters such as taste and texture.
Factors that should be considered in evaluating product appearance include:
COPYRIGHT
12
Sensory Evaluation
THE SENSE OF SMELL
(Odour/olfaction)
Stimuli = volatile chemicals
Receptors= olfactory cells in the nose
Perception=smell, odour, aroma, flavour
Smell is one of our most primate senses.
influenced by smell than other senses.
The human nose is capable of detecting thousands of different odour substances. However,
our sensitivity is much less than other animals. (Animals use smell - food, mating, territory
etc).
Smell is detected both before and during eating. Smell is an important aspect of flavour.
There are 20x106 olfactory receptors, but only about 1000 taste receptors.
Odour description requires the development of an odour/flavour memory, e.g. fishy, flowery,
woody. This is the basis of flavour/odour memory development by wine judges and
milk/cheese graders. Individuals vary a great deal in their sensitivity to different
odours/aromas.
Anatomy of olfactory system
From the diagram it can be seen that most of air misses the olfactory area. Only 5-10% of
inspired air passes over olfactory receptors. However, this amount can be increased by
sniffing harder; obviously the more air which passes over the receptors the better the
COPYRIGHT
13
Sensory Evaluation
response.
The large number of olfactory receptors (20x106) enable detection of :
Substance needs to be volatile enough to get into air in the sensory region.
Substance needs to be partially soluble in mucus covering of receptors.
Minimum number of odorous molecules need to be present.
Need to be in contact with receptors for minimum time.
Olfactory intensity
Human nose is about 10-100 times more sensitive to odours than any physico-chemical
analysis (e.g. gas chromatography). It has been demonstrated that human nose is capable of
detecting ethyl mercaptan at a concentration of 0.01 mg/230m3 of air, which is equivalent to
about 8 molecules/receptor.
Olfactory threshold
Detection threshold is the concentration where smell is detected.
Recognition threshold is the concentration where the smell is recognised.
Olfactory interactions
Nature of the response may change with concentration (e.g. perfumes at low concentration
are pleasant but at strong concentration may be unpleasant).
Interaction of odours:
Additive - increase intensity;
Suppressive - decrease intensity; and
Blending - when new odour unrelated to originals.
Olfactory adaptation
Initial sensation maybe strong - but weakens and makes identification difficult; this is due to
adaptation of olfactory receptors.
In testing we therefore need to allow for this by:
Taking first impression of odour and/or
Waiting between tests to allow receptors to recover.
COPYRIGHT
14
Sensory Evaluation
Summary
Foods contain numerous compounds of varying volatility that can make analytical
interpretation difficult (e.g. strong peaks may produce weak odour whereas weak
peaks may produce a strong odour).
Taste;
Smell;
Touch; and
Temperature.
Strictly speaking taste involves only those sensations mediated by the Gustatory Nerve Fibres
and these sensations have five (5) basic qualities:
Salt;
Sweet;
Sour; and
Bitter.
Umami
Taste stimuli
Taste response requires an aqueous solution of the substance (stimulus) to contact the taste
buds. Therefore, saliva secretions are important in terms of ensuring contact between the
product and the taste buds. Saliva production is generally stimulated by chewing, as well as
the appearance and odour of the food. The tongue is important as it brings the food into
contact with the taste buds and also provides a mixing action which enables an even
distribution of food about the taste buds as well as preventing the development of
concentration gradients.
COPYRIGHT
15
Sensory Evaluation
Taste receptors
The receptors for taste are the taste buds and these are mounted on papillae (folds in the skin
of the tongue). The area of greatest response is the top of the tongue. Other areas in the
mouth and throat where taste buds are situated include: palate, pharynx, larynx, tonsils,
epiglottis, lips, cheeks, underside of tongue and floor of mouth.
Taste buds are mainly located at the tip, sides and rear of tongue. There is very little response
in the centre of the tongue. Different areas of the tongue are most responsive to different
sensations.
Tip
Sides Sides Rear -
sweet
salty
sour
bitter
Taste cells constantly degenerate and regenerate. Their life cycle is 10 days and they are
easily destroyed by heat.
The tongue itself is important as it brings the food into contact with the taste buds and also
provides a mixing action which enables an even distribution of food about the taste buds as
well as preventing the development of concentration gradients.
COPYRIGHT
16
Sensory Evaluation
The five basic tastes
A basic taste is one for which specific taste buds have been identified as being
physiologically responsible for the particular taste sensation.
Sourness
This is the simplest taste as only acids (H+) produce sourness and as the (H+) increases the
sourness increases
However there are some anomalies to this:
Sweetness
The common substances that produce the sweet taste are the sugars and other hydroxy
compounds such as alcohols and glycols. Other substances such as lead salts, amino acids,
proteins, non-nutritive sweeteners (cyclamates, saccharin and aspartame ) also taste sweet.
Saltiness
Many crystalline water-soluble salts yield a salty taste, but only sodium chloride gives a pure
salty taste. Other substances taste salty but also bitter, alkaline, sweet and salt in various
combinations.
Bitterness
Many chemically different compounds have a bitter taste. However, bitterness is mainly
associated with alkaloids such as caffeine, quinine, strychnine and nicotine. Originally it was
thought that bitterness was an indication of danger (poison). However, many alkaloids are
used as drugs (e.g. codeine) and many other bitter substances are harmless (glycosides, esters
and aldehydes and tannins in wines and tea).
Bitterness is generally perceived at very low concentration and a relationship appears to exist
between sweet and bitter as many sweet substances produce a bitter aftertaste (saccharin).
Bitterness is the taste which most people have difficulty in detecting and response level varies
greatly from individual to individual.
COPYRIGHT
17
Sensory Evaluation
Umami
Umami is the taste that has been shown to be associated with substances that contain
glutamate. The most notable example is mono-sodium glutamate (MSG). MSG is well
known as a flavour enhancer and can cause adverse reactions in some sensitive individuals.
However, there are many other compounds which contain glutamate and which are capable of
producing the savoury, spicy, brothy taste associated with MSG. Many foods contain
naturally high levels of glutamate.
Taste interactions
Having described the 5 basic tastes it is obvious that foods are a very complex system which
contain many different taste compounds and therefore many different tastes. The fact that
there are only 5 basic tastes and yet we are able to detect hundreds of different taste
sensations is due to a series of complex taste interactions that can range from simple 2 way
interactions to complex 5 way interactions
Interactions between the 4 basic tastes were previously described simplistically by the taste
tetrahedron.
Adaptation and fatigue
During exposure to a stimulus, sensitivity decreases due to adaptation and fatigue. This loss
in sensitivity varies considerably with the taste (sweet, sour, salty or bitter) and also with the
compound. For example, tasting a series of acids causes the sensitivity to be reduced by the
preceding acids. However, recovery is usually rapid because most common organic acids are
very soluble.
Taste thresholds and sensitivity
There is great variability between individuals in their levels of sensitivity. Sensitivity is
affected by:
Temperature;
Sleep;
Hunger;
Age; and
Sex.
Absolute/Detection threshold - Concentration of stimulus at which a subject can detect a
difference between two samples in a paired test.
Recognition threshold - Concentration at which the specific taste can be identified.
Recognition threshold is generally higher than detection threshold.
Both absolute threshold and recognition threshold will vary between individuals. Most
people can detect taste within 0.2 - 0.6 seconds and therefore if there is no response within
this time the level is sub-threshold. However, recognition times vary between the basic tastes
COPYRIGHT
18
Sensory Evaluation
Salt = 0.3s
Sweet = 0.4s
Sour = 0.5s
Bitter = 1.0s
Vision = 0.02s
Hearing = 0.01s
Touch = 0.005s
Reaction times also relate to retention times for example; bitterness has the longest reaction
time (1.0s) and the sensation lingers considerably after tasting.
Summary
Five types of taste receptors - salt, sweet, sour, bitter and umami.
COPYRIGHT
19
Sensory Evaluation
Negative aspects:
noisy environment may distract tasters or mask product sounds.
THE SENSE OF TOUCH
(Texture, Kinesthetics)
Stimuli = physical contact between the food and body tissue
Receptors= muscles and nerves in mouth and fingers
Perception=touch, feel, texture, viscosity
Texture usually relates to solid food while viscosity relates to homogeneous liquid foods and
consistency relates to non-homogeneous liquid foods.
Instrumental methods only measure one aspect of "texture" and again cannot relate the
complex interactions which produce the perception of food texture.
Finger feel
Firmness/Softness indicates the eating quality of some food products:
Juiciness can be used as a subjective quality index (eg the thumbnail test for corn).
Mouth feel
Liquids
Solids
Classification of textural characteristics - assessed mainly by chewing.
Textural Terminology
Mechanical Characteristics
Hardness
Brittleness
Chewiness
Grittiness
COPYRIGHT
Sensory Evaluation
Fibrousness
Moistness
Oiliness/Greasiness
SENSORY INTERACTION
As has been indicated previously when eating or tasting food there is a continuous
relationship between the senses and unless steps are taken to separate the individual senses or
stimuli, interactions may occur. It is not known whether interactions occur at the receptor
site or the brain. However, the second option would appear to be more likely.
Interaction between senses
This is the ability of a response from one modality to influence or affect the response from
another. There are two aspects of this:
Positive - interactions giving clues to possible identity, e.g. pink milkshake being strawberry
flavoured.
Negative - If clues are not correct this may lead to confusion and a wrong judgement, e.g.
pink milkshake with pineapple flavour.
Types of sensory interactions
Taste - odour
Receptors for these two senses are very close so that interactions between these senses are
highly likely and these may be important in classifying a particular taste.
Taste - tactile
The taste threshold for sugar, salt, caffeine have all been shown to be lower in water than in
tomato sauce. This may be due to the fact that in more viscous solutions the chemicals do not
react with the receptors as easily as in pure solutions.
Taste - sight
This is a very important aspect because vision is the first sense affected and appearance of a
product will have a major influence on absolute quality. Bright colours indicate strong
flavours whereas dull colours indicate mild flavours.
Other interactions include:
Odour - Sight
Odour - Tactile
Taste Hearing
COPYRIGHT
21
Sensory Evaluation
Odour - Hearing
Multiple interactions
Multiple interactions between more than two modalities are also possible.
Example: Tasting food pureed, blindfolded and with nose clips gives a different response
than when interactions are allowed.
Interactions between stimuli
These interactions are more difficult to define and measure but are just as important as
interactions between the senses. Some examples include:
no effect; original flavours are distinct and separate, e.g. fruit in cheese;
COPYRIGHT
22
Sensory Evaluation
OPERATIONAL PRINCIPLES OF SENSORY TESTING
When evaluating properties of foods using people as measuring instruments it is important to
control the methods and conditions of testing as rigidly as possibly. This helps to eliminate
the numerous errors or biases that can be caused by psychological and physiological factors.
The mental attitude and physical condition of a taster, and the atmosphere of the testing
environment all influence their judgements. There are therefore a number of basic rules
which should always be applied, as stringently as circumstances allow, when running taste
panels. These relate to:
Selection of panellists;
Preparing the testing environment;
Designing the experiment;
Preparing samples;
Serving samples.
23
Sensory Evaluation
Serve tasters promptly and make sure they have everything they need.
Run a taste panel as you would expect a good restaurant to be run, i.e. give courteous
friendly service, be efficient, and serve good food.
Keep accurate records of any cooking or preparation methods used. Record temperatures and
size of samples served and any special conditions (e.g. coloured lighting).
It is important that panellists do not see the samples being prepared as this may indicate
quality difference.
Sample preparation should be uniform:
Temperature
Cooking
Thawing
Size and shape (provided this is not a variable)
Avoid bias
Overcome any nonuniformity
30g solids
30mL liquids
Flavour loss
Discoloration
Textural changes
COPYRIGHT
24
Sensory Evaluation
Containers for presentation
Containers for presentation and tasting should be:
Clean
Identical for all samples and sessions
Disposable containers or reusable
Coloured to mask product appearance (if required)
Relevant to product
Serving temperature
Expensive
Time consuming
Variable quality
Difficult to control product/carriers ratio uniformity.
For example: developing a cake icing individually may not allow for interaction with flavour
or it may be incompatible with the cake (affects texture or falls off).
Number of samples
Samples / Sessions
The number of samples presented at any testing session will depend on:
COPYRIGHT
25
Sensory Evaluation
Monday and Friday are recognised as being bad days for tasting
Normally taste 1 hour before meals and 1 - 2 hours after
Sometimes this becomes difficult in practice due to:
Unavailability of tasters
Number of sessions
Illness
Sensitivity of people suffering from illness is reduced -particularly those with colds or flu
(physical and psychological)
Likes / Dislikes
In preference testing a series of treatments within a specific product type, it is legitimate to
eliminate people who dislike the product (or those who are not discriminatory).
Palate Clearing
COPYRIGHT
26
Sensory Evaluation
It is a good idea to get panellists to cleanse their palate:
Sample Coding
Remove possible bias or influence from samples codes. Do not use.
27
Sensory Evaluation
Devise your own system for remembering orders, e.g. 3 digit numbers - put in sequence of
one of digits. Keep it a secret!
Always work systematically in coding, labelling, setting up, e.g. as in reading a page
(1)
(2)
Left to Right
Top to Bottom
2 samples A, B.
3 samples
-
4 samples
4 samples
COPYRIGHT
28
Sensory Evaluation
Four sample balanced orders
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
A
A
A
A
A
A
B
B
B
B
B
B
C
C
C
C
C
C
D
D
D
D
D
D
B
B
C
C
D
D
A
A
C
C
D
D
A
A
B
B
D
D
A
A
B
B
C
C
C
D
B
D
B
C
C
D
A
D
A
C
B
D
A
D
A
B
B
C
A
C
A
B
D
C
D
B
C
B
D
C
D
A
C
A
D
B
D
A
B
A
C
B
C
A
B
A
Expectation Error
Any information a panellist receives before a test will influence the results. This is called
expectation error. To overcome this:
Sensory Evaluation
situation ie: consumers do not taste every aspect separately.
Suggestion
Influence of other panellist may bias or influence results. This can be prevented by:
Using booths
Not allowing talking in tasting area
Reducing outside distractions
Questionnaire design
Questionnaire design should be simple and easy to follow in terms of design and language
and make sure tasters know how to use it. You may need to include some instructions on the
scoresheet itself, but it is usually better to give instructions verbally to your panel first. The
questionnaire should generally not be more than one page and include:
Name
Date
Time
Product
Sample codes
Instructions
Comments section
COPYRIGHT
30
Sensory Evaluation
DESIGN OF A SENSORY TESTING AREA
The main considerations to keep in mind when preparing an area for sensory testing concern
the requirements for an atmosphere conducive to concentration, where conditions can be
controlled. Sensory panellists need somewhere comfortable and free from distractions if they
are to be able to "tune in" to the sensations triggered by the stimuli in the food products they
are tasting. Product characteristics can be markedly affected by temperature and humidity,
and appearance is affected by lighting intensity.
The conditions should be controlled in order to :
Reduce bias
Improve accuracy
Improve sensitivity
(compare to the conditions used in an analytical laboratory)
International standard (ISO 8589-1988)
The standard looks at the design of the testing area for both new and existing buildings. It
also specifies which recommendations are considered essential and which are only desirable.
Important points summarised from the standard are listed below. If designing an area that is
to be dedicated solely to taste panel work, these should be seriously considered.
Total area should include:
Location - close proximity to preparation area, but separate entrance, and with
complete "close-off" capability.
Odours - keep area free from odours (air conditioner with carbon filters, slight
positive pressure).
COPYRIGHT
31
Sensory Evaluation
Decoration - use neutral, light colours for walls and furniture (e.g. off-white, light
grey).
Booths
Number - minimum three, normally five to ten - six is a useful number since it fits in well
with balanced ordering of 3 samples.
Space - allow sufficient space for movement of tasters and for serving samples.
Set-up - permanent booths recommended. Temporary acceptable. If adjacent to preparation
area include openings in the wall to pass samples through. Size and style specified. Consider
space for samples, utensils, spittoons, rinsing agents, scoresheets and pens, computerised
equipment. Include comfortable seats.
Lighting - uniform, shadow-free, controllable, adequate intensity for assessing appearance.
Devices to mask appearance (e.g. dimmers, coloured lights or filters).
Group work area
General
Necessary for discussion and training purposes. Include large table and
several chairs. "Lazy Susan" useful. Include board for discussion notes, etc.
Lighting
Preparation area
General
Located close to assessment areas but no access to tasters. Design for efficient
work-flow. Well ventilated. Flexible services (i.e. plumbing, gas, electricity).
Equipment
Depending on testing required. Include working surfaces, sinks, cooking
equipment, refrigerator, freezer, dishwasher, etc. Storage space for crockery etc. Crockery,
glassware etc for serving samples.
Office area
General
Separate but close to testing area, reasonable size, desk, filing cabinet,
computer, bookcase. Photocopying service needed.
Additional areas
Useful to include rest room, cloak room and toilets.
Practical alternatives
The requirements specified in the International Standard (ISO 8589) will obviously provide a
suitable area, but they are not always feasible, either from the point of view of financial
COPYRIGHT
32
Sensory Evaluation
resources or physical space available.
Very few industries are able to start from scratch, designing new premises solely dedicated to
sensory analysis work. I therefore would like to abbreviate the list proposed in the standard
to one which I consider includes the bare essentials.
Minimum of 2 areas:
Preparation area and office area. If possible position these at opposite ends of the room to
avoid messy paperwork!
Testing area with entrance separate from preparation area.
Preparation area requires
A system using collapsible booths can work quite well if it is not possible to keep an area
solely for sensory work. These may be made of painted wood, heavy duty cardboard, or
"corflute". They can be made specifically to fit any available benches or tables and folded
and stored when not in use.
The type of facility will depend on:
Finance
Available space
Frequency of use
Tests conducted
COPYRIGHT
33
Sensory Evaluation
STATISTICAL PRINCIPLES
This section looks at the role of statistics in sensory evaluation and introduces some terms
and concepts required to correctly apply statistical methods in evaluating sensory type data.
Why do we need statistics in sensory evaluation?
When we measure something (eg salt level in cheese) we find there is variation in what we
are measuring. This variation is called natural variation or experimental error and implies
that there is some true measurement but because of our limitations we cannot reproduce the
correct readings every time. This is a fact of life and we have limited control over this sort of
error.
Because of this variation there is some risk in making decisions about changing formulations
or introducing new products onto the market. Using statistics we have rules to estimate and
minimise the risk and enable us to extrapolate our results from an experiment to a more
general situation.
What is an experiment?
It is any process that generates raw data.
There are many sources of error in sensory data. Some of these include.
COPYRIGHT
34
Sensory Evaluation
6
Frequency
5
4
3
2
1
0
3
Flavour Acceptability
Looking at the graph or distribution we ask what is the best single estimate of the panels
score and what is a good measure of their variability? The best or most likely single
estimates are called measures of central tendency. The three most commonly used are:
mean - or average (sum of all data values divided by number of observations)
median - 50th percentile or middle value
mode - most frequent value, good for categorical data
Measures of variability include the range, standard deviation and variance. The range is
simply the difference between the smallest and the largest. The standard deviation is
probably the most common and is calculated by using the formula below.
s=
( X M )2
( N 1)
where M is the mean or average of X scores and N is the number of scores. This formula
calculates the deviation of each score from the mean and squares it to take into account
positive and negative values and the square root is then taken to bring it back to the original
units. The variance is simply the square of the standard deviation and is used in a number of
statistical formulas.
COPYRIGHT
35
Sensory Evaluation
The normal distribution
Many things we measure about a group of people will be normally distributed. This means
they will form a bell shaped curve described by an equation usually attributed to Gauss.
How does the standard deviation relate to the normal distribution?
Standard deviations describe discrete percentages of observations at certain degrees of
difference from the mean. So for a normal distribution about 66% of our data will be within
one standard deviation of the mean and about 95% will be within two standard deviations.
For our mango flavour data with a mean of 6.0 and standard deviation of 1.89 then 66% of
our data lie between 4.11 and 7.89. If the standard deviation had been 1.00 then 66% of the
values would be between 5.0 and 7.0, a smaller range indicating less variability.
In addition any score, X can be described in terms of a z-value, which describes how far the
score is from the mean in standard deviation units.
Z = X-/
Since z-scores are related to percentages under the normal curve they can predict how far a
score is from the mean and how likely or unlikely it is. So the z-score can be converted to a
probability value or p value. This p - value is found from the area under the curve outside
the z score and is the chance with which we would see a score of that size or greater. Tables
are often used to convert z - scores to p values.
An important concept
When we do an experiment we are using results from a sample taken from a larger
population of possible results. Since we cannot take all possible results from the population
we infer from our sample results what should happen in the rest of the population. By
making this generalisation we often express our results in terms of probability or p- values.
This is our safety margin or level of confidence about our result. It is often quoted like this the flavour score for naturally ripened mango was significantly higher (P<0.05) than that for
artificially ripened mango.
But what does this mean?
We are at least 95% certain that based on our experimental conditions the naturally ripened
mango will have more flavour than artificially ripened mangoes. This conclusion will be
wrong about five times out of 100. Sometimes a 1 % value or 0.01 is used for greater
precision.
How does all this help?
We need to identify some more concepts before we can be confident in using statistics.
Experiments need to be planned and carried out correctly before we can use statistics and two
important principles are replication and randomisation.
Replication is the assessment of each treatment more than once. A treatment can be the
addition of sweetener to a product or the storage temperature of a fruit. With replication we
COPYRIGHT
36
Sensory Evaluation
can assess the natural variability and separate this from our variability due to treatment
differences. This is like a signal to noise ratio. Is our signal greater than the background
noise (natural variation)?
Random allocation of treatments to samples or products ensures each sample has an equal
opportunity of receiving any treatment, and that this chance is unaffected by the treatments
assigned to other samples. For example if two products are tasted by 24 tasters and they all
taste product A first then this may well bias the results, as the first product tasted may tend to
be preferred regardless of which it is. Subjective allocation of treatments in a haphazard way
is not a satisfactory alternative to randomisation.
COPYRIGHT
37
Sensory Evaluation
SENSORY EVALUATION METHODS
There are two main types of sensory methods:
Affective :tests which involve consumer preference or acceptance
Analytical : tests which are involved with analyzing specific product attributes in terms of:
discrimination/difference
description
AFFECTIVE TESTS
Preference infers a preference for one product over another; either overall or in relation to a
particular parameter.
Acceptance infers actual utilisation/purchase of the product.
Panel selection
Select panel on basis of end use:
Age
Race
Religion
Sex or
Random selection for overall
Panel training
No need for training, in terms of technique or ability.
instructed/briefed in terms of:
Method
Questionnaire
Length of trial
Number of samples
Panel size
1. 20 to 100 people
2. 20 = pilot consumer panel
3. 100 = consumer panel
COPYRIGHT
38
Sensory Evaluation
SPECIFIC TEST METHODS
PAIRED PREFERENCE TEST
(Reference: AS 2542.2.1; 1982.)
Application: to establish whether there is a preference between two samples.
Principle: a pair of samples (one may be a control) is presented to each assessor. The
assessors are asked to choose the sample they prefer.
This test is a forced choice ie: the assessors must select one sample as being more
preferable. Responses indicating no preference are not permitted.
Statistically based on null hypothesis that there is no preference between the samples.
ie:PA = PB = 50%= 0 5
Bilateral Test - no expectation of preferences
Specimen Answer form for bilateral paired preference test
PRODUCT.DATE..TIMEASSESSOR
..
Which sample do you prefer?
Please examine code 349 first.
Please tick the appropriate box.
Code
Place tick
922
349
Conclusions
no preference
A preferred to B
B preferred to A
Question which of the two samples do you prefer? Count the number of replies citing one
of the two samples the more frequently.
Conclude that this sample is significantly preferred to the other if the number obtained is
greater than or equal to that shown in Table 4.
Example:
Two drinks A and B, are offered to a panel of 30 assessors. The two samples are
presented under random number eg: 789 and 379. The test supervisor accepts a 5% level
COPYRIGHT
39
Sensory Evaluation
of significance (ie: P < 0.05%). It is not known which of the two samples contains more
sugar.
Question - Which sample do you prefer?
Replies:
22 prefer A
8 prefer B
From Table 4 it can be concluded that Drink A is preferred to Drink B.
Unilateral Test - expect one sample to be preferred.
Specimen Answer form for unilateral paired preference test
PRODUCT.DATE..TIMEASSESSOR
..
Do you prefer sample 186 to sample 592?
Please examine code 592 first.
Please tick the appropriate box.
YES
NO
Conclusion
no preference
the declared sample is preferred
From Table 3 it can be concluded that there is preference for drink A over drink B.
COPYRIGHT
40
Sensory Evaluation
N.B. If uncertain always use the bilateral test.
Advantages
Disadvantages
Only suitable for 2 products (note multiple Comparisons can be used but other
preferences tests are more commonly used. See ASTM manual on sensory testing
method, STP 434; 1968)
No magnitude of preference is given ie they both may be disliked but one can still be
preferred.
Applications
Product Development
Product Matching
Process Change
COPYRIGHT
41
Sensory Evaluation
Statistical analysis
Kramers tables, which have been used in the past to analyses differences between rank sums,
should not be used due to questions of accuracy and statistical validity.
When there is no expectation of a specific rank order being made (eg when ranking
preference of new product prototypes) the Friedman Test should be used (see statistical
method s section for details).
Example
Twelve households were presented with four samples of meat seasoning to be used in
cooking. They were asked to use the samples as directed and to rank them in order of
preference. The results are shown below:
COPYRIGHT
42
Sensory Evaluation
Rankings for the preference of four meat seasonings
HOUSHOLD
1
2
3
4
5
6
7
8
9
10
11
12
Rank sums
Seasoning
A
1
2
1
1
2
3
3
3
1
1
1
1
20
B
3
1
4
4
3
4
4
4
2
2
2
3
36
C
2
3
2
2
1
2
2
1
3
3
3
2
26
D
4
4
3
3
4
1
1
2
4
4
4
4
38
B
36b
C
26ab
D
38b
Rank sums that do not have a common superscript are significantly different (P<0.05)
COPYRIGHT
43
Sensory Evaluation
RATING FOR PREFERENCE
(Australian Standard 2542.2.30
Principle
Assessors are asked to evaluate one or more samples and indicate the degree of liking for the
product or some characteristic of the product.
When performing preference testing it is important to include as many panellists as possible.
Personal preferences in foods are being measured which are purely subjective, so the variance
in the data is large. This makes it more difficult to obtain statistically significant results. The
larger the panel, the more chance there is of obtaining a significant result. Only untrained
panellists are used and should be selected at random or from a targeted group related to the
product.
Pilot consumer panel = 20-25
Consumer panel = 100
Types of response scale
Category scale/structured scale
The response scale is divided into categories or boxes.
The response scale is usually divided into an arbitrary number of categories - usually between
7 and 13
Category scales must be bipolar.
Verbal descriptors or facial expressions may be used to identify the levels of acceptance.
Hedonic category rating
AROMA
FLAVOUR
TEXTURE
Like extremely
Like very much
Like moderately
Like slightly
Neither like nor dislike
Dislike slightly
Dislike moderately
Dislike very much
Dislike extremely
COPYRIGHT
44
Sensory Evaluation
Facial hedonic scale
7 point facial hedonic scale
Appearance
Aroma
Flavour
Mouth-feel
Graphic rating scale
Disadvantages
COPYRIGHT
45
Sensory Evaluation
Applications
storage trials
product development
consumer testing
quality control
subjective/objective correlations
research
Example:
Three samples of frozen chicken casserole were presented to a 24 member panel who
assessed the appearance, flavour, texture and general acceptability of the products using a 13
point
The following results were obtained:
CHICKEN CASEROLE
Appearance
Flavour
Texture
General Acceptability
A
10.8 a
9.9a
10.4 a
10.3 a
B
8.5 d
9.4 b
9.6 b
9.2 b
C
10.3 b
9.2 b
9.1 b
9.4 b
Scores within each row that do not have a suffix in common are significantly different.
SENSORY EVALUATION IN CONSUMER TESTING
Introduction
The personal response by current or potential customers of a product, a product concept, or
specific characteristics of a product is collectively grouped under what we call consumer
testing. However, it is important to define the terms acceptance and preference often
associated with consumer testing. Acceptance refers to the degree of liking or disliking for a
particular product or the ability of the product to meet expectations of consumers while
preference refers to a choice made by panellists among several products on the basis of
liking or disliking. Unfortunately preference is widely used as a generic term to describe
both acceptance and preference judgements. It is important to note for example in paired
preference testing that although one product may be preferred to another, neither product may
be liked to any degree. The term hedonic is an adjective associated with degrees of
pleasure or displeasure and is applied to both acceptance and preference testing.
COPYRIGHT
46
Sensory Evaluation
Applications of Consumer Testing
The reasons for conducting consumer tests usually fall into one of the following categories:
Product maintenance
Product improvement/optimization
Development of new products
Assessment of market potential
Product maintenance
Research and development projects may involve cost reduction, substitution of ingredients,
process and formulation changes and packaging modifications without affecting the product
characteristics and overall acceptance. Usually difference tests would be used to determine
whether a difference was perceived or not but it is necessary to take the product out to the
consumer to determine if the reformulated product will achieve at least parity with the current
product.
Product maintenance is also a key issue with quality control/quality assurance and shelf-life/
storage projects. Feedback on consumer response gives important information on those
sensory characteristics that are most important to consumer choice and which should
therefore be rigorously controlled. A combination of in-house profile testing on the
magnitude and type of change over time, condition, production site, raw material sources etc
can be used in conjunction with consumer testing to determine how large a difference is
sufficient to change the acceptance rating.
Product improvement/optimisation
The intense competition among consumer products drives companies to constantly improve
and optimise products so that they can deliver what the consumer is really looking for and
therefore increase market share.
In product improvement, prototypes are made, tested by a trained panel to verify that the
desired attribute differences are perceptible, and then tested with consumers to determine the
degree of perceived product improvement and its effect on overall acceptance or preference
scores.
For product optimisation, ingredients or process variables are manipulated and a trained panel
identifies the key sensory attributes affected and consumer tests are conducted to determine if
consumers perceive the change in attributes and if such modifications improve the overall
acceptability.
Development of new products
During the new product development from concept to a range of trial samples to a modified
sample range and finally a choice to launch, consumer testing should be used throughout in
conjunction with trained panel assessment.
COPYRIGHT
47
Sensory Evaluation
Assessment of market potential
In addition to the use of sensory evaluation to gather information about key attributes of a
product, typical marketing questions such as intent to purchase, purchase price, current
purchase habits, consumer food habits, effects of packaging, advertising and convenience are
critical for the acceptance of branded products. It is often convenient for these marketing
type questions to be included in a questionnaire presented to consumers when assessing the
sensory characteristics of the product.
Conducting Consumer Tests
There are a number of factors to consider when conducting consumer tests and these are:
Test design
Test subjects
Test location
Test questionnaire
Test Design
There are two main types of design, one is qualitative measuring subjective responses while
the other is quantitative determining the responses of a large group to a set of questions
regarding preference, liking, sensory properties etc.
Qualitative Tests include focus groups, focus panels and one-on-one interviews. Each of
these has their use in a particular situation depending on what is required and how sensitive
the topic is. Essentially small groups are used to uncover as much specific information from
as many participants as possible. It is frequently recorded either by video and or audio and a
summary is made.
Quantitative Tests
Essentially all the good practice principles used in sensory evaluation as described in the
difference and descriptive testing should be followed here such as 3 digit random codes for
product and presentation in a balanced order. Some typical designs used include:
Monadic test where only one product is assessed which makes it fast and the least
expensive but is relatively insensitive and requires large numbers of consumers (at
least 200).
Sequential monadic where one product is assessed, removed and then replaced by a
second product in a balanced design giving it greater sensitivity.
Paired preference testing where two products are assessed simultaneously and a
direct comparison is made making it quite sensitive.
Acceptability testing. Usually the nine-point hedonic scale is used to determine
consumers liking of a product and if required the relative ratings for liking can be
used as a measure of preference.
COPYRIGHT
48
Sensory Evaluation
Example of nine-point hedonic category rating
AROMA
FLAVOUR
TEXTURE
Like extremely
Like very much
Like moderately
Like slightly
Neither like nor dislike
Dislike slightly
Dislike moderately
Dislike very much
Dislike extremely
Example of seven point facial hedonic scale often used for children
Appearance
Aroma
Flavour
Mouth-feel
Graphic rating scale - the response is recorded by marking a position on a line (also called
visual-analogue scale, line mark scale or unstructured scale) - physical lengths 100-150 mm.
This scale may also use facial expressions for measurement.
Attribute testing can be used to gain information on the reasons underlying overall
preferences and usually category or line scales are used. These can be hedonic type
attributes or sensory attributes in the form of just right scales as shown below.
not sweet enough
just right
too sweet
Test Subjects
If information on the acceptance of the product by consumers is required, then it is they who
should do the tasting. However, this is not always practical in preliminary testing of products, so
a compromise can be made by using large numbers staff who assess fairly infrequently.
However, it should always be remembered that this is a compromise, and results are best
COPYRIGHT
49
Sensory Evaluation
interpreted only in relative, not absolute, terms.
representative sample of the target population.
Product usage. It is important to determine if you are looking for low, medium or
high users of the product. For speciality products or niche markets, the cost of
consumer testing increases as more people must be contacted before the required
number are found.
Gender. It is not always necessary to get equal numbers as purchasing or usage
habits vary between products. Researchers should use current market information.
Age. If a product has broad age appeal then consumers should be selected by age in
proportion to their representation in the user population.
Nationality. Products targeted towards a specific part of the community or for export
ideally should be tested in that environment. However, it is possible to use foreign
nationals resident here but it depends on how long they have been residing in their
adopted country as they can develop the likes and dislikes characteristic of the
adopted country.
Social class. This can be based on income or occupation although sometimes it is
difficult to get consumers to reveal such information.
Others including race, religion, education level, employment, geographic
location, etc. If any of these are important in defining the target audience then the
researcher should consider them.
Source of Consumers
As mentioned it is important to sample properly from the consuming population but because
of cost restraints employees and local residents may be used for things such as product
maintenance. However, for new products or product optimisation or improvement the correct
audience should be selected. These can come from a database of consumers willing to assess
products, telephone survey, leaflet drop, shopping centres, embassies, colleges or door to
door.
Test Location
It is possible to conduct consumer testing in a number of locations depending on the
resources and the results can vary greatly. Locations include:
Company laboratory facilities, which give good control of the environment and
rapid feedback of results but the sensory booths, are clinical and atypical of a real
COPYRIGHT
R L Mason and S M Nottingham
50
Sensory Evaluation
domestic environment.
Central location such as school or church halls or shopping centres are convenient as
large numbers can be tested at one time and on a number of products. However the
conditions are artificial compared to normal use at home and the number of questions
that can be asked may be limited.
Home use tests represent the ultimate in consumer testing as the product is tested
under its normal conditions of use. In addition to the product itself, a check on the
packaging can also be determined. Generally more information can be gathered as the
consumer gets more time and can perform repeated assessments. However it is time
consuming and uses a smaller number of people and the possibility of nonresponse is
great unless consumers are continually reminded.
Test Questionnaire
It is very important that the test questionnaire format is simple, unambiguous, easy to read
and understand. You need to consider the objective of the test and any constraints such as
time, funding etc. In essence you need to be:
Brief
Use simple plain English (provide translation for studies involving foreigners)
Be specific
Multiple choice questions should be mutually exclusive
Avoid ambiguity
Watch the effects of wording
Dont ask what they dont know
Try and pre-test the questionnaire
For example
How satisfied or dissatisfied were you with the product?
Very satisfied
Slightly satisfied
Neither satisfied nor dissatisfied
Slightly dissatisfied
Very dissatisfied
The product looks like how it is shown on the package.
Agree strongly
Agree
Neither agree nor disagree
Disagree
Disagree strongly
What did you like about the product?
This open-ended question allows for the consumer to add something you may have forgotten
but it is sometimes hard to read the answer (handwriting) and some people do not bother with
answering.
COPYRIGHT
51
Sensory Evaluation
The question order should go from the more general to the more specific and ask overall
acceptability first before biasing the consumer with more specific issues. Ask the more
sensitive demographic questions last.
Data Analysis
All quantitative data should be subjected to some form of statistical analysis from simple
summary statistics and graphical representation to t-tests and analysis of variance with
pairwise comparisons. Further advanced multivariate methods such as principal components
analysis and cluster analysis along with regression methods to relate consumer data to other
data such as linear regression, partial least squares and preference mapping can also be used.
COPYRIGHT
52
Sensory Evaluation
ANALYTICAL SENSORY TESTS:
In general, analytical panels are used as measuring instruments and therefore need to be:
Reproducible
Simple difference tests are those that have no direction or characteristic associated
with the difference between the products. Examples of simple difference tests are:
Triangle test
Duo Trio test
Two-out-of-five test
A not A
Difference from control
Directional difference tests are those that have a direction or characteristic associated
with the difference between the products. Examples of directional difference tests
are:
53
Sensory Evaluation
the selection and monitoring of panellists.
With products that produce sensory fatigue, carryover effect or adaptation effects, the triangle
test has limited application.
Principle
Three samples, two of which are identical, are presented simultaneously to each panellist for
testing in a predetermined order. The panellist is told that two samples are identical and one
is different (odd). The panellist is required to identify the different sample. The triangle test
is a forced choice test.
Preparation and Procedure
The samples should be representative of the product and all prepared in exactly the same
way. Select four 3-digit random number codes, two for each product. Prepare scoresheets to
provide equal numbers of the following orders:
AAB
ABA
BAA
BBA
BAB
ABB
Make up sets of 3 samples to match the score sheets so that half contain 2 samples of product
A and half contain 2 samples of product B (Total number of sets should be a multiple of 6.).
Make up sets in multiples of the six arrangements as required for the number of panellists.
If total number of panellists or quantity of products available is insufficient to provide equal
numbers of the 6 orders, you still need to make sure there is a balance between sets with 2
As and 2 Bs. The triangle tests should be presented at random to the panellists. Instruct
each panellist to examine in the specified order (e.g. left to right) and remind them that they
must make a decision.
Count the number of correct responses (those that select the odd sample) and compare the
result with those presented in Table 2.
Questionnaire
Specimen answer form for the triangle test
Product
Date
Time
Assessor
One of the three samples presented is different from the other two.
Please examine in the order requested and place a circle around the code of the sample
which is different.
293
Analysis of results
COPYRIGHT
594
862
Sensory Evaluation
The total number of correct responses is counted as well as the total number of responses and
compared to the statistical tables (Table 2). This is based on the probability that if there is no
real difference the odd sample will be chosen a third of the time.
Example
A company wishes to put a new dessert topping on the market. The product development
section has two different thickening agents available to them, one which is considerably
cheaper. They wish to know if there is any difference in the products made using the 2
different thickeners.
Two batches (A, B) are prepared using the two different thickening agents and samples are
presented to 17 assessors. As each assessor will only make one assessment, it will be
necessary to prepare 27 samples of A and 27 samples of B, and arrange them to provide three
of each of the six possible arrangements as indicated above. One set is discarded and the
remaining 17 sets are randomly distributed between the assessors.
The number of correct responses is 10, ie the number of panelists who correctly selected the
odd sample from the 3 samples presented.
The test organizer will accept a risk of error of 5% (P<0.05), that the test will reveal a
difference when there is none.
Table 2 indicates that for 17 assessors at P<0.05, 10 correct responses are required for
significance.
It can be concluded that the product from the two thickening agents are significantly different
(P<0.05).
What should the test organizer do next???
DUO-TRIO TEST
(Australian Standard 2542.2.4 - 1988)
Scope and Application
Used to determine whether a difference exists between two samples. The difference can
involve one or several sensory attributes, but no direction or magnitude of the difference is
measured. A duo-trio test can be used when one of the products is an existing standard or
reference.
A duo-trio test can be applied to determine whether changes in ingredients, processing,
packaging or storage have resulted in differences between products. The duo-trio test finds
application in the selection of panellists, product and process development, product matching,
quality control and as a preliminary test prior to analytical descriptive testing.
Statistically the duo-trio test is less powerful than the triangle test because the chance of
guessing a correct result is one in two. The Duo-Trio test is therefore only used when it is
COPYRIGHT
55
Sensory Evaluation
required to form a judgement. This is the case when tasting a product with a lingering aftertaste such as bitterness, spicy or chilli.
Principle
Three samples, two of which are identical, are presented to each panellist. One sample is
identified as the reference sample and panellists are instructed to assess the reference sample
first and then identify which of the two samples is the same as the reference. It is a forced
choice test.
Preparation and Procedure
The samples prepared should be representative of the product and prepared in exactly the
same way. If possible, samples are presented simultaneously or if required, sequentially.
There are two forms of this test: balanced reference mode and constant reference mode.
Balanced reference mode
This is used when both the samples are unfamiliar and so both the samples are used as the
reference sample.
Select two 3-digit random codes, one for each product. Prepare scoresheets to provide equal
numbers of the following orders:
RA A B
RA B A
RB B A
RB A B
Make up sets of 3 samples (reference plus two samples) to match the scoresheets so that half
contain 2 samples of product A and half contain 2 samples of product `B'. (Total number of
sets should be a multiple of 4).
If total number of panellists or quantity of products available is insufficient to provide equal
numbers of the 4 orders, then you will still need to check that there is a balance between sets
with 2 As and 2 Bs.
The sample sets are allocated at random to the panellists. Instruct each panellist to assess the
reference sample first, followed by the two other samples in order (e.g. left to right). Remind
them that they must make a decision.
Constant reference mode
The constant reference duo-trio test is useful when you have trained panellists. In this test,
one of the samples is a familiar product or designated standard. It is therefore the only one
used as a reference sample. The number of possible presentation orders is thus restricted to:
RA A B
RA B A
Select two 3-digit random codes, one for each product and prepare the scoresheets so that
equal numbers of the two orders are presented. (Total sets should be a multiple of 2).
COPYRIGHT
56
Sensory Evaluation
Randomly allocate sample sets to panellists and instruct each panellist to assess the reference
sample first, followed by the two other samples in order (e.g. left to right). Remind them that
they must make a decision.
Analysis of results
Count the number of correct responses as well as the total number of responses and use the
statistical Tables 3.
Questionaire
DUO-TRIO TEST
Name:
Date:
Time:
Product:
You are provided with three samples. The left-hand one is a reference; one of the other two is
the same as the reference.
Taste the samples in the order left to right and circle the number of the sample which is the
same as the reference.
Reference
Sample code:. . . . . . . . .
Sample code: . . . . . . . . .
Example
A duo-trio test was used to determine if methional could be detected when added to cheddar
cheese in amounts of 0.125 ppm and 0.250 ppm. Each tray had a control sample marked R
and two coded samples, one with methional added and one with no methional. The duo-trio
test was used in preference to the triangle test because less tasting is required to form a
judgment. This fact is important when tasting a substance with a lingering aftertaste, such as
methional.
The test was performed on two successive days using eight judges. Each day the judges were
presented with two trays. One tray contained a sample with 0.125 ppm methional and two
control samples and the other contained a sample with 0.250 ppm methional and two control
samples. A total of 16 judgments were made at each level. The results are shown in the
following table.
COPYRIGHT
57
Sensory Evaluation
Duo trio test on cheddar cheese containing methional. 0.125 and 0.250 ppm.
Day1
Day 2
JUDGES
0.125
0.250
0.125 0
.250
1
2
3
4
5
6
7
8
TOTAL
X
R
X
R
R
X
R
R
5
R
R
R
X
R
R
R
R
7
R
R
X
X
R
X
R
R
5
R
R
R
R
R
X
R
R
7
X = wrong
R = right
0.125 ppm = 10 out of 16 correct judgments
0.250 ppm = 14 out of 16 correct judgments
Consult Table 3 for 16 judges in a two sample test. This chart shows that 12 correct
judgments are significant at the 5% level.
The conclusion is that methional added to cheddar cheese can be detected at the 0.250 ppm
level but not at the 0.125 ppm level. What would you do next??
Advantages
Disadvantages
Applications
COPYRIGHT
58
Sensory Evaluation
TWO-OUT-OF-FIVE TEST
Used to determine whether there is a sensory difference between two samples and to select
and monitor panellists. It is statistically very efficient as the probability of guessing correctly
the different two samples from the five samples presented is low. It can be useful when only
a small number of panellists are available. However, sensory fatigue and memory effects
may affect the test.
As with the triangle and duo-trio tests, assign 3-digit random codes to the samples and then
make up the scoresheets, taking care to prepare the samples in an identical fashion. There
will be 20 possible combinations.
Panellists are instructed to assess each product from left to right and select the two samples
that are different from the other three. Statistical tables exist to determine the significance of
the result.
A NOT A TEST
(Australian Standard 2542.2.5 - 1991)
Used to determine whether test samples in a series are the same as or different from the
reference sample. It is an especially useful test where triangle and duo-trio tests cannot be
used. This may be the case where comparisons are required between products that have a
strong or lingering flavour/aftertaste when you will need to control the time between sample
presentation or if there are differences in appearance. It is also useful to determine assessor
sensitivity to a stimulus.
Initially, panellists require familiarisation with the reference or A sample. Panellists are
then presented with a series of samples, some of which are the reference sample A and
some not-A. Generally, the panellist does not have access to the reference A while
evaluating the test samples. The panellist must determine whether the sample is the same
(A) or different (not-A) so it is a forced-choice test. Only one type of not-A sample
exists per test series. Panellists may test one, two or up to 10 samples in series (depending on
fatigue factors). The samples are presented randomly with 3-digit codes and one at a time (an
assessment is made and recorded before proceeding to the next sample). All samples are
prepared in an identical way and are representative of the product. The analysis of the data is
quite complex.
DIFFERENCE-FROM-CONTROL TEST (DFC)
Also called the degree of difference (DOD) test.
There is no Australian Standard this test however further information can be obtained in
Meilgaard, Civille and Carr and Aust et al. 1985.
Scope and Application
This test is used to determine whether or not a perceptible overall difference exists between
one or more samples and a control sample and also to give an indication as to the size of any
difference perceived. In quality control situations, trained panellists may also be able to rate
the degree of difference for individual attributes.
COPYRIGHT
59
Sensory Evaluation
A difference from control test is a useful test to use when other difference tests, such as
triangle or duo-trio are not suitable because of the normal heterogeneity of the products to be
tested. For example with products such as meats, baked goods and horticultural products it
can be difficult to obtain a homogeneous sample which is necessary for a triangle or duo trio
test.
When used in conjunction with consumer acceptability testing and descriptive testing using a
trained panel, the DFC test is useful for quality control and shelf life testing. In these cases
the relative size of the difference is important for deciding whether the product is an accept or
reject. It can be used to check production samples for the degree of difference from a
recognised control or standard product. In this situation the panellists must be familiar with
the range of differences expected and will require some training with reference samples and
the use of the scale. The test can also be useful in product development situations to
determine which sample is closest to a target product.
Principle
Each panellist is presented with an identified control sample plus one or more test samples.
The panellists are asked to rate the size of difference between each test sample and the
identified control sample. Panellists are informed that some of the test samples may be the
same as the control sample. The mean difference from control for each test sample is
compared with the mean difference from control obtained from the blind presentation of the
control sample.
The blind control sample is included as a measure of the placebo effect as it is very rare that
the blind control will actually be rated as absolutely identical to the identified control.
Panellists
Generally 20-50 people are required. Panellists may be trained or untrained but not a mixture
of the two. For some applications such as in a quality control, the panellists would require
some training. All panellists should be familiar with the test format, how to use the scale and
also be aware that some of the samples will be blind controls.
Preparation and Procedure
All samples should be representative of the product and all prepared in exactly the same way.
Label an identified control sample for each panellist. Label additional blind control samples
as well as the test sample(s) with 3 digit blinding codes. Where possible the control sample
and samples for assessment should be presented simultaneously. Each panellist evaluates the
identified control sample first. The panellists then rate the degree of difference for each test
sample of which some samples will be the blind control. The order of presentation of the test
and blind control samples should be balanced.
For example, half the panellists assess the samples in the order:
1. Identified control vs blind control
60
Sensory Evaluation
2. Identified control vs blind control
Examples of scales that may be used for the difference from control test:
Verbal Category
No difference
Very slight difference
Slight/moderate difference
Moderate difference
Moderate/large difference
Large difference
Very large difference
COPYRIGHT
0 = No difference
1
2
3
4
5
6
7
8
9 = Very large difference
61
Sensory Evaluation
Line scale
no difference
Analysis of Results
Calculate the mean difference from the identified control for each of the test samples and the
blind control samples. If several samples have been evaluated, use a randomised block
analysis of variance using the panellists as blocks. If only one test sample has been evaluated
use a paired t-test to analyse the results.
Example
A company suspects a flavouring ingredient may have been left out of a batch of chunky
vegetable soup. They want to know if this batch of soup is perceived to be different or not
from a control batch of soup. Due to the natural degree of batch to batch variability with the
product, a triangle test or other forced choice difference would be unsuitable due to the risk
of yielding false positives or false negatives.
Date:
Time:
Product:.........................................................................................
Assess the sample marked control first.
Assess sample 386 and score the overall sensory difference between the two samples using the
scale below.
not different
very different
COPYRIGHT
64
Sensory Evaluation
DIRECTIONAL DIFFERENCE TESTS
PAIRED COMPARISON TEST
(Australian Standard 2542.2.1 - 1982)
Scope and Application
Used to determine how a specific sensory property differs between two samples. It can be
applied to determine a directional difference (e.g. which sample is sweeter).
A paired comparison test has numerous applications in product or process development, eg
substitution of a new low-calorie sweetener, in quality assurance as well as in storage tests
and in product matching. A paired comparison test can also be used to determine if a more
advanced sensory test should be applied.
The paired comparison test can be used for multiple comparisons, but this results in a large
number of pairs to assess which uses a lot of sample and can cause sensory fatigue. In this
situation it is better to use a rating test.
Before the sensory testing commences, it is necessary to decide whether the results will be
treated as a unilateral or bilateral test. The most common paired comparison tests are twosided (bilateral) where there is no prior expectation of the result. Conclusions that can be
drawn are that there is no evidence of a difference or that one sample has a greater intensity
of the chosen attribute or is preferred.
One sided tests (unilateral) also exist when there is prior expectation of the direction of
difference. Conclusions to be drawn include that there is no evidence of a difference or that
the previously declared sample is greater in the attribute intensity or is preferred. The
wording used on bilateral and unilateral score sheets is different.
Test principle
Two coded samples are presented. The panellists complete the scoresheet questions that have
been previously determined by the test objective.
Panellists
The test is fairly simple requiring minimal training but the panellists must understand the
attribute that is being tested. However, trained panellists may be selected if appropriate.
Twenty is a reasonable number when the panellists have been screened. Statistically,
numbers can be reduced to 7 for a trained panel, but when using completely untrained tasters
such as consumers, then much larger numbers (100+) are needed.
Preparation and Procedure
Two 3-digit randomly coded samples, one of each product, are presented. The sample
presented is representative of the product and all samples are prepared identically. Equal
numbers of AB and BA are randomly allocated to the panellists. Panellists are instructed to
COPYRIGHT
65
Sensory Evaluation
assess the samples in a specific order (left to right) and identify which has the higher level of
a particular attribute or is preferred. The test is a forced choice test and no difference
responses are not allowed.
Analysis of results
Use standard statistical tables for unilateral tests (Table 3) and bilateral tests (Table 4). Count
the number of replies identifying a particular sample most frequently. Compare this value
with the number shown in the statistical table for the number of panellists used.
Questionaires
BILATERAL PAIRED COMPARISON TEST
Name.Date.Time
In front of you are two coded samples of orange juice.
Please assess them in the order shown below from left to right and indicate which
sample
016
sample 016
NO
COPYRIGHT
66
Sensory Evaluation
Examples
Bilateral test
Two drinks A and B, are offered to a panel of 30 assessors. The two samples are
presented under a random number, eg: 789 and 379. The test supervisor accepts a 5%
level of significance (ie: P< 0.05%). He does not know which of the two samples contains
more sugar.
Question: Which sample is sweeter?
Replies 18 opt for sample A
12 opt for sample B
From Table 4 it can be concluded that there is no significant difference in the sweetness of
the two drinks.
Unilateral test
Two drinks, A and B, are offered to a panel of 30 assessors. The two samples are
presented under a random number eg: 789 and 379. The test supervisor accepts a 1% level
of significance (ie: P<0.01%). He knows that drink A contains more sugar than drink B.
Question: Is sample A sweeter than sample B?
Replies 22 yes and 8 No.
From Table 3, it can be concluded that drink A is significantly sweeter than drinkB.
Advantages/Disadvantages
See paired preference.
Applications
Product Development
Quality Control
Shelf Life Measurement
RANKING TEST
(Australian Standard 2542.2.6)
Scope and Application
The ranking test can be considered an extension of the paired comparison test. It is used to
place a series of three or more samples in a rank order to determine whether differences exist
between samples. Samples are ranked for a specified criterion, e.g. an attribute (bitterness,
crunchiness, hardness) or a preference. The criterion needs to be understood by the
panellists. The data obtained is ordinal and therefore provides directional differences between
samples but does not provide information about the degree of difference.
The ranking test is a simple way to compare samples and is useful for reducing the number of
COPYRIGHT
67
Sensory Evaluation
test samples prior to performing another test and to evaluate panellist ability. In product
development, a ranking test can be used as a quick method of indicating the effects of
different raw materials, processing, or packaging and storage treatments.
Test principle
Samples are presented to the panellists simultaneously and are placed by the panellists into a
rank order relative to one another according to the specified criterion.
Panellists
Minimum of 8 but a larger number of panellists is better.
Preparation and Presentation
Three or more 3-digit random coded samples are presented to panellists simultaneously for
assessment in a balanced or random (if more than 4 samples) order. All the samples are
prepared and presented identically. The maximum number of samples will depend on the
type of product. They are instructed to arrange the samples in rank order according to the
level of the specified criterion, and are instructed whether to assign rank 1 for the lowest or
highest level. It is a forced choice test and tied rankings are not permitted. A separate
scoresheet is used and completed separately if the rank order is required for more than one
criterion.
As a panellist, it is often easier to perform this test by arranging the samples in a provisional
order first and then to re-evaluate them before assigning final ranks.
Analysis of results
Rank totals are calculated for each sample and used to generate test statistics which are
compared to statistical tables. As samples are evaluated only in relation to each other, results
from one test cannot be compared to those from another unless they both tested the same
samples.
Example
A cordial manufacturer has been provided with two new samples of lemon flavour that are
cheaper than the existing flavour. The manufacturer wants to know if cordials are made at
the same flavour intensity, would it be cheaper to use either of the two new flavours.
Samples are prepared at the same concentration but in order to test this from a sensory
perspective the 3 samples are presented to 30 assessors who are asked to rank them in order
of flavour intensity. The results are presented below:
COPYRIGHT
68
Sensory Evaluation
Assessor
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
Rank Sums
A
1
1
2
1
3
1
1
3
3
3
2
3
1
3
3
1
2
1
1
2
3
2
1
3
1
1
3
3
1
2
58
Cordial Samples
B
2
3
1
3
2
2
3
2
2
2
1
2
2
1
2
2
3
3
2
1
1
1
3
2
3
3
2
2
2
1
61
C
3
2
3
2
1
3
2
1
1
1
3
1
3
2
1
3
1
2
3
3
2
3
2
1
2
2
1
1
3
3
61
12
(58 2 + 612 + 612 ) 3 30(3 + 1)
30 3(3 + 1)
= 360.2-360
= 0.2
F=
From Table 7 the critical value for F with 2 degrees of freedom (df = number of samples 1)
is 5.99.
The technician must retain the null hypothesis that there is no difference between the flavour
strength of the three products.
RATING TEST
COPYRIGHT
69
Sensory Evaluation
(Australian Standard 2542.2.3 1988)
(AS2542.1.3 1995: 7.6 Selection of assessors for rating methods and 7.7 Selection of
assessors for descriptive analysis)
The rating test can be used to measure the perceived intensity of sensory characteristics eg
degree of strawberry flavour in a strawberry milkshake.
For this type of test the basic principles of sensory evaluation should be followed eg coded
samples, controlled test environment, number of samples tested. Panellists should be selected
based on their ability to give consistent ratings to the same sample and to discriminate
between samples checked by statistical analysis. The number of panellists used depends on
the degree of training but generally a minimum of eight highly trained, more if less trained.
Selection and training of panellists will be discussed later in a separate section.
The response scale used for rating may be in the form of a category scale or a line scale. A
category scale is a series of 7 15 boxes labelled to identify levels of intensity. With a line
scale, panellists respond by marking a position on a horizontal line labelled with anchors at
each end. An advantage of this type of scale is that panellists responses are not limited to a
number of categories on the scale and therefore it may be possible to identify more
differences between samples.
COPYRIGHT
70
Sensory Evaluation
Example of a category scale.
Strawberry flavour
Sample number
Extremely strong
495
128
Very strong
Moderate
Slight
Absent
An example of a line scale.
Strawberry Flavour
None
Very strong
Analysis of results
Ratings must be converted to numerical scores for analysis and interpretation. For category
scales, successive integers are assigned to successive categories and these are used in
analysis, e.g. with a 9-point scale, the integers 1-9 would be used. For graphic scales, the
distance, e.g. in mm, between the response mark and one end of the scale serves as the
response score.
The arithmetic mean and standard deviation, when obtained for each sample, serve as
measures of central tendency and variability, respectively. For statistical analysis, the
analysis of variance technique is appropriate (or a t-test in the case of one or two samples).
Correlation or regression analysis may be used for subjective/objective correlations.
Advantages
Disadvantages
COPYRIGHT
71
Sensory Evaluation
Applications
Product Development
Quality Control
Storage Trials
Research
Example
The scoring method was used to determine if there was a difference in bitterness in cheddar
cheese made using three different milk-coagulating enzymes. Samples of cheese from each
treatment were coded and presented to 12 judges for evaluation. The order in which the three
samples were tasted was balanced so that each possible order was used twice: ABC, ACB,
BAC, BOA, CAB, CBA.
The ratings assigned by the judges were given numerical values, ranging from 0 points for
not bitter to 5 points for extremely bitter. The results are shown in the following table.
Judges
1
2
3
4
5
6
7
8
9
10
11
12
Total
Mean
Samples
A
3
2
3
1
3
2
3
2
3
4
1
2
29
2.42a
B
0
2
1
1
1
1
2
0
1
2
1
2
14
1.17b
C
1
2
2
0
3
1
2
1
2
3
0
2
19
1.58b
Total
4
6
6
2
7
4
7
3
6
9
2
6
62
COPYRIGHT
72
Sensory Evaluation
STATISTICS FOR SENSORY: DIFFERENCE TESTING
What kind of data do we have?
When data are graded, as in rating scales then the t-test or other parametric statistics such
as analysis of variance are applied. These are discussed in a different section. In other
situations when we categorise performance into right or wrong answers and count numbers of
people who get tests correct or incorrect or those who make one choice over the other we call
this discrete categorical data. For these data we use a special distribution called the binomial
distribution and is useful for tests based on proportions. Some examples of how these tests
are used in sensory tests is given below.
Triangle Test
The triangle test is used when we want to know if there is a detectable difference between
two samples or products. Three samples are presented where two are the same and one is
different. Panellists are asked to pick the odd one out. Purely by luck the panellist has a one
in three chance of getting it right.
This forms the basis of the normal approximation to the binomial test. Lets accept that
p pexp
z = obs
where
pq / N
pobs is the proportion who answered correctly ie X/N
pexp is the proportion of people who we expect by chance ie 1/3
q = 1 - pexp
z is obtained from tables and for a one tailed risk of 5% is equal to 1.65.
By substituting into the equation and solving for X we get
X 1
N
3
1.65 =
and
12
N
33
X = 0.778 N + N / 3
Now for a range of N values (ie number who sit the test) we can get a range of X values (ie
the minimum number who must get the test right). These values have conveniently been
calculated and are already tabulated for use (see tables 1, 2).
For example if we have N = 30 panellists we must have at least
or
0.77830 + 30/3
14.26 correct to achieve significance.
Since we cannot have 0.26 of a person so we round up to 15. Therefore 15 out of 30 people
must get the triangle test right in order to reject the null hypothesis and conclude there is a
difference among the samples.
COPYRIGHT
73
Sensory Evaluation
Duo Trio
This is similar to the triangle test except that a standard is presented and two other samples,
one of which is the same as the standard, are also given. The panellist has to pick which of
the two is the same as the standard so has a one in two chance of being correct. Tables for
this are also available but as it is less efficient than the triangle test it is not usually preferred
over the triangle test (see table 3).
Paired Comparison Test
Only two samples are given and panellists are asked to pick which sample is, for example,
sweeter than the other. The same tables as for the duo trio test can be used and a one-tailed
test is used when you expect one sample to be sweeter (for example) than the other. When
there is no preconceived idea of which sample may be sweeter then two-tailed test is
appropriate (see tables 3, 4).
Freidman Test
This is best demonstrated by example. Suppose 18 panellists are asked to rank three orange
juices in order of preference. What we want to know is. Are the ranked values for all
panellists the same? The results are as follows.
Panellist
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
Sum (column total)
Tp
COPYRIGHT
RANK
A
1
2
1
1
3
2
3
1
3
3
2
2
3
2
2
3
3
2
39
B
3
3
3
2
1
3
2
3
1
1
3
3
2
3
3
2
2
3
43
74
C
2
1
2
3
2
1
1
2
2
2
1
1
1
1
1
1
1
1
26
Sensory Evaluation
The Freidman value F needs to be calculated as follows.
F=
12
Tp2 3J ( P + 1)
JP ( P + 1)
JP ( P + 1)
6
at the 5% level.
Two samples are different if the difference between their rank sums is greater than or equal to
11.76.
Difference from Control Test
Although the difference from control test is a form of difference testing, the data we collect is
not discreet data so the analysis follows the parametric tests that we use on rating scales.
These are discussed in the Statistics for Sensory: Descriptive Testing section.
COPYRIGHT
75
Sensory Evaluation
DESCRIPTIVE TESTING
Descriptive testing is used to identify and provide a picture or profile of the important
sensory characteristics of a product. With sensory profiling more than two samples can be
assessed simultaneously. This type of test has the advantage of not only being able to tell you
whether or not there is a difference between samples but also the nature and magnitude of
these differences. Appearance, odour, flavour and texture can all be assessed in this way and
the characteristics can be quantified using various techniques and scales as outlined in this
section.
Applications:
Tracking changes in the sensory characteristics of a product over time for shelf-life
evaluation
Examining the sensory properties of a target product for new product development
Examining sensory characteristics of different varieties of a product eg to look at
several varieties of apples in order to identify which varieties are sweetest, crunchiest
etc.
Sensory diagnostics of ingredient, process or packaging changes
Correlations with instrumental methods
COPYRIGHT
76
Sensory Evaluation
Questionnaire for flavour profile of beer
Product
Name
Date
AROMA
Characteristic
Hoppy
Fruity
Sour
Yeasty
Malty
Amplitude(overall aroma)
Intensity
Characteristic
Tingly
Sweet
Fruity
Bitter
Malty
Yeasty
Metallic
Astringent
Amplitude(overall flavour)
Intensity
FLAVOUR
Comments
COPYRIGHT
77
Sensory Evaluation
Profile Attribute Analysis
The Flavour Profile method was renamed the Profile Attribute Analysis with the introduction
of numerical scales. Mean scores could then be calculated and the data statistically analysed.
However consensus methods are still employed by some people. Again, this runs the risk of
a result being skewed by a dominant personality in the group.
The Texture Profile Method
This method was developed at General Foods in the 1960s. It was based on the principles of
the Flavour Profile method to assess the textural characteristics of a product. Textural
characteristics are categorised into three groups, mechanical, geometrical and other
characteristics.
1. Mechanical: relating to the reaction of food to stress eg. hardness, chewiness and
adhesiveness
2. Geometrical characteristics: relating to the size, shape and orientation of the particles
within the food eg. grainy, fibrous and aerated
3. Other characteristics: relating to the perception of the moisture and fat contents of the
food
The order in which the characteristics are assessed is also very important. The order of
assessment is first bite, chewing or masticatory second phase and residual or third phase.
Panellists are selected on their ability to discriminate between textural differences in the
product area to be trained. Six to ten panellists are suggested.
Standardised terminology and rating scales are used for the assessments and each scale point
is anchored with a specific food. Initially the technique used an expanded version of the
Flavour Profile scale, however more recently category and line scales have been used.
Panellists each make their own individual judgement and then depending on the type of scale
used, a consensus decision is reached or statistical analysis is performed on the data.
Quantitative Descriptive Analysis (QDA)
This method of descriptive analysis was developed in the 1970s. Ten to twelve panellists are
selected by screening for ability to discriminate between products, their ability to verbalise
their perceptions and to work as a group. The first step is to expose the panellists to a wide
range of products from the product category to be assessed. Each panellist individually lists
as many descriptive words possible that describe differences between the products. Hedonic
terms such as nice, good, bad, etc are not allowed. Through a group discussion, the list of
descriptive words is narrowed down to remove duplications and redundant terms until a
standardised vocabulary is reached. This standardised vocabulary then needs to be defined
with verbal definitions or reference standards and anchor points for the scale agreed upon.
The panel also decides the order in which the terms are to be assessed. During this process
the panel leader only acts to facilitate the discussion and provide references but does not
influence or lead the panel. Trial evaluations are then carried out using the agreed vocabulary
and refinements may be made until the panel is happy with the terms used. The panel leader
evaluates the results from these trial sessions and once confident the results are reliable and
COPYRIGHT
78
Sensory Evaluation
repeatable the actual assessment can take place. The assessment and trial sessions are
completed in sensory booths following the basic principles of sensory evaluation. An
unstructured 6-inch or 15cm line scale is used to measure the intensities of the agreed
characteristics. Several replicates (3+) are required to validate the data. Data is then
analysed using an analysis of variance. The results are often displayed visually on a spider
web or star diagram.
Results of ANOVA of orange jelly using QDA
Attribute
Brand A
Brand B
SEM
Probability
Orange colour
Orange aroma
Firmness
Tartness
Orange flavour
Foreign flavour
Sweetness
Rate
of
breakdown
10.2
7.6
9.6
8.6
7.6
4.3
7.1
5.1
7.9
6.9
6.6
6.9
6.9
4.8
9.6
6.1
0.62
0.50
0.64
0.66
0.72
0.48
0.42
0.60
0.011
0.325
0.001
0.072
0.494
0.464
<0.001
0.242
COPYRIGHT
79
Sensory Evaluation
Other Methods
Other methods which you may come across in literature but which will not be discussed in
detail in this workshop are:
Spectrum Method
This is a descriptive analysis technique developed by Civille to cover any or all of
appearance, aroma, flavour, texture or sound characteristics. Panellists use a standardised
lexicon of terms to evaluate the products. This method requires extensive training of the
panel to use standardised scales anchored with multiple reference points and panellists are
trained to use the scale identically. Data is analysed in a similar way to QDA.
Example of intensity scale values (0 to 15) for firmness.
Scale
value
3.0
5.0
8.0
11.0
14.0
Reference
Aerosol whipped cream
Miracle whip
Cheese whiz
Peanut butter
Cream cheese
Sample size
Redi whip
Kraft
Kraft
CPC Best Foods
Kraft/Philadelphia
1oz
1oz
1oz
1oz
1oz
Time Intensity
This is used to track the changes in perception of a particular attribute of a product over time.
For example you might rate the intensity of mint flavour perceived in chewing gum over a 3
minute period. This can be measured using pencil and paper or using one of the sensory
software packages with time intensity facilities.
Free Choice Profiling
Unlike other descriptive testing techniques this method does not use an agreed vocabulary to
assess the samples. Each panellist generates their own list of terms and scales, although they
must use these consistently for all samples. The data from this type of assessment is then
analysed using Generalised Procrustes analysis. The main advantage of this technique is the
time saved on training a panel, however interpretation of individual attributes can be
subjective as the terms are not defined as with other descriptive testing methods.
COPYRIGHT
80
Sensory Evaluation
STATISTICS FOR SENSORY: DESCRIPTIVE TESTING
We mentioned earlier about different types of data and how they are analysed using different
statistical methods. In this section we will look at the most common form of statistical
analysis for rating or preference type data, the paired t-test and Analysis of Variance (one
way and two way also known as repeated measures analysis of variance). We will also
introduce some advanced methods for separating data into logical groups using Principal
Component Analysis.
The paired t test
A common question we have in sensory evaluation is when we are comparing two products
or samples and we want to know if they are the same or different. We can use statistics and
in particular the paired t test to determine statistical difference. We calculate a t value from
the formula below and compare it to some tabled values for probabilities less than our
accepted risk, usually a probability of 0.05.
t=
Here is an example taken from OMahony. Intensity scores for two products are measured by
10 panellists on a 25 point scale.
Panellist
1
2
3
4
5
6
7
8
9
10
N = 10
Score
Product A
20
18
19
22
17
20
19
16
21
19
Product B
22
19
17
18
21
23
19
20
22
20
d2
4
1
4
16
16
9
0
16
1
1
Difference - d
2
1
-2
-4
4
3
0
4
1
1
d = 10
= 68
d =1
Now for some calculations.
mean of difference scores, d, = 1
the standard deviation of d is computationally =
(( d ) 2 / N )
N 1
58 / 9 = 2.538
1
= 1.25
2.538 10
The degrees of freedom term
so t =
COPYRIGHT
81
Sensory Evaluation
If we had four flavour scores, and knew three of them plus the variance or standard deviation
then we could calculate the value of the fourth unknown data point. In general, degrees of
freedom are equal to the sample size, minus the number of parameters we are estimating. We
need this value when looking up statistical tables.
The tabulated t value for df = 9, two-tailed, p=0.05 is 2.262 (see table 6). A word on twotailed and one-tailed test. If we simply wish to test whether a mean is different from the
population then we use a two-tailed test. However, if it is directional ie. greater than or less
than then we need a one-tail test. Our value of 1.25 is less than this so we do not reject our
notion that product A is the same as product B. Our data indicates that the difference scores
are not significantly different from 0.
Analysis of Variance
If we have only two samples we want to compare then we can use the paired t-test as
described earlier and establish a difference if it is there. If we have four samples then we
could do six paired t-tests and cover all possible pairings of the four treatments. This
however becomes very inefficient and unreliable as the number of samples increase. An
alternative to this is to use a technique known as analysis of variance to compare several
samples at the same time.
Analysis of variance looks at the amount of variance attributed to the samples or treatments
and also estimates the error variance or natural variation. By then doing a ratio of these
variances (ie signal to noise) we can then compare this to tabulated values. The distribution
is known as the F distribution and we calculate a F ratio or F test.
Most computer packages now do analysis of variance, sometimes described as one-way
analysis of variance and two-way analysis of variance. A two-way analysis of variance is
used when the same judges or panellists rate the same samples (sometimes called repeated
measures). Lets look at an example.
Suppose we have 10 panellists rating three samples of mango for mango flavour intensity on
a nine point scale. Their results are entered into a computer that then completes a two-way
analysis of variance and gives the following table.
COPYRIGHT
82
Sensory Evaluation
Source of variation
d.f
Mean square F
Samples
Panellists
Error
2
9
18
23.456
3.667
1.987
11.80 **
1.84 ns
The F ratio for samples is 11.80 which is greater than the tabulated value of 6.01 at P=0.01
with 2 and 18 degrees of freedom (often ** indicates significance at P=0.01, * for P=0.05).
This means that the variability due to samples is greater than that occurring naturally so there
are differences between the mango flavour intensity of the three mango samples.
Now to further test which pairs are significantly different we have a number of options. The
most common test would be the least significant difference (lsd) test which is based on the ttest. The lsd tells us what the minimum difference between two means must be for there to
2
be a significant difference. The formula is t ,df ems where ems is the error mean
n
square and n is the number of observations per sample and t come from tables of calculated
values.
In this example the lsd (P=0.01) is 1.32, so any two means with a difference greater than this
lsd are significantly different.
Other pairwise comparison tests are Duncans multiple range and Tukeys honestly significant
difference (HSD). Formulas for these tests can be found in most statistical textbooks or in
some cases the computer package may do the test for you.
You will also note that the panellists F ratio is not significant (ns), indicating the average
score given by any one judge is not that different to another judges score. Quite often the
panellists F ratio is significant, indicating that they are using different parts of the scale. With
highly trained panels, panellists tend to agree on the use of the scales.
An extension to the two-way analysis of variance is the three-way analysis of variance where
we add replicates to the AOV table to provide a complete analysis of the experimental data.
Difference from Control Test
The analysis of the data from this test can take a number of forms but I will outline the most
common and simplest to use. If you have a blind control and one test sample then you can
perform a paired t-test. If you have more than two samples then you can use analysis of
variance techniques and pairwise comparisons to determine differences.
An example taken from Meilgaard et al is given below. Forty-two judges are asked to
measure the perceived overall sensory difference between two prototypes (samples F and N)
and the regular analgesic cream (control). A category scale is used where 0 is no difference
and 10 is extreme difference.
COPYRIGHT
83
Sensory Evaluation
Difference from control test Analgesic cream
Judge
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
Mean
Blind
control
1
4
1
4
2
1
3
0
6
7
0
1
4
1
4
2
2
4
0
5
2
Product
F
4
6
4
8
4
4
3
2
8
7
1
5
5
6
7
2
6
5
3
4
3
Product
N
5
6
6
7
3
5
6
4
9
9
2
6
7
5
6
5
7
7
4
5
3
Judge
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
Blind
control
3
3
4
0
2
2
2
3
1
4
1
3
1
4
2
3
0
4
0
1
3
2.4
Source of variation
d.f
Mean square F
Judge
Product
Error
41
2
82
6.183
105.365
1.024
Product
F
6
5
6
3
5
5
6
5
4
6
4
5
4
6
3
4
4
8
5
5
4
4.8
Product
N
7
6
6
3
1
5
4
6
7
7
5
5
4
5
6
6
4
7
6
5
4
5.4
6.04 **
102.93 **
Product F
4.8b
Sample
Blind control
Mean response
2.4a
Product N
5.4b
Within a row, means not followed by a similar letter are significantly different at the 95%
confidence level.
It is concluded that both samples F and N were found to be significantly different from the
control. Descriptive testing is recommended to determine the nature of these differences.
COPYRIGHT
84
Sensory Evaluation
Advanced analysis using principal components
When we have lots of data and have asked our panel a number of questions we need a
technique for reducing the data down to a manageable size. The aim of the technique is to
reduce the number of variables that describe a sample to that of fewer dimensions. If we
reduce to two then we can plot the results onto a graph. These two dimensions are called
principal components and are a linear combination of the original variables.
It is useful for classifying a number of products by grouping them according to the variables
that describe them. Below is an example of the separation of nine coffees using principal
components.
Fragrant
Sweet
Oil
Cereal
Wetwood
Roast
COPYRIGHT
85
Sensory Evaluation
SELECTION, TRAINING AND MOTIVATION OF A PANEL
When developing a sensory panel, there are several areas that need to be addressed that include:
Selection
Training
interest in participating in the screening and training program as well as ongoing work
availability
general good health (note any illnesses or allergies and permanent impairment to the
senses)
any food idiosyncrasies (strong food dislikes or reactions to foods)
other information that might be relevant (age, sex, nationality, cultural and religious
background, previous sensory experience, smoking habits)
86
Sensory Evaluation
criteria. If you make all the questions optional you will find that the majority of people
respond truthfully. Pre-screening questionnaires can also be used to select individuals who
can describe sensory concepts. Record all the information you receive in some form of
database. Based on the above criteria, decide which prospective panellists are to proceed in
the screening process.
Interview
Individual interviews are required to determine whether prospective panellists will work well in
a group situation as well as for the analytical approach required in descriptive testing. An
interview is also used to confirm interest and availability.
For a descriptive sensory panel, there is a large investment involved in terms of both time and
money. It is best to complete a thorough screening process rather than training unsuitable
subjects. During the selection process, it is important to make note of both attendance and
personalities of panellists. A panellist who is repeatedly late or unavailable can be more trouble
than they are worth. Someone who distracts other panellists by talking or making comments,
despite repeated requests to remain silent while testing, is a liability, not an asset.
However, it is recognised that the best panellists available may need to be used although they
may not necessarily meet all the requirements.
Sensory screening tests
Screening is completed to obtain information on prospective panellists who need to be able
to:
Detect differences in attributes present and their intensities
Describe the attributes using verbal descriptors and scaling methods for the different
intensities
Be able to recall and apply attribute references when required
Prior to the first screening test, a preliminary session is a good idea to set the rules that may need
to be enforced politely but firmly.
Instructions for panellists
Avoid eating, drinking, smoking or chewing gum for 30 minutes before testing.
Do not talk or distract other panellists while testing.
Read any instructions on the scoresheet before starting to evaluate samples.
Make sure you evaluate the samples in the required order.
Don't forget to fill in your name and the date.
Do not discuss samples with other panellists until after they have evaluated the samples.
Have confidence in your own judgement.
Ignore your personal likes and dislikes.
Sensory screening tests also give the prospective panellists an indication of the methods used in
sensory analysis. The screening tests used should be chosen with the envisaged sensory
program in mind. Basic tastes and odours are commonly used for screening tests as well as
materials that illustrate the attributes that may be included in the sensory program. Samples of
COPYRIGHT
87
Sensory Evaluation
the actual food products may also be used.
A series of triangle or duo-trio tests may be completed to assess the ability of the potential
panellists to detect small differences between stimuli at supra-threshold levels. Preferably,
potential panellists should respond correctly 100% of the time.
Matching tests may be used to evaluate the ability of a prospective panellist to distinguish
between different sensory stimuli.
In order to evaluate the ability of the panellists to describe sensory responses, a series of
products can be presented and potential panellists asked to describe the sensory impression. The
products used should be related to those that will be used in the envisaged sensory testing.
For example, a range of odours may be presented:
Chemical name
Benzaldehyde
Octene-3-ol
Phenyl-2 ethyl acetate
Diallyl sulfide
Camphor
Menthol
Eugenol
Anethol
Vanillin
Geosmin
Beta-ionone
Butyric acid
Acetic acid
Isoamyl acetate
Dimethylthiophene
Panellists are given these samples to assess one at a time and asked to describe the odour using
his/her own words. A system of marking can be devised e.g. 4 points for absolutely correct, 3
points for correct in general terms, 2 points for a vague association, 1 point for a wrong
association and 0 points for no response. A satisfactory level for selection of panellists needs to
be specified in relation to the materials used. Similar techniques can be applied for taste and
texture.
The potential panellists may be screened for their ability to rank or rate products for selected
attributes using the same technique as the final panel will use. All potential panellists are
presented with the samples in the same order. Panellists are chosen if a satisfactory level is
attained which will depend on the intensities of the samples used. Also check that they have
used most of the scale.
COPYRIGHT
88
Sensory Evaluation
Training
In this phase, it is important that the panellists develop confidence as well as the skills for
product assessment. Panellists must be taught the correct procedures for evaluating samples
and ways to reduce or eliminate sensory adaptation. They must also learn to disregard their
personal preferences. Between 40 and 120h of training are required for a descriptive sensory
panel which will depend on the product, the number of attributes as well as the validity and
reliability required. A trained panel usually consists of 10-20 panellists.
The initial stage of training involves vocabulary development. The entire range of products
is presented to the panellists. They are instructed to individually assess the sensory
differences between the samples and record any differences as descriptive words. On
completion of this task, the panellists each list the attributes used to describe each sample. At
this time, it is very important that the panel leader does not lead or judge the descriptive
words generated by the panellist although they can ask for clarification. The panellists
themselves will usually start to move towards a general consensus once the total attribute list
has been generated.
It is then the role of the panel leader to provide reference standards for the attributes that have
been previously selected by general panel consensus. The references can be used to help the
panellists to identify and remember a sensory attribute found in the sample. The references
may be chemicals, ingredients or products. The panellists then assess the samples alongside
the references until a consensus is reached regarding the sensory attributes, reference
standards and definitions. This process should continue until the panellists are all happy and
understand the terms used.
Towards the end of training, a scoresheet is created by the panellists. The panellists decide
on the order in which the attributes are to be assessed. Generally the panel leader decides on
the type of scale used, although the panel decides on the verbal anchors to be used.
Once the panellists have become familiar with the samples, references and definitions, panel
evaluation sessions are completed that should be similar to the final testing situation. The
panellists are presented with coded samples in triplicate and asked to rate them using the
scoresheets and attribute scales they have trained with. By statistically analysing the data, the
panel leader will be able to determine if further training is required or if the evaluation phase can
begin.
Like any instrument, the performance of individual panellists as well as the panel as a whole
needs to be monitored to check they are producing reliable results. Reliability is checked by
completing test replications and the descriptive data obtained is analysed statistically using an
analysis of variance.
Motivation of panellists is one of the most important factors in maintaining an efficient trained
sensory panel. If panellists are motivated and interested they will perform well. For panellists, a
sense of completing meaningful work is an important source of motivation. When appropriate
on completion of a project, feedback should be given to the panel as to the project objectives and
outcomes and the contribution of the sensory results. Individual panellist feedback is also
important. They should be made to feel that attendance at sensory evaluation sessions is
COPYRIGHT
89
Sensory Evaluation
important. This can be reinforced by running sessions strictly and efficiently to keep their time
input to a minimum.
Throughout training as well as during ongoing sensory evaluation sessions, it is important to
keep the channels of communications open through panel discussion at the completion of a
training session or a sensory testing session.
Ongoing records of panellists' training and experience are invaluable. In some instances training
can occupy more time than the actual experimental testing sessions, especially when you first
start. However, if the job is done correctly right from the start, your trained panel will be one of
the most valuable resources in the company. Make sure you look after them.
An aside: Expert panels
Panellists who have a great deal of experience in assessing a particular product are often referred
to as "Expert tasters". Commodities that utilise expert tasters include the tea, coffee, wine and
dairy industries. These panels usually include only 2 or 3 highly trained tasters. These tasters
are particularly sensitive to the nuances of a specific product. They also have the ability to carry
the characteristics of standard samples in their sensory memory. It takes a great deal of practice
to develop the skill and requires continued tasting to stay "tuned". They are usually responsible
for arranging the tasting conditions and samples themselves, in addition to actually tasting and
making a final report. This type of panel is most frequently used to assign a quality grade to a
finished product, as in butter and cheese grading. In the wine and coffee industries one expert
may use these skills to blend individual components to produce a final product with the desired
characteristics.
COPYRIGHT
90
Sensory Evaluation
REPORTING
As with any other scientific experiment your sensory testing needs to be reported in a clear
and concise manner. The Australian standards for each test type details what should be
included in the report.
The results obtained should be interpreted and conclusions drawn using all the information
gathered in the experiment. Recommendations may also need to be included depending on
the nature of the work.
Remember that it is much easier to write the report if you keep a record as you go along!
COPYRIGHT
91
Sensory Evaluation
SELECTED BIBLIOGRAPHY
American Meat Science Association, Guidelines for Cookery and Sensory Evaluation of
Meat, AMS, USA, 1978.
Amerine, M A, Pangborn, R M and Roessler, E B, Principles of Sensory Evaluation of
Food, New York: Academic Press, 1965.
ASTM, Manual on Sensory Testing Methods, STP 434, Am. Soc. Test. Makr.,
Philadelphia, Pennsylvania, 1968.
Aust, L B, Gacula, M C, Beard,S A and Washam, R W. Degree of Difference Test Method
in Sensory Evaluation of Heterogeneous Product Types. Journal of Food Science, 50: 511
513, 1985
Bartoshuk, L, Separate worlds of taste Psychology Today 14 (9): 48-57, 1980.
Bartoshuk, L M, The biological basis of food perception and acceptance Food Quality &
Preference 4: 21-32, 1993.
Bourne, M C, Food Texture and Viscosity: Concept and Measurement, Academic Press
Inc., California, 1982.
Chi-Tang Ho, Manley, C H, Flavor Measurement, Marcel Dekker, Inc. 1993.
Gacula M C., Design and analysis of Sensory Optimization, Food & Nutrition Press. 1993.
Gacula, M C and Singh, J, Statistical Methods in Food and Consumer Research, New York:
Academic Press, 1984.
Jellinek, G, Sensory Evaluation of Food:
Horwood; 1985.
Chichester:
Ellis
Lawless, H T, Pepper potency and the forgotten flavour sense Food Technology 43 (11):
52, 57-58, 1989.
Lawless, H T & Heymann, H, Sensory Evaluation of Food: Principles and Practices,
Chapman & Hall, New York, 1998.
Lyman, B, A Psychology of Food, Van Nostrand Reinhold Co. Inc., New York, USA,
1989.
Lyon, D H, Francombe, M A, Hasdell, T A and Lawson, K, (editors) Guidelines for
Sensory Analysis in Food Product Development and Quality Control. Chapmann and Hall,
London, UK, 1992.
McBride, R L, The Bliss Point Factor, Sun Books, Australia, 1990.
COPYRIGHT
92
Sensory Evaluation
McBride, R L, (editor), Psychological Basis of Sensory Evaluation, Elsevier Applied
Science, London, UK, 1990.
McRae, R, Robinson, R K & Sadler, M J (eds) Encyclopedia of Food Science, Food
Technology and Nutrition, Volume 6, Academic Press, London, 1993.
Meilgaard, M, Civille, G V and Carr, B T, Sensory Evaluation Techniques: Boca Raton,
Fla: CRC Press, 1999. (3rd Edition)
Miflora Minoza-Gatchalian, Sensory Evaluation Methods with Statistical Analysis (for
Research Product Development and Quality Control). 1981.
Moskowitz, H R, New Directions for Product Testing and Sensory Analysis of Foods, Food
& Nutrition Press, Inc. 1985.
Moskowitz, H, Applied Sensory Analysis of Food, Volumes 1 and 2, CRC Press, Florida,
USA, 1988.
OMahoney, M, Sensory Evaluation of Food: Statistical Methods and Procedures, New
York: Marcel Dekker, Inc, 1986.
OMahony, M & Ishii, I Do you have an umami tooth? Nutrtion Today May/June, 1985.
Piggott, J R, Sensory Analysis of Food, London: Elsevier Applied Science, 1988 (2nd
edition now available).
Piggott, J R, Statistical Procedures in Food Research, London: Elsevier Applied Science,
1986.
Piggott, J R, Paterson, A Understanding Natural Flavors.
Professional. 1994.
COPYRIGHT
93
Sensory Evaluation
JOURNALS
Gacula, M C, Journal of Sensory Studies. Food & Nutrition Press, Inc.
MacFie, H J., Meiselman, H L., Food Quality and Preference. Elsevier Applied Scien
COPYRIGHT
94
STATISTICAL TABLES
Table 1: Probability of X or More Correct Judgments in n Trials (one-tailed, p = 1/3)a
n\x
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
47
38
39
40
41
42
43
44
45
46
47
48
49
50
1
868
912
941
961
974
983
988
992
995
997
998
998
999
999
COPYRIGHT
2
539
649
737
805
857
896
925
946
961
973
981
986
990
993
995
997
998
998
999
999
999
3
210
320
429
532
623
701
766
819
861
895
921
941
956
967
976
982
987
991
993
995
996
997
998
999
999
999
4
045
100
173
259
350
441
527
607
678
739
791
834
890
898
921
940
954
965
974
980
985
989
992
994
996
997
998
998
999
999
999
5
004
018
045
088
145
213
289
368
448
524
596
661
719
769
812
848
879
904
924
941
954
964
972
979
984
988
991
993
995
996
997
998
998
999
999
999
6
001
007
020
042
077
122
178
241
310
382
453
522
588
648
703
751
794
831
862
888
910
928
943
955
965
972
978
983
987
990
992
994
996
997
997
998
999
999
999
999
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
003
008
020
039
066
104
149
203
263
326
391
457
521
581
638
690
737
778
815
847
874
897
916
932
946
957
965
973
978
963
987
990
992
994
995
996
997
998
998
999
999
999
999
001
003
009
019
035
058
088
126
172
223
279
339
399
460
519
576
630
679
725
765
801
833
861
885
905
922
937
949
959
967
973
979
983
987
990
992
994
995
996
997
998
998
001
004
009
017
031
050
075
108
146
191
240
293
349
406
462
518
572
623
670
714
754
789
821
849
873
895
913
928
941
952
961
968
974
980
984
987
990
992
994
995
001
002
004
008
016
027
043
065
092
125
163
206
254
304
357
411
464
517
568
617
662
705
744
779
810
838
863
885
903
920
933
945
955
963
970
976
980
984
987
001
002
004
008
014
024
038
056
079
107
140
178
220
266
314
364
415
466
516
565
612
656
697
735
769
800
829
854
876
895
912
926
938
949
958
965
972
001
002
004
007
013
021
033
048
068
092
121
154
191
232
276
322
370
419
468
516
562
607
650
689
726
761
791
820
845
867
887
904
919
932
943
001
002
004
007
012
019
028
042
058
079
104
133
166
203
243
285
330
376
422
469
515
560
603
644
683
719
753
783
811
836
859
879
896
001
002
003
006
010
016
025
036
050
068
090
115
144
177
213
252
293
336
381
425
470
515
558
600
639
677
713
745
776
803
829
001
002
003
006
009
014
022
031
043
059
078
100
126
155
187
223
261
301
32
385
428
471
514
556
596
635
672
706
739
001
002
003
055
008
013
019
027
038
051
067
087
109
135
164
196
231
268
307
347
389
430
472
514
554
593
631
001
002
003
005
007
011
016
023
033
044
058
075
095
118
144
173
205
239
275
313
352
392
433
473
513
001
001
002
004
066
010
014
020
028
038
051
066
083
104
127
153
182
213
246
282
318
356
395
001
001
002
004
006
009
012
018
025
033
044
057
073
091
111
135
161
189
220
253
287
001
001
002
003
005
007
011
016
021
029
038
050
063
079
098
119
142
168
196
001
001
002
003
004
007
010
014
019
025
033
043
055
070
086
105
126
001
001
002
003
004
006
008
012
016
022
029
038
048
061
076
001
001
001
002
003
005
007
010
014
019
025
033
042
001
001
002
003
004
006
009
012
017
022
001
001
002
003
004
006
008
011
001
001
002
002
003
005
001
001
001
002
001
Table 2:
0.05
0.04
0.03
0.02
0.01
0.005
10
11
10
12
10
13
10
11
14
10
10
11
15
10
10
10
11
12
16
10
10
10
11
11
12
17
10
10
10
11
11
12
13
18
10
11
11
11
12
12
13
19
11
11
11
12
12
13
14
20
11
11
12
12
13
13
14
21
12
12
12
13
13
14
15
22
12
12
13
13
14
14
15
23
12
13
13
13
14
15
16
24
13
13
13
14
15
15
16
25
13
14
14
14
15
16
17
26
14
14
14
15
15
16
17
27
14
14
15
15
16
17
18
28
15
15
15
16
16
17
18
29
15
15
16
16
17
17
19
30
15
16
16
16
17
18
19
31
16
16
16
17
18
18
20
32
16
16
17
17
18
19
20
33
17
17
17
18
18
19
21
34
17
17
18
18
19
20
21
35
17
18
18
19
19
20
22
36
18
18
18
19
20
20
22
37
18
18
19
19
20
21
22
38
19
19
19
20
21
21
23
39
19
19
20
20
21
22
23
40
19
20
20
21
21
22
24
41
20
20
20
21
22
23
24
42
20
20
21
21
22
23
25
43
20
21
21
22
23
24
25
44
21
21
22
22
23
24
26
45
21
22
22
23
24
24
26
46
22
22
22
23
24
25
27
47
22
22
23
23
24
25
27
48
22
23
23
24
25
26
27
49
23
23
24
24
25
26
28
50
23
24
24
25
26
26
28
60
27
27
28
29
30
31
33
70
31
31
32
33
34
35
37
80
35
35
36
36
38
39
41
90
38
39
40
40
42
43
45
COPYRIGHT
96
0.001
Table 3:
No of trials (N)
0.05
0.04
0.03
0.02
0.01
0.005
0.001
10
10
10
10
11
10
10
10
11
11
12
10
10
10
10
11
11
12
13
10
11
11
11
12
12
13
14
11
11
11
12
12
13
13
15
12
12
12
12
13
13
14
16
12
12
13
13
14
14
15
17
13
13
13
14
14
15
16
18
13
14
14
14
15
15
16
19
14
14
15
15
15
16
17
20
15
15
15
16
16
17
18
21
15
15
16
16
17
17
18
22
16
16
16
17
17
15
19
23
16
17
17
17
18
19
20
24
17
17
18
18
19
19
20
25
18
15
18
19
19
20
21
26
18
18
19
19
20
20
22
27
19
19
19
20
20
21
22
28
19
20
20
20
21
22
23
29
20
20
21
21
22
22
24
30
20
21
21
22
22
23
24
31
21
21
22
22
23
24
25
32
22
22
22
23
24
24
26
33
22
23
23
23
24
25
26
34
23
23
23
24
25
25
27
35
23
24
24
25
25
26
27
36
24
24
25
25
26
27
28
37
24
25
25
26
26
27
29
38
25
25
26
26
27
28
29
39
26
26
26
27
28
28
30
40
26
27
27
27
28
29
30
41
27
27
27
28
29
30
31
42
27
28
28
29
29
30
32
43
28
28
29
29
30
31
32
44
28
29
29
30
31
31
33
45
29
29
30
30
31
32
34
46
30
30
30
31
32
33
34
47
30
30
31
31
32
33
35
48
31
31
31
32
33
34
36
49
31
32
32
33
34
34
36
50
32
32
33
33
34
35
37
60
37
38
38
39
40
41
43
70
43
43
44
45
46
47
49
80
48
49
49
50
51
52
55
90
54
54
55
56
57
58
61
100
59
60
60
61
63
64
66
Source : .E.B .Roessler et al.. Journal of Food Science, 1978, 43, 940-947
COPYRIGHT
97
0.05
0.04
0.03
0.02
0.01
0.005
0.001
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
60
70
80
90
100
7
8
8
9
10
10
11
12
12
13
13
14
15
15
16
17
17
18
18
19
20
20
21
21
22
23
23
24
24
25
25
26
27
27
28
28
29
29
30
31
31
32
32
33
39
44
50
55
61
7
8
8
9
10
10
11
12
12
13
14
14
15
16
16
17
17
18
19
19
20
20
21
22
22
23
23
24
25
25
26
26
27
27
28
29
29
30
30
31
31
32
33
33
39
45
50
56
61
7
8
9
9
10
11
11
12
13
13
14
15
15
16
16
17
18
18
19
19
20
21
21
22
22
23
24
24
25
25
26
27
27
28
28
29
30
30
31
31
32
32
33
34
39
45
51
56
62
7
8
9
10
10
11
12
12
13
14
14
15
15
16
17
17
18
19
19
20
20
21
22
22
23
23
24
25
25
26
26
27
28
28
29
29
30
30
31
32
32
33
34
34
40
46
51
57
63
8
9
10
11
11
12
13
13
14
15
15
16
17
17
18
19
19
20
20
21
22
22
23
24
24
25
25
26
27
27
28
28
29
30
30
31
31
32
33
33
34
34
35
41
47
52
58
64
9
10
11
12
12
13
14
14
15
16
16
17
18
18
19
20
20
21
22
22
23
24
24
25
25
26
27
27
28
29
29
30
30
31
32
32
33
33
34
35
35
36
42
48
53
59
65
11
12
13
14
14
15
16
17
17
18
19
19
20
21
21
22
23
23
24
25
25
26
27
27
28
29
29
30
31
31
32
32
33
34
34
35
36
36
37
37
44
50
56
61
67
COPYRIGHT
98
1
161.40
18.51
10.13
7.71
6.61
5.99
5.59
5.32
5.12
4.96
4.84
4.75
4.67
4.60
4.54
4.49
4.45
4.41
4.38
4.35
4.32
4.30
4.28
4.26
4.24
4.22
4.21
4.20
4.18
4.17
4.08
4.00
3.92
3.84
2
199.50
19.00
9.55
6.94
5.79
5.14
4.74
4.46
4.26
4.10
3.98
3.88
3.80
3.74
3.68
3.63
3.59
3.55
3.52
3.49
3.47
3.44
3.42
3.40
3.38
3.37
3.35
3.34
3.33
3.32
3.23
3.15
3.07
2.99
3
215.70
19.16
9.28
6.59
5.41
4.76
4.35
4.07
3.86
3.71
3.59
3.49
3.41
3.34
3.29
3.24
3.20
3.16
3.13
3.10
3.07
3.05
3.03
3.01
2.99
2.98
2.96
2.95
2.93
2.92
2.84
2.76
2.68
2.60
4
224.60
19.25
9.12
6.39
5.19
4.53
4.12
3.84
3.63
3.48
3.36
3.26
3.18
3.11
3.06
3.01
2.96
2.93
2.90
2.87
2.84
2.82
2.80
2.78
2.76
2.74
2.73
2.71
2.70
2.69
2.61
2.52
2.45
2.37
5
230.20
19.30
9.01
6.26
5.05
4.39
3.97
3.69
3.48
3.33
3.20
3.11
3.02
2.96
2.90
2.85
2.81
2.77
2.74
2.71
2.68
2.66
2.64
2.62
2.60
2.59
2.57
2.56
2.54
2.53
2.45
2.37
2.29
2.21
6
234.00
19.33
8.94
6.16
4.95
4.28
3.87
3.58
3.37
3.22
3.09
3.00
2.92
2.85
2.79
2.74
2.70
2.66
2.63
2.60
2.57
2.55
2.53
2.51
2.49
2.47
2.46
2.44
2.43
2.42
2.34
2.25
2.17
2.09
8
238.90
19.37
8.84
6.04
4.82
4.15
3.73
3.44
3.23
3.07
2.95
2.85
2.77
2.70
2.64
2.59
2.55
2.51
2.48
2.45
2.42
2.40
2.38
2.36
2.34
2.32
2.30
2.29
2.28
2.27
2.18
2.10
2.02
1.94
12
243.90
19.41
8.74
5.91
4.68
4.00
3.57
3.28
3.07
2.91
2.79
2.69
2.60
2.53
2.48
2.42
2.38
2.34
2.31
2.28
2.25
2.23
2.20
2.18
2.16
2.15
2.13
2.12
2.40
2.09
2.00
1.92
1.83
1.75
24
249.00
19.45
8.64
5.77
4.53
3.84
3.41
3.12
2.90
2.74
2.61
2.50
2.42
2.35
2.29
2.24
2.19
2.15
2.11
2.08
2.05
2.06
2.00
1.98
1.96
1.95
1.93
1.91
1.90
1.89
1.79
1.70
1.61
1.52
8
254.30
19.50
8.53
5.63
4.36
3.67
3.23
2.93
2.71
2.54
2.40
2.30
2.21
2.13
2.07
2.01
1.96
1.92
1.88
1.84
1.81
1.78
1.76
1.73
1.71
1.69
1.67
1.65
1.64
1.62
1.51
1.39
1.25
1.00
Source : Table 9 is taken from Table V of Fisher and Yates : 1974 Statistical Tables for Biological, Agricultural and Medical
Research published by Longman Group UK Ltd. London (previously published by Oliver and Boyd Ltd. Edinburgh) and by
permission of the authors and publishers.
COPYRIGHT
99
n2\n1
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
40
60
120
8
1
4052
98.49
34.12
21.20
16.46
13.74
12.25
11.26
10.56
10.04
9.65
9.33
9.07
8.86
8.68
8.53
8.40
8.28
8.18
8.10
8.02
7.94
7.88
7.82
7.77
7.72
7.68
7.64
7.60
7.56
7.31
7.08
6.85
6.64
2
4999
99.00
30.81
18.00
13.27
10.92
9.55
8.65
8.02
7.56
7.20
6.93
6.70
6.51
6.36
6.23
6.11
6.01
5.93
5.85
5.78
5.72
5.66
5.61
5.57
5.53
5.49
5.45
5.42
5.39
5.18
4.98
4.79
4.60
3
5403
99.17
29.46
16.69
12.06
9.78
8.45
7.59
6.99
6.55
6.22
5.95
5.74
5.56
5.42
5.29
5.18
5.09
5.01
4.94
4.87
4.82
4.76
4.72
4.68
4.64
4.60
4.57
4.54
4.51
4.31
4.13
3.95
3.78
4
5625
99.25
28.71
15.98
11.39
9.15
7.85
7.01
6.42
5.99
5.67
5.41
5.20
5.03
4.89
4.77
4.67
4.58
4.50
4.43
4.37
4.31
4.26
4.22
4.18
4.14
4.11
4.07
4.04
4.02
3.83
3.65
3.48
3.32
5
5764
99.30
28.24
15.52
10.97
8.75
7.46
6.63
6.06
5.64
5.32
5.06
4.86
4.69
4.56
4.44
4.34
4.25
4.17
4.10
4.04
3.99
3.94
3.90
3.86
3.82
3.78
3.75
3.73
3.70
3.51
3.34
3.17
3.02
6
5859
99.33
27.91
15.21
10.67
8.47
7.19
6.37
5.80
5.39
5.07
4.82
4.62
4.46
4.32
4.20
4.10
4.01
3.94
3.87
3.81
3.76
3.71
3.67
3.63
3.59
3.56
3.53
3.50
3.47
3.29
3.12
2.96
2.80
8
5981
99.36
27.49
14.80
10.29
8.10
6.84
6.03
4.47
5.06
4.74
4.50
4.30
4.14
4.00
3.89
3.79
3.71
3.63
3.56
3.51
3.45
3.41
3.36
3.32
3.29
3.26
3.23
3.20
3.17
2.99
2.82
52.66
2.51
12
6106
99.42
27.05
14.37
9.89
7.72
6.47
5.67
5.11
4.71
4.40
4.16
3.96
3.80
3.67
3.55
3.45
3.37
3.30
3.23
3.17
3.12
3.07
3.03
2.99
2.96
2.93
2.90
2.87
2.84
2.66
2.50
2.34
2.18
24
6234
99.46
26.60
13.93
9.47
7.31
6.07
5.28
4.73
4.33
4.02
3.78
3.59
3.43
3.29
3.18
3.08
3.00
2.92
2.86
2.80
2.75
2.70
2.66
2.62
2.58
2.55
2.52
2.49
2.47
2.29
2.12
1.95
1.79
8
6366
99.50
26.12
13.46
9.02
6.88
5.65
4.86
4.31
3.91
3.60
3.36
3.16
3.00
2.87
2.75
2.65
2.57
2.49
2.42
2.36
2.31
2.26
2.21
2.17
2.13
2.10
2.06
2.06
2.01
1.80
1.60
1.38
1.00
Source : Table 9 is taken from Table V of Fisher and Yates : 1974 Statistical Tables for Biological, Agricultural and Medical
Research published by Longman Group UK Ltd. London (previously published by Oliver and Boyd Ltd. Edinburgh) and by
permission of the authors and publishers.
COPYRIGHT
100
df
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
40
60
120
00
0.01
0.005
0.0005
0.02
31.821
6.965
4.541
3.747
3.365
3.143
2.998
2.896
2.821
2.764
2.718
2.681
2.63
2.624
2.602
2.583
2.567
2.552
2.539
2.528
2.518
2.508
2.5
2.492
2.485
2.479
2.473
2.467
2.462
2.457
2.423
2.39
2.358
2.326
0.01
63.657
9.925
5.841
4.604
4.032
3.707
3.499
3.355
3.25
3.169
3.106
3.055
3.012
2.977
2.947
2.921
2.898
2.878
2.861
2.845
2.831
2.819
2.807
2.797
2.787
2.779
2.771
2.763
2.756
2.75
2.704
2.66
2.617
2.576
0.001
636.619
31.598
12.941
8.61
6.859
5.959
5.405
5.041
4.781
4.587
4.437
4.318
4.221
4.14
4.073
4.015
3.965
3.922
3.883
3.85
3.819
3.792
3.767
3.745
3.725
3.707
3.69
3.674
3.659
3.646
3.551
3.46
3.373
3.2
The value listed in the table is the critical value of t for the number of degrees of freedom listed in the left
column for a one- or two-tailed test at the significance level indicated at the top of each column. If the
observed t is greater than or equal to the tables value, reject Ho.
Source: Table III of Fisher and Yates, Statistical Tables for Biological, Agricultural and Medical
Research, published by Longman Group Ltd, London (previously published by Oliver and Boyd Ltd,
Edinburgh) and by permission of the authors and publishers.
COPYRIGHT
101
df
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
32
34
36
38
40
44
48
52
56
60
0.01
0.005
0.0005
0.02
5.41
7.82
9.84
11.67
13.39
15.03
16.62
18.17
19.68
21.16
22.62
24.05
25.47
26.87
28.26
29.63
31
32.35
33.69
35.02
36.34
37.66
38.97
40.27
41.57
42.86
44.14
45.42
46.69
47.96
50.49
53
55.49
57.97
60.44
65.34
70.2
75.02
79.82
84.58
0.01
6.64
9.21
11.34
13.28
15.09
16.81
18.48
20.09
21.67
23.21
24.72
26.22
27.69
29.14
30.58
32
33.41
34.8
36.19
37.57
38.93
40.29
41.64
42.98
44.31
45.64
46.96
48.28
49.59
50.89
53.49
56.06
58.62
61.16
63.69
68.71
73.68
78.62
83.51
88.38
0.001
10.83
13.82
16.27
18.46
20.52
22.46
24.32
26.12
27.88
29.59
31.26
32.91
34.53
36.12
37.7
39.29
40.75
42.31
43.82
45.32
46.8
48.27
49.73
51.18
62.62
54.05
55.48
56.89
58.3
59.7
62.49
65.25
67.99
70.7
73.4
78.75
84.04
89.27
94.46
99.61
The table lists the critical values of chi square for the degrees of freedom shown at the left for tests
corresponding to those significance levels heading each column. If the observed value of xobs2 is greater
than or equal to the tabled value, reject Ho.
Source: Table IV of Fisher and Yates, Statistical Tables for Biological, Agricultural and Medical
Research, published by Longman Group Ltd, London (previously published by Oliver and Boyd Ltd,
Edinburgh) and by permission of the authors and publishers.
COPYRIGHT
102