Professional Documents
Culture Documents
Papers published in this section do not necessarily reflect the opinion of the Editors, the
Editorial Board and the Publisher.
Marina Patriarca
Ferdinando Chiodo
Estimates of uncertainty of measurement
Marco Castelli from proficiency testing data: a case study
Antonio Menditto
Abstract The results obtained by a for lead in blood over the last 2 years.
Received: 31 October 2005
Accepted: 26 April 2006 laboratory over a number of We also investigated how different
Published online: 27 June 2006 proficiency testing/external quality PT/EQAS features (frequency of
C Springer-Verlag 2006 assessment schemes (PT/EQAS) trials and number of samples) would
rounds can give information on the affect a laboratory estimate of its
Presented at the Eurachem PT Workshop uncertainty of its measurements for a uncertainty. Such information can be
September 2005, Portorož, Slovenia.
given test, provided that conditions helpful in improving PT/EQAS
such as full coverage of the routine organisation and define, for a given
analytical range, traceability, and test: (a) the state of the art of the
small uncertainty of the assigned uncertainty of current measurement
values (compared to the spread of the procedures, (b) identify needs for
results) are met and provided that improvement of analytical
M. Patriarca () · F. Chiodo · systematic deviations and any other methodologies and (c) set targets for
M. Castelli · A. Menditto
Department of Food Safety and sources of uncertainty are considered. acceptable uncertainty values.
Veterinary Public Health, Istituto As organisers of the Italian EQAS
Superiore di Sanità, (ITEQAS) in occupational and Keywords Uncertainty estimate .
Viale Regina Elena 299, environmental laboratory medicine, Lead in blood . External quality
00161 Rome, Italy we tested this hypothesis using as assessment . Uncertainty of assigned
e-mail: marina.patriarca@iss.it
Tel.: + 390649902562 model data from well-performing values
Fax: + 390649903686 laboratories taking part in ITEQAS
Introduction tainty for a given test should be compatible with the spread
of results obtained by that laboratory over a number of
The international standards on accreditation of calibration, PT/EQAS rounds [4]. This implies the PT/EQAS to offer
testing [1] and clinical [2] laboratories require the appli- full coverage of the routine analytical range for real sam-
cants to have procedures for the estimate of the uncertainty ples, traceability to stated references and small uncertainty
of the tests to be accredited. The theoretical principles and of the assigned values, compared to the spread of the re-
methods for the expression of uncertainty in measurement sults. Participants wishing to make such use of PT/EQAS
are laid down in the GUM [3]. Practical applications of data should make sure that systematic deviations from the
those principles to quantitative measurements in analyti- assigned values and any other source of uncertainty not
cal chemistry are described in the EURACHEM/CITAC covered by EQAS (e.g., sampling or storage of real sam-
Guide [4]. This last document, starting from measurement ples) are taken into account by separate assessments.
models and identification of possible uncertainty sources, However, PT/EQAS are organised in different ways, even
exploits the concept of extracting information about un- in the same field of analysis. In a survey of European EQAS
certainty of measurement from data already available in in occupational and environmental laboratory medicine
laboratories operating under a quality management system, (OELM) [5], frequency of rounds, number of samples/trial
such as data from validation studies, internal quality con- and methods to define assigned values varied widely. Few
trol and participation in proficiency testing (PT)/external schemes estimated the uncertainty of the assigned value
quality assessment schemes (EQAS). The rationale for the and/or took it into account in the evaluation of participants’
use of PT/EQAS data to check or estimate a laboratory’s performance. As organisers of the Italian EQAS (ITEQAS)
uncertainty is based on the fact that the estimated uncer- in OELM, we wished to assess whether participants could
475
use their results in EQAS to check or estimate the uncer- the last two years. According to [9], the uncertainty of the
tainty of their measurements for lead in blood. In addition, assigned values was estimated as:
we investigated how different PT/EQAS features would
affect a laboratory estimate of its uncertainty. Such infor- 1.23 × s ∗
mation can be helpful in improving PT/EQAS organisation, ux = √
p
to define the state of the art of the uncertainty of current
measurement procedures and to set targets for acceptable where s ∗ is the robust standard deviation estimated from
uncertainty values for a given test. the results provided by all participants using the Algorithm
A described elsewhere [9 , 10]. The number of participants
in each trial ranged from 55 to 64. All participants used ana-
Materials and methods lytical methods based on electrothermal atomic absorption
spectrometry. Expanded uncertainty was calculated using
Scheme organization a covering factor k = 2 for a confidence interval of 95%.
The METOS Project (from “Metalli Tossici”, Toxic Met-
als) is an EQAS for laboratories performing specialised Estimate of uncertainty of participants’ results
analyses in OELM including lead in blood [6]. All con- from cumulative EQAS data
trol materials for this scheme are prepared in-house, from
animal blood spiked with known amounts of lead. The fre- We used the results obtained by well-performing partici-
quency of trials is currently four times a year with three pants (laboratories, n = 14) in the ITEQAS for lead in blood
unknown samples distributed in each trial. Unknown du- over the last 2 years. Laboratories were selected according
plicates are included on a regular basis. Samples distributed to (a) participation in all trials (n = 8) reporting results for
to laboratories are labelled with randomly generated con- all samples (n = 24); (b) sum of |z-score| ≤ 48; (c) sum of
ventional codes in order to prevent the disclosure of the |z-score| in each trial ≤ 9; (d) average of z-scores not signif-
sample identity to participants. Assigned values are defined icantly different from zero, when compared to its standard
as consensus values (median of all participants’ results for deviation. For each laboratory, the relative difference (Drel )
each sample). The report distributed to participants at the between the result provided (xi ) and the assigned value of
end of each trial includes sample statistics and performance the respective EQA sample (Xref ) was calculated as:
indices for each analysed sample (compliance with stated
acceptability limits and z-score). Acceptability limits are xi − X ref
set by the organisers for each measurand at two concen- Drelx =
X ref
trations and are wider at lower concentrations. Limits for
lead in blood are set as ± 20% at 100 µg l–1 and ± 10% and the uncertainty of measurement was estimated as the
at 800 µg l–1 . Limits at any intermediate concentration standard deviation of the relative differences. Expanded
are obtained by interpolation. Z-scores are calculated in uncertainty was calculated taking a covering factor k = 2
the usual way [7], taking half of the acceptability limit as for a confidence level of 95%.
the standard deviation for proficiency assessment at that For each laboratory, the contribution of imprecision to to-
concentration [6]. tal uncertainty was estimated as pooled standard deviation
of duplicates from the results provided in two different oc-
casions for each of seven samples distributed as unknown
Agreed acceptability limits for lead in blood duplicates, according to the formula:
As part of collaborative work, the Network of Organisers of
EQAS in OELM has agreed common acceptability limits n
(xi1 − xi2 )2
for lead in blood as ± 4 µg dl–1 or ± 10% of the target i=1
concentration, whichever is greater, with a view to reduce SDpooled =
2n
them to ± 3 µg dl–1 or ± 10%, whichever is greater [8].
A huge (compared to the others) difference (184 µg l–1 )
between one set of duplicates (Lab #1) was attributed to
Uncertainty of assigned values occasional contamination of the sample and the set of data
removed.
Since assigned values in ITEQAS are obtained as consen-
sus values (medians) of the participants’ results, it was
not possible to apply a rigorous procedure such as that de- Assessment of the effect of simulated variations of
scribed in the GUM [3] for the estimate of their uncertainty. scheme features on the estimate of laboratory
However, a procedure to obtain an estimate of the uncer- uncertainty
tainty of consensus values has been reported in ISO/FDIS
13528 [9] and this was applied to the values assigned to To test whether the frequency of trials (Tn ) or the number
the samples distributed in ITEQAS for lead in blood over of samples/trial affects the estimate of uncertainty, varia-
476
Table 1 Experimental design. Two sets of data (A bold line and B dashed line) were obtained from the whole database, by selecting trials
(T) or samples (S)
Variation %
1 year apart (set A1: all samples from T1 and T5 ; set A2: 30
all samples from T2 and T6 ; set A3: all samples from T3
and T7 ; set A4: all samples from T4 and T8 ); (B) three 20
sets each including only one sample from each of the eight
trials (set B1: all the S1 Tn ; set B2: all the S2 Tn ; set B3 10
all the S3 Tn ) where S1 , S2 , and S3 are the 1st, 2nd and
3rd samples (ordered by code), respectively. For each sub- 0
set, laboratories uncertainty estimates were obtained as de- 0 200 400 600 800
scribed above. Data were analysed by means of two-ways Assigned values, [Pb], µg l
-1
200
a 0.25
160
0.20
Relative u or SDpooled
120
-1
[Pb], µg l
0.15
80
0.10
40
0.05
0
0.00 0 100 200 300 400 500 600 700 800
#1 #2 #3 #4 #5 #6 #7 #8 #9 #10 #11 #12 #13 #14
Median, [Pb], µg l-1
Laboratora code
Fig. 3 Expanded uncertainty of measurement of two laboratories
b 100
(heavy and light solid line) compared over the concentration range
90 with: a the expanded uncertainty of the assigned values (dotted line);
80
b the acceptability limits defined in ITEQAS (dashed/dotted line);
c the acceptability limits agreed by the Network of Organisers of
(relative DSpooled/u relative)%
50
0 .1 2
40 Variation of uncertainty estimate 0 .1 0
30
0 .0 8
20
0 .0 6
10
0 .0 4
0
#1 #2 #3 #4 #5 #6 #7 #8 #9 #10 #11 #12 #13 #1 4
Laboratory code
0 .0 2
– Set B, laboratories: F = 3.1286 (Fcrit = 2.1192), Fig. 4 Variations of the uncertainty estimates, for each laboratory
(—— lab#1, — ◦ — lab#2, —— lab#3, — — lab#4, ——
p = 0.064, df = 13; sets of samples F = 6.5591 lab#5, —•— lab #6, —— lab#7, —— lab#8, ––lab#9, – ◦ –
(Fcrit = 3.3690), p = 0.049, df = 2. lab#10, ––lab#11, – – lab#12, –– lab #13, –•– lab#14) observed
when data from subsequent exercises (trials: 1 and − 2; 1, 2 and 3;
For each laboratory, the changes of the uncertainty esti- and so on, until all trials from 1–8) were cumulated. Changes are
mates, observed when data from subsequent exercises (tri- shown as the difference to the estimate of the first trial
als: 1 and 2; 1, 2 and 3; and so on, till all trials from 1 to
8) were cumulated, are shown in Fig. 4, as the differences
from the uncertainty value estimated from the data of the Discussion
first trial only.
According to the EURACHEM/CITAC Guide [4], pre-
requisites for the use of PT/EQAS data to estimate the
Assessment of the reliability of the uncertainty uncertainty of measurement are full coverage of the rou-
estimate based on previous trials tine analytical range for real samples, traceability to stated
references and small uncertainty of the assigned values,
The comparison of laboratories’ results accompanied by compared to the spread of the results. In ITEQAS, the
their estimated expanded uncertainty and the assigned concentrations of samples distributed to participants cov-
values showed that, in all cases, the expanded uncer- ered the range of blood Pb levels that may occur following
tainty intervals around the data provided by the labo- environmental or occupational exposure. Robust statistics
ratory overlapped the interval assigned value ± its ex- (consensus median of the participants’ results) are used as
panded uncertainty. An example for the sample at con- the assigned values, checked against sample formulation
centration 401 µg l–1 is given in Fig. 5. Similar plots and the results of homogeneity tests. Consensus values are
were obtained for the samples at concentrations 105.8 and not traceable to SI, and, in some cases, there may be no
205.9 µg l–1 . real consensus amongst the participants or the consensus
may be biased by the general use of faulty methodology.
478
700 Table 2 Ratio between uncertainty of the assigned value (uref ) and
ITEQAS standard deviation for proficiency testing (σ) in comparison
600 with the recommended ISO criteria (uref /σ ≤ 0.3)
σ
[Pb], µg l -1
ally obtained after four consecutive trials. When treating ternational activities are in place to provide the means to
data according to this model, a significant variation of the improve traceability in chemistry and laboratory medicine,
estimated uncertainty suggests a corresponding change in at present in many situations PT/EQAS may still represent
laboratory performance or an analytical problem. In this the only way to demonstrate equivalence of measurements
study, the variations observed for four laboratories when and the only means available to a routine laboratory to trace
data from trial no. 8 were cumulated with the others are its results to an independently agreed value.
due to the greater analytical difficulties experienced with This study also provides some information on the uncer-
a sample at the lowest extreme of the concentration range tainty of measurement of lead in blood in routine labora-
(22 µg l–1 ) included in that trial. The reliability of these tories, in relation with stated acceptability limits for such
uncertainty estimates was independently confirmed by the tests at national and international level. Such information
analysis of the data reported by some of the laboratories in is valuable in order to identify needs for improvement of
a new trial (Fig. 5). analytical methodologies and to set targets for acceptable
uncertainty values for a given test, to the benefit of all
parties, i.e., the laboratory itself, accreditation bodies and
Conclusions end-users of the data.
Provision of information that can be used for a labora-
A model for the estimate of uncertainty from PT/EQAS tory’s estimate of its uncertainty and reduction of the uncer-
data was applied to data from well-performing laboratories tainty of the assigned values could substantially increase
taking part in ITEQAS for lead in blood. The effect of the value of participation in PT/EQAS schemes. This last
frequency of trials and number of samples analysed was issue is of particular value for laboratories with the best
simulated by sampling of datasets. Reliable estimates were performances, in order to obtain full benefit from their par-
obtained when data from at least four subsequent trials were ticipation in PT/EQAS, but the additional costs may result
combined. in reduced effectiveness of the scheme (fewer participants).
The approach of using PT/EQAS data to estimate the In the field of OELM, extensive collaboration is already in
uncertainty of routine analysis is particularly valuable as place among scheme organisers [17, 18] and one of the main
it provides independent information on the performance of issues is to develop common approaches to traceability and
the laboratory over the entire analytical range. Establishing reduced uncertainty of the assigned values at acceptable
traceability and evaluating uncertainty are the most difficult cost [17].
issues to be implemented in routine analytical laboratories.
Surveys of participants in ITEQAS have shown that only Acknowledgements The authors acknowledge the financial sup-
30% of them estimate the uncertainty of their measure- port from the Italian Ministry of Health, Project “Analysis of risk
connected with the presence of residues in food of animal origin -
ment [16]. Similar situations may occur in other countries SARA”, 2003–2006.
or fields of clinical analysis. Although a number of in-
References
1. ISO/IEC 17025 (2005) General laboratory medicine. Ann Super Sanità for the determination of the precision of
requirements for the competence of 32(2):191–316 a standard measurement method. ISO,
testing and calibration laboratories. 6. Patriarca M, Chiodo F, Castelli M, Geneva, Switzerland
ISO, Geneva, Switzerland Corsetti F, Menditto A (2005) 11. Thompson M, Ellison SLR, Wood R
2. ISO 15189 (2003) Medical laboratories Microchemical J 79:337–340 (2006) Pure Appl Chem
– Particular requirements for quality 7. ISO/IEC Guide 43-1 (1997) Proficiency 78(1):145–196
and competence. ISO, Geneva, testing by interlaboratory comparisons 12. Boyer KW, Horwitz W, Albert R (1985)
Switzerland – Part 1: Development and operation of Anal Chem 57:454–459
3. BIPM, IEC, IFCC, ISO, IUPAC, proficiency testing schemes. ISO, 13. Thompson M (2000) Analyst
IUPAP, OIML (1993) Guide to the Geneva, Switzerland 125:385–386
expression of uncertainty in 8. Taylor A, Angerer J, Claeys F, 14. Kristiansen J, Molin Christensen J,
measurement. ISO, Geneva, Kristiansen J, Mazarrasa O, Menditto Nielsen JL (1996) Mikrochim Acta
Switzerland (corrected and reprinted in A, Patriarca M, Pineau A, Schoeters I, 123:241–249
1995) Sykes C, Valkonen S, Weykamp C 15. Patriarca M, Castelli M, Corsetti F,
4. EURACHEM/CITAC (2000) (2002) Clin Chem 48(11):2000–2007 Menditto A (2004) Clin Chem
Quantifying uncertainty in analytical 9. ISO FDIS 13528 (2005) Statistical 50(8):1396–1405
measurement. Second Internet Edition methods for use in proficiency testing 16. Chiodo F, Patriarca M, Menditto A
(last access June 2005) by interlaboratory comparisons. ISO, (2005) Microchemical J
5. Morisi G, Menditto A, Patriarca M, Geneva, Switzerland 79(1–2):341–347
Taylor A (eds) (1996) European 10. ISO 5725-5 (1998) Accuracy (trueness
external quality assessment schemes in and precision) of measurement methods
occupational and environmental and results: Part 5: Alternative methods
480
17. Taylor A, Arnaud J, Angerer J, Claeys and Laboratory Medicine, Portoroz, performance among external quality
F, Jones RL, Mazarrasa O, Menditto A, Slovenia. Accred Qual Assur (this assessment schemes in occupational
Parsons PJ, Patriarca M, Pineau A, issue) and environmental laboratory medicine.
Valkonen S, Weber J-P, Weykamp C 18. Taylor A, Arnaud J, Angerer J, Claeys Eurachem 5th Workshop Proficiency
(2005) Occupational and environmental F, Jones RL, Mazarrasa O, Menditto A, Testing in Analytical Chemistry,
laboratory medicine: a network of Parsons PJ, Patriarca M, Pineau A, Microbiology and Laboratory
EQAS organisers. Eurachem 5th Valkonen S, Weber J-P, Weykamp C Medicine, Portoroz, Slovenia. Accred
Workshop Proficiency Testing in (2005) Quality specifications for Qual Assur (this issue)
Analytical Chemistry, Microbiology evaluation and comparison of