Professional Documents
Culture Documents
a r t i c l e i n f o abstract
Article history: This paper introduces the concept of safety-related (SR) uncertainty and the methodology to measure
Received 26 November 2010 SR uncertainty. SR uncertainty is concerned with the effect of parameter uncertainty on the uncertainty
Received in revised form of system unsafety (defined with respect to achieve safety integrity level), which is in direct contrast to
14 September 2011
the effect on overall system uncertainty. The properties of SR uncertainty are discussed and its
Accepted 30 October 2011
significance in analyzing safety systems is highlighted. The conventional global sensitivity analysis
Available online 7 November 2011
(GSA) to handle overall uncertainty is inappropriate when SR uncertainty is of interest. We present and
Keywords: discuss four methods to measure SR uncertainty. Three examples are used to demonstrate the
Safety system effectiveness of the proposed methods in comparison with GSA.
Importance measure
& 2011 Elsevier Ltd. All rights reserved.
Safety-related uncertainty
Global sensitivity analysis
Safety integrity level
0951-8320/$ - see front matter & 2011 Elsevier Ltd. All rights reserved.
doi:10.1016/j.ress.2011.10.015
16 M. Xu et al. / Reliability Engineering and System Safety 99 (2012) 15–23
Fig. 2. Density fY(y) (solid) and conditional density f Y9X i ¼ xn ðyÞ (dashed).
i
achieved safety level is affected by parameter uncertainty. For concluded to be more influential than X2. Two completely opposite
example, safety systems that need to comply with the IEC 61508 results may be obtained, depending on either overall uncertainty or
should reach a certain SIL and thus be considered safe (otherwise SR uncertainty is considered. In practice, GSA techniques become
they are considered unsafe). Usually, a point estimate (i.e. inappropriate when SR uncertainty is of concern. Next, the methods
average) of the probability of failure on demand (or safety to quantify the SR uncertainty are proposed.
probability of a dangerous failure per hour) is used to judge
whether the system achieves the required SIL. However, when the
uncertainty of parameters is considered, the probability of failure 3. Safety-related uncertainty measures
on demand itself becomes a random variable, and its distribution
may enclose more than one SIL. For example in Fig. 1, if SIL-2 is The relevant notations used in this paper are as follows.
the required safety level, the region to the right of the dashed line
(i.e. the failure probability Y 4pSIL2) is considered unsafety. For (1) X ¼ ðX 1 ,X 2 ,. . .,X n Þ A Rn is the set of uncertain input parameters.
safety systems, a minimal unsafety region is desired. The primary (2) Y ¼ gðX Þ, gðX Þ : ED Rn -R is the function relationship
objective of this paper is to identify which parameter influences between output Y and input parameters X , i.e. the known
the unsafety region the most. The uncertainty of the unsafety system model.
region due to parameter uncertainty is called SR uncertainty. (3) x ¼ ðx1 ,x2 ,. . .,xn Þ is a realization of X .
Fig. 2 illustrates the fundamental concept of SR uncertainty. The di (4) f X ðxÞ is the joint density of X .
measure from GSA (Eq. (3)) calculates the overall difference between (5) fXi(xi) is the marginal density of xi.
fY(y) and f Y9X i ¼ xn ðyÞ (the shaded area in Fig. 2(a)), while the SR (6) fY(y) is the density function of the model output Y.
i
uncertainty concerns with the change of unsafety probability when (7) fY9Xi(y) is the conditional density of Y given one parameter Xi
the uncertainty in Xi is eliminated. As shown in Fig. 2 (b), if given being fixed.
X i ¼ xni , the size of conditional unsafety region SY9X i ¼ xn equals to the (8) pSILx is the upper bound under safety integrity level x (x ¼1, 2,
3, 4). Table 1 gives pSIL1 ¼10 1, pSIL2 ¼10 2, pSIL3 ¼10 3 and
i
original unsafety region, we say that the parameter Xi in the value xni
has no contribution to SR uncertainty. Moreover, it is possible that SR pSIL4 ¼10 4 for the low demand mode of system operation.
uncertainty will increase by reducing the uncertainty of certain
parameters, which is impossible for overall uncertainty. These para-
meters have adverse effect on reducing SR uncertainty and should be 3.1. Method 1
ranked as the least important to SR uncertainty, since the existence of
their uncertainty is desired for reduced system unsafety. Therefore, Assume that SILx is the required safety integrity level. Let S be
no effort may be needed to reduce these parameters’ uncertainty. the failure probability of safety system above pSILx:
Z 1
2.3. Overall uncertainty and SR uncertainty may rank the SY ¼ f Y ðyÞdy ð5Þ
pSILx
importance of parameters differently
Further, let SY9X i ¼ xn be the failure probability of safety system
i
Since GSA is focused on the overall uncertainty while SR above pSILx given X i ¼ xni :
Z 1
uncertainty is only concerned with the uncertainty that is
SY9X i ¼ xn ¼ f Y9X i ¼ xn ðyÞdy ð6Þ
directly related to achieved safety, these two methods may i
pSILx i
Fig. 3(a) and (b) show DSdY9X 1 ¼ xn o DSdY9X 2 ¼ xn , and the measure In Eq. (7), xni may simply be taken as the expected value of Xi, i.e.
1 2
xi ¼E(Xi). Note that when considering SR uncertainty, the safety
n
by Borgonovo [1] indicates d1 o d2, i.e. X2 is more influential than
system should satisfy the required SILx (i.e. E[Y]rpSILx). Method
X1. However, the SR uncertainty measure, as shown in
1 quantifies the change in the probability of unsafety if the uncer-
Fig. 3(c) and (d), shows DSY9X 1 ¼ xn 4 DSY9X 2 ¼ xn , i.e. the shift
1 2 tainty in Xi is eliminated.
between fY(y) and f Y9X 1 ¼ xn ðyÞ is greater than the shift between Since SY and SY9X i ¼ xn are the failure probabilities, SYA[0,1] and
1
i
fY(y)and f Y9X 2 ¼ xn ðyÞ with regard to system unsafety. Hence, X1 is SY9X i ¼ xn A ½0,1, and thus M1i takes values in ( N,1]. M1i ¼1
2 i
18 M. Xu et al. / Reliability Engineering and System Safety 99 (2012) 15–23
means complete reduction of the system unsafety (the shaded ‘‘direction’’ of the influence, also similar to M1 i. Therefore,
area in Fig. 4(a)) if the uncertainty in Xi is eliminated and M1i ¼0 the parameter with the highest M10i value is ranked as the
indicates that the uncertainty of Xi has no effect on the system most influential with regard to SR uncertainty.
unsafety. In contrast, M1i o0 denotes increase in the system
unsafety (the shaded area in Fig. 4(b)) if the uncertainty in Xi is 3.2. Method 2
eliminated. In this case, we may prefer to keep the existing
uncertainty in Xi. Hence, the parameter with the highest M1i The definition of Method 2 is given by
value is ranked as the most influential as far as reducing unsafety
probability is concerned. VðX i Þ @SY
M2i ¼ ð9Þ
One natural extension of M1i is to replace SY9X i ¼ xn by the SY @VðX i Þ
i
expectation of SY9Xi with respect to Xi, giving rise to a new
measure M10i : This method measures the rate of change in system unsaf-
R R1 ety due to the change in the variance of X i. If M2i 4 0, reducing
SY f X i ðxi Þ p f Y9X i ðyÞdydxi SY E½SY9X i
M10i ¼ SILx
¼ ð8Þ the uncertainty of X i will reduce system unsafety S Y . On the
SY SY contrary, a negative M2 i suggests an increase of system
In analogous to M1 i, M10i takes values in ( N,1] and its unsafety by reducing the uncertainty of X i. Therefore, the
magnitude quantifies the influence of parameter uncertainty parameter with the highest M2 i value is ranked as the most
on the system unsafety. The sign of M10i denotes the influential with regard to SR uncertainty.
Fig. 3. Density fY(y) and conditional density f Y9X i ¼ xn ðyÞ, i¼ 1,2. (a) and (b) overall uncertainty; (c) and (d): SR uncertainty.
i
3.3. Method 3 is desired, such as 0.001. However, Eqs. (12) and (13) are
stochastic functions, and thus using such a small a is numerically
The third method proposed in this paper is a variance-based unstable unless an extremely large number of MC samples are
measure, and the definition is given by used. In this study, a relatively large value a ¼0.2 is adopted based
@VðSY9X i Þ on empirical study, which will be further discussed along with the
M3i ¼ VðX i Þ ð10Þ results in the next section.
@VðX i Þ
In addition, the sample size is taken as m ¼10 000. To ensure
where V(SY9Xi) is variance of system unsafety with respect to Xi. the robustness of the MC method, N ¼100 replicated simulations
This method measures the change in the variance of system are performed and the average values of the importance measures
unsafety due to the change in the variance of Xi. It should be noted are reported. The choice of these settings gives reliable results,
that M3i measures the absolute change in the variance of system and it is consistent with those reported in the literature [19].
unsafety by varying the uncertainty in Xi, while M2i measures the
relative change in system unsafety by changing the uncertainty in
Xi. If M3i 40 (or M3i o0), the reduction in uncertainty of Xi will 4. Examples
decrease (or increase) the uncertainty of SY. Thus, the parameter
with the largest M3i value poses the greatest influence on the Three examples are selected to demonstrate the application of
uncertainty of SY. the proposed methods, including two simple models and a two-
out-of-three system.
3.4. Numerical computation
4.1. Example 1: A simple example for illustration
The proposed SR uncertainty measures are computed using
Monte Carlo (MC) simulation. For each simulation run, m MC To understand the relationship between the proposed meth-
samples are generated from the distribution of input parameters ods and GSA, consider a simple example given below
f X ðxÞ, based on which the output distribution fY(y), and thus the Y ¼ ðX 1 þ X 2 Þ=11 ð14Þ
system unsafety in Eq. (5) can be approximated. To calculate
SY9X i ¼ xn , we may replace the i-th parameter of all the m samples where the uncertainty of X1 and X2 are given by the following
i
by E(Xi), followed by the computation of the conditional output probability density function:
distribution and thus its integration as in Eq. (6). Then, M1i can be f X 1 ðx1 Þ ¼ Betaðx1 ,2,16Þ ð15Þ
obtained for each input parameter. Similar procedure can be used
for obtaining M10i . f X 2 ðx2 Þ ¼ Betaðx2 ,16,2Þ ð16Þ
The partial derivatives in Methods 2 and 3 are approximated
The distributions of X1 and X2 are positively and negatively
by finite difference. Specifically, the variance of Xi is reduced by a
skewed respectively as shown in Fig. 5. The corresponding
small amount (and denoted by X 0i ):
distribution of model output Y is shown in Fig. 6. The statistical
VðX i Þ ¼ VðX 0i Þ þ aVðX i Þ ð11Þ properties of the parameters and the model output are summar-
and the measures are calculated as follows: ized in Table 2.
Assume that the required SIL is level 1 and the failure
VðX i Þ @SY VðX i Þ SY SY 0 probability greater than pSIL1 ¼10 1 is considered unsafe.
M2i ¼ ð12Þ
SY @VðX i Þ SY aVðX i Þ Table 3 shows the results of M2i and M3i when varying the
parameter a in the finite difference method (Eqs. (12) and (13)).
@VðSY9X i Þ VðSY9X i ÞVðSY9X 0i Þ
M3i ¼ VðX i Þ VðX i Þ ð13Þ The last column in the table refers to the percentage that the
@VðX i Þ aVðX i Þ rankings (from N ¼100 repeated MC simulations) are consistent
where SY0 and VðSY9X 0i Þ are respectively system unsafety and the with the final ranking (from the average of these 100 repetitions).
variance of system unsafety with the variance of Xi being reduced A larger percentage indicates a more stable calculation. Clearly,
by 100a%. when a small a (0.001 or 0.01) is used, finite difference does not
Normally, in finite-difference method where the function to be give stable approximation to the partial derivatives. This phe-
differentiated is deterministic, a small value for a (yet not small nomenon can be rectified, in theory, by using a very large number
enough to be comparable with the computer’s numeric precision) of MC samples. Nevertheless, it is practically more desirable to
choose a relatively large a to achieve reasonable calculation while Table 4 shows that X1 and X2 are equally important according
maintaining a low computational cost. Based on the results in to di and IHi. The two GSA measures are unable to distinguish the
Table 3, a ¼0.2 appears to be a good choice and is adopted for this importance of the two parameters, so is the proposed M10i
example. Furthermore, the same procedure has been carried out measure. However, M1i, M2i and M3i suggest that X1 is more
for all the three examples presented in this paper and the results influential than X2 with regard to SR uncertainty. M1i indicates
all supported the choice of a ¼0.2 (details not reported for the rest that if the uncertainty of X1(X2) is eliminated, the system unsafety
two examples for the sake of conciseness). region is reduced by 90% (34%). M2i denotes higher relative
Besides the four proposed measures, two GSA indicators (d and reduction of system unsafety by reducing the variance of X1
IH) are also calculated and the results are shown in Table 4. (0.42) than that by reducing the variance of X2 (0.29). M3i also
supports the conclusion that X1 is more important than X2 in the
view of SR uncertainty. Note that M10i cannot distinguish X1 and
X2 in this example because SY EE(SY9Xi).
The importance ranking may be potentially used to improve
the system safety by reducing parameter uncertainty. Following
the ranking based on SR uncertainty, the uncertainty of X1 may be
reduced. As an example, suppose that the standard deviation of X1
is reduced from 0.72 10 1 to 0.23 10 1, and the original and
reduced distribution of model output Y are shown in Fig. 7(a). For
comparison, we may choose to reduce the standard deviation of
X2 from 0.72 10 1 to 0.23 10 1 instead of changing that of X1,
and the original and reduced distribution of model output Y are
shown in Fig. 7(b).
As shown in Fig. 7(a), the white and blue bars compose the
original distribution of model output Y, while the white and green
bars represent the distribution of Y with reduced uncertainty of
X1. The reduced SR uncertainty in case 1 is DS1(blue bars on the
right of the dashed line in Fig. 7(a)). Similarly, DS2 in Fig. 7(b) is
the reduced unsafety probability in case 2. Clearly DS1 4 DS2,
suggesting that reducing the uncertainty in X1 is more effective
than reducing the uncertainty in X2 towards reducing the SR
Fig. 6. Distribution of model output Y. uncertainty. In comparison, the overall shift (the blue bars and
green bars) in the two cases are the same, and this is why the GSA
measures, di and IHi that consider overall uncertainty of the
Table 2
system, are unable to distinguish the two input parameters. This
Statistical properties of the parameters and the model output.
example also indicates that the M10i measure may not be appro-
Distribution Mean Standard deviation priate to assess SR uncertainty.
Table 4
Uncertainty importance measures and their ranking (bracketed, ‘‘E’’ refers to equal ranking).
X1 0.35 (E) 0.43 (E) 0.90 (1) 0.00 (E) 0.42 (1) 0.49 (1)
X2 0.35 (E) 0.43 (E) 0.34 (2) 0.00 (E) 0.29 (2) 0.12 (2)
M. Xu et al. / Reliability Engineering and System Safety 99 (2012) 15–23 21
Fig. 7. Distribution of model output Y with reduced uncertainty of (a) X1 and (b) X2.
Table 7
Uncertainty importance measures and their ranking.
Fig. 8. System with two components in series.
IHi M1i M101 M2i M3i
Parameter di (10 1)
(10 8) (10 1) (10 3) (10 1) (10 2)
Table 5
Uncertainty importance measures and their ranking (bracketed, ‘‘E’’ refers to equal lD 2.28 (1) 9.74 (1) 4.09 (1) 0.96 (2) 3.97 (1) 3.19 (1)
ranking). b 1.87 (3) 7.05 (2) 2.00 (2) 0.46 (3) 2.83 (2) 1.92 (2)
bD 0.00 (5) 0.00 (5) 0.00 (5) 0.04 (5) 0.00 (5) 0.00 (5)
Parameter di IHi (10 4) M1i M101 M2i M3i (10 1) DCD 2.22 (2) 3.54 (3) 1.66 (3) 3.37 (1) 0.19 (3) 0.41 (3)
MTTR 0.01 (4) 0.00 (4) 0.00 (4) 0.13 (4) 0.02 (4) 0.00 (4)
X1 0.33 (2) 0.36 (2) 1.00 (1) 0.00 (E) 0.55 (1) 0.45 (1)
X2 0.36 (1) 0.41 (1) 0.40 (2) 0.00 (E) 0.34 (2) 0.10 (2)
The mean of the output distribution, fYPFD(y), is 5.6 10 4. based
on which we assume that the safety system requires SIL-3. The
Table 6 results of the five measures are shown in Table 7.
Parameters used in the 2oo3 model. Table 7 shows that M1i, M2i, M3i and IHi (hereafter the ‘‘four
measures’’) give the same ranking. lD ranks 1st for all the
Parameter Mean Value
measures except M10i . b ranks 2nd according to the ‘‘four mea-
lD (/h) Dangerous failure rate 4 10 6 8 10 7
sures’’ while it ranks 3rd according to di and M10i . DCD ranks 3rd
b Common cause factor for dangerous 0.02 0.2 according to the ‘‘four measures’’ while it ranks 2nd based on di
undetected failures and 1st based on M10i . For all the measures, MTTR ranks 4th and bD
bD Common cause factor for dangerous detected 0.01 0.1 ranks 5th, and their values are far less than the values of other
failures
parameters. Hence, the effect on both overall and SR uncertainty
DCD Safe diagnostic coverage coefficient 0.2 0.9
MTTR (h) Mean Time to Restoration 4 24 due to uncertainty of MTTR and bD is negligible. In the view of the
T1 (y) Proof-test interval 1 most important parameter, the rank given by M10i is very different
from those given by other measures. It appears that M10i can only
identify the group of the most influential parameters (DCD,lD and
b), but it cannot distinguish them in detail. As a result, M10i is not
and observing the change of unsafety probability, similar to the recommended to measure SR uncertainty.
method presented for Example 1. Above discussion also shows that the proposed methods
(expect M10i ) and the two GSA measures give similar results,
4.3. Example 3: Two out of three (2oo3) system though they focus on different aspects of system uncertainty. This
is because the effect of parameter uncertainty with regard to
In this example, a more practical system with a 2oo3 (two-out- overall uncertainty and SR uncertainty is similar in the example.
of-three) architecture, which is widely used in industry, is To further illustrate the importance of SR uncertainty, the
considered [20]: probability distributions of two parameters, b and DCD, are
modified to beta distribution with the following density func-
Y PFD ¼ 3T 1 ðlD ð1bÞð1DC D ÞÞ2 ðT 1 =3 þ MTTRÞ tions:
þ lD ð1DC D ÞðT 1 =2 þ MTTRÞð6lD DC D þ bÞ f b ðxÞ ¼ Betaðx,2,16Þ ð19Þ
þ3ðlD DC D MTTRð1bD ÞÞ2 þ bD lD DC D MTTR ð18Þ
f DC D ðxÞ ¼ Betaðx,1:2,1:8Þ ð20Þ
Table 6 and the values are within the recommend ranges of IEC
61508 standard. The proof-test interval T1 can be fixed to one year which are heavily skewed when compared with the original log-
according to [14]. The other parameters are assumed to follow the normal distribution. By keeping all other settings unchanged, the
conventional lognormal distribution [11], whose mean and var- results are given in Table 8.
iance can be obtained by converting the range in Table 6 (See Table 8 shows that the ranking given by the proposed methods
Appendix A for detail). Subsequently, MC simulation is used to and the two GSA measures are different with regard to the most
calculate the output distribution and the importance measures. and least important parameters. DCD ranks 1st according to M1i
22 M. Xu et al. / Reliability Engineering and System Safety 99 (2012) 15–23
Traditionally, quantitative risk assessment has been focused In addition, the definition of the Gaussian error function is
Z x
on investigating how the uncertainty of input parameters affects 2
that of system output in an overall sense. This paper introduces erf ðxÞ ¼ pffiffiffiffi expðt 2 Þdt ðA:4Þ
p 0
the concept of safety-related uncertainty and highlights its
From Eqs. (A.2)–(A.4) we have
relevance for the analysis of safety systems. The conventional
GSA that provides information about the overall uncertainty is 1
P ¼ erf pffiffiffi lnðFÞ ðA:5Þ
inappropriate to measure SR uncertainty. Therefore, four new 2s
methods are developed in this paper to quantify and rank the Now s can be calculated with the help of the inverse function
impact of individual parameters on SR uncertainty, and they are of the error function
demonstrated through the application to three examples. In the
first two examples, the proposed SR uncertainty measures cor- lnðFÞ
s ¼ pffiffiffi ðA:6Þ
rectly rank the parameters with regard to achieved safety, while 2inverf ðPÞ
the GSA measures either are unable to distinguish the importance The error function and its inverse function are available in
of the two parameters (example 1), or give the opposite conclu- many computation software packages, e.g. Matlab.
sion by considering the overall uncertainty (example 2). In the If the form of duple (minimum, maximum) is given, a similar
third example, the proposed methods and GSA measures obtain approach can be followed by
inconsistent results in particular regarding the most and least rffiffiffiffiffi
important parameters when the distributions of b and DCD are pffiffiffiffiffiffiffiffiffi M
T ¼ mM and F ¼ ðA:7Þ
heavily skewed. The results indicate the need of the proposed m
measures when SR uncertainty is considered. Nevertheless, the
measure M10i appears to be incapable of assessing the parameters’
importance appropriately in the studied examples. References
[8] Saltelli A, Marivoet J. Non-parametric statistics in sensitivity analysis for [14] International Electrotechnical Commission. Functional safety of electrical/
model output: a comparison of selected techniques. Reliability Engineering & electronic/programmable electronic safety-related systems. IEC 61508, Parts
System Safety 1990;28:229–53. 1–7, 1st ed., Geneva, Switzerland; 1998.
[9] Iman RLA. Matrix-based approach to uncertainty and sensitivity analysis for [15] Hora SC, Helton JC. A distribution-free test for the relationship between
fault Trees1. Risk Analysis 1987;7:21–33. model input and output when using Latin hypercube sampling. Reliability
[10] Anand FS, Realff MJ, Lee JH. A risk based approach to estimate key Engineering & System Safety 2003;79:333–9.
uncertainties. In: Proceedings of the 9th international symposium on [16] Christopher Frey H, Patil SR. Identification and review of sensitivity analysis
dynamics and control of process systems, DYCOPS 2010, June 5, 2010–July methods. Risk Analysis 2002;22:553–78.
7, 2010. Leuven, Belgium: Mayuresh Kothare, Moses Tade, Alain Vande [17] Iman RL, Hora SCA. Robust measure of uncertainty importance for use in fault
Wouwer, llse Smets; 2010. p. 569–74. tree system analysis. Risk Analysis 1990;10:401–6.
[11] Rouvroye J. Enhanced markov analysis as a method to assess safety in the [18] Homma T, Saltelli A. Importance measures in global sensitivity analysis of
process. Dutch: Technische Universiteit Eindhoven; 2001. nonlinear models. Reliability Engineering & System Safety 1996;52:
[12] Rouvroye JL, van den Bliek EG. Comparing safety analysis techniques. 1–17.
Reliability Engineering & System Safety 2002;75:289–94. [19] Liu Q, Homma T. A new computational method of a moment-independent
[13] Torres-Echeverrı́a AC, Martorell S, Thompson HA. Design optimization of a uncertainty importance measure. Reliability Engineering & System Safety
safety-instrumented system based on RAMSþ C addressing IEC 61508 2009;94:1205–11.
requirements and diverse redundancy. Reliability Engineering & System [20] Oliveira LF, Abramovitch RN. Extension of ISA TR84.00.02 PFD equations to
Safety 2009;94:162–79. KooN architectures. Reliability Engineering & System Safety 2010;95:707–15.