Professional Documents
Culture Documents
Figure 6: Instantaneous unavailability of a component models which did not involve an explicit state space
generation. The applicability of such models, how-
component is and the repair rate is , then the in- ever is limited. They assume stochastic independence
stantaneous unavailability of the component is given among failure and repair of various components. In
1.000000
0.999980
model the temporal behaviour of a system along with
its functional behavior.
0.999970 4.2 Generalized Stochastic Petri Nets
Generalized stochastic Petri nets are an extension
to SPNs which allow transitions to have zero ring de-
0.999960 lay or exponentially distributed ring delay [1]. Both
0.0 50.0 100.0 150.0 200.0 SPNs and GSPNs have been shown to be equivalent
time to CTMCs.
Figure 8: Instantaneous Availability of the network Figure 9 shows the GSPN model for our example
network when no repairs are performed. The left most
practical situations, this assumption does not hold. A
single repair facility shared among all components, dif-
ferent priorities assigned to repair of dierent compo-
nents (i.e., it may be more important to repair a par- a1
ticular node rather than a failed link in our network)
are examples which introduce dependencies among the
a1.dn
Name Function
if (G5 ^ G6 ^ G7 ^ G8)
G1 (#(a1.dn) == 1) (#(a2.dn) == 1) return 0; (system is unavailable)
G2 (#(b.dn) == 1) (#(be.dn) == 1) (#(e.dn) == 1) else
G3 (#(bd.dn) == 1) (#(d.dn) == 1) (#(df.dn) == 1) return 1 (system is available);
G4 (#(ac.dn) == 1) (#(c.dn) == 1) (#(ce.dn) == 1) We saw earlier that both reliability and availability
G5 G1 (#(ab.dn) == 1) (#(b.dn) == 1) G3 evaluation is possible using Fault trees and reliability
graphs as long as the individual component behaviors
G6 G1 G4 (#(e.dn) == 1) (#(ef.dn) == 1)
are stochastically independent. Unfortunately, in real
G7 G1 G2 (#(ab.dn) == 1) (#(ef.dn) == 1) world situations, this is not always true. Let us con-
G8 G1 G2 G3 G4 sider our network again but with only a single repair
facility. This means that whenever a component fails,
Halting Condition it might have to wait for the start of repairs. Fig-
ure 12 shows the SRN model. The repair policy is
if (G5 G6 G7 G8) then disable all the transitions
priority based but non-preemptive. For example, if
the link df failed rst and while it is undergoing re-
Figure 10: SRN model without repair pairs, the node b as well as the links be and ce failed
thus causing the communication to stop. Upon com-
contains places and transitions which represent fail- pletion of the current repair, the facility must choose
ure behavior for each individual component. System one of the already failed nodes or links. Note that if
Single repair (SRN)
RBD
To P 0.999990 Independent Repair (FTREE)
a1 a1.dn a1.rp
From P
To P 0.999980
a2 a2.dn a2.rp
A(t)
From P
To P
0.999970
b b.dn b.rp
From P
0.999960
To P
c c.dn c.rp
From P
0.999950
To P 0.0 100.0 200.0
d d.dn d.rp
time
Figure 13: Instantaneous Availability of the network
From P
To P
e e.dn e.rp
ing its past behavior [29]. If the past history of the pro-
ce ce.dn ce.rp
To P
cess is completely summarized in the current state and
From P
is independent of the current time, then the process is
said to be (time-) homogeneous. Otherwise, the exact
To P
characterization of the present state needs the associ-
ated time information, and the process is said to be
df df.dn df.rp
From P
non-homogeneous. A wide range of real problems fall
To P in the class of Markov models (both homogeneous and
ef ef.dn ef.rp
From P non-homogeneous). However, some important aspects
of system behavior in a dependability model cannot
Figure 12: SRN model with dependent repair be easily captured in a Markov model. The common
characteristic these problems share is that the Markov
property is not valid (if valid at all) at all time in-
the link be is repaired next, the communication will stants. This category of problems is jointly referred
still be stopped. However, if the node b or the link ce to as non-Markovian models and can be analyzed us-
is repaired, the communication can be resumed. In the ing several approaches:
modeled policy, the repairs are performed according to supplementary variables;
preassigned priorities. Other possible policies include
FCFS or even marking dependent repair. For further phase-type expansions; and
examples of dependability modeling using SRN's, the
reader is referred to [22, 21]. Markov renewal theory.
4.4 Results 5.1 Supplementary Variables
The SRN models were solved using the SPNP This method, originally discussed in [12], allows for
(Stochastic Petri Net Package). SPNP [10] provides the solution of dependability models when the lifetime
support for specifying the SRN using a \C" like lan- and/or repair distributions of network components are
guage and allows for the modeler to do steady state, non-exponential. It is the most direct method of solv-
transient, cumulative transient and sensitivity anal- ing the modeling problem and is based on the inclu-
ysis. Figure 13 shows the instantaneous availability sion of sucient supplementary variables in the spec-
for network with a single repair facility overlayed on ication of the state of the system to make the whole
process Markovian. In dependability models the sup- alternate phases (parallel connection) then the over-
plementary variables are the times expended in repairs all distribution is hyperexponential. The basic instru-
and ages of network components. The purpose of the ment when sellecting one of these distributions to rep-
added suplementary variables is to include all neces- resent a non-exponential interval is given by the coef-
sary information about the history of the stochastic cient of variation. The coecient of variation, CX ,
process. The resulting Markov process is in continuous of a random variable is a measure of deviation from
time and has a state space which is multidimensional the exponential distribution and is given by
of mixed type, partly discrete and partly continuous.
Since, after the inclusion of the supplementary vari- X ;
CX = E[X]
ables, the stochastic process describing the system be-
havior is now memoryless, then it is possible to de-
rive the Chapman-Kolmogorov equations describing where X is the standard deviation of the random
the dynamic behavior for such a process. The resul- variable and E[X] is its expectation. This coecient
tant set of ordinary or partial dierential equations varies as follows according to the sellected distribu-
can be dened together with boundary conditions and tion:
analyzed.
5.2 Phase Type Expansions CX Distribution
> 1 Hyperexponential
The use of phase type distributions dates back to 1 Exponential
the pionner work of Erlang on congestion in telephone < 1 Hypoexponential
systems at the beginning of this century [7]. His ap- Erlang
proach (named method of stages), although simple, 0 Deterministic
was very eective in dealing with non-exponential dis-
tributions and has been considerably generalized since Important generalizations of the basic stage devices
then. The age (repair time) of a component is as- are the Coxian distributions, Phase Type (PH), and
sumed to consist of a combination of stages each of Generalized Hyperexponential (GH).
which is exponentially distributed. The whole process
becomes Markovian provided that the description of
the state of the system contains the information as to 5.2.1 Coxian Distributions
which stage of the component state duration has been Cox [13] extended the concept of stages by consider-
reached. The division into stages is an operational ing the class of distributions having rational Laplace
device and may not necessarily have any physical sig-
nicance, and any distribution with a rational Laplace transforms. He showed that the method of stages can
transform can, in principle, be represented exactly by still be employed for this class if one is willing to tol-
a phase type expansion. erate stages having complex roots. The basic struc-
The application of this technique involves the fol- ture of an Coxian distribution is depicted in Figure
lowing steps [27, 19]: 14. In SHARPE terminology, such distributions are
called exponomials.
Selection of a Stage Combination: When the dis-
tribution has a rational Laplace transform, the a a
stage combination can be found by examining the
a1 a 3 n
2
EXP( λ 1) EXP( λ ) EXP( λ )
roots of this tranform. In other cases of known
2 n
b1 b2 b3 bn bn+1
probability distributions or of directly tting the
data, a suitable guess has to be made. Several
stage approximations are described in detail in
[19]. where: a i + bi = 1 for 1<i<n
derivation of its parameters from those of the dis- Figure 14: Coxian distribution structure.
tribution being approximated. There are no gen-
eral explicit formula for directly deriving the stage
model parameters and a numerical solution is re-
quired. 5.2.2 Phase-type Distributions
The basic phase type expansion techniques approx- Another important category of phase type expansions
imate a non-exponential distribution by connecting are the PH distributions. Neuts [23] popularized the
dummy stages with independent and exponential so- class of PH distributions, which correspond to the time
journ time distribution in series or parallel (or com- until absorption in nite dimensional Markov chains
bination of both). A process with sequential phases with at least one absorbing state. That is, F(t) is PH
gives rise to hypoexponential or an Erlang distribu- if it can be written as
tion, depending upon whether or not the phases have
identical distributions. Instead, if a process consists of F(t) = 1 ; eQt 1
where Q is the innitesimal generator matrix of (MRS's), and two other important classes of stochastic
an (n+1)-state CTMC with absorbing state (n+1). processes with embedded MRS's, named semi-Markov
The vector = (1; 2; :::; n) is the vector of initial processes (SMP's) and Markov regenerative processes
state probabilities at t = 0, and the vector 1 is an n- (MRGP's). In this subsection we review the deni-
dimensional column vector of all ones. The entries in tions and some of the concepts of Markov renewal
the generator matrix (qij ; i 6= j) represent the instan- theory. We start with the mathematical denition of
taneous rate of the transition from state i to state j. Markov renewal sequences. We then proceed with a
Each component of eQt 1 corresponds to a phase-type study of processes in which an embedded MRS's can
be identied. We conclude illustrating the equations
distribution that results from starting at a particular that provide the solution of Markov renewal problems
state. Therefore, the CDF F (t) can be interpreted as and suggest references for especic applications of the
a mixture of phase-type distributions, that is, theory solving computer network problems.
n
X
F(t) = i[1 ; (eQt 1)i ]: 5.3.1 Markov Renewal Sequence
i=1 Assume the system we are modeling is described by
The major advantage of using PH distributions a stochastic process Z = fZt; t 2 R+ = [0; 1)g, and
is computational, instead of dealing with dierential we observe that at these particular times the stochas-
equations, complex variables and numerical integra- tic process Z exhibits the Markov property. In this
tion, they can be handled using matrix methods [5]. A scenario we are dealing with a countable collection
drawback of PH distributions is their non-uniqueness of renewal processes progressing simultaneously such
of representation. Many dierent combinations of that successive renewals form a discrete-time Markov
dening parameters lead to the same CDF. chain (DTMC). The superposition of all the identi-
ed renewal processes gives the points fSn ; n 2 Ng,
known as Markov renewal moments, and together with
5.2.3 Generalized Hyperexponential the states of the DTMC denes a Markov renewal
Generalized hyperexponential distribution functions sequence.
were proposed by Botta and Harris [5] and are of the In mathematical terms, the bivariate stochastic
form process (X; S) = fXn; Sn ; n 2 Ng is a Markov re-
newal sequence if it satises
n
F (t) =
X
ai (1 ; e; t);
i
P rfXn+1 = j; Sn+1 ; Sn t j X0 ; :::; Xn; S0 ; :::; Sng =
i=1 P rfXn+1 = j; Sn+1 ; Sn t j Xn g
with
Pn
ai = 1; ai 2 R; i > 0. They are ex- for all n 2 N , j 2 E , t 2: R+ , and such that S0 S1
tensions ofi=1
the hyperexponential distributions, which S2 :::, assuming S0 = 0. In practical problems usu-
are of the same form but without the additional re- ally the MRS is assumed time-homogeneous; that is,
quirements that the coecients fai g be positive. This the conditional transition probabilities Ki;j (t), where
added freedom makes the GH distributions extremelly Ki;j (t) =: PrfXn+1 = j; Sn+1 ; Sn t j Xn = ig
versatile. The GH family has the same computational
advantage over transform methods (e.g., Coxian dis- are independent of n for any i; j 2 E , t 2 R+ . There-
tributions) that the PH family has, namely, avoidance fore, we can always write
of complex arithmetic. Ki;j (t) = P rfX1 = j; S1 t j X0 = ig;
5.2.4 Additional Information and Examples The matrix of transition probabilities K(t) =
Botta, Harris and Marchal [6] provide an excellent fKi;j (t) : i; j 2 E ; t 2 R+ g is called the kernel
comparison among several commonly used classes of of the MRS.
approximating distributions, including Erlang, Cox-
ian and PH types. For examples on how to incor- 5.3.2 Semi-Markov Processes
porate non-exponential distributions into stochastic Given an MRS (X; S) with state space E and kernel
Petri nets we suggest [8]. [19] discusses a complete K(t), we can introduce the counting process
approach to phase approximation, including choice of
phase approximation class, numerical tting of appro-
priate parameters, and implementation of the approx-
N(t) =: sup fn : Sn tg; 8t 2 R +
imation approach in a modeling toolkit. to count the number of Markov renewal moments up
5.3 Markov Renewal Theory to time t, but not considering the one at zero. Using
the counting process just dened, we introduce the
A set of techniques that proved very powerful for process Y = fYt ; t 2 R+ g dened by
the solution of non-Markovian models of computer
networks is based on concepts grouped under the um-
brella of Markov renewal theory [15, 11], a collec- Yt =: XN(t)
tive name that includes Markov renewal sequences = Xn ; if Sn t < Sn+1
called semi-Markov process. An SMP is a stochas- Then, the set of integral equations Vi;j (t) denes a
tic process which moves from one state to another Markov renewal equation, and can be expressed
within a countable number of states with the suc- in matrix form as
cessive states visited forming a discrete-time Markov Z t
chain, and that the time required for each successive V(t) = E(t) + dK(u)V(t ; u)
move is a random variable whose distribution function
may depend on the two states between which the move 0
is being made. From the SMP denition it should be where the Lebesgue-Stieltjes integral is taken term by
observed that the process only changes state at the term. The Markov renewal equation represents a set
Markov renewal moments Sn . of coupled Volterra integral equations of the second
kind and can be solved in time-domain or in Laplace-
5.3.3 Markov Regenerative Processes Stieltjes domain. For a discussion of approaches to
solve these equations see [14, 28]. References for the
A stochastic process Z = fZt ; t 2 R g is called regen- application of Markov renewal theory in the solution
of performance and realiability/availability models of
+
erative if there exist time points at which the process
probabilistically restarts itself. Such random times computers networks are [17, 16, 18]
when the future of Z becomes a probabilistic replica 6 Conclusion
of itself are named times of regeneration for Z. This
concept may be weakened by letting the future after In this paper, we reviewed reliability and availabil-
a time of regeneration depend also on the state of an ity modeling techniques by applying them to a commu-
MRS at that time. We then say that Z is a Markov nication network example. Two categories of models
regenerative process. viz., the non-state space based models which include
MRGP's are stochastic processes fZt ; t 2 R+ g that reliability block diagrams, fault trees and reliability
exhibit embedded MRS's (X,S) with the additional graphs and fault trees and the state space based mod-
property that all conditional nite distributions of els which include stochastic Petri nets and its variants
fZt+S ; t 2 R+ g given fZu ; 0 u Sn ; Xn = ig were discussed. We showed powers and limitations of
are the same as those of fZt ; t 2 R+ g given X0 = i.
n
various models by solving for reliability and availabil-
21
As a special case, the denition implies that ity of the network example. In all of the above, the
failure times and the repair times are assumed to be
PrfZt+S = j j Zu ; 0 u Sn ; Xn = ig = exponentially distributed which may not hold. There-
n
fore, nally, we also described three methods of mod-
PrfZt = j j X0 = ig eling a system which has nonexponentially distributed
failure or repair times.
In contrast to SMP's, state changes are allowed be-
tween two consecutive Markov renewal moments in References
MRGP's. [1] Ajmone-Marsan A., Balbo G. and Conte. G.,
5.4 Solution of Problems A class of generalized stochastic Petri nets for
the performance evaluation of multiprocessor sys-
Let Z = fZt ; t 2 R+ g be an MRGP with
state space F , whose embedded MRS is (X; S) = tems. ACM Trans. Comp. Systems, Vol 2. No. 2,
fXn ; Sn; n 2 Ng with kernel matrix K(t). For such a May 1984, pp. 93-122
process we can dene a matrix of conditional transi- [2] ATM Forum, ATM User-Network Interface Spec-
tion probabilities as: ication, Version 3.0, Prentice Hall (ISBN 0-13-
Vi;j (t) =: PrfZt = j j Z0 = ig; 8i 2 E ; 8j 2 F ; 8t 2 R+ 225863-3), September, 1993.
In many practical problems involving Markov re- [3] ATM Forum, ATM User-Network Interface Spec-
newal processes, our primary concern is nding ways ication, Version 3.1, July 21, 1994.
to eectively compute Vi;j (t) since several measures of [4] Balakrishnan, Meera and Trivedi, K. S., Com-
interest (e.g., reliability and availability) are related to ponentwise Decomposition for an Ecient Re-
the conditional transition probabilities of the stochas- liability Computation of Systems with Re-
tic process. pairable Components, Proc. Twenty-fth Inter-
At any instant t, the conditional transition proba- national Symposium on Fault-Tolerant Comput-
bilities Vi;j (t) of Z can be computed as: ing, Pasadena, CA, July 1995.
Vi;j (t) = PrfZt = j; S1 > t j Z0 = ig + [5] Botta, R.F. and Harris, C.M., Approxima-
XZ t
tion with Generalized Hyperexponential Distri-
dKi;k (u)Vk;j (t ; u) butions: Weak Convergence Results, Queueing
Systems - Theory and Applications, 1(2):169{190,
k2E 0
1986.
for all i 2 E , j 2 F , and t 2 R+ . If we dene matrix [6] Botta, R.F., Harris, C.M., and Marchal, W.G.,
E(t) by Characterizations of Generalized Hyperexponen-
tial Distribution Functions, Communications in
Ei;j (t) =: PrfZt = j; S1 > t j Z0 = ig: Statistics - Stochastic Models, 3(1):115{148, 1987.
[7] Brokemeyer, F., Halstron, H. S. and Jensen, A., [18] Logothetis, D. and Trivedi, K., Time{Dependent
The Life and Works of A.K. Erlang, Transactions Behavior of Redundant Systems with Determinis-
of the Danish Academy of Technical Sciences, 2, tic Repair, in Computations with Markov Chains,
1948. edited by W.J. Stewart, Kluwer Academic Pub-
lishers, Norwell, pp. 135{150, 1995.
[8] Chen, P., Bruel, S.C., and Balbo, G., Alterna-
tive Methods for Incorporating Non-Exponential [19] Malhotra, M. and Reibman, A., Selecting and
Distributions into Stochastic Timed Petri Nets, Implementing Phase Approximations for Semi-
PNPM'89, Kyoto, Japan, pp. 187{197, Decem- Markov Models, Stochastic Models , 9(4), 1993.
ber 11-13, 1989.
[20] Malhotra, M. and Trivedi, K. S., Power-Hierarchy
[9] Ciardo G., Blakemore A., Chimento P. F., Mup- of Dependability Model Types, IEEE Transac-
pala J. K. and Trivedi K. S., Automated genera- tions on Reliability, Vol. 43, No. 2, pp. 493-502,
tion and analysis of Markov reward models using Sept. 1994.
stochastic reward nets. In Linear Algebra, Markov
Chains, and Queueing Models, Carl Meyer and [21] Malhotra, M. and Trivedi, K. S., Dependability
R.J. Plemmons (eds.), IMA Volumes in Mathe- Modeling Using Petri-Nets, IEEE Transactions
matics & its Applications, Vol. 48, pp 145-191, on Reliability, Vol. 44, No. 3, pp. 428-440, Sept.,
Springer Verlag, Heidelberg, 1993. 1995.
[10] Ciardo, G., Muppala J. and Trivedi, K. S., SPNP: [22] Muppala, J., Ciardo, G. and Trivedi, K. S.,
Stochastic Petri Net Package, Proc. Third Int. Stochastic Reward Nets for Reliability Predic-
Workshop on Petri Nets and Performance Models tion, Communications in RMS, July 1994.
(PNPM89), Kyoto, pp. 142 - 151, 1989. [23] Neuts, M.F., Renewal Process of Phase Type,
[11] Cinlar,
E., Introduction to Stochastic Processes, Naval Research Logistics Quartely, 25(3):445{
Prentice-Hall, Englewood Clis, 1975. 454, 1978.
[12] Cox, D.R., The Analysis of Non-Markovian [24] Onvural, Raif O., Asynchonous Transfer Mode
Stochastic Processes by the Inclusion of Sup- Networks: Performance Issues, Artech House,
plementary Variables, Proc. Camb. Philos. Soc., 1995.
51(3): 433{441, 1955. [25] Peterson, J. L., Petri net theory and the modeling
[13] Cox, D.R., Use of Complex Probabilities in the of systems, Prentice Hall, 1981.
Theory of Stochastic Processes, Proc. Camb. Phi- [26] Sahner, Robin A., Trivedi, Kishor S., and Puli-
los. Soc., 51:313{318, 1955. ato, Antonio, Performance and Reliability Anal-
[14] German, R., Logothetis, D., and Trivedi, K., ysis of Computer Systems: An Example-Based
Transient Analysis of Markov Regenerative Approach Using the SHARPE Software Package,
Stochastic Petri Nets: a Comparison of Ap- Kluwer Academic Publishers, 1996.
proaches, in Petri Net Performance Models, [27] Singh, C., Billington, R., and Lee, S.Y., The
PNPM'95, 1995. Method of Stages for Non-Markovian Models,"
[15] Kulkarni, V.G., Modeling and Analysis of IEEE Transactions on Reliability, 26(2):135{137,
Stochastic Systems, Chapman & Hall, London, June 1977.
1995. [28] Telek, M., Bobbio, A., Jereb, L., and Trivedi, K.,
[16] Logothetis, D. and Trivedi, K., Reliability Anal- Steady state analysis of Markov Regenerative
ysis of Various Station Attachment Schemes in a SPN with age memory policy, in Performance
FDDI Token Ring, Proceedings of the IEEE IN- Tools and MMB '95, Heidelberg (Germany),
FOCOM 93, San Francisco, CA, March 1993. 1995.
[17] Logothetis, D. and Trivedi, K., The Eect of De- [29] Trivedi, Kishor S., Probability & Statistics with
tection and Restoration Times for Error Recovery Reliability, Queuing, and Computer Science Ap-
in Communication Networks, MILCOM, 1996. plications, Prentice-Hall, 1982.