Professional Documents
Culture Documents
www.elsevier.com/locate/dsw
a
Faculty of Business Administration, Simon Fraser University, Burnaby, BC, Canada V5A 1S6
b
University College of the Fraser Valley, Abbotsford, BC, Canada
c
Faculty of Business Administration, Simon Fraser University, Burnaby, Canada
d
Department of Statistical Sciences, University of Cape Town, Cape Town, South Africa
Received 1 July 1997; accepted 1 October 1998
Abstract
The state of a machine (system) that may experience failures is characterized by the real age of the machine and the
number of failures incurred to date. On failure, the unit may undergo a repair which can partially reset the failure
intensity of the unit. In this paper, it is assumed that repairs can, at most, shift back the failure intensity so as to remove
the most recent run-time sojourn. The other alternative at a failure is to conduct a major overhaul that serves to refresh
the failure intensity of the unit. General cost structures, depending upon both real age and number of failures are
permitted. The decision, on failure to repair or renew is formulated as a discrete semi-Markov decision process with real
age and number of failures as the state space. Optimal decisions are of the threshold type. That is, on the nth failure, if
the real age is above a predetermined threshold value, refresh; otherwise conduct an imperfect repair. Ó 2000 Elsevier
Science B.V. All rights reserved.
0377-2217/00/$ ± see front matter Ó 2000 Elsevier Science B.V. All rights reserved.
PII: S 0 3 7 7 - 2 2 1 7 ( 9 9 ) 0 0 0 0 9 - 0
C.E. Love et al. / European Journal of Operational Research 125 (2000) 398±409 399
virtual age of the system. The repair, following a ure/Type I repairs as a g-renewal density.
failure, serves us to reset the virtual age which Integrating this density up to an arbitrary time, T,
determines the revised failure intensity. In his yields a g-renewal function. The concept of a
original paper, Kijima proposed two repair eect g-renewal density and its integral, the g-renewal
models although in this paper we will focus only function follows directly from the well-known or-
his Type I imperfect repair model leaving attention dinary renewal functions of renewal processes (see
to his Type II imperfect repair model (and exten- Refs. [1,2]). Determination of the g-renewal func-
sions) to a subsequent paper. tion provides a direct estimate of the expected
Consider a system that has experienced its number of failures from time 0 to time T and hence
n ÿ 1th failure and has been repaired. After the an optimal cost rate C
T can be found by mini-
repair, denote its virtual age as v
tnÿ1 , where mizing the function 1=T
C0 C1 H
T , with re-
the denotes virtual age at failure but after spect to T; where C0 is the cost of a single
repair. Upon restart, if the nth system failure replacement, C1 is the cost per repair and H
T is
occurs after a sojourn of xn time units, then the the expected number of failures in
0; T (i.e. the
virtual age before repair at the nth failure can be g-renewal function).
denoted as To determine the g-renewal function, Kijima
proposed a numerical approximation procedure
vÿ
tn v
tnÿ1 xn ;
1 and applied the scheme to an assumed gamma
where tn is the real age of the system at the nth failure process with Type I repairs. We return to
repair, and ÿ denotes virtual age at failure but his numerical work later in this paper by way of
before repair. comparing his results to the semi-Markov decision
In Kijima's Type I imperfect repair model, he analysis developed in this paper.
suggested that upon failure, the repair undertaken Makis and Jardine [7] formulated this failure/
could serve to reset the intensity only as far back Type I repair process as a semi-Markov decision
as the virtual age at the start of the last sojourn. process (for a thorough discussion of such struc-
That is: tures, see Ref. [11]) and demonstrated that, under
suitable conditions, an optimal stationary policy
v
tn v
tnÿ1 hn xn :
2
exists. A two dimensional state space was utilized
Here, hn re¯ects the impact of the nth repair in their formulation, de®ned as (number of failures,
0 6 hn 6 1. If hn 1, then we have a minimal real age). This provides considerable ¯exibility in
repair. If hn 0, then there has been no aging from modelling such problems since, in addition to the
the
n ÿ 1th repair. It is obvious from the above failure process itself, the costs of repair and re-
equation that, since a repair can reset, at a maxi- placement can depend upon this state space. A
mum, the most recent sojourn time, the post-repair further distinction between the Kijima [5] analysis
virtual age is non-decreasing as is the real age. and that of Makis and Jardine [7] is the assump-
Furthermore, if hnP h; 8n; one can immediately tion regarding cycle lengths. Kijima assumed that
n
see that v
tn h j1 xj htn : Thus, in this case, if the prescribed time T had been reached then the
there is a one-to-one correspondence between real system would be immediately shut down and re-
age and post-repair virtual age. placed. We will refer to this as a ®xed-T policy.
The purpose of the original paper of Kijima [4] Makis and Jardine in contrast assumed that re-
was to provide a procedure whereby bounds on the placement occurs at the ®rst failure after time T.
expected real age of a system could be determined This latter assumption follows from the observa-
when repairs follow Type I or Type II processes. In tion of Phelps [9,10] following work of Muth [8]
a subsequent paper Kijima [5] sought to determine that, if replacements upon failure cost the same as
the optimal cycle time between replacements for a replacements at schedule times, then it is always
system subject to a sequence of Type I repairs. To optimal to wait until the ®rst failure following T
solve this problem, utilizing his notion of virtual to carry out the replacement. Such a policy we
aging, Kijima [5] formulated the sequence of fail- will refer to as a T-plus policy, in contrast to the
400 C.E. Love et al. / European Journal of Operational Research 125 (2000) 398±409
Markovian property of the system and the di- ®rst failure after the maximum age (M). The
mensionally of the problem would become un- probability of making such as transition is
manageable.)
Z1
If at state
n; i a repair action
a 1 is taken,
P
n;i
1;M
a 0 f
x dx:
7
the (discretized) p.d.f. of xn1 can be written as
M
4
3. A search algorithm
We also have transitions arising when the sys-
tem, following a repair, restarts in Region 1 but Utilizing state transitions de®ned by Eqs. (4)±
next fails on or beyond the maximum age M. (7) and mean durations de®ned by Eqs. (8) and
These are de®ned by (9), it is clear that we have a semi-Markov state
Z1 transition process. Since in each state, a decision to
either repair or replace must be taken, the resul-
P
n;i
n1;M
a 1 f
hi=n
x dx; n 6 N ÿ 1:
tant structure is a uni-chain, SMDP. Chose an
Mÿi
initial set of sn values. De®ne this set as an initial
5 policy (R). For this arbitrary initial feasible policy
Notice here that we have consolidated all cases (R), solving the following set of simultaneous
where j P M into one of the states (n; M), such that equations will yield the value of such a policy in
the failures occuring beyond M are put at M. terms of the cost per unit time (real age):
Next we have a machine that has traversed into
Regions 2 or 3 and fails. This initiates a replace- w
n; i C1
n; i ÿ g
Rs
n;i
1
ment. One case is that the machine on replace- X
M
ment, has its ®rst failure before the maximum age P
n;i
n1;k
1w
n 1; k;
10
(M) is reached. This probability is ki
Z
j1 where i 6 sn ÿ 1; 1 6 n 6 N ÿ 1;
P
n;i
1;j
a 0 f
x dx; 0 6 j 6 M ÿ 1:
6 w
n; i C0 ÿ g
Rs
n;i
0
j
X
M
The other situation, again beginning with a P
n;i
1;k
0w
1; k;
11
machine that has traversed into Regions 2 or 3 and k0
w
N ; i C0 ÿ g
Rs
N ;i
0 Step 3: Test for state
n; sn ; where sn sn , to
X
M ®nd if:
P
N ;i
1;k
0w
1; k;
12
k0 C1
n; sn ÿ g
Rs
n;sn
1
w
1; 0 0;
13 X M
P
n;sn
n1;k
1w
n 1; k < w
n; sn :
15
where g
R represents the gain or cost per unit ksn
time of this policy and w
n; i represents the value, If Eq. (15) does not apply for
n; sn , then go to
(relative to state
1; 0), of beginning in state
n; i Step 4. Otherwise, determine the largest sn with
and following this policy for an in®nite amount of sn < sn 6 M such that Eq. (15) holds. (If sn M,
time. enlarge the bound M and retest.)
With this as an initial policy (over all states), we Step 4: n n 1, if n 6 N ÿ 1 go to Step 2,
can proceed to determine a policy improvement otherwise go to Step 5.
algorithm to identify improved policies. One ap- Step 5: The new control limit rule R consists of
proach would be to proceed by determining, for the
the parameters values s1 ; s2 ; . . . ; sN ÿ1 . If R R,
each state, whether it is optimal to repair (a 1) algorithm is stopped. Otherwise, go to Step 1 with
or replace (a 0). However since optimal policies R replaced by R. (If sN ÿ1 6 0, enlarge the bound
must be of the control-limit (threshold) type, it is N.)
clear that one can search directly for optimal With this algorithm, we can numerically deter-
control limits (sn ). The following search algorithm mine the optimal control-limit (threshold-type)
can be used to ®nd the parameter values of the policy for the replacement problem under general
optimal control-limit policy. repair. In Fig. 2, we present a ¯owchart of this
algorithm.
3.1. Algorithm
Following the parameter assumptions of of 1 (i.e. repair costs do not vary with n or i). In
Makis and Jardine as well as Kijima, we present Table 1, we provide a comparison of our discrete
analyses for the case of k 3, a ®xed replacement SMDP model with the g-renewal approach of
cost (C0 ) of 2 and a constant repair cost (C1
n; i) Makis and Jardine. These restrictions on repair
406 C.E. Love et al. / European Journal of Operational Research 125 (2000) 398±409
Table 2
a k 3; C0 2; C1
n; i p
i=n1:5 1
p s1 s2 s3 s4 s5 s6 s7 sa8 g
R
0.05 1.0 1.0 1.0 1.0 1.0 1.0 1.0 0 1.836
0.2 0.75 0.75 0.75 0.80 0.80 0.80 0.80 0 1.869
0.5 0.65 0.65 0.65 0.65 0.65 0.65 0.65 0 1.903
a
With the assumed data, the largest failure number, N, needed was 8. Thus on the 8th failure the threshold real time is 0, implying
replacement.
C.E. Love et al. / European Journal of Operational Research 125 (2000) 398±409 407
Table 3
a k 3; C0 2; C1
n; i 0:1nr 1
r s1 s2 s3 s4 s5 s6 s7 sa8 g
R
1.05 1.15 0.80 0.55 0.35 0.20 0.10 0 0 1.839
1.2 1.15 0.80 0.60 0.45 0.30 0.20 0.15 0 1.836
a
With the assumed data, the largest failure number, N, needed was 8.
and the degree of repair
h. We see that, in gen- Appendix A. Transition structure for failure pro-
eral, the form of the optimal (repair/replacement) cesses with Type I repairs
policy is of the threshold type such that at the nth
failure, if the real age is above a critical value, We assume that repairs are of Type I such that
replace, otherwise undertake an imperfect repair. the virtual age of the system following the nth re-
We also see that this discrete SMDP is very ¯ex- pair is htn : In order to discretized our process we
ible, permitting state-dependent cost structures for introduce a parameter n to represent the number
both repair and replacement costs. In this paper of time slices in a unit of time. With this parame-
we have assumed the failure intensity depends ter, real age, tn is discretized into in non-overlap-
only on the virtual age of the machine. Although ping ages such that the discretized virtual age after
we have not explored the issue in this paper, with repair is denoted as hi=n (where here we have
few changes, the failure intensity could be made to suppressed the failure number, n for ease of pre-
depend on the state (age, number of failures) as sentation). Clearly then, increasing n allows us to
well. increase the state space along the time dimension
We have also limited this paper to ®nding op- (in ) and to achieve any level of accuracy desired. Of
timal T-plus policies [9]. It is clear however that course there is no need to discretize n (the number
with few modi®cations to the transition probabil- of failures to date) since it is a discrete valued
ities and duration times (Eqs. (4)±(9)) we could function.
determine optimal ®xed-T policies as well. Our numerical work in Section 4 utilizes the
We have only addressed the case of Type I Gamma distribution. Utilizing this n discretization
imperfect repairs. Kijima also proposed a Type II approach, Eq. (3) for the Gamma distribution,
imperfect repair model in which the repair eect becomes
served to operate on the full virtual age of the
system up to that time. Other virtual age models f
hi=n
x
a 1 aÿ1
are also possible and may be important in par- k
1 k
n
hi x exp
ÿ n
x hi
ticular situations (see Refs. [3,6]). It is clear that R khi=n :
C
a 1 ÿ C
a exp
ÿu uaÿ1 du
for Type II repairs, the semi-Markov construc- 0
tion would require the introduction of a virtual The transition probabilities (Eqs. (4)±(7))
age variable into the state space. The general become:
formulation here then would require either two P
n;i;
n1;j
1
state variables; real age and virtual age and
possibly a third state variable (number of failures R
a;
k=n
j 1 ÿ
1 ÿ hi ÿ R
a;
k=n
j ÿ
1 ÿ hi
;
to date) in order to properly characterize the 1 ÿ R
a;
khi=n
system. Kijima Type II imperfect repairs and
where R
a; x is the
R x incomplete Gamma function
extensions appear to have considerable potential
de®ned by 1=C
a 0 exp
ÿu uaÿ1 dt;
in capturing repair eects. Research into the
construction and computation demands of ap- P
n;i;
n1;M
1
propriate discrete SMDP models for this case are
1 ÿ R
a; kn
M ÿ
1 ÿ hi R
a; khi
n
ongoing and the results will be presented in a ;
subsequent paper. 1 ÿ R
a; khi
n
1 ÿ R
a; khi
n
408 C.E. Love et al. / European Journal of Operational Research 125 (2000) 398±409
k kj P
n;i;
n1;j
1
P
n;i;
1;j
0 R a;
j 1 ÿ R a; ;
n n a a
i
j ÿ i hi
exp kh exp ÿ k
kM n n
P
n;i;
1;M
0 1 ÿ R a; : a
n
j ÿ i 1 hi
ÿ exp ÿ k ;
n
The mean duration times (Eqs. (8) and (9)) are:
P
n;i;
n1;M
1
2 a
khi
exp
ÿ khi a a
n 4 n n i
M ÿ i hi
s
n;i
1 exp kh exp ÿ k ;
1 ÿ R
a; khi
n
kC
a n n
3 P
n;i;
1;j
0
a a
a hi khi 5; i
j 1
ÿ 1 ÿ R a; exp ÿ k ÿ exp ÿ k ;
k n n n n
a
M
an P
n;i;
1;M
0 exp ÿ k :
s
n;i
0 : n
k
Finally, the mean duration times (Eqs. (8) and
These transition probabilities and times for (9)) are:
the Gamma distribution can be used directly in
our search algorithm of Section 3 and are utilized s
n;i
1
a a
in Section 4 of the paper to determine optimal exp
khi=n
n=kaC
1=a
1 ÿ R
1=a;
khi=n
policies and optimal cost per unit time for the and
system.
One sees very often in the published literature s
n;i
1
the use of the Weibull distribution in reliabilities a a
exp
khi=n
n=kaC
1=a
1 ÿ R
1=a;
khi=n
studies. For example, Phelps [9] utilized this dis-
tribution in his comparisons of various repair/re- respectively.
placement policies under the assumption of
minimal repairs. Given its frequent application, in
the interests of completeness, we brie¯y present References
here the necessary transition structure for this
distribution as well. We assume the lifetime dis- [1] H. Ascher, H. Feingold, Repairable Systems Reliability:
Lecture Notes in Statistics, Marcel Dekker, New York,
tribution of a new system following a Weibull 1984, pp. 33±34.
a
distribution has density f
t ka ataÿ1 exp
ÿ
kt ; [2] C.L. Chaing, An Introduction to Stochastic Processes and
a
distribution function F
t 1 ÿ exp
ÿ
kt and their Applications, Krieger Publishing, New York, 1980,
mean ®rst passage time of
1=k
C
a 1=a pp. 172±206.
where k is the scale parameter of the Weibull, a is [3] R. Guo, C.E. Love, Simulating non-homogeneous Poisson
processes with proportional intensities, Naval Research
the shape parameter and again, C
is the Gamma Logistics Quarterly 41 (1994) 507±522.
function. [4] M. Kijima, Some results for repairable systems with
Utilizing the same discretization approach, general repair, Journal of Applied Probability 26 (1989)
Eq. (3) for the Weibull distribution, becomes, 89±102.
aÿ1 a [5] M. Kijima, H. Morimura, Y. Suzuki, Periodical replace-
f
hi=n
x ka
k=n
x hi exp
ÿ
k=n
x hi
a ment problem without assuming minimal repair, European
khi=n . Journal of Operational Research 37 (1988) 194±203.
The transition probabilities equivalent to [6] C.E. Love, R. Guo, Simulation strategies to identify the
Eqs. (4)±(7) are: failure parameters of repairable systems under the in¯u-
C.E. Love et al. / European Journal of Operational Research 125 (2000) 398±409 409
ence of general repair, Quality and Reliability Engineering [9] R.I. Phelps, Replacement policies under minimal repair,
International 10 (1994) 37±47. Journal of the Operational Research Society 32 (1981)
[7] V. Makis, A.K.S. Jardine, A note on optimal replacement 549±554.
policy under general repair, European Journal of Opera- [10] R.I. Phelps, Optimal policy for minimal repair, Journal of
tional Research 69 (1993) 75±82. the Operational Research Society 34 (1983) 425±427.
[8] E.J. Muth, An optimal decision rule for repair vs [11] H.C. Tjims, Stochastic Models: An Algorithmic
replacement, IEEE Transactions on Reliability R26 Approach, Wiley, New York, 1994, pp. 218±248.
(1977) 179±181.