Professional Documents
Culture Documents
7.1 Introduction
where τ is defined as the change point or threshold of the failure rate function
r(t).
This change point is different from the discrete change point discussed in
the usual literature of the change point models. It is a continuous time in
nature, and its “discrete” realization is the change point location we want to
estimate.
For testing whether a new leukemia therapy produces a departure from
a constant relapse rate after induction of remission, Matthews and Farewell
(1982) proposed to study the above model for the relapse rate of leukemia
patients and estimated the threshold τ by a numerical method. Nguyen,
Rogers, and Walker (1984) estimated the parameters in the above model
using the analysis of mixture model. Matthews, Farewell, and Pyke (1985)
J. Chen and A.K. Gupta, Parametric Statistical Change Point Analysis: With Applications 189
to Genetics, Medicine, and Finance, DOI 10.1007/978-0-8176-4801-5_7,
© Springer Science+Business Media, LLC 2012
190 7 Hazard Function Change Point Model
Fig. 7.1 A typical hazard rate function with one change point
Note that there are several classical estimators of the location τ of the
change point model (7.1) in the literature.
First, an estimator of τ , denoted τ̂ , was obtained in Nguyen et al. (1984)
as the value of τ̂ such that the stochastic process Yn (τ ) approaches 0 at τ̂ ,
where for 0 ≤ r ≤ n − 1, Yn (tr ) = Xn (tr ) with
n
Xn (t) = S(t) (n − r) log − r n−1
n−r
M (t) n
+r − Zn log n−1 ,
n n−r
n
ti
M (t) = ,
i=r+1
n −r
n
t2i 2
S 2 (t) = − M (t) ,
i=r+1
n − r
and
Zn = (t1 + · · · + tn )/n.
For tr ≤ t < tr+1 , Yn (t) is defined by linear interpolation. For t ≥ tn , Yn (t)
is defined by
8
n
L(a, b, τ |D) = fTi (ti ; a, b, τ )
i=1
where
n
R(τ ) = I[Ti ≤τ ] ,
i=1
n
M (τ ) = Ti I[Ti ≤τ ] ,
i=1
where ∞
g(b) = a−1 e−anτ da.
b
(n − 1)! T j j−1 −T b
n−1
R.H.S. of (7.5) = b e . (7.6)
T n j=0 j!
which diverges. For this reason, a restriction on the parameter b was imposed
by Ghosh et al. (1993). This restriction is that for a positive constant b0 ,
we require 0 < b0 ≤ b. With this restriction and the above-obtained joint
posterior π(b, τ |D) of (b, τ ), the posterior density of τ , π(τ |D), was obtained
in Ghosh et al. (1993) for the following three cases.
Note that the integral given by expression (7.7) does not have a closed
form, therefore, the posterior of τ can only be obtained numerically. To assure
the existence of π(τ |D) in this case, Ghosh et al. (1993) have shown that
π(τ |D) is finite, along with some other properties of π(τ |D) evidenced by
Propositions 4.1–4.5 in Ghosh et al. (1993). During the course of establishing
these propositions, another restriction, namely, τ ≥ c > 0 for a constant c
was imposed on τ as if τ = 0; the model is nonidentifiable.
∞
(i − 1)! (Q(τ ))j n−i+j−1 −T b
i−1
π(τ |D) ∝ b e db
b0 (Q(τ ))i j=0 j!
n−i+j−1
(i − 1)! −T b0 (T b0 )k (n − i + j − 1)! (Q(τ ))j
i−1
= e .
(Q(τ ))i j=0
k! j! T n−i+j
k=0
(7.8)
Case 3. When τ ≥ Tn .
In this case, π(τ |D) was obtained by integrating expression (7.6) with
respect to b over the range [b0 , ∞):
(n − 1)! T j j−1 −T b
∞ n−1
π(τ |D) ∝ b e db
b0 T n j=0 j!
⎡ j−1 ⎤
(n − 1)! ⎣
n−1 (T b0 )k
1⎦
= h(b0 ) + e−T b0 , (7.9)
Tn j=1
k! j!
k=0
where ∞
h(b0 ) = b−1 e−T B db.
b0
Due to the constancy of π(τ |D) over the interval [Tn , ∞), Ghosh et al.
(1993) placed a third restriction on τ : τ ≤ d < ∞ for a finite constant d.
Summing all of the considerations above, the prior π(a, b, τ ) on the para-
meters (a, b, τ ) should now be completely restated as
1
π(a, b, τ ) = , 0 < b0 ≤ b < a < ∞, 0 < c ≤ τ ≤ d < ∞.
ab
Under this prior, the posterior π(τ |D) is given by expressions (7.7), (7.8),
and (7.9), respectively, for the three cases of τ . Then, the Bayesian estimate
of the change point τ is obtained by either the posterior mean or posterior
mode. Ghosh et al. (1993) have given computational details on how to obtain
such an estimate using (7.7)–(7.9). Note that the computation itself is quite
tedious as is the case for Bayesian computation. So, in the next section,
an alternative estimate of τ , given in Chen (2003) is presented in light of the
Schwarz information criterion (SIC).
H0 : τ = 0 against H1 : τ > 0.
7.3 The Informational Approach 195
Several test statistics and estimates were proposed for the inference. The most
interesting aspect of the inference is to estimate the true change point τ . For
this reason, a model selection method using SIC proposed in Chen (2003) is
presented herein.
Recall from the previous chapter that the SIC is defined as
+ p log n,
SIC = −2 log L(Θ)
where p is the number of free parameters that need to be estimated under the
model. Its penalty term p log n takes the information from the sample size.
Now, we introduce an estimate for τ based on the SIC. If a lifetime random
variable T has a hazard function given by (7.1), then it is clear that the
probability density function of T is
a exp{−at} 0≤t≤τ
f (t) = . (7.10)
b exp{−aτ − b(t − τ ) τ <t<∞
L(a, b, k)
= L( a, b, k| t(1) , . . . , t(n) )
= fT(1) ,...,T(n) (t(1) , . . . , t(n) )
!⎛ n ⎞
8 k 8
= n! f (t(i) ) ⎝ f (t(j) )⎠
i=1 j=k+1
⎧ ⎫
k ⎨
n ⎬
= n!ak exp −a t(i) · bn−k exp −(n − k)aτ0 − b (t(j) − τ0 ) ,
⎩ ⎭
i=1 j=k+1
where t(1) , . . . , t(n) are the sample realizations of the order statistics T(1) ≤
T(2) ≤ · · · ≤ T(n) . The log of the likelihood function is clearly
a, b, τ0 ) + 3 log n
SIC(k; τ0 ) = −2l(
⎧ ⎡ ⎤⎫
⎨ n ⎬
= −2(n − k) log (n − k) ⎣ t(j) − (n − k)τ0 ⎦
⎩ ⎭
j=k+1
k
− 2k log k (n − k)τ0 + t(i) − 2 log n! + 2n + 3 log n,
i=1
k nj=k+1 t(j) − (n −
k) ki=1 t(i)
τ = , (7.12)
n(n − k)
This estimate (7.12) is optimal by the model selection principle. Three choices
of τ0 are suggested here:
A simulation study is given to show the feasibility of these new estimates τ1 ,
τ2 , and τ3 (with respect to τ01 , τ02 , and τ03 ) in comparison with the estimates
obtained in Ghosh and Joshi (1992; GJ) and in BGJ. The simulation results
are given in Table 7.1, where τ1 , τ2 , and τ3 are the new estimates proposed
in Section 7.2 and τ is the estimate in Table I of GJ. One can observe that
the new estimates τ1 , τ2 , and τ3 are quite compatible with the estimate τ in
GJ and τ1 , τ2 in BGJ; and they are better than the estimates τ3 and τ4 in
BGJ (see the simulation results given in BGJ). However, the new estimates
perform better only when a ≥ 3, a − b ≥ 1, and τ ≥ .15. Thus, it is still
desirable to find good estimates for other values of a, b, and τ . A further
study for the properties of the new estimates τ1 , τ2 , and τ3 is also needed.