Professional Documents
Culture Documents
Among the recent proposed continuous distributions, the truncated inverse Lomax distribution benefits of an
overall simplicity from the mathematical point of view and offers a more realistic scenario for statistical modelling
in comparison to the former inverse Lomax distribution (originally defined on the semi-infinite real line). A new
use of the truncated inverse Lomax distribution is investigated in this study; we exploit its simple structure to
generate original continuous distributions, constituting what we call the “truncated inverse Lomax generated
family.” We expose the merits of this new truncated family, proving several results on stochastic ordering, stress-
strength reliability coefficient, incomplete probability weighted moments, order statistics, and entropy. Also, some
bivariate extensions with the use of copula are discussed. Then, we derive an extension of the (type I) gener-
alized half logistic distribution having pliant probability density and hazard rate functions, making it desirable for
any lifetime data analysis. The parametric estimation of the related model is discussed by the three following
methods: maximum likelihood, least square and weighted least square methods. A simulation study is carried
out, showing the efficiency of these methods. Finally, the practical use of the new model is demonstrated by the
analyzes of two biomedical data. Furthermore, two agricultural data sets related to wheat production in Pakistan
are considered.
Keywords: Truncated Inverse Lomax Distribution, Parametric Estimation, Entropy, Copula, Biomedical Data
Analysis, Agricultural Data Analysis.
J. Med. Imaging Health Inf. Vol. 11, No. 9, 2021 2156-7018/2021/11/2425/015 doi:10.1166/jmihi.2021.3733 2425
RESEARCH ARTICLE J. Med. Imaging Health Inf. 11, 2425–2439, 2021
2426
J. Med. Imaging Health Inf. 11, 2425–2439, 2021 RESEARCH ARTICLE
1.5
gx = e− x 1−e− x −1 x>0
1.0
this distribution can be found in the survey of Ref. [34]. Thus,
based on Eqs. (2) and (3), we introduce the TILEE distribution,
hrf
defined by the following cdf and pdf:
−1 −
F x = 1−2 1+ 1−1−e− x
0.5
x > 0 (7)
α= 2 β = 0.3 θ = 0.2
with F x = 0 for x ≤ 0, and α= 0.5 β = 2 θ = 2
α= 2 β=3 θ=1
−2 α= 0.6 β = 0.6 θ = 2.2
f x = 2 e− x 1−e− x −1 1−1−e− x α= 5 β = 5 θ = 0.7
0.0
−1 −−1
× 1+ 1−1−e− x x > 0 (8)
0.0 0.5 1.0 1.5 2.0 2.5 3.0
x
with f x = 0 for x ≤ 0, respectively. Thus defined, by
taking = 1, the TILEE becomes the type I generalized half Fig. 2. Plots of the hrf of the TILEE distribution for different values of ,
logistic distribution. Hereafter, a random variable X having the and .
cdf given as Eq. (7) (or, equivalently, the pdf given as Eq. (8))
will be denoted by X∼TILEE. with hx = 0 for x ≤ 0, respectively. This function is plot-
As a visual approach, the pdf of the TILEE distribution is ted for different values of , and in Figure 2.
plotted for different values of , and in Figure 1. In Figure 2, we see that the hrf can be (monotonically) increas-
Thus, Figure 1 illustrates the pliant asymmetry of the TILEE ing and decreasing, concave or convex, as well as bathtub shaped.
distribution, also giving some ideas on the data that are poten- These properties are demanding for fitting various kinds of real
tially well fitted by the TILEE model. life data.
Also, the sf and hrf of the TILEE distribution are defined by Last but not least, the quantile function of the TILEE distribu-
−1 − tion is obtained as
F¯ x = 2 1+ 1−1−e− x x>0
1 1/
Qu = − log 1− 1−21−u−1/ −1−1
with F¯ x = 1 for x ≤ 0, and
u ∈ 01 (10)
− x − x −1
− x −2
e 1−e 1−1−e
hx = In this study, we will mainly use it for simulation purposes.
1+1−1−e− x −1
x>0 (9)
3. FUNDAMENTAL PROPERTIES
The TIL-G family possesses interesting mathematical and statisti-
1.0
α = 9 β = 12 θ = 0.6
α = 5 β = 0.02 θ = 0.2
3.1. Stochastic Ordering Results
Two stochastic ordering results of the TIL-G family are now
proved. For further detail on the concept of stochastic ordering,
0.6
we may refer the reader to the book of Ref. [43]. The following
pdf
2−Gx − 2−Gx
0 1 2 3 4 F x = 2 log >0
x 1−Gx 21−Gx
Fig. 1. Plots of the pdf of the TILEE distribution for different values of , Therefore, F x is an increasing function with respect to ,
and . proving the desired result.
2427
RESEARCH ARTICLE J. Med. Imaging Health Inf. 11, 2425–2439, 2021
With the notations and vocabulary of the stochastic ordering Remark 1. In the context of the TILEE distribution as
concept, one can say the following: Let X1 ∼TIL-G with param- described in Subsection 2.2, thanks to the above results, when x
eter 1 and X2 ∼TIL-G with parameter 2 . Then, if 2 ≥ 1 , tends to 0, we have
Proposition 1 gives the following inequality in terms of sfs:
F¯2 x ≤ F¯1 x, meaning that X2 is smaller than X1 in the first
F x ∼ x f x ∼ x−1
2 2
stochastic order: X2 ≤st X1 .
hx ∼ x−1
2
The following result is about a likelihood ratio order property
inherent to the TIL-G family. implying different limits for f x and hx , i.e.,
+, /2, or 0, depending on < 1, = 1 or > 1, respectively,
Proposition 2. Let f x be the pdf given as Eq. (3). Let 2 ≥ and when x → +,
1 . Then, the following ratio function:
F x ∼1−2 e− x f x ∼2 e− x
f x
rx = 2
f1 x hx ∼
is decreasing with respect to x. revealing that f x tends to 0 with an exponential decay.
The shape properties of f x and hx can be analyzed via
Proof. First of all, let us notice that
mathematical formula. That is, the modes of f x are contained
− −1 in the set of roots of the following non-linear equation:
22 2 gx1−Gx−2 1+1−Gx−1 2
rx =
21 1 gx1−Gx−2 1+1−Gx−1 −1 −1 dgx/dx gx gx
−−1 ++1 =0
− gx 1−Gx 2−Gx
= 22 −1 2 1+1−Gx−1 1 2
1 The uni/multimodality nature depends on the number of maxi-
Therefore mum points/modes among these roots, which also depend on
and the parent distribution. A mode, say x0 , satisfies U x > 0 for
d gx
rx = 22 −1 2 1 −2 all x < x0 and U x < 0 for all x > x0 , where U x is the function
dx 1 1−Gx2 defined by
− −1
× 1+1−Gx−1 1 2 < 0 d2 gx/dx2 gx−dgx/dx2
U x =
This ends the proof of Proposition 2. gx2
dgx/dx1−Gx+gx2
With the notations and vocabulary of the stochastic ordering −−1
concept, one can say the following: Let X1 ∼TIL-G with param- 1−Gx2
eter 1 and X2 ∼TIL-G with parameter 2 . Then, if 2 ≥ 1 , dgx/dx2−Gx+gx2
Proposition 2 says that X2 is smaller than X1 in the likelihood ++1
2−Gx2
ratio order: X2 ≤lr X1 .
Propositions 1 and 2 give precious information of immediate Also, the critical points of hx are given as the roots of the
hierarchy between the statistical models derived from the TIL-G following simple non-linear equation:
family, with respect to the parameter . Also, these results are
dgx/dx gx gx
valid for any parent distribution. They are fulfilled for the TILEE + + =0
distribution in particular. gx 1−Gx 2−Gx
They depend on the parent distribution only; has no role here.
3.2. Equivalences and Shapes The number and nature of these solutions are precious indicators
Let us now investigate some analytical properties of the main on the possible shapes of the hrf. The nature of a critical point
functions of the TIL-G family. We begin with the asymptotic of hx, say x∗ , can be determined by the analytical study of the
behavior of F x, f x and hx, aiming to identify the role of following function:
in this regard. When Gx tends to 0, we have
d2 gx/dx2 gx−dgx/dx2
V x =
F x∼ Gx f x∼ gx hx∼ gx gx2
2 2 2
dgx/dx1−Gx+gx2
Therefore, the parameter has proportional influence of the limit +
1−Gx2
of these functions. On the other side, when Gx tends to 1, we
have dgx/dx2−Gx+gx2
+
2−Gx2
F x∼1−2 1−Gx f x∼2 gx1−Gx−1
depending on its sign according to x < x∗ and x > x∗ , or x = x∗ .
hx∼
gx Remark 2. In the TILEE distribution setting, the above equa-
1−Gx tions hold with gx = gx , Gx = Gx , implying
that
Hence, the parameter impacts as a shape parameter on the limit
of f x. It remains a coefficient of proportionality for the limit d
gx = − 2 1−e− x e x −e x −1−2
of hx. dx
2428
J. Med. Imaging Health Inf. 11, 2425–2439, 2021 RESEARCH ARTICLE
where f1 2 x1 x2 denotes the joint pdf of the random vector where Hu x = gxu Gx and
X1 X2 .
−u+v−u
Under the independence assumption between X1 and X2 , the k uv = 2u+v u
following result shows that R can reduce to a simple expression, k
independently of the parent distribution. k+u+v−u
× −1 (11)
Proposition 3. Let us assume that X1 and X2 are independent.
Then, we have Proof. Let Ḡx = 1−Gx be the sf of the parent distribution.
2 Then, after some simplifications, we get
R=
1 +2 f xu F¯ xv
Proof. Let f1 x be the pdf of X1 , and F2 x, F¯2 x and −u+1
= 2u u gxu 1−Gx−2u 1+1−Gx−1
f2 x be the cdf, sf and pdf of X2 , respectively. Then, since X1 −v
and X2 are independent, we have f1 2 x1 x2 = f1 x1 f2 x2 , ×2v 1+1−Gx−1
implying that −u+v−u
= 2u+v u gxu 1−Gx−2u 1+1−Gx−1
+ x1 + −u+v−u
R= f1 x1 f2 x2 dx2 dx1 = F2 x1 f1 x1 dx1 = 2u+v u gxu Ḡxu+v−u 1+ Ḡx
− − −
+ Then, it follows from the generalized binomial theorem applied
= 1− F¯2 x1 f1 x1 dx1 two times in a row that
− −u+v−u
Ḡxu+v−u 1+ Ḡx
Now, based on Eqs. (3) and (4), we get
+
−u+v−u
= Ḡxk+u+v−u
F¯2 x1 f1 x1 k=0
k
−1 +2 −1 +
= 21 +2 1 gx1 1−Gx1 −2 1+1−Gx1 −1 −u+v−u k+u+v−u
= −1 Gx
1 k=0
k
= f x
1 +2 1 +2 1 Hence,
where f1 +2 x denotes the pdf given by Eq. (3) with parameters f xu F¯ xv
1 +2 . Hence, +
−u+v−u
= 2u+v u gxu
1 + 1 2 k=0
k
R = 1− f1 +2 x1 dx1 = 1− =
1 +2 − 1 +2 1 +2 k+u+v−u
× −1 Gx
This ends the proof of Proposition 3.
+
Proposition 3 reinforces the fact that the TIL-G family can = k uvHu x
k=0
be used for various reliability analysis. In particular, the expres-
sion of the stress-strength reliability coefficient is rather simple This ends the proof of Proposition 4.
to make it easily estimable if necessary. One can notice that In particular, Proposition 4 can be applied to have a series
the expression of R holds for any parent distribution; it can be expansion for f x by taking u = 1 and v = 0, f xu by taking
applied for the TILEE distribution in particular. v = 0 and F¯ xv by taking u = 0.
2429
RESEARCH ARTICLE J. Med. Imaging Health Inf. 11, 2425–2439, 2021
Remark 3. In the TILEE distribution framework, we can Table I. Measures of X∼TILEE based on moments for some values of
express the function Hu x = Hu x as , and at =05 and =05.
where
+u−u
m uv = −1m u
u Then, the r-th incomplete moment of X can be derived
m
as r t = EX r 1X∈−t = r0 t. The incomplete moments
Hence, we can derive a series expansion for
allow us to see the shape of the moments of a distribution, which
f x u F¯ x v in terms of simple exponential
is of interest for many areas, including econometrics, finance and
functions. Applications of this result will be presented in the
reliability. See, for instance, Ref. [14].
next.
Also, the r-th crude moment of X can be derived as r =
EX r = r → +. As usual, they are used to provide some
3.5. Incomplete Probability Weighted Moments crucial characteristics of a distribution, including the mean given
This section is devoted to various kinds of moments and related as = 1 , variance obtained as 2 = 2 −2 , in addition to some
measures of the TIL-G family. For the sake of generality, we measures such as the coefficients of variation, skewness and kur-
focus on the incomplete probability weighted moments which tosis defined by
include several other kinds of moments, and appear in the deter-
mination of the incomplete moments for order statistics, as dis- 3 −32 +23
CV = CS =
cussed later. That is, for any integers r and s, and t ∈ , the 3
rs-th incomplete probability weighted moment of X∼TIL-G 4 −43 +62 2 −34
can be defined by CK =
4
t
rs t = E X r F¯ Xs 1X∈−t = xr F¯ xs f xdx respectively. Wide numerical fluctuations of these measures
− ensure the flexibility of a distribution, and pliant statistical mod-
Owing to the expressions in Eqs. (2) and (3), depending on the els can be derived as well. In this regards, Tables I and II present
complexity of the parent distribution, we can consider the direct the values of the first four crude moments, 2 , CV , CS and CK
calculation of rs t. In all the situations, we can provide a of X∼TILEE for selected values of the parameters.
numerical evaluation of it by using a mathematical software. The range of possible values in Tables I and II provide numeri-
Following the methodology presented in Ref. [17], in full gen- cal evidence of the flexibility of the TILEE distribution regarding
erality, a less opaque approach consists in using the series expan- the involved measures. Also, some monotonic comportments are
sion of f xF¯ xs as presented in Proposition 4 (with u = 1 and observed. For instance, for the considered values of the param-
v = s), which gives eters, in Table I, we see that CV , CS and CK increase as
increases and, in Table II, we see that CS and CK decrease as
+ increases.
rs t = k 1s rt
k=0
3.6. Order Statistics
t
where rt = − x H1 xdx. This representation is of inter-
r For n random variables X1 Xn independent and identically
est because all the involved terms are often simply expressible distributed, the m-th order statistic is given by the m-th smallest
and, in most of the cases, a correct approximation of rs t is random variable. The order statistics are involved in the mod-
possible by replacing the infinite limit by any large integer. elling of various lifetime phenomena (see Ref. [9]). Here, some
Remark 4. An application for the TILEE distribution is results on order statistics in the context of the TIL-G family
described below. Based on Remark 3, thanks to the series expan-
sion of H1 x = H1 x , we can express rt as Table II. Measures of X∼TILEE based on moments for some values of
, and at =05 and =05.
+
−r+1
rt = m 1vm+1−r+1 r +1m+1 t =10 =15 =20 =25 =30 =35
m=0
1 3.365 3.704 3.918 4.063 4.165 4.238
2 18.668 21.469 23.467 24.978 26.163 27.111
where 3 123.459 144.933 161.044 173.778 184.161 192.804
+1−1
m 1v = −1m 4 903.311 1072 1202 1308 1396 1470
m 2 7.345 7.749 8.115 8.468 8.813 9.149
x CV
and ax = 0 t a−1 e−t dt is the lower incomplete gamma func- 0.805 0.752 0.727 0.716 0.713 0.714
CS 0.563 0.371 0.238 0.141 0.068 0.013
tion. Hence, a series expansion of rs t follows, depending on CK 2.321 2.123 2.019 1.953 1.903 1.861
lower incomplete gamma functions.
2430
J. Med. Imaging Health Inf. 11, 2425–2439, 2021 RESEARCH ARTICLE
are derived. As first result, the pdf of the m-th order statistic of Remark 5. In view of the previous result, the r-th incomplete
X∼TIL-G, denoted by Xm , is defined as moment of Xm for X∼TILEE can be deduced from Remark 4.
n−1
fXm x = n f xF xm−1 F¯ xn−m x ∈ (12) 3.7. Rényi Entropy
m−1
The Rényi entropy is employed to quantify the overall random-
In expanded form, from Eqs. (2) and (3), fXm x can be ness of a distribution. It finds applications in various fields such
expressed as as statistics, ecology, and quantum information. We may refer the
n−1 1+n−m reader to Ref. [41] and the overview of the entropy concept in
fXm x = n 2 gx1−Gx−2 Ref. [8]. Formally, the Rényi entropy of X∼TIL-G is defined by
m−1
−1+n−m−1 +
1 1
× 1+1−Gx−1 I = log Ef X−1 = log f x dx
− m−1 1− 1− −
× 1−2 1+1−Gx−1 x∈
where > 0 and = 1. For some parent distributions, thanks to
In particular, the pdf of the maximum order statistic, i.e., Xn = Eq. (3), one can hope to determine I via standard integration
supX1 Xn , is given as techniques. A numerical approximation of I is always possible,
−−1 as soon as it exists. For further manipulation or analysis, one can
fXn x = n2 gx1−Gx−2 1+1−Gx−1 provide a series expansion of I through the use of Proposition 4
− n−1 (with u = and v = 0) as
× 1−2 1+1−Gx−1 x∈
+
Also, the pdf of the minimum order statistic, i.e., X1 = 1
I = log k 0
infX1 Xn , is obtained as 1− k=0
−n−1 +
fX1 x = n2n gx1−Gx−2 1+1−Gx−1 where = − H xdx. Again, by replacing the infinite
by any large integer, we obtain a possible precise approximation
x∈
of I .
For further analytical use, a series expansion of fXm x is wel- Remark 6. If the TILEE distribution is considered, based
come, which is the object of the following result. on Remark 3, thanks to the series expansion of H x =
H x , we can express as
Proposition 5. The following series expansion holds:
+
m−1 +
−1
fXm x = jk H1 x = m vm+−1
m=0
j=0 k=0
where where
+−
n−1 m−1 m v = −1m
jk = n −1j k 1j +n−m m
m−1 j Thus, an expression of I involving a series expansion can be
and H1 x = gxGx derived.
Table III displays some numerical values for the Rényi entropy
Proof. Owing to the standard binomial theorem and Proposi-
of X∼TILEE, i.e., I = I , for different values of the
tion 4 (with u = 1 and v = j +n−m), we get
parameters.
fXm x Several other measures of entropy can be defined in a simi-
lar manner, such as the Shannon entropy, q-entropy (also called
n−1 m−1 m−1 Tsallis entropy) or Mathai-Haubold entropy (see Refs. [8] and
=n −1j f xF¯ xj+n−m
m−1 j=0 j [45]). One can also mention the concept of extropy introduced by
Ref. [30]. It represents a complementary dual of entropy, and is
+
n−1 m−1 m−1 defined by J = −1/2e−I2 . Thus, it can be easily deduced from
=n −1j k 1j +n−mH1 x
m−1 j=0 j k=0 the above results.
m−1 +
= jk H1 x Table III. Numerical results of Rényi entropy of X∼TILEE for different
j=0 k=0
values of the parameters.
2431
RESEARCH ARTICLE J. Med. Imaging Health Inf. 11, 2425–2439, 2021
3.8. Bivariate Extensions where = 1 2 , = 1 2 , = 1 2 and ∈ −11, and
One can extend the TIL-G family for bivariate probabilistic or F xy = 0 for x ∈ 0+2 . This new bivariate continu-
statistical purposes through the use of several techniques. Some ous distribution with seven parameters can serve to the modelling
of them are presented below. The most simple way is to consider of two dimensional phenomena or to analyze any bivariate data
a bivariate parent distribution, defined by a bivariate cdf, say with positive values. It also offers an alternative to some bivari-
Gxy. That is, one can define the bivariate TIL-G family by ate extensions of the EE distribution, such as the one developed
the following cdf: by Ref. [29], for instance.
−
F xy = 1−2 1+1−Gxy−1 xy ∈ 2
4. INFERENCE
provided that the parent distribution satisfies the necessary In this section, various kinds of estimates are obtained for the
assumptions. Then, if a vector of random variables XY has parameters of the TILEE model. We consider the three following
this cdf, then X∼TIL-G with the parent distribution Gx→ + methods of estimation: maximum likelihood (ML), least square
and Y ∼TIL-G with the parent distribution G→ +y. (LS) and weighted least square (WLS) methods. All of them are
Another approach consists in using the notion of copula. Thus, described in Table IV, with suggested references, and the follow-
one can define the bivariate TIL-G family by the following cdf: ing notations are adopted: x1 xn are n (independent) observa-
tions from X∼TILEE, and x1 xn correspond to their values
F xy = CF1 xF2 y xy ∈ 2 in increasing order.
The performance of the ML, LS and WLS estimates of
where F1 x and F2 x are defined as Eq. (2), with possible the parameters of the TILEE model is assessed in terms of
different parent distribution, and Cuv is a copula, i.e., a bivari- the sample size n. In this regard, in addition to the esti-
ate function defined on 012 and with values in 01 satisfying mates, we evaluate the mean square errors (MSEs). The sim-
C0u = 0, Cu0 = 0, C1u = u, Cu1 = u and Cu2 v2 + ulation procedure is achieved by using the MATHCAD (14)
Cu1 v1 −Cu1 v2 −Cu2 v1 ≥ 0 for all 0 ≤ u1 ≤ u2 ≤ 1 and package.
0 ≤ v1 ≤ v2 ≤ 1 . Then, if a vector of random variables XY has • A random sample x1 xn of sizes n = 50, 100, 200, 500
the cdf F xy, then X∼TIL-G with the cdf F1 x and Y ∼TIL-G and 1000 are generated from the TILEE distribution.
with the cdf F2 x. Also, the copula Cuv can be defined to • Three sets of the values of parameters are considered
modulate the dependence of X and Y . As examples of copula,
as Set I ( = 0 5 = 0 5 = 0 5), Set II ( = 0 8 = 0 8 =
there are the Farlie-Gumbel-Morgenstern copula defined by
0 5) and Set III ( = 0 3 = 0 3 = 0 5). Then, the ML, LS
and WLS estimates of the parameters of the TILEE model
Cuv = uv +uv1−u1−v uv ∈ 01
are evaluated based on the different sets of parameters and
where ∈ −11 (including the independence copula for = 0), sample sizes.
the Clayton copula defined by • Repeat this process 5000 times and then obtain the means
and MSEs of the ML, LS and WLS estimates for the con-
−1/
Cuv = maxu− +v− −10 sidered values of the model parameters. Empirical results are
F xy
Table V. Estimates and MSEs of the parameters of the TILEE model
−1 −1 for Set I, Set II and Set III at n =50.
= 1−21 1+ 1−1−e− 1 x 1
Set I Set II Set III
−1 −2
× 1−22 1+ 1−1−e− 2 y 2 Method Estimates MSE Estimates MSE Estimates MSE
−1 −1 ML 0.703 0.261 1.176 0.535 0.641 0.257
+21 +2 1−21 1+ 1−1−e− 1 x 1 0.49 0.088 0.792 0.109 0.367 0.044
0.626 0.152 0.784 0.19 0.413 0.056
−1 −2
× 1−22 1+ 1−1−e− 2 y 2 × LS 0.425 0.013 0.45 0.137 0.384 0.014
0.532 0.589 0.531 0.208 0.456 0.084
−1 −1 0.713 0.523 0.792 0.645 0.614 0.269
× 1+ 1−1−e− 1 x 1 WLS 0.356 0.041 0.427 0.149 0.391 0.036
−1 −2 0.423 0.062 0.544 0.961 0.331 0.272
× 1+ 1−1−e− 2 y 2 xy ∈ 0+2 1.167 1.344 0.809 1.35 0.416 0.377
2432
J. Med. Imaging Health Inf. 11, 2425–2439, 2021 RESEARCH ARTICLE
Table VI. Estimates and MSEs of the parameters of the TILEE model Table IX. Estimates and MSEs of the parameters of the TILEE model
for Set I, Set II and Set III at n =100. for Set I, Set II and Set III at n =1000.
reported in Tables V–IX, for n = 50, 100, 200, 500 and 1000, We follow the same numerical setting as above (same sam-
respectively. ple sizes, same sets of parameters…). Various values for
are considered ( = 1 2, 1 5 and 1 8), and the relative bias
From Tables V–IX, the following observations can be formu-
(RB) is used as a benchmark. The obtained numerical results
lated. The MSEs decrease as sample sizes increase, for all the
are collected in Tables X–XII for Set I, Set II and Set III,
estimates, illustrating the efficiency of the methods. Globally, the
respectively.
three methods perform well; it is hard to identify the best one.
From Tables X–XII, we observe that the ML estimates are
However, in the next, thanks to its well-known desirable statisti-
close to the corresponding values of the Rényi entropy, at least
cal features (consistency, asymptotic normality…), we adopt the
for n large. Also, the RBs decrease as sample sizes increase,
ML method for further developments with the TILEE model.
in all cases. This illustrates the nice behavior of the proposed
We complete this part by a simulation study on the Rényi
method.
entropy of X∼TILEE. Here, we estimate I = I by Iˆ =
ˆ ˆ, where ,
ˆ
I ˆ ˆ and ˆ denote the ML estimates of ,
and , respectively. Owing to the invariance property of the ML 5. APPLICATIONS
estimator, it is worth mentioning that Iˆ is also the ML estimate Four data sets are now analyzed in order to assess the
of I , and all the standard properties of the ML estimators can goodness-of-fit of the TILEE model compared with some strong
be applied in this regard (consistence, asymptotic normality…). competitors.
Table VII. Estimates and MSEs of the parameters of the TILEE model 5.1. Biomedical Data
for Set I, Set II and Set III at n =200. In this part, two biomedical data sets of interest are considered.
Set I Set II Set III These data have been used by several authors to show the appli-
Method Estimates MSE Estimates MSE Estimates MSE cability of other competing models. We also provide a forma-
tive evaluation of the goodness of fit of the models and make
ML 0.689 0.151 1.105 0.293 0.579 0.184
0.485 0.05 0.805 0.034 0.332 0.029
comparisons with other models. Then, in order to compare the
0.686 0.136 0.767 0.154 0.452 0.035 models, the following criteria are applied: Akaike information
LS 0.425 0.011 0.457 0.13 0.38 0.013 criterion (AIC), consistent AIC (CAIC), Hannan-Quinn infor-
0.479 0.067 0.602 0.332 0.437 0.047 mation criterion (HQIC), Anderson-Darling (A) and Cramér-von
0.715 0.407 0.749 0.439 0.593 0.229 Mises (W). The model with the minimum values of AIC, CAIC,
WLS 0.367 0.033 0.464 0.12 0.376 0.023 HQIC, W and A is considered to be the best model to fit the
0.455 0.04 0.678 0.245 0.318 0.221
proposed data. The R software is used for all the calculations.
0.905 0.757 0.486 0.478 0.419 0.278
Data set: Survival Times. The first data set was studied by
Ref. [12]. It represents the survival times (in days) of 72 guinea
pigs infected with virulent tubercle bacilli. Based on this data
Table VIII. Estimates and MSEs of the parameters of the TILEE model
for Set I, Set II and Set III at n =500. set, the MLEs of the parameters , and of the TILEE model
are given as
Set I Set II Set III
Method Estimates MSE Estimates MSE Estimates MSE ˆ = 0 593 ˆ = 3 645 ˆ = 1 877
ML 0.664 0.133 1.092 0.284 0.516 0.181
0.475 0.042 0.795 0.033 0.279 0.027 respectively, with the corresponding standard errors (SEs)
0.683 0.135 0.766 0.152 0.453 0.033 given as
LS 0.428 0.011 0.465 0.122 0.382 0.013
0.463 0.062 0.581 0.27 0.443 0.046 SEˆ = 0 373 SEˆ = 1 271 SE ˆ = 0 825
0.682 0.372 0.472 0.385 0.565 0.198
WLS 0.368 0.03 0.756 0.115 0.364 0.017 respectively.
0.466 0.036 0.753 0.134 0.315 0.195
0.889 0.682 0.493 0.451 0.387 0.238
For this data, we compare the fit of the TILEE model with
the following models: Topp-Leone Lomax (TLL) (see Ref. [4]),
2433
RESEARCH ARTICLE J. Med. Imaging Health Inf. 11, 2425–2439, 2021
Table X. ML estimates and RBs for the Rényi entropy of the TILEE model for Set I.
Table XI. ML estimates and RBs for the Rényi entropy of the TILEE model for Set II.
Table XII. ML estimates and RBs for the Rényi entropy of the TILEE model for Set III.
Weibull Lomax (WL) (see Ref. [47]), beta Lomax (BL) (see the estimated and empirical cdfs against each other with the
Ref. [19]), exponentiated Lomax (EL) (see Ref. [1]), Marshall- adjustment of the scatter plot by a suitable line.
Olkin exponential (MOE) (see Ref. [7]), Kumaraswamy expo- In Figures 3–5, the obtained fits are quite acceptable; In
nential (KwE) (see Ref. [16]), generalized Marshall-Olkin Figure 3, we see that the blue line fits the corresponding his-
exponential (GMOE), beta exponential (BE) (see Ref. [36]) togram quite well, the same for the blue line over the correspond-
and Kumaraswamy Marshall-Olkin exponential (KwMOE) (see ing empirical cdf in Figure 4, and in Figure 5 the points of the
Ref. [20]) models. The results giving AIC, CAIC, HQIC, A and
W for all the models are presented in Table XIII.
Since it has the smallest AIC, CAIC, HQIC, A and W, the Histogram and theoretical densities
TILEE model is the best.
Figures 3–5 illustrate the good fits of the TILEE model; TILEE
Figure 3 considers the superposition of the estimated pdf of the
0.4
2434
J. Med. Imaging Health Inf. 11, 2425–2439, 2021 RESEARCH ARTICLE
Empirical and theoretical CDFs Table XIV. Fit behavior criteria for cancer patient.
1.0
Model AIC CAIC HQIC A W
TILEE 836.118 836.311 839.594 0.7699 0.1238
BEBXII 841.268 841.760 855.528 0.900 0.134
0.8
For this data, we aim to compare the fit of the TILEE model
with the following models: the beta exponentiated Burr XII
(BEBXII) (see Ref. [32]), generalized inverse gamma (GIG) (see
0.2
additive Weibull (TAW) (see Ref. [18]) models. The results giv-
0 1 2 3 4 5 6 7 ing AIC, CAIC, HQIC, A and W for all the models are presented
data in Table XIV.
We observe that the TILEE model has the smallest AIC, CAIC,
Fig. 4. Empirical cdf of Survival Times along with the curve (in blue) of the
HQIC, A and W. For this reason, it can be considered as the best.
estimated cdf of the TILEE model.
Figures 6–8 illustrate the accuracy of the TILEE model;
Figure 6 considers the superposition of the estimated pdf of the
P–P plot are near aligned, and not too far from the corresponding TILEE model over the histogram of the data, Figure 7 shows the
P–P line. All those observations demonstrate the usefulness of superposition of the estimated cdf of the TILEE model over the
the TILEE model for the analysis of the considered data. empirical cdf of the data, and Figure 8 displays the related P–P
Data set: Cancer Patient. This data set describes the remission plot with the P–P line.
times (in months) of a random sample of 128 bladder cancer In Figures 6–8, the obtained fits are visually satisfying.
patients studied by Ref. [31]. Based on this data set, the MLEs
of the parameters , and of the TILEE model are given as
5.2. Agricultural Data
ˆ = 21 257 ˆ = 1 03 ˆ = 0 01 Now, the data sets are related to the wheat production in Pak-
istan. As in numerous countries, agriculture is an essential com-
respectively, with the corresponding standard errors given as ponent of the economy of Pakistan, employing 42.3% of the total
SEˆ = 16 738 SEˆ = 0 08 SE ˆ = 0 0072 country’s labor force. One of main agricultural production in
Pakistan remains the wheat production which attains 25 750 mil-
respectively. lion tons during 2016–17. Among all the provinces of Pakistan,
P−P plot
Histogram and theoretical densities
1.0
0.08
TILEE
0.8
Empirical probabilities
0.06
0.6
Density
0.04
0.4
0.02
0.2
0.00
TILEE
0.0
Fig. 5. P–P plot (in blue) and P–P line (in black) for Survival Times for the Fig. 6. Histogram of cancer patient along with the curve (in blue) of the
estimated TILEE model. estimated pdf of the TILEE model.
2435
RESEARCH ARTICLE J. Med. Imaging Health Inf. 11, 2425–2439, 2021
Empirical and theoretical CDFs Table XV. Fit behavior criteria for wheat Punjab.
1.0
Model AIC CAIC HQIC A W
TILEE 638802 662802 63703 0.2728 00414
EKE 659278 703722 65691 0.2788 00428
0.8
respectively.
For this data set, we compare the fit of the TILEE model with
TILEE the following models: exponentiated Kumaraswamy-exponential
0.0
(EKE) (see Ref. [44]), beta exponential (BE) (see Ref. [37]), type
0 20 40 60 80 I generalized half logistic (TIGHL) (see Ref. [39]), exponentiated
data exponential (EE) (see Ref. [22]) and the standard exponential (E)
Fig. 7. Empirical cdf of cancer patient along with the curve (in blue) of the
model. The results giving AIC, CAIC, HQIC, A and W for all
estimated cdf of the TILEE model. the models are presented in Table XV.
From this table, it is clear that the TILEE model has the small-
the Punjab province contains the main production of wheat crop est AIC, CAIC, HQIC, A and W; It is the best for the considered
(around 72%) (and mostly grown on irrigated land), just followed data set.
by the Sindh province (around 14%). Hence, any data analysis For a visual analysis of the fits of the TILEE model, as for
of the wheat production of Pakistan is of great interest, and spe- the previous data sets, we consider the superposition of the esti-
cially, the construction of statistical models, which is the purpose mated pdf of the TILEE model over the histogram of the data
of this part of our study. The two following data sets are consid- in Figure 9, the superposition of the estimated cdf of the TILEE
ered, distinguishing the Punjab and Sindh provinces. model over the empirical cdf of the data in Figure 10, and the
Data set: Wheat Punjab. This data set contains wheat averages in corresponding P–P plot with its P–P line in Figure 11.
Punjab (average yield in mds/acers) from 2006–2019. The data In Figures 9–11, the obtained fits are quite acceptable, sup-
are: 26.03, 28.05, 30.09, 26.43, 29.21, 28.10, 30.85, 29.67, 30.95, porting the results of Table XV.
31.01, 29.95, 33.32, 31.70, 30.67. For Wheat Punjab, the MLEs Data set: Wheat Sindh. This data set contains wheat average
of the parameters , and of the TILEE model are given as in Sindh (average yield in mds/acers) from 2006–2019. The
data are: 29.82, 35.12, 34.87, 34.73, 34.34, 37.91, 36.27, 34.4,
ˆ = 169 03 ˆ = 178 3676 ˆ = 0 1216 36.10, 33.56, 33.6, 33.83, 33.79, 31.88. For this data set, the
TILEE
0.20
0.8
Empirical probabilities
0.15
0.6
Density
0.10
0.4
0.05
0.2
0.00
TILEE
0.0
Fig. 8. P–P plot (in blue) and P–P line (in black) for cancer patient for the Fig. 9. Histogram of Wheat Punjab along with the curve (in blue) of the
estimated TILEE model. estimated pdf of the TILEE model.
2436
J. Med. Imaging Health Inf. 11, 2425–2439, 2021 RESEARCH ARTICLE
TILEE
0.8
0.15
0.6
Density
0.10
CDF
0.4
0.05
0.2
0.00
TILEE
0.0
26 28 30 32 30 32 34 36 38
data data
Fig. 10. Empirical cdf of Wheat Punjab along with the curve (in blue) of the Fig. 12. Histogram of Wheat Sindh along with the curve (in blue) of the
estimated cdf of the TILEE model. estimated pdf of the TILEE model.
1.0
0.8
0.8
Empirical probabilities
0.6
0.6
CDF
0.4
0.4
0.2
0.2
TILEE TILEE
0.0
0.0
Fig. 11. P–P plot (in blue) and P–P line (in black) for Wheat Punjab for the Fig. 13. Empirical cdf of Wheat Sindh along with the curve (in blue) of the
estimated TILEE model. estimated cdf of the TILEE model.
2437
RESEARCH ARTICLE J. Med. Imaging Health Inf. 11, 2425–2439, 2021
Conflicts of Interest
The authors declare no conflicts of interest.
0.6
cial support.
TILEE
References and Notes
0.0
0.0 0.2 0.4 0.6 0.8 1.0 1. Abdul-Moniem, I.B. and Abdel-Hameed, H.F. 2012. On exponentiated Lomax
distribution. International Journal of Mathematical Archive, 3, pp.2144–2150.
Theoretical probabilities 2. Abid, A.H. and Abdulrazak, R.K., 2017. [0, 1] truncated Fréchet-G generator
of distributions. Applied Mathematics, 7, pp.51–66.
Fig. 14. P–P plot (in blue) and P–P line (in black) for Wheat Sindh for the 3. Afify, A.Z., Nofal, Z.M. and Ebraheim, A.N. 2015. Exponentiated transmuted
estimated TILEE model. generalized Rayleigh: A new four parameter Rayleigh distribution. Pakistan
Journal of Statistics and Operation Research, 11, pp.115–134.
4. Al-Shomrani, A., Arif, O., Shawky, A., Hanif, S. and Shahbaz, M.Q., 2016.
Topp-leone family of distributions: Some properties and application. Pakistan
Table XVI. Fit behavior criteria for Wheat Sindh. Journal of Statistics and Operation Research, 12(3), pp.443–451.
5. Aldahlan, M.A., 2019. Type II truncated Fréchet generated family of distribu-
Model AIC CAIC HQIC A W tions. International Journal of Applied Mathematics, 7, pp.221–228.
6. Aldahlan, M.A., Jamal, F., Chesneau, C., Elgarhy, M. and Elbatal, I., 2019. The
TILEE 63245 65645 63068 0384 0.0656
truncated Cauchy power family of distributions with inference and applications.
EKE 649324 693768 64696 03965 0.0671
Entropy, 22, p.346.
BE 639846 663846 63807 0469 0.0788 7. Alice, T. and Jose, K.K., 2004. Marshall-Olkin exponential time series mod-
TIGHL 1302446 1313355 130126 04606 0.0774 elling. STARS International Journal, 5(1), pp.12–22.
EE 850128 861037 84894 05303 0.0887 8. Amigo, J.M., Balogh, S.G. and Hernandez, S., 2018. A brief review of gener-
E 128984 1293173 128925 04626 0.0777 alized entropies. Entropy, 20, p.813.
9. Balakrishnan, N. and Cohen, A.C., 1991. Order Statistics and Inference, New
York, Academic Press.
10. Bantan, R.A.R., Jamal, F., Chesneau, C. and Elgarhy, M., 2019. Truncated
inverted Kumaraswamy generated family of distributions with applications.
6. CONCLUSION Entropy, 21, p.1089.
11. Bantan, R.A.R., Chesneau, C., Jamal, F. and Elgarhy, M., 2020. On the anal-
The constant flow of new data requires the use of increasingly
ysis of new Covid-19 cases in Pakistan using an exponentiated version of the
flexible statistical models. This paper contributes to the subject M family of distributions. Mathematics, 8(953), pp.1–20.
by developing new models deduced from a new family of dis- 12. Bjerkedal, T., 1960. Acquisition of resistance in guinea pigs infected with dif-
ferent doses of virulent tubercle bacilli. American Journal of Epidemiology, 72,
tributions, called the truncated inverse Lomax generated (TIL-G)
pp.130–148.
family of distributions. The main properties of the TIL-G fam- 13. Brito, C.R., Rêgo, L.C., Oliveira, W.R. and Gomes-Silva, F., 2019. Method for
ily are derived and discussed, with an emphasis on a special generating distributions and classes of probability distributions: The univariate
case. Hacettepe Journal of Mathematics and Statistics, 48, pp.897–930.
distribution extending the generalized half logistic distribution,
14. Butler, R.J. and McDonald, J.B., 1989. Using of incomplete moments to mea-
called the truncated inverse Lomax exponentiated exponential sure inequality. Journal of Econometrics, 42(1), pp.109–119.
(TILEE) distribution. A complete parametric estimation study 15. Casella, G. and Berger, R.L., 1990. Statistical Inference, Brooks/Cole Pub-
lishing Company, Bel Air, CA, USA.
of the model derived from the TILEE distribution is proposed
16. Cordeiro, G.M. and de Castro, M., 2011. A new family of generalized distribu-
through maximum likelihood, least squares and weighted least tions. Journal of Statistical Computation and Simulation, 81, pp.883–893.
squares methods, with efficiency validated by simulation study. 17. Cordeiro, G.M., Silva, R.B. and Nascimento, A.D.C. 2020. Recent
Advances in Lifetime and Reliability Models, Bentham Books,
A part is also devoted to the Rényi entropy, exploring the theo-
DOI: 10.2174/97816810834521200101.
retical and practical uses. Two biomedical and two agricultural 18. Elbatal, I. and Aryal, G., 2013. On the transmuted additive Weibull distribution.
data sets of modern interest are analysed. It is observed that the Austrian Journal of Statistics, 42, pp.117–132.
19. Eugene, N., Lee, C. and Famoye, F., 2002. Beta-normal distribution and its
new model fitted these data sets better than a plethora of com-
applications. Communications in Statistics-Theory and Methods, 31, pp.497–
peting models. Altogether, we may claim that the TIL-G family 512.
can serve to construct nice models for analyzing various types of 20. George, R. and Thobias, S., 2019. Kumaraswamy marshall-olkin exponen-
tial distribution. Communications in Statistics-Theory and Methods, 48(8),
data sets.
pp.1920–1937.
21. Gilchrist, W., 2000. Statistical Modelling with Quantile Functions, Chapman
Data Availability Statement and Hall/CRC.
22. Gupta, R.D. and Kundu, D., 2001. Exponentiated exponential family: An
The agricultural data used to support the findings of this study alternative to Gamma and Weibull distributions. Biometrical Journal, 43,
are included within the article. pp.117–130.
2438
J. Med. Imaging Health Inf. 11, 2425–2439, 2021 RESEARCH ARTICLE
23. Hassan, A.S., Sabry, M.A.H and Elsehetry, A.M., 2020. A new family of upper- 39. Olapade, A.K., 2014. The type I generalized half logistic distribution. Journal
truncated distributions: Properties and estimation. Thailand Statistician, 18(2), of the Iranian Statistical Society, 13(1), pp.69–82.
pp.196–214. 40. Raqab, M.Z., Madi, M.T. and Kundu, D., 2008. Estimation of P Y < X for
24. Jamal, F., Bakouch, H.S. and Nasir, M.A., 2019. A truncated general-G class the three-parameter generalized exponential distribution. Communications in
of distributions with application to truncated Burr-G family, RevStat, in press, Statistics-Theory and Methods, 37, pp.2854–2864.
https://hal.archives-ouvertes.fr/hal-01956892/document. 41. Rényi, A., 1960. On Measures of Entropy and Information. Proc. 4th Berkeley
25. Khan, M.S. and King, R., 2013. Transmuted modified Weibull distribution: A Symposium on Mathematical Statistics and Probability, Vol. 1, pp.47–561.
generalization of the modified Weibull probability distribution. European Jour- 42. Rezaei, S., Tahmasbi, R. and Mahmoodi, M., 2010. Estimation of P Y < X
nal of Pure and Applied Mathematics, 6, pp.66–88. for generalized Pareto distribution. Journal of Statistical and Planning Infer-
26. Kleiber, C. and Kotz, S., 2003. Statistical Size Distributions in Economics and ence, 140, pp.480–494.
Actuarial Sciences, Vol. 470, New York, Wiley. 43. Shaked, M. and Shanthikumar, J.G., 2007. Stochastic Orders, New York,
27. Klein, J.P. and Moeschberger, M.L., 2003. Survival Analysis: Techniques for Wiley.
Censored and Truncated Data, Springer, 2nd edition. 44. Silva, R., Silva, G.F., Ramos, M., Cordeiro, G., Marinho, P. and de Andrade,
28. Kumar, D., Singh, U., Singh, S.K. and Mukherjee, S., 2017. The new proba- T.A.N., 2019. The exponentiated Kumaraswamy-G class: General properties
bility distribution: An aspect to a life time distribution. Mathematical Sciences and application. Revista Colombiana de Estadística, 42(1), pp.1–33.
Letters, 6(1), pp.35–42. 45. Singh, S. and Tiwari, S., 2013. An application of generalized entropy in queu-
29. Kundu, D. and Gupta, R.D., 2009. Bivariate generalized exponential distribu- ing theory. Journal of Applied Science and Engineering, 16(1), pp.99–103.
tion. Journal of Multivariate Analysis, 100, pp.581–593. 46. Swain, J.J., Venkatraman, S. and Wilson, J.R., 1988. Least-squares estima-
30. Lad, F., Sanfilippo, G. and Agro, G., 2015. Extropy: Complementary dual of tion of distribution functions in Johnson’s translation system. Journal of Sta-
entropy. Statistical Science, 30(1), pp.40–58. tistical Computation and Simulation, 29(4), pp.271–297.
31. Lee, E.T. and Wang, J., 2003. Statistical Methods for Survival Data Analysis, 47. Tahir, M.H., Cordeiro, G.M., Mansoor, M. and Zubair, M., 2015. The weibull-
Vol. 476, New York, Wiley. lomax distribution: Properties and applications. Hacettepe Journal of Mathe-
32. Mead, M., 2014. A new generalization of Burr XII distribution. Journal of matics and Statistics, 44(2), pp.461–480.
Statistics: Advances in Theory and Applications, 12, pp.53–73. 48. Yadav, A.S., Shukla, S. and Kumari, A., 2019. Statistical inference for trun-
33. Mead, M., 2015. Generalized inverse gamma distribution and its application cated inverse Lomax distribution and its application to survival data. Annals
in reliability. Communications in Statistics-Theory and Methods, 44, pp.1426– of Data Science, to appear, DOI: 10.1007/s40745-019-00235-2.
1435. 49. Yadav, A.S., Singh, S.K. and Singh, U., 2016a. On hybrid censored
34. Nadarajah, S., 2011. The exponentiated exponential distribution: A survey. inverse Lomax distribution: Application to the survival data. Statistica, 76(2),
AStA Advances in Statistical Analysis, 95, pp.219–251. pp.185–203.
35. Nadarajah, S. and Gupta, A.K., 2004. The beta Fréchet distribution. Far East 50. Yadav, A.S., Singh, S.K. and Singh, U., 2016b. Reliability estimation for
Journal of Theoretical Statistics, 14, pp.15–24. inverse lomax distribution under type-II censored data using markov chain
36. Nadarajah, S. and Kotz, S., 2006. The beta exponential distribution. Reliability monte carlo method. International Journal of Mathematics and Statis-
Engineering and System Safety, 91(6), pp.689–697. tics, 17(1), pp.128–146.
37. Nadarajah, S. and Samuel, K., 2006. The beta exponential distribution. Reli- 51. Yadav, A.S., Singh, S.K. and Singh, U., 2019. Bayesian estimation of R =
ability Engineering and System Safety, 91(6), pp.689–697. P Y < X for inverse Lomax distribution under progressive type-II censoring
38. Nelsen, R.B., 2006. An Introduction to Copulas, Springer-Verlag, Berlin, 2nd scheme. International Journal of System Assurance Engineering and Man-
edition. agement, 10(5), pp.905–917.
2439