Professional Documents
Culture Documents
https://doi.org/10.1007/s10700-018-9298-z
Abstract
Time series analysis is a method to predict future values based on previously observed
values. Assuming the observed values are imprecise and described by uncertain vari-
ables, this paper proposes an approach of uncertain time series. By employing the
principle of least squares, a minimization problem is derived to calculate the unknown
parameters in the uncertain time series model. In addition, residual and confidence
interval are also proposed. Finally, some numerical examples are given.
1 Introduction
Time series is a sequence of observed values arranged in chronological order, and the
main goal of time series analysis is to predict future value from the knowledge of the
time series at its current time and also previous time. That is, via time series analysis,
the inner relationships reflected from observed values will be obtained, and the future
tendency can be forecasted. The early time series model can be traced back to the
work of Yule (1927), in which he proposed an autoregressive model to represent time
series. Later on, Walker (1931) developed a moving-average model. But time series
analysis still does not attract the attention of researchers until Box and Jenkins (1970)
published a book about time series analysis, in which they developed a systematic
approach that enables practitioners to use time series analysis in forecasting.
B Baoding Liu
liu@tsinghua.edu.cn
Xiangfeng Yang
yangxf@uibe.edu.cn
123
X. Yang, B. Liu
123
Uncertain time series analysis with imprecise observations
2 Preliminaries
This section introduces some fundamental concepts and properties in uncertainty the-
ory including uncertain measure, uncertain variable and expected value.
In order to provide the operational law, Liu (2009) defined the product uncertain
measure on the product σ -algebre L, called product axiom.
Axiom 4 Let (Γk , Lk , Mk ) be uncertainty spaces for k = 1, 2, . . . . The product
uncertain measure M is an uncertain measure satisfying
∞ ∞
M Λk = Mk {Λk }
k=1 k=1
{ξ ∈ B} = {γ ∈ Γ |ξ(γ ) ∈ B}
is an event.
123
X. Yang, B. Liu
The operational law of uncertain variables was proved by Liu (2010) to calculate
the inverse uncertainty distribution of strictly monotonous function as the following
theorem.
ξ = f (ξ1 , . . . , ξm , ξm+1 , . . . , ξn )
The expected value operator of uncertain variable, proposed by Liu (2007), is the
average value of uncertain variable in the sense of uncertain measure, and represents
the size of uncertain variable.
Definition 4 (Liu 2007) Let ξ be an uncertain variable. Then the expected value of ξ
is defined as
+∞ 0
E[ξ ] = M{ξ ≥ x}dx − M{ξ ≤ x}dx
0 −∞
For an uncertain variable ξ with regular uncertainty distribution Φ(x), its expected
value can be obtained by
1
E[ξ ] = Φ −1 (α)dα. (1)
0
Theorem 2 (Liu 2010) Let ξ and η be independent uncertain variables with finite
expected values. Then for any real numbers a and b, we have
Definition 5 (Liu 2007) Let ξ be an uncertain variable with finite expected value e.
Then the variance of ξ is defined by
V [ξ ] = E[(ξ − e)2 ].
123
Uncertain time series analysis with imprecise observations
For an uncertain variable ξ with regular uncertainty distribution Φ(x) and finite
expected value e, its variance can be obtained by
1
2
V [ξ ] = Φ −1 (α) − e dα. (2)
0
Theorem 3 (Sheng and Kar 2015) Let ξ be an uncertain variable with regular uncer-
tainty distribution Φ(x). Then
+∞
E[ξ 2 ] = (Φ −1 (α))2 dα.
−∞
Traditional time series analysis is a method to predict future values based on previously
observed values which are precisely observed. However, in an environment where the
previously observed values are evaluated imprecisely through domain experts’ belief
degrees, the traditional time series model cannot be applied directly since imprecise
observed values cannot be treated as precise observed values. By interpreting the
observed values as uncertain variables, traditional time series model can be naturally
extended to become uncertain time series model.
An uncertain times series is a sequence of imprecisely observed values each of
which is described by an uncertain variable. Mathematically, an uncertain time series
is represented by
X = {X 1 , X 2 , . . . , X n }
k
X t = a0 + ai X t−i + t (3)
i=1
123
X. Yang, B. Liu
Denoting the optimal solution by a0∗ , a1∗ , . . . , ak∗ , the fitted autoregressive model is
determined by
k
X t = a0∗ + ai∗ X t−i . (5)
i=1
k
X t = a0 + ai X t−i + t
i=1
n
2
1
k
−1
min Φt−1 (α) − a0 − ai Υt−i (α, ai ) dα (6)
a0 ,a1 ,...,ak
t=k+1 0 i=1
where
−1
−1 Φt−i (1 − α), if ai ≥ 0
Υt−i (α, ai ) = −1
Φt−i (α), if ai < 0
for i = 1, 2, . . . , k.
Proof For each index t, we obtain that the inverse uncertainty distribution of
k
X t − a0 − ai X t−i
i=1
is
k
−1
Ψt−1 (α) = Φt−1 (α) − a0 − ai Υt−i (α, ai ).
i=1
Therefore, the minimization problem (4) is equivalent to (6). This theorem is thus
proved.
123
Uncertain time series analysis with imprecise observations
4 Residual analysis
k
X t = a0∗ + ai∗ X t−i .
i=1
Then for each index t (t = k + 1, k + 2, . . . , n), the difference between the actual
observed value and the value predicted by the fitted model,
k
ˆt := X t − a0∗ − ai∗ X t−i
i=1
Then a suitable approach to estimate the expected value of disturbance term is the
average of expected values of residuals, i.e.,
1
n
ê := E ˆt ,
n−k
t=k+1
1
n
σ̂ 2 := E (ˆt − ê)2
n−k
t=k+1
k
X t = a0∗ + ai∗ X t−i .
i=1
123
X. Yang, B. Liu
Then the estimated expected value of disturbance terms under the iid hypothesis is
n 1
1
k
−1 ∗ ∗ −1 ∗
ê = Φt (α) − a0 − ai Υt−i (α, ai ) dα (7)
n−k 0 t=k+1 i=1
n 1
2
1
k
−1 ∗ ∗ −1 ∗
σ̂ =
2
Φt (α) − a0 − ai Υt−i (α, ai ) − ê dα (8)
n−k 0 t=k+1 i=1
where
−1
−1 Φt−i (1 − α), if ai∗ ≥ 0
Υt−i (α, ai∗ ) = −1
Φt−i (α), if ai∗ < 0
for i = 1, 2, . . . , k.
Proof Since
k
ˆt = X t − a0∗ − ai∗ X t−i
i=1
and X t , X t−1 , . . . , X t−k are independent, we obtain that the inverse uncertainty dis-
tribution of ˆt is
k
−1
Ψ̂t−1 (α) = Φt−1 (α) − a0∗ − ai∗ Υt−i (α, ai∗ )
i=1
and
2 1
2
E ˆt − ê = Ψ̂t−1 (α) − ê dα,
0
123
Uncertain time series analysis with imprecise observations
This section proposes a confidence interval to predict the next value X n+1 for k-order
uncertain autoregressive model (3). From Sects. 3 and 4, we already have obtained the
fitted autoregressive model
k
X t = a0∗ + ai∗ X t−i
i=1
and the estimators ê and σ̂ 2 of disturbance term t . We further assume that disturbance
term n+1 follows normal uncertainty distribution N(ê, σ̂ ), and n+1 is independent
of X 1 , X 2 , . . . , X n .
For predicting the next value X n+1 , a forecast uncertain variable X̂ n+1 is defined
as follows,
k
X̂ n+1 := a0∗ + ai∗ X n+1−i + n+1 .
i=1
And the expected value of X̂ n+1 is called forecast value of X n+1 . From independence
of X n , X n−1 , . . . , X n+1−k and n+1 (Theorem 2), we can obtain that the forecast value
is
μ = E X̂ n+1
k
∗ ∗
= E a0 + ai X n+1−i + n+1
i=1
k
(9)
= a0∗ + ai∗ E X n+1−i + E n+1
i=1
k
= a0∗ + ai∗ E X n+1−i + ê.
i=1
k
−1 −1
Φ̂n+1 (α) = a0∗ + ai∗ Υn+1−i (α, ai∗ ) + Φ −1 (α)
i=1
123
X. Yang, B. Liu
Since
μ ± b.
6 Numerical examples
X1 X2 X3 X4 X5 X6
L(335, 347) L(338, 350) L(340, 354) L(343, 359) L(344, 364) L(346, 366)
X7 X8 X9 X 10 X 11 X 12
L(350, 366) L(355, 369) L(360, 372) L(362, 376) L(365, 381) L(370, 384)
X 13 X 14 X 15 X 16 X 17 X 18
L(373, 390) L(379, 391) L(380, 398) L(384, 402) L(388, 410) L(390, 415)
123
Uncertain time series analysis with imprecise observations
18
2
1
3
−1
min Φt−1 (α) − a0 − ai Υt−i (α, ai ) dα
a0 ,a1 ,a2 ,a3 0
t=4 i=1
where
−1
−1
Φt−i (1 − α), if ai ≥ 0
Υt−i (α, ai ) =
−1
Φt−i (α), if ai < 0
1
18 1 3
−1
ê = Φt−1 (α) − a0∗ − ai∗ Φt−i (1 − α) dα,
15 0
t=4 i=1
2
1
18 1 3
−1
σ̂ 2 = Φt−1 (α) − a0∗ − ai∗ Φt−i (1 − α) − ê dα,
15 0
t=4 i=1
ê = 0.0000, σ̂ 2 = 88.7226,
3
μ = a0∗ + ai∗ E X 19−i + ê = 401.6407.
i=1
Furthermore, take the confidence level α = 95%, and 19 is assumed to follow normal
uncertainty distribution N(ê, σ̂ ). We find that
b = 27.4935
123
X. Yang, B. Liu
X1 X2 X3 X4 X5
Z(600, 612, 620) Z(615, 630, 650) Z(640, 666, 685) Z(700, 725, 740) Z(735, 758, 770)
X6 X7 X8 X9 X 10
Z(740, 755, 760) Z(730, 745, 755) Z(705, 713, 720) Z(680, 685, 692) Z(630, 636, 655)
X 11 X 12 X 13 X 14 X 15
Z(605, 611, 620) Z(560, 568, 590) Z(505, 520, 550) Z(500, 510, 512) Z(485, 494, 508)
where uncertainty distribution Φ̂19 can be derived from its inverse uncertainty distri-
bution
√
3
σ̂ 3 α
−1 −1
Φ̂19 (α) = a0∗ + ai∗ Φ19−i (α) + ln .
π 1−α
i=1
401.6407 ± 27.4935.
15
2
1
3
−1
min Φt−1 (α) − a0 − ai Υt−i (α, ai ) dα
a0 ,a1 ,a2 ,a3 0
t=4 i=1
where
−1
−1
Φt−i (1 − α), if ai ≥ 0
Υt−i (α, ai ) =
−1
Φt−i (α), if ai < 0
123
Uncertain time series analysis with imprecise observations
1
1
15
−1 −1
ê = Φt−1 (α) − a0∗ − a1∗ Φt−1 (1 − α) − a3∗ Φt−3 (α) dα,
12 0
t=4
1
2
1
15
−1 −1
σ̂ 2 = Φt−1 (α) − a0∗ − a1∗ Φt−1 (1 − α) − a3∗ Φt−3 (α) − ê dα,
12 0
t=4
ê = 0.2480, σ̂ 2 = 403.3562,
Furthermore, take the confidence level α = 95%, and 16 is assumed to follow normal
uncertainty distribution N(ê, σ̂ ). We find that
b = 61.6727
where uncertainty distribution Φ̂16 can be derived from its inverse uncertainty distri-
bution
√
−1 −1 −1 σ̂ 3 α
Φ̂16 (α) = a0∗ + a1∗ Φ15 (α) + a3∗ Φ13 (1 − α) + ê + ln .
π 1−α
507.6092 ± 61.6727.
X t = a0 + a1 X t−1 + a2 X t−2 + t .
123
X. Yang, B. Liu
X1 X2 X3 X4 X5 X6 X7 X8
16
2
1
2
−1
min Φt−1 (α) − a0 − ai Υt−i (α, ai ) dα
a0 ,a1 ,a2 0
t=4 i=1
where
−1
−1 Φt−i (1 − α), if ai ≥ 0
Υt−i (α, ai ) = −1
Φt−i (α), if ai < 0
1
16 1 2
−1
ê = Φt−1 (α) − a0∗ − ai∗ Φt−i (1 − α) dα,
14 0
t=3 i=1
2
1
16 1 2
−1
σ̂ =
2
Φt−1 (α) − a0∗ − ai∗ Φt−i (1 − α) − ê dα,
14 0
t=3 i=1
ê = 0.0000, σ̂ 2 = 29.2464,
3
μ = a0∗ + ai∗ E X 16−i + ê = 89.0357.
i=1
123
Uncertain time series analysis with imprecise observations
Furthermore, take the confidence level α = 95%, and 17 is assumed to follow normal
uncertainty distribution N(ê, σ̂ ). We find that
b = 14.7727
where uncertainty distribution Φ̂17 can be derived from its inverse uncertainty distri-
bution
√
2
σ̂ 3 α
−1 −1
Φ̂17 (α) = a0∗ + ai∗ Φ17−i (α) + ln .
π 1−α
i=1
89.0357 ± 14.7727.
7 Conclusion
This paper firstly proposed a methodology of uncertain time series analysis to predict
future values based on previously observed values which are described by uncertain
variables. According to the principle of least squares, a minimization problem was
derived to calculate the unknown parameters in the uncertain time series model. Fur-
thermore, residual of disturbance term was analyzed, and a confidence interval was
also defined to predict the future value.
Acknowledgements The authors gratefully acknowledge the financial support provided by National Natural
Science Foundation of China (Grants Nos. 61573210 and 61873329).
References
Box, G. E. P., & Jenkins, G. M. (1970). Time series analysis: Forecasting and control. San Francisco:
Holden-Day.
Cai, Q., Zhang, D., Zheng, W., & Leung, S. C. (2015). A new fuzzy time series forecasting model combined
with ant colony optimization and auto-regression. Knowledge-Based Systems, 74, 61–68.
Chen, S. M. (2002). Forecasting enrollments based on high order fuzzy time series. Cybernetics and Systems,
33(1), 1–16.
Domańska, D., & Wojtylak, M. (2012). Application of fuzzy time series models for forecasting pollution
concentrations. Expert Systems with Applications, 39(9), 7673–7679.
Egrioglu, E., Yolcu, U., Aladag, C. H., & Kocak, C. (2013). An ARMA type fuzzy time series forecasting
method based on particle swarm optimization. Mathematical Problems in Engineering, 2013, Article
ID 935815.
Huarng, K. (2001). Effective lengths of intervals to improve forecasting in fuzzy time series. Fuzzy Sets
and Systems, 123(3), 387–394.
123
X. Yang, B. Liu
Lee, L. W., Wang, L. H., Chen, S. M., & Leu, Y. H. (2006). Handling forecasting problems based on
two-factors high-order fuzzy time series. IEEE Transactions on Fuzzy Systems, 14(3), 468–477.
Lee, H. S., & Chou, M. T. (2014). Fuzzy forecasting based on fuzzy time series. International Journal of
Computer Mathematics, 81, 781–789.
Lio, W., & Liu, B. (2018a). Uncertain data envelopment analysis with imprecisely observed inputs and
outputs. Fuzzy Optimization and Decision Making, 17(3), 357–373.
Lio, W., & Liu, B. (2018b). Residual and confidence interval for uncertain regression model with imprecise
observations. Journal of Intelligent and Fuzzy Systems, 35(1), 2573–2583.
Liu, B. (2007). Uncertainty theory (2nd ed.). Berlin: Springer.
Liu, B. (2009). Some research problems in uncertainty theory. Journal of Uncertain Systems, 3(1), 3–10.
Liu, B. (2010). Uncertainty theory: A branch of mathematics for modeling human uncertainty. Berlin:
Springer.
Nejad, Z. M., & Ghaffari-Hadigheh, A. (2018). A novel DEA model based on uncertainty theory. Annals
of Operations Research, 264, 367–389.
Sheng, Y. H., & Kar, S. (2015). Some results of moments of uncertain variable through inverse uncertainty
distribution. Fuzzy Optimization and Decision Making, 14(1), 57–76.
Song, Q., & Chissom, B. S. (1993a). Forecasting enrollments with fuzzy time series: Part I. Fuzzy Sets and
Systems, 54, 1–9.
Song, Q., & Chissom, B. S. (1993b). Fuzzy time series and its models. Fuzzy Sets and Systems, 54, 269–277.
Song, Q., & Chissom, B. S. (1994). Forecasting enrollments with fuzzy time series: Part II. Fuzzy Sets and
Systems, 62, 1–8.
Sullivan, J., & Woodall, W. H. (1994). A comparison of fuzzy forecasting and Markov modeling. Fuzzy
Sets and Systems, 64(3), 279–293.
Tseng, F. M., Tzeng, G. H., Yu, H. C., & Yuan, B. J. C. (2001). Fuzzy ARIMA model for forecasting the
foreign exchange market. Fuzzy Sets and Systems, 118(1), 9–19.
Wen, M. L., Zhang, Q. Y., Kang, R., & Yang, Y. (2017). Some new ranking criteria in data envelopment
analysis under uncertain environment. Computers and Industrial Engineering, 110, 498–504.
Yao, K. (2018). Uncertain statistical inference models with imprecise observations. IEEE Transactions on
Fuzzy Systems, 26(2), 409–415.
Yao, K., & Liu, B. (2018). Uncertain regression analysis: An approach for imprecise observations. Soft
Computing, 22(17), 5579–5582.
Yule, G. U. (1927). On a method of investigating periodicities in disturbed series with special reference to
Wolfer’s sunspot numbers. Philosophical Transactions of the Royal Society of London, 226, 267–298.
Walker, G. T. (1931). On periodicity in series of related terms. Proceedings of the Royal Society of London,
Series A, 131, 518–532.
123