Professional Documents
Culture Documents
8, AUGUST 2018
Abstract— With accurate traffic prediction, future cellular Aiming at exploiting the spatial dependence of different
networks can make self-management and embrace intelligent and cells, [9] established a strategy combining auto-encoder and
efficient automation. This letter devotes itself to citywide cellular Long Short Term Memory (LSTM) network [10]. However,
traffic prediction and proposes a deep learning approach to model
the nonlinear dynamics of wireless traffic. By treating traffic the learned features through auto-encoder are lossy represen-
data as images, both the spatial and temporal dependence of cell tation of original data [11], which may fail to fully characterize
traffic are well captured utilizing densely connected convolutional the spatial dependence of neighboring cells. Besides, the above
neural networks. A parametric matrix based fusion scheme is methods mainly concentrate on predicting traffic for single
further put forward to learn influence degrees of the spatial cell. They are computationally expensive if applied to citywide
and temporal dependence. Experimental results show that the
prediction performance in terms of root mean square error can be scale networks, because hundreds even thousands of models
significantly improved compared with those existing algorithms. need to be trained simultaneously.
The prediction accuracy is also validated by using the data sets Motivated by the aforementioned problems, this letter
of Telecom Italia. proposes a new method for citywide traffic prediction by
Index Terms— Cellular traffic prediction, big data, deep exploiting the powerful capability of deep convolutional
learning, intelligent traffic management. neural network (CNN). More specifically, densely connected
CNN [12], which is the most advanced deep learning archi-
I. I NTRODUCTION tecture, is utilized to collectively model the spatial and
T HAS been well known that adopting traffic prediction temporal dependence of traffic in different cells. The spa-
I can facilitate resource allocation, enhance energy effi-
ciency, and finally enable intelligent cellular networks [1], [2].
tial dependence is naturally captured by the convolution
operation. Two temporal dependence, i.e., closeness and
Recently, a lot of work has been done to investigate period, are modeled using two CNNs and the results are
the dynamic characteristics of wireless traffic, e.g., non- further fused by a parametric-matrix-based scheme. Experi-
stationary and seasonality [3], thus to make an accurate mental results and comparisons demonstrate that the proposed
prediction [4], [5]. Cellular traffic prediction in these work can method is effective and outperforms those existing traffic
be treated as a time series analysis problem, the performance prediction methods. The source code is available through
of which depends on its linear statistical models, such as https://github.com/zctzzy/traffic_prediction.
AutoRegressive Integrated Moving Average (ARIMA) and
alpha-stable. The pattern of cellular traffic is actually very II. DATASET AND S OME K EY O BSERVATIONS
complex due to various factors, e.g., user mobility, arrival A. Wireless Big Traffic Dataset
pattern and diverse user requirements. It becomes increasingly The wireless traffic data analyzed in this letter comes from
clear that those linear models are not suitable for such kind a large telephony service provider in Europe, Telecom Italia,
of applications [6]. as part of the “Big Data Challenge” [13]. The dataset consists
To capture the complex and nonlinear dependence hidden of time series of aggregated cell phone traffic, i.e, short
in wireless traffic data, recent advances on machine learning message service (SMS) and call service, sent or received by
models [7] have established themselves as strong competitors users within a specific area over the city of Milan. The city
to classical statistical models in traffic prediction [8], [9]. is divided into a grid with size of H × W and each square
[8] proposed a deep belief network based prediction method of the grid is referred to as a “cell”1 . H and W refer to
to model the long-term dependence of cellular traffic. the number of rows and columns of the grid. In the dataset,
Manuscript received March 22, 2018; revised May 3, 2018; accepted
H = W = 100, which means the whole city area is divided
May 16, 2018. Date of publication May 29, 2018; date of current version into 100 × 100 cells. The traffic is recorded during the period
August 10, 2018. This work was supported in part by the National Science from 00:00 11/01/2013 to 00:00 01/01/2014 with a temporal
Foundation of China for Excellent Young Scholars under Grant 61622111 and
in part by the National Natural Science Foundation of China under Grant
interval of 10 minutes. At the tth time slot, the in and out
61671278. The associate editor coordinating the review of this paper traffic in all cells can be denoted as a tensor Xt ∈ R2×H×W
and approving it for publication was M. Caleffi. (Corresponding author: where (Xt )0,i,j = xin,i,j
t , (Xt )1,i,j = xout,i,j
t . Furthermore,
Haixia Zhang.) i,j
The authors are with the Shandong Provincial Key Laboratory of
xi,j = {xt }, ∀t is denoted as the in/out traffic of cell (i, j)
Wireless Communication Technologies, Shandong University, Jinan 250100, without distinguishing them unless otherwise specified.
China (e-mail: chuanting.zhang@gmail.com; haixia.zhang@sdu.edu.cn;
dfyuan@sdu.edu.cn). 1 This is the best approximation of a cell tower from publicly available
Digital Object Identifier 10.1109/LCOMM.2018.2841832 dataset.
1558-2558 © 2018 IEEE. Personal use is permitted, but republication/redistribution requires IEEE permission.
See http://www.ieee.org/publications_standards/publications/rights/index.html for more information.
ZHANG et al.: CITYWIDE CELLULAR TRAFFIC PREDICTION BASED ON DENSELY CONNECTED CNNs 1657
V. C ONCLUSION
This letter investigates the spatial and temporal dependence
of traffic among different cells and has proposed a deep
learning approach to collectively model these two kinds of
dependencies for traffic prediction. To exactly model the
Fig. 4. The change of training loss with each epoch.
influence degree of these two dependences, a parametric
matrix based fusion scheme has been introduced. It has
been shown that by treating traffic data as images, the city-
wide traffic can be efficiently predicted. Experimental results
have demonstrated that the proposed CNN based approach
obtains better performance in terms of RMSE when com-
pared with some existing methods. It should be noted that
to achieve the best performance with the proposed approach
large amount of training data is needed. The code is available
at https://github.com/zctzzy/traffic_prediction.
R EFERENCES
[1] R. Li et al., “Intelligent 5G: When cellular networks meet artificial
intelligence,” IEEE Wireless Commun., vol. 24, no. 5, pp. 175–183,
Fig. 5. Prediction results of a random selected cell. Oct. 2017.
[2] N. Saxena, B. J. R. Sahu, and Y. S. Han, “Traffic-aware energy
optimization in green LTE cellular systems,” IEEE Commun. Lett.,
fusion scheme (XL+1 c + XL+1
p ), denoted as Proposed, is also vol. 18, no. 1, pp. 38–41, Jan. 2014.
included in Fig. 3. It can be observed that the fusion scheme [3] A. D’Alconzo, A. Coluccia, F. Ricciato, and P. Romirer-Maierhofer,
can enhance the performance in terms of RMSE, which “A distribution-based approach to anomaly detection and application
means that learning the weights of closeness and period can to 3G mobile traffic,” in Proc. IEEE Global Telecommun. Conf.,
Nov./Dec. 2009, pp. 1–8.
better describe their function in traffic prediction. For com- [4] B. Zhou, D. He, and Z. Sun, Traffic Modeling and Prediction Using
parison purpose, three existing algorithms, Historical Average ARIMA/GARCH Model. Boston, MA, USA: Springer, 2006, ch. 5,
value (HA), ARIMA and LSTM, are adopted as baselines as pp. 101–121.
[5] R. Li, Z. Zhao, J. Zheng, C. Mei, Y. Cai, and H. Zhang, “The
they have been widely used for traffic prediction in current learning and prediction of application-level traffic data in cellu-
research work [4], [9]. The obtained RMSE shows that the lar networks,” IEEE Trans. Wireless Commun., vol. 16, no. 6,
proposed method achieves the most accurate prediction. This pp. 3899–3912, Jun. 2017.
[6] J. G. D. Gooijer and R. J. Hyndman, “25 years of time series forecast-
is because the spatial and temporal dependences of cellular ing,” Int. J. Forecasting, vol. 22, no. 3, pp. 443–473, 2006.
traffic among different cells are collectively modeled through [7] H. Zhang, L. Cao, and S. Gao, “Locality correlation preserving based
utilizing CNNs. Moreover, the RMSE values on the Call one-class support vector machine,” Pattern Recognit., vol. 47, no. 9,
dataset are relatively lower than that of the SMS dataset, as the pp. 3168–3178, 2014.
[8] L. Nie, D. Jiang, S. Yu, and H. Song, “Network traffic prediction based
pattern hidden in the Call dataset is more “regular”, as seen on deep belief network in wireless mesh backbone networks,” in Proc.
from Fig. 1a, than the SMS dataset, which in turn makes it IEEE Wireless Commun. Netw. Conf. (WCNC), San Francisco, CA, USA,
easier to be learned by the deep model. It should be noted that Mar. 2017, pp. 1–5.
[9] J. Wang et al., “Spatiotemporal modeling and prediction in cellular
for real communication systems re-training is needed to cope networks: A big data enabled deep learning approach,” in Proc. IEEE
with the new coming data. Conf. Comput. Commun. (INFOCOM), Atlanta, GA, USA, May 2017,
To show the convergence speed of the proposed method, pp. 1–9.
the training loss after each epoch is given in Fig. 4. It is shown [10] Y. LeCun, Y. Bengio, and G. E. Hinton, “Deep learning,” Nature,
vol. 521, pp. 436–444, May 2015.
that the loss decreases quickly during the first 20 epochs and [11] G. E. Hinton and R. R. Salakhutdinov, “Reducing the dimensionality of
then gradually converges to a stable status after 40 epochs, data with neural networks,” Science, vol. 313, no. 5786, pp. 504–507,
indicating that the training process of the proposed method is 2006.
[12] G. Huang, Z. Liu, L. van der Maaten, and K. Q. Weinberger,
time efficient. “Densely connected convolutional networks,” in Proc. IEEE Conf.
Comput. Vis. Pattern Recognit. (CVPR), Honolulu, HI, USA,
C. Prediction Results Jul. 2017, p. 3.
[13] G. Barlacchi et al., “A multi-source dataset of urban life in the city
Finally, both the predicted in and out traffic of a ran- of milan and the province of trentino,” Sci. Data, vol. 2, Oct. 2015,
domly selected cell are given and compared with the ground Art. no. 150055.