Yin2019 Article QoS

Mobile Networks and Applications
https://doi.org/10.1007/s11036-019-01241-7
QoS Prediction for Service Recommendation with Deep Feature

Learning in Edge Computing Environment
Yuyu Yin 1,2 & Lu Chen 1 & Yueshen Xu 3,4 & Jian Wan 2,5 & He Zhang 3 & Zhida Mai 6
# Springer Science+Business Media, LLC, part of Springer Nature 2019
Abstract
Along with the popularity of intelligent services and mobile services, service recommendation has become a key task, especially
the task based on quality-of-service (QoS) in edge computing environment. Most existing service recommendation methods have
some serious defects, and cannot be directly adopted in edge computing environment. For example, most of existing methods
cannot learn deep features of users or services, but in edge computing environment, there are a variety of devices with different
configurations and different functions, and it is necessary to learn deep features behind those complex devices. In order to fully
utilize hidden features, this paper proposes a new matrix factorization (MF) model with deep features learning, which integrates a
convolutional neural network (CNN). The proposed mode is named Joint CNN-MF (JCM). JCM is capable of using the learned
deep latent features of neighbors to infer the features of a user or a service. Meanwhile, to improve the accuracy of neighbors
selection, the proposed model contains a novel similarity computation method. CNN learns the neighbors features, forms a
feature matrix and infers the features of the target user or target service. We conducted experiments on a real-world service dataset
under a batch of cases of data densities, to reflect the complex invocation cases in edge computing environment. The experi-
mental results verify that compared to counterpart methods, our method can consistently achieve higher QoS prediction results.
Keywords Service recommendation . QoS prediction . Edge computing . Deep feature learning . Convolutional neural network .
Matrix factorization
1 Introduction decentralized architecture, and are processed by local comput-

ing devices in priority, such as local computers, smart phones
At present, many information technologies, such as cloud com- or sensors [4]. Such a trend promotes the rapid development of
puting, big data, Internet-of-things, mobile computing and ar- services in three directions (i.e., intelligence, personalization
tificial intelligence, are being widely applied [1, 2]. and integration), and spawns a large number of new services.
Meanwhile, as an emerging technique, edge computing is de- At the same time, due to the large number of available services,
veloping fast [3]. In edge computing, services are provided in a it is a difficult task for a user to select services that match his or
* Yueshen Xu 1
School of Computer, Hangzhou Dianzi University,
ysxu@xidian.edu.cn Hangzhou 310018, China
2
Yuyu Yin Key Laboratory of Complex Systems Modeling and Simulation of
yinyuyu@hdu.edu.cn Ministry of Education, Hangzhou 310018, China
3
Lu Chen School of Computer Science and Technology, Xidian University,
chenlu@hdu.edu.cn Xi’an 710126, China
Jian Wan 4
Provincial Key Laboratory for Computer Information Processing
wanjian@hdu.edu.cn Technology, Soochow University, Suzhou 215006, Jiangsu, China
He Zhang 5
zhh@stu.xidian.edu.cn School of Information and Electronic Engineering, Zhejiang
University of Science and Technology, Hangzhou 310023, China
Zhida Mai
6
top.mark@e-live.cn Xanten Guangdong Development Co., Ltd, Foshan 528200, China
Mobile Netw Appl
her requirements. So it is an urgent task to solve how to rec- In recent years, deep learning has made great progress in
ommend suitable services to meet users’ requirements, espe- learning deep hidden features in many artificial intelligence
cially in edge computing environment. tasks, such as image recognition, text mining and natural lan-
Service recommendation technique is the key to finish per- guage processing [13–15]. In those tasks, convolutional neu-
sonalization for intelligent services. The service recommenda- ral network (CNN for short) is particularly prominent.
tion task is to recommend services that can meet the functional Recently, CNN has been also applied in recommendation sys-
requirements of users, and also requires that the recommended tems [16], but its prediction results are not competitive
services should meet the non-functional requirements (quali- enough.
ty-of- service, QoS for short), especially for those services To attack the exiting problems, in this paper, we propose a
with the same or similar functions. However, in a real service new neighborhood feature learning method, and further pro-
invocation scenario, the number of services that a user has pose a novel MF model, which combines the advantages of
invoked before is usually quite limited, resulting in a serious neighborhood-based method, model-based method and deep
lack of historical QoS values. Such a case is also serious in learning technique. The proposed model can achieve high
edge computing environment. Therefore, in order to carry out accuracy in neighbors selection under high data sparsity
intelligent service recommendation, it becomes an inevitable and is capable of learning deep features. First, the proposed
task to predict missing QoS values in high accuracy. model is inspired by the neighborhood-based method. The
In recent years, researches on QoS prediction have received historical invocation QoS values of neighbors are key to the
extensive attention both in academia and in industry [5]. Among prediction of the target invocation QoS, as the invocation
all prediction methods, the methods extended from collaborative features of neighbors can be used to infer the target invoca-
filtering (CF) have been deeply studied and applied, mainly be- tion features. Therefore, this paper proposes to use the neigh-
cause of its simplicity and effectiveness. The basic CF method bors features to compute the latent features of the target user,
does not depend on structural information of the recommended enhancing the learning ability of matrix factorization.
task, and can achieve good prediction accuracy. The CF-based Second, the traditional neighbors selection methods are easy
prediction methods can be divided into two categories, i.e., to lead to overestimation. This paper analyzes the invocation
neighborhood-based methods and model-based methods [6]. preference in service invocation process and finds that if two
The neighborhood-based method uses historical QoS records to users invoke services that are not frequently invoked by oth-
compute the similarity between users or services, to form the er users, these two users tend to invoke more similar ser-
neighbors set, and then computes missing QoS values by utiliz- vices. Considering that in edge computing environment, the
ing historical QoS records of neighbors. The prediction accuracy invocations records among users or services are sparse, we
of CF-based methods depends on the quality of neighbors selec- propose two new concepts, which are inverse invocation
tion, but in the case of high data sparsity, the quality of neighbor frequency of users (IIFU for short) and inverse invocation
selection becomes a bottleneck due to the limited available his- frequency of services (IIFS for short). IIFU and IIFS are used
torical QoS records [7]. Model-based methods use historical QoS to compute similarity, to obtain a more accurate neighbors
records to learn latent features in service invocation process, and set. Finally, in order to learn deep features, the proposed
obtain the prediction results with the learned features. Model- model integrates convolutional neural network (CNN) into
based methods do not depend on neighbors selection. Under high matrix factorization model. We employ CNN to learn deep
data sparsity, it has been verified that the prediction accuracy of features from the invocation records of the selected neigh-
model-based methods is usually better than that of bors, and also learn the relationship with features of the
neighborhood-based methods [8]. target user or target service. The missing QoS values are
Matrix factorization (MF for short) [9] is a widely-studied predicted with our model and the recommendation is con-
model-based CF method. Matrix factorization assumes that the ducted subsequently.
QoS value of a service invocation is determined by several The main contributions of this paper are as follows.
hidden factors. For example, QoS is easily affected by network
condition or geographical location. In MF-based methods, user i. To promote the quality in neighbors selection in edge com-
QoS records and service QoS records are mapped to a common puting environment, we propose two concepts, i.e., IIFU
latent feature space to learn and obtain prediction results [10]. It (inverse invocation frequency of users) and IIFS (inverse
has been verified that MF-based models can obtain more accu- invocation frequency of services). The two new concepts
rate prediction results than neighborhood-based method. are extended to a new similarity computation method.
However, note that, MF-based method is not capable of learn- ii. This paper proposes a novel QoS prediction model with
ing deep features of users and services, as MF is a shallow neighbors feature learning, based on CNN and MF,
model. In MF, features are designed manually, and the scalabil- named Joint CNN-MF (JCM). The proposed JCM model
ity is very limited, which in turn leads to a decline in prediction has a stronger ability than basic MF model in features
accuracy [11, 12]. learning.
Mobile Netw Appl
iii. The experiments conducted in the real-world dataset time factors into QoS prediction, and built a time-user-service
demonstrate that the proposed JCM model is superior tensor to perform tensor factorization. At that time, there were
to the compared traditional and state-of-the-art methods some problems existing in the proposed methods, such as too
in prediction accuracy, and has high stability regarding many parameters and slow convergence speed. Wu et al. [30]
parameter settings. proposed an improved prediction method named CAP
(Credibility-Aware Prediction), which was based on clustering
The remaining sections of this paper are organized as fol- algorithm. Tian [31] collected a dataset containing a large
lows. Section 2 summarizes the work related to QoS predic- number of user-service invocation records, and proposed a
tion. Section 3 explains the whole framework. Section 4 elab- model that combined implicit feedbacks. Although many
orates the details of the proposed model. Section 5 gives the studies have improved the prediction accuracy from a series
experimental results, and Section 6 concludes the paper. of aspects, the existing methods have a serious limitation that
only the shallow features are extracted or learned. The existing
methods that are purely based on neighborhood computation
2 Related work or matrix factorization are not capable of learning deep fea-
tures that impact QoS values.
Many researches have been done in QoS prediction for Web In recent years, deep neural networks emerge as effective
service or Web-based services, where most of the proposed models in deep feature extraction, and some researchers have
methods are extended from collaborative filtering (CF) algorithm tried to apply neural networks in recommendation systems
[17, 18]. The traditional CF algorithms can be classified into two [16], especially the convolutional neural network (CNN). Tuan
categories: neighborhood-based CF and model-based CF [6]. et al. [32] proposed to combine session clicks and content fea-
Neighborhood-based CF algorithms usually have the ad- tures to generate recommendations, using CNN. Xing et al. [33]
vantages of easy implementation and high scalability, and are proposed a convolutional matrix factorization model, jointly
also widely used in QoS prediction [19]. Zheng et al. [20] modeling user profiles, social relationships and geographical lo-
introduced a similarity computation method for improving cation. Huan et al. [34] proposed a content-based MF-CNN joint
the similarity computation of similar neighbors. Yao et al. model that exploited text contents in recommendations, especial-
[21] integrated content features into collaborative filtering. ly in solving the cold-start problem. Similarly, Kim et al. [16] also
The authors analyzed the description content of Web services proposed a content-aware recommendation model that were also
and used semantic analysis technique to learn the preferences constructed based on CNN. Compared to the pure CF or MF
of users’ invocation. Xin et al. [22] established a location- methods, the neural networks methods achieve better perfor-
aware hidden Markov model, which utilized the geographical mance in traditional recommender systems. However, there are
information. Ren et al. [23] proposed a support vector ma- rarely methods proposed for QoS prediction, service recommen-
chine (SVM) based CF method, which filtered the services dation or service selection, and the existing methods in traditional
that might not be truly preferred by a user. Although the recommender systems cannot be directly used in edge comput-
neighborhood-based CF methods can finish the prediction ing. The reason is that in edge computing, QoS is determined by
task, it cannot effectively solve the problem in high data spar- physical configurations in user side, service side and network
sity. Besides, the existing methods are not specific for edge link, but ratings in e-commerce are determined by users
computing environment. preference.
Model-based CF methods have received wide attention in In this paper, we propose a joint MF and CNN model,
recent years, especially in high data sparsity [24]. As an over- which takes full advantage of neighborhood features, common
whelming model-based method, matrix factorization (MF for latent features and deep latent features of the service invoca-
short) has attracted a lot of attention [10]. Chen et al. [24] tion process. The proposed model solves the problems of how
proposed an MF model combining with the knowledge of to improve neighbors selection quality and how to learn deep
geographical neighborhoods, which could alleviate the cold- latent features from QoS records. Based on the observation of
start problem. To better characterize the QoS data, Xu et al. service invocation records, we propose two new concepts, i.e.,
[25] introduced the probabilistic matrix factorization (PMF) IIFU (inverse invocation frequency of users) and IIFS (inverse
method into service recommendation. He et al. [26] proposed invocation frequency of services), and propose a new similar-
a location-based hierarchical matrix factorization model ity computation method based on IIFU and IIFS, to improve
(HMF) to preform QoS prediction. neighbors selection quality As for feature learning, our model
Li et al. [27] found that the existing methods ignored the extends convolutional neural network to learn deep latent fea-
text information associated with services, and proposed a tures of user neighbors and service neighbors. Matrix factor-
QoS-aware service recommendation method utilizing text fea- ization model is performed to predict missing QoS values. The
tures. Zhou [28] et al. integrated the user’s network location proposed model improves the ability in high data sparsity, and
information into MF model. Zhou et al. [29] proposed to add improves the prediction accuracy.
Mobile Netw Appl
3 The whole framework 4.1 Matrix factorization model with neighbors feature
learning
In order to recommend high-quality services to users in edge
computing environment, this paper focuses on achieving high- Due to the high sparsity and limitations of shallow models, the
ly accurate QoS prediction. To this end, this paper proposes a latent feature learning in pure MF is not accurate enough. In
QoS prediction method based on deep feature learning, as this paper, we propose to combine the convolutional neural
shown in Fig. 1. network and matrix factorization to learn deep latent features.
Because of the sparsity problem and characteristics of tra- The convolutional operation is used to learn deep neighbors
ditional shallow models, the pure MF model cannot obtain features and reshape MF model using deep latent features. A
deep features. To alleviate the sparsity issue, the model ex- new matrix factorization model is constructed, named joint
ploits to use the neighbor features to learn the latent features of CNN-MF (JCM for short).
the target user. At the same time, to learn deep latent features, Different from the traditional MF model, the feature matrix
the proposed model combines a convolutional neural network, in joint CNN-MF is reconstructed according to neighbors fea-
which is named Joint CNN-MF model (JCM for short). As for tures. Therefore, the first key step is to obtain the reconstruct-
neighbors selection and neighbors feature learning, the proce- ed user feature matrix and service feature matrix, which are
dure is as follows. denoted as P' and Q', and the user- service QoS matrix is
denoted as R. The objective is to fit the product of the two
1. Neighbors selection. First, we compute the inverse invo- matrices to R.
cation frequency of users (IIFU) for each service, and 0
R≈P ðQ0 Þ
T
IIFU takes the role of the weight for that service. We also ð1Þ
compute the inverse invocation frequency of services
To obtain P' and Q', it is assumed that there are a number of
(IIFS) for each user, and IIFS is as the weight for that user.
M users and N services. In matrix factorization, the matrix R
IIFU and IIFS are integrated into Euclidean distance to
(M × N) is decomposed into two low-dimensional matrices,
compute the similarity between two users or two services,
i.e., user feature matrix P (M × F) and services feature matrix
and the neighbors set is formed according to the computed
Q (N × F), where F denotes the number of features.
similarities.
Features learning requires neighbors similarities and selec-
2. Neighbor features learning. The selected neighbors are
tion, but the traditional Euclidean distance suffers from over-
collected to form a neighbors feature matrix, which will
estimation problem. In this paper, we propose two measures,
be reconstructed by CNN, and finally be factorized to
inverse invocation frequency of users (IIFU) and inverse in-
predict the missing values.
vocation frequency of services (IIFS), to improve similarity
computation. After that, the most similar K neighbors of a user
or service are selected, and the neighbors set of user u is
notated as N'(u) and neighbors set of service i is notated as
N'(i).
4 QoS prediction with neighbors feature
Based on the neighbors set, the neighbor feature matrix can
learning
be constructed. The construction of user neighbors set is
shown in Fig. 3, and the obtained neighbors feature matrix is
This section gives a detailed explanation of our proposed joint 0
CNN and MF model (JCM for short), which is based on notated as PvN ðuÞ , and the construction of service neighbors set
N 0 ðiÞ
neighbors feature learning. is similar, and is notated as Q j , where v ∈ N'(u), j ∈ N'(i).
Fig. 1 The whole framework for QoS prediction in edge computing

Mobile Netw Appl
Fig. 2 Matrix factorization (MF) services features services

model
users
users features
User-service QoS Two feature matrices

matrix
With convolutional operation, latent features are learned where α is the learning rate, and the loss function gradually
for every user neighbors feature matrix, and every service decreases along with iterations, so as to achieve the optimiza-
neighbors feature matrix, and the reconstructed feature matrix tion. The iterative process is repeated until convergence, and
is as follows. the prediction result is
2 3 2 3 0
^qu;i ¼ Pu ⋅ðQi 0 Þ
T
L L ð6Þ
P ¼ 4 P ½u 5; Q ¼ 4 Q ½i 5
0 0 0 0
ð2Þ
L L
The reconstructed feature matrices P' and Q' are composed 4.2 Similarity computation
of feature vectors Pu' and Qi', which are learned by stochastic
gradient descent algorithm. As the traditional selection of neighbors has overestimation
Finally, to fit the matrix R by the product of P' and Q', a loss problem, we propose an improved similarity computation
function L is built to minimize the predicted errors. method, which is enhanced by inverse invocation frequency
0 0 M N of users (IIFU) and inverse invocation frequency of services
T 2
L P ; Q ¼ min ∑ ∑ qu;i −Pu 0 ⋅ðQi 0 Þ ð3Þ (IIFS). The similarity between users is computed by the ser-
u¼1 i¼1
vices that they both invoke. IIFU of service i (λi) is computed
where qu, i is the QoS value of service i invoked by user u. In with
optimization of L, Pu' and Qi' are solved by stochastic gradient
juj
descent, which are conducted as follows. λi ¼ log ð7Þ
jui j
0 0 ∂L
Pu ¼ Pu −α 0 ð4Þ where ∣u∣ is the total number of users, and ∣ui∣ is the number
∂Pu
of users who have invoked service i. A higher IIFU of service i
0 0 ∂L indicates a higher discrimination of service i. Similar to IIFU,
Qi ¼ Qi −α 0 ð5Þ
∂Qi the inverse invocation frequency of services (IIFS, λu) is
Fig. 3 User feature matrix and features

neighbors feature matrix
Features
N(u)
users N(u) neighbors
N(u)
User feature matrix P Neighbors feature matrix PN'(u)

Mobile Netw Appl
jij 4.3 Neighbors feature learning

λu ¼ log ð8Þ
jiu j
In our model, the convolutional operation is to learn deep fea-
where ∣ i ∣ is the total number of services, and ∣iu∣ represents tures from the neighbors feature matrix, and further construct
the number of services invoked by user u. Similarly, a higher user feature matrix or service feature matrix. Figure 4 shows a
value of λu means a higher discrimination of user u. Using 4-layer convolutional network architecture, containing input
IIFS of user u as the weight, the proposed similarity is based layer, convolution layer, pooling layer and output layer.
on Euclidean distance, and is computed as The convolutional layer is the core of CNN, and is used for
convolutional computation of input features and learning key
1 features. In our model, the convolutional layer receives feature
S u;v ¼ vffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi ð9Þ
u M 2 0 N 0 ðiÞ
u matrices PNv ðuÞ generated by neighbors of user u and Q j
u∑ qu;i −qu − qv;i −qv ⋅λi
t generated by neighbors of service i. Assuming that there are n
1 þ i¼0
jM j neurons in convolutional layer, each neuron nt performs
convolutional computation on inputs by sliding convolutional
where Su, v is the similarity between user u and user v. To kernel C nt , and obtains a new set of features. The convolution
model the common behavior in service invocation, we build kernel is also known as a filter. In convolution computation,
a co-invoked services set M = Mu ∩ Mv, which is the set of each convolutional kernel slides along the width and height of
services invoked by both user u and user v. Mu denotes the input data, and computes the inner product of the entire
set of services invoked by user u and Mv denotes the set of convolutional kernel and the input data. Taking user side as
services invoked by user v.qu, i is the real QoS value generated an example. Assuming that the size of convolutional kernel is
after the target service i is invoked by the target user u, and qv, i s and C nt ∈Rss , the feature map represented by nt is obtained by
is the real QoS value of the target service i. To prevent the 0
denominator from being 0, we add 1 in denominator. qu is the gnt ¼ f PvN ðuÞ *C nt þ bv ; v∈N ðuÞ ð11Þ
average QoS value of user u, and qv is the average QoS value
of user v. The computation of service similarity is similar to where the operator ∗ represents the convolutional computation,
that of users, and is computed with and bv represents the offset. f is a sigmoid function, and is used
to transform vectors to a value in the range of 0 to 1.
1
S i; j ¼ vffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi ð10Þ
u N 2 1
u Sigmoid ðxÞ ¼ ð12Þ
u∑ qu;i −qi − qu; j −q j ⋅λu 1 þ e−x
t
1 þ i¼0
jN j The convolutional layer is followed by a pooling layer,
which is used to reduce the dimension of features learned by
where N = Ni ∩ Nj is the set of users that have invoked service
the convolutional layer, retaining more important features, and
i and service j before, qi is the average QoS of service i, and q j
can also alleviate the overfitting issue. Assuming that the fea-
is the average QoS value of service j. Ni is the set of users that ture map obtained in convolutional layer is
have invoked service i before, and Nj is the set of users that n o
have invoked service j before. Following similarity results, the G ¼ g n1 ; gn2 ; …; g nt ; …; gnz−sþ1 . We adopt the maximum
top K most similar neighbors are selected to form the neigh- pooling method, that is, the maximum value in G is taken to
bors set N'(u) of user u, and the neighbors set N'(i) of service i. represent the pooling result, which is shown as follows.
Fig. 4 Neighbors feature learning

in convolutional neural network
Input Layer Convolutional Pooling Layer Output Layer

(neighbors feature matrix) Layer (deep feature vector)
Mobile Netw Appl
Table 1 Dataset statistics QoS prediction. Therefore, it is more credible for performance
Number of users 339 comparison among different methods. Table 1 presents the
dataset details.
Number of services 5828
Number of service invocation 1,974,675 5.2 Metric
Number of countries where users are located 30
Number of countries where services are located 73 We evaluate the performance of methods using two metrics,
Average response time 0.81 S including mean absolute error (MAE) and root mean square
error (RMSE). A smaller MAE means higher prediction ac-
curacy. MAE is computed as follows.
h i
d ¼ max g n1 ; max gn2 ; …; max gnz−sþ1 ð13Þ 1
MAE ¼ ∑ jq −^ ^q j ð16Þ
N u;i∈M t u;i u;i
The output layer follows the pooling, and converts the
high-order features obtained by the pooling layer into user- RMSE represents the standard deviation of prediction er-
feature vectors of fixed dimensions, which are obtained by rors, and is computed as
sigmoid function. sffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
1 2
0
P ½ u ¼ f ð wu d u þ b u Þ ð14Þ RMSE ¼ ∑ qu;i −^ ^qu;i ð17Þ
N u;i∈M t
where P'[u] is the feature vector of user u, du is the output of
where Mt denotes the test set and N is the number of QoS
pooling, wu is the weight, and bu is an offset. Similarly, the
values in test set Mt. qui represents the real QoS value, while
feature vector of service i can be computed in the same way as
^qui represents the corresponding prediction QoS value. A
0
Q ½i ¼ f ðwi d i þ bi Þ ð15Þ smaller RMSE means higher prediction accuracy.
5.3 Training set and test set
5 Experiments and evaluation In real service invocation environment, the QoS records are
usually highly sparse. Users typically only have invoked a
5.1 Dataset very small number of services, which leads to the limited
overlapping of the same service invocation among users. To
WSDream dataset [35] is used in our experiments. WSDream evaluate our method in real service invocation environment, a
contains 5825 services and 339 users, including two types of part of QoS values are randomly selected from the whole
QoS, i.e., response time and throughput. In this paper, we response time set as training set, and the remaining response
conducted the experiments on response time dataset, as this time values form the test set. In this paper, 8 different training
dataset has been widely used in performance evaluation for sets with different densities are formed, and the data densities
Table 2 Prediction accuracy

comparison in low data densities Model d = 5% d = 10% d = 15% d = 20%
(A smaller value means a better
performance) MAE RMSE MAE RMSE MAE RMSE MAE RMSE
UserMean 0.8715 1.8586 0.8734 1.8561 0.8758 1.8537 0.8792 1.8561

ItemMean 0.7426 1.5776 0.7282 1.5485 0.7115 1.5306 0.7005 1.5308
UPCC 0.7695 1.6940 0.7201 1.6101 0.6521 1.4959 0.5921 1.3993
IPCC 0.7326 1.6346 0.7125 1.5439 0.6783 1.4104 0.6503 1.3109
WSRec 0.6794 1.4884 0.6211 1.4265 0.6037 1.3684 0.6020 1.3515
LFM 0.5783 1.5010 0.5644 1.3201 0.5438 1.2719 0.5350 1.2269
SN-MF 0.6318 1.3968 0.5377 1.2695 0.5008 1.2264 0.4871 1.2073
LE-MF 0.5734 1.3703 0.5136 1.2511 0.4827 1.2041 0.4646 1.1809
LR-MF 0.5516 1.4151 0.4785 1.2576 0.4460 1.2128 0.4340 1.1422
JCF 0.5132 1.3328 0.4665 1.2505 0.4504 1.1854 0.4365 1.1802
Mobile Netw Appl
Table 3 Prediction accuracy

comparison in high data densities Model d = 30% d = 50% d = 70% d = 90%
(A smaller value means a better
performance) MAE RMSE MAE RMSE MAE RMSE MAE RMSE
UserMean 0.8688 1.8351 0.8775 1.8594 0.8774 1.8660 0.8727 1.8418

ItemMean 0.6825 1.5290 0.6746 1.5046 0.6749 1.5182 0.6806 1.5262
UPCC 0.5861 1.5028 0.6077 1.6207 0.5628 1.4737 0.5639 1.3791
IPCC 0.6316 1.4724 0.5865 1.6184 0.5439 1.4704 0.6687 1.4173
WSRec 0.4800 1.3424 0.4652 1.2712 0.4312 1.2058 0.4169 1.1324
LFM 0.4767 1.3511 0.4212 1.2501 0.4018 1.1514 0.3848 1.1015
JCF 0.4295 1.1746 0.4069 1.1488 0.3897 1.1157 0.3783 1.0894
d are 5%, 10%, 15%, 20%, 30%, 50%, 70% and 90%, respec- 5.4 Performance comparison
tively. Take the training set with a density of 10% as an ex-
ample. We randomly select 10% QoS values from the whole To give a comprehensive performance comparison, we com-
response time dataset as the training set. The remaining 90% pare our model with the following widely-used or state-of-the-
data form the test set to test the prediction accuracy of the art methods, and conduct a series of comparative experiments
proposed model. Considering that, the prediction results ob- on different training sets with different densities. The com-
tained after a single round of experiment do not have enough pared methods are present as follows, and the results are given
credibility in model comparison. Therefore, we performed 10 in Table 2 and Table 3.
times of experiments on each density case (5%, 10%, 15%,
20%, 30%, 50%, 70% and 90%), and report the average pre- 1. UserMean. UserMean predicts missing values by com-
diction results. puting the known average QoS value of each user.
(a) (b)
(c) (d)
Fig. 5 Impact of f under different data densities
Mobile Netw Appl
2. ItemMean. ItemMean predicts the missing values by 8. LE-MF (Location-Enhanced MF). LE-MF integrates geo-
computing the known average QoS value of each service. graphical information and trust mechanism into traditional
3. UPCC (User Pearson Correlation Coefficient). UPCC matrix factorization model to predict QoS values [39].
borrows the idea from user-based collaborative filtering 9. LR-MF (Location and Reputation-aware MF). LR-MF uses
algorithm, which uses Pearson correlation coefficient to reputation information combined with matrix factorization to
compute the similarity between two users [36]. discover hidden features and make predictions [40].
4. IPCC (Item Pearson Correlation Coefficient). IPCC bor-
rows the idea from item-based collaborative filtering al-
gorithm, which also uses Pearson correlation coefficient From Tables 2 and 3, we can have the following
to compute the similarity between two items [37]. observations.
5. WSRec (Web Service Recommendation). WSRec per-
forms a linear combination of prediction results of 1. Our model JCF achieves the lowest errors in all cases of
UPCC and IPCC [35]. data densities, including low data densities and high data
6. LFM (Latent Factor Model). LFM performs dimensional densities. It indicates that JCF model can better handle the
reduction of the QoS matrix to learn latent features of high sparsity problem, which is a serious issue in real-
users and services, and predicts results based on the world scenario of service invocation. Also, it can be in-
learned latent features [9]. ferred that JCF model has high applicability and can be
7. SN-MF (Service Neighbors-based MF). SN-MF proposes employed in many cases of service invocation.
a new method for selecting neighbors of each service, and 2. The proposed JCF model behaves better than the com-
further combines latent features of service neighbors into pared neighborhood-based methods, such as UPCC and
matrix factorization model [38]. IPCC, and also behaves better than the compared model-
(a) (b)
(c) (d)
Fig. 6 Impact of K under different data densities
Mobile Netw Appl
based methods. Such an advantage verifies the effective- side. We also proposed an effective way to integrate the
ness of the proposed framework, including similarity learned deep features into MF model, and constructed a com-
computation, built model and features learning technique. prehensive prediction framework. The experimental results on
3. Along with the data densities increasing, the prediction the real-world dataset demonstrated that compared to the tra-
errors become smaller. The reason is that more training ditional and state-of-the-art methods, our model achieved con-
QoS records can promote the accuracy of similarity com- sistently higher accuracy, both in low data densities and high
putation, and also promote the quality of learned features data densities. We also conducted parameter sensitivity
in neighbors feature learning. With promoted similarity experiments.
results and features, JCF model will generate more accu- Although the advantages of CNN have been widely used in
rate prediction results. various fields, there are still some deficiencies requiring fur-
ther research, such as how to integrate time factors into CNN.
As in edge computing, the configurations and running status
5.5 Sensitivity analysis of parameters of devices probably are not stable, and may change along with
environment changing. We will continuously improve our
5.5.1 Impact of f model in future.
In our proposed method, the parameter f determines the num- Acknowledgements This paper is supported by the National Key
Research and Development Program of China (No.2017YFB1400601),
ber of latent features. In this section, we study the impact of f
National Natural Science Foundation of China (No. 61872119, No.
on prediction results, and the value of f changes from 10 to 61702391), Natural Science Foundation of Zhejiang Province (No.
100 with an increment of 10. Other parameters are set accord- LY16F020017) and Shaanxi Province (No.2018JQ6050), and
ing to default parameter setting. The impact of f on MAE and Fundamental Research Funds for Central Universities (JBX171007).
RMSE under different data densities is presented in Fig. 5.
As shown in Fig. 5, as the number of features increases,
both MAE and RMSE gradually decrease. At the case of data References
densities being 5% and 10%, when f exceeds 50, MAE and
RMSE tend to be stable. In contrast, at the case of densities 1. Deng S, Xiang Z, Yin J, Taheri J, Albert Y (2018) Zomaya:
Composition-Driven IoT Service Provisioning in Distributed
being 15% and 20%, when f exceeds 70, MAE and RMSE Edges. IEEE Access 6:54258–54269
tend to be stable. 2. Xia Y, Zhou MC, Luo X, Zhu Q, Li J, Huang Y (2015) Stochastic
Modeling and Quality Evaluation of Infrastructure-as-a-Service
5.5.2 Impact of K Clouds. IEEE Trans Automation Science and Engineering 12(1):
162–170
3. Zhang C, Zhao H, Deng S (2018) A Density-Based Offloading
The parameter K determines the number of neighbors. We Strategy for IoT Devices in Edge Computing Systems. IEEE
conduct the sensitivity analysis of K in the value range of 10 Access 6:73520–73530
to 50 with a constant increment of 10. The data density varies 4. Peng Q, Zhou M, He Q, Xia Y, Wu C, Deng S (2018) Multi-
from 5% to 90%, and a lower density means less available Objective Optimization for Location Prediction of Mobile
Devices in Sensor-Based Applications. IEEE Access 6:77123–
training data. We report the experimental results of MAE and 77132
RMSE, which are present in Fig. 6. 5. Park Y, Park S, Jung W (2015) Reversed CF: A fast collaborative
As it can be seen from Fig. 6, with the increase of K, MAE filtering algorithm using a k-nearest neighbor graph. Expert Syst
and RMSE results slightly decline in the beginning, and rise Appl 42(8):4022–4028
also slightly after K reaches a certain threshold. The reason for 6. Zheng Z, Ma H, Lyu MR (2010) QoS-Aware Web Service
Recommendation by Collaborative Filtering. IEEE Trans Services
errors rising is that as the number of neighbors increases, some Computing 4(2):140–152
unreliable neighbors will be included, lowering the quality of 7. Sharifi Z, Rezghi M, Nasiri M (2014) A new algorithm for solving
neighbors selection. In the whole range of values, it can be data sparsity problem based-on Non-negative matrix factorization
seen that the performance of our model changes slightly, and in recommender systems. Int. Conf. Computer and Knowledge
Engineering, pp. 56–61
is not sensitive to different values of K.
8. Chen X, Zheng Z, Yu Q (2014) Web Service Recommendation via
Exploiting Location and QoS Information. IEEE Trans Parallel and
Distributed Syst 25(7):1913–1924
6 Conclusion and future work 9. Koren Y, Bell R, Volinsky C (2009) Matrix Factorization
Techniques for Recommender Systems. Computer. 42(8):30–37
This paper proposed a QoS prediction method for services 10. Kai SU, Liang-Li MA, Sun YF et al (2015) Non-negative matrix
factorization model for Web service QoS prediction. J Zhejiang
recommendation in edge computing environment, with deep University 49(7):1358–1366
feature learning. Using CNN, the proposed model learned the 11. Zhang S, Yao L, Sun A (2017) Deep learning based recommender
deep latent features of neighbors, both in user side and service system: A survey and new perspectives. In CoRR, abs/1707.07435
Mobile Netw Appl
12. Gao H, Mao S, Huang W, Yang X (2018) Applying Probabilistic Recognition at Server-Side. IEEE Trans Network and Service
Model Checking to Financial Production Risk Evaluation and Management PP(99):1–1
Control: A Case Study of Alibaba's Yu'e Bao. IEEE Transactions 28. Zhou L, Song Z, Zhai S (2014) Predicting Web Service QoS via
on Computational Social Systems (TCSS) 5(3):785–795 Combining Matrix Factorization with Network Location.
13. Yu F, Zhang Y, Song S (2015) LSUN: construction of a large-scale International Journal of U and E-Service Science and Technology
image dataset using deep learning with humans in the loop. 7(3):303–317
Computer Science 29. Zhou C, Zhang W, Li B (2014) Web Service Recommendation via
14. Yu J, Kuang Z, Zhang B, Zhang W, Lin D, Fan J (2018) Leveraging Exploiting Temporal QoS Information. In Algorithms and
Content Sensitiveness and User Trustworthiness to Recommend Architectures for Parallel Processing, pp. 15–27
Fine-Grained Privacy Settings for Social Image Sharing. IEEE 30. Wu C, Qiu W, Zheng Z (2015) QoS prediction of web services
Transactions on Information Forensics and Security. https://doi. based on two-phase K-means clustering. Int Conf. Web Services,,
org/10.1109/TIFS.2017.2787986 pp. 161–168
15. Yu J, Hong C, Rui Y, Tao D (2018) Multitask Autoencoder Model 31. Tian G, Wang J, He K (2017) Integrating implicit feedbacks for
for Recovering Human Poses. IEEE Trans Ind Electron 65(6): time-aware web service recommendations. Inf Syst Front 19(1):1–
5060–5068 15
16. Kim D, Park C, Oh J (2016) Convolutional matrix factorization for
32. Tuan T X, Tu MP (2017) 3D Convolutional networks for session-
document context-aware recommendation. Int Conf Recommender
based recommendation with content features. Int Conf.
Systems:233–240
Recommender Systems, pp. 138–146
17. Chen L, Ha W (2018) Reliability prediction and QoS selection for
web service composition. Int J Comput Sci Eng 16(2):202 33. Xing S, Liu F, Zhao X (2017) Points-of-interest recommendation
18. Wang S, Zhao Y, Huang L (2017) QoS prediction for service rec- based on convolution matrix factorization. Appl Intell C:1–12
ommendations in mobile edge computing. J. Parallel and 34. Huan H, Wei Z, Liang L (2017) Collaborative filtering recommen-
Distributed Computing dation model based on convolutional denoising auto encoder. In
19. Yin Y, Song A, Min G (2016) QoS Prediction for Web Service Chinese Conference., pp. 64–71
Recommendation with Network Location-Aware Neighbor 35. Zheng Z, Ma H, Lyu MR (2009) WSRec: A Collaborative Filtering
Selection. Int J Softw Eng Knowl Eng 26(4):611–632 Based Web Service Recommender System. Int. Conf. Web
20. Zheng Z, Chen J, Lyu MR (2013) Personalized Web Service Services, pp. 437–444
Recommendation via Normal Recovery Collaborative Filtering. 36. Resnick P, Iacovou N, Suchak M (1994) GroupLens: an open ar-
IEEE Trans Services Computing 6(4):573–579 chitecture for collaborative filtering of netnews. In ACM
21. Yao L, Sheng QZ, Segev A (2013) Recommending web services Conference on Computer Supported Cooperative Work, pp. 175–
via combining collaborative filtering with content-based features. 186
Int Conf Web Services:42–49 37. Sarwar B, Karypis G, Konstan J (2001) Item-based collaborative
22. Xin M, Zhang Y, Li S (2017) A Location-Context Awareness filtering recommendation algorithms. Int Conf. World Wide Web,
Mobile Services Collaborative Recommendation Algorithm pp. 285–295
Based on User Behavior Prediction. International Journal of Web 38. Yin J, Xu Y (2015) Personalised QoS-based web service recom-
Services Research 14(2):45–66 mendation with service neighborhood-enhanced matrix factoriza-
23. Ren L, Wang W (2017) An SVM-based collaborative filtering ap- tion. Int J Web and Grid Services 11(1):39–56
proach for Top-N web services recommendation. Future Generation 39. Xu Y, Yin J, Deng S (2016) Context-aware QoS prediction for web
Computer Systems service recommendation and selection. Expert Syst Appl 53:75–86
24. Chen Z, Shen L, Li F (2017) Your neighbors alleviate cold-start: On 40. Li S, Wen J, Luo F, Tian C, Xiong Q (2017). A location and repu-
geographical neighborhood influence to collaborative web service tation aware matrix factorization approach for personalized quality
QoS prediction. Knowl-Based Syst 138:188–201 of service prediction. IEEE International Conference on Web
25. Xu Y, Yin J, Wei L (2013) Personalized Location-Aware QoS Services, pp. 652–659
Prediction for Web Services Using Probabilistic Matrix
Factorization. Int Conf. Web Information Systems Engineering,
pp. 229–242 Publisher’s Note Springer Nature remains neutral with regard to jurisdic-
26. He P, Zhu J, Zheng Z (2014) Location-based hierarchical matrix tional claims in published maps and institutional affiliations.
factorization for web service recommendation. Int Conf. on Web
Services, pp. 297–304
27. Li S, Wen J, Luo F (2017) A New QoS-Aware Web Service
Recommendation System based on Contextual Feature

Yin2019 Article QoS

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Yin2019 Article QoS

Uploaded by

Copyright:

Available Formats

Mobile Networks and Applications

QoS Prediction for Service Recommendation with Deep Feature

# Springer Science+Business Media, LLC, part of Springer Nature 2019

1 Introduction decentralized architecture, and are processed by local comput-

Fig. 1 The whole framework for QoS prediction in edge computing

Fig. 2 Matrix factorization (MF) services features services

User-service QoS Two feature matrices

Fig. 3 User feature matrix and features

users N(u) neighbors

User feature matrix P Neighbors feature matrix PN'(u)

jij 4.3 Neighbors feature learning

Fig. 4 Neighbors feature learning

Input Layer Convolutional Pooling Layer Output Layer

5.3 Training set and test set

Table 2 Prediction accuracy

UserMean 0.8715 1.8586 0.8734 1.8561 0.8758 1.8537 0.8792 1.8561

Table 3 Prediction accuracy

UserMean 0.8688 1.8351 0.8775 1.8594 0.8774 1.8660 0.8727 1.8418

You might also like