4633 - Comprehensive Tempo-Spatial Data Collection in Crowd Sensing Using A Heterogeneous Sensing Vehicle Selection Method PDF

Pers Ubiquit Comput (2016) 20:397–411
DOI 10.1007/s00779-016-0932-x
ORIGINAL ARTICLE
Comprehensive tempo-spatial data collection in crowd sensing

using a heterogeneous sensing vehicle selection method
Yazhi Liu1 • Jianwei Niu2 • Xiting Liu2
Received: 9 October 2015 / Accepted: 30 March 2016 / Published online: 10 May 2016
Springer-Verlag London 2016
Abstract The wide distribution of mobile vehicles instal- simulations show that the HVS algorithm can collect
led with various sensing devices and wireless communi- sensing data with a higher coverage ratio in a more uniform
cation interfaces has made vehicular mobile crowd sensing and continuous manner than existing mobile crowd sensing
possible in practice. However, owing to the heterogeneity methods.
of vehicles in terms of sensing interfaces and mobilities,
collecting comprehensive tempo-spatial sensing data with Keywords Vehicular sensing Mobile crowd sensing
only one sensing vehicle is impossible. Moreover, the Comprehensive tempo-spatial data coverage
sensing data collected may expire in the future; as a result,
sensing vehicles may have to continuously collect sensing
data to ensure the relevance of such data. Although 1 Introduction
including more sensing vehicles can improve the quality of
collected sensing data, this step also requires additional In mobile crowd sensing systems, ordinary citizens are
cost. Thus, how to continuously collect comprehensive recruited to collect and share sensing data from their sur-
tempo-spatial sensing data with a limited number of rounding environments through their mobile devices [1, 2].
heterogeneous sensing vehicles is a critical issue in On the basis of collected sensing data, mobile crowd
vehicular mobile crowd sensing systems. In this work, a sensing systems are able to provide users with various
heterogeneous sensing vehicle selection (HVS) method for novel applications [3], such as real-time air quality reports
the collection of comprehensive tempo-spatial sensing data [4] and road traffic monitoring [5–7]. An increasing num-
is proposed. On the basis of the spatial distribution and ber of vehicles are being installed with various sensing
sensing interfaces of sensing vehicles and the tempo-spatial devices and mobile communication interfaces. Therefore,
coverage of collected sensing data, a utility function is an increasing number of mobile vehicles are able to collect
designed in HVS to estimate the sensing capacity of and share various types of sensing data in urban environ-
sensing vehicles. Then, according to the utilities of sensing ments. The extensive distribution of these vehicles in urban
vehicles and the restriction on the number of recruited areas has made vehicular mobile crowd sensing possible in
sensing vehicles, sensing vehicle selection is modeled as a practice.
knapsack problem. Finally, a greedy optimal sensing A vehicular mobile crowd sensing system is composed
vehicle selection algorithm is designed. Real trace-driven of sensing vehicles, wireless networks, and sensing data
centers. Sensing data are collected by sensing vehicles and
then transmitted to a sensing data center via wireless net-
& Jianwei Niu works. The application of vehicular mobile crowd sensing
niujianwei@buaa.edu.cn
to transportation can generate numerous advantages. First,
1
College of Information Engineering, North China University it can lower the cost of sensing data collection because
of Science and Technology, Tangshan 063000, China vehicles are equipped with several sensors. Second, unlike
2
State Key Laboratory of Software Development in single-vehicle sensing systems, sensing data in vehicular
Environment, Beihang University, Beijing 100191, China crowd sensing systems are collaboratively collected by
123
398 Pers Ubiquit Comput (2016) 20:397–411
multiple vehicles. As a result, vehicular mobile crowd interfaces of sensing vehicles and the tempo-spatial cov-
sensing systems can collect a large amount of compre- erage of collected sensing data, a utility function is
hensive sensing data. Third, owing to the collaboration designed in HVS to estimate the sensing capacity of
between vehicles and sensing data sharing, vehicular crowd sensing vehicles in the future. According to the utilities of
sensing systems can provide many novel applications to the sensing vehicles and the restriction on the number of
field of transportation and other areas. To meet the sensing recruited sensing vehicles, sensing vehicle selection is
data requirements of vehicular mobile crowd sensing modeled as a knapsack problem. Finally, a greedy optimal
applications, a large amount of sensing data must be col- sensing vehicle selection algorithm is designed.
lected. Therefore, vehicular mobile crowd sensing systems The remainder of this paper is organized as follows.
are mainly aimed at collecting comprehensive sensing data Section 2 reviews the related studies on mobile crowd
that encompass all the spatial temporal dimensions of tar- sensing and vehicle sensing. Section 3 presents the vehic-
get sensing areas. However, sensing vehicles are hetero- ular mobile crowd sensing system model and the con-
geneous. First, the sensing interfaces of vehicles and the struction of tempo-spatial sensing data models. Section 4
types of sensing data that can be collected by vehicles are presents the vehicular sensing utility formula, which con-
different because vehicles are manufactured by different siders the spatial and temporal coverages of sensing data.
companies with no uniform hardware and software stan- Section 5 proposes an optimized heterogeneous vehicle
dards. Second, vehicles are driven by different people, and selection algorithm based on the utilities of sensing vehicle
the trajectories of the vehicles also vary. Thus, collecting candidates. Section 6 evaluates the performance of the
all available sensing data with only one sensing vehicle is proposed strategy through real trace-driven simulations.
impossible. Section 7 provides the conclusions.
Sensing data are temporal, and collected sensing data
may expire at some time in the future. For example,
atmospheric temperature changes with time in any given 2 Related work
day; as a result, collected temperature data expire after a
few hours. Moreover, different types of sensing data may Mobile crowd sensing [1] is a promising mode of collect-
have different life cycles. For example, the life cycle of ing comprehensive sensing data in urban areas. In recent
noise data is much shorter than that of temperature data. years, a large number of novel applications [3] based on
Thus, vehicular crowd sensing systems should continu- mobile crowd sensing have been deployed. For example,
ously collect sensing data in target sensing areas to ensure The Common Sense project [9] monitored the contents of
the relevance of each sensing data type to the target sensing pollutants in the air by collecting sensing data from
area. handheld air quality monitors. Yu et al. [10] recovered the
Owing to the heterogeneity of sensing vehicles, one noise situation throughout New York City (NYC) based on
sensing vehicle cannot possibly collect comprehensive mobile crowd sensing and inferred the fine-grained noise
tempo-spatial sensing data. Furthermore, as collected situation at different times of the day for each region of
sensing data may expire, crowd sensing vehicles should NYC with the use of data from the city complaint platform,
continuously collect sensing data to ensure their relevance. together with social media, road network data, and points
Obviously, in vehicular mobile crowd sensing systems, the of interests. The Mahali project [11] used GPS signals that
quality of collected sensing data increases with the use of penetrate the ionosphere for science rather than position-
many sensing vehicles to collect sensing data. However, a ing; a large number of ground-based sensors fed data into a
large number of sensing vehicles equates to high cost [8]. cloud-based processing environment through mobile devi-
Therefore, how to continuously collect comprehensive ces, thus enabling a tomographic analysis of the global
tempo-spatial sensing data with a limited number of ionosphere at an unprecedented resolution and coverage.
heterogeneous sensing vehicles is a critical issue in Zhou et al. [12] presented a novel bus arrival time pre-
vehicular mobile sensing systems. diction system based on crowd sensing and predicted bus
To address the aforementioned problem, a heteroge- arrival times by encouraging passengers to collect gener-
neous sensing vehicle selection (HVS) method for the ally available and energy-efficient sensing resources,
collection of comprehensive tempo-spatial sensing data is including cell tower signals, movement statuses, and audio
proposed in this work. The proposed HVS method con- recordings. Vassilis et al. [13] exploit public transit bus
siders not only the spatial coverage of collected sensing passengers Bluetooth-capable devices to capture and
data but also the temporal coverage of sensing data. A reconstruct micro- and macro-passenger behavior. How-
mobility model based on a continuous-time Markov chain ever, owing to the lack of theoretical research on mobile
is established to forecast the spatial distribution of sensing crowd sensing, existing mobile sensing applications suffer
vehicles. On the basis of the spatial distribution and sensing from poor sensing data qualities; thus, the commercial
123
Pers Ubiquit Comput (2016) 20:397–411 399
deployment of mobile crowd sensing systems is restricted. call logs can reduce energy consumption and result in high
Collecting comprehensive tempo-spatial sensing data is the sensing data coverage ratio [27]. Song et al. [28] proposed
foremost criterion of mobile crowd sensing systems. a multi task oriented Dynamic participant selection (DPS)
Sensing data collected by 85 mobile nodes for two months algorithm, which selects participants under limited bud-
revealed that people from areas with a small population are gets. However, none of the previous studies considered
willing to collect sensing data [14]. The reverse auction- both the heterogeneity of sensing nodes, the limited budget
based dynamic price (RADP) mechanism [15] introduces of sensing systems and the properties of the sensing data.
incentive mechanisms to mobile crowd sensing systems. A vehicle is a special type of node in mobile crowd
The most inexpensive sensing data are utilized to increase sensing systems. Considering that vehicles are widely
the total amount of sensing data collected by participants; distributed in urban areas and equipped with various types
as a result, the accuracy of sensing data is improved. of sensors, vehicles provide a natural means to realize
However, mobile sensing nodes are usually clustered mobile sensing. In early vehicle-based environmental
according to time and space. Consequently, the collected sensing applications, only one vehicle is recruited in a
data are inhomogeneous in time and space. To collect system. A novel application is the monitoring of locations
homogeneous sensing data, the greedy budgeted maximum through a vehicle radar [29]. Air quality can also be
coverage (GBMC) algorithm [16] maximizes not only the monitored with a CO2 sensor installed in a vehicle [30].
amount of collected sensing data but also the coverage of Similarly, rainy weather can be monitored with an onboard
sensing data. The Information Sampling (ISAM) [17] camera [31]. In [32], a laser scanner and an onboard
minimizes the diversity between the collected data and the camera were utilized to update a street view map. How-
forecast model to improve the quality of sensing data. ever, in these applications, no sensing data are shared
Unlike RADP, GBMC and ISAM investigate the homo- among vehicles. This condition constrains the extensive
geneous space distribution of sensing data. deployment of vehicle sensing-based applications. A plat-
In mobile crowd sensing systems, the mobile crowd form was designed in [33] for large-scale vehicle collab-
participants frequently encounter one another and are thus orative sensing. However, the algorithm was designed to
provided an opportunity to collaborate and provide high- plan the trajectories of a robot team to collect sensing data
quality sensing services. Collaborative sensing is one of the within the shortest time. In [34], multiple probe vehicles
essential features of participatory sensing systems. Context were utilized to estimate large-scale traffic in an urban
characteristics are generated from the sensing data col- environment. Vehicle sensing can also be utilized for safe
lected by multiple nearby participants through the use of a driving. A roadway weather information system developed
collaborative learning algorithm [18]. MOSDEN (a mobile in [35] recruited probe vehicles equipped with GPS, an
sensor data engine) [19] is a collaborative mobile sensing anti-lock braking system, and acceleration sensors to
framework that can operate on smartphones to capture and quickly detect the conditions of road surfaces according to
share sensing data among multiple distributed applications the side slip force of the vehicles in certain road segments.
and users. Similarly, a cloud-assisted collaborative sensing
method was proposed in [20] to reduce the energy con-
sumption of mobile phone sensing applications. 3 System model
Sensing node selection is a major challenge in mobile
crowd sensing systems because of the diversity of the A vehicular mobile crowd sensing system is composed of
sensing capabilities of mobile devices and the uncontrol- sensing data servers, wireless networks, and vehicles (i.e.,
lable trajectories of mobile nodes [21]. Chien et al. [22] sensing nodes), as shown in Fig. 1. Vehicular sensing
introduced the online task assignment problem in which nodes are installed in multiple types of sensing devices,
heterogeneous tasks are assigned to workers with different
unknown skill sets. Reddy et al. [23] developed a selection
framework to allow organizers to identify well-suited Sensing Area
sensing nodes for data collection on the basis of their

geographic and temporal availabilities as well as their
habits. Tuncay et al. [24] exploited the stability of user
behavior and selected sensing nodes according to the fit-
ness of the mobility history profiles of the users. Moreover,
Wireless Network Sensing Data
modeling the mobility of sensing nodes and forecasting the Center
traces of sensing nodes can reduce the required amount of Vehicle Nodes
participating sensing nodes [25, 26]. When selecting
sensing nodes, forecasting the traces of the nodes with their Fig. 1 Architecture of vehicular mobile crowd sensing systems
123
which can collect multiple types of sensing data. All the trajectories of vehicles are random. Therefore, col-
vehicles have wireless communication interfaces. When a lecting sensing data that cover an entire tempo-spatial
vehicle is selected as a sensing node, the sensors installed sensing space is difficult for a single vehicle to accom-
in it are turned on to sample the sensing data periodically. plish. The sensing data center is in charge of a vehicular
All the collected sensing data coupled with the corre- sensing system. The collected sensing data are stored in
sponding information of the GPS coordinates are uploaded the sensing data center. According to the collected sens-
and stored in the sensing data server via wireless networks. ing data and the sensing interfaces, as well as to the
Users of the sensing system can then make a query on the trajectories of the vehicular sensing nodes, the sensing
sensing data of interest from the sensing data server. data center selects the proper sensing nodes to collect
sensing data.
3.1 System architecture Different sensing node selection strategies may have
different effects on sensing data collection. An example of
In vehicular mobile crowd sensing systems, a sensing node a sensing node selection scenario is shown in Fig. 2. The
is a vehicle installed with sensing devices and wireless right part of the figure shows the collected sensing data in
communication interfaces. All sensing nodes are supposed the sensing data center, and the left part of the figure shows
to be willing to participate in the collection of sensing data. the sensing capabilities of the vehicles and the future
Vehicular sensing nodes are heterogeneous. mobility traces of all available participants. In this sensing
On the one hand, vehicles are manufactured by dif- scenario, {a, b} is a more preferable set of participants than
ferent companies, and the sensing interfaces embedded in {a, b} when meeting the task requirements. However, in
them are different. For example, vehicle a can collect real-world scenarios with a significantly large number of
temperature, noise, and humidity data, whereas vehicle b sensing data requirements and available heterogeneous
can collect temperature, speed, and vibrations (the nota- sensing vehicles, the selection of appropriate sensing
tions used in this paper are shown in Table 1). On the vehicles becomes an arduous task. The present work pre-
other hand, vehicles are driven by different people, and sents a heterogeneous sensing vehicle selection strategy
Table 1 List of notations

Notation Description
I The set of subareas

i; j 2 I; 1 6 i 6 N One subarea
W ¼ fw1 ; w2 ; . . .; wM g M types of sensing data
wl Life cycle of the lth type of sensing data
X The set of candidate vehicles
a, b Sensing vehicles

Ya ¼ ya1 ; ya2 ; ; yaM Sensing interfaces of vehicle a
nai The moment when vehicle a travels into subarea i
hai The residence time of vehicle a in subarea i
ZðtÞ The valid sensing data in the sensing data center at time t
fi;l ðtÞ The valid function of the sensing data of type l in sub area i at time t
X ¼ fX ðtÞ; t > 0g Mobility procedure of a vehicle
X(t) The state (i.e., location) of the vehicle at time t
Pij ðtÞ The transition probability of a vehicle from sub area i to sub area j after time duration t

Q ¼ qij Q Matrix of vehicle’s mobility procedure
uai Utility of sensing node a in sub area i
Ua The utility of vehicle a
U Total budget of the sensing system within one system duty cycle
ua The cost of sensing vehicle a
S Selected sensing vehicles
Ha ðtÞ The indicate matrix of the sensing data collected by a
WðtÞ Temp indicate matrix of the sensing data in the sensing data center
123
Target Sensing Area Coll ected Sens ing Data
Historical mobility trace Recruited vehicles Vibration sensor Brightness sensor
Predicted mobility trace PM2.5 sensor Temperature sensor
Fig. 2 Data collection by heterogeneous vehicles
that can collect the most comprehensive tempo-spatial where yal ¼ 1 indicates that a can collect the sensing data
sensing data with a limited number of vehicles. of type l and yal ¼ 0 indicates that a cannot collect the data
of type l.
3.2 Sensing data model
Vehicles away collect sensing data of a specific subarea
when it is traveling in the subarea. Let nai denote the
The geographic area where the vehicular crowd sensing
moment when vehicle a travels into subarea i, and hai
system collects sensing data is defined as the target sensing
denotes the duration vehicle a stay in subarea i. Then, the
area or simply the sensing area. The target sensing area is
life cycle function of the sensing data of type wl collected
divided into lattice cells according to the geographical
by a when it traveling in subarea i could be denoted as
location (Fig. 2), and each lattice cell is called a subarea. A
follow.
subarea is a unit area of the sensing system; the assumption
Vehicles collect the sensing data of a specific subarea
is that the sensing data sampled at any point in the subarea
when traveling in such subarea. The moment when vehicle
can indicate the sensing value of the subarea. The entire
a travels into subarea i is denoted as nai , and the time
sensing area is divided into N subareas, as shown in Fig. 2.
duration within which vehicle a stays in subarea i is
The set of subareas in the sensing area is denoted as
denoted as hai . Then, the life cycle function of the sensing
I = {1, 2, …, N}. The ith subarea is denoted as
data of type wl collected by a when traveling in subarea i
i ði 2 I; 1 6 i 6 NÞ:
can be denoted as follows:
Given that sensing data are usually sensitive to time a
(e.g., air temperature data collected an hour ago may no a yl nai 6 t 6 nai þ hai þ wl
hil ðtÞ ¼ ð2Þ
longer be accurate in the succeeding hours), sensing data 0 other
should be sampled periodically in each unit sensing area.
The valid sensing data in the sensing data center at time t is
The M types of sensing interfaces installed in vehicle
denoted as ZðtÞ, which is expressed as
sensing nodes are denoted as W ¼ fw1 ; w2 ; . . .; wM g; and
2 3
the life cycle of the lth type of sensing data is denoted as f1;1 ðtÞ f1;2 ðtÞ ... f1;M ðtÞ

wl. If Ya ¼ ya1 ; ya2 ; . . .; yaM denotes the sensing data types 6 f2;1 ðtÞ f2;2 ðtÞ f2;M ðtÞ 7
6 7
that can be collected by sensing node a, then 6
ZðtÞ ¼ 6 . 7 ð3Þ
. .. 7 ;
4 . . 5
a 0
yl ¼ ð1Þ fN;1 fN;2 ðtÞ ... fN;M ðtÞ
1
123
where fi;l ðtÞ is the valid function of the sensing data of type According to the Kolmogorov backward differential
l in subarea i at time t. equation,

1 the sensing data is effective at t P0 ðtÞ ¼ QPðtÞ ð9Þ
fi;l ðtÞ ð4Þ
0 the sensing data is ineffective at t If Pð0Þ ¼ I, then the transition probability of the Markov
chain X could be derived as follows:
3.3 Mobility model PðtÞ ¼ eQt ð10Þ
A time-continuous life cycle of sensing data is considered

in this paper. As a result, to estimate the precise collection
time of the sensing data, a time-continuous Markov Chain- 4 Utility of sensing vehicles
based mobility model is used. In vehicular crowd sensing
systems, vehicles are traveling in or transferring between 4.1 Utility function
subareas. The mobility process is denoted as
X ¼ fX ðtÞ; t > 0g, where X(t) denotes the state of the The sensing capacity of a sensing vehicle is indicated by
vehicle at time t (i.e., the subarea in which the vehicle stays the utility of the vehicle. The utility of vehicle a in the
at time t). Normally, the state of a vehicle is only related to following time interval tu is denoted as ta. The utility of a
its state at the last moment. Consequently, for any sensing vehicle is the metric for its sensing capacity. The
0 6 t0 \t1 \ \tn \tnþ1 , and ik 2 I, 0 6 k 6 n þ 1, we amount of sensing data collected by a vehicle depends
have largely on the duration within which the sensing vehicle
collects the sensing data. If the vehicle spends a long time
PfX ðtnþ1 Þ ¼ inþ1 jX ðt0 Þ ¼ i0 ; X ðt1 Þ ¼ i1 ; . . .; X ðtn Þ ¼ in g collecting sensing data, its sensing utility will be large.
¼ PfX ðtnþ1 Þ ¼ inþ1 jX ðtn Þ ¼ in g However, the computing cost of the utility will also be high
ð5Þ if the sensing time is considerably long. Therefore, a sys-
tem duty cycle, tu, is set to reduce the cost of utility cal-
Equation (5) means that the mobility process X ¼ culation. At the beginning of a system duty cycle, the
fX ðtÞ; t > 0g is a time-continuous Markov chain. For any utilities of the sensing vehicles are updated. Then, the
s; t > 0, and i; j 2 I, we have proper sensing nodes are selected according to the utilities
PfX ðs þ tÞ ¼ jjX ðsÞ ¼ ig of the sensing vehicle candidates. If uai denotes the utility
¼ PfX ðtÞ ¼ jjX ð0Þ ¼ ig ð6Þ of sensing node a in subarea i, then
XM Z tu
a

¼ Pij ðtÞ a a a
ui ¼ yl hil ðtÞ hil ðtÞ fil ðtÞ dt ð11Þ
0
where Pij ðtÞ denotes the transition probability of a vehicle l¼1
from subarea i to subarea j after time duration t. The matrix Utility uai denotes the amount of sensing data that vehicle a

of transition probability is denoted as PðtÞ ¼ Pij ðtÞ , can collect in the next system duty cycle tu in subarea i;
i; j 2 I. We define the transfer rate matrix of the mobility
hail ðtÞ denotes the effective duration of the sensing data of
process X ¼ fX ðtÞ; t > 0g as Q ¼ qij , which is also type l collected by a in subarea i. If a arrives in subarea i at
called the Q matrix. For any i 2 I, we define time t then stays in subarea i for a duration of s (i.e., nai ¼ t,
hai ¼ s), then
1 Pii ðDtÞ
qii ¼ lim ð7Þ Z tu
Dt!0 Dt
hail ðtÞdt ¼ s þ wl ð12Þ
For any i; j 2 I, j 6¼ i, we also define t¼0
Pij ðDtÞ Thus, for any nai and hai , we have

qij ¼ lim ð8Þ Z tu
Dt!0 Dt
hli ða; tÞdt
where qij denotes the transition intensity of the vehicle t¼0
ZZ
a
from subarea i to subarea j. As the number of the subareas ¼ P ni ¼ t P hai ¼ s ðs þ wl Þ dsdt
in the target sensing area is limited, for 8i 2 I, t2ð0;tu Þ;s2ð0;tu tÞ
P
0\ j6¼i qij ¼ qi \1. The average duration that the ð13Þ
vehicle stays in state X ð0Þ ¼ i is determined by qi.
where P nai ¼ t is the probability density of the time that
Therefore, the value of qi can be estimated by the historical
mobility traces of the vehicles. vehicle a arrives in subarea i and P hai ¼ s is the
123
probability density of the duration within which vehicle a A also indicates the event that the sensing node stays in
stays in subarea i. Equation (13) is to calculate the cumu- subarea i from time 0 to time t. Obviously, B = A.
lative efficient sensing data of type l that can be collected Therefore,
by node a in subarea i in one system duty cycle. If a arrives
Pfhi [ tjX ð0Þ ¼ ig
subarea i at time t, then the effective duration of the col-
lected sensing data in the current system duty cycle is tu t ¼ PfX ðuÞ ¼ i; 0 6 u 6 tjX ð0Þ ¼ ig
. Therefore, in Equation (13), the region of s is ð0; tu tÞ. ¼ PfBjX ð0Þ ¼ ig
Then, the utility of vehicle a can be calculated as follows: ¼ PfAjX ð0Þ ¼ ig
X
N ¼ lim PfAn jX ð0Þ ¼ ig
n!1
Ua ¼ uai ð14Þ

k
i¼1
¼ lim P X n t ¼ i; k ¼ 0; 1; 2; . . .; 2n jX ð0Þ ¼ i
n!1 2
n t o2n
4.2 Distribution of residence time ¼ lim Pii n
n!1 2
n t o
Theorem If hi denotes the residence time of a vehicle in ¼ lim exp 2n ln Pii n
n!1 2
subarea i, then hi obeys the exponential distribution with
ln 1 qi 2tn þ o 2tn
parameter ki , which is the average residence time of the ¼ lim exp ðqi tÞ
vehicle in subarea i.
n!1 qi t=2n
¼ expðqi tÞ
Pfhi 6 tjX ð0Þ ¼ ig ¼ 1 expðqi tÞ ð15Þ
ð21Þ
where qi ¼ ki .
Thus, the residence time of a vehicle in subarea i is subject
Proof If the sensing node arrives in subarea i at time 0, to the exponential distribution with parameter qi . The
that is, X ð0Þ ¼ i, and the residence time hi can be defined average residence time in the subarea is denoted as ki , and
as ki ¼ qi , which means that the parameter of the exponential
hi ¼ inf ft : t [ 0; X ðtÞ 6¼ X ð0Þ; X ð0Þ ¼ ig ð16Þ distribution can be estimated by the average residence time
of the sensing vehicles in the subarea.
then
Pðhi [ tjX ð0Þ ¼ iÞ ¼ P½X ðuÞ ¼ i; 0 6 u 6 tjX ð0Þ ¼ i 4.3 Distribution of arrival time
ð17Þ
The arrival time of a vehicle in subarea j under the con-
First, the uncountable events are translated into count- dition that the vehicle is in subarea i at the initial time (i.e.,
able events, as shown in Equation (18). t = 0) is denoted as nij and defined as
\
B ¼ f! : X ðuÞ ¼ i; 0 6 u 6 tg ¼ f! : X ðuÞ ¼ ig nij ¼ inf ft : t [ 0; X ðtÞ ¼ jjX ð0Þ ¼ ig ð22Þ
06u6t
ð18Þ where nij ¼ 0 if X ð0Þ ¼ j.

Fij ðtÞ is defined as the distribution function of nij and is
where B indicates that the sensing node stay in subarea i in expressed as
time duration [0, t]. Then, the interval [0, t] is uniformly
Fij ðtÞ ¼ P nij 6 t ; i; j 2 I; i 6¼ j; t > 0 ð23Þ
divided into 2n subintervals as follows:

If #1 ¼ inf ft : t [ 0; X ðtÞ 6¼ X ð0Þg denotes the moment
k n
An ¼ ! : X n t ¼ i; k ¼ 0; 1; 2; ; 2 that the vehicle leaves its initial state and
2
n
ð19Þ Gi ð xÞ ¼ Pð#1 6 xjX ð0Þ ¼ iÞ ¼ ð1 expðqi xÞÞ, then
\2
k Z t
¼ !:X nt ¼i X
k¼0
2 Fij ðtÞ ¼ q1
i q G
ij i ð t Þ þ q 1
i q ik Fkj ðt uÞdGi ðuÞ
k6¼i;k6¼j;k2I 0
An is the event that sensing node is in subarea i at time 2kn t,
ð24Þ
where k ¼ 0; 1; 2; . . .; 2n . Given that: Anþ1 An , it could
be denoted as follows: If #0 ¼ inf ft : t [ 0; X ð#1 þ tÞ ¼ jg and X 0 ðtÞ ¼ X ð#1 þ tÞ,
then # ¼ #1 þ #0 . Based on the total probability formula
\
1
A¼ An ¼ lim An ð20Þ and the strong Markov property, #1 and X ð#1 Þ are condi-
n!1
n¼1 tionally independent when X ð0Þ ¼ i; thus, Eq. (24) is true.
123
5 Selection of Heterogeneous Sensing Vehicles

wil ðtÞ wil ðtÞ þ cail ðtÞ wil ðtÞ cail ðtÞ
The total cost of the sensing system within a system duty
cycle is denoted as U, and the cost of the sensing vehicle a . Then, the temp utility function of sensing vehicle a is
is ua in a system duty cycle. If X denotes the set of the defined as Eq. (28).
( )
sensing vehicle candidates, then the total utility should be M X
X N Z tu
maximized under the constraint of the maximum number of Ua0 ¼ cail ðtÞ cail ðtÞ wil ðtÞ dt ua
l¼1 i¼1 t¼0
vehicles for sensing data collection.
X ð28Þ
Max Ua
a2S
X ð25Þ
s:t: ua 6 U
a2S
In Eq. (25), S denotes the selected sensing vehicles. The

target of the optimization problem is to select a set S of
vehicles from X, and the total cost of the selected sensing
vehicles in a system duty cycle is less than U. Furthermore,
the selected vehicles in S are able to collect the most
comprehensive tempo-spatial sensing data (i.e., the sum of
the utilities of the vehicles in S is maximized).
The optimization formula in Eq. (25) is a knapsack
problem. Therefore, a greedy algorithm denoted as Algo-
rithm 1 is designed to obtain the solution S in Eq. (25). The
goal of Algorithm 1 is to select a set of vehicles from the
candidates so as to collect as much sensing data as possible
in the sensing area within the budget at each system duty
cycle.
6 Evaluation
In our algorithm, after a sensing vehicle is selected, the
indicate matrix of its expected sensing data are added into a
Real vehicle GPS traces from the T-Drive trajectory [36,
temp indicate matrix of the sensing data in the sensing data
37] are utilized to simulate and evaluate the proposed HVS
center. Then, the utilities of the left sensing candidates are
strategy. The T-Drive trajectory data set contains one-week
updated according to the new indicate matrix of the sensing
trajectories of 10357 taxis. The total number of GPS points
data. It will circulate until the total budget of the system in
in this data set is approximately 15 million, and the total
the specific duty cycle is exhausted. The indicate matrix of
distance of the trajectories is 9 million kilometers. The time
the sensing data collected by a is denoted as
frequency of data sampling is set to 5 seconds. Taxi drivers
2 3
h1;1 ðtÞ h1;2 ðtÞ ... h1;M ðtÞ can usually find an optimal route to a destination based on
6 h2;1 ðtÞ h2;2 ðtÞ h2;M ðtÞ 7 their experience. Therefore, the T-Drive project attempts to
6 7
6
Ha ðtÞ ¼ 6 . 7 ð26Þ improve the efficiency of the navigation software accord-
.
.. 7
4 .. 5 ing to the experience of taxi drivers. The traces of taxis can
hN;1 hN;2 ðtÞ ... hN;M ðtÞ reflect the characters of passengers. As the mobilities of
passengers are not completely random, the mobilities of
where t 2 ½0; tu . The temp indicate matrix of the sensing
taxis are also predictable. For example, Huang et al. [38]
data in the sensing data center is denoted as WðtÞ, which is
designed a vehicle mobility model on the basis of the
expressed as
regular patterns derived from the traces of 4000 taxis in
2 3
w11 ðtÞ w12 ðtÞ ... w1M ðtÞ Shanghai.
6 w ðtÞ w22 ðtÞ ... w2M ðtÞ 7 In our simulations, a rectangular region around the
6 21 7
W ðt Þ ¼ 6
6 .. .. .. 7
7 ð27Þ Fourth Ring Road of Beijing is adopted as the target
4 . . . 5 sensing area (Fig. 3), whose latitude is between
wN1 ðtÞ wN2 ðtÞ ... wNM ðtÞ (39.840018 N, 39.993971 N), and longitude is between
(116.276206 E, 116.494243 E). The traces in the rect-
At the beginning, WðtÞ ¼ ZðtÞ. After the sensing vehicle a angular area are used in the simulations. Although the
is selected, WðtÞ WðtÞ þ Ha ðtÞ, where traces are not limited in the target sensing area, we abstract
123
Table 2 Simulation parameters

Parameter Default Value
Simulation time 533315 s

Number of selected vehicles 60
Number of vehicle candidates 1000
System duty cycle 1000 s
Sensor types 10
Sensing data life cycle 10 System duty cycles
Area partition 100 (10 10)
system budget constraint; specifically, RS1 considers

sensing data life cycle, whereas RS2 does not.
During the simulations, the collected sensing data are
recorded in a list, and each record is composed as follows:
½node name; time index; location; sensing data
Fig. 3 Target sensing area type and value( A, B, C, ...) ]
The sensing data coverage ratio is the ratio of the effective
the area out of the target sensing area as subarea 0 and the sensing data coverage to the product of the simulation
subareas in the target sensing area as 1; 2; 3; . . .. The 1000 period length and the number of subareas. As the relation
vehicles with the longest traces in the rectangular region between the system budget and the number of selected
are employed as sensing vehicle candidates, and the traces vehicles is linear, the number of selected vehicles is used in
are used to calculate the arrival and residence time distri- the following simulations instead of the system budget in
bution among the subareas. each system duty cycle.
The default parameters of the experiments are set as
follows. The default number of selected sensing vehicles is 6.1 Impact of the number of selected vehicles
60. The total time of the experiment is 533315 seconds on sensing data coverage ratio
(more than 6 days). The rectangular region is uniformly
divided into 100 (10 10) subareas. The system duty cycle In this subsection, the impact of the number of selected
is set to 1000 s. Ten types of sensors are employed to vehicles on the sensing data coverage ratio is evaluated.
collect data for the sensing system, and each vehicle has The number of selected vehicles bears the most significant
50 % possibility to be equipped with each sensor. The impact on the sensing data coverage ratio because it
default upper bound of each type of sensing data life cycle determines the amount of data that can be collected. Given
is 10 system duty cycles. The cost of the sensing system is the number of sensing vehicles that varies from 5 to 100 in
assumed to be directly proportional to the number of increments of 5 for a system duty cycle, we show the
selected vehicles. Then, the number of selected sensing sensing data coverage ratio results in Fig. 4. The collected
vehicles is used to denote the cost of the sensing system. data cannot match all the data requirements even if 100
The simulation parameters and their default values are available vehicles are selected because the traces are not
shown in Table 2. uniformly distributed in spatial and temporal spaces.
We compare the HVS algorithm with three other algo- Figure 4 shows the impact of the number of selected
rithms, as shown in Table 3. Dynamic participant selection vehicles on the coverage ratio of the collected sensing data.
(DPS) [28] is a multitask-oriented mobile crowd sensing The experiment is executed 10 times, and the results in
system. Although the objective of DPS is to collect sensing Fig. 4 show the average values from the 10 executions.
data for all sensing tasks instead of comprehensive tempo- Figure 4 shows that the coverage ratios of the collected
spatial sensing data, it proposes a greedy sensing node sensing data of the four algorithms increase with an
selection algorithm on the basis of utility calculation. DPS increase in the number of selected vehicles because many
does not consider sensing data life cycle when estimating vehicles can collect a large amount of sensing data.
the sensing capacity of a sensing node; instead, it uses a Among the four strategies, the HVS algorithm acquires
probability-based mobility model. Random selection (RS) the highest coverage ratio because it considers tempo-
algorithms randomly select sensing vehicles under the spatial sensing data coverage when selecting sensing
123
Table 3 Evaluated algorithms

Algorithm HVS DPS RS1 RS2
Node selection Greedy Greedy Random Random

Mobility model Time-continuous Probability based No No
Markov chain-based
Sensing data life cycle Yes No Yes No
vehicles and data coverage ratio of the four algorithms are

HVS
0.8
RS1 shown in Fig. 4, only the results of HVS and DPS are
DPS shown in Fig. 5 for clarity. In Fig. 5, the standard deviation
Sensing Data Coverage Ratio
RS2 of DPS increases from 0.015 to 0.17 with an increase in the

0.6
number of selected sensing vehicles from 5 to 100, whereas
the standard deviation fluctuation of the HVS algorithm is
0.4 small (i.e., from 0.02 to 0.09). Figure 5 also reveals that the
standard deviation of the HVS algorithm is lower than that
of DPS. The deviation of vehicle traces is high when
0.2
numerous vehicles are selected, and the high deviation of
the vehicle traces leads to a considerable deviation in the
0.0 coverage ratio of the collected sensing data. The fluctuation
of the coverage ratio of the collected sensing data is also
0 20 40 60 80 100
significantly reduced with the HVS algorithm because of
Number of Selected Vehicles its adoption of a time-continuous Markov chain-based
Fig. 4 Data coverage ratio under different numbers of selected
mobility model and consideration of the uniformity of the
vehicles temporal coverage of the different types of sensing data.
Thus, the sensing data collection of the HVS algorithm is
more sustainable and stable than those of the other
vehicles. Indeed, the life cycle of sensing data is usually algorithms.
much longer than the system duty cycle. Therefore, col-
lecting a certain type of sensing data in each system duty 6.2 Impact of sensing data life cycle on coverage
cycle is unnecessary. Sensing data are better collected just ratio
before they expire in the sensing data server. As a result,
the consideration of the temporal coverage of sensing data The life cycle of sensing data are usually longer than the
allows the HVS algorithm to acquire relatively high sens- system duty cycle. Therefore, collecting a certain type of
ing data coverage ratios. sensing data in each system duty cycle is unnecessary.
In calculating the coverage ratio, both the HVS algo-
rithm and RS1 consider the sensing data temporal cover-
age. As a result, their coverage ratios are higher than those 0.9 HVS
of DPS and RS2. Moreover, the coverage ratio of the HVS DPS
0.8
algorithm is higher than that of RS1 by up to 20 %. This
result may be ascribed to the fact that the HVS algorithm 0.7
selects vehicles according to their historical traces, sensing 0.6

interfaces, and the collected sensing data in the sensing
0.5
data center, whereas RS1 randomly selects vehicles. The
coverage ratio of the HVS algorithm is four times higher 0.4
than that of DPS because it considers the temporal sensing 0.3

data coverage when calculating the expected contributions 0.2
to the existing collected sensing data; furthermore, the
0.1
HVS algorithm predicts the location of sensing vehicles
using a time-continuous Markov chain-based mobility 0.0
0 20 40 60 80 100
model, which is more accurate than that employed in DPS.
Number of Selected Vehicles
Figure 5 shows the impact of the number of selected
vehicles on the standard deviation of the data coverage Fig. 5 Standard deviations of data coverage ratios for different
ratio. Since the relation between the number of selected numbers of selected vehicles
123
Collecting sensing data just before it expires is preferred to When the sensing area is fine-grained partitioned into
keep the sensing data effective along the temporal space. 100 subareas, as a consequence of the system budget lim-
An experiment is conducted in this study to evaluate the itation in each system duty cycle, the sensing data coverage
impact of the upper bound of the sensing data life cycle on ratio decreases from 89 to 72 % with an increase in the
the sensing data coverage ratio. The life cycle of each type partition fineness from (6, 6) to (10, 10). As the mobility
of sensing data is randomly chosen at an interval of 0 to the traces of the vehicles are random, the sensing data cover-
upper bound of the sensing data life cycle. age ratio is low when the number of divided subareas is
Owing to the data lifetime are not considered in DPS small. For the same reason, DPS and RS1 acquire the
and RS2, only the coverage ratio of HVS and RS1 are highest ratios when the area partition is (4, 4). If the
compared in this experiment. In the experiment, the upper number of the divided subareas is small to (2, 2), then the
bound of the sensing data life cycle changes from 2 to 20 in coverage ratio deviation between the HVS algorithm and
increments of 2. As shown in Fig. 6, the coverage ratios of RS1 is also small to 0.02. The reason is twofold. First, the
the HVS algorithm and RS1 increase with an increase of probability for one vehicle to travel into each subarea is
the upper bound of the sensing data life cycle because the sufficiently high, and the efficiencies of the mobility
long temporal coverage of sensing time leads to few models are reduced. Second, the requirements for sensing
requirements for sensing data collection. The coverage data collection are fewer than those when the sensing area
ratio of the HVS algorithm when the life cycle upper bound is fine-grained partitioned. However, when the number of
is 20 is 55 % higher than that when the upper bound of the subareas increases, the mobility model and the estimation
sensing data life cycle is 2 system duty cycles. As the HVS of future sensing capacity show highly significant effects.
algorithm selects sensing vehicles according to their utili-
ties, its coverage ratio is higher than that of RS1 by about 6.4 Cumulative amount of collected sensing data
15 % under the same upper bound of the sensing data.
In this subsection, the variation trend in the cumulative
6.3 Impact of the sensing area partition on coverage amount of the collected sensing data during the simulation
ratio is evaluated. The simulation time is divided into 30 sub-
time intervals. In each subtime interval, the effective
Another experiment is conducted to test the sensing data coverages of the sensing data collected by the vehicles are
coverage ratio for different sensing area partitions. The computed. Figure 8 shows that the HVS algorithm exhibits
default number of sensing vehicles is 60. The area partition a faster data collection speed than the other algorithms. At
is selected from fð2; 2Þ; ð4; 4Þ; ð6; 6Þ; ð8; 8Þ; ð10; 10Þg, as the end of our simulation, sensing data collected by HVS is
shown in Fig. 7. The HVS algorithm has the highest cov- 3 times more than that of DPS. As DSP and RS2 do not
erage ratio in each area partition. When the area partition is count the life cycles of the sensing data, the cumulative
(6, 6), the HVS algorithm obtains the highest sensing data amounts of the sensing data collected with DSP and RS2
coverage ratio (i.e., 89 %). are less than that of the HVS algorithm. Moreover, the
0.9
HVS 1.0
RS1 HVS
0.8
RS1
DPS
0.8
0.7
RS2
0.6
0.6
0.5
0.4 0.4
0.3
0.2
0.2
0 2 4 6 8 10 12 14 16 18 20 22 0.0
Upper Bound of Sensing Data Lifetime (2,2) (4,4) (6,6) (8,8) (10,10)
Sensing Area Partition
Fig. 6 Impact of the upper bound of the sensing data life cycle on
coverage ratio Fig. 7 Impact of sensing area partition on coverage ratio
123
1.480
Cumulative Amount of Collected Sensing Data
4.00E+009 HVS
RS1
3.50E+009 DPS 1.475
RS2
3.00E+009
1.470
Temporal Entropy
2.50E+009
1.465
2.00E+009
1.50E+009 1.460
1.00E+009
1.455
HVS
5.00E+008 RS1
1.450 DPS
0.00E+000
0 5 10 15 20 25 30 RS2
1.445
Time Interval
0 20 40 60 80 100
Fig. 8 Cumulative amount of collected sensing data Number of Selected Vehicles
Fig. 9 Temporal entropy at different numbers of selected vehicles
sensing data collection speeds of DPS and RS2 are much

slower than that of the HVS algorithm. mobilities and heterogeneities and consequently collect
various types of sensing data in all subareas.
6.5 Sustainability of data collection
6.6 Spatial uniformity of collected sensing data
In this subsection, the sustainability of the data collection
of each of the four algorithms is evaluated with the use of In this subsection, the impact of the number of selected
the temporal entropy of the collected sensing data. The sensing vehicles on the spatial entropy of the collected
temporal entropy serves as an indicator of the uniformity of sensing data is evaluated. The spatial entropy of the col-
the collected sensing data at different intervals during the lected sensing data reveals the spatial uniformity of the
simulation process. Therefore, the entropy can evaluate the collected sensing data. The spatial entropies are derived
sustainability of the sensing data collection. A large tem- from the probability distribution of the collected sensing
poral entropy equates to a highly sustainable sensing data data in all the partitioned subareas, as shown in Eq. (30).
collection algorithm. The method for calculating temporal X
N
entropy is shown in Eq. (29). ss ¼ pi log pi ð30Þ
i¼1
Xn
st ¼ pk log pk ð29Þ where
k¼1
data coverage in subarea i
where pi ¼
coverage of all collected data
effective data coverage in time interval k
pk ¼ Figure 10 shows the relationship between the spatial
total effective data coverage
entropy and the number of selected vehicles. As shown in
In Fig. 9, all temporal entropies increase with the number Fig. 10, the HVS amount of vehicles has the highest spatial
of selected vehicles. Thus, the HVS algorithm can acquire entropy. Thus, this algorithm is able to collect uniformly
sensing data sustainably. Furthermore, the HVS algorithm distributed sensing data with the same number of sensing
has the highest temporal entropy among the four algo- vehicles. Moreover, the advantage of the HVS algorithm is
rithms when few vehicles are selected to collect sensing highlighted when the number of selected vehicles is small
data. This result reveals that the HVS algorithm can collect for three reasons. First, the mobility model is able to pre-
sensing data sustainably with few sensing vehicles because dict the locations of sensing vehicles. Second, when cal-
it typically selects vehicles according to their utilities. This culating the utility of each sensing vehicle, both the
feature suggests the capacity of the algorithm to collect expected collected sensing data and the collected sensing
scarce sensing data. When numerous sensing vehicles are data in the sensing data center are considered. Therefore,
selected, the four algorithms can acquire uniformly dis- the selected sensing vehicles are more likely to collect
tributed sensing data because the number of vehicles is scarce sensing data; as a result, the spatial uniformity of the
sufficiently large to generate high deviations among collected sensing data is improved. Third, the life cycle of
123
sensing data is longer than the system duty cycle; conse- 1.4770
quently, the uniformity of sensing data is further enhanced. HVS

1.4768
RS1
Figure 11 reveals the relationship between the sensing
1.4766
area partition and the spatial entropy of the collected
sensing data. Obviously, the spatial uniformity of the col- 1.4764
Spatial Entropy
lected sensing data is high when several other subareas are 1.4762
partitioned. In Fig. 11, the spatial entropies of the four 1.4760
algorithms almost converge when the area partition is
1.4758
(2, 2) because when the number of subareas is small, the
influence of mobility and heterogeneity on sensing capacity 1.4756
is low. Then, the spatial entropy increases with an increase 1.4754
in the number of area partitions. The spatial entropy of the
1.4752
HVS algorithm is slightly higher than those of the other 0 2 4 6 8 10 12 14 16 18 20 22
algorithms when the area partition is (10, 10). Neverthe- Upper Bound of Sensing Data Lifetime
less, the variation tendencies of the four algorithms are
Fig. 12 Impact of the upper bound of the sensing data life cycle on
almost the same. spatial entropy
Figure 12 reveals the impact of the upper bound of the

2.0
sensing data life cycle on the spatial entropy of the col-
lected sensing data. Since DPS and RS2 do not select
participants according to the lifetime of sensing data, the
spatial entropies of HVS and RS1 are evaluated in this
Spatial Entropy
1.9 experiment. The spatial entropies of the HVS algorithm

and RS1 decrease with an increase in the upper bound of
the sensing data life cycle because the non-uniformity of
the life cycles of the different types of sensing data increase
HVS with an increase in the upper bound of the sensing data life
1.8 RS1
DPS
cycle. Then, the heterogeneity of the collected sensing data
RS2 will also increase. As a result, the spatial entropy will be
small because the upper bound of the sensing data life
0 20 40 60 80 100
cycle is high. As the HVS algorithm estimates the types
Number of Selected Sensing Vehicles
and effective duration of the sensing data that could be
Fig. 10 Spatial entropy at different numbers of selected vehicles collected by vehicles in the following system duty cycle,
the spatial entropy of the HVS algorithm is higher and
decreases slower than that of RS1. Therefore, the HVS
algorithm can mitigate the drawbacks arising from the
2.2
HVS
heterogeneity of the sensing data life cycle and spatial
2.0 distribution of the collected sensing data.
RS1
1.8
DPS
RS2
1.6
7 Conclusion
Spatial Entropy
1.4
1.2 The continuous collection of comprehensively covered

tempo-spatial sensing data with limited heterogeneous
1.0
sensing vehicles is a critical issue in vehicular mobile
0.8 sensing systems. In this work, the HVS algorithm for the
0.6
collection of comprehensive tempo-spatial sensing data is
proposed. First, on the basis of the expected spatial dis-
0.4
(2,2) (4,4) (6,6) (8,8) (10,10)
tribution of a vehicle and its sensing interfaces, a utility
Sensing Area Partition
function is established to estimate the ability of the vehicle
to collect sensing data in the next system duty cycle. Then,
Fig. 11 Impact of area partition on spatial entropy according to the utility function and system budget
123
restriction, the sensing vehicle selection problem is for- 15. Lee J-S, Hoh B Sell your experiences: a market mechanism based
mulated as a knapsack problem. Finally, a greedy optimal incentive for participatory sensing, In: IEEE international con-
ference on pervasive computing and communications (Per-
sensing vehicle selection algorithm is designed. Real trace- Com’10), pp 60–68
driven experiments show that the sensing data collected by 16. Jaimes LG, Vergara-Laurens I, Labrador MA A location-based
the HVS algorithm exhibit higher coverage ratio and more incentive mechanism for participatory sensing systems with
uniform tempo-spatial distribution than those collected by budget constraints. In: IEEE international conference on perva-
sive computing and communications (PerCom’12), pp 103–108
other mobile crowd sensing algorithms. 17. Ioannis B, Vana K Mobile stream sampling under time con-
straints, In: IEEE 14th international conference on mobile data
Acknowledgments This work was supported by the National Natural management (MDM’13), pp 227–236
Science Foundation of China (61572060, 61370197, 61170296 and 18. Liu B, Jiang Y, Sha F, Govindan R Cloud-enabled privacy-pre-
61190125), the R&D Program (2013BAH35F01) and key technolo- serving collaborative learning for mobile sensing, In: The 10th
gies R&D program project of Tangshan (15130251a). ACM conference on embedded network sensor aystems (Sen-
Sys’12), pp 57–70
19. Jayaraman PP, Perera C, Georgakopoulos D, Zaslavsky A (2013)
References Efficient opportunistic sensing using mobile collaborative plat-
form mosden, In: The 9th IEEE international conference on
collaborative computing: networking, applications and work-
1. Ganti RK, Ye F, Lei H (2011) Mobile crowdsensing: current state sharing, pp 1–10
and future challenges. IEEE Commun Mag 49:32–39 20. Sheng X, Tang J, Zhang W Energy-efficient collaborative sensing
2. Guo B, Yu Z, Zhou X, Zhang D From participatory sensing to with mobile phones, In: The 31rd annual IEEE international confer-
mobile crowd sensing. In: IEEE international conference on ence on computer communications (INFOCOM’12), pp 1916–1924
pervasive computing and communications workshops (PERCOM 21. Chon Y, Lane ND, Kim Y, Zhao F, Cha H, Understanding the
2014 Workshops), pp 593–598 coverage and scalability of place-centric crowd sensing, In:
3. Khan WZ, Xiang Y, Aalsalem MY, Arshad Q (2013) Mobile Proceedings of the 2013 ACM international joint conference on
phone sensing systems: a survey. IEEE Commun Surv Tutor pervasive and ubiquitous computing (UbiComp 2013), pp 3–12
15:402–427 22. Ho C-J, Vaughan JW Online task assignment in crowdsourcing
4. Devarakonda S, Sevusu P, Liu H, Liu R, Iftode L, Nath B (2013) markets. In: AAAI, vol 12, pp 45–51
Real-time air quality monitoring through mobile sensing in 23. Reddy S, Estrin D, Srivastava M (2010) Recruitment framework
metropolitan areas. In: Proceedings of the 2nd ACM SIGKDD for participatory sensing data collections. Pervasive Comput
international workshop on urban computing 2013, pp 1–8 138–155
5. Zhou P, Cheng Z, Li M Smart traffic monitoring with partici- 24. Tuncay G, Benincasa G, Helmy A Autonomous and distributed
patory sensing. In: Proceedings of the 11th ACM conference on recruitment and data collection framework for opportunistic
embedded networked sensor systems (SenSys’13), pp 1–2 sensing. In: Proceedings of Mobicom’12, pp 407–410
6. Banerjee R, Sinha A, Saha A (2013) Participatory sensing based 25. Ahmed A, Yasumoto K, Yamauchi Y, Ito M Distance and time
traffic condition monitoring using horn detection. In: Proceedings based node selection for probabilistic coverage in people-centric
of the 28th annual ACM symposium on applied computing, sensing. In: Proceedings of the 8th annual IEEE communications
pp 567–569 society conference on sensor, mesh and ad hoc communications
7. Perttunen M, Kostakos V, Riekki J, Ojala T (2015) Urban traffic and networks (SECON 2011), pp 134–142
analysis through multi-modal sensing. Pers Ubiquitous Comput 26. Hachem S, Pathak A, Issarny V Probabilistic registration for
19:709–721 large-scale mobile participatory sensing. In: Proceedings of the
8. Liu Y, Li X (2015) Heterogeneous participant recruitment for IEEE international conference on pervasive computing and
comprehensive vehicle sensing. Plos One 10:1–20 communications (PerCom 2013), pp 132–140
9. Dutta P, Aoki PM, Kumar N, Mainwaring A, Myers C, Willet W, 27. Zhang D, Xiong H, Wang L, Chen G Crowdrecruiter: selecting
Woodruff A Common sense: participatory urban sensing using a participants for piggyback crowdsensing under probabilistic
network of handheld air quality monitors. In: Proceedings of the coverage constraint. In: Proceedings of the ACM international
7th ACM conference on embedded networked sensor systems joint conference on pervasive and ubiquitous computing (Ubi-
(SenSys), pp 349–350 Comp 2014), pp 703–714
10. Zheng Y, Liu T, Wang Y, Zhu Y, Liu Y, Chang E Diagnosing 28. Song Z, Liu CH, Wu J, Ma J, Wang W (2014) Qoi-aware mul-
new york city’s noises with ubiquitous data. In: Proceedings of titask-oriented dynamic participant selection with budget con-
the ACM international joint conference on pervasive and ubiq- straints. IEEE Trans Veh Technol 63:4618–4632
uitous computing (UbiComp 2014), pp 715–725 29. Hull B, Bychkovsky V, Zhang Y, Chen K, Goraczko M, Miu A,
11. Pankratius V, Lind F, Coster A, Erickson P, Semeter J (2014) Shih E, Balakrishnan H, Madden S Cartel: a distributed mobile
Mobile crowd sensing in space weather monitoring: the mahali sensor computing system. In: The 4th ACM conference on
project. IEEE Commun Mag 52:22–28 embedded networked sensor systems (Sensys’06), pp 125–138
12. Zhou P, Zheng Y, Li M (2014) How long to wait? Predicting bus 30. Hu S-C, Wang Y-C, Huang C-Y, Tsengy Y-C, Kuoz L-C, Chen
arrival time with mobile phone based participatory sensing. IEEE C-Y Vehicular sensing system for co2 monitoring applications.
Trans Mob Comput 13:1228–1241 In: IEEE VTS Asia pacific wireless communications symposium
13. Kostakos V, Camacho T, Mantero C (2013) Towards proximity- (APWCS’09), pp 168–171
based passenger sensing on public transport buses. Pers Ubiqui- 31. Görmer S, Park S-B, Egbert P (2009) Vision-based rain sensing
tous Comput 17:1807–1816 with an in-vehicle camera. In: IEEE intelligent vehicles sympo-
14. Chon Y, Lane ND, Kim Y, Zhao F, Cha H, Understanding the sium, pp 279–284
coverage and scalability of place-centric crowd sensing. In: The 32. Hyyppä J, Jaakkola A, Hyyppä H (2009) Map updating and
ACM international joint conference on pervasive and ubiquitous change detection using vehicle-based laser scanning. In: The
computing (UbiComp’13), pp 3–12 IEEE joint urban remote sensing event, pp 1–6
123
33. Cortez RA, Luna J-M, Fierro R, Wood J Multi-vehicle testbed for international conference on knowledge discovery and data mining
decentralized environmental sensing. In: The IEEE international (KDD’11), pp 316–324
conference on robotics and automation (ICRA’10), pp 3052–3058 37. Yuan J, Zheng Y, Zhang C, Xie W, Xie X, Sun G, Huang Y
34. Zhu Y, Li Z, Zhu H, Li M, Zhang Q (2013) A compressive T-drive: driving directions based on taxi trajectories. In: Pro-
sensing approach to urban traffic estimation with probe vehicles. ceedings of the 18th SIGSPATIAL international conference on
IEEE Trans Mob Comput 12:2289–2302 advances in geographic information aystems (GIS’10), pp 99–108
35. Jang J, Baik N (2012) Diagnosing hazardous road surface con- 38. Huang H, Zhang D, Zhu Y, Li M, Wu M-Y (2012) A
ditions through probe vehicle as mobile sensing platform. In: The metropolitan taxi mobility model from real gps traces. J Univ
international conference on winter maintenance and surface Comput Sci 18:1072–1092
transportation weather, pp 54–60
36. Yuan J, Zheng Y, Xie X, Sun G Driving with knowledge from the
physical world. In: Proceedings of the 17th ACM SIGKDD
123

4633 - Comprehensive Tempo-Spatial Data Collection in Crowd Sensing Using A Heterogeneous Sensing Vehicle Selection Method PDF

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

4633 - Comprehensive Tempo-Spatial Data Collection in Crowd Sensing Using A Heterogeneous Sensing Vehicle Selection Method PDF

Uploaded by

Copyright:

Available Formats

Pers Ubiquit Comput (2016) 20:397–411

Comprehensive tempo-spatial data collection in crowd sensing

sensing nodes for data collection on the basis of their

Table 1 List of notations

I The set of subareas

Target Sensing Area Coll ected Sens ing Data

Historical mobility trace Recruited vehicles Vibration sensor Brightness sensor

Predicted mobility trace PM2.5 sensor Temperature sensor

Fig. 2 Data collection by heterogeneous vehicles

A time-continuous life cycle of sensing data is considered

Pij ðDtÞ Thus, for any nai and hai , we have

ð18Þ where nij ¼ 0 if X ð0Þ ¼ j.

5 Selection of Heterogeneous Sensing Vehicles

In Eq. (25), S denotes the selected sensing vehicles. The

Table 2 Simulation parameters

Simulation time 533315 s

system budget constraint; specifically, RS1 considers

Table 3 Evaluated algorithms

Node selection Greedy Greedy Random Random

vehicles and data coverage ratio of the four algorithms are

RS2 of DPS increases from 0.015 to 0.17 with an increase in the

selects vehicles according to their historical traces, sensing 0.6

than that of DPS because it considers the temporal sensing 0.3

Fig. 8 Cumulative amount of collected sensing data Number of Selected Vehicles

Fig. 9 Temporal entropy at different numbers of selected vehicles

sensing data collection speeds of DPS and RS2 are much

quently, the uniformity of sensing data is further enhanced. HVS

Figure 12 reveals the impact of the upper bound of the

1.9 experiment. The spatial entropies of the HVS algorithm

1.2 The continuous collection of comprehensively covered

You might also like