You are on page 1of 8

Vehicle Delay-tolerant Network Routing Algorithm

based on Multi-period Bayesian Network


Hongyu Zhou∗† , Jiagao Wu∗† , Lu Shen∗† , Linfeng Liu ∗†
∗ School
of Computer, Nanjing University of Posts and Telecommunications, Nanjing, Jiangsu 210023, China
† JiangsuKey Laboratory of Big Data Security & Intelligent Processing, Nanjing, Jiangsu 210023, China
Email: zhouhy1010@foxmail.com, jgwu@njupt.edu.cn, shenlu email@163.com, liulf@njupt.edu.cn

Abstract—Delay-tolerant networks (DTNs) are wireless mobile the decision tree and naive Bayesian classifier respectively to
networks where constant end-to-end connections may not exist divide nodes into different categories. However, ANN requires
among nodes. In real-life vehicle DTNs, most nodes have repetitive a lot of data to be repeatedly trained, and both the decision tree
movement patterns. However, due to the change of time and
different activity scenarios, the movement patterns cannot be and the ANN are prone to the over-fit problem. The Bayesian
described consistently with a single model. Considering this classifier is easy to be trained and suitable for incremental
issue, the Multi-period Bayesian Network (MBN) is proposed learning. However, the naive Bayesian classifier is based on
to build multiple prediction models, which intends to predict the assumption of attribute independence. To be more precise, it
the regular movement patterns of nodes in the real world. The ignores the potential relationships among attributes. Moreover,
Bayesian network model is constructed by using several network
parameters (e.g. spatial and temporal information at the time these algorithms failed to take into account the fact that there
of message forwarding) to describe the movement patterns of are differences among the mobility patterns presented by nodes
DTN nodes. Additionally, a novel classification method called at different periods of time. According to [11], we can obtain
Dynamic Multiple-Level Classification (DMLC), is proposed where that the metric of contacts per hour of nodes presents period-
nodes are classified into multiple levels according to the dynamic ical fluctuations in real public transport network environment.
parameters. Followed by that, a routing algorithm based on MBN
is presented, which can make routing decisions based on the Considering these two problems, we propose a multi-period
classification results of DMLC. The simulation results show that Bayesian network routing algorithm where multiple Bayesian
MBN algorithm and DMLC method can improve the delivery network models of different periods are constructed from the
ratio with a minor forwarding overhead. historical data. In addition, in order to capture the dynamic
I. I NTRODUCTION characteristics of the network, we define delivery bonus as an
attribute of each node, which is updated dynamically according
Delay tolerant networks (DTNs) are wireless mobile net-
to the time, encounter frequency and message delivery results.
works, where constant end-to-end connections between source
The delivery bonus has a similar refresh mechanism with the
nodes and destination nodes may not exist at any given time [1].
pheromone of ant colony algorithm [12], which is featured as
Due to intermittent contact and disconnections, the nodes adopt
positive feedback and self-adapting optimization. Based on that,
the store-carry-and-forward mechanism to deliver messages.
nodes are classified into multiple levels which will be used as
Such networks have been applied in many fields such as the In-
the criteria for routing decisions.
terPlaNetary Internet, vehicular ad-hoc networks, opportunistic
mobile social networks and wireless sensor networks [2]–[4]. The main contributions of this paper are as follows:
Vehicular Delay Tolerant Network (VDTN) is a new type
of DTNs based on the vehicular communications [5], where a
vehicle with a mobile router is used as a carrier to carry the 1) Multi-period Bayesian network is proposed to build sev-
data from traffic sources to destinations. In earlier years, some eral prediction models, which intends to predict the node
researchers who took the historical information of network regular movement patterns in the real world.
characteristics into consideration and proposed a number of 2) A new classification method called Dynamic Multiple-
classic algorithms like PRoPHET [6] and Maxprop [7], etc. Level Classification (DMLC), is proposed where nodes
In VDTNs, nodes usually have repetitive movement patterns. are classified into different levels according to their
For example, public transport buses follow the fixed routes and dynamic updating delivery bonus.
schedules. Due to this reason, many routing algorithms based 3) A multi-period Bayesian network routing algorithm is
on the methods of machine learning, such as Artificial Neural presented, which can make routing decisions based on
Networks (ANN) [8], decision tree [9] and naive Bayesian the classification results obtained from MBN models. The
classifier [10], were proposed. In [8], Segundo et al. presented simulation results show that the combinational use of
an approach to predict the next node and the moment of MBN algorithm and DMLC outperforms other algorithms
contact based on ANN. In [9] and [10], the authors applied by improving the delivery ratio with a minor forwarding
978-1-5386-6808-5/18/$31.00 ⃝2018
c IEEE overhead.
II. R ELATED W ORK ment patterns of nodes and use historic datasets to train the
multi-period Bayesian network models. Besides, we propose
In order to overcome the mentioned challenges in DTNs, the DMLC method to optimize the classification of nodes
several routing protocols have been proposed such as Epidemic through dynamic parameters. Combined with the Bayesian
[13], FirstContact [14] and so on. In Epidemic, nodes forward network model, this method can improve the delivery ratio and
the copies of messages as many as possible, which maximizes reduce the network overhead.
the delivery ratio and reduce the message delay. However,
massive network resources are wasted by delivering the redun- III. M ULTI - PERIOD BAYESIAN N ETWORK ROUTING
dant copies of messages. In contrast, FirstContact is a single- A LGORITHM
copy protocol where nodes only forward the message copies
to the first encounter node. Although reducing the excrescent The Bayesian network can obtain an overall representation
message copies and saving the buffer space, the single-copy of the joint probability distribution which combines both graph
routing scheme limits the message success delivery. Besides, and probability theories [16]. It uses the Directed Acyclic
researchers have proposed several routing protocols based on Graph (DAG) and Conditional Probability Table (CPT), which
prior probability. Lindgren et al. defined a transmission pre- describes the dependencies among attributes of nodes and the
diction value according to the number of nodes encounter to joint probability distributions of attributes, respectively. The
measure the probability of successful delivery in PRoPHET [6]. Bayesian network contains a set of variables, a set of probabil-
Burgess et al. calculated the shortest path of nodes movement ities and a graphical structure that connects the variables with a
based on prior probabilities of encounter history information probability on each arc. In this paper, the Bayesian network is
between nodes in MaxProp [7]. However, those protocols applied to improve the routing decisions of nodes and therefore
mentioned above only consider prior information about the overall network performance.
network characteristics. In this paper, we apply the posterior The rest of the section consists of two parts, we first
probability model to the node classification which takes more discuss the multi-period Bayesian network, and then the routing
information into consideration. This means that when routing algorithm is described in details.
decisions are made, the algorithm can select the next hop node
of the message copy more accurately [10]. A. Multi-period Bayesian network
A number of machine learning algorithms have been applied
In most VDTNs, the mobility patterns of the nodes generally
to DTNs. In [8], Segundo et al. presented proposed an ANN
exhibit some level of time periodicity. Frans Ekman et al. [11]
model to predict the next node and the moment of contact.
presented a new movement model, called Working Day Move-
However, the initialized weights and threshold of ANN is diffi-
ment Model, which shows the patterns of people’s schedules
cult to determine, requiring repeated training to determine the
such as traveling by car or bus. In [11], the metric of contacts
network structure and various parameters. In [9], Portugalpoma
per hour is used to measure the activity of nodes at the different
et al. applied the decision tree to reducing network overhead by
time in a day. This metric presents a regular distribution of one-
discriminating the worst intermediate nodes. However, decision
day cycle no matter the trace data is based on synthetic datasets
tree is easy to cause over-fitting problems and can be unstable
[11] or real trace datasets [17]. To be more specific, during the
because small variations in the data might result in a completely
rush hours in the morning and evening, the number of contacts
different tree being generated. Bayesian classifier has stable
between nodes increased continuously, while in other periods,
classification efficiency and is suitable for incremental learning.
it declined significantly. This means that a single model is
Therefore it plays the important role in the area of machine
hard to describe the mobility patterns of the nodes for different
learning, which has been widely used in text classification and
periods. Therefore, it is more reasonable to describe the patterns
weather forecasting and achieved good performance. Shabbir
of nodes by a multi-period model. Therefore, we propose the
Ahmed et al. in [10] introduced a naive Bayesian classifier to
Multi-period Bayesian Network (MBN) to capture the periodic
DTNs and obtained good results. Naive Bayesian is based on
behavior of nodes in VDTNs. As shown in Fig. 1, the time
the assumption of attributes independence. However, it does
in a day is divided into n periods, and we construct multiple
not take into account the dependency among attributes and
models based on the datasets of these different periods. Due
such dependency certainly exists in the real environment. For
to the fact that the MBN takes the multi-period characteristic
instance, scheduled buses usually arrive at the same place at
into consideration, the network performance can be further
the similar time according to their timetable. Jiao Jiao Liang
improved.
et al. in [15] applied the Bayesian network model to choose
an appropriate routing protocol based on different application
requirements. In spite of taking account of the dependencies B. Routing algorithm framework
among the attributes, this method can only be applied in As shown in Fig. 2, our routing algorithm framework in-
selecting routing algorithms and does not take the data forward volves five phases: attribute selection, node classification, struc-
strategy into consideration. ture learning, inference and forwarding strategy. Then, these
In this paper, we focus on the periodical and dynamic move- phases will be discussed in detail in the following subsections.
Period 1 Period 2 … Period n
Time Attribute Selection
Select the appropriate
node attributes

Classification
Classify the nodes

Fig. 1. The diagram of multi-period Bayesian network


Structure Learning
1) Attribute selection: The state of a node can be described Construct the Bayesian
network structures
by a number of attributes which are closely related to the
message delivery. The first step of applying the Bayesian
network to DTN is to select the appropriate attributes of nodes Inference and Forwarding Strategy
and discrete their values. We select a set of attributes as Provide a theoretical basis
X =< x1 , x2 , x3 , x4 >, where for routing decisions
• x1 = Area Id < A1 , A2 , ..., Am1 >, signifying the location
of nodes at the time of forwarding the message. The entire Fig. 2. Routing algorithm framework
physical domain of the network is divided into m1 squares
of equal size. the pheromone of ant colony algorithm, which is featured as
• x2 = Time Slot < T1 , T2 , ..., Tm2 >, identifying the time positive feedback and self-adapting optimization. It has the
slot when the message copy is forwarded. The message property of volatilization and is updated dynamically according
forwarding behavior changes with time since there are to time, encounter frequency and message delivery. The update
different traffic patterns at different hours of a day. In our rule of φni ,nd (i) can be illustrated by Eq. (1).
scenario, the granularity of the time slots is chosen to be
∑v
φr
30min. Therefore, 48 time slots are used to distinguish the φni ,nd (i + 1) = (1 − ρ) φni ,nd (i) + c (i) ∆φc +
different periods. hk (i)
k=1
• x3 = Average Inter-contact Interval < I1 , I2 , ..., Im3 >, (1)
indicating the average interval of the encounter between where ρ denotes the volatility coefficient and ρ ∈ [0, 1],
nodes. The interval is discretized by a certain number (e.g. c(i) indicates the number of encountering with other nodes
5s) and this attribute can reflect the current activity of the at time slot Ti , v represents the number of messages that
node. are successfully delivered, hk (i) denotes the number of nodes
• x4 = Route Id < R1 , R2 , ..., Rm4 >, denoting the routes in each delivery path at time slot Ti and the initial delivery
of vehicle operating. It is a common situation that vehicles bonus value of all nodes is the same, i.e. φni ,nd (0) = 100. The
follow the fixed commuter routes, e.g. bus routes. Different detailed update rule is as follows: φni ,nd (i) decreases with time
vehicles may travel along the same route, so they may have slot in a ratio of ρ and increases by ∆φc if the node carrying
the same route id. the message encounters another node. In addition, when the
2) Classification: The Bayesian classifier can classify an message reaches the destination node at time slot Ti , a fixed
unknown sample based on its own attributes value and the amount of delivery bonus (φr ) is allocated to the delivery path.
information of known samples. In [10], nodes are classified into With hk (i) nodes on the path, the added delivery bonus for each
two categories based on their historic delivery results, which node is φr /hk (i).
are successful delivery and unsuccessful delivery. In this paper, In fact, in order to dynamically allocate φr in the absence
inspired by the idea of ant colony algorithm, we propose a new of an end-to-end path, the destination node will generate and
classification method, Dynamic Multiple-Level Classification forward a small ACK message with the flooding mechanism
(DMLC), which classify nodes into multiple levels according when they receive the message. The other nodes will update
to the dynamic updating parameter. their delivery bonus values according to the unique identifier
In ant colony algorithm, pheromone is a kind of chem- of the arrived messages after receiving the ACK message. If
icals that ants can smell. When walking, ants deposit the node that is not included in the delivery path, it will not be
pheromone, which marks the passing route. The concentration updated.
of pheromone decreases as time changes due to diffusion effects For example, let ∆φc = 20, ρ = 0.2 and φr = 600. As
[18]. Similarly, in our approach, Delivery Bonus φni ,nd (i) shown in Fig. 3(a), the delivery bonus of node n2 decreases by
is defined as a dynamic evolution criterion of node ni that 20% at the end of time slot T1 . As shown in Fig. 3(b), node
measures the ability to deliver messages to the destination node n1 encounters node n2 and transmits the message to n2 at time
nd at time slot Ti . slot T2 , then the delivery bonus value of node n2 increases by
The delivery bonus has the similar updating mechanism with ∆φc = 20. As shown in Fig. 3(c), the destination node n5 has
Period 1 Period 2 Period 3
Time slot:T1 Time slot:T2
-20 (7am - 10am) (10am - 1pm) (1pm - 5pm) Time
+20

n2 n1 n2 L L L

r =0.2 jc =20 x1 x2 x1 x2 x1 x2

(a) (b) x4 x4 x3 x4

Time slot:T3 x3 x3
+200 +200 +200
Ř
n1 n2 n3 Fig. 4. The Bayesian Network Structures of 3-period

+300 variables is. The mutual information is defined as


ř ∑ p (xi , xj )
I(xi , xj ) = p (xi , xj ) log (2)
n4 n5 x ,x
p (xi ) p (xj )
i j

+300
jr =600 where p(xi ) and p(xj ) represent the probability density distri-
bution of attribute xi and attribute xj , and p(xi , xj ) represents
(c) the joint probability density distribution of node attribute xi
Fig. 3. The progress of generating delivery bonus and xj .
The mutual information can effectively describe the depen-
received two messages from node n1 at time slot T3 , where two dencies among the node attributes. We perform statistics on the
messages are delivered through the path 1: n1 → n2 → n3 → datasets generated during the training phase and then determine
n5 and path 2: n1 → n4 → n5 , respectively. Thus, the delivery the dependencies among attributes according to Eq. (2). The
bonus of nodes n1 , n2 and n3 increase by φr /3 = 200, while process of learning Bayesian network structure using MI is as
the values of nodes n1 and n4 will both increase by φr /2 = 300 follows:
when they receive the ACK messages. • Calculate the mutual information value of each pair of
From the above, it can be concluded that the more active the attributes (xi , xj ) in the attributes set. If I(xi , xj ) is larger
node is, the larger the delivery bonus it can obtain. Likewise, than a threshold e1 , the edge < xi , xj > is added and e1
the shorter the path of the successfully delivered message is, usually take a very small positive number. In this way, an
the more the delivery bonus the nodes can obtain. undirected graph is built up.
In order to apply the delivery bonus into the classifier model, • Judge the direction of the side according to the mutual

we divide the value range of the delivery bonus into ml information between the attributes and the class attributes.
intervals and each of them stands for a different level. Based When the dependency between xi and l is stronger than
on this method, the nodes can be divided into <1, 2, 3, ..., ml > that between xj and l, which is denoted as I(xi , l) >
levels. These levels are obtained based on the delivery bonus, I(xj , l), xi is more likely to be the parent attribute of xj .
so we name them the Delivery Level (l). The higher the level Thus, we can get the direction of edge < xi , xj > which
is, the greater the probability of the node succeeds in message is from xi to xj , otherwise from xj to xi .
delivery is. Therefore, the dynamic delivery bonus is utilized Based on the above steps, we can obtain the Bayesian
as the classification criteria in DMLC which can help nodes network structures of various periods. In our scenario, as the
make better routing decisions. Fig. 4 shows, we focus on a 10-hour period from 7 am -
3) Structure learning: In general, there are two methods in 5 pm though we have mobility traces for the entire 24-hour
Bayesian network structure learning. The one is scoring search duration of a weekday. The 10-hour period is divided into 3
learning method, which constructs the Bayesian network struc- periods according to the metric of contacts per hour in which
ture according to certain search strategies and scoring criteria. the Bayesian network structure is constructed corresponding to
The other is the learning algorithm based on dependency, where each period.
statistical or information theories are used to quantitatively After obtaining the network structure, we can estimate the
analyze the dependencies between variables and obtain the best conditional probability table of each attribute by counting the
network structure. Mutual information (MI) [19] is one type corresponding data of the datasets.
of the second method mentioned above and in our approach, 4) Inference: A trained Bayesian network can be used
it is applied to constructing Bayesian network structures. The to estimate the value of a certain attribute(category) by the
concept of mutual information is derived from the information observational values of the other attributes. In this paper, we
theory. It is considered as a measure of the statistical correlation can calculate the probability of the sample belonging to a
of two random variables, in which the greater the value of certain level according to the Bayesian network model. Let
mutual information is, the stronger the correlation between the Xi = < x1i , x2i , ..., xsi > denote an instance of the current
attribute set. Utilizing Bayes theorem and Bayesian network the relay node. Moreover, if there are several encounter nodes,
classifier, the posterior probability P (lq |Xi ) can be calculated the current node will choose the one with the highest level as
from prior probability P (lq ) as the next hop node to deliver the message, and the relay node
continues to forward the message to the destination node or the
P (lq ) ∏
s
P (lq |Xi )= P (xji |πji ) (3) other relay node with a higher level and a greater probability.
P (Xi ) j=1 For example, suppose that node n1 encounters node n2 in the
where πji is the parent attributes of xji in the Bayesian network area of A1 . Node n1 carries a message of m4 whose destination
and s is the number of attributes. is node n3 . The current attributes set of node n1 is X1 = <
P (Xi ) is the probability of any node whose attribute set is A1 , T5 , I1 , R1 > and the attributes set of node n2 is X2 = <
Xi . For a given sample, the value of P (Xi ) is the same for all A1 , T5 , I2 , R3 >. Historic datasets of node n1 and node n2 are
class memberships. Thus, we can obtain stored in Table I and Table II respectively where M id records
the id of the forwarded message, Dest records the destination

s
node of the message, x1 , x2 , x3 , x4 record the value of each
P (lq |Xi ) ∝ P (lq ) P (xji |πji ) (4)
attribute when the message is forwarded and Level is the node’s
j=1
classification result. Suppose the Bayesian network stucture of
Then, the level of node ni can be obtained according to Eq. current period is P eriod 1 in Fig. 4. According to Eqs. (6)-
(4) (8) and table II, we can obtain that the probability of node n1
  belonging to lq =1:
 ∏
s 
lni = arg max P (lq ) P (xji |πji ) (5)
lq ∈(1,...,ml )   Pn1 (lq = 1) = P (x1 = A1 |lq = 1)P (x2 = T5 |lq = 1)
j=1
P (x4 = R1 |x1 = A1 )P (x3 = I1 |x4 = R1 )
For example, suppose the Bayesian network stucture is
shown as P eriod 1 in Fig. 4. Then, we can get the delivery P (lq = 1)
level of node ni according to Fig. 4 and Eq. (5) 2 1 1 1
= × × × 0.01 ×
3 3 3 3
lni = arg max {P (lq ) P (x1 |lq )P (x2 |lq )
lq ∈(1,...,ml ) ≈ 2.47 × 10−4
(6)
P (x4 |x1 )P (x3 |x4 )}
Similarly, the probabilities of lq = 2, lq = 3 and lq = 4 are
Let Slq denote a sample set belonging to level lq in the respectively 3.3 × 10−9 , 10−10 and 10−10 . Thus the level of
historic datasets, Slq ,xji denote a sample set whose correspond- node n1 is ln1 = 1. In the same way, we can get that the level
ing attribute is equal to xji in Slq , and Slq ,xji ,πji denote a of node n2 is ln2 = 2, and the probability is 0.0165. Hence,
sample set whose corresponding parent attribute is equal to πji node n1 should forward the message to node n2 . Then, node
in Slq ,xji , then we can obtain n2 continues to forward the message to the destination node or

Slq ,xji the other relay nodes.
P (lq ) = (7) Actually, we can pre-calculate the distribution probability of
S lq
all attributes in order to reduce the nodes’ requirements for
Slq ,xji ,πji computing energy.
P (xji |πji ) = (8)
Slq ,xji
TABLE I. The dataset of node n1
where Slq , Slq ,xji and Slq ,xji ,πji can be obtained from histori-
cal data sets. It is noteworthy that if the size of any sample set Mid Dest
Area id Time slot Interval Route id Level
is zero, then the corresponding probability will also be zero, (x1 ) (x2 ) (x3 ) (x4 ) (l)
m1 n3 A3 T1 I1 R1 1
eliminating the effect of other attributes. In order to avoid this m2 n3 A2 T2 I1 R1 2
problem, we assume a low value (e.g. 0.01) for P (xji |πji ) m3 n3 A1 T5 I2 R3 1
whenever its value becomes zero. m4 n5 A3 T1 I3 R2 3
5) Forwarding strategy: The forwarding strategy consists
of two policies. The first one that when the node carrying
the message encounters the destination node, the message is TABLE II. The dataset of node n2
delivered to it.
The second policy involved is to classify the current and Area id Time slot Interval Route id Level
Mid Dest
(x1 ) (x2 ) (x3 ) (x4 ) (l)
the encounter nodes into corresponding levels. If the delivery m1 n3 A3 T1 I2 R3 2
level of the encountered node is larger than that of the current m2 n3 A2 T2 I1 R1 1
node, the message will be transmitted from the current node m3 n3 A1 T5 I2 R3 2
m4 n7 A3 T1 I3 R2 1
to the encountered node. If two levels are the same, the node
having the greater probability of current level will be chosen as
IV. S IMULATION -BASED E VALUATIONS 1) Impact of multi-period: According to the multi-period
characteristic of the nodes movement patterns, we construct the
A. Details of Mobility Traces 1-period, 3-period (7am -10am, 10am - 1pm and 1pm - 5pm)
In this paper, we use the simulation software the ONE (The and 6-period (7am - 8:30am, 8:30am - 10am, 10am - 11:30am,
Opportunistic Network Environment simulator) [20] to simu- 11:30am - 1pm, 1pm - 3:30pm and 3:30pm - 5pm) Bayesian
late the proposed algorithm. The data used in the simulation network models according to different hours of the day. Fig.
experiment is derived from the real bus running track recorded 5 shows the performance of three types of Bayesian network
by Seattle Public Transportation System in the King County, models in experiments. In Fig. 5(a), we can see that, compared
Seattle, USA. The data is obtained from CRAWDAD [21], to the 1-period model, a larger delivery ratio can be achieved
which contains a movement trajectory covering 5100 square by the 3-period model which is attributed to the fact that the
kilometers and involving 236 buses. movement pattern reflected in 3-model is more suitable for the
The whole area is divided into different small areas by the node’s real mobile environment. However, too many periods
size of 1km × 1km and then these areas are identified by (e.g. 6-period model) lead to insufficient data for training each
different area id. Theoretically, the smaller the partition is, model and are not conducive to learning the patterns from the
the better the simulation results are expected to be. However, historical datasets.
in practice, the amount of data to be processed will increase Moreover, the overhead ratio of the 3-period model is sig-
largely in that way. Similarly, the time slot is divided by every nificantly lower than those of the 1-period model and 6-period
30 minutes and the average contact interval is discretized in 5 model as illustrated in Fig. 5(b). The reason is that the 3-period
seconds. In order to simplify the model, the nodes are divided model can make better routing decisions and reduce the waste
into four levels at intervals of 200 in the delivery bonus. of resources in message forwarding. In Fig. 5(c), the delay of
the 1-period model is the highest because it uses the largest
datasets in the routing decision process.
B. Simulation Parameters
2) Impact of delivery bonus volatility coefficient: In the
Although the original datasets provide complete 24-hour DMLC method, the delivery level of nodes is divided according
mobile data, we focus on a 10 hour period from 7 am - 5 pm to the dynamic update delivery bonus in which the volatility
in order to make the experimental environment closer to the coefficient is ρ. In order to verify the effect of ρ, we compare
real environment. In addition, the data used in the simulation the MBN algorithms with different ρ in this experiment. In
experiment is narrowed down to an area of 38 km × 65 km, Fig. 6(a), it can be observed that MBN with ρ = 0.25 has the
which includes the central business district of this city. The best performance in delivery ratio because the delivery bonus
experiment involves a total of 34 nodes with enough trajectory is retained mostly and updated dynamically according to the
data and the data message with an interval of 100s and the size nodes state at the same time. When ρ is set as 0 and 0.5, we
of 1KB. can obtain the similar performance. However, when ρ is too big
(such as 0.75), the delivery bonus cannot be effectively reserved
C. Gathering Historical Datasets after updating. As a result, the delivery ratio of MBN with ρ
= 0.75 decreases significantly. In Fig. 6(b), when ρ is 0.75, the
Before conducting this simulation experiment, we have to
overhead ratio increases because only a small portion of the
first train the historical datasets. As a result, the data of 8 days
delivery bonus is retained. In contrast, the other three perform
working days are selected. More specifically, one day’s data is
relatively better in network overhead. In Fig. 6(c), we can
randomly selected as the test set and the rest data is used as the
obtain that MBN algorithm in these four cases exhibits similar
training set. The historic datasets of each node is an empty table
properties on account of the same routing decision process.
by default. In the process of generating the historic datasets, the
nodes adopt the flooding mechanism to deliver the messages, 3) Comparisons of other algorithms: The results of com-
and every time the nodes encounter, the value of the message id, paring the delivery ratio, overhead ratio and average delay are
the destination node, the current area id, time slot, route id and respectively illustrated in Fig. 7(a), Fig. 7(b) and Fig. 7(c).
average inter-contact interval will be recorded into the table. Here, we adopt the 3-period model and ρ = 0.25 in MBN. The
After the experiment is finished, the node level is confirmed results in Fig. 7(a) and Fig. 7(b) indicates that the overhead ratio
according to the delivery bonus value and the datasets are also of Epidemic is the largest, although it maintains a pretty high
stored locally. delivery ratio. The reason is that Epidemic adopts a flooding
mechanism to forward the data messages, and this mechanism
yields a larger delivery ratio, a shorter delivery delay and a
D. Simulation results
larger forwarding overhead. Note that the delivery ratio and
In this section, MBN is evaluated by observing the per- overhead ratio of MBN are better than those of PRoPHET
formance variations about different parameters and by com- and Bayesian, since PRoPHET relies on prior probabilities
paring with other algorithms (naive Bayesian, Epidemic and and cannot effectively utilize more network parameters and
PRoPHET with single copy). Bayesian ignores the potential relationships among attributes.
0.8 15 5500

MBN with 6−period 5000


0.7
MBN with 3−period 4500
MBN with 1−period
0.6 4000

Overhead Ratio(%)
10

Average Delay(s)
Delivery Ratio(%)

0.5 3500
3000
0.4
2500
0.3 2000
5
0.2 1500
MBN with 6−period MBN with 6−period
MBN with 3−period 1000 MBN with 3−period
0.1 MBN with 1−period
MBN with 1−period 500
0 0 0
0 0.5 1 1.5 2 2.5 3 3.5 0 0.5 1 1.5 2 2.5 3 3.5 0.5 1 1.5 2 2.5 3 3.5
Time(s) x 10
4 Time(s) x 10
4 Time(s) x 10
4

(a) Delivery Ratio (b) Overhead Ratio (c) Average Delay

Fig. 5. Impact of multi-period

0.8 15 5500

MBN with r=0.75 5000


0.7
MBN with r=0.5 4500
0.6 MBN with r=0.25
4000
MBN with r=0
Overhead Ratio(%)

10

Average Delay(s)
Delivery Ratio(%)

0.5 3500
3000
0.4
2500
0.3 2000
5 MBN with r=0.75
MBN with r=0.75
0.2 1500
MBN with r=0.5 MBN with r=0.5
MBN with r=0.25 1000 MBN with r=0.25
0.1 MBN with r=0 MBN with r=0
500
0 0 0
0 0.5 1 1.5 2 2.5 3 3.5 0 0.5 1 1.5 2 2.5 3 3.5 0.5 1 1.5 2 2.5 3 3.5
Time(s) x 10
4 Time(s) x 10
4 Time(s) x 10
4

(a) Delivery Ratio (b) Overhead Ratio (c) Average Delay

Fig. 6. Impact of delivery bonus volatility coefficient

1 30 5500

0.9 5000
25 4500
0.8
4000
0.7
Overhead Ratio(%)

20
Average Delay(s)
Delivery Ratio(%)

3500
0.6 Epidemic
MBN 3000
0.5 15 Bayesian
2500
PRoPHET
0.4
2000
10
0.3 Epidemic Epidemic
1500
0.2 MBN MBN
Bayesian 5 1000
Bayesian
0.1 PRoPHET 500 PRoPHET

0 0 0
0 0.5 1 1.5 2 2.5 3 3.5 0 0.5 1 1.5 2 2.5 3 3.5 0 0.5 1 1.5 2 2.5 3 3.5
Time(s) x 10
4 Time(s) x 10
4 Time(s) x 10
4

(a) Delivery Ratio (b) Overhead Ratio (c) Average Delay

Fig. 7. Algorithm Comparisons

Besides, the MBN considers the multi-period characteristic of V. C ONCLUSION


nodes movement patterns, which is able to reduce the waste of
This paper investigates the data forwarding problem for
resources in message forwarding. As it can be seen in Fig. 7(c),
VDTNs, where the delivery ratio is optimized while the over-
according to flooding mechanism of Epidemic, packets can be
head ratio is confined. The multi-period Bayesian network
easily forwarded. Therefore, the average delay of Epidemic can
classifier takes into account the dependencies among node
stay at a low level. However, the average delay of Prophet
attributes and captures the periodic movement patterns of
tends to be slightly higher. Likewise, the delay for the MBN
DTN nodes. Furthermore, the DMLC method is proposed to
algorithm also increased, since the forwarding strategy imposes
classify nodes into multiple levels which takes advantage of the
restrictions on the forwarding behavior of the next hop message.
dynamic delivery bonus. Based on those, MBN is developed to
In conclusion, MBN can improve the delivery ratio with a forward the data packets in the public transport environment.
minor overhead ratio through making routing decisions on the The simulations suggest that MBN can achieve a satisfactory
basis of the multi-period Bayesian network model and DMLC delivery ratio with a minor forwarding overhead. In the future,
method, and thus it is suitable for real situations in VDTNs. we are going to study our protocol under different mobility
models and extend it to multicast routing scenarios. [20] “The opportunistic network environment simulator,” [Online]. Available:
http://www.netlab.tkk.fi/tutkimus/dtn/theone/
ACKNOWLEDGMENT [21] J. G. Jetcheva, Y. C. Hu, S. P. Chaudhuri, A. K. Saha, and D. B. Johnson.
“Design and evaluation of a metropolitan area multitier wireless ad hoc
This research is supported by National Natural Science Foun- network architecture,” In Proceedings of fifth IEEE workshop on Mobile
Computing Systems and Applications, October 2003.
dation of China under Grants Nos. 41571389 and 61872191,
Open Research Fund from Key Laboratory of Computer Net-
work and Information Integration (SEU), Ministry of Edu-
cation, China, under Grant No. K93-9-2014-05B, Scientific
Research Foundation of NUPT under Grant No. NY214063.

R EFERENCES
[1] A. P. Silva, S. Burleigh, C. M. Hirata, and K. Obraczka, “A survey on
congestion control for delay and disruption tolerant networks,” Ad Hoc
Networks, vol. 25, no. PB, pp. 480–494, 2015.
[2] S. E. Alaoui, and B. Ramamurthy, “Routing optimization for DTN-
based space networks using a temporal graph model,” IEEE International
Conference on Communications, pp. 1–6, 2016.
[3] L. Zhang, B. Yu, and J. Pan, “GeoMob: a mobility-aware geocast scheme
in metropolitans via taxicabs and buses,” IEEE INFOCOM, pp. 1279–
1787, 2014.
[4] N. Vastardis, and K. Yang, “Mobile social networks: architectures, social
properties and key research challenges,” IEEE Commun. Surveys and
Tutorials, vol. 15, no. 3, pp. 1355–1371, 2013.
[5] S. M. Tornell, C. T. Calafate, J. C. Cano, and P. Manzoni, “DTN
Protocols for Vehicular Networks: An Application Oriented Overview,”
IEEE Communications Surveys & Tutorials, vol. 17, no. 2, pp. 868–887,
2017.
[6] A. Lindgren, A. Doria, and O. Scheln, “Probabilistic routing in intermit-
tently connected networks,” Sigmobile Mobile Computing & Communi-
cation Review, vol. 7, no. 3, pp. 19–20, 2004.
[7] J. Burgess, B. Gallagher, and B. N. Levine, “MaxProp: Routing for
Vehicle-Based Disruption-Tolerant Networks,” in Proceedings of IEEE
INFOCOM, vol. 6, pp. 1–11, 2007.
[8] F. R. Segundo, J. M. Farines, and E. S. E. Silva, “Quasi-opportunistic
contact prediction in delay/disruption tolerant network,” Global Informa-
tion Infrastructure Symposium, vol. 5, pp. 1–6, 2013.
[9] L. P. Portugalpoma, C. A. C. Marcondes, H. Senger, and L. Arantes,
“Applying Machine Learning to Reduce Overhead in DTN Vehicular
Networks,” IEEE Computer Society, pp. 94–102, 2014.
[10] S. Ahmed, and S. S. Kanhere, “A Bayesian Routing Framework for Delay
Tolerant Networks,” in Proceedings of IEEE WCNC 2010, pp. 1–6, April
2010.
[11] F. Ekman, and J. Karvo, “Working day movement model,” ACM Sigmo-
bile Workshop on Mobility MODELS, pp. 33–40, 2008.
[12] J. Tian, M. Gao, and G. Ge, “Wireless sensor network node optimal
coverage based on improved genetic algorithm and binary ant colony
algorithm,” Eurasip Journal on Wireless Communications & Networking,
vol. 2016, no. 1, pp. 104, 2016.
[13] F. Li and J. Wu, “Epidemic routing for partially connected ad hoc
networks,” Technical Report, CS-200006, Duke University, 2000.
[14] S. Jain, “Routing in delay tolerant network,” in Acm Sigcomm Confer-
ence, 2012.
[15] J. Liang, Y. Bai, C. Bi, Z. Sun, C. Yan, and H. Liang, “Adaptive Routing
Based on Bayesian Network and Fuzzy Decision Algorithm in Delay-
Tolerant Network,” in IEEE HPCC 2013, pp. 690–697, 2014.
[16] J. Cheng, and R. Greiner, “Comparing bayesian network classifiers,” IEEE
Transactions on Vehicular Technology, vol. 63, no. 5, pp. 2002–2012,
2013.
[17] L. Song, and D. F. Kotz, “Evaluating opportunistic routing protocols with
large realistic contact traces,” ACM Workshop on Challenged Networks,
pp. 35–42, 2007.
[18] J. I. Jun-Zhong, H. X. Zhang, H. U. Ren-Bing, and C. N. Liu, “A bayesian
network learning algorithm based on independence test and ant colony
optimization,” Acta Automatica Sinica, vol. 35, no. 3, pp. 281–288, 2009.
[19] R. A. Sutrisnowati, H. Bae, J. Park, and B. H. Ha, “Learning Bayesian
Network from Event Logs Using Mutual Information Test,” IEEE Inter-
national Conference on Service-Oriented Computing and Applications,
pp. 356–360, 2014.

You might also like