Accepted Manuscript

Accepted Manuscript
Least fresh first cache replacement policy for NDN-based IoT networks
Maroua Meddeb, Amine Dhraief, Abdelfettah Belghith,

Thierry Monteil, Khalil Drira, Hassan Mathkour
PII: S1574-1192(18)30088-9
DOI: https://doi.org/10.1016/j.pmcj.2018.12.002
Reference: PMCJ 983
To appear in: Pervasive and Mobile Computing
Received date : 15 February 2018

Revised date : 22 October 2018
Accepted date : 12 December 2018
Please cite this article as: M. Meddeb, A. Dhraief, A. Belghith et al., Least fresh first cache
replacement policy for NDN-based IoT networks, Pervasive and Mobile Computing (2018),
https://doi.org/10.1016/j.pmcj.2018.12.002
This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to
our customers we are providing this early version of the manuscript. The manuscript will undergo
copyediting, typesetting, and review of the resulting proof before it is published in its final form.
Please note that during the production process errors may be discovered which could affect the
content, and all legal disclaimers that apply to the journal pertain.
Least Fresh First Cache Replacement Policy for
NDN-based IoT Networks
Maroua Meddeba,b , Amine Dhraiefa , Abdelfettah Belghithc,∗, Thierry
Monteilb,d , Khalil Drirab , Hassan Mathkourc
a
HANA Lab, University of Manouba, Tunisia
b
LAAS-CNRS, Université de Toulouse, CNRS, Toulouse, France
c
College of Computer and Information Sciences, King Saud University, Saudi Arabia
d
Université de Toulouse, INSA, Toulouse, France
Abstract
In-network caching in Named Data Networking (NDN) based Internet of

Things (IoT) plays a central role for efficient data dissemination. Data cached
throughout the network may quickly become obsolete as they are transient
and frequently updated by their producers. As such, NDN-based IoT net-
works impose stringent requirement in terms of data freshness. While vari-
ous cache replacement policies were proposed, none has considered the cache
freshness requirement. In this paper, we introduce a novel cache replacement
policy called Least Fresh First (LFF) integrating the cache freshness require-
ment. LFF evicts invalid cached contents based on time series forecasting
of sensors future events. Extensive simulations are performed to evaluate
the performance of LFF and to compare it to the different well-known cache
replacement policies in ICN-based IoT networks. The obtained results show
∗
Corresponding author. Tel.: +966 535920540
Email address: abelghith@ksu.edu.sa (Abdelfettah Belghith)
Preprint submitted to Pervasive and Mobile Computing October 26, 2018

that LFF significantly improves data freshness compared to other policies,
while enhancing the server hit reduction ratio, the hop reduction ratio and
the response latency.
Keywords: IoT, NDN, cache replacement, freshness, prediction model
1. Introduction
We are currently experiencing a networking paradigm shift in which sur-

rounding everyday objects are becoming interconnected and smart. The
Internet of Things (IoT) is changing our way of living and represents an
important disruptive technological revolution. The IoT traffic is increasing
every day due to the explosive growth of the number of ”things” connected to
the Internet. Fearing that the current Internet architecture cannot support
this evolution, numerous researches were interested in proposing cutting-edge
solutions in order to improve the data dissemination efficiency.
Information-Centric Networking (ICN) [1] is a new Internet paradigm in
which contents are addressed by their unique location-independent names
rather than their host addresses. ICN naively provides multicast, in-network
caching and name-based routing. Considering the benefits of this innovation,
ICN is becoming a key technology for data dissemination in IoT networks.
In-network caching is one of the pillars of ICN [2]. It increases the data
availability within the network since consumers requests are often satisfied
by cache nodes rather than by the producers. Thanks to this feature, the
traffic load is significantly reduced. Another novel approach is given in [3]
2
where the authors proposed a local storage service at the edge of the network
to temporarily buffer generated data. The produced data is then selectively
processed and/or synchronized with the cloud only when necessary depend-
ing on the application strategy and requirements. Such an edge storage is
intended to better shape the network traffic according to the network condi-
tions. In this paper, we rather concentrate on the in-network caching.
Some studies [4, 5, 6] have already identified the Named Data Networking
(NDN) architecture as the most suitable ICN architecture for IoT systems.
On the other hand, in an IoT context, data are transient and frequently
updated by the producer. As a consequence, copies stored in caching nodes
may become out of date after a certain period of time, and need to be evicted.
Caching mechanisms use already cache replacement policies in order to allow
the storing of new items once the cache is full. However, existing policies are
based on the incoming requests frequency or content popularity and are not
interested in data validity.
In this work, we propose the Least Fresh First (LFF) cache replacement
policy. The rationale behind this policy is to predict sensors future events
based on their past behavior. To this end, we rely on the Autoregressive
Moving Average (ARMA) time series model [7]. To the best of our knowl-
edge, none of the studies that addressed the cache replacement policies have
considered the cache freshness requirement. To evaluate our proposal, we
carry out extensive simulations using the ccnSim simulator [8]. We compare
LFF to the different well-known cache replacement policies in an ICN-based
3
IoT environment with regard to various performance metrics. The obtained
results show that our proposal significantly improves data freshness com-
pared to other policies. In addition, it improves the system performances in
terms of server hit reduction ratio, hop reduction ratio and response latency.
The remainder of this paper is organized as follows: We give in section
2 an overview of ICN and its use in an IoT environment. We present in
section 3 the most cited caching policies. We detail our proposal in section
4. In section 5, we evaluate the performance of our proposal and analyze the
obtained results. We finally conclude the paper in section 6.
2. Caching Policies in ICN
Caching policies address two main issues: where to cache? and which
data item to evict once the cache is full?.
2.1. In-network Caching Strategies
Relevant caching strategies have been proposed to answer the question

of where to cache. The Leave Copy Everywhere (LCE) strategy [9] leaves
(caches) a copy of the content at each node along the response path. The
Leave Copy Down (LCD) [9] strategy stores a copy in just one node which is
the gateway on one level down in the reverse path towards the consumer. The
edge-caching strategy [10] is proposed for hierarchical topology. With this
strategy, cache nodes reside in the leaves of the topology. The consumer-cache
strategy [11] is a variant of the edge-caching. The consumer-cache strategy
4
can be used in any type of topology. The main objective of the consumer
cache strategy is firstly to reduce the caching cost while maintaining the
system performances in terms of hop reduction and server hit reduction ratios
[12] and secondly to enhance the percentage of the freshness o the requested
content. [12] seems to be the only paper that has investigated the cache
coherency issue for ICN based IoT [13] and [14].
The betweenness centrality strategy [15] depends on the betweenness
parameter calculated for each node in the topology. The parameter measures
the number of times a node belongs to a path connecting any two nodes in
the topology. Nodes with the highest betweenness centrality parameter store
a copy of the data. Finally, the ProbCache caching strategy [16] selects the
cache node with a probability inversely proportional to the distance between
the consumer and the producer. That means this strategy privileges one or
more nodes which are close to the consumer who sent the request.
2.2. Cache Replacement Policies
Once a cache becomes full, some stored content must be deleted to be

replaced by new items. In the First In First Out (FIFO), the oldest data
object is replaced by the newly arriving content. The Least Recently Used
(LRU) policy is more efficient in the case of high temporal locality of the
request streams. The LRU has been adopted in many studies addressing
caching policies in NDN [17, 11]. This eviction policy replaces the data
object that is not being used for the longest time. The least frequently used
5
(LFU) policy proposes to keep popular objects in the cache to satisfy a high
number of requests. With LFU, the cache node keeps track of the number
of times a data object satisfies a request, and replaces the item with the
lowest frequency. The Random Replacement (RR) policy randomly chooses
the data item to be evicted. Caches with complex data structure motivate
the use of the randomized cache replacement policies. This is because the
RR policy does not require state information, in addition, with RR policy
both memory and processing power can be saved.
Some other specific strategies have been recently proposed for ICN. In [18],
authors introduced a Universal Caching (UC) strategy where the replacement
decision depends on a parameter assigned to each incoming content. This
parameter is based on the distance from the source to the current node, the
reachability of the router and the frequency of the content access. Results
show that this policy performs better than LRU and FIFO in ICN networks
in terms of cache hits and the average number of hops required to get the
requested content. In [19], Al-Turjman et al. introduced the Least-Value
First (LVF) cache replacement policy which takes into account the delay for
retrieving a content as well as the popularity and age of the content. LVF
was showed to outperform FIFO and LRU in terms of time-to-hit, hit-rate,
in-network delay and data publisher load.
In an IoT context, data are transient and frequently updated by the
producer. As a consequence, copies stored in caching nodes may become out
of date after a certain short period of time. Facing this stringent freshness
6
requirement, it is necessary to consider the data freshness to privilege the
eviction of stale contents from the cache. To the best of our knowledge,
existing cache replacement policies are enhanced by just embedding constant
expiration delay to cached contents (Time To Live). Such an expiration delay
can support the data freshness to a certain extent as it uses a fixed delay
which in turn may delete still valid content or by against keep in the cache
out-of-date content for a long time.
3. The Least Fresh First Cache Replacement Algorithm
To select the least fresh content, we rely on prediction calculations. A

cached content is supposed to be valid if it has not been updated at the
source since its last retrieval. The basic idea of our policy is to predict the
time of the next event of a sensor in order to estimate the residual life, Tf resh ,
of the retrieved content (its remaining time validity). In order to calculate
Tf resh , it is essential to study the sensors behavior. To this end, we need to
analyze and classify the IoT traffic patterns.
3.1. The IoT traffic patterns
IoT traffic is usually classified into four categories: continuous, periodic,

OnOff and request-response transmissions [20]. With continuous transmis-
sion, the data are transmitted in a continuous manner like in video streaming.
Under periodic transmission, the source sends a data at every fixed period
of time T. For example, with a temperature sensor, after the elapse of each
7
period (e.g;. 1 hour), a new value is recorded. The OnOff transmission mode
stipulates that the content is updated as soon as a new event occurs. We may
consider here the example of a presence sensor. The value 0 of the sensor
indicates the absence of persons. Once someone gets in the room, the value
is updated to 1. Finally, in the request-response transmission, as its name
indicates, the consumer sends directly a request to get the current value of
the sensor. In this latter case, sensors are considered to be passive and do
not follow any behavior and usually there is no need to cache their data.
Furthermore considering continuous transmission as a periodic with a tiny
period (), we may classify IoT events into two classes periodic and OnOff.
Concerning the periodic transmission, when the period T elapses, the
content is no longer valid. However, with the OnOff transmission, the sensor
behavior can not be predicted and the updates of values can be performed at
any time. For this reason, the prediction process is used with this mode. We
differentiate in Algorithm. 1 the two modes of transmission in the calculation
of Tf resh . In the case of a periodic sensor, Tf resh is the remaining time of the
period since the last update (Line 8). With an OnOff sensor, Tf resh will be
estimated using forecasting tools.
3.2. Prediction model
The exponential smoothing, the machine learning and the Autoregressive

Moving Average (ARMA) models are the most widely-used approaches to
time series forecasting [7]. ARMA which is also called the Box and Jenkins
8
1: Input: Received request
2: Output:Tf resh
3: Data:
4: Sensors will receive requests from different consumers to retrieve the
sensor’s value
5: Begin
6: for each received request do
7: if Data.f low = ”P eriodic” then
8: Tf resh = (update time + T ) − current time
9: else
10: Estimate Tf resh
11: end if
12: end for
Algorithm 1: Calculation of Tf resh
approach can be regarded as a special case of the most general and most pow-
erfull algorithm of the Kalman filter algorithm [21]. Both are applied only to
the processes which satisfy the linear models with a finite number of param-
eters. The Kalman filter algorithm carries an additional strength, differently
from the Box and Jenkins approach, especially for handling missing data. In
our case, ARMA is sufficient since all the events will be recorded. Machine
learning techniques work better if we are dealing with a rather huge amount
of data and enough training is available, while ARMA is better for smaller
data sets.The exponential smoothing is suitable for forecasting data which
does not display any clear trending behavior or any seasonality. However,
the ARMA time series is designed for stationary time series. In this study,
we used real IoT data extracted from the ADREAM [22] building at LAAS-
CNRS laboratory which is a smart building. The building hosts our smart
9
apartment equipped with different sensors (temperature, humidity, lumines-
cence, presence, etc.) as well as actuators such as electric plugs attached to
different elements: lamps, fans, humidifier, etc. We have noticed a very low
variability of the data around their average value which led us to choose the
ARMA model [23] as a tool for forecasting.
3.3. Autoregressive Moving Average model
ARMA model was introduced by Box and Jenkinsis in [24]. This model
is a very large family of stationary processes which has two advantages. On
the one hand, these processes are excellent and precise forecasting tools. The
forecasting error is proved to be less than or equal to 5% [25]. On the other
hand, methods are perfectly developed to estimate their parameters. The
prediction operation in ARMA is based not only on the past events but also
on some unexpected recent events. In fact, it consists in eliminating obvious
trends as periodicity and growth and then focusing on residue. This latter is
modeled and the forecast relies on such a model.
An ARMA(p,q) model is represented by the stochastic process (Xi ) mea-

suring the inter arrival times between successive events (i.e., between eventi−1
and eventi ; i ≥ 1). The process is presented in equation Eq. 1, where i are
the errors of the previous estimations and n is the number of past events.
An ARMA(p, q) model is characterized by p + q + 1 unknown parameters;
φ = (φ1 , . . . , φp ), θ = (θ1 , ..., θq ) and n that need to be estimated. P and q
10
are respectively the number of Xi and i used in the calculation. P and q can
be constant and it was shown in [25] that this model needs a maximum of
30 values to have a good estimation. However, to optimize the calculation, p
and q could be variable and calculated according to the values of Xi and i .
Xn +φ1 Xn−1 +φ2 Xn−2 +· · ·+φp Xn−p = n +θ1 n−1 +θ2 n−2 +· · ·+θq n−q (1)
Forecasting Tf resh amounts to calculate Xn+1 . To this end, it is necessary

to start by collecting the data (Xi ), then, we calculate ARMA parameters
(p, q, φi , θi , i ). The learning phase is indispensable in the forecasting process.
It is a primordial step to collect a sufficient set of data and to analyze the
followed process in terms of stationarity, trends and seasonality.
3.3.1. Calculation of ARMA parameters
The ARMA process is the convolution of two processes that begins with
the AR (Autoregressive) process then the MA (Mobile Average) process. Xn
is an autoregressive process of order p, ARp . It is therefore determined by
the variance n of the white noise as well as its canonical polynomial Eq. 2,
where φ1 , . . . , φp are the parameters of the model.
Xn = n − φ1 Xn−1 − φ2 Xn−2 − · · · − φp Xn−p (2)
The autoregressive model specifies that the future estimated value (Xn ) de-
pends linearly on its own previous values X1 , . . . , Xp and on a stochastic
11
term n which presents the imperfectly predictable term. n is the white
noise which measures the variance σ 2 given in equation Eq. 3.
n
X
n = σ 2 = 1/n Xi2 − X̄ 2 (3)
i=1
φ = (φ1 , . . . , φp ) is the solution of the Yule-Walker equation given by equation

(n) n n
Eq. 4 where CX,p = (CX (1), . . . , CX (p)) are the empirical covariances (Eq. 5)
and Rp the Toeplitz matrix of the covariances to the order p.
(n) (n)
Rp(n) φ(n) = CX,p ⇔ φ(n) = CX,p (Rp(n) )−1 (4)
n−k
X
n
CX (k) = 1/n Xj+k Xj (5)
j=1
 
(n) n n n
Rp = 1 CX (1) CX · · · CX
(2) (p − 1)
 
 
 CX (1)
n
1 n
CX (1) · · · CX (p − 2)
n
 
 
Rp =   We solve the
(n) n n n
 C X (2) C X (1) 1 · · · C X (p − 3)
 
 .. .. .. .. .. 
 . . . . . 
 
n n n
CX (p − 1) CX (p − 2) CX (p − 3) · · · 1
ARp equation in Algorithm 2. As we can see from line 11 to line 21, p is set
according to the calculated φp . It is a question of choosing the best value of p
for the given distribution (Xi ) that will be sufficient for a precise estimation.
A corollary used in AR provides a tool for finding the right value of p when
we want to model a series with an AR process. We can, in fact, calculate
one by one the empirical partial correlations φp . If we compare the quantile
12
1: Input: (Xi )
2: Output:Xn+1
3: Data:
4: Sensors with full past event container
5: Sensors with an OnOff transmission mode
6: Begin
7: for each sensor do
8: Calculate n (Eq. 3)
9: for each received request do
10: p=1
11: while p ≤ n do
(n)
12: Calculate CX,p (Eq. 5)
(n)
13: Calculate (Rp )−1
14: Calculate φp (Eq.√4)
15: if | φp |≥ Z.975 / n then
16: if p ≥ 1 then
17: p--
18: end if
19: break
20: else
21: p++
22: end if
23: end while
24: Calculate Xn+1 (Eq. 2)
25: end for
26: end for
Algorithm 2: Calculation of the ARp process parameters
√
to the 0.975 order of the Gaussian ( Z.975 ), divided by n, we can see from
which value of p, the absolute value of φp remains smaller (line 15). We can
then admit that p − 1 is the best value (line 17) since p pushes the process
in the rejection region.
The Xn+1 (Tf resh ) obtained with the AR process is already sufficient es-
13
pecially with perfectly stationary series. However, for better precision, the
Moving Average process is used to readjust Tf resh . We call the MA process
of order q (also called M Aq ), the process defined by equation Eq. 6 where
θ1 , . . . , θq are the parameters of the model.
Xn = θ1 n−1 + θ2 n−2 + · · · + θq n−q (6)
The MA process is combined to the AR process in order to adjust results by

taking into account the errors of previous predictions. In fact, i is the error
between the past predicted values and the real ones (i =| Xireal −Xif orecast |).
To solve the MA process, we calculate θi with the innovation algorithm [26].
θi = Cov(Xn , t−n )/σ 2 (7)
As the same way with the AR process, we calculate the MA process according
to equation Eq. 6, using i and θi . It is worth noticing that p and q can have
different values. After the calculation of the MA process, we can finally
deduce the adjusted value of Xn+1 . The calculated Tf resh is then appended
to the requested data and sent back to the consumer. As a consequence, all
cached contents will contain the freshness delay information.
To recapitulate, the data collection as well as the forecasting calcula-
tions are only performed at the gateways directly connected to the sensors.
Each gateway handles the sensors connected to it and maintains a queue for
each sensor. While the data collection is done for each event, the prediction
14
process is only invoked upon the arrival of a requests. In fact, the data col-
lection is invoked when a new event occurs and each sensor event is stored
in its specific queue. The prediction is only performed when a request ar-
rives to the gateway. The queue used in the forecasting process is the queue
corresponding to the requested sensor. The estimated lifetime called Tf resh
is then appended to the data packet. Authors in [27] showed that the com-
plexity of the forecasting model using ARMA is O(n), where n is the number
of collected data usually set to n ≤ 30. As such, the cumulative prediction
complexity at each gateway is O(n) ∗ numberof receivedInterests.
3.4. LFF algorithm
The approach, presented in Algorithm 3, is invoked each time when there

is a new Data to cache. As a first step, a lookup process is mandatory to see
if the content already exists in the cache (line 11). If it is the case, just an
update of the version, the cache time and Tf resh is occurred (line 15-18). If
not, it is necessary to verify if the cache is full or not (line 20). If there is a
place to add the content, this latter is stacked on the top of the Cache (line
21). Otherwise, one data item ci has to be ejected. Therefore, a look over the
Cache is performed in order to select the ci with the least Tf resh +cache time
and evict it (line 23-25). The LFF cache replacement policy can be used with
any caching strategy.
15
1: Input: A new data item to cache
2: Output: A data item to evict from the cache
3: Data:
4: Data to cache
5: Cache = (c1 , c2 , ...cn ) s.a: n ≤ cache size
6: ci the position of the data item to be evicted s.a: 1 ≤ i ≤ n
7: Begin
8: i=1
9: F ound = F alse
10: while NOT F ound AND i ≤ n do
11: if ci .name = Data.name then
12: F ound = T rue
13: else
14: i=i+1
15: end if
16: end while
17: if F ound then
18: ci .version = Data.version
19: ci .cache time = current time
20: ci .Tf resh = Data.Tf resh
21: else
22: if n 6= cache size then
23: cn+1 = DAT A
24: else
25: for each Ci ∈ C1..n do
26: cevict = min(ci .Tf resh + ci .cache time)
27: Evict(cevict )
28: end for
29: end if
30: end if
Algorithm 3: Least Fresh First
3.5. Example with LFF policy
We detail in Figure. 1 the operation of the LFF cache replacement pol-

icy. We suppose that the cache size is equal to 5 and we propose to use
16
the consumer-cache caching strategy. Consumer1 is interested in content
/Home/room2 /pre. Since there is no entry in cache nodes that can satisfy
this request, the data is retrieved from the producer. When the response
reaches node n3, according to the adopted caching strategy, a copy of the
content will be stored in the cache. However, the cache is full, so one item
must be evicted to allow the caching of the new one. The candidate to be
ejected is the one with the least value of Tf resh + cache time which measures
the time from which the content is considered invalid. In our scenario, the
content /Home/room1/hum is deleted (red line) and the new item is pushed
into the stack (green line). The CS structure maintained by node n1 is not
full, then the new item is directly pushed into the stack. We remark that the
cache at n3 filled up faster than the cache at n1. This is because n3 stored
the content requested by both Consumer1 and Consumer2, while n1 caches
the content of requests from Consumer1 only.
Sensor2
n6
Home\room2\pre
n5
Consumer2
n3
n1
Sensor1
Consumer1
n4 n2
Home\room1\hum
CS
Id Data cache_time T_fresh
\Home\room2\pre 1 10:55am 2h
\Home\room1\Tmp 23° 10:30am 30m
\Home\room2\sound 620 09:36am 1h30
CS
\Home\room1\hum 50 08:52am 26min
Id Data cache_time T_fresh
\Home\room2\pre 1 10:55am 2h
\Home\room1\Tmp 23° 10:30am 30m
\Home\room2\sound 620 09:36am 1h30
\Home\kitchen\flame 0 09:11am 15min
\Home\room1\hum 50 08:52am 26min
\Home\kitchen\gas 0 08:05am 3h
Figure 1: An example with the NDN architecture using LFF policy
17
4. Performance evaluation
We now present the performance evaluation of our proposed cache re-

placement policy combined with the different caching strategies presented
in section 2. With the purpose to simulate an information-centric IoT net-
works, we use the ccnSim simulator [8] to model NDN nodes and we add
some special nodes to model IoT devices (e.g., sensors).
4.1. Simulation scenario
In the simulation scenario, we need to fix two distribution laws: the dis-
tribution governing the generation of Interest requests at each consumer, and
the distribution governing the choice of the content to be requested among
all available contents. For the former, we assume that each consumer request
contents following a Poisson process with parameter λ = 1; that is 1 request
per second on the average for each consumer. The latter distribution concerns
the selection of the content among available contents (the content catalog).
Under IoT, contents have close probabilities to be requested. Therefore,
Interest packets are assumed to be uniformly distributed as in [28, 29].
Authors in [30] showed that in existing studies the ratio of the cache size
C
C over the catalog size F , F
∈ [10−5 ; 10−1 ]. To be faithful to this constraint,
C
in our simulation, we set F
= 10−3 . We consider the cache size C = 4 chunks
and the catalog size F = 4000 files. In our simulation, we do not consider file
fragmentation and we suppose that each file is presented as a unique chunk.
Many topologies can be used to evaluate ICN aspects. We choose the
18
Transit-Stub (TS) topology [31] which can model an IoT topology. Our
topology is composed of 260 nodes distributed on 2 transit domain with
on average 10 transit nodes connected each one to 2 stub domains with on
average 6 stub nodes. The 4000 sensors are connected to 40 Gateways. We
consider 25 consumers already connected at the beginning of the simulation.
The producers and their consumers are distributed in such a way that they do
not belong to the same transit domain. However, a gateway can be connected
to both the consumer and the producer.
As we have already mentioned, our simulations were carried out with real
IoT data extracted from ADREAM. We choose periodic sensors with different
periods T varying from 1s to 1h. We used smart meters such as temperature,
humidity and luminescence. Concerning the OnOff sensors, we used devices
having different variance, for example, a presence detection sensor injected in
a corridor has not the same update frequency as a presence detection sensor
injected in a bedroom. We report that the majority of considered sensors
have a sensing variance between 7s and 13s.
4.2. Performance metrics
We consider the Hop Reduction Ratio, the Server hit Reduction Ratio
and the Response Latency metrics. Then, we propose the Validity metric to
examine the freshness of requested data.
The Hop Reduction Ratio α measures the reduction of the number of hops
traversed to satisfy a request compared to the number of hops required to
19
retrieve the content from the server.
PR
PN hir
r=1 Hir
i=1 R
α=1− (8)
N
Where N is the number of consumers, and R is the number of requests created

per consumer. The hir parameter is the number of hops from i to the cache
that satisfies r, and Hir is the number of hops from i to the producer.
β represents the Server hit Reduction Ratio measuring the reduction of
the rate of access to server/producer.
PN
serverhiti
β = 1 − Pi=1
N
(9)
i=1 totalReqi
where serverhiti is the number of requests sent by i and satisfied by the

producer and totalReqi is the total number of requests, satisfied by both the
producer and the cache, sent by i.
The third metric is the Response Latency representing the duration be-
tween the delivery of a request and its response. In equation Eq. 10, we
calculate γ, the average of the response latency Tir over N consumers each
sending R requests. PR
PN r=1 Tir
i=1 R
γ= (ms) (10)
N
To measure the data freshness, we use the following metric presented in

equation Eq. 11. Validity is the percentage of the valid contents received by
N consumers and satisfied by cache nodes against the total number of received
20
content including valid and invalid ones. In equation Eq. 11, we respectively
note validi and invalidi as the number of valid and invalid content received
by consumer i and satisfied by a cache node.
PN
i=1 validi ∗ 100
V alidity(%) = PN (11)
i=1 validi + invalidi
4.3. Simulation Results
CcnSim is distributed with native support of LRU cache replacement pol-

icy and already provides LCE, LCD and ProbCache caching strategies. We
have enhanced the simulator to support RR, LFU and FIFO cache replace-
ment policies and we also implemented Btw, edge-caching and consumer-
caching strategies. We refer the readers to [11, 32, 33] for further simulations
and results.
We plot the system performances for the caching strategies and replace-
ment policies in terms of the Server hit Reduction Ratio (Figure. 2), The hop
reduction ratio (Figure. 3), and Response Latency (Figure. 4). We firstly re-
mark that cache replacement policies have the same behavior under different
caching strategies.
21
1 1
LFF LFF
RR RR
LRU LRU
LFU LFU
0.8 0.8 FIFO
FIFO
Server hit Reduction Ratio
Hop Reduction Ratio

0.6 0.6
0.4 0.4
0.2 0.2
0
0
LCE LCD ProbCache Btw
Edge Consumer cache LCE LCD ProbCache Btw
Edge Consumer cache
Figure 2: hit reduction ratio Figure 3: Hop reduction ratio

Regardless of the cache replacement policy, we observe that LCE performs
the worst, and consumer-cache outperforms all the other strategies. Second,
the impact of different cache replacement policies is much more noticeable
with LCE than Consumer-cache. To give the reason behind this behaviour,
we plot in Figure. 5 the average number of evictions for each caching strat-
egy. This figure shows that the closer the cache nodes are to the producer
the higher is the number of evictions. This is because nodes close to the
producers belong to many request paths by against nodes close to consumers
belong only to request paths starting from this consumer. LCE has the high-
est number of evictions as it is expected. Notice that the increase in the
number of evictions diminishes the cache efficiency. We report from 0.32 to
0.56 of Server hit Reduction Ratio (Figure. 2). These values mean that only
56% of requests are satisfied from cache nodes. The Hop Reduction Ratio
(Figure. 3) for this strategy is between 0.44 and 0.67, which means that paths
are reduced by 67% in term of number of hops. Concerning the third met-
ric, which is the response latency (Figure. 4), it is about 120ms to 221ms.
22
On the other hand, edge-caching and consumer-cache have almost the same
number of evictions and perform good results. The minor difference between
these two strategies stems from the fact that consumer-cache strategy makes
contents closer to consumers. We report under the consumer-cache strategy
from 0.84 to 0.92 of server hit reduction. The hop reduction ratio is about
0.76 to 0.89, implying that requests only cross 11% of hops on the path to-
wards the producer. Finally, with our strategy, the response latency varies
from 93ms to 121ms. LCD after a certain number of requests, tends to LCE
and all path nodes become caches. For this reason, LCD results are not as
bad as LCE. Concerning ProbCache and Btw, cache nodes are selected in the
middle of the request path and probably closer to consumers, in the case of
ProbCache. Simulation results of these two strategies are medium compared
to other caching strategies. Results in [10] showed that the edge nodes are the
best placement for cache nodes. Our findings confirm this conclusion. In fact,
edge-caching reports good results. Consumer-cache has the best simulation
results because requests are, in most cases, satisfied by the first hop node.
250 200
LFF
RR
LRU
200 LFU
FIFO 150
Response Latency (ms)
Number of evictions
150
100
100
50
50
0 0
LCE LCD ProbCache Btw
Edge Consumer cache LCE LCD ProbCache Btw Edge Consumer−cache
Figure 4: Response latency Figure 5: Number of evictions
23
Now, we evaluate the impact of the different cache replacement policies.
Figures 2, 3, 4 show that LFF and RR policies outperform LRU, LFU and
FIFO policies. Recall that in an IoT environment, requests are uniformly
distributed and all sensors have close probabilities of being solicited. In
other words, contents are randomly requested. This fact explains why the
RR policy outperforms LRU and LFU. The FIFO policy aims to keep each
content as long as possible in the cache node regardless of the frequency
with which each content is requested. But also, the evicted item is not
uniformly selected. This policy may be suitable with closed queue-based
request distribution. In our scenario, FIFO presents the worst results.
Concerning our proposed cache replacement policy LFF, the selection
process of the content to be evicted is not related to the incoming requests
but to the data freshness. For this reason, our policy does not contradict a
uniform request distribution. This explains why RR and LFF findings are
very close to each other. The minor difference between these two policies is
due to the fact that the RR policy does not follow any logic. It can delete a
content which has just been stored or rather the opposite, keep in the cache
a content for a long time. The LFF is more coherent vis-a-vis the contents
lifetimes.
Figure. 2 reports from 0.65 to 0.93 of server hit reduction under LFF
and from 0.56 to 0.92 with RR policy. Under LRU policy, from 43% to
89% of requests are satisfied by cache nodes. This figure portrays between
0.38 and 0.87 of server hit reduction using LFU. Finally, with FIFO policy,
24
100
LFF
RR
LRU
80 LFU
FIFO
Validity % 60
40
20
0
LCE LCD ProbCache Btw Edge Consumer−cache
Figure 6: Validity %
results are almost from 0.32 to 0.84. Figure. 3 gives the same performance
results. LFF policy outperforms other replacement policies with 0.75 to 0.89
for hop reduction ratio. RR performs close results, about 0.67 to 0.86. This
ratio is about 0.58 to 0.83 and between 0.53 and 0.80 under LRU and LFU
respectively. With FIFO policy, requests traverse from 24% to 56% of the
path towards the producer. The response latency, depicted in Figure. 4, is
the lowest with LFF policy with 83ms to 115ms. It is about 93ms to 135ms
under RR policy. With LRU and LFU, it respectively varies from 108ms
to 170ms and from 113ms to 185ms. The FIFO policy reports the longest
response latency with 121ms to 221ms.
The proposed LFF cache replacement policy maximizes the content va-
lidity percentage to meet the IoT freshness requirement. It thrives to predict
the exact delays of updates in order to eliminate copies supposed to be in-
valid. Figure. 6 depicts the percentage of fresh content with different cache
replacement policies and caching strategies.
LRU, LFU, FIFO and RR policies do not consider the data freshness in
25
the eviction process. That means an item may remain stored in the cache for
a long time even if it is no longer valid. Figure. 6 compares different cache
replacement policies in term of data freshness. Since among the compared
cache replacement policies, only our policy considers the data freshness re-
quirement, we can intuitively deduce that it performs better results in term
of data validity %.
As it is shown in Figure. 6, LRU, LFU and FIFO have almost the same
percentage of content validity, from 52% with consumer-cache to 45% with
LCE. In fact, LRU and LFU policies usually keep in the cache for a long
time more solicited contents until they are evicted. The same with FIFO, it
follows the logic to keep in the cache all the contents for the longest time.
The RR policy has slightly better results by comparing it to LRU, LFU
and FIFO, from 61% with consumer-cache to 52% with LCE. This policy is
random and does not follow any law to manage the lifetime of a content in a
cache. Concerning our proposed LFF cache replacement policy, it calculates
the required lifetime for which the content is supposed to be valid. Then, at
the eviction time, if all the content are valid, it selects the one who has the
least lifetime remaining and if many contents are already invalid, it selects the
one being invalid the longest. The LLF policy, combined with any caching
strategy, significantly increases the data validity percentage. Figure.6 shows
that this percentage can reach 96% with consumer-cache and 81% under
LCE.
26
5. Conclusion
The Information-Centric Networking (ICN) is a ”Clean Slate” solution

to support Future Internet challenges. Thanks to its relevant benefits, ICN
has been nominated as the most suitable solution to deal with IoT networks
challenges. The in-network caching is considered as the most important ICN
feature since it improves data dissemination by satisfying requests by cache
nodes. However, under IoT environment, cached contents can rapidly become
out of date which raises the data freshness requirement and challenge. To
this end, we addressed the cache coherence issue in ICN/IoT networks. We
introduced a novel cache replacement policy named Least Fresh First (LFF)
which provides the capability to evict the least fresh content from the cache.
LFF policy is based on time series analysis as a tool to predict future events,
and therefore to estimate the residual life time of cached contents. Conducted
simulation results showed that our proposal exhibits superior results in terms
of freshness percentage of the retrieved contents of system performances.
6. Acknowledgement
The authors extend their appreciation to the Deanship of Scientific Re-
search at King Saud University for funding this work through research group
no RGP-1436-031
References
[1] V. Jacobson, D. K. Smetters, J. D. Thornton, M. . Plass, N. H. Briggs,
R. L. Braynard, Networking named content, in: CoNEXT ’09, ACM,
2009, pp. 1–12.
27
[2] Y. Zhang, a. et, Icn based architecture for iot: Requirements and chal-
lenges (2013).
[3] I. Psaras, O. Ascigil, S. Rene, G. Pavlou, A. Afanasyev, L. Zhang, Mobile

data repositories at the edge, in: HotEdge’18, USENIX Association,
Boston, MA, 2018.
[4] E. Baccelli, C. Mehlis, O. Hahm, T. C. Schmidt, M. Wählisch, Informa-

tion centric networking in the iot: Experiments with NDN in the wild,
in: ICN’14, ACM, New York, NY, USA, 2014, pp. 77–86.
[5] O. Ascigil, S. Rene, G. Xylomenos, I. Psaras, P. George, A keyword-

based ICN-IoT platform, in: ICN’17, ACM, 2017.
[6] M. Meddeb, A. Dhraief, A. Belghith, D. K. Monteil, Thierry, S. AlAh-

madi, Named data networking: A promising architecture for the internet
of things (IoT), IGI IJSWIS 14 (2).
[7] R. Hyndman, G. Athanasopoulos, Forecasting: principles and practice:,

OTexts, 2014.
[8] R. Chiocchetti, D. Rossi, G. Rossini, ccnsim: An highly scalable ccn

simulator, in: ICC’13, 2013, pp. 2309–2314.
[9] L. Saino, I. Psaras, G. Pavlou, Icarus: a caching simulator for informa-

tion centric networking (icn), in: SIMUTOOLS’14, ICST, ICST, Brus-
sels, Belgium, Belgium, 2014.
[10] S. K. Fayazbakhsh, Y. Lin, A. Tootoonchian, A. Ghodsi, T. Koponen,

Maggs, Less pain, most of the gain: Incrementally deployable ICN, SIG-
COMM Comput. Commun. Rev. 43 (4) (2013) 147–158.
[11] M. Meddeb, A. Dhraief, A. Belghith, T. Monteil, K. Drira, Cache coher-

ence in machine-to-machine information centric networks, in: LCN’15,
Clearwater Beach, FL, USA, October 26-29, 2015, 2015, pp. 430–433.
[12] M. Meddeb, A. Dhraief, A. Belghith, T. Monteil, K. Drira, S. AlAhmadi,

Cache freshness in named data networking for the internet of things, The
Computer Journal 61 (10) (2018) 1496–1511.
28
[13] S. Arshad, A. M. Awais, M. H. Rehmani, Information-centric networking
based caching and naming schemes for internet of things: A survey and
future research directions, IEEE Com Surveys and Tutorials.
[14] I. U. Din, S. Hassan, M. K. Khan, M. Guizani, O. Ghazali, A. Habbal,
Caching in information-centric networking: Strategies, challenges, and
future research directions, IEEE Commu Surveys and Tutorials.
[15] W. K. Chai, D. He, I. Psaras, G. Pavlou, Cache ”less for more” in
information-centric networks, in: Networking, IFIP’12, Springer-Verlag,
2012, pp. 27–40.
[16] I. Psaras, W. K. Chai, G. Pavlou, Probabilistic in-network caching for
information-centric networks, in: ICN’12, ACM, 2012, pp. 55–60.
[17] V. Sourlas, P. Flegkas, L. Tassiulas, A novel cache aware routing scheme
for information-centric networks, Computer Networks 59 (2014) 44 – 61.
[18] B. Panigrahi, S. Shailendra, H. K. Rath, A. Simha, Universal caching
model and markov-based cache analysis for information centric net-
works, in: ANTS’14, 2014, pp. 1–6.
[19] F. M. Al-Turjman, A. E. Al-Fagih, H. S. Hassanein, A value-based cache
replacement approach for information-centric networks, in: LCN’13,
2013, pp. 874–881.
[20] R. Liu, W. Wu, H. Zhu, D. Yang, M2M-oriented qos categorization in
cellular network, in: WiCOM’11, 2011, pp. 1–5.
[21] P. E. Caines, Relationship between box-jenkins-strm control and kalman
linear regulator, Proc. Institution Elec. Engineers 119 (5) (1972) 615–
620.
[22] LAAS-CNRS, Adream (2013).
URL http://www.laas.fr/1-32329-Le-batiment-intelligent
-Adream-instrumente-et-econome-en-energie.php
[23] S. Makridakis, M. Hibon, Arma models and the boxjenkins methodology,
Journal of Forecasting 16 (3) (1997) 147–163.
[24] G. Box, G. M. Jenkins, Time Series Analysis: Forecasting and Control,
1st Edition, Holden-Day Inc., San Francisco, 1970.
29
[25] P. J. Brockwell, R. A. Davis, Time Series: Theory and Methods,
Springer New York, 1991.
[26] J. Navarro-Moreno, Arma prediction of widely linear systems by using

the innovations algorithm, Trans. on Sig. Proc. 56 (7) (2008) 3061–3068.
[27] J. Rissanen, Stochastic complexity and modeling, The Annals of Statis-

tics 14 (3) (1986) 1080–1100.
[28] J. Quevedo, D. Corujo, R. Aguiar, Consumer driven information fresh-

ness approach for content centric networking, in: INFOCOMW’14, 2014,
pp. 482–487.
[29] M. Hail, M. Amadeo, A. Molinaro, S. Fischer, Caching in named data

networking for the wireless internet of things, in: RIOT’15, 2015, pp.
1–6.
[30] D. Rossi, G. Rossini, Caching performance of content centric networks

under multi-path routing, Tech. rep., Telecom ParisTech (2011).
[31] K. L. Calvert, M. B. Doar, E. W. Zegura, Modeling internet topology,

IEEE Comm. Magazine 35 (6) (1997) 160–163.
[32] M. Meddeb, A. Dhraief, A. Belghith, T. Monteil, K. Drira, How to cache

in ICN-based IoT environments?, in: AICCSA’17, 2017, pp. 1117–1124.
[33] M. Meddeb, A. Dhraief, A. Belghith, T. Monteil, K. Drira, S. Gannouni,

AFIRM: adaptive forwarding based link recovery for mobility support
in NDN/IoT networks, Future Gen. Comp. Systems 87 (2018) 351–363.
30

Accepted Manuscript

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Accepted Manuscript

Uploaded by

Copyright:

Available Formats

Accepted Manuscript

Maroua Meddeb, Amine Dhraief, Abdelfettah Belghith,

To appear in: Pervasive and Mobile Computing

Received date : 15 February 2018

In-network caching in Named Data Networking (NDN) based Internet of

Preprint submitted to Pervasive and Mobile Computing October 26, 2018

We are currently experiencing a networking paradigm shift in which sur-

2. Caching Policies in ICN

2.1. In-network Caching Strategies

Relevant caching strategies have been proposed to answer the question

2.2. Cache Replacement Policies

Once a cache becomes full, some stored content must be deleted to be

3. The Least Fresh First Cache Replacement Algorithm

To select the least fresh content, we rely on prediction calculations. A

3.1. The IoT traffic patterns

IoT traffic is usually classified into four categories: continuous, periodic,

3.2. Prediction model

The exponential smoothing, the machine learning and the Autoregressive

3.3. Autoregressive Moving Average model

An ARMA(p,q) model is represented by the stochastic process (Xi ) mea-

Forecasting Tf resh amounts to calculate Xn+1 . To this end, it is necessary

3.3.1. Calculation of ARMA parameters

Xn = n − φ1 Xn−1 − φ2 Xn−2 − · · · − φp Xn−p (2)

φ = (φ1 , . . . , φp ) is the solution of the Yule-Walker equation given by equation

Xn = θ1 n−1 + θ2 n−2 + · · · + θq n−q (6)

The MA process is combined to the AR process in order to adjust results by

θi = Cov(Xn , t−n )/σ 2 (7)

3.4. LFF algorithm

The approach, presented in Algorithm 3, is invoked each time when there

3.5. Example with LFF policy

We detail in Figure. 1 the operation of the LFF cache replacement pol-

Figure 1: An example with the NDN architecture using LFF policy

We now present the performance evaluation of our proposed cache re-

4.1. Simulation scenario

4.2. Performance metrics

Where N is the number of consumers, and R is the number of requests created

where serverhiti is the number of requests sent by i and satisfied by the

To measure the data freshness, we use the following metric presented in

4.3. Simulation Results

CcnSim is distributed with native support of LRU cache replacement pol-

Hop Reduction Ratio

Figure 2: hit reduction ratio Figure 3: Hop reduction ratio

Figure 4: Response latency Figure 5: Number of evictions

The Information-Centric Networking (ICN) is a ”Clean Slate” solution

[3] I. Psaras, O. Ascigil, S. Rene, G. Pavlou, A. Afanasyev, L. Zhang, Mobile

[4] E. Baccelli, C. Mehlis, O. Hahm, T. C. Schmidt, M. Wählisch, Informa-

[5] O. Ascigil, S. Rene, G. Xylomenos, I. Psaras, P. George, A keyword-

[6] M. Meddeb, A. Dhraief, A. Belghith, D. K. Monteil, Thierry, S. AlAh-

[7] R. Hyndman, G. Athanasopoulos, Forecasting: principles and practice:,

[8] R. Chiocchetti, D. Rossi, G. Rossini, ccnsim: An highly scalable ccn

[9] L. Saino, I. Psaras, G. Pavlou, Icarus: a caching simulator for informa-

[10] S. K. Fayazbakhsh, Y. Lin, A. Tootoonchian, A. Ghodsi, T. Koponen,

[11] M. Meddeb, A. Dhraief, A. Belghith, T. Monteil, K. Drira, Cache coher-

[12] M. Meddeb, A. Dhraief, A. Belghith, T. Monteil, K. Drira, S. AlAhmadi,

[26] J. Navarro-Moreno, Arma prediction of widely linear systems by using

[27] J. Rissanen, Stochastic complexity and modeling, The Annals of Statis-

[28] J. Quevedo, D. Corujo, R. Aguiar, Consumer driven information fresh-

[29] M. Hail, M. Amadeo, A. Molinaro, S. Fischer, Caching in named data

[30] D. Rossi, G. Rossini, Caching performance of content centric networks

[31] K. L. Calvert, M. B. Doar, E. W. Zegura, Modeling internet topology,

[32] M. Meddeb, A. Dhraief, A. Belghith, T. Monteil, K. Drira, How to cache

[33] M. Meddeb, A. Dhraief, A. Belghith, T. Monteil, K. Drira, S. Gannouni,

Xn = n − φ1 Xn−1 − φ2 Xn−2 − · · · − φp Xn−p (2)

Xn = θ1 n−1 + θ2 n−2 + · · · + θq n−q (6)

θi = Cov(Xn , t−n )/σ 2 (7)