You are on page 1of 28

A Survey on Low Latency Towards 5G: RAN, Core

Network and Caching Solutions


Imtiaz Parvez, Student Member, IEEE, Ali Rahmati, Student Member, IEEE, Ismail Guvenc, Senior Member, IEEE,
Arif I. Sarwat, Senior Member, IEEE, and Huaiyu Dai, Fellow, IEEE

Abstract—The fifth generation (5G) wireless network technol- 100 kb/s 384 kb/s-2 Mb/s 150-450 Mb/s 10 Gb/s
ogy is to be standardized by 2020, where main goals are to
improve capacity, reliability, and energy efficiency, while reducing 500-1000 ms 200 ms 100 ms 1 ms
latency and massively increasing connection density. An integral
arXiv:1708.02562v1 [cs.NI] 8 Aug 2017

part of 5G is the capability to transmit touch perception type


real-time communication empowered by applicable robotics and 2G 3G 4G
haptics equipment at the network edge. In this regard, we GSM/GPRS UMTS LTE 5G
need drastic changes in network architecture including core and EDGE/CDMA CDMA2000 LTE-A
radio access network (RAN) for achieving end-to-end latency Fig. 1: Latency for different generations of cellular networks
on the order of 1 ms. In this paper, we present a detailed
survey on the emerging technologies to achieve low latency [2].
communications considering three different solution domains:
RAN, core network, and caching. We also present a general one thousand times throughput improvement, 100 billion con-
overview of 5G cellular networks composed of software defined nections, and close to zero latency [1], [3]. In particular, 5G
network (SDN), network function virtualization (NFV), caching,
and mobile edge computing (MEC) capable of meeting latency
will support enhanced mobile broadband (MBB) with end-user
and other 5G requirements. data rates of 100 Mbps in the uniform spatial distribution with
Index Terms—5G, cloud, caching, haptic communications, peak data rates of 10-20 Gbps. Based on consensus, 5G will
latency, massive connectivity, real-time communication, SDN, not only provide personal mobile service, but also massive
tactile Internet, ultra-high reliability, ultra-low latency. machine type communications (MTC), and latency/reliability
critical services. In mission critical communication (MCC),
I. I NTRODUCTION both the latency and reliability issues need to be addressed
The focus of next generation mobile communication is to [6]. In many cases, the corresponding E2E latency as low as
provide seamless communication for machines and devices 1 ms needs to be met with reliability as high as 99.99% [7].
building the Internet-of-Things (IoT) along with personal com- The evolution of latency requirement for different generations
munication. New applications such as tactile Internet, high- of cellular networks is presented in Fig. 1.
resolution video streaming, tele-medicine, tele-surgery, smart To achieve low latency for MCC, drastic changes in the
transportation, and real-time control dictate new specifications network architecture need to be performed. Since the delay
for throughput, reliability, end-to-end (E2E) latency, and net- is contributed by radio access network (RAN) and core net-
work robustness [1]. Additionally, intermittent or always-on work along with backhaul between RAN and core network,
type connectivity is required for machine-type communication new network topology involving software define network
(MTC) serving diverse applications including sensing and (SDN), network virtualized function (NFV), and mobile edge
monitoring, autonomous cars, smart homes, moving robots and computing (MEC)/caching can be employed to reduce the
manufacturing industries. latency significantly. This can happen due to the capability
Several emerging technologies including wearable devices, of seamless operation and independence from hardware func-
virtual/augmented reality, and full immersive experience (3D) tionality provided by these entities. Moreover, new physical
are shaping the demeanor of human end users, and they have air interface with small time interval transmission, small size
special requirements for user satisfaction. Therefore, these use packets, new waveforms, new modulation and coding schemes
cases of the next generation network push the specifications are the areas of investigation for attaining low latency. In
of 5G in multiple aspects such as data rate, latency, reliabil- addition, optimization of radio resource allocation, massive
ity, device/network energy efficiency, traffic volume density, MIMO, carrier aggregation in millimeter wave, and priority
mobility, and connection density. Current fourth generation of data transmission need to be addressed. All in all, a robust
(4G) networks are not capable of fulfilling all the technical integration with existing LTE is necessary for 5G networks
requirements for these services. that will enable industries to deploy 5G quickly and efficiently
Fifth generation (5G) cellular network is the wireless ac- when it is standardized and available. In summary, 5G wireless
cess solution to fulfill the wireless broadband communication access should be an evolution of LTE complemented with
specifications of 2020 and beyond [3], [4]. In ITU, 5G ITU- revolutionary architecture designs and radio technologies.
R working group is working for development of 5G under In the literature, surveys on 5G network including architec-
the term IMT-2020 [5]. The vision of this work is to achieve ture [3], [4], SDN/NFV/MEC based core network [9], [10],
Virtualized SDN Virtualized
server server

Internet
S-G/GGSN
MME/SGSN

Fog network/
Caching
Enhanced Broadband
Ultra reliable low
latency
communication

Small cell/ Small cell/ Data BS


Data BS Macro/Control BS

Data
Signaling UE UE

Fig. 2: A candidate architecture for 5G cellular network [8].

caching [11], [12], backhaul [13], resource management [14] an exemplary 5G architecture is presented with visions to meet
and data centric network [15], [16] are available. Apart from the requirements of low latency, high reliability, very large
that, surveys on latency reduction approaches in Internet [17], throughput, and massive connectivity. Section III presents the
cloud computing [18] and distributed network applications latency critical services in 5G. The sources of latency in a
[19], [20] are also presented; however, to the best of our cellular network are discussed in Section IV. Section V reviews
knowledge, a comprehensive survey on latency reduction the fundamental constraints and approaches for achieving
approaches in cellular networks towards 5G is not available low latency. Three key low latency solutions in RAN, core
yet. network, and caching are presented in Sections VI, VII, and
In this paper, we present a comprehensive survey of latency VIII, respectively. Section IX presents the field tests, trials and
reduction solutions particularly in the context of 5G wireless experiments of low latency approaches. Finally, concluding
technology. We first present the sources and fundamental remarks are provided in Section X.
constraints for achieving low latency in a cellular network.
We also overview an exemplary 5G network architecture II. A RCHITECTURE OF A N E XEMPLARY 5G N ETWORK
with compliance to IMT-2000 vision. Finally, we provide With an ambition of significant enhancements over 4G cel-
an extensive review of proposed solutions for achieving low lular networks, we need drastic changes in the architecture for
latency towards 5G. The goal of our study is to bring all meeting 5G visions such as 1000 times capacity enhancement,
existing solutions on the same page along with future research 100 billions connections and close to zero latency [3], [4].
directions. We divide the existing solutions into three parts: The requirements of 5G can be divided broadly into 3 groups:
(1) RAN solutions; (2) Core network solutions; (3) Caching enhanced Mobile Broadband (eMBB), massive Machine Type
solutions. However, detailed comparison of these solutions are Communication (mMTC), and ultra Reliable Low Latency
beyond the scope of this work. Communication (uRLLC). For eMBB or throughput enhance-
The rest of the paper is organized as follows. In Section II, ment, new modulation and coding schemes, massive multiple-
TABLE I: D EFAULT L ATENCY VALUES FOR P HYSICAL
Transportation
L AYER [7].
Parameter value (ms)
User equipment (UE) processing time 0.3
eNB processing time 0.3
Minimum time to transmit/transmission
time interval (TTI)
0.2 Virtual reality
Healthcare
Time required for uplink (UL) ACK/NACK
0.06 (short coverage)
transmission
0.25 (medium
coverage)
0.50 (high coverage)
1.00 (extreme Factory Gaming
coverage)
Downlink (DL) resources for bidirectional 0.80 (DL heavy
TDD blocks with 1 ms duration block)
0.40 (balanced block)
0.20 (UL heavy
block) Control and
DL resources for bidirectional TDD blocks robotics Augmented
2.8 (DL heavy block) reality
with 4 ms duration Smart grid
Fig. 3: Latency critical services in 5G.
input multiple-output (MIMO), advanced multiple access tech-
niques, small cell deployment, and millimeter wave (mmWave)
aggregation are proposed. On the other hand, non orthogonal
• Factory Automation: Factory automation includes real-
multiple access, flexible packet structure, overloaded data
time control of machine and system for quick production
transmission, and active user detection are being considered
lines and limited human involvement. In these cases, the
for mMTC. For low latency and reliability enhancement (i.e.
production lines might be numerous and contiguous. This
uRLLC), modification in the physical layer along upper layers
is highly challenging in terms of latency and reliability.
is required. An example candidate for 5G network is presented
Therefore, the latency requirement for factory automation
in Fig. 2 with proposed latency parameters tabulated in Table I.
applications is between 0.25 ms to 10 ms with a packet
Some enhancements including ultra-dense small cell de-
loss rate of 10−9 [22], [23].
ployments and new technologies such as massive MIMO,
• Intelligent Transportation Systems: Autonomous driv-
SDN, NFV, MEC/caching are the fundamental modifications
ing and optimization of road traffic requires ultra reli-
in the proposed 5G network [8]. The dynamic deployment
able low latency communication. According to intelligent
and scaling of functions can be achieved by SDN and NFV
transportation systems (ITS), different cases including
where access stratum (AS) and non access stratum (NAS) are
autonomous driving, road safety, and traffic efficiency ser-
integrated by intelligent protocol. Split of control plane (C-
vices have different requirements [22], [24]. Autonomous
plane) and data plane (D-plane) in the architecture and the
vehicles require coordination among themselves for ac-
carrier frequency could be performed in order to have separate
tions such as platooning and overtaking [25]. Road safety
provision of capacity and coverage [21]. Special network
includes warnings about collisions or dangerous situa-
functionality as a service (XaaS) which can be assumed as
tions. Traffic efficiency services control traffic flow using
the relation between a radio network and the cloud will grant
the information of the status of traffic lights and local
service on request/demand. Resource pooling can be an ex-
traffic situations. For these purposes, latency of 10 ms to
ample of this XaaS. Moreover, intelligent data driven network
100 ms with packet loss rate of 10−3 to 10−5 is required.
structures such as mobile edge computing cloud/fog network
• Robotics and Telepresence: In the near future, remote
will reduce latency and enhance the network throughput. The
controlled robots will have applications in diverse sec-
high data rate and reduced latency in radio interface will also
tors such as construction and maintenance in dangerous
lead to long battery life and energy savings.
areas. A prerequisite for the utilization of robots and
In the following section, latency critical services in 5G
telepresence applications is remote-control with real-time
networks are discussed in details.
synchronous visual-haptic feedback. In this case, system
response times should be less than a few milliseconds
III. L OW L ATENCY S ERVICES IN 5G
including network delays [22], [26], [27]. Communication
Latency is highly critical in some applications such as infrastructure capable of proving this level of real-time
automated industrial production, control/robotics, transporta- capacity, high reliability/availability, and mobility support
tion, health-care, entertainment, virtual realty, education, and is to be addressed in 5G networks.
culture (as illustrated in Fig. 3). In some cases, we need latency • Virtual Reality (VR): Several applications such as
as low as 1 ms with packet loss rate no larger than 10−2 . micro-assembly and tele-surgery require very high levels
Several latency critical services which need to be supported of sensitivity and precision for object manipulations. VR
by 5G are described as follows. technology accommodates such services where several
users interact via physically coupled VR simulations in TABLE II: L ATENCY R EQUIREMENTS FOR D IFFERENT
a shared haptic environment. Current networked commu- M ISSION C RITICAL S ERVICES .
nication does not allow sufficient low latency for stable, Use case Latency (ms)
seamless coordination of users. Typical update rates of Factory Automation 0.25-10
Intelligent Transport Systems 10-100
display for haptic information and physical simulation are Robotics and Telepresence ≤1
in the order of 1000 Hz which allows round trip latency Virtual Reality (VR) ≤1
of 1 ms. Consistent local view of VR can be maintained Health care 1-10
for all users if and only if the latency of around 1 ms is Serious Gaming 1
Smart Grid 1-100
achieved [26], [27]. Education and Culture 5-10
• Augmented Reality (AR): In AR technology, the aug-
mentation of information into the user’s field of view en-
ables applications such as driver-assistance systems, im- of 1 ms to 100 ms. The latency requirements for various 5G
proved maintenance, city/museum guides, telemedicine, services are summarized in Table II.
remote education, and assistive technologies for police In the next section, the major sources of latency in a cellular
and firefighters [26]. However, insufficient computational network are discussed.
capability of mobile devices and latency of current cellu-
IV. S OURCES OF L ATENCY IN A C ELLULAR N ETWORK
lar network hinder the applications. In this case, latency
as low as a few milliseconds is required. In the LTE system, the latency can be divided into two
• Health care: Tele-diagnosis, tele-surgery and tele- major parts: (1) user plane (U-plane) latency and (2) control
rehabilitation are a few notable healthcare applications plane (C-plane) latency. The U-plane latency is measured by
of low latency tactile Internet. These allow for remote one directional transmit time of a packet to become available
physical examination even by palpation, remote surgery in the IP layer between evolved UMTS terrestrial radio access
by robots, and checking of patients’ status remotely. network (E-UTRAN) edge/UE and UE/E-UTRAN node [28].
For these purposes, sophisticated control approaches with On the other hand, C-plane latency can be defined as the
round trip latency of 1-10 ms and high reliability data transition time of a UE to switch from idle state to active
transmission is mandatory [26], [27]. state. At the idle state, an UE is not connected with radio
• Serious Gaming: The purpose of serious gaming is not resource control (RRC). After the RRC connection is being
limited to entertainment. Such games include problem- setup, the UE switches from idle state into connected state
solving challenges, and goal-oriented motivation which and then enters into active state after moving into dedicated
can have applications in different areas such as education, mode. Since the application performance is dependent mainly
training, simulation, and health. Network latency of more on the U-plane latency, U-plane is the main focus of interest
than 30-50 ms results in a significant degrade in game- for low latency communication.
quality and game experience ratings. Ideally, a round In the U-plane, the delay of a packet transmission in a
trip time (RTT) on the order of 1 ms is recommended cellular network can be contributed by the RAN, backhaul,
for perceivable human’s interaction with the high-quality core network, and data center/Internet. As referred in Fig. 5,
visualization [26]. the total one way transmission time [29] of current LTE system
• Smart Grid: The smart grid has strict requirements of can be written as
reliability and latency. The dynamic control allows only
100 ms of end-to-end latency for switching suppliers (PV, T = TRadio + TBackhaul + TCore + TTransport (1)
windmill, etc.) on or off. However, in case of a syn- where
chronous co-phasing of power suppliers (i.e. generators), • TRadio is the packet transmission time between eNB and
an end-to-end delay of not more than 1 ms is needed [3], UEs and is mainly due to physical layer communication;
[26]. Latency more that 1 ms which is equivalent to a • TBackhaul is the time for building connections between
phase shift of about 18◦ (50 Hertz AC network) or 21.6◦ eNB and the core network (i.e. EPC). Generally, the
(60 Hertz AC network), may have serious consequence core network and eNB are connected by copper wires
in smart grid and devices. or microwave or optical fibers;
• Education and Culture: Low latency tactile Internet • TCore is the processing time taken by the core network;
will facilitate remote learning/education by haptic overlay • TTransport is the delay to data communication between the
of teacher and students. For these identical multi-modal core network and Internet/cloud.
human-machine interfaces, round trip latency of 5-10 ms The end-to-end delay, TE2E is then approximately given by
is allowed for perceivable visual, auditory, and haptic 2 × T . The TRadio is the sum of transmit time, propagation
interaction [26], [27]. Besides that, tactile Internet will latency, processing time (channel estimation, encoding and
allow to play musical instruments from remote locations. decoding time for first time), and retransmission time (due
In such scenarios, supporting network latency lower than to packet loss). In particular, the TRadio for a scheduled user
few milliseconds becomes crucial [26]. [30], [31] can be expressed as:
Based on the applications and use case scenarios above,
latency critical services in 5G networks demand an E2E delay TRadio = tQ + tFA + ttx + tbsp + tmpt (2)
where V. C ONSTRAINTS AND A PPROACHES FOR ACHIEVING
• tQ is the queuing delay which depends on the number of L OW L ATENCY
users that will be multiplexed on same resources; There are major fundamental trade-offs between capacity,
• tFA is the delay due to frame alignment which depends on coverage, latency, reliability, and spectral efficiency in a wire-
the frame structure and duplexing modes (i.e., frequency less network. Due to these fundamental limits, if one metric is
division duplexing (FDD) and time division duplexing optimized for improvement, this may results in degradation of
(TDD)); another metric. In the LTE system, the radio frame is 10 ms
• ttx is the time for transmission processing, and payload with the smallest TTI being 1 ms. This fixed frame structure
transmission which uses at least one TTI depending on depends on the modulation and coding schemes for adaptation
radio channel condition, payload size, available resources, of the transmission rate with constant control overhead. Since
transmission errors and retransmission; latency is associated with control overhead (cyclic prefix,
• tbsp is the processing delay at the base station; transmission mode, and pilot symbols) which occupies a major
• tmpt is the processing delay of user terminal. Both portion of transmission time of a packet (approximately 0.3-
the base station and user terminal delay depend on the 0.4 ms per packet transmission), it is not wise to consider
capabilities of base station and user terminal (i.e., UE), a packet with radio transmission time less than 1 ms. If
respectively. we design a packet with time to transmit of 0.5 ms, more
In compliance with ITU, TRadio should not be more than than 60% of the resources will be used by control overhead
0.5 ms for low latency communication [32]. In this regard, [29]. Moreover, retransmission per packet transmission takes
radio transmission time should be designed to be on the order around 8 ms, and removal of retransmission will affect packet
of hundreds of microseconds while the current configuration error significantly. As a result, we need radical modifications
in 4G is 1 ms. For this, enhancement in various areas of and enhancements in packet/frame structure and transmission
RAN such as packet/frame structure, modulation and coding strategy. In this regard:
schemes, new waveform designs, transmission techniques, and • First, a novel radio frame reinforced by limited control
symbol detection need to be carried out. In order to reduce overhead and smaller transmission time is necessary to be
the delay in TBackhaul , approaches such as advanced backhaul designed. For reduction of control overhead, procedures
techniques, caching/fog enabled networks, and intelligent in- for user scheduling, resource allocation, and channel
tegration of AS and NAS can provide potential solutions. For training can be eliminated or merged.
TCore , new core network consists of SDN, NFV, and various • Second, packet error probability for first transmission
intelligent approaches can reduce the delay significantly. For should be reduced with new waveforms and transmission
TTransport , MEC/fog enabled Internet/cloud/caching can provide techniques reducing the retransmission delay.
reduced latency. • Third, since latency critical data needs to be dispatched
In the following section, we discuss the constraints and immediately, techniques for priority of data over normal
approaches for achieving low latency. data need to be identified.

5G low latency
communication

RAN Core network Caching

Frame/Packet
Symbol detection SDN Caching placement
structure
Waveform/
mmWave NFV Content delivery
Multiple Access
Modulation and Location aware
coding communication MEC Centralized caching

Reinforcing
Transmission Fog network Distributed caching
QoS &QoE

Control channel Other aspects General


backhauling
mmWave
backhauling

Fig. 4: Categories of different solutions for achieving low latency in 5G.


• Fourth, synchronization and orthogonality are the indis- most of the proposed solutions are on the physical (PHY) and
pensable aspects of OFDM that are major barriers for medium access control (MAC) layers.
achieving low latency. Even though asynchronous mode In LTE cellular network, the duration of a radio frame is
of communication is more favorable over synchronized 10 ms. Each frame is partitioned into 10 subframes of size
operation in terms of latency, it requires additional spec- 1 ms which is further divided into 0.5 ms units that are
trum and power resources [33]. referred as a resource block (RB). Each RB spans 0.5 ms
• Fifth, since the latency for data transmission also depends (6 or 7 OFDM symbols) in time domain and 180 KHz (12
on the delay between the core network and the BS, consecutive subcarriers, each of which 15 KHz) in frequency
caching networks can be used to reduce latency by storing domain. Based on this, the subcarrier spacing ∆f is 15 KHz,
1
the popular data at the network edge. the OFDM symbol duration TOFDM is ∆f = 66.67µs, the FFT
Researchers proposed various techniques/approaches for size is 2048, the sampling rate fs is ∆f ×NFFT = 33.72 MHz
achieving low latency in 5G. As summarized in Fig. 4, we and the sampling interval Ts is 1/fs .
divided the existing solutions into three major categories: To reduce TTI for achieving low latency, the subcarrier
(1) RAN solutions, (2) core network solutions, and (3) spacing ∆f can be changed to 30 KHz [37]. This results the
caching solutions. The RAN solutions include new/modified corresponding OFDM symbol duration TOFDM to be 33.33 µs
frame or packet structure, waveform designs, multiple ac- and the FFT size NFFT to become 1024 while sampling rate
cess techniques, modulation and coding schemes, transmission fs is kept 30.72 MHz similar to LTE systems. The frame
schemes, control channels enhancements, low latency symbol duration Ts =10 ms can be divided into 40 subframes in which
detection, mmWave aggregation, cloud RAN, reinforcing QoS each subframe duration Tsf is 0.25 ms and contains 6 or 7
and QoE, energy-aware latency minimization, and location symbols. Two types of cyclix prefixs (CPs) can be employed
aware communication techniques. On the other hand, new in this configuration with durations
entities such as SDN, NFV, MEC and fog network along
with new backhaul based solutions have been proposed for the Tcp1 = 5/64 × NIFFT × Ts ≈ 2.604 µs, (3)
core network. The solutions of caching can be subdivided into
caching placement, content delivery, centralized caching, and
Tcp2 = 4/64 × NIFFT × Ts ≈ 2.083 µs. (4)
distributed caching, while backhaul solutions can be divided
into general and mmWave backhaul. In the following sections, An exemplary physical air frame and conventional LTE radio
these solutions are described in further details. frame with equal sized RB are illustrated in Fig. 6(a) and
Fig. 6(b), respectively.
VI. RAN S OLUTIONS FOR L OW L ATENCY In [34], an extensive analysis of the theoretical principles
To achieve low latency, various enhancements in the RAN that regulates the transmission of small-scale packets with
have been proposed. Referring to Table III, RAN solu- low latency and high reliability is presented with metrics
tions/enhancements include frame/packet structure, advanced to assess their performance. The authors emphasize control
multiple access techniques/waveform designs, modulation and overhead optimization for short packet transmission. In [35],
coding scheme, diversity and antenna gain, control channel, small-scale data such as 10 − 20 bytes are proposed for
symbol detection, energy-aware latency minimization, carrier transmission in order to achieve latency as low as 0.5 − 1 ms
aggregation in mmWave, reinforcing QoS and QoE, cloud and reliability (probability of failed packet delivery) of not
RAN and location aware communication. In what follows, the higher than 10−9 . In this regard, various forms of diversity
detailed overview for each of these solutions is presented. could be employed such as spatial diversity, time diversity, and
frequency diversity. The spatial, time, and frequency diversity
A. Frame/packet structure is accomplished by means of several transmit and receive
antennas, time slots of independent coefficients, and multiple
In the RAN solutions, modification in the physical air inter- RBs of independent fading coefficients, respectively.
face has been considered as an attractive choice. In particular, In [45], a flexible 5G radio frame structure is introduced
in which the TTI size is configurable in accordance with the
requirement of specific services. At low offered load, 0.25 ms
Evolved packet
TTI is an attractive choice for achieving low latency due to
core (EPC)
low control overhead. However, for more load, control over-
head increases which affects reliability and packet recovery
eNB Virtualized SDN mechanism resulting in increased latency. This study argues to
User server Internet/ employ user scheduling with different TTI sizes in the future
Cloud 5G networks. In [44], the numerology and subframe structure
SGSN/MME
are defined considering diverse carrier frequencies and band-
TRadio TBackhaul Tcore TTransport widths for low latency 5G networks. Cyclic prefix, FFT size,
subcarrier spacing, and sampling frequency were expressed as
Fig. 5: Latency contribution in E2E delay of a packet a function of the carrier frequency. In [41], software defined
transmission. radio (SDR) platform based 5G system implementation with
TABLE III: OVERVIEW OF T ECHNIQUES IN RAN FOR L OW L ATENCY.
Case/Area Reference Approach Summary
[34], [35], [36] Transmission of small packets Transmission of small scale data is investigated for packet loss rate of 10−9 and
latency as low as 1 ms.
[37] Subcarrier spacing Subcarrier spacing is enlarged to shorten the OFDM symbol duration, and the
number of OFDM symbols is proposed to keep unchanged in each subframe.
[38], [7], [39], Flexible OFDMA based TDD TDD numerology is optimized for dense deployment with smaller cell sizes and
[40]–[42] subframe larger bandwidth in the higher carrier frequencies.
[43] Modification of physical sub- Different control and data part patterns for consecutive subframes, TX and RX
frame control parts are proposed to be separated from each other, and from the data
symbols with a GP, leading to total number of 3 GPs per subframe.
Frame/Packet [36] Short frame design numerology The main focus in designing physical parameter is FFT, frame duration and physical
structure for local area channel for local area based communication.
[44], [45] Numerology and flexible sub Numerology and subframe structure are defined considering diverse carrier fre-
frame quencies and bandwidths to envision 5G including low latency. Cyclic prefix, FFT
size, subcarrier spacing, and sampling frequency were expressed as the function of
carrier frequency.
[46] mmWave MAC layer frame A new mmWave MAC layer frame structure is proposed with several improvements
structure such as adaptable and smaller transmission times, extended messaging, dynamic
locations for control signals, and the capacity to multiplex directional control signals
(dynamic HARQ placement) efficiently.
Advanced [47], [48], [49] Filtered CP-OFDM, UFMC and UFMC outperforms over OFDM by about 10% in case of both large and small
multiple ac- FBMC packets. FBMC demonstrates better performance in case of transmitting long
cess/Waveform sequences; however, it suffers during the transmission of short bursts/frames.
[50], [51] Polar coding Based on simulation and field test, polar coding has been proposed for 5G,
outperforming over turbo coding in case of small packet transmission.
[52] Turbo decoding with combined A highly-parallel architecture for the latency sensitive turbo decoding is proposed
sliding window algorithm and combining two parallel algorithms: the traditional sliding window algorithm and
cross parallel window (CPW) al- cross parallel window (CPW) algorithm.
gorithm
[53] New IFFT design with butterfly Input signal of IFFT processor corresponding to guard band are assigned as null
operation revealing the existence of numerous zeros (i.e., 0). If the sequence of OFDM symbol
data which enter the IFFT is adjusted, the memory depth can be reduced from 1024
to 176.
Modulation [54] Sparse code multiple access A dynamic shrunk square searching (DSSS) algorithm is proposed, which cuts off
and coding (SCMA) unnecessary communication control port (CCP) calculation along with utilization
of both the noise characteristic and state space structure.
[55] Priority to latency critical data A latency reduction approach by introducing TDM of higher priority ultra-low
latency data over other less time critical services is proposed which maps higher
priority user data during the beginning of a subframe followed by the normal data.
[56] Balanced truncation Balanced truncation is applied for the model reduction in the linear systems that
are being coupled over arbitrary graphs under communication latency constraints.
[57] Finite block length bounds and Recent advances in finite-block length information theory are utilized in order to
coding demonstrate optimal design for wireless systems under strict constraints such as
low latency and high reliability.
[58] Asymmetric window Asymmetric window is proposed instead of well-known symmetric windows for
reduction of cyclic prefix by 30%. This technique suppresses OOB emission but
makes the system more susceptible to channel induced ISI and ICI.
[59] Transmission power optimiza- Transmission power is optimized by steepest descent algorithm considering trans-
tion mission delay, error probability and queuing delay.
Transmission [60] Path-switching method and a Low latency packet transport system with a quick path-switching method and a
packet-recovery method packet-recovery method are introduced for a multi-radio-access technology (multi-
RAT) environment.
[35] Diversity Diversity could be employed through various approaches such as spatial diversity,
time diversity, and frequency diversity.
[61] Control channel sparse encoding CCSE is introduced in order to provide the control information using non-
(CCSE) orthogonal spreading sequences.
[62] Scaled control channel design A scaled-LTE frame structure is proposed assuming the scaling factor to be 5 with a
dedicated UL CCHs for all sporadic-traffic users in each transmission time interval
with possible smallest SR size.
[63] Symbol-level frequency hopping A sequence-based sPUCCH (SS-PUCCH) incorporating two SC-FDMA symbols
and sequence-based sPUCCH is introduced in order to meet a strict latency requirement. Symbol-level frequency
hopping technique is employed to achieve frequency diversity gain and reliability
enhancement.
Control chan- [64] Radio bearer and S1 bearer man- Establishment of radio bearer and S1 bearer in parallel are proposed where eNB
nel agement and mobility management element (MME) manages and controls radio bearer and
S1 bearer, respectively. The eNB sends only single control signal in order to
configure radio bearers such as SRB1, SRB2 and DRBs, that decreases the signaling
interaction rounds between the UE and the eNBs.
[65] Outer-loop link adaptation The proposed scheme controls the size of the compensation in the estimated SINR
(OLLA) scheme based on the time elapsed after a UE transits from an idle state to an active state,
which helps to reduce latency for small packet applications.
TABLE III: OVERVIEW OF T ECHNIQUES IN RAN FOR L OW L ATENCY ( CONTINUED ).
Case/Area Reference Approach Summary
[66] SM-MIMO detection scheme A low-complexity and low latency massive SM-MIMO detection scheme is intro-
with ZF and MRC-ZF duced and validated using SDR platforms. The low complexity detection scheme
is proposed with a combination of ZF and MRC-ZF.
[67] Linear MMSE A linear MMSE receiver is presented for low latency wireless communications
using ultra-small packets.
Symbol [68] Space-time encoding and widely Space-time encoding is introduced within a GFDM block for maintaining overall
detection linear estimator low latency in the system. On the other hand, a widely linear estimator is used to
decode the GFDM block at the receiver end, which yields significant improvements
in gain over earlier works.
[69]–[71] Compressed sensing Compressed sensing has been proposed to be effective in reducing latency of
m = Fraction of 1 ms networked control systems if the state vector can be assumed to be sparse in some
representation.
n[72]
= Multiplier of Low
180 complexity
kHz receiver design A low complexity receiver is designed and using this, the performance of an SCMA
system is verified via simulations and real-time prototyping. This approach triples
LLC
the whole system throughput while maintaining low latency similar to flexible
orthogonal transmissions.
mmWave ag- [46], [73]–[75] mmWave based air interface Physical layer air interface is proposed using GB
mmWave aggregation. Large band-
gregation GB
width along with various approaches such LLC as small frame structure, mmWave
LLC backhaul and beamforming canGB help to achieve low latency.
Cloud RAN [76] LLC cloud RAN (CRAN) A CRAN over passive optical
Virtualized LLCnetwork (PON) architecture is introduced called
GB LCC
virtualized-CRAN (V-CRAN) which can dynamically associate any radio unit (RU)
to any digital unit (DU) so that several RUs can be coordinated by the same DU,
LCC
Time

m × 1 ms

and the concept of virtualizedLLC


BS (V-BS) that can jointly transmit common signals
LLC from multiple RUs to a user. LLC
Location [36], [40], [77] Location information Issues and research challenges of 5G are discussed followed by the conclusion
n × 180 kHz that 5G networks can exploit the location information and accomplish performance
gains in terms of throughput and latency.
Frequency

GB GB GB GB GB GB GB
Time

1 ms

180 kHz
Frequency
(a)

m = Fraction of 1 ms
n = Multiplier of 180 kHz
LLC

GB
GB LLC
LLC GB
LLC LLC
GB LCC
LCC
Time

m × 1 ms

LLC
LLC LLC
n × 180 kHz
Frequency
(b)

Fig. 6: Physical air interface (a) Conventional LTE radio frame, (b) Exemplary 5G radio frame with flexible time and
frequency division for low latency [78] (GB: guard band; LLC: low latency communication).

GB GB GB GB GB GB GB
Time

1 ms
strict latency requirement is presented. The scalability of the system by using flexible TDD, and coexistence with overlay
proposed radio frame structure is validated with E2E latency LTE-A network, sleeping modes, interference statistics, con-
less than 1 ms. tention based data channel, and channel quality indicator. Here,
In [36], a 5G flexible TDD is proposed for local area the channel quality and traffic statistics are accumulated from
radio interface (5GETLA) with FFT size of 256 and 512, the small cells which help to boost throughput and to reduce
and short frame structure to achieve latency of lower than latency.
1 ms. In particular, RTT of 0.25 ms can be achieved for the A new mmWave MAC layer frame design with adaptable
packets smaller than 50 Kbits. In [37], the proposed subcarrier and smaller transmission times, extended messaging, dynamic
spacing is enlarged to shorten the OFDM symbol duration, locations for control signals, and the capacity to multiplex
and the number of OFDM symbols in each subframe is kept directional control signals (dynamic HARQ placement) effi-
unchanged in the new frame structure for TDD downlink. ciently is presented in [46]. The granular transmission time,
The subcarrier spacing is changed to 30 KHz resulting the and dynamic and directional control signals ensure low la-
corresponding OFDM symbol duration T = 33.33 µs. The fast tency. PHY and MAC layer solutions [71] including the com-
Fourier transform (FFT) size N is 1024, while the sampling pressed sensing based multi user detection, sparse code mul-
rate fs is kept same as 30.72 MHz. The frame duration Ts is tiple access and continuous phase modulation are developed
still 10 ms with 40 subframes. by the METIS project to address low latency and high node
In [43], in order to have fully flexible allocations of different density challenges. In [79], a scheme that reserves resources
control and data RB in the consecutive subframes, TX and RX for re-transmission for a group of ultra reliable low latency
control RBs are proposed to be separated from each other and communication UEs is presented. The optimum dimensioning
also from the data RB by guard periods (GPs). This leads to of groups and block error rate (BLER) target can reduce
total number of 3 GPs per subframe which separates them. the probability of contention for the shared retransmission
Assuming symmetrical TX and RX control parts with Nctrl s resources. Moreover, the unused resources can be utilized for
symbols in each and defining that same subcarrier spacing non-grouped UEs resulting in overall efficiency enhancement.
is used for control and data planes, with Ndata s being the In [80], fundamental trade-offs among three KPIs (reliabil-
number of data symbols and Tsymbol being the length of an ity, latency, and throughput) in a 4G network is characterized,
OFDM symbol, the subframe length Tsf can be determined as and an analytical framework is derived. In cases where the
Tsf = (2Nctrl s + Ndata s (Tsymbol + TCP )) + 2TGP . (5) theory can not be extended via mathematical formulations
due to complexity of scenario in hand, some guidelines are
In [38], the fundamental limits and enablers for low air
interface latency are discussed with a proposed flexible OFDM
based TDD physical subframe structure optimized for 5G local TABLE IV: PHY AND MAC L AYER BASED R ADIO
area (LA) environment. Furthermore, dense deployment with I NTERFACE S OLUTIONS FOR L OW L ATENCY.
smaller cell sizes and larger bandwidth in the higher carrier References Approach/Area PHY layer MAC layer
frequencies are argued as notable enablers for air interface [45] Short TTI " "
latency reduction. In [7], a new configurable 5G TDD frame [34], [44] Numerology and sub "
design is presented, which allows flexible scheduling (resource frame structure
allocation) for wide area scenarios. The radical trade-offs [37] Subcarrier "
[36], [38] Flexible subframe "
" "
between capacity, coverage, and latency are discussed further
with the goal of deriving a 5G air interface solution capable [41] Flexible subframe
implementation with
of providing low latency, high reliability, massive connectivity, SDR platform
and enhanced throughput. Since achieving low latency comes [43] Allocation of control " "
at cost of lower spectral efficiency, the proposed solution of and data RB
the study includes control mechanisms for user requirement, [40] Radio frame using lo- " "
cal statistics
i.e. whether the link should be optimized for low latency or
high throughput.
[7] Radio frame and " "
scheduling
A 5G flexible frame structure in order to facilitate users with [39] Flexible TTI and mul- " "
highly diversified service requirements is proposed in [39]. tiplexing
Although, in-resource physical layer control signaling is the [46] mmWave based MAC "
layer
basis of this proposed radio frame, it allows the corresponding
data transmission based on individual user requirements. For [79] Reservation of "
resources
this, it incorporates adaptable multiplexing of users on a [71] Radio frame with " "
shared channel with dynamic adjustment of the TTI in accor- compressed-sensing-
dance with the service requirements per link. This facilitates based multi-user
detection
optimization of the fundamental trade-offs between latency,
spectral efficiency, and reliability for each link and service
[80] Calculating the " "
fundamental trade-
flow. In [40], a new radio interface design along with the offs among three
corresponding numerology is presented for future local area KPIs
provided to make the problem tractable. In order to improve
the aforementioned trade-offs between these KPIs in future 5G OFDM SCFDM
systems different candidate techniques are proposed.

N subcarrier

N frequency
The above approaches of frame/packet structure to achieve

samples
low latency at the RAN level are tabulated in Table IV.

B. Advanced Multiple Access Techniques/Waveform


Different kinds of candidate multiple access (MA) tech- N sub-symbols
niques and waveforms including orthogonal, non orthogonal K time samples
and asynchronous access have been proposed for low latency

Frequency
communication [33], [47]–[49]. Since synchronization and GFDM
orthogonality (integral to OFDM) is a hindrance for achieving M frequency

N subcarrier
low latency, asynchronized non orthogonal multiple access samples
techniques have been discussed in [33]. Reduction of symbol Time N=KM
duration to 67 µs is not a promising solution in critical time
budgeting. In this regard, interleave division multiple access
(IDMA) has been introduced in [81], [82] for generating K time samples
signal layers. The IDMA is a variant of the CDMA technique M sub-symbols
which uses specific interleaving for user segregation in lieu of
using a spread sequence to the individual user. Here, channel Fig. 7: Slot placment in GFDM, OFDM and SCFDM.
coding, forward error correction coding and spreading are
combined into a single block by a low rate encoder. The
spreading can not be considered as a distinct and special performance in case of transmitting long sequences; however,
task. Interleaving usually utilizes a simpler iterative multiuser it suffers during the transmission of short bursts/frames. For
identification approach. However, this approach needs further usage of cyclic prefix, wide frequency guards and more
rigorous investigation. required coordination, OFDM may be inefficient in case of low
In order to supplement synchronization and orthogonality, latency communication [84]. Universal filtered multi-carrier
sparse code multiple access (SCMA) and non orthogonal (UFMC) [84], [86] is upgraded version of FBMC which
multiple access (NOMA) have been presented in [83] for offsets the disadvantage of FBMC. It outperforms OFDM by
5G scenarios. In SCMA, symbol mapping and spreading are about 10% in cases of time frequency efficiency, inter carrier
combined together, and the mapping of multi dimensional interference (ICI) and transmissions of long or short packets.
codeword over incoming bits is performed directly from Additionally, UFMC preforms better than FBMC in the case
SCMA codebook. SCMA is comparatively simpler and has of very short packets while demonstrating similar performance
superior performance over low density version of CDMA. for long sequences. These make UFMC as the one of the best
Another modulation technique that aims to reduce latency is choices for next generation low latency communication.
referred as the generalized frequency division multiplexing In case of UFMC, the time domain transmit vector [86] for
(GFDM) is introduced in [68], [84]. The flexibility of covering a user is superposition of sub-band wise filtered components.
both the cyclic prefix OFDM (CP-OFDM) and single carrier The time domain transmit vector for a particular multi-carrier
frequency domain equalization (SC-FDE), and block structure symbol of user k with filter length L and FFT length N is
of GFDM help to achieve low latency. A typical mapping
structure of GFDM, OFDM and SC-FDM is illustrated in B
X
Fig. 7. The overall comparison among IDMA, SCMA and Xk = Fik Vik Sik , (6)
(N +L−1)×1 N ×ni ni ×1
GFDM is presented in Table V. i=1 (N +L−1)×N

Filter bank multi carrier (FBMC) has been a strong candi-


date waveform for 5G [3], [85]. FBMC demonstrates better where

TABLE V: P ROPOSED M ULTIPLE ACCESS T ECHNIQUES FOR 5G.


Cases IDMA [81], [82] SCMA [83] GFDM [68], [84]
• Specific interleaving • Multiple dimensional code • Block frame consists of time
Fundamental • User segregation word slots and subcarriers
concept/features • Iterative multiuser identification • QAM spreading combination • Non-orthogonal
• FFT/IFFT implementation

Low complexity " "


"
Flexibility (in case of cov-
ering CP-OFDM and SC-
FDE)
Low latency " "
TABLE VI: WAVEFORM C ONTENDER FOR 5G.
Cases OFDM FBMC [3], [85] UFMC [84], [86]
Generalized filtering to all subcar- Filtering to each subcarriers Generalized filtering to a group of
Filtering riers of entire band consecutive subcarriers
Requirement of coordination Higher Lower Lower
Time-frequency efficiency
(due to CP and guard band) 0.84 1 1
ICI (in case of lower degree of
synchronization with UEs and Higher Lower Lower
eNBs)
Performance Performs well for large packets Performs well for large packets Performs well for short packets
with well coordination with less coordination with less coordination

• S is the complex QAM symbol vector; terms of small packets while for medium and large packets,
• V is the transformed time domain vector by IDFT matrix; the opposite is true. While small packet is a requirement
In this case, the relevant columns of the inverse Fourier for low latency, other aspects such as implementation com-
matrix are incorporated in accordance with the respective plexity, performance in practical test, and flexibility need to
subband position within entire available band; be investigated. In [51], polar code has been tested in field
• i is the index of each subband of B; for 5G considering various scenarios: air interface, frame
• F is a Toeplitz matrix. It is comprised of filter impulse structure, settings for large and small packets, OFDM, and
response, and performs the linear convolution. filtered OFDM (f-OFDM) waveforms. In all cases, polar code
The main advantage of UMFC is the ability of using performed better than turbo codes which makes it a candidate
different subcarrier spacings or filter times for users in dif- channel coding scheme for 5G. The comparison among the
ferent subbands. If a user uses FFT size N1 and filter length schemes are illustrated in Table VII.
L1 , and another user uses filter length and FFT size of N2
and L2 respectively, then UFMC symbol duration can be In [52], a highly-parallel architecture for the latency sen-
designed such that N1 + L1 − 1 = N2 + L2 − 1. This sitive turbo decoding is proposed by combining two parallel
makes UFMC a remarkable adaptive modulation scheme with algorithms: the traditional sliding window algorithm and cross
capability to be tailored easily under various characteristics parallel window (CPW) algorithm. New IFFT design with
of communications, including delay/Doppler spread variations butterfly operation is proposed in [53], which reduces IFFT
in the radio channel and user QoS needs. The comparative output data delay through the reduction of IFFT memory size
discussion among OFDM, FBMC and UMFC is presented in and butterfly operation (e.g. addition/subtraction). Input signal
Table VI. of the IFFT processor corresponding to guard band is assigned
as zero (i.e. ‘0’) revealing the existence of numerous zeros.
C. Modulation and Channel Coding
If the sequence of OFDM symbol data which enter the IFFT
Although use of small packets is a potential approach for is adjusted, the memory depth can be reduced from 1024 to
achieving low latency, appropriate modulation and coding is 176.
required for small packet transmission for acceptable reliabil-
ity. In the literature, mainly three types of coding schemes are
proposed for 5G. As presented in [50], low-density parity- A dynamic shrunk square searching (DSSS) algorithm is
check (LDPC) and polar codes outperform turbo codes in proposed in [54], which cuts off unnecessary communication
control port (CCP) calculation by utilizing both the noise
characteristic and state space structure. In this way, it can
TABLE VII: C OMPARISON AMONG C HANNEL C ODING maintain close to optimal decoding performance in terms of
S CHEMES FOR L OW L ATENCY [50]. the block error rate (BLER). This results in reduction of
Cases Turbo LDPC- Convolutional Polar delay in communication. In [55], a latency reduction approach
coding PEG coding [50] codes by introducing time division multiplexing (TDM) of higher
[52] [50] [51] priority ultra-low latency data over other less time critical
Algorithm com- 100% 98% 66.7% 1.5%
plexity for cod- services is proposed, which maps higher priority user data
ing 1/3 of 40 bits during the beginning of a subframe followed by the normal
with respect to data. In [56], balanced truncation is applied for the model
turbo codes
Algorithm com- 100% 98% 66.7% 110.7
reduction in the linear systems that are being coupled over
plexity for cod- % arbitrary graphs under communication latency constraints. In
ing 1/3 of 200 [57], recent advances in finite-block length information theory
bits with respect
to turbo codes
are utilized in order to demonstrate optimal design for wireless
Performance in " " systems under strict constraints such as low latency and high
short packets reliability. For a given set of constraints such as bandwidth,
Performance in " " latency, and reliability the bounds for the number of the bits
medium packets that can be transmitted for an OFDM system is derived.
TABLE VIII: OVERVIEW OF S OLUTIONS IN T RANSMISSION TABLE IX: OVERVIEW OF S OLUTIONS IN C ONTROL
SIDE FOR L OW L ATENCY. C HANNEL FOR L OW L ATENCY.
Focus Reference Techniques Focus References Techniques
[58] Asymmetric window [61] Control channel sparse encoding
[59] Transmission power optimization (CCSE)
[60] Path-switching and packet-recovery method [62] Dedicated UL CCHs
Transmission [35] Diversity gain [92] Sub slotted data and control channel
[87] Beam forming using mmWave [63] SS-PUCCH consists of SC-FDMA
[88]– Full duplex communication in same channel symbol
[91] Control [64] Radio bearer and S1 bearer manage-
channel ment
[65] Outer-loop link adaptation
[93] Slotted TTI based radio resource man-
D. Transmission agement
[94] Adaptive radio link control (RLC)
A representative set of approaches for reducing latency [95] SDN based control plane optimization
using transmission side processing are tabulated in Table VIII,
which will be overviewed in the rest of this subsection.
In [58], an asymmetric window is proposed instead of well- targeting the control channel enhancements to achieve low
known symmetric windows for reduction of cyclic prefix by latency are illustrated in Table IX.
30%, and hence reducing latency due to reduced overhead. In [61], control channel sparse encoding (CCSE) is intro-
This technique suppresses out of bound (OOB) emission duced with vision to transmit the control information by means
but makes the system more susceptible to channel induced of non-orthogonal spreading sequences. A scaled-LTE frame
inter symbol interference (ISI) and inter carrier interference structure is proposed in [62] assuming the scaling factor to be
(ICI). Transmission power optimization by the steepest descent 5 with dedicated UL control channels (CCHs) for all sporadic-
algorithm considering transmission delay, error probability traffic users in each TTI with possible smallest scheduling re-
and queuing delay is proposed in [59]. In [60], low latency quest (SR) size. In [92], short TTI based uplink frame has been
packet transport system with a quick path-switching and a proposed for achieving E2E latency no longer than 1 ms. In
packet-recovery method is introduced for a multi-radio-access the proposed scheme, subslot consisting of 2 symbols has been
technology (multi-RAT) environment. In [35], use of diversity proposed for uplink data and control channel. A sequence-
gain is proposed as a solution for capacity enhancement based sPUCCH (SS-PUCCH) incorporating two single carrier-
and latency reduction. Diversity could be achieved through frequency division multiple access (SC-FDMA) symbols is
various approaches such as spatial diversity, time diversity, introduced in [63] in order to meet a strict latency requirement.
and frequency diversity. Symbol-level frequency hopping technique is employed to
In [87], a mmWave based switched architecture system achieve frequency diversity gain and reliability enhancement.
is proposed where control signals use low-resolution digital In the proposed procedure of [64], establishment of radio
beamforming (to enable multiplexing of small control packets) bearer and S1 bearer in parallel are proposed where eNB and
with analog beamforming in the data plane (to enable higher mobility management element (MME) manages and controls
order modulation). This reduces the overhead significantly due radio bearer and S1 bearer, respectively. The eNB sends
to the control signaling which results in more resources for only single control signal in order to configure radio bearers
data transmission. This technique leads to reduction of round such as SRB1, SRB2 and DRBs, that decreases the signaling
trip latency in the physical layer. interaction rounds between the UE and the eNBs. In [65], a
Recent advancements in full duplex (FD) communication new outer-loop link adaptation (OLLA) scheme is proposed.
comes forward with feature of doubling the capacity, im- The scheme controls the size of the compensation in the
proving the feedback, and latency mechanism meanwhile estimated SINR based on the time elapsed after a UE transits
upholding steady physical layer security [88]–[91]. Various from an idle state to an active state, which helps to reduce
proposed techniques of 5G networks such as massive MIMO latency for small packet applications. The study [93] proposed
and beamforming technology providing reduced spatial do- a slotted TTI based radio resource management for LTE-A and
main interference can be contributive for FD realization [89]. 5G in order to achieve low latency. The approach can serve
Besides that intelligent scheduling of throughput/delay critical low latency services utilizing short TTI and enhance download
packets along with proper rate adaption and power assignment control channel (ePDCCH).
can results in capacity gain and reduction of latency. However,
The study [94] proposed a novel mechanism that introduces
this field needs to be extensively investigated for studying
an adaptive radio link control (RLC) mode which dynamically
capacity and latency trade offs.
alternates between unacknowledgment mode (UM) and ac-
knowledgment mode (AM) according to the real-time analysis
E. Control Channel
of radio conditions. This technique reduces system latency
When the packet size is reduced as envisioned in 5G sys- and processing power, and improves throughput using UM.
tems, control overhead takes the major portion of the packet. On the other hand, it improves data reliability by activating
Addressing this, various approaches are proposed in order to AM during the degraded radio conditions. In [95], SDN based
reduce the control channel overhead. The potential solutions control plane optimizing strategy is presented to balance the
Received bits at WRC-15
UE1 scope WRC-19 scope

Symbol Demodulation 5G
estimation below 5G cmWave 5GmmWave
5G mm
Bits for UE1
6 GHz Wave

Modulation Channel
estimation 1 GHz 6 GHz 24 GHz 100 GHz
Precoding BS 5G
LTE
Symbol Demodulation LTE-A Backhauling MIMO
Modulation estimation LAA
Received bits at
Bits for UE2 Propagation
UE2 Fig. 9: mmWave spectrum to be used for 5G
channel Channel
estimation
communications.
eNB Transmitter
UE receiver

Fig. 8: Transmission and symbol detection in cellular improvements in terms of symbol error rate and latency over
network. earlier works.
In [69]–[71], compressed sensing is proposed for latency
reduction in networked control systems if the state vector
latency requirement of vehicular ad hoc network (VANET), can be modeled as sparse in some representation domain. In
and the cost on radio networks. The interaction between [72], a low complexity receiver design is proposed and the
vehicles and controller is formulated and analyzed as a two- superiority of an SCMA system is verified via simulations. In
stage Stackelberg game followed by optimal rebating strategy, addition, it is demonstrated with a real-time prototype that the
which provides reduced latency compared to other control whole system throughput triples while maintaining low latency
plane structures. similar to flexible orthogonal transmissions.

F. Symbol Detection G. mmWave Communications

As illustrated in Fig. 8, symbol detection encompasses Carrier aggregation using the mmWave spectrum (as il-
various processes such as channel estimation and decoding, lustrated in Fig. 9) is widely considered to be a promising
which can all contribute into the overall latency. The related candidate technology for 5G, capable of providing massive
literature in the symbol detection side for latency reduction bandwidth and ultra low latency. The mmWave technology is
are tabulated in Table X. especially critical for VR/AR type of applications which re-
quire high throughput and low latency. The works in mmWave
In [66], a low-complexity and low-latency massive SM-
spectrum for achieving low latency is summarized in Table XI.
MIMO detection scheme is introduced and validated us-
ing SDR platforms. The low complexity detection scheme In [46], a new frame design for mmWave MAC layer is
is proposed with a combination of zero forcing (ZF) and introduced which provides several improvements including
maximum-ratio-combining-zero-forcing (MRC-ZF). In [67], adaptable and smaller transmission intervals, dynamic loca-
a linear minimum mean square error (MMSE) receiver is tions for control signals, and the capability of directional
presented for low latency wireless communications using ultra- multiplexing for control signals (dynamic HARQ placement).
small packets. The estimation of receiver filter using the It addresses ultra low latency along with the multiple users,
received samples is proposed during the data transmission short bursty traffic and beam forming architecture constraints.
period in lieu of interference training period. Additionally, The study [73] focuses on three critical higher-layer design
soft decision-directed channel estimation is argued using the areas: low latency core network architecture, flexible MAC
data symbols for re-estimation of the channels. In [68], space- layer, and congestion control. Possible solutions to achieve
time encoding is introduced within a GFDM block in order to improvements in these critical design areas are short symbol
achieve transmit diversity for overall low latency in the system. periods, flexible TTI, low-power digital beam forming for
On the other hand, a widely linear estimator is used to decode control, and low latency mmWave MAC, which can all be
the GFDM block at the receiver end, which yields significant
TABLE XI: OVERVIEW OF S OLUTIONS IN MM WAVE
C OMMUNICATIONS FOR L OW L ATENCY.
TABLE X: OVERVIEW OF S OLUTIONS IN D ETECTION S IDE
FOR L OW L ATENCY. Focus References Techniques
[46] mmWave based MAC layer frame structure
Focus Reference Technique/Approach [73] Low latency core network architecture, flexi-
[66] SM-MIMO detection with ZF and MRC-ZF ble MAC layer, and congestion control
[67] Linear MMSE receiver mmWave [74] mmWave based physical layer air interface
Symbol [68] Space-time encoding and widely linear es- with basic numerology and logical channel
detection timator arrangement
[69]–[71] Compressed sensing [75] Low latency frame structure with beam track-
[72] Low complexity receiver in SCMA system ing
Throughput Packet loss rate
TABLE XII: L OCATION -AWARE C OMMUNICATIONS FOR
L OW L ATENCY. Throughput Packet loss rate
Focus References Techniques QoS
[77] Location information utilization in
protocol stack
Location aware [36] Physical layer parameters design us-
communications ing FFT, frame duration and local area Network delay QoS Round trip travel
(LA) physical channel time
[40] Utilization of channel quality and
traffic statistics from small cell
Network delay Peak SINR Round trip travel
time

considered for data channel, downlink control channel, and Peak SINR
uplink control channel.
In [74], in order to decrease the latency of the system, (a)
two different physical layer numerologies are proposed. The
Understanding
first approach is applicable for indoor or line of sight (LOS) Aesthetic user
communications, and the second one is suitable for non line experience Manageable
of sight (NLOS) communications. This is justified by some
Understanding
channel measurements experiments in 28 − 73 GHz range.
Aesthetic user
In [75], a 5G mmWPoC system is employed to evaluate the experience Manageable
Learnable
Appropriate
throughput functionality in field tests at up to 20 km/h mobile QoE
speed in an outdoor LOS environment. Additionally, some
improvements for a frame design is obtained which decrease
Appropriate Learnable
the latency in the field tests. In the experiments, it is observed Effective
Usable
QoE
that the new slotted frame design can decrease the RTT to design
3 ms for 70% − 80% of the cases in experiments, alongside Mutable
Need
the observed throughput up to 1 Gbps. Usable
Effective (b)
H. Location-Aware Communications for 5G Networks design
Fig. 10: (a) Attributes of QoS, and (b) Attributes
Mutable
Need of QoE.
Location knowledge (in particular, the communication link
distance) can be considered as a criterion of received power,
interference level, and link quality in a wireless network. accumulated from the small cells which can help to gain high
Therefore, overhead and delays can be reduced with location- throughput and low latency. Especially, in order to reduce the
aware resource allocation techniques because of the possibility latency, the delay due to packets containing critical data for
of channel quality prediction beyond traditional time scales. the higher layer protocols, for instance transmission control
The literature on location-aware communications regarding protocol (TCP) acknowledgment (ACK) packets, must be
low latency are tabulated in Table XII. optimized. To do so, one possible approach is to carry out the
In [77], several approaches are presented for monolithic retransmissions as quick as possible compared to the higher
location aware 5G devices in order to identify corresponding layer timers. Moreover, capability of data transmission to a
signal processing challenges, and describe how location data contention based data channel (CBDCH) can play a key role
should be employed across the protocol stack from a big here. As a result, by introducing CBDCH in small cells that
picture perspective. Moreover, this work also presents sev- are not highly loaded, the average latency of small packets
eral open challenges and research directions that should be transmission can be decreased considerably.
solved before 5G technologies employ mmWave to achieve
the performance gains in terms of latency, connectivity and I. Reinforcing QoS and QoE
throughput. In [36], 5G flexible TDD is proposed for local As shown in Fig. 10(a) and Fig. 10(b), respectively, re-
area (5GETLA) radio interface with FFT size of 256 and inforcing constraints on QoS and QoE can maintain low
512, and short frame structure to achieve latency lower than latency in 5G services including ultra high definition and 3D
1 ms. The packets of size of less than 50 kbits can be video content, real time gaming, and neurosurgery. The related
transmitted with end-to-end latency of 0.25 ms. The main literature on QoS and QoE control for low latency services are
focus in designing physical layer parameters is on FFT, frame tabulated in Table XIII.
duration and physical channel (LA). Abundance of mmWave bandwidth and extensive use of
In [40], a novel numerology and radio interface architecture beamforming techniques in 5G will allow high QoS and
is presented for local area system by flexible TDD, and frame QoE overcoming the resource and sharing constraints [96].
design. The proposed framework ensures coexistence with However, current transmission protocols and technologies
overlay LTE-A network, sleeping modes, contention based cannot be employed simply for addressing technical issues
data channel, and channel quality indicator and interference in 5G. The mapping of diverse services including latency
statistics. Here, the channel quality and traffic statistics are critical service to the optimal frequency, SDN and cloud
TABLE XIII: L ITERATURE OVERVIEW R ELATED TO
Q O S/Q O E R EINFORCEMENT. Core
network
Aspect Reference
[96]
Techniques/Approaches
mmWave utilization with beam tracking
Optical
[97] SDN and cloud technology Wireless
[98] QoS-aware multimedia scheduling Backhaul
QoS [99] Client based QoS monitoring architec-
reinforcing ture
[100] Colored conflict graph
Virtual BTS Virtual BTS Virtual BTS

Cloud-RAN
[101] QoS architecture with heterogeneous

[102]
statistical delay bound
Dynamic energy efficient bandwidth al- pool pool pool
location scheme
[103], [104] Emphasis on users’ perceived satisfac-
tion (i.e. QoE)
QoE [105] Routing using proximity information
reinforcing
[106] Predictive model based on empirical ob-
servations
[107] Predictive scheme of user QoE for Inter-
net streaming

technologies can ensure to achieve the best QoS and QoE, as


discussed in [97]. In [98], a QoS-aware multimedia scheduling
approach is proposed using propagation analysis and proper
countermeasure methods to meet the QoS requirements in the Fig. 11: Cloud-RAN architecture in 5G networks.
mmWave communications. Mean opinion score (MOS) which
is a criterion for user satisfaction can be employed for func-
tionality evaluation of the newly presented QoS approach and J. Other Aspects
well-known distortion driven scheduling in different frequency Cloud radio access network (CRAN) (as illustrated in
ranges. Fig. 11) is introduced for 5G in order to reduce the capital
Client based QoS monitoring architecture is proposed in expenditure (CAPEX) and simplify the network management.
[99] to address the issue of QoS monitoring from server CRANs combine baseband processing units of a group of base
point of view. Different criteria such as bandwidth, error rate stations into a central server retaining radio front end at the
and signal strength are proposed with the well-known RTT cell sides. However, this requires connection links with delay
delay for maintaining desirable QoS. A colored conflict graph of 250 µs to support 5G low latency services. In order to
is introduced in [100] to capture multiple interference and meet strict latency requirements in CRAN, two optimization
QoS aware approaches in order to take the advantage of techniques including (i) fine-tuned real-time kernel for pro-
beamforming antennas. In this case, reduction in call blocking cessing latency and (ii) docker with data plane development kit
and handoff failure helps to have a better QoS for multi class (DPDK) for networking latency have been proposed in [117].
traffic. Each device can be sensitive to time based on its The experimental results clearly demonstrate the effectiveness
application. This can be considered as an issue for QoS provi- of the approaches for latency optimization. In [118], split of
sioning. To address this, a novel QoS architecture is presented PHY and MAC layer in a CRAN with Ethernet fronthaul is
in [101] with heterogeneous statistical delay bound over a proposed, and verified through experimental test followed by
wireless coupling channel. The authors presented the dynamic latency interpretation. It is found that latency for packets of
energy efficient bandwidth allocation schemes in [102], which size 70 and 982 bytes is 107.32 µs and 128.18 µs confirming
improve system quality significantly and maintain QoS. 20% latency increase from small to large packet. The promis-
Previous QoS criteria which consist of packet loss rate, ing results affirm that latency critical services in 5G can be
network latency, peak signal-to-noise ratio (SNR) and RTT are supported by CRAN.
not sufficient for streaming media on Internet, and therefore, The CRAN can utilize backhaul information to redistribute
users’ perceived satisfaction (i.e. QoE) needs to be addressed users for QoE maximization and adaptation of temporal
[103], [104]. Higher QoS may not ensure the satisfactory backhaul constraints. Corresponding to this, authors in [119]
QoE. Different routing approaches of video streams in the proposed a centralized optimization scheme to control the
mobile network operators’ scenario is discussed in [105] for cell range extension offset so as to minimize the average
substantial refinement in QoE considering bit rate streams, network packet delay. In [76], a CRAN on the basis of
low jitter, reduced startup delay and smoother playback. A the optical network (PON) architecture is presented which
predictive model from empirical observations is presented in is called virtualized-CRAN (V-CRAN). The proposed scheme
[106] to address interdependency formulated as a machine can dynamically affiliate each radio unit (RU) to a digital unit
learning problem. Apart from that, a predictive model of user (DU) which results in coordination of multiple RUs with their
QoE for Internet video is proposed in [107]. corresponing DU. Moreover, definition of virtualized BS (V-
App. Plane
OSS/BSS NFVO
APP APP APP APP

Management and Admin Plane


Northbound Interface (NBI) e.g. RESTful, JSON
EMS EMS EMS
Control Plane

VNFM
SDN Controllers (e.g. ODL, ONOS) VNF VNF VNF

Southbound Interface (SBI) e.g. OpenFlow, ForCES Virtual Virtual Virtual


Network Network Network
VIM
Date Plane

Virtualization Layer
DFE DFE DFE
Compute Storage Network

(a) (b)

Fig. 12: Simplified example architectures for core network (a) Architecture of SDN [120], and (b) Architecture of NFV [121]
(APP: application; ODL: opendaylight platform; ONOS: open network operating system; DFE: dyna forming engineering;
OSS: operations support systems; BSS: base station subsystem; EMS: element management system; NFVO: NFV
orchestrator; VNFM: virtual network function manager; VIM: virtualized infrastructure manager).

BS) is brought up which is able to mutually send shared signals system can be improved. The optimal threshold distance is
from several RUs to a user. V-CRAN can reduce latency for derived in order to minimize the average distance between
joint transmission due to the following reasons. First, it can a user and an SBS. Simulation results demonstrate that the
provide more than enough bandwidth for data transmission energy consumption of SBS can be reduced by about 35% if
between RUs and DU. Furthermore, for each DU, a dedicated some of the SBSs are switched off.
hardware/software is assigned that can be utilized by joint In [123], authors introduced an energy efficient, low-
transmission controller in order to provide data and signaling complexity technique for load-based sleep mode optimization
NFVO
for RUs. The last but not least, in order to handle the load in densely deployed 5G small cell networks. By defining a new
distribution between DUs, the virtualized PON can connect a analytic model, the distribution of the traffic load of a small
DU directly via a linecard (LC). cell is characterized using Gamma distribution. It is shown
that the network throughput can be improved significantly
Before starting the packet transmission in data applications,
VNFM while some amount of energy is saved by taking the benefit
a tolerable initial delay can be considered. Thus, this short
of the initial delay. In [124], impact of average sleeping time
delay can be employed to decrease the energy required to
of BS and association radius on the mean delay in an UDN
operate the small cell base stations (SBSs). With the goal
is investigated using a M/G/1/N queuing model. An explicit
of average power consumption reduction of SBSs, when the
equation of delay is derived and the effect of average sleeping
load is low, under-utilized SBSs can be are switched off. As a
VIM time and association radius on the mean delay is analyzed.
result, the users and the network can be able to save energy by
postponing transmissions. By doing so, the users have time to
VII. C ORE N ETWORK S OLUTIONS FOR LOW LATENCY
wait for an SBS with better link quality. It can be observed that
sleeping mode operation for energy efficiency improvements To meet the vision of 5G encompassing ultra low latency in
in SBSs will also introduce a source of latency. In [122], addition to enhancements in the RAN, drastic changes are also
the energy-efficiency versus delay trade-off is investigated and proposed in the core network. The new core network includes
optimality conditions for UE’s transmit power is derived. By some new entities such as SDN, MEC, and NFV as well as
postponing the access of the users, energy efficiency of the new backhaul techniques [131], [132]. These enhancements

TABLE XIV: OVERVIEW OF T ECHNIQUES IN C ORE N ETWORK FOR L OW L ATENCY.


Case/Area Reference Approach Summary
[29], [108]–[110], SDN based architec- The architecture of 5G is proposed based on SDN with 5G vision to
[111], [112] ture meet large throughput, massive connectivity and low latency.
Core Network Architecture [113]–[115], [116] NFV based architec- NFV invalidates the dependency on hardware platform and makes
ture easy deployment of EPC functions as well as the sharing of resources
in the RAN. This can reduce the end to end latency with improved
throughput performance.
[113], [115], [116] MEC/fog based net- MEC/fog provides computation and storage near user end and also
work separates the data plan from control plan. These reduces the latency .
aim to reduce the processing time, bypass several protocol level slicing), and information centric wireless virtualization
layers, and ensure seamless operation. The core network controller have been introduced.
solutions for low latency are reviewed in further detail in the In [114], deployment of EPC support solutions as a Service
rest of this section. (EPCaaS) based on a uniform edge cloud structure is urged
using the concepts of NFV. Furthermore, the authors proposed
A. Core Network Entities an approach to share states over several data centers with
availability of network encapsulation across different data cen-
The existing literature on the core network entities that can ters even in presence of firewall/network encapsulation with
facilitate to achieve low latency are summarized in Table XIV. dynamically selectable state portions and edge nodes. In [115],
Exemplary architectures of SDN and NFV of the 5G core authors presented the optimization problem of composing,
network are illustrated in Fig. 12. computing and networking virtual functions to select those
In [29], authors presented an LTE compliant architecture nodes along the path that minimizes the overall latency (i.e.,
to decrease delay for combination of fog networks, MEC network and processing latency). The optimization problem is
and SDN in which the architecture is supposed to take the formulated as a resource constrained shortest path problem on
advantage of NFV in the evolved packet core (EPC) func- an auxiliary layered graph followed by initial evaluation.
tions. Following this, optimization of general packet radio The proposed core network entities for 5G vision are
service (GPRS) tunneling protocol (GTP) is introduced for summarized in Table XVI.
supporting low latency services. GTP tunnels management is
accomplished by a novel element between the eNB and the B. Backhaul Solutions
mobile network interface with the Internet. In [108], [109], Backhaul between base stations and the core network carries
SDN is proposed along with some changes in the existing the signaling and data from the core and the Internet. Due
4G architecture for moving forward towards 5G. The changes to the enormous number of small cells and macro cell base
include reduction of number of serving gateways (S-GW) and stations supporting 1000x capacity, massive connectivity and
elimination of some protocol layers. In SDN based system, latency critical services in 5G, the capacity of backhaul is
virtualization is possible, and routes can be optimized. This a bottleneck for achieving low latency. At current scenario,
will allow handling of QoS by setting specific rules in the microwave, copper and optical fiber links are used for backhaul
switches along the data path. Network coding integrated with connections based on availability and requirements. 5G back-
SDN is proposed in [110] for low latency and reduced packet- haul requires higher capacity, lower latency, synchronization,
retransmission. Network coding can work as network router security, and resiliency [13]. An exemplary 5G backhaul is
and can be integrated with SDN, which provides seamless illustrated in Fig. 13, in which the small cell base stations are
network operation and reduction in latency. connected by fronthaul whereas the macro cell base stations
The NFV is proposed as a major entity of 5G core network are connected by backhaul.
in [112]–[116], [133]. NFV removes the dependency on the Referring to Table XV, we divide existing backhaul so-
hardware platform and makes flexible deployment of EPC lutions into 2 parts: (1) General backhaul and (2) mmWave
functions as well as sharing of resources in RAN. This can backhaul. The solutions are described as follows.
reduce the E2E latency with improved throughput perfor- 1) General backhaul: General backhaul includes a dynamic
mance. SDN and NFV based 5G architecture with enhanced GPRS tunneling protocol (GTP) termination mechanism that
programmability of the network fabric, decoupled network combines cloud based GTP with a quick GTP tunnel proposed
functionalities from hardware, separated control plane from in [111]. Based on the user request or other factors, the
data plane, and centralized network intelligence in the network system can change its mode from a cloud-based GTP tunnel
controller is presented in [112]. In [113], an information to the quick GTP tunnel. In [29], a 5G vision compliant
centric scheme is presented in order to integrate the wireless architecture is presented to reduce latency combining with
network virtualization with information centric network (ICN). the fog network, MEC and SDN. The optimization of GTP
In this architecture, key components such as wireless network tunnels is accomplished by a novel element acting as an
infrastructure, radio spectrum resource, virtual resources (in- intermediate node between eNB and the mobile network
cluding content-level slicing, network-level slicing, and flow- interface accompanied with the Internet. In [125], modified

TABLE XV: OVERVIEW OF L ITERATURE IN BACKHAUL S OLUTIONS TO ACHIEVE L OW L ATENCY.


Category Reference Approach/Techniques
General [111] A dynamic GTP termination scheme combining cloud based GTP with a quick GTP tunnel with a dedicated hardware.
backhaul
[29] GTP tunnel optimization by a new component in 5G complaint network consists of fog networks, MEC and SDN.
[125] Modified VLC technology to set up an OW link for low-cost back hauling of small cells.
[126] PON-based architecture with a tailored dynamic bandwidth allocation algorithm.
mmWave [127] mmWave for fronthaul and backhaul, and split of control and user plane.
backhaul
[128] A digitally-controlled phase-shifter network based hybrid precoding/combining scheme for mmWave massive MIMO.
[129] A framework supporting of in-band, point-to-multipoint, non-Line-of-sight and mmWave backhaul.
[130] A mmWave based backhaul frame structure in 3 - 10 GHz carrier frequencies.
Wired connection
Optical connection
Microwave/wireless
connection

Virtualized SDN Virtualized


Backhaul server server

Internet
MME/SGSN S-G/
GGSN

Macro base Base station


station controller
Small cell
base station Router

Fig. 13: An example for a backhaul connection scenario in 5G networks where SBSs are connected by fronthaul and macro
BSs are connected to the core network by backhaul.

TABLE XVI: P ROPOSED C ORE N ETWORK E NTITIES FOR dynamic bandwidth allocation algorithm for a fast handover
5G V ISIONS . between eNBs, which are associated to the same or diverse
Reference SDN NFV MEC Fog net- CRAN optical network units.
works
[29] " " " 2) mmWave Backhaul: In addition to the presented solu-
[108]–[111] " tions for backhaul, mmWave employment in backhaul can be
[116] " considered as a promising solution for latency reduction. In
[112]–[114], " order to have the enhanced user experience, the BSs should be
[116], [133] in touch with core network and all other BSs via a low latency
[113] " " backhaul [159]. In [127], the authors proposed a scheme that
[114] "
[112] " " employed mmWave links as backhaul, fronthaul and access in
[115] " " which a new separation method between control and user plane
[76] " is proposed for 5G cellular network. A reasonable split among
control and user plane improve the user QoS by providing
ubiquitous high data rates in mmWave SBS coverage.
In [128], to implement an ultra-dense network (UDN) for
VLC technology is used to set up an optical window (OW) the future 5G network and providing high data rates, the need
link for low-cost backhaul of small cells to achieve a latency of a reliable, gigahertz bandwidth, and economical backhaul
of 10 ms. Moreover, using a next generation baseband chipset, is emphasized. Since mmWave can be easily integrated with
end-to-end latency below 2 ms can be achieved. An efficient massive MIMO to improve link reliability, and can provide
PON-based architecture is proposed in [126] that offers ultra- sufficient data rate for wireless backhaul, it is a promising
short latency for handovers by enhancing connectivity between candidate for such a scenario. Considering a massive MIMO
neighboring cells. Additionally, the authors propose a tailored scenario, a hybrid precoding approach is considered, in which
MBS MBS MBS

SBS SBS SBS


SBS
User 1 User 2 User 1 User 2 User 1 User 1 User 2
User 2
Local caching D2D caching SBS caching MBS caching

Fig. 14: Different types of caching in 5G.

each BS can cover several SBSs with multiple streams for each A. Caching for cellular network
SBS at the same time. In [129], the authors proposed a solution Let us consider, a scenario that a user requests content
framework for supporting an in-band, point to multi-point, from a content library F = {f1 , f2 , ......, fk }, where k is the
NLOS, mmWave backhaul in order to provide a cost-effective number of files. The files are sorted with popularity where f1
and low latency solution for wireless backhaul. It is shown and fk are the most and least popular files, respectively. The
that an in-band wireless backhaul for inter BS coordination popularity of a requested file l can be written [160] as
is feasible while the cell access capacities are not affected
considerably. l−γ
In [130], a frame design for mmWave communications is φl = Pk , (7)
−γ
proposed for 5G SBS network radio interface in 3-10 GHz. i=1 l
For both of LOS and NLOS scenarios different frame designs
are proposed, which have a frame duration of 0.1 ms and where l ∈ {1, 2, ..., k} and γ is the parameter for uneven
0.05 ms, respectively, to achieve low latency. The proposed distribution of popularity in F which follows Zipf distribution.
LOS structure can be assumed as a suitable solution for short For N eNBs B = {BS1 , BS2 , ......., BSN } with each eNB
distance indoor wireless access or in-band backhaul. having capacity C, the probability of caching of file fl by
an eNB can be obtained as
VIII. C ACHING S OLUTIONS FOR L OW L ATENCY
2
In addition to the shortage of the radio spectrum, the PlB = 1 − e−ρσπR , (8)
insufficient capacity of backhaul links can be considered as a
bottleneck for low latency communication. The long delay can
be due to the requests of too many users in peak-traffic hours. where ρ is the spatial density of eNBs following a Poisson
Thus, latency reduction is crucial for users’ QoS and QoE in point process [161], [162], and σ is the probability that file
the 5G networks. Caching and in a more general category, fl is cached within B. Then, the total probability of getting
information centric networking, can be assumed as one of content from the eNB can be written as
the promising candidate technologies to design a paradigm
N
shift for latency reduction in next generation communication B
X
P = φi PiB . (9)
systems [11], [12].
i=1
In this section, referring to Table XVII, we present a detailed
overview of caching concepts for cellular network followed by The probability of getting the content as in (9) is directly
fundamental limits and existing solutions. associated with the latency of downloading it, and hence,

TABLE XVII: OVERVIEW OF L ITERATURE IN C ACHING .


Aspect Reference Summary
Content caching [134]–[153] Filling of appropriate data is investigated by diverse techniques employing time intervals in which
the network is not congested.
Content delivery [135], [138]–[156] Content delivery to requested users is presented by different approaches for reduction of latency.
Centralized caching [134]–[136], [139], [141], Various centralized caching is investigated with assumption that a coordinator with access to almost
[143]–[149], [151]–[155] all the information about the storage capacities of different BSs, the connectivity of the users and
BSs, and etc.
Distributed caching [137], [138], [140], [142], Various aspects of distributed caching has been investigated in order to minimize the communi-
[150], [153], [156] cation overhead among SBSs and the central scheduler.

Latency-Storage trade-off [143]–[148], [157], [158] Fundamental trade off between storage and latency is investigated in radio networks complemented
with cache-enabled nodes.
TABLE XVIII: D IFFERENT T YPES OF C ACHING S CHEMES TABLE XIX: D EFINITION OF THE M ETRICS USED FOR
FOR THE C ELLULAR N ETWORK . L ATENCY E VALUATION IN C ACHING S CHEMES .
Cases Reference Description Cases Reference Defenition
When a UE wants to access a content, it first Defined as asymptotic delivery delay per bit in the
Local [163] checks in itself. Once the content is confirmed NDT [143] high-power, long-blocklength case.
caching in the local caching storage, it is accessed by Defined as the ratio between the duration of trans-
the UE without any delay. DTB [148] mission in channel to the file size in bits for the
If the requested content is not found locally, very large file size regime.
user will seek it within the range of it’s Defined as the worst-case delivery latency for the
D2D [160] D2D communication. If it is found in nearby FDT [147] real load at a rate described by the DoF of the
caching devices, it is delivered to the requester UE by channel.
D2D communication.
If the requested content is available in the
SBS [164] local SBS, the content is delivered to the UE
caching by the local SBS. radio access networks, optimal caching front-haul transmission
If the content is not found in local caching is obtained. In [145], the latency storage trade-off in a 3 × 3
MBS [160] storage, nearby devices or SBS caching, the
caching content is delivered by MBS caching. wireless interference network is investigated while all trans-
mitters and receivers are equipped with caches. Another metric
called (FDT) is proposed in order to characterize the trade-
effective caching strategies can help in significantly reducing off between latency and storage. This information theoretic
latency in 5G networks performance metric is actually a refined version of the metric
The proposed caching schemes for mobile networks can be originally proposed in [143]. The FDT can reflect the load
divided into 4 categories: (1) Local caching, (2) Device to reduction as well. In a similar work [146] the well-known
device (D2D) caching, (3) SBS caching, and (4) Macro base DoF metric is used which does not reflect the load reduction.
station (MBS) caching. Each of these caching types can reduce Moreover, the proposed approach in [146] just considers the
the latency by providing the requested content for the users one-shot linear processing, but interference alignment scheme
using a way other than bringing it from the core network using in [143] may require infinite symbol extension.
backhaul links. In fact, each user starts from the nearest source In [146], an optimization problem is designed to minimize
to look for its desiered content and proceed until finding it in the number of required communication blocks for content
any of the proposed sources. The different types of caching delivery. Then, a lower bound is proposed on the value of the
for cellular network are illustrated in Fig. 14 followed by the objective function. Using the same metric, the authors in [147]
summarized descriptions presented in Table XVIII. investigated the fundamental trade-off for a cache-enabled
MIMO system. Considering a scenario with a 3 × 3 MIMO
B. Fundamental Latency-storage trade-off in Caching system in which the nodes are enabled with several antennas,
There are several fundamental limits for caching in mobile the trade off between storage and latency is investigated. In
networks including latency versus storage, memory versus rate addition to FDT and showing its optimality for some ranges
[149], memory versus CSIT [150], storage versus maximum of cache size, the model can consider the effect of real traffic
link load [151], and caching capacity versus delivery rate load at a rate specified by the DoF of the channel.
[152]. As defined in Table XIX, from an information theoretic In [148], a cellular network is considered with multiple
point of view, authors employed the metrics such as normal- SBSs with limited cache capacity in which there is interference
ized delivery time (NDT), fractional delivery time (FDT), and among them. Here, another information theoretic metric based
delivery time per bit (DTB) for investigation of the latency on delivery latency is defined as well in order to characterize
storage trade-off in caching networks. In most of these works the system performance as a function of SBS cache memory
[137], [138], [140], [142], [150], [153], [156], for a given and capacity of backhaul links connected to SBS. Using this
scenario, an upper bound or lower bound for the considered metric which is called DTB, the trade off between latency and
metric is derived in order to get useful insights of this trade-off. system resources is investigated. In [157], using NDT trade-
The summary of latency storage trade-off works is presented off between storage and latency a distributed caching scenario
in Table XX. in fog radio access networks is characterized. In the presented
The authors in [143] investigated the storage latency trade- approach, a coded delivery scheme is proposed to minimize
off using a new metric called NDT. This metric measures the the latency for delivering user demands for two edge-nodes
worst-case latency that can happen in a cache-aided wireless and arbitrary number of users. It is shown that using decen-
network divided by that of an ideal system with unlimited tralized placement, the presented delivery approach can obtain
caching capability. Considering a general cache-aided wireless a considerable performance improvement in comparison to the
network, the lower bound for NDT is presented in terms of derived lower bound.
the ratio of the existing file memory at the edge node and the In [158], again NDT is employed to characterize the funda-
total size of files for both perfect channel state information mental trade off between delivery latency and system architec-
(CSI) and imperfect CSI. ture. Considering NDT as the criterion for latency evaluation
Authors in [144] employed NDT as well in order to of the system, some bounds on its value are proposed. In the
characterize the trade-off between NDT and fronthaul/caching light of such bounds, useful insights on the latency and storage
resources. Using this information-theoretic analysis of fog trade off are obtained. It is demonstrated that in order to obtain
TABLE XX: S UMMARY OF THE WORKS ON are cache-enabled with limited storage. On the contrary to
LATENCY- STORAGE TRADE - OFF IN CACHING . the existing works that focused on offline caching where
Ref. Used Description both caching phases are considered separately, the proposed
metric method can integrate both of them in each time slot. The
[143] NDT Lower bound for NDT is derived for a general cache-
enabled network for both perfect and imperfect CSI. performance is characterized using NDT and compared to
[144] NDT The trade-off between NDT and front-haul and that of optimal offline caching schemes. It is shown that the
caching resources is characterized and optimal proposed approach outperforms the offline caching.
caching front-haul transmission is obtained.
[145] FDT For a 3×3 wireless interference network the storage-
latency trade-off is investigated while all transmitters C. Existing Caching Solutions for 5G
and receivers are equipped with caches.
[147] FDT For a scenario with a 3 × 3 MIMO system in which
In general, the file delivery service in mobile networks can
the nodes are enabled with several antennas, the trade be classified into two parts: cache placement, and content
off between storage and latency is investigated. delivery [169]. In cache placement, the cached content on the
[148] DTB DTB is used to characterize the system performance BSs is determined, which is usually based on the amounts of
as a function of cache storage and capacity of
backhaul links connected to SBS. requests from users. Cache placement can be done using a
[157] NDT The trade-off between storage and latency for a centralized or distributed approach. In centralized approach,
distributed caching scenario in fog radio access net- a coordinator is assumed with access to almost all the infor-
works is characterized.
[158] NDT Cloud-based compressed precoding and edge-based mation about the memory size of BSs, the connectivity of
interference management introduced as two major the users, and the BSs. However, in some scenarios that there
techniques for optimal performance of cloud and is no central controller, these schemes can not be applicable
caching resources in different cases.
[165] NDT Considering heterogeneous timeliness requests de- and a distributed cache placement is required [137]. In each
pending on application, the fundamental trade-off of the caching phases whether centralized or distributed, the
between the delivery latencies of different users’ design can reduce the delivery time of the requested content
requests is characterized using NDT.
[166] NDT Upper and lower bounds for minimum delivery la- and latency of the system.
tency as a function of cache and fronthaul resources There are some efforts in the literature on investigation
is obtained over fronthaul and wireless link in a F- of different challenges raised up in centralized cache place-
RAN with a wireless multicast fronthaul.
[167] NDT Assuming an F-RAN when pipelined fronthaul edge ment problems. In [134], the cache placement problem was
transmission is used, lower and upper bounds on the investigated in a scenario including SBSs, called helpers with
NDT is presented. weak backhaul links but large memory size. In experimental
[168] NDT Assuming both content placement and delivery evaluation, it is shown that the proposed scheme can achieve
phases in one time slot, it is shown that the proposed a considerable performance improvement for the users at
approach outperforms the offline caching. reasonable QoS levels. In [136], authors aim at minimizing
the average download delay of wireless caching networks with
respect to caching placement matrix. It is demonstrated that the
the lowest delivery latency, cloud-based compressed precoding backhaul propagation delay can affect the caching placement.
and edge-based interference management should be considered In some scenarios, it is more desirable to design the caching
as two major techniques for optimal performance of cloud problems in a distributed approach. In [137], a distributed
and caching resources in different cases. In [165], for a fog cache placement approach is proposed in order to minimize
radio access network, the heterogeneous timeliness requests the average download delay while some constraint for BSs
depending on application is considered while in the existing storage capacities are met. The formulated optimization prob-
works the assumption is that all requests have identical latency lem which is NP-hard is solved using a belief propagation
for all files in the content library. The fundamental trade-off based distributed algorithm with low complexity. This opti-
between the delivery latencies of different users’ requests is mization problem makes sense because there is a trade-off
characterized using NDT. The minimization of the average between latency and storage capacity in caching networks.
delivery latency as a function of the content popularity profile The comparison between the performance of the proposed
remains as a future research direction. distributed scheme with that of the centralized algorithm in
In [166], the total delivery latency over fronthaul and [134] is presented as well. In [138], a decentralized content
wireless link in a fog radio access network with a wireless placement caching scheme is presented. Although there is no
multicast fronthaul is investigated. Again using NDT, the coordination, the proposed scheme can attain a rate as good
optimal delivery latency based on cache storage and fronthaul as the optimal centralized scheme proposed in [134]. In [170],
resources is formulated and upper and lower bounds are two caching and delivery schemes are considered. The first
obtained. It is shown that in contrast to the receiver-side one operates in a centralized manner, while the second one is
caching, coded multi-casting can not help in decreasing the based on decentralized caching. For both cases, the trade-off
NDT when two users and two edge nodes (ENs) are available. between coded multi-casting and spatial reuse is reflected by
In [167], for a F-RAN, the NDT is used to characterize the the code length.
performance of a fog radio access network when pipelined In addition to the aforementioned literature, in [180], con-
fronthaul edge transmission is used. tent caching, and content delivery schemes are proposed con-
In [168], a F-RAN system is considered in which ENs sidering cooperation to address the explosive enhancements
of demand for mobile network applications. Defining the delivery in 5G cellular networks. The proposed scheme is
objective as minimizing the average downloading latency, it carefully designed in order to take into account the benefit
is demonstrated that the proposed content assignment and of multicast and cooperation while in the existing caching
delivery policy scheme has a better performance in comparison schemes, the popular content simply is brought close to the
to the previous known content caching schemes in terms users. The optimization problem is formulated in order to
of average downloading latency. A weighted optimization minimize average latency for all the content requests. It is
problem is formulated in [139] to minimize the traffic of demonstrated that via various trace-driven simulations that
backhaul and downlink while the constraints for cache memory the proposed cooperative multicast-aware caching scheme can
size and bandwidth limitation for D2D communication is provide up to 13% decrease in the average content-access
considered. It is shown in [156] that if latency awareness is latency in comparison to multicast-aware caching scheme with
considered in caching management, it is an effective approach the same total cache capacity.
to reduce the delivery time of latency sensitive applications, In [182], the authors presented a cooperative caching ar-
and the global delivery time of users. In the proposed model, chitecture in which multiple locally cache-enabled nodes of
two main advantages is claimed. First, it not only has a better cloudlets interact cooperatively in a decentralized cloud ser-
performance in term of delivery time at the end-user, but also vice networks. By proposing a content distribution strategy, the
affects the link load reduction. Second, a faster convergence problem is formulated so that the mean total content delivery
with respect to probabilistic caching is achieved. In [140], the delay for all users in the proposed scheme is minimized. It is
effect of joint latency awareness and forwarding is investigated confirmed that the approach can enhance the cache hit rate, and
in a cache-enabled network. Authors proposed a scheme which also reduce the the content delivery latency in comparison to
is based on caching and forwarding strategies in order to existing solutions. In [183], the end-to-end packet transmission
improve E2E experienced latency by the UEs while there is in a cache-enabled network is modeled in which both the
no coordination among them. wired backhaul and the RAN are jointly considered. The
In [141], a cooperative content caching approach between performances of both the on-peak and the off-peak network
BSs in cache-enabled multi-cell network is considered. Due to are investigated while both the wired backhaul and the RAN
the trade-off between storage and latency, cooperative caching are considered. The E2E average packet latency is elaborated
optimization problem is designed in order to minimize the with the change of the request rate. It is shown that the average
average delay while a constraint on the finite cache size packet latency reduces in comparison to that of the system
at BSs is met. It is shown that cooperation among cells without caching ability due to the traffic offloading of the
can considerably reduce delay in comparison to that of non- wired backhaul via caching.
cooperative case. Moreover, the gains of the proposed scheme IX. F IELD T ESTS , T RIALS AND E XPERIMENTS
will be increased in more diverse and heavier load traffics. In In this section, we present some representative field tests,
[142], the aim of the work is to minimize the data transmission trials and experiments for 5G low latency. The related litera-
delay for the P2P caching system while considering the effect ture is summarized in Table XXI, where each of the individual
of cache size, all mobiles in a cell are considered as several references will be described in further detail below.
P2P caching groups. Then, the problem is formulated as a The study [171] presents SDR based hardware platform
stochastic optimization problem and solved using Markov to verify the concept of 5G. This facilitates initial proof-of-
decision process (MDP) to obtain the optimal solution. concepts (PoC) of novel 5G air interface and other concepts
In [181], a cooperative multicast-aware caching strategy is by extending hardware-in-the-loop (HIL) experiments to small
proposed for the BSs to decrease the average latency of content laboratory experiments and finally trials of outdoor tests.

TABLE XXI: P LATFORM AND P ERFORMANCE ON F IELD T ESTS , T RIALS AND E XPERIMENTS .
Reference Evaluation SDR DSP mmWave Conventional/ Proprietary LTE Remarks
methodology
[171] " Trials of 5G concepts along with a novel air interface
[172] " 4 test cases and 15 KPIs
[173] " mmWave aggregation
[174] " Minimum latency µs
[175] " Low latency VANET
[176] " Round-trip latency ≤ 2µs
[66] " 20 times latency reduction in comparison to existing
works
[75] " Minimum latency 3 ms
[177] " Latency ≤ 1 or 2 ms
[72] "
[178] " Latency ≤ 17 ms
[179] " HARQ RTT ≤ 1.5 ms
[42] " RTT latency ≤ 1 ms
Such an SDR based hardware can demonstrate high-capacity, (PoC) system. It was found that the slot interleaving frame
low latency and coverage capabilities of LTE-A solutions. structure can achieve RTT latency of 3 ms in the 70 − 80% of
In [172], evaluation methodology including some novel test the trial course. Additionally, beam tracking and throughput
environments and certain new key performance indicators are performance were evaluated in field tests at a speed up to
discussed in order to evaluate 5G network. Here, four candi- 20 km/h on LOS outdoor environment. It was confirmed that
date test environments such as indoor isolated environment and mmWave system can obtain throughput of 1 Gbps in the 38%
high speed train environment, and fifteen key performance in- of the trial time at 20 km/h speed.
dicators such as latency, throughput, network energy efficiency In [72], a low complexity receiver design was introduced
and device connection density are emphasized for performance followed by verification of superiority of an SCMA system
evaluation. via simulations and real-time prototyping. It can provide up
In [173], 5G system operating at 15 GHz is presented to 300% overloading that triples the whole system throughput
followed by some experimental results. Here 0.2 ms subframe while still enjoying the link performance close to orthogonal
(14 OFDM symbols) is used for throughput, latency and transmissions. In [178], the concept of MEC was introduced
other performance evaluation. The hardware implementation first time for 5G followed by promising field tests. The MEC
results of digital signal processing (DSP) and SDR based was tested and analyzed on various cases including local
5G system for low latency is presented in [174]. In this breakout and network end-to-end latency. It was concluded
study, both the short TTI (sTTI) frame structures and wider that MEC can support low latency services of not lower than
subcarrier spacings are implemented in DSP platform. Based 17 ms. It also urged that stricter requirement of latency needs
on the configurations of the system, RTT latency as low as to be investigated from the new radio technologies or D2D
1 ms can be achieved. However, for achieving latency on communication. In [179], a lab trial is presented to study the
the order of a few µs, optimization in between controllers feasibility of ultra-low latency for 5G. It is shown that 1.5
and processing machines needs to be performed by cross- ms HARQ RTT for TDD downlink in a lab trial is achievable
layer fashion. Additionally, the tail latency is argued to be when using the available test equipment in the literature while
considered in strict latency requirements assessment while a novel frame structure and the associated signaling procedure
maintaining required reliability. is employed. The proposed scheme has 5 times better latency
An SDR based test bed is presented in [175] for co- performance in comparison to the existing LTE-Advanced
operative automated driving with some experimental results standard.
from lab measurements. It implements flexible air inter- In [42], a novel frame structure is tested using a propri-
face consisting of re-configurable frame structure with fast- etary quasi-static system simulator for ultra-dense 5G outdoor
feedback, new pulse shaped OFDM (P-OFDM) waveform, RANs. In this regard, a frame structure is designed in order
low latency multiple-access scheme and robust hybrid syn- to facilitate low latency and multiuser spatial multiplexing
chronization, which ensure low latency high reliable commu- on radio interface along with small-scale packet transmission
nication. Results of the experimental trials are presented in and mobility support. It is found that performance of the
[176], which utilizes DSP techniques for channel aggregation introduced 5G network is better than that of LTE in case of
and de-aggregation, adjacent channel leakage ratio reduction, air interface latency. In particular, considering UL scheduling
frequency-domain windowing, and synchronous transmission requests in the RTT latency, the proposed frame structure can
of I/Q waveforms and code words used in control and man- achieve latency as low as 0.8365 ms which is reduced by a
agement function. In the proposed experiment, transmission factor of 5 in comparison to that of LTE. This satisfies 5G
of 48 chunks of 20 MHz LTE signals using a common public latency requirement (i.e.,1 ms latency).
radio interface of capacity 59 Gb/s can achieve RTT DSP
latency of less than 2 µs and mean error-vector magnitude of X. C ONCLUSION
about 2.5% after fronthaul fiber communication. This mobile
fronthaul technique shows the path towards ultra low latency Along with very large capacity, massive connection density,
integrated fiber/wireless access networks. and ultra high reliability, 5G networks will need to support
In [66], a multi-terminal massive SM-MIMO system is eval- ultra low latency. The low latency will enable new services
uated considering realistic scenarios. The authors developed a such as VR/AR, tele-medicine and tele-surgery; in some
massive SM-MIMO OFDM system prototype utilizing multi- cases, latency not more than 1 ms is critical. To achieve
ple off-the-shelf SDR modules which serve as IoT terminals. this low latency, drastic changes in multiple network domains
Two linear detection schemes with diverse complexity levels need to be addressed. In this paper, an extensive survey on
were tested for detection instead of maximum likelihood de- different approaches in order to achieve low latency in 5G
tection (MLD) schemes. It demonstrates the similar real-time networks is presented. Different approaches are reviewed in
SINR performance of the MLD techniques along with 20 times the domain of RAN, core network and caching for achieving
latency reduction over existing works. The promising results low latency. In the domain of RAN techniques, we have
urge the utilization of massive SM-MIMO systems for latency studied short frame/packets, new waveform designs, multiple
reduction and reliability enhancement in IoT transmissions. In access techniques, modulation and coding schemes, control
[75], the performance of a lower latency frame structure was channel approaches, symbol detection methods, transmission
evaluated in field tests using a 5G mmWave proof-of-concept techniques, mmWave aggregation, cloud RAN, reinforcing
QoS and QoE, and location aware communication as different TABLE XXII: L IST OF ACRONYMS .
aspects of facilitating low latency. AS Access stratum
On the other hand, SDN, NFV and MEC/fog network AR Augmented reality
BC Broadcast channel
architectures along with high speed backhaul are reviewed in BLER Block error rate
the literature for core network with vision to meet the low la- CCP Communication control port
tency requirements of 5G. The new core network will provide CCSE Control channel sparse encoding
diverse advantages such as distributed network functionality, CSIT Channel state information at the transmitter
DTB Delivery time per bit
independence of software platform from hardware platform, D2D Device to device
and separation of data plane from software plane, which DRB Data radio bearer
will all help in latency reduction. In caching, distributed and eMBB Enhanced mobile broadband
centralized caching with various trade-offs, cache placement EPC Evolved packet core
FFT Fast Fourier transform
and content delivery have been proposed for latency reduction FDT Fractional delivery time
in content download. Following this, promising results from FBMC Filter bank multicarrier
field tests, trials and experiments have also been presented GFDM Generalized frequency division multiplexing
GP Guard period
here. However, more practical and efficient techniques in the GTP GPRS tunnel protocol
presence of existing solutions need to be investigated before GGSN Gateway GPRS (General Packet Radio Service) ser-
the standardization of 5G. The authors believe that this survey vice node
will serve as a valuable resource for latency reduction for the HARQ Hybrid Automatic-Repeat-Request
ICI Inter carrier interference
emerging 5G cellular networks and beyond. IoT Internet of Things
ITS Intelligent transportation system
A PPENDIX IFFT Inverse first Fourier transform
Definitions of the acronyms used in this survey article are IUT International telecommunications union
listed in Table XXII. IUT-R International telecommunication union-radio com-
munication sector
ACKNOWLEDGMENT IMT-2020 International mobile telecommunication system with
a target date set for 2020
The work was supported by the National Science Foun- ISI Inter symbol interference
dation entitled “Towards Secure Networked Cyber-Physical MEC Mobile Edge Computing
MTP Machine type communication
Systems: A Theoretic Framework with Bounded Rationality” METIS Mobile and wireless communications enablers for
under the grant number 1446570. twenty-twenty (2020) information society
NGMN Next generation mobile networks
R EFERENCES MCC Mission critical communication
[1] S. Zhang, X. Xu, Y. Wu, and L. Lu, “5G: Towards energy-efficient, low- MRC-ZF Maximum ration combining zero forcing
latency and high-reliable communications networks,” in Proc. IEEE Int. MAC Medium access control
Conf. on Commun. Syst. (ICCS), Nov 2014, pp. 197–201. mMTc Massive machine type communication
[2] 5 Things Worth Knowing About 5G. [Online]. Available: http: MME Mobility management entity
//wi360.blogspot.com/2015/05/5-things-worth-knowing-about-5g.html MISO Multiple input single output
[3] M. Agiwal, A. Roy, and N. Saxena, “Next Generation 5G Wireless MDP Markov decision process
Networks: A Comprehensive Survey,” IEEE Commun. Surv. Tutor., NFV Network virtualized function
vol. 18, no. 3, pp. 1617–1655, thirdquarter 2016. NAS Non access stratum
[4] A. Gupta and R. K. Jha, “A Survey of 5G Network: Architecture and NDT Normalized delivery time
Emerging Technologies,” IEEE Access, vol. 3, pp. 1206–1232, 2015. OFDM Orthogonal frequency division multiplexing
[5] ITU-R, “Framework and Overall Objectives of the Future Development OOB Out of bound
of IMT for 2020 and Beyond,” Feb. 2015. OW Optical window
[6] A. Kumbhar, F. Koohifar, I. Guvenc, and B. Mueller, “A Survey OLLA Outer loop link adaption
on Legacy and Emerging Technologies for Public Safety Commu- PUCCH Physical uplink control channel
nications,” IEEE Commun. Surv. Tutor., vol. 19, no. 1, pp. 97–124, P2P Peer to peer
Firstquarter 2017. RAN Radio access network
[7] K. I. Pedersen, F. Frederiksen, G. Berardinelli, and P. E. Mogensen, SC-FDMA Single carrier frequency division multiple access
“The Coverage-Latency-Capacity Dilemma for TDD Wide Area Op-
SRB Signaling radio bearer
eration and Related 5G Solutions,” in Proc. IEEE Veh. Technol. Conf.
SCMA Sparse code multiple access
(VTC Spring), May 2016, pp. 1–5.
[8] P. K. Agyapong, M. Iwamura, D. Staehle, W. Kiess, and A. Benjebbour, SGW Serving GPRS gateway
“Design considerations for a 5G network architecture,” IEEE Commun. TDD Time division duplex
Maga., vol. 52, no. 11, pp. 65–75, Nov. 2014. RAT Radio access technology
[9] V. G. Nguyen, A. Brunstrom, K. J. Grinnemo, and J. Taheri, uRLLC Ultra reliable low latency communication
“SDN/NFV-based Mobile Packet Core Network Architectures: A Sur- UFMC Universal filtered multi-carrier
vey,” IEEE Commun. Surv. Tutor., vol. PP, no. 99, pp. 1–1, 2017. UDN Ultra dense network
[10] T. Taleb, K. Samdanis, B. Mada, H. Flinck, S. Dutta, and D. Sabella, VR Virtual reality
“On Multi-Access Edge Computing: A Survey of the Emerging 5G VLC Visible light communication
Network Edge Architecture Orchestration,” IEEE Commun. Surv. Tu- ZF Zero forcing
tor., vol. PP, no. 99, pp. 1–1, 2017. 5G Fifth generation mobile network
[11] A. Ioannou and S. Weber, “A Survey of Caching Policies and Forward- 5GETLA 5G flexible TDD based local area
ing Mechanisms in Information-Centric Networking,” IEEE Commun.
Surv. Tutor., vol. 18, no. 4, pp. 2847–2886, Fourthquarter 2016.
[12] M. Zhang, H. Luo, and H. Zhang, “A survey of caching mechanisms in
information-centric networking,” IEEE Commun. Surv. Tutor., vol. 17, [13] M. Jaber, M. A. Imran, R. Tafazolli, and A. Tukmanov, “5G Back-
no. 3, pp. 1473–1499, thirdquarter 2015. haul Challenges and Emerging Research Directions: A Survey,” IEEE
Access, vol. 4, pp. 1743–1766, 2016. mobile applications,” IEEE Commun. Mag., vol. 52, no. 2, pp. 97–105,
[14] T. O. Olwal, K. Djouani, and A. M. Kurien, “A Survey of Resource Feb. 2014.
Management Toward 5G Radio Access Networks,” IEEE Commun. [34] G. Durisi, T. Koch, and P. Popovski, “Toward Massive, Ultrareliable,
Surv. Tutor., vol. 18, no. 3, pp. 1656–1686, thirdquarter 2016. and Low-Latency Wireless Communication With Short Packets,” Proc.
[15] W. Xia, P. Zhao, Y. Wen, and H. Xie, “A Survey on Data Center of the IEEE, vol. 104, no. 9, pp. 1711–1726, Sep. 2016.
Networking (DCN): Infrastructure and Operations,” IEEE Commun. [35] N. A. Johansson, Y. P. E. Wang, E. Eriksson, and M. Hessler, “Radio
Surv. Tutor., vol. 19, no. 1, pp. 640–656, Firstquarter 2017. access for ultra-reliable and low-latency 5G communications,” in Proc.
[16] M. F. Bari, R. Boutaba, R. Esteves, L. Z. Granville, M. Podlesny, IEEE Int. Conf. Commun. Workshop (ICCW), Jun. 2015, pp. 1184–
M. G. Rabbani, Q. Zhang, and M. F. Zhani, “Data Center Network 1189.
Virtualization: A Survey,” IEEE Commun. Surv. Tutor., vol. 15, no. 2, [36] T. Levanen, J. Pirskanen, T. Koskela, J. Talvitie, and M. Valkama,
pp. 909–928, Second 2013. “Low latency radio interface for 5G flexible TDD local area commu-
[17] B. Briscoe, A. Brunstrom, A. Petlund, D. Hayes, D. Ros, I. J. Tsang, nications,” in Proc. IEEE Int. Conf. Commun. Workshop (ICCW), Jun.
S. Gjessing, G. Fairhurst, C. Griwodz, and M. Welzl, “Reducing 2014, pp. 7–13.
Internet Latency: A Survey of Techniques and Their Merits,” IEEE [37] P. Guan, X. Zhang, G. Ren, T. Tian, A. Benjebbour, Y. Saito, and
Commun. Surv. Tutor., vol. 18, no. 3, pp. 2149–2196, thirdquarter 2016. Y. Kishiyama, “Ultra-Low Latency for 5G - A Lab Trial,” CoRR, vol.
[18] S. Srivastava and S. P. Singh, “A Survey on Latency Reduction abs/1610.04362, 2016. [Online]. Available: http://arxiv.org/abs/1610.
Approaches for Performance Optimization in Cloud Computing,” in 04362
Proc. Inter. Conf. on Comp. Intel. Commun. Techn. (CICT), Feb 2016, [38] E. Lähetkangas, K. Pajukoski, J. Vihriälä, G. Berardinelli, M. Laurid-
pp. 111–115. sen, E. Tiirola, and P. Mogensen, “Achieving low latency and energy
[19] D. Delaney, T. Ward, and S. McLoone, “On Consistency and Network consumption by 5G TDD mode optimization,” in Proc. IEEE Int. Conf.
Latency in Distributed Interactive Applications: A Survey Part I,” Commun. Workshop (ICCW), Jun. 2014, pp. 1–6.
Presence, vol. 15, no. 2, pp. 218–234, April 2006. [39] K. I. Pedersen, G. Berardinelli, F. Frederiksen, P. Mogensen, and
[20] ——, “On Consistency and Network Latency in Distributed Interactive A. Szufarska, “A flexible 5G frame structure design for frequency-
Applications: A Survey Part II,” Presence, vol. 15, no. 4, pp. 465–482, division duplex cases,” IEEE Commun. Mag., vol. 54, no. 3, pp. 53–59,
Aug 2006. Mar. 2016.
[21] A. Mohamed, O. Onireti, M. A. Imran, A. Imran, and R. Tafazolli, [40] T. A. Levanen, J. Pirskanen, T. Koskela, J. Talvitie, and M. Valkama,
“Control-Data Separation Architecture for Cellular Radio Access Net- “Radio Interface Evolution Towards 5G and Enhanced Local Area
works: A Survey and Outlook,” IEEE Commun. Surv. Tutor., vol. 18, Communications,” IEEE Access, vol. 2, pp. 1005–1029, 2014.
no. 1, pp. 446–465, Firstquarter 2016. [41] T. Wirth, M. Mehlhose, J. Pilz, B. Holfeld, and D. Wieruch, “5G
[22] P. Schulz, M. Matthe, H. Klessig, M. Simsek, G. Fettweis, J. Ansari, new radio and ultra low latency applications: A PHY implementation
S. A. Ashraf, B. Almeroth, J. Voigt, I. Riedel, A. Puschmann, perspective,” in Proc. Asilomar Conf. Signals, Syst. and Comput., Nov.
A. Mitschele-Thiel, M. Muller, T. Elste, and M. Windisch, “Latency 2016, pp. 1409–1413.
Critical IoT Applications in 5G: Perspective on the Design of Radio [42] P. Kela, M. Costa, J. Salmi, K. Leppanen, J. Turkka, T. Hiltunen, and
Interface and Network Architecture,” IEEE Commun. Mag., vol. 55, M. Hronec, “A novel radio frame structure for 5G dense outdoor radio
no. 2, pp. 70–78, Feb. 2017. access networks,” in Proc. IEEE Veh. Technol. Conf. (VTC Spring),
[23] O. N. C. Yilmaz, Y. P. E. Wang, N. A. Johansson, N. Brahmi, S. A. May 2015, pp. 1–6.
Ashraf, and J. Sachs, “Analysis of ultra-reliable and low-latency 5G
[43] E. Lähetkangas, K. Pajukoski, E. Tiirola, G. Berardinelli, I. Harjula,
communication for a factory automation use case,” in Proc. IEEE Int.
and J. Vihriälä, “On the TDD subframe structure for beyond 4G radio
Conf. Commun. Workshop (ICCW), Jun. 2015, pp. 1190–1195.
access network,” in 2013 Future Network Mobile Summit, Jul. 2013,
[24] A. F. Cattoni, D. Chandramouli, C. Sartori, R. Stademann, and pp. 1–10.
P. Zanier, “Mobile Low Latency Services in 5G,” in Proc. IEEE Veh.
Technol. Conf. (VTC Spring), May 2015, pp. 1–6. [44] J. Vihriälä, A. A. Zaidi, V. Venkatasubramanian, N. He, E. Tiirola,
J. Medbo, E. Lähetkangas, K. Werner, K. Pajukoski, A. Cedergren, and
[25] C. Campolo, A. Molinaro, G. Araniti, and A. O. Berthet, “Better
R. Baldemair, “Numerology and frame structure for 5G radio access,”
Platooning Control Toward Autonomous Driving: An LTE Device-
in Proc. IEEE Symp. Pers. Indoor Mobile Radio Commun. (PIMRC),
to-Device Communications Strategy That Meets Ultralow Latency
Sep. 2016, pp. 1–5.
Requirements,” IEEE Veh. Techn. Maga., vol. 12, no. 1, pp. 30–38,
March 2017. [45] G. Pocovi, K. I. Pedersen, B. Soret, M. Lauridsen, and P. Mogensen,
“On the impact of multi-user traffic dynamics on low latency commu-
[26] ITU-Report, “The Tactile Internet,” ITU-T Technology Watch
nications,” in Proc. IEEE Int. Symp. Wireless Commun. Syst. (ISWCS),
Report, Aug. 2014. [Online]. Available: https:http://www.itu.int/
2016, pp. 204–208.
oth/T2301000023/en
[27] M. Simsek, A. Aijaz, M. Dohler, J. Sachs, and G. Fettweis, “The 5G- [46] S. Dutta, M. Mezzavilla, R. Ford, M. Zhang, S. Rangan, and M. Zorzi,
Enabled Tactile Internet: Applications, requirements, and architecture,” “Frame Structure Design and Analysis for Millimeter Wave Cellular
in Proc. IEEE Wireless. Commun. Netw. Conf. (WCNC), Apr. 2016, pp. Systems,” CoRR, vol. abs/1512.05691, 2015. [Online]. Available:
1–6. http://arxiv.org/abs/1512.05691
[28] Latency Analysis in LTE network. [Online]. [47] F. Schaich, T. Wild, and Y. Chen, “Waveform Contenders for 5G -
Available: http://www.techmahindra.com/Documents/WhitePaper/ Suitability for Short Packet and Low Latency Transmissions,” in Proc.
WhitePaperLatencyAnalysis.pdf Veh. Technol. Conf. (VTC Spring), May 2014, pp. 1–5.
[29] C. A. Garcia-Perez and P. Merino, “Enabling Low Latency Services on [48] M. Kasparick, G. Wunder, Y. Chen, F. Schaich, and T. Wild, “5G
LTE Networks,” in Proc. IEEE Int. Workshop Found. Appl. Self Syst. Waveform Candidate Selection D3.1,” in 5Gnow, Nov. 2013. [Online].
(FASW), Sep. 2016, pp. 248–255. [49] B. Farhang-Boroujeny, “OFDM Versus Filter Bank Multicarrier,” Proc.
[30] G. Pocovi, K. I. Pedersen, B. Soret, M. Lauridsen, and P. Mogensen, IEEE Signal Processing Mag., vol. 28, no. 3, pp. 92–112, May 2011.
“On the impact of multi-user traffic dynamics on low latency commu- [50] M. Sybis, K. Wesolowski, K. Jayasinghe, V. Venkatasubramanian,
nications,” in Proc. Inter. Symp. on Wire. Commun. Sys. (ISWCS), Sep. and V. Vukadinovic, “Channel Coding for Ultra-Reliable Low-Latency
2016, pp. 204–208. Communication in 5G Systems,” in Proc. IEEE Veh. Technol. Conf.
[31] ETSI, “Universal Mobile Telecommunications System (UMTS); Fea- (VTC Fall), Sep. 2016, pp. 1–5.
sibility study for evolved Universal Terrestrial Radio Access (UTRA) [51] B. Zhang, H. Shen, B. Yin, L. Lu, D. Chen, T. Wang, L. Gu, X. Wang,
and Universal Terrestrial Radio Access Network (UTRAN),” ETSI TR X. Hou, H. Jiang, A. Benjebbour, and Y. Kishiyama, “A 5G Trial of
125 912 V7.1.0, Tech. Rep., 09 2006. Polar Code,” in Proc. IEEE Globecom Workshops (GC Wkshps), Dec.
[32] P. K. Agyapong, M. Iwamura, D. Staehle, W. Kiess, and A. Benjebbour, 2016, pp. 1–6.
“Design considerations for a 5G network architecture,” IEEE Commun. [52] D. Liu, C. Zuo, and Z. Wu, “Benefit and cost of cross sliding window
Mag., vol. 52, no. 11, pp. 65–75, Nov. 2014. scheduling for low latency 5G Turbo decoding,” in Proc. IEEE/CIC
[33] G. Wunder, P. Jung, M. Kasparick, T. Wild, F. Schaich, Y. Chen, S. T. Int. Conf. Commun. China (ICCC), Nov. 2015, pp. 1–4.
Brink, I. Gaspar, N. Michailow, A. Festag, L. Mendes, N. Cassiau, [53] I.-G. Jang and G.-D. Jo, “Study on the latency efficient IFFT design
D. Ktenas, M. Dryjanski, S. Pietrzyk, B. Eged, P. Vago, and F. Wied- method for low latency communication systems,” in Proc. Int. Symp.
mann, “5GNOW: non-orthogonal, asynchronous waveforms for future Intell. Signal Process. Commun. Syst. (ISPACS), Oct. 2016, pp. 1–4.
[54] Y. Zhou, H. Luo, R. Li, and J. Wang, “A dynamic states reduction [76] X. Wang, C. Cavdar, L. Wang, M. Tornatore, Y. Zhao, H. S. Chung,
message passing algorithm for sparse code multiple access,” in Proc. H. H. Lee, S. Park, and B. Mukherjee, “Joint Allocation of Radio and
IEEE Wireless Telecommun. Symp. (WTS), Apr. 2016, pp. 1–5. Optical Resources in Virtualized Cloud RAN with CoMP,” in Proc.
[55] K. Ganesan, T. Soni, S. Nunna, and A. R. Ali, “Poster: A TDM IEEE Global Commun. Conf. (GLOBECOM), Dec. 2016, pp. 1–6.
approach for latency reduction of ultra-reliable low-latency data in 5G,” [77] R. D. Taranto, S. Muppirisetty, R. Raulefs, D. Slock, T. Svensson, and
in Proc. IEEE Veh. Netw. Conf. (VNC), Dec. 2016, pp. 1–2. H. Wymeersch, “Location-Aware Communications for 5G Networks:
[56] D. A. Jaoude and M. Farhood, “Balanced truncation of linear systems How location information can improve scalability, latency, and robust-
interconnected over arbitrary graphs with communication latency,” in ness of 5G,” IEEE Signal Process. Mag., vol. 31, no. 6, pp. 102–112,
Proc. IEEE Conf. Decision and Control (CDC), Dec. 2015, pp. 5346– Nov. 2014.
5351. [78] T. Levanen, J. Pirskanen, T. Koskela, J. Talvitie, and M. Valkama,
[57] J. Östman, G. Durisi, E. G. Ström, J. Li, H. Sahlin, and G. Liva, “Low- “Low latency radio interface for 5G flexible TDD local area commu-
latency Ultra Reliable 5G Communications: Finite-Blocklength Bounds nications,” in Proc. IEEE Inter. Conf. on Commun. Works. (ICC), June
and Coding Schemes,” arXiv preprint arXiv:1611.08714, 2016. 2014, pp. 7–13.
[58] T. Taheri, R. Nilsson, and J. van de Beek, “Asymmetric Transmit- [79] R. Abreu, P. Mogensen, and K. I. Pedersen, “Pre-Scheduled Resources
Windowing for Low-Latency and Robust OFDM,” in Proc. IEEE for Retransmissions in Ultra-Reliable and Low Latency Communica-
Globecom Workshop (GC Wkshps), Dec. 2016, pp. 1–6. tions,” in IEEE Wire. Commun. and Netw. Conf. (WCNC), Mar. 2017,
[59] C. She, C. Yang, and T. Q. S. Quek, “Cross-layer Transmission Design pp. 1–5.
for Tactile Internet,” CoRR, vol. abs/1610.02800, 2016. [Online]. [80] B. Soret, P. Mogensen, K. I. Pedersen, and M. C. Aguayo-Torres,
Available: http://arxiv.org/abs/1610.02800 “Fundamental tradeoffs among reliability, latency and throughput in
[60] H. Hirai, T. Tojo, and M. M. N. Takaya, “Low Latency packet transport cellular networks,” in Proc. IEEE Globecom Workshop (GC Wkshps),
methods for remote-controlled devices in multi-RAT environments,” 2014, pp. 1391–1396.
in Proc. IEEE Int. Symp. on Local and Metropolitan Area Netw. [81] P. Niroopan and Y.-h. Chung, “A user-spread interleave division
(LANMAN), Jun. 2016, pp. 1–2. multiple access system,” Int. J. Adv. Resear. Computer and Commun.
[61] H. Ji, S. Park, and B. Shim, “Sparse coding for reliable control channel Eng., pp. 837–841, 2012.
transmission,” to appear on IEEE Commun. Mag., 2017. [82] J. C. Fricke, H. Schoeneich, and P. A. Hoeher, “An interleave-division
[62] S. A. Ashraf, F. Lindqvist, R. Baldemair, and B. Lindoff, “Con- multiple access based system proposal for the 4G uplink,” Proc. IST
trol Channel Design Trade-Offs for Ultra-Reliable and Low-Latency Mobil. Wire. Commun. Summit, 2005.
Communication System,” in Proc. IEEE Globecom Workshops (GC [83] H. Nikopour and H. Baligh, “Sparse code multiple access,” in Proc.
Wkshps), Dec. 2015, pp. 1–6. IEEE Pers. Indoor Mobile Radio Commun. (PIMRC), Sep. 2013, pp.
[63] S. Xia, X. Han, X. Yan, Z. Zuo, and F. Bi, “Uplink control channel 332–336.
design for 5G ultra-low latency communication,” in Proc. IEEE Pers. [84] N. Michailow, I. Gaspar, S. Krone, M. Lentmaier, and G. Fettweis,
Indoor Mobile Radio Commun. (PIMRC), Sep. 2016, pp. 1–6. “Generalized frequency division multiplexing: Analysis of an alterna-
[64] R. Jin, X. Zhong, and S. Zhou, “The Access Procedure Design for Low tive multi-carrier technique for next generation cellular systems,” in
Latency in 5G Cellular Network,” in Proc. IEEE Globecom Workshop Proc. Int. Symp. Wireless Commun. Syst. (ISWCS), Aug. 2012, pp.
(GC Wkshps), Dec. 2016, pp. 1–6. 171–175.
[65] T. Ohseki and Y. Suegara, “Fast outer-loop link adaptation scheme re- [85] A. Sahin, I. Guvenc, and H. Arslan, “A Survey on Multicarrier Com-
alizing low-latency transmission in LTE-Advanced and future wireless munications: Prototype Filters, Lattice Structures, and Implementation
networks,” in Proc. IEEE Radio and Wireless Symp. (RWS), Jan. 2016, Aspects,” IEEE Commun. Surv. Tutor., vol. 16, no. 3, pp. 1312–1338,
pp. 1–3. Third 2014.
[66] H. Tang, W. Zhang, W. Hardjawana, and B. Vucetic, “Improving
[86] F. Schaich and T. Wild, “Waveform contenders for 5G OFDM vs
latency and reliability in 5G Internet-of-Things networks,” in Proc.
FBMC vs UFMC,” in Proc. Int. Symp. on Commun., Control and Signal
IEEE Int. Conf. Smart Grid Commun. (SmartGridComm), Nov. 2016,
Process. (ISCCSP), May 2014, pp. 457–460.
pp. 509–513.
[87] S. Dutta, M. Mezzavilla, R. Ford, M. Zhang, S. Rangan, and M. Zorzi,
[67] B. Lee, S. Park, D. J. Love, H. Ji, and B. Shim, “Packet Structure and
“MAC layer frame design for millimeter wave cellular system,” in Euro.
Receiver Design for Low-Latency Communications with Ultra-Small
Conf. Netw. Commun. (EuCNC), Jun. 2016, pp. 117–121.
Packets,” in Proc. IEEE Global Commun. Conf. (GLOBECOM), Dec.
2016, pp. 1–6. [88] S. Goyal, P. Liu, S. S. Panwar, R. A. Difazio, R. Yang, and E. Bala,
[68] M. Matthé, L. L. Mendes, N. Michailow, D. Zhang, and G. Fettweis, “Full duplex cellular systems: will doubling interference prevent dou-
“Widely Linear Estimation for Space-Time-Coded GFDM in Low- bling capacity?” IEEE Commun. Mag., vol. 53, no. 5, pp. 121–127,
Latency Applications,” IEEE Trans. Commun., vol. 63, no. 11, pp. May 2015.
4501–4509, Nov. 2015. [89] G. Zheng, “Joint Beamforming Optimization and Power Control for
[69] L. D. Gregorio, “A Note on Compressed Sensing for Low Latency,” Full-Duplex MIMO Two-Way Relay Channel,” IEEE Trans. Signal
in Proc Mobile Systems Technologies Workshop (MST), May 2015, pp. Process., vol. 63, no. 3, pp. 555–566, Feb. 2015.
8–11. [90] E. Ahmed, A. M. Eltawil, and A. Sabharwal, “Rate Gain Region and
[70] J. W. Choi, B. Shim, Y. Ding, B. Rao, and D. I. Kim, “Compressed Design Tradeoffs for Full-Duplex Wireless Communications,” IEEE
Sensing for Wireless Communications : Useful Tips and Tricks,” IEEE Trans. Wireless Commun., vol. 12, no. 7, pp. 3556–3565, Jul. 2013.
Commun. Surveys Tuts., vol. PP, no. 99, pp. 1–1, 2017. [91] A. Osseiran, F. Boccardi, V. Braun, K. Kusume, P. Marsch, M. Mater-
[71] C. Bockelmann, N. Pratas, H. Nikopour, K. Au, T. Svensson, C. Ste- nia, O. Queseth, M. Schellmann, H. Schotten, H. Taoka, H. Tullberg,
fanovic, P. Popovski, and A. Dekorsy, “Massive machine-type commu- M. A. Uusitalo, B. Timus, and M. Fallgren, “Scenarios for 5G mobile
nications in 5g: physical and MAC-layer solutions,” IEEE Commun. and wireless communications: the vision of the METIS project,” IEEE
Mag., vol. 54, no. 9, pp. 59–65, Sep. 2016. Commun. Mag., vol. 52, no. 5, pp. 26–35, May 2014.
[72] L. Lu, Y. Chen, W. Guo, H. Yang, Y. Wu, and S. Xing, “Prototype for [92] E. Shin and G. Jo, “Uplink frame structure of short TTI system,” in
5G new air interface technology SCMA and performance evaluation,” Proc. Int. Conf. Advanced Commun. Technol. (ICACT), Feb. 2017, pp.
China Commun., vol. 12, no. Supplement, pp. 38–48, Dec. 2015. 827–830.
[73] R. Ford, M. Zhang, M. Mezzavilla, S. Dutta, S. Rangan, and M. Zorzi, [93] Y. Hwang and J. Shin, “Slot based radio resource management for
“Achieving Ultra-Low Latency in 5G Millimeter Wave Cellular low latency in LTE-Advanced system,” in Proc. Int. Conf. Advanced
Networks,” CoRR, vol. abs/1602.06925, 2016. [Online]. Available: Commun. Technol. (ICACT), Feb. 2017, pp. 244–246.
http://arxiv.org/abs/1602.06925 [94] R. Shreevastav and R. S. Carbajo, “Dynamic RLC mode based upon
[74] T. Levanen, J. Pirskanen, and M. Valkama, “Radio interface design for link adaptation to reduce latency and improve throughput in cellular
ultra-low latency millimeter-wave communications in 5G Era,” in Proc. networks,” in Proc. IEEE Annu. Ubiquitous Computing, Electronics
IEEE Globecom Workshop (GC Wkshps), Dec. 2014, pp. 1420–1426. Mobile Commun. Conf. (UEMCON), Oct. 2016, pp. 1–6.
[75] S. Yoshioka, Y. Inoue, S. Suyama, Y. Kishiyama, Y. Okumura, J. Ke- [95] H. Li, M. Dong, and K. Ota, “Control Plane Optimization in Software-
pler, and M. Cudak, “Field experimental evaluation of beamtracking Defined Vehicular Ad Hoc Networks,” proc. IEEE Tran. Veh. Technol.,
and latency performance for 5G mmWave radio access in outdoor mo- vol. 65, no. 10, pp. 7895–7904, Oct 2016.
bile environment,” in Proc. IEEE Pers. Indoor Mobile Radio Commun. [96] O. Bazan and M. Jaseemuddin, “A Conflict Analysis Framework for
(PIMRC), Sep. 2016, pp. 1–6. QoS-Aware Routing in Contention-Based Wireless Mesh Networks
with Beamforming Antennas,” IEEE Trans. Wireless Commun., vol. 10, [118] G. Mountaser, M. Lema Rosas, T. Mahmoodi, and M. Dohler, On the
no. 10, pp. 3267–3277, Oct. 2011. feasibility of MAC and PHY split in Cloud RAN. IEEE, 5 2017.
[97] Huawei, “5G: A technology vision,” White paper, 2013. [119] G. Zhang, T. Q. S. Quek, A. Huang, M. Kountouris, and H. Shan,
[98] D. Wu, J. Wang, Y. Cai, and M. Guizani, “Millimeter-wave multimedia “Backhaul-aware base station association in two-tier heterogeneous
communications: challenges, methodology, and applications,” IEEE cellular networks,” in Proc. IEEE Inter. Works. on Sig. Proces. Adv.
Commun. Mag., vol. 53, no. 1, pp. 232–238, Jan. 2015. in Wire. Commun. (SPAWC), June 2015, pp. 390–394.
[99] A. Kamel, A. Al-Fuqaha, and M. Guizani, “Exploiting Client-Side [120] SDN architecture overview. [Online]. Available:
Collected Measurements to Perform QoS Assessment of IaaS,” IEEE https://www.opennetworking.org/images/stories/downloads/
Trans. Mobile Comput., vol. 14, no. 9, pp. 1876–1887, Sep. 2015. sdn-resources/technical-reports/SDN-architecture-overview-1.0.
[100] O. Bazan and M. Jaseemuddin, “A Conflict Analysis Framework for pdf,[Online;accessed5-May-2017]
QoS-Aware Routing in Contention-Based Wireless Mesh Networks [121] E. N. ISG, “Network Functions Virtualization, white paper,” in https:
with Beamforming Antennas,” IEEE Trans. Wireless Commun., vol. 10, //portal.etsi.org/nfv/nfv white paper.pdf, 2012. [Online; accessed 6-
no. 10, pp. 3267–3277, Oct. 2011. July-2016], Nov. 2016.
[101] X. Zhang, W. Cheng, and H. Zhang, “Heterogeneous statistical QoS [122] H. Celebi, N. Maxemchuk, Y. Li, and I. Guvenc, “Energy reduction in
provisioning over 5G mobile wireless networks,” IEEE Network, small cell networks by a random on/off strategy,” in Proc. IEEE Glob.
vol. 28, no. 6, pp. 46–53, Nov. 2014. Works. (GC Wkshps), 2013, pp. 176–181.
[102] W. Wang, X. Wang, and A. A. Nilsson, “Energy-efficient bandwidth [123] H. Celebi and I. Guvenc, “Load Analysis and Sleep Mode Optimization
allocation in wireless networks: algorithms, analysis, and simulations,” for Energy-Efficient 5G Small Cell Networks,” in Proc. IEEE Int. Conf.
IEEE Trans. Wireless. Commun., vol. 5, no. 5, pp. 1103–1114, May Commun. (ICC), 2017.
2006. [124] L. Pei, J. Huilin, G. Shen, D. Fei, and P. Zhiwen, “Impact of bs sleeping
[103] A. Balachandran, V. Sekar, A. Akella, S. Seshan, I. Stoica, and user association scheme on delay in ultra-dense networks,” in 2016
and H. Zhang, “Developing a Predictive Model of Quality of 8th Int. Conf. Wireless Commun. Signal Processing (WCSP), Oct 2016,
Experience for Internet Video,” SIGCOMM Comput. Commun. Rev., pp. 1–6.
vol. 43, no. 4, pp. 339–350, Aug. 2013. [Online]. Available: [125] D. Schulz, C. Alexakis, J. Hilt, M. Schlosser, K. Habel, V. Jungnickel,
http://doi.acm.org/10.1145/2534169.2486025 and R. Freund, “Low Latency Mobile Backhauling using Optical
[104] M. Agiwal, A. Roy, and N. Saxena, “Next Generation 5G Wireless Wireless Links,” in Proc. Symp. Broadband Coverage, Apr. 2015, pp.
Networks: A Comprehensive Survey,” IEEE Commun. Surveys Tuts., 1–3.
vol. 18, no. 3, pp. 1617–1655, thirdquarter 2016. [126] J. Chen and J. Li, “Efficient mobile backhaul architecture offering ultra-
[105] A. Roy, P. De, and N. Saxena, “Location-based social video sharing short latency for handovers,” in Proc. Int. Conf. Transparent Opt. Netw.
over next generation cellular networks,” IEEE Commun. Mag., vol. 53, (ICTON), Jul. 2016, pp. 1–1.
no. 10, pp. 136–143, Oct. 2015. [127] R. J. Weiler, M. Peter, W. Keusgen, E. Calvanese-Strinati,
A. De Domenico, I. Filippini, A. Capone, I. Siaud, A.-M. Ulmer-Moll,
[106] A. Balachandran, V. Sekar, A. Akella, S. Seshan, I. Stoica,
A. Maltsev et al., “Enabling 5G backhaul and access with millimeter-
and H. Zhang, “A Quest for an Internet Video Quality-of-
waves,” in Proc. IEEE Euro. Conf. Netw. Commun. (EuCNC), 2014,
experience Metric,” in Proc. ACM Workshop Hot Topics in
pp. 1–5.
Netw., ser. HotNets-XI, 2012, pp. 97–102. [Online]. Available:
[128] Z. Gao, L. Dai, D. Mi, Z. Wang, M. A. Imran, and M. Z. Shakir,
http://doi.acm.org/10.1145/2390231.2390248
“MmWave massive-MIMO-based wireless backhaul for the 5G ultra-
[107] ——, “Developing a Predictive Model of Quality of Experience for
dense network,” IEEE Wireless Commun., vol. 22, no. 5, pp. 13–21,
Internet Video,” SIGCOMM Comput. Commun. Rev., vol. 43, no. 4,
2015.
pp. 339–350, Aug. 2013. [Online]. Available: http://doi.acm.org/10.
[129] R. Taori and A. Sridharan, “Point-to-multipoint in-band mmwave
1145/2534169.2486025
backhaul for 5G networks,” IEEE Commun. Mag., vol. 53, no. 1, pp.
[108] J. Pagé and J. M. Dricot, “Software-defined networking for low-latency 195–201, 2015.
5G core network,” in Proc. Int. Conf. Military Commun. and Inform. [130] T. Levanen, J. Pirskanen, and M. Valkama, “Radio interface design for
Syst. (ICMCIS), May 2016, pp. 1–7. ultra-low latency millimeter-wave communications in 5G era,” in Proc.
[109] R. Trivisonno, R. Guerzoni, I. Vaishnavi, and D. Soldani, “Towards IEEE Globecom Workshop (GC Wkshps), Dec. 2014, pp. 1420–1426.
zero latency Software Defined 5G Networks,” in Proc. IEEE Conf. [131] V. G. Nguyen, A. Brunstrom, K. J. Grinnemo, and J. Taheri,
Commun. Workshop (ICCW), Jun. 2015, pp. 2566–2571. “SDN/NFV-based Mobile Packet Core Network Architectures: A Sur-
[110] D. Szabo, A. Gulyas, F. H. P. Fitzek, and D. E. Lucani, “Towards vey,” IEEE Commun. Surv. Tutor., vol. PP, no. 99, pp. 1–1, 2017.
the Tactile Internet: Decreasing Communication Latency with Network [132] T. Taleb, K. Samdanis, B. Mada, H. Flinck, S. Dutta, and D. Sabella,
Coding and Software Defined Networking,” in Proc. Euro. Conf., May “On Multi-Access Edge Computing: A Survey of the Emerging 5G
2015, pp. 1–6. Network Edge Architecture Orchestration,” IEEE Commun. Surv. Tu-
[111] J. Heinonen, T. Partti, M. Kallio, K. Lappalainen, H. Flinck, and tor., vol. PP, no. 99, pp. 1–1, 2017.
J. Hillo, “Dynamic tunnel switching for SDN-based cellular core [133] B. Martini, F. Paganelli, P. Cappanera, S. Turchi, and P. Castoldi,
networks,” in Proc. ACM 4th workshop All things cellular: operations, “Latency-aware composition of Virtual Functions in 5G,” in Proc. IEEE
applications, and challenges, 2014, pp. 27–32. Conf. on Netw. Soft. (NetSoft), Apr. 2015, pp. 1–6.
[112] J. Zhang, W. Xie, and F. Yang, “An architecture for 5G mobile network [134] K. Shanmugam, N. Golrezaei, A. G. Dimakis, A. F. Molisch, and
based on SDN and NFV,” in Proc. Int. Conf. Wireless, Mobile and G. Caire, “FemtoCaching: Wireless Content Delivery Through Dis-
Multi-Media (ICWMMN), Nov. 2015, pp. 87–92. tributed Caching Helpers,” IEEE Trans. Inf. Theory, vol. 59, no. 12,
[113] C. Liang, F. R. Yu, and X. Zhang, “Information-centric network func- pp. 8402–8413, Dec. 2013.
tion virtualization over 5g mobile wireless networks,” IEEE Network, [135] S. Mosleh, L. Liu, H. Hou, and Y. Yi, “Coordinated Data Assignment:
vol. 29, no. 3, pp. 68–74, May 2015. A Novel Scheme for Big Data over Cached Cloud-RAN,” in Proc.
[114] E. Cau, M. Corici, P. Bellavista, L. Foschini, G. Carella, A. Edmonds, IEEE Global Commun. Conf. (GLOBECOM), Dec. 2016, pp. 1–6.
and T. M. Bohnert, “Efficient exploitation of mobile edge computing [136] X. Peng, J. C. Shen, J. Zhang, and K. B. Letaief, “Backhaul-Aware
for virtualized 5g in epc architectures,” in Proc. IEEE Mobile Cloud Caching Placement for Wireless Networks,” in Proc. IEEE Global
Computing, Services, and Eng. (MobileCloud), Mar. 2016, pp. 100– Commun. Conf. (GLOBECOM), Dec. 2015, pp. 1–6.
109. [137] J. Liu, B. Bai, J. Zhang, and K. B. Letaief, “Content caching at the
[115] B. Martini, F. Paganelli, P. Cappanera, S. Turchi, and P. Castoldi, wireless network edge: A distributed algorithm via belief propagation,”
“Latency-aware composition of Virtual Functions in 5G,” in Proc. IEEE in Proc. IEEE Int. Conf. Commun. (ICC), May 2016, pp. 1–6.
Conf. Netw. Softw. (NetSoft), Apr. 2015, pp. 1–6. [138] M. A. Maddah-Ali and U. Niesen, “Decentralized Coded Caching At-
[116] J. Heinonen, P. Korja, T. Partti, H. Flinck, and P. Pöyhönen, “Mobility tains Order-Optimal Memory-Rate Tradeoff,” IEEE/ACM Trans. Netw.,
management enhancements for 5G low latency services,” in Proc. IEEE vol. 23, no. 4, pp. 1029–1040, Aug. 2015.
Int. Conf. on Commun. Workshop (ICCW), May 2016, pp. 68–73. [139] H. Hsu and K. C. Chen, “A Resource Allocation Perspective on
[117] C. N. Mao, M. H. Huang, S. Padhy, S. T. Wang, W. C. Chung, Y. C. Caching to Achieve Low Latency,” IEEE Commun. Lett., vol. 20, no. 1,
Chung, and C. H. Hsu, “Minimizing Latency of Real-Time Container pp. 145–148, Jan. 2016.
Cloud for Software Radio Access Networks,” in Proc. IEEE Inter. Conf. [140] G. Carofiglio, L. Mekinda, and L. Muscariello, “FOCAL: Forwarding
on Cloud Comp. Tech. and Sci. (CloudCom), Nov 2015, pp. 611–616. and Caching with Latency Awareness in Information-Centric Network-
ing,” in Proc. IEEE Global Workshop (GC Wkshps), Dec. 2015, pp. [163] F. Boccardi, R. W. Heath, A. Lozano, T. L. Marzetta, and P. Popovski,
1–7. “Five disruptive technology directions for 5G,” IEEE Commun. Mag.,
[141] Y. Sun, Z. Chen, and H. Liu, “Delay Analysis and Optimization vol. 52, no. 2, pp. 74–80, Feb. 2014.
in Cache-Enabled Multi-Cell Cooperative Networks,” in Proc. IEEE [164] M. Gregori, J. Gómez-Vilardebó, J. Matamoros, and D. Gündüz,
Global Commun. Conf. (GLOBECOM), Dec. 2016, pp. 1–7. “Wireless Content Caching for Small Cell and D2D Networks,” IEEE
[142] X. Zhang and Q. Zhu, “P2P Caching Schemes for Jointly Minimizing Jour. on Select. Area. in Commun., vol. 34, no. 5, pp. 1222–1234, May
Memory Cost and Transmission Delay over Information-Centric Net- 2016.
works,” in Proc. IEEE Global Commun. Conf. (GLOBECOM), Dec. [165] J. Goseling, O. Simeone, and P. Popovski, “Delivery Latency Regions
2016, pp. 1–6. in Fog-RANs with Edge Caching and Cloud Processing,” arXiv preprint
[143] A. Sengupta, R. Tandon, and O. Simeone, “Cache aided wireless arXiv:1701.06303, 2017.
networks: Tradeoffs between storage and latency,” in Proc. Annu. Conf. [166] J. Koh, O. Simeone, R. Tandon, and J. Kang, “Cloud-aided edge
Inf. Sci. and Syst. (CISS), Mar. 2016, pp. 320–325. caching with wireless multicast fronthauling in fog radio access net-
[144] R. Tandon and O. Simeone, “Cloud-aided wireless networks with works,” in Proc. IEEE Wireless Commun. and Netw. Conf. (WCNC),
edge caching: Fundamental latency trade-offs in fog Radio Access Mar. 2017, pp. 1–6.
Networks,” in Proc. IEEE Int. Symp. Inf. Theory (ISIT), Jul. 2016, [167] A. Sengupta, R. Tandon, and O. Simeone, “Pipelined Fronthaul-Edge
pp. 2029–2033. Content Delivery in Fog Radio Access Networks,” in Proc. IEEE
[145] F. Xu, K. Liu, and M. Tao, “Cooperative Tx/Rx caching in interference Globecom Workshop (GC Wkshps), Dec. 2016, pp. 1–6.
channels: A storage-latency tradeoff study,” in Proc. IEEE Inte. Symp. [168] S. M. Azimi, O. Simeone, A. Sengupta, and R. Tandon, “On-
on Info. Theo. (ISIT), July 2016, pp. 2034–2038. line Edge Caching in Fog-Aided Wireless Network,” arXiv preprint
[146] N. Naderializadeh, M. A. Maddah-Ali, and A. S. Avestimehr, “Funda- arXiv:1701.06188, 2017.
mental Limits of Cache-Aided Interference Management,” IEEE Trans. [169] C. Fang, F. R. Yu, T. Huang, J. Liu, and Y. Liu, “A survey of energy-
Inf. Theory, vol. 63, no. 5, pp. 3092–3107, May 2017. efficient caching in information-centric networking,” IEEE Commun.
[147] Y. Cao, F. Xu, K. Liu, and M. Tao, “A Storage-Latency Tradeoff Study Mag., vol. 52, no. 11, pp. 122–129, 2014.
for Cache-Aided MIMO Interference Networks,” in Proc. IEEE Global [170] M. Ji, G. Caire, and A. F. Molisch, “Fundamental Limits of Caching
Commun. Conf. (GLOBECOM), Dec. 2016, pp. 1–6. in Wireless D2D Networks,” IEEE Trans. Inf. Theory, vol. 62, no. 2,
[148] S. M. Azimi, O. Simeone, and R. Tandon, “Fundamental Limits on pp. 849–869, Feb. 2016.
Latency in Small-Cell Caching Systems: An Information-Theoretic [171] T. Wirth, M. Mehlhose, J. Pilz, R. Lindstedt, D. Wieruch, B. Holfeld,
Analysis,” in Proc. IEEE Global Commun. Conf. (GLOBECOM), Dec. and T. Haustein, “An Advanced Hardware Platform to Verify 5G
2016, pp. 1–6. Wireless Communication Concepts,” in Proc. IEEE Veh. Technol. Conf.
[149] M. A. Maddah-Ali and U. Niesen, “Fundamental Limits of (VTC Spring), May 2015, pp. 1–5.
Caching,” CoRR, vol. abs/1209.5807, 2012. [Online]. Available: [172] X. Meng, J. Li, D. Zhou, and D. Yang, “5G technology requirements
http://arxiv.org/abs/1209.5807 and related test environments for evaluation,” China Commun., vol. 13,
[150] J. Zhang and P. Elia, “Fundamental Limits of Cache-Aided Wireless no. Supplement2, pp. 42–51, 2016.
BC: Interplay of Coded-Caching and CSIT Feedback,” IEEE Trans. [173] S. Parkvall, J. Furuskog, Y. Kishiyama, A. Harada, T. Nakamura,
Inf. Theory, vol. 63, no. 5, pp. 3142–3160, May 2017. P. Naucler, and B. Halvarsson, “5G Wireless Access - Trial Concept
[151] M. Ji, M. F. Wong, A. M. Tulino, J. Llorca, G. Caire, M. Effros, and and Results,” in Proc. IEEE Global Commun. Conf. (GLOBECOM),
M. Langberg, “On the fundamental limits of caching in combination Dec. 2015, pp. 1–6.
networks,” in Proc. IEEE Int. Workshop Signal Process. Advances in [174] T. Wirth, M. Mehlhose, J. Pilz, B. Holfeld, and D. Wieruch, “5G
Wireless Commun. (SPAWC), Jun. 2015, pp. 695–699. new radio and ultra low latency applications: A PHY implementation
[152] M. M. Amiri and D. Gündüz, “Fundamental Limits of Coded Caching: perspective,” in Proc. Asilomar Conf. Signals, Syst. and Comput., Nov.
Improved Delivery Rate-Cache Capacity Tradeoff,” IEEE Trans. Com- 2016, pp. 1409–1413.
mun., vol. 65, no. 2, pp. 806–815, Feb. 2017. [175] H. Cao, S. Gangakhedkar, A. R. Ali, M. Gharba, and J. Eichinger, “A
[153] A. Sengupta, R. Tandon, and T. C. Clancy, “Fundamental Limits of 5G V2X testbed for cooperative automated driving,” in Proc. IEEE
Caching With Secure Delivery,” IEEE Trans. Inf. Forensics Security, Veh. Netw. Conf. (VNC), Dec. 2016, pp. 1–4.
vol. 10, no. 2, pp. 355–370, Feb. 2015. [176] X. Liu, H. Zeng, N. Chand, and F. Effenberger, “Efficient Mobile
[154] X. Peng, J. C. Shen, J. Zhang, and K. B. Letaief, “Joint data assignment Fronthaul via DSP-Based Channel Aggregation,” J. Light. Technol.,
and beamforming for backhaul limited caching networks,” in Proc. vol. 34, no. 6, pp. 1556–1564, Mar. 2016.
IEEE Pers. Indoor Mobile Radio Commun. (PIMRC), Sep. 2014, pp. [177] T. Wirth, M. Mehlhose, J. Pilz, B. Holfeld, and D. Wieruch, “5G
1370–1374. new radio and ultra low latency applications: A PHY implementation
[155] E. Baştuğ, M. Bennis, and M. Debbah, “Cache-enabled small cell perspective,” in Proc. Asilomar Conf. Signals, Syst. and Comput., Nov.
networks: Modeling and tradeoffs,” in Proc. Int. Symp. on Wireless 2016, pp. 1409–1413.
Commun. Syst. (ISWCS), Aug. 2014, pp. 649–653. [178] J. Zhang, W. Xie, F. Yang, and Q. Bi, “Mobile edge computing and
[156] G. Carofiglio, L. Mekinda, and L. Muscariello, “LAC: Introducing field trial results for 5G low latency scenario,” China Commun., vol. 13,
latency-aware caching in Information-Centric Networks,” in Proc. no. Supplement2, pp. 174–182, 2016.
IEEE Conf. Local Comp. Netw. (LCN), Oct. 2015, pp. 422–425. [179] P. Guan, X. Zhang, G. Ren, T. Tian, A. Benjebbour, Y. Saito, and
[157] A. M. Girgis, O. Ercetin, M. Nafie, and T. ElBatt, “Decentralized Y. Kishiyama, “Ultra-Low Latency for 5G-A Lab Trial,” arXiv preprint
Coded Caching in Wireless Networks: Trade-off between Storage and arXiv:1610.04362, 2016.
Latency,” arXiv preprint arXiv:1701.06673, 2017. [180] W. Jiang, G. Feng, and S. Qin, “Optimal cooperative content caching
[158] A. Sengupta, R. Tandon, and O. Simeone, “Cloud RAN and edge and delivery policy for heterogeneous cellular networks,” IEEE Trans.
caching: Fundamental performance trade-offs,” in Proc. IEEE Int. Mobile Comput., vol. 16, no. 5, pp. 1382–1393, 2017.
Workshop Signal Process. Advances in Wireless Commun. (SPAWC), [181] X. Huang, Z. Zhao, and H. Zhang, “Latency Analysis of Cooper-
Jul. 2016, pp. 1–5. ative Caching with Multicast for 5G Wireless Networks,” in Proc.
[159] C. Dehos, J. L. González, A. De Domenico, D. Kténas, and L. Dussopt, IEEE/ACM Int. Conf. Utility and Cloud Computing (UCC), Dec. 2016,
“Millimeter-wave access and backhauling: the solution to the exponen- pp. 316–320.
tial data traffic increase in 5G mobile communications systems?” IEEE [182] L. Jiang, G. Feng, and S. Qin, “Cooperative content distribution for 5G
Commun. Mag., vol. 52, no. 9, pp. 88–95, 2014. systems based on distributed cloud service network,” in Proc. IEEE Int.
[160] M. Chen, Y. Hao, M. Qiu, J. Song, D. Wu, and I. Humar, “Mobility- Conf. Commun. Workshop (ICCW), Jun. 2015, pp. 1125–1130.
aware caching and computation offloading in 5G ultra-dense cellular [183] Y. Liu, C. Yang, Y. Yao, and B. Xia, “Modeling and analysis on end-
networks,” Sensors, vol. 16, no. 7, p. 974, 2016. to-end performance of cache-enabled networks,” in Proc. Int. Conf.
[161] S. H. Chae, J. Y. Ryu, T. Q. S. Quek, and W. Choi, “Cooperative Wireless Commun. Signal Process. (WCSP), Oct. 2016, pp. 1–5.
Transmission via Caching Helpers,” in Proc. IEEE Global Commun.
Conf. (GLOBECOM), Dec. 2015, pp. 1–6.
[162] X. Ge, B. Yang, J. Ye, G. Mao, C. Wang, and T. Han,
“Spatial Spectrum and Energy Efficiency of Random Cellular
Networks,” CoRR, vol. abs/1501.05368, 2015. [Online]. Available:
http://arxiv.org/abs/1501.05368

You might also like