Kaya 2021

,(((:LUHOHVV&RPPXQLFDWLRQVDQG1HWZRUNLQJ&RQIHUHQFH :&1&
Deep Learning-based Predictive Beam Management

for 5G mmWave Systems
Aliye Özge Kaya, Harish Viswanathan
2021 IEEE Wireless Communications and Networking Conference (WCNC) | 978-1-7281-9505-6/20/$31.00 ©2021 IEEE | DOI: 10.1109/WCNC49053.2021.9417452
Nokia Bell Labs, Murray Hill, NJ 07974, USA
Abstract—Periodic measurement reporting based beam man- with increasing number of beams and/or users for beam-based
agement is not sufficiently agile for 5G New Radio (NR) and NR systems [4]. Without sufficient number of measurement
comes with significant overhead that scales with the number reports frequently being fed back by the UE, signal blocking
of beams and users. Furthermore, such an approach to beam
selection is unlikely to be sufficient to avoid signal blocking in may interrupt the service since the base station does not
real world scenarios. We propose a method to accurately predict know which beam of which TRP to switch transmission to
in advance the best serving beams and transmission points as as the signal begins to degrade due to blocking. Between
users move through the network and thereby eliminate the need two measurement reports, best serving beams could become
for frequent measurement reporting. Our prediction approach obsolete and cause a degradation in service quality, which may
applies deep learning techniques similar to that used in Natural
Language Processing (NLP) for translation/sentence completion not be remedied until a handover or beam switch following
tasks to the problem of predicting the best serving beams. The the next measurement report. Thus, even if a user is less likely
proposed solution enables the network to proactively switch users to switch beams, frequent measurement reports are still ex-
to new beams or cells to reduce blockage and handover related changed to monitor the best beam. Exchange of measurement
interruptions especially in high mobility scenarios. We evaluate reports cause both messaging overhead and wastage of the
our scheme in realistic scenarios using a new modeling technique
where computer vision is used to obtain mobility traces of users time-frequency resources, degrading user throughput.
from videos of live environments. We show significant benefits in Our solution to this problem is to enable the gNB to predict
terms of measurement report overhead reduction and signal-to- the next best serving beam for a UE and to proactively switch
noise ratio enhancement through blockage prevention in several beams without waiting for the next measurement report. Users
scenarios. in vehicles and pedestrians follow paths defined by roads and
walkways. They do not move randomly and typically follow
I. I NTRODUCTION
social customs, for example, they do not run into each other,
Next generation base stations (gNB) will be connected to they yield way to others, and often walk together [5]. In
one or more transmission reception points (TRP) that are computer vision, the task of trajectory prediction is viewed
capable of transmitting beamformed signals based on a pre- as a sequence generation task to predict future trajectories
determined set of directional beam patterns called the beamset. of objects based on their past positions. The authors of [5]
A TRP consists of a radio unit and an antenna panel capable propose a long term short term memory (LSTM) deep learning
of forming the beams in the beamset, and TRPs connected model, which can learn general human movement and predict
to a base station may be spatially distributed. Narrow beams their future trajectories. This level of social intelligence has
with significant beamforming gain are required to overcome been mostly ignored in wireless access systems, and the re-
path loss and extend the coverage especially in mm wave, source allocation algorithms are designed largely ignoring the
which implies a large number of beams in the beamset to spatio-temporal aspects of user distributions or their mobility
cover a whole cell serving area. As users move through the patterns. On the other hand, there is a significant amount
network, they frequently switch between beams, TRPs and of research on localization of individual users and use of
gNBs in order to receive the strongest possible signal [1], location and velocity information to provide better service
[2]. To facilitate beam switching, the base station has to [6]. In particular, in mmWave bands, because of blocking and
periodically transmit using each of the beams in the beamset reflections from structures even the UEs close to each other in
reference signals in the form of synchronization signaling terms of physical location may be served by different beams,
blocks (SSB) or channel state information reference signals TRPs or gNBs. Therefore, beam prediction should not rely
(CSI-RS) that a user equipment (UE) can use to measure entirely on explicit localization.
the received signal quality on the different beams and report The main contribution of this paper is a novel method for
the measurements to the base station for beam management. beam management that relies on prediction of best serving
Moreover, conveying information about the allocation of these beams using a LSTM network. Rather than relying on physical
resources requires extra UE-specific radio resource control location, channel state information or angular spread informa-
(RRC) and medium-access control (MAC) signaling [3]. The tion the prediction is performed by treating the sequences of
number of required time-frequency resources for reference beam indices as a sequence of letters or words in a language
signals and the amount of uplink resources required to send thereby enabling the application of NLP techniques. We also
measurement reports from the UE pose an overhead problem describe how in-field training can be performed to train the
978-1-7281-9505-6/21/$31.00 ©2021 IEEE
Authorized licensed use limited to: Carleton University. Downloaded on June 04,2021 at 22:57:35 UTC from IEEE Xplore. Restrictions apply.
predictor. The second major contribution of this work is to Although their approach is beneficial for UE discovery, it
show how the prediction, even when is not highly accurate, would not work to mitigate blockages for specific UEs along
can be combined with measurement reports to operate the a path. The authors in [14] propose a deep learning approach
system efficiently by using the prediction reliability metrics for coordinated beamforming using deep learning. The deep
obtained from the predictor. Hence our proposed method can learning model learns to predict the beamforming vectors at
be applied to all scenarios independent of how predictable cooperating base stations based on the pilot sequences. Our
the mobility patterns are. We evaluate our scheme in realistic approach is applicable for non-cooperating sites as well.
scenarios using a new modeling technique where computer The paper is organized as follows. In Section II we describe
vision is used to obtain mobility traces of users from videos the beam management problem and provide an overview of our
of live environments and video frame by video frame ray prediction-based solution. In Section III the neural network
tracing is used to model the radio environment. We show architecture, training and inference procedures are described
significant benefits of our approach in terms of measurement in detail. Section IV showcases the usage and benefits of our
report overhead reduction and blockage prevention in multiple framework through simulations on two different scenarios. We
use cases. conclude with a summary in Section V.
We rely on a LSTM neural network approach to predict
the next best beams or sites to serve the users [5]. LSTM II. P REDICTIVE BEAM MANAGEMENT
is a recurrent neural network where the network is capable To enable beam management for a large number of beams,
of learning sequences of arbitrary length while limiting the 5G radio access networks support hierarchical beam refine-
error propagation using gates. Sequence-to-sequence learning ment. First Level is SSB beams limited up to 64 within a
is about training models to convert sequences from one cell. These are transmitted periodically and used for initial
domain to another using an encoder and decoder [7]. The access by UE to establish connection to the base station.
encoder-decoder architecture is referred as autoencoder. The At the second level, narrower refinement beams with SSB
encoder encodes and compresses sequence data step by step. beams as root beams are used to improve system performance.
The output of the encoder, which is also an input to the For the purposes of beam refinement and tracking, the gNB
decoder, represents the input sequence as a fixed length vector. can configure CSI-RS resources, where each CSI-RS may
The decoder interprets it for each step of the autoencoder. be sent on a different beam and direct the UE to provide
LSTM auto-encoder architecture could be regarded as a data relevant measurement reports. With multiple TRPs in a rapidly
compression algorithm with loss where the compression and changing environment such as in a city with cars and people
decompression functions are learned from examples rather moving, frequent reports from a large number of beams will
than being engineered with specifically designed algorithms. be required to overcome blocking.
The data driven encoding and decoding functions are designed During periodic reporting of the best beam index, each
using neural networks. UE sends at specified time intervals measurement reports that
There are many publications in the area of beam manage- include reference signal received power (RSRP) and identifiers
ment focusing on tracking beams. The reference [8] provides of the measured beams. As illustrated in Figure 1 the reported
an extensive overview of the recently proposed measurement best beam is used by the gNB until the next measurement
techniques for beam and mobility management in mmWave report. The gNB typically picks the strongest beam reported by
cellular networks and their impact on standards. The authors the UE as the serving beam. That limits the agility of the gNB
in [9] propose an angle domain hybrid precoding and channel to react to changes in signal quality especially, if a beam pair
tracking method for mmWave massive MIMO system where is rapidly blocked and there is no sufficient time for regular
they utilize an ekinematic equation characterize the movement beam adjustment. Each time the measured RSRP is below a
of users and track user’s direction of arrival. In our work, configured value a beam failure instance is declared by the UE.
we use realistic traces extracted from videos for creating test If the number of such instances exceeds a configured value,
scenarios. The authors in [10] employ an analog architecture, the UE declares beams failure and initiates the beam recovery
i.e. the base station uses a small number of radio frequency procedure to find a new beam pair where the connectivity
chains to drive a large number of antenna elements and forms could be restored [15]. However, this process will result in
the tracking beams in the analog domain. [11] uses machine service interruption or increased latency for packet delivery
learning to predict channel states and to derive beamforming and lowered throughput. By predicting the next beams, TRPs
vectors. Our approach is more practical since we assume grid or gNBs to serve a UE we can be avoid beam and radio link
of beams as widely used by the current beam-based wireless failures and save on time and frequency resources needed for
standards. The authors in [12] propose extracting the user recovery.
locations in high speed trains for beam tracking purposes. The essence of our work is to use an LSTM autoencoder
We do not require explicit user localization for predicting network to learn the features of the beam space, the environ-
next likely beams. Our approach could easily be applied to ment and the user mobility pattern. The only data the gNB
the high-speed train use case as well. The authors in [13] needs to accumulate is the serving beam index reports which
propose an algorithm to prioritize the sectors where more users are already available at gNB for each UE being served. The
are expected to be found according to previous experience. beams across gNBs need to be labeled with non-overlapping
Fig. 1. Periodic Measurement Reporting: The best beam is used by the

gNB until the next measurement report. Therefore, the gNB may not react to
blockages between measurement reports with agility.
Fig. 3. Algorithm utilizing predictions while enforcing a minimum signal-to-

noise ratio (SNR) threshold and monitoring reliability of prediction
a given history of size P we predict beams for the next F

Fig. 2. We label beams at different TRPs with non-overlapping beam indexes slots. If the performance (based on traditional short time scale
channel quality indicator feedback) with the predicted beams
indexes. Figure 2 shows a representative scenario. Users are is above a threshold, we append those beams to the past
likely to move on similar paths again and again at varying beam sequence which we refer to as history. Based on the
speeds in different directions. They will go through similar updated history we predict beams for the next F beams. If the
sequences of beams as they move through the network. The probabilities of individual beams are not above the threshold to
crossing in the figure is being served by two TRPs forming be declared reliable, top K best beams are predicted. The gNB
many narrow beams. We label the beams with non-overlapping configures CSI-RS measurement at resources corresponding to
indices across all TRPs. those K beams. For K = 2, namely for Top-2 prediction, at
When the user first becomes active in the cell, it is not each slot the UE measures the resources corresponding to the
possible to immediately start the prediction of the best serving most likely K beams and sends beam selection report. If Top-
beams. There is an initial period when periodic measurement K predictions are unreliable and serving beams are yielding
reporting is utilized to collect the sequence of serving beams. signal levels below a threshold we fall back to conventional
Once a sufficiently long history is generated, the prediction is frequent measurement reporting mode until a new history is
turned on and the reporting mechanism is modified to reduce built.
the overhead. Once the required history length is available, Note that if the beam sequence predicted involves a tran-
the scheme predicts the next best beam at periodic intervals sition to a beam from a different TRP, then the gNB can
at a frequency that is much higher than that of the measure- trigger the change in TRP. The prediction procedure can also
ment reporting. In other words, the measurement reporting be used for handover prediction by treating each neighboring
frequency is reduced once the prediction is triggered. The base station as a single unique beam that UEs transition to.
predicted sequence of beams is matched against the periodic When the prediction result is a transition to one of these
reports to ensure that the prediction is accurate. The predicted beams representing neighboring base stations, then handover
beams themselves are then used as input to the predictor. We procedure may be triggered to the corresponding neighboring
define the best beam prediction accuracy as the proportion of gNB.
time slots in which the predicted beam with highest probability
III. D EEP LEARNING - BASED PREDICTION SYSTEM
for a slot is the best beam to serve the user in that slot. We
assume that only a single beam serves a user in each slot. Best In the following subsections we explain the details of the
beam prediction accuracy depends strongly on user mobility prediction system starting with the structure of the neural
patterns which can change rapidly. The beam management network, followed by the steps for data generation, training
scheme should be robust against sudden changes in prediction and inference.
accuracy. Hence in the proposed beam management scheme,
the output of neural network is not only designed to predict A. LSTM Autoencoder
the beam index of the next best beam but also the probability An LSTM autoencoder architecture is used to learn the
of each possible beam. If highest probability value falls below features of the beamspace, environment and the user mobility
a threshold, then the result of the prediction is a set of K most pattern. Figure 6 depicts the process of performing sequence
likely beams rather than a single best beam, so that the sum to sequence prediction with the autoencoder, which consists
of probabilities across the K beams exceeds the threshold. In of two LSTM cells one for encoding and one for decoding.
such a case, an on-demand measurement report is triggered The first LSTM cell, which is the encoder, processes the input
to obtain from the UE the RSRP value and the indication sequence and returns its own internal state. The state variables
of best beam from among the set of K beams. With this of encoder are fed to the decoder. The neural network is trained
approach, the beam management scheme automatically obtains to predict the next beam in the beam sequence given the
more feedback from the UEs whose next best beams are less sequence of previous beam indexes. The training methodology
predictable resulting in an overall reduction in overhead. that we use where the target sequence is the same sequence
Figure 3 illustrates the above scheme incorporating the as the input except one step into the future is called teacher
prediction, showing a reduction in the reporting. Based on forcing [16]. Effectively, the decoder learns to create beam
Fig. 5. Training
are captured and stored in gNB memory along with their
Fig. 4. The LSTM autoencoder consists of two LSTM cells and a dense time stamps. During the training phase there may be some
layer. Each LSTM layer has 16896 parameters. The dense layer has 3267 non-optimal sequences of beams, e.g., the best beams may
parameters. It takes 2 ms to train a batch of 128 samples with CPU resources
(10 x I9-7900X 3.30GHz@4.30 GHz 128 GB Memory) be blocked soon after switching to that beam and radio link
failures may occur. So, training data can be constructed by
indices starting from n+1 conditioned on indices up to current using the sequence of beam indexes with good signal quality
index n. and suppressing intermediate beam indices with very poor
Once the training is completed the autoencoder is run in signal strength and short dwell times. Therefore, the gNB
may select an actual beam sequence that served the UE for a
inference mode where the beam indices used previously are
given period without any outage or construct a training beam
fed into the autoencoder and next set of beams are predicted.
In the inference mode, the input sequence of size P is encoded sequence from measurement reports ignoring beam switching
triggered with a gain below a threshold or followed by another
into the state vector output of the encoder. The state vectors
beam switch within a very short dwell time.
and the target sequence are fed to the decoder. The decoder
The beam sequences are interpolated to have an index for
predicts the probabilities of each beam to be the next. The
equidistant time intervals e.g every 10 ms which we refer as a
predicted beam sequence is appended to the target sequence.
slot. If the data in the memory does not include a beam index
This process is repeated until F future beams are predicted.
for each 10 ms sampling interval, then the previous beam may
The LSTM autoencoder which is used for predictions con-
be repeated, for example, because the same beam may be used
sists of two LSTM cells and a dense layer. Figure 4 shows the
for a period of time over multiple 10 ms intervals. Through
specific structure and parameters of the LSTM autoencoder.
such interpolation and processing of real beam sequences we
Each input beam index integer value is encoded as a binary
construct for each UE a beam sequence sampled at constant
vector that is all zero values except the beam index. This step
time intervals for training the predictor.
is called one-hot-encoding. The input sequence is the sequence
After a beam sequence of an arbitrary length is constructed,
of one-hot-encoded vectors which are fed into the input layer
we create samples of length P + F consisting of a first portion
labeled by input1 . Since we have N = 99 possible beams,
of P beam indices, and a second portion of future F beam
each one-hot-encoded vector is of size 99. The auto-encoder
indexes as the correct output for training. Consider a past
consists of two LSTM cells of size 32 which are denoted in
sequence of 600 ms. At 10 ms sampling rate, the past sequence
Figure 4 as lstm1 and lstm2 . The output of the decoder dense
would consist of P = 60 slots. If we want to predict 40 ms
layer is a vector where each entry is the probability of each
ahead, the future samples will be of size F = 4 slots. Figure 5
beam being the next determined using a SoftMax function. The
illustrates the neural network training steps. The past samples
index corresponding to the highest probability is deemed as the
collected from all UEs are fed in batches to the neural network.
most likely beam, the index of the second highest probability
The prediction of the neural network is compared with the
is deemed as the second most likely beam and so on.
correct future samples. The parameters of the neural network
B. Generating Beam Sequences are adjusted to minimize the error between predictions and
ground truth. We use categorical cross entropy function as
When gNBs are first deployed, they are operated in training defined in (1) where each unique beam index is treated as a
mode where beam refinement procedures are implemented category. In (1) F is the number of samples to be predicted, C
with traditional CSI-reporting configuration. During this time, is the number of unique beam index categories , 1yi∈Cc is the
each user device is asked to report substantial number of indicator function of the ith prediction belonging to the cth
measurement reports for estimating the quality of different category. pmodel [yi ∈ Cc ] is the probability predicted by the
beams. Training mode could also be triggered if substantial model for the ith observation to belong to the cth category.
changes are made to the antenna orientation or to the infras-
1
F C
tructure such as major new building or road construction in
− 1γ ∈C log Pmodel [γi ∈ Cc ] (1)
the neighborhood. F i=1 c=1 i c
In training mode, gNB configures UEs to measure specific,
likely beams or all available beams. These reports are used The process of tuning the neural network parameters based
directly to determine the best beam to serve the user during on a set of training samples may be repeated for many training
the training phase. The sequence of beams used by the UEs samples for many UEs, in order to train the beam sequence
Fig. 6. Inference with autoencoder Fig. 7. People detected with computer vision algorithms on a single frame
of a video (left) and entire trajectories extracted from a video overlayed on
top of each other (right).
model to provide an accurate prediction. Training with the
entire user population’s past experiences improve prediction
accuracy for current individual users.
C. Algorithm utilizing predictions

The capability of predicting the next beams to serve the
user improves the effective measurement reporting frequency
by relying on predictions as long as reliable. The histories are Fig. 8. 3D ray tracing model for a sample frame. At each frame a new ray
tracing model is generated to model the moving blockages.
updated with predicted beam indexes which may not always
be correct. Predictions based on the imperfect histories will
get less reliable over time and the histories will be needed to A. Pedestrian Mobility
bootstrapped with data from infrequent measurement reports. Figure 7 shows the 635 trajectories extracted from a video
The output of the decoder is the probabilities of each beam. frame by frame overlaid on top of each other collected over
Based on the probability of a beam and percentage of the 12 minutes time frame [17]. Temporal and spatial correlation
correct prediction of beams using inferences based on the in locations under mobility is captured by modeling the envi-
similar probabilities we define the reliability of the beam. As ronment for each frame consecutively by embedding humans
an example, if the predicted probability is 0.6 but only the in every frame into the ray tracing model as shown in Figure
90% of predictions with the probability 0.6 are correct, we 8 [1]. This step is very crucial to capture the beam switching
define the reliability of the beam as 90%. patterns under realistic mobility conditions not at predefined
Let’s define bc as the index of the serving beam, RSRPthr user speeds. Note that our algorithm does not rely on any
the minimum RSRP to be maintained, rmin , smin the required information extracted from cameras. It would only utilize
reliability of the most and second likely beam and K the serving beam index information which is already collected at
prediction window. At time i = 0, prediction starts if the gNB based on measurement reports. We use this simulation
history H has the sufficient length and predict based on this methodology just to extract beam index sequences collected
history the next most likely beam. If the reliability of the on trajectories to test the prediction accuracy of our algorithm.
beam prediction is greater than rmin as determined by the There are two TRPs serving the intersection in Fig. 7. The
probability of the predicted beam by the output layer of the tower in front of TRP 2 blocks the signal for almost 30
predictor, the serving beam bc is set to be the predicted beam percent of users, which causes the users to switch to TRP
bi . 1 as the received signal power decreases. It is main source
If the bc meets the RSRP threshold the history is updated of the attenuation, which may go up 40 dB as the user is
using bc otherwise a full report is requested. If the reliability being blocked. Each TRP has a beamset each with 96 beams
of the most likely beam is not enough but the second most spanning 360 degrees. The beams are of 10 to 60 degrees
likely beam is reliable, gNB requests measurement reports for beamwidths with fixed tilt and azimuth. Only 99 of those
only those two beams and the UE switches to the best one if beams serve the users in this area. For training, validating
the RSRP requirement is met. That requires significantly less and testing we use trajectories of different users. We test the
resources than probing all possible beams. neural network on trajectories of 223 minutes of aggregated
length.
IV. R ESULTS 1) Top K Prediction Accuracy and Reliability: The output
We show the benefits of beam prediction based on the ray- of the neural network is the probability of each available beam
tracing results obtained in two realistic scenarios involving being the next best beam. From this we infer the most likely
pedestrian and vehicular mobility. The results presented in beams by ranking the probabilities. We quantify the prediction
Section IV-A are for a college intersection with pedestrians accuracy as follows:
who are mostly bikers. The results presented in Section IV-B • Best Beam Accuracy (Top-1): Proportion of time slots in
are for vehicular users on a busy road with many lanes and which the predicted beam with highest probability for a
intersections. slot is the best beam to serve the user in that slot.
Fig. 9. Top K Prediction Accuracy: Proportion of time slots in which one out Fig. 10. Top-2 Reliability: Percentage of all the samples where one of the
of the top K likely predicted beams for a slot includes the best beam/TRP to Top-2 beams is a correct prediction versus best beam prediction probability
serve the user in that slot. With Top-2 prediction, we can achieve 96% beam
prediction accuracy and 99.5% TRP prediction accuracy.
• Top K Beam Accuracy: Proportion of time slots in which

one out of the top K likely predicted beams for a slot
includes the best beam to serve the user in that slot.
• Best TRP Accuracy: Proportion of time slots in which
predicted TRP is the best TRP to serve the user
Based on a history of length 1.2s the beams to serve the user
in the next 120 ms sampled at 12 ms intervals are predicted.
Prediction for longer intervals is feasible. We chose 120 ms Fig. 11. Comparison of percentage of blocking instances. A blockage instant
is defined as the slot where the received signal is below noise level or the
since it was a reasonably long time interval to show benefits signal deteriorates at least by 20 dB in comparison to the prior slot.
of predictions in radio access layer. Figure 9 shows the beam
and TRP prediction accuracy for different values of K. With by 20 dB in comparison to the prior slot. Due to directive
Top-2 prediction, 96% beam prediction accuracy and 99.5% antennas used at the transmitter a drop of 20 dB in signal
TRP prediction accuracy is achieved. power does not always cause an outage but would impact the
After extracting the probabilities of each predicted beam quality of service. We determine the percentage of blockage
from the output layer of the network, the relationship between instances along the entire trajectories of users in the test data
the probability value of a predicted beam and its reliability set. Figure 11 compares the percentage of blockage instances
is assessed, i.e. whether a higher beam prediction probability between measurement reporting every 36 ms, every 120 ms
implies that the prediction more likely to be accurate. To and using our algorithm as shown in Fig. 3. We were able
answer this question, the reliability of beams is determined as to reduce the blocking instances by 55% even in comparison
follows. For Top-K reliability, all samples whose best beam to frequent measurement reporting every 36 ms. Reacting to
prediction
p probabilities
p when quantized using the formula blockages proactively can bring significant gains in terms of
0.025 0.025 + 0.5 results in the same value are determined. SNR. We improved 20th SNR percentile from -5 dB to 32 dB
The bin sizes are chosen as small as possible while ensuring at locations where a TRP switching occur by predictions in
enough samples in each bin. Then for those predictions the comparison to measurement reports every 120 ms.
other K-1 most likely beams are determined as well. Reliability 3) Measurement Report Reduction: Frequent measurement
is defined as percentage of all the samples where one of the reporting makes signal quality stable through appropriate beam
Top-K beams is a correct prediction. switching, avoiding sudden signal strength changes because
In the set of 120 ms path segments where both TRP of using the wrong beam. The advantage of beam predictions
switching and beam switchings within the same TRP occur, the is to achieve the same performance by virtually increasing
reliability is between 65-75% for 80% of samples for the best measurement reporting periodicity through the predictions as
beam prediction. With Top-2 prediction the reliability could long as they are determined to be reliable. Moreover, we
be pushed above 90% as shown in Fig. 10. 80% of samples can react to blockages before they happen or initiate beam
in this set had a best beam prediction probability above 0.8 switchings with agility. Since we rely on imperfect histories to
with a reliability metric above 0.8. With Top-2 prediction even predict it may be necessary to switch to conventional frequent
the best beam prediction probability is as low as 0.5 we can measurement reporting approach to build a reliable history.
achieve above 90% reliability by relying on two beams. The Using the algorithm in Figure 3 with a minimum required SNR
reliability could be further increased by increasing K. threshold of 20 dB we were able to reduce measurement re-
2) Blockage Prevention: If gNB can predict when the porting overhead by 80%. If we rely on frequent measurement
blockages or significant drop in the received power are likely reporting instead of predictions, we may improve SNR but
to happen it can react to it before the signal deteriorates. not necessarily throughput because of the associated overhead.
A blockage instant is defined as the slot where the received The overhead associated with frequent measurement reporting
signal is below noise level or the signal deteriorates at least and associated allocation of large number of CSI-RS resources
reduce the benefit achieved in terms of per user throughput

because of sacrificing data resources to control channels.
B. Vehicular Mobility
Fig. 13. Trajectories extracted from Lankershim Boulevard video, overlayed

on top of each other.
Fig. 12. Lankershim Boulevard in Los Angeles, LA. It has four lane
bidirectional arterial segments and three signalized intersections. The traffic
data was collected using video cameras mounted on the roof of a 36-story
building.
Federal Highway Administration Research and Technology
published vehicular videos collected around the Lankershim
Boulevard for investigating driver behavior at microscopic
level [18]. In each video frame, we model all the vehicles
present in that frame with their proper orientation, and map
that within the ray tracing model for the fixed surroundings.
Fig. 14. Top-K prediction accuracy for predicting next 120ms base on a
The extracted trajectories over 10 minutes are depicted in history of 1.2s (Lankershim Boulevard in Los Angeles, LA)
Figure 13. There are four TRPs at locations marked with
squares in Figure 13 at 500-m inter-site distance. 180 beams [3] 3GPP, “ NR; Physical layer procedures for control,” 3rd Generation
across these four sites serve the users. For each vehicle the Partnership Project (3GPP), Technical Specification (TS) 38.213, 04
next beams to serve the user in the next 120 ms based on 2019, version 15.7.0.
[4] W. Zirwas, M. Sternad, and R. Apelfrjd, “Key solutions for a massive
a history of 1.2s is predicted. Figure 14 shows Top-K beam MIMO FDD system,” in 2017 IEEE PIMRC, Oct 2017.
and TRP prediction accuracy. We showed above 98% Top-2 [5] A. Alahi, K. Goel, V. Ramanathan, A. Robicquet, L. Fei-Fei, and
prediction accuracy for both beam prediction and above 99% S. Savarese, “Social LSTM: Human trajectory prediction in crowded
spaces,” in 2016 IEEE CVPR, June 2016.
Top-2 prediction accuracy for TRP prediction. If we predict 6s [6] F. Zafari, A. Gkelias, and K. K. Leung, “A survey of indoor localization
ahead still using a history of 1.2 seconds, we achieve a TRP systems and technologies,” IEEE Communications Surveys Tutorials,
prediction accuracy of 98% and beam prediction accuracy of vol. 21, no. 3, pp. 2568–2599, 2019.
[7] A. Bhattacharyya, M. Fritz, and B. Schiele, “Long-term on-board
70%. prediction of people in traffic scenes under uncertainty,” in IEEE CVPR,
June 2018.
V. C ONCLUSIONS [8] M. Giordani, M. Polese, A. Roy, D. Castor, and M. Zorzi, “A tutorial
We showed the feasibility of predicting the next best beam on beam management for 3gpp nr at mmwave frequencies,” IEEE
Communications Surveys Tutorials, vol. 21, no. 1, pp. 173–196, 2019.
to serve a specific user using the users history and a neural [9] J. Zhao, F. Gao, W. Jia, S. Zhang, S. Jin, and H. Lin, “Angle domain hy-
network trained using the entire user ensemble in a given brid precoding and channel tracking for millimeter wave massive mimo
environment. Especially at mm-waves and high mobility sce- systems,” IEEE Transactions on Wireless Communications, vol. 16,
no. 10, pp. 6868–6880, Oct 2017.
narios, periodic measurement reports may not be sufficient to [10] D. Zhu, J. Choi, Q. Cheng, W. Xiao, and R. W. Heath, “High-
eliminate signal blockage. By predicting the next best beams resolution angle tracking for mobile wideband millimeter-wave systems
to serve the user we can enhance radio performance and with antenna array calibration,” IEEE Transactions on Wireless Com-
munications, vol. 17, no. 11, pp. 7173–7189, Nov 2018.
reduce the signaling overhead by as much as 80% in realistic [11] Y. Guo, Z. Wang, M. Li, and Q. Liu, “Machine learning based mmwave
use cases. Since it is impossible to perfectly predict in all channel tracking in vehicular scenario,” in IEEE ICC Workshops, 2019.
scenarios, our scheme relies on predicting more than a single [12] V. Va, X. Zhang, and R. W. Heath, “Beam switching for millimeter wave
communication to support high speed trains,” in IEEE VTC, Sep. 2015.
best beam when necessary. Top K prediction, with K being [13] R. Parada and M. Zorzi, “Cell discovery based on historical user’s
only 2 or 3, yields almost perfect beam and corresponding location in mmWave 5G,” in European Wireless, May 2017.
TRP prediction accuracy and reliability. Prediction reliability [14] A. Alkhateeb, S. Alex, P. Varkey, Y. Li, Q. Qu, and D. Tujkovic, “Deep
learning coordinated beamforming for highly-mobile millimeter wave
can be used to guide on-demand reporting. Incorporating the systems,” IEEE Access, vol. 6, pp. 37 328–37 348, 1 2018.
Top K prediction into the beam management automatically [15] E. Dahlman, S. Parkwall, and J. Skold, 5G NR: The Next Generation of
allocates measurement reports across the UEs when and where Wireless Access Technology. AP, 2018.
[16] A. Lamb, A. Goyal, Y. Zhang, S. Zhang, and Y. Bengio, “Professor
it is essential. forcing: A new algorithm for training recurrent networks,” 2016.
R EFERENCES [17] A. Robicquet, A. Sadeghian, A. Alahi, and S. Savarese, “Learning social
etiquette: Human trajectory understanding in crowded scenes,” in ECCV,
[1] A. O. Kaya and H. Viswanathan, “Coverage and capacity impact of
mobility and human body blocking at millimeter waves,” in IEEE 2016.
GLOBECOM, Dec 2017, pp. 1–7. [18] Federal Highway Administration Research and Technology,
“Lankershim boulevard dataset,” 2007. [Online]. Available:
[2] A. O. Kaya, D. Calin, and H. Viswanathan, “28 GHz and 3.5 GHz
https://www.fhwa.dot.gov/publications/research/operations/07029/
wireless channels: Fading, delay and angular dispersion,” in 2016 IEEE
Global Communications Conference (GLOBECOM), Dec 2016, pp. 1–7.

Kaya 2021

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Kaya 2021

Uploaded by

Copyright:

Available Formats

,(((:LUHOHVV&RPPXQLFDWLRQVDQG1HWZRUNLQJ&RQIHUHQFH :&1&

Deep Learning-based Predictive Beam Management

Nokia Bell Labs, Murray Hill, NJ 07974, USA

978-1-7281-9505-6/21/$31.00 ©2021 IEEE

Fig. 1. Periodic Measurement Reporting: The best beam is used by the

Fig. 3. Algorithm utilizing predictions while enforcing a minimum signal-to-

a given history of size P we predict beams for the next F

C. Algorithm utilizing predictions

• Top K Beam Accuracy: Proportion of time slots in which

reduce the beneﬁt achieved in terms of per user throughput

Fig. 13. Trajectories extracted from Lankershim Boulevard video, overlayed

You might also like

Kaya 2021

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Kaya 2021

Uploaded by

Copyright:

Available Formats

,(((:LUHOHVV&RPPXQLFDWLRQVDQG1HWZRUNLQJ&RQIHUHQFH :&1&

Deep Learning-based Predictive Beam Management

Nokia Bell Labs, Murray Hill, NJ 07974, USA

978-1-7281-9505-6/21/$31.00 ©2021 IEEE

Fig. 1. Periodic Measurement Reporting: The best beam is used by the

Fig. 3. Algorithm utilizing predictions while enforcing a minimum signal-to-

a given history of size P we predict beams for the next F

C. Algorithm utilizing predictions

• Top K Beam Accuracy: Proportion of time slots in which

reduce the beneﬁt achieved in terms of per user throughput

Fig. 13. Trajectories extracted from Lankershim Boulevard video, overlayed

You might also like

,(((:LUHOHVV&RPPXQLFDWLRQVDQG1HWZRUNLQJ&RQIHUHQFH :&1&