Deep Learning Based Channel Prediction at 226 GHZ Band Using Long Short-Term Memory Network

Deep Learning Based Channel Prediction at 2–26
GHz Band using Long Short-Term Memory Network

Motoharu Sasaki1*, Nobuaki Kuno1, Toshiro Nakahira1, Minoru Inomata1, Wataru Yamada1, Takatsune Moriyama1
1
Access Network Service Systems Laboratories, NTT Corp., Kanagawa, Japan, motoharu.sasaki.cn@hco.ntt.co.jp*
Abstract—We report a method of predicting variations in used for signal processing, predictions are made in the order
path loss using long short-term memory (LSTM) as deep of microseconds to milliseconds. A method of predicting the
learning. The training data and validation data are path loss second order based on video information from a camera or
data measured in Kanagawa, Japan, and the measurement the like has been reported [8–12]. There are studies that
frequencies are in the 2.2 GHz, 4.7 GHz, and 26.4 GHz
frequency bands. The median data of the path loss after 1 predict the received power of 100 to 500 ms ahead by
second was predicted using 100 points of fast fading data combining DNNs based on the video information of the
obtained about every 0.1 seconds. The median data was communication environment [8–10] and studies that predict
derived using fast fading data of 100 points (about 10 seconds). the optimum beam direction of multi-antenna systems [11,
Utilizing the prediction method using LSTM, the root-mean- 12]. Studies of predicting future received power from past
square error (RMSE) for the validation data was about 2.2 dB received power without using video information, so methods
at 2.2 GHz, about 2.1 dB at 4.7 GHz, and about 2.4 dB at 26.4
of predicting received power after 20 ms using LSTM [13],
GHz. The prediction errors were improved by 1 dB or more
than predictions using the latest observed values. predicting up to 2 seconds ahead in the 2.36 GHz band [14],
and predicting received power such as LTE and Wi-Fi using
Index Terms—deep learning, LSTM, path loss prediction, LSTM and a gated recurrent unit (GRU) [15, 16] have been
Sub6, millimeter wave, measurements.
reported. However, all the reports are focused on frequency
bands below 6 GHz.
I. INTRODUCTION Therefore, in this paper, we predict future received power
The use of high frequency bands of 6 GHz and above is from past received power without using video information by
advancing in fifth-generation mobile communication utilizing LSTM for path loss in the 2–26 GHz band including
systems, and the use of higher frequency bands such as the high frequency bands of 6 GHz and above. In Chapter 2, we
THz band is expected in sixth-generation mobile show our deep learning LSTM model. In Chapter 3, we
communication systems (6G) [1, 2]. In addition, 6G will describe the measurement method and parameters for path
have more diverse system requirements and usage patterns, loss data we used for training and validation. In Chapter 4,
including smartphones, tablet PCs, delivery drones, remote we show our model evaluation results. In Chapter 5, we
controls, and high-definition telemedicine videos [2]. summarize.
Therefore, a new network topology cell configuration will
conceivably superimpose various wireless cell services over II. PROPOSED MODEL USING LSTM
each other [2]. Therefore, the connections between base
Fig. 1 shows our model using LSTM, a type of recurrent
stations and terminals must be controlled to switch and/or
neural network (RNN) that is well known for time series
coordinate a large number of cell base stations to satisfy
prediction and has been used to predict wireless
various requirements without deteriorating the
communication quality in recent years [7, 13, 15, 16]. RNNs
communication quality of the terminals in a complicated cell
mainly have a vanishing gradient problem, which LSTM
configuration. For that purpose, technology that predicts the
overcomes to make long-term memory possible. The
communication quality of the connected cell in advance is
structure of the LSTM unit is made by adding the cell state
required for high frequency bands including above 6 GHz,
with a forget gate layer, an input gate layer, and an output
and the received power must by predicted in the order of
gate layer to a hidden state of the RNN unit. Input gate i
seconds [3].
controls the level of cell state updates. Forget gate f controls
As for the prediction technology of wireless
the level of cell state resets. Cell candidate g adds
communication quality, various methods using deep learning
information to a cell state, and output gate o controls the
have been reported. For example, there are methods of
level of cell states added to the hidden state. Each component
predicting channel state information (CSI) using the
has the input weights W, the recurrent weights R, and the
frequency band, time, place, etc. [4], predicting downlink
bias b. They are calculated as follows.
CSI from uplink CSI using sparse complex-valued neural
networks (SCNet) [5], predicting the multi-antenna coding
using a deep neural network (DNN) [6], and predicting
future CSI from past CSI using long short-term memory
(LSTM) [7]. However, since these prediction methods are
This paper's copyright is held by the author(s). It is published in these proceedings and included in any archive
such as IEEE Xplore under the license granted by the "Agreement Granting EurAAP Rights Related to
Publication
Authorized licensedof Scholarly
use Work."
limited to: Florida Institute of Technology. Downloaded on August 02,2021 at 09:41:04 UTC from IEEE Xplore. Restrictions apply.
it = σ g (Wi xt + Ri ht −1 + bi ) value can be predicted accurately even in the presence of
f t = σ g (W f xt + R f ht −1 + b f ) , (1) high-speed fading.
In Chapter 4 of this paper, we adopt a method of
g t = σ c (Wg xt + Rg ht −1 + bg ) prediction that uses the latest observed median values as a
ot = σ g (Wo xt + Ro ht −1 + bo ) conventional method for comparison.
where σg denotes the state activation function that is the

hyperbolic tangent function, and σc is the gate activation III. MEASUREMENT METHOD AND PARAMETERS
function that is the sigmoid function. Therefore, the cell state Fig. 2 shows the measurement environment, and Table 1
ct and the hidden state ht at time t are given by shows the measurement parameters. The measurements were
carried out in an urban area in Yokosuka, Kanagawa, Japan.
ct = f t  ct −1 + it  g t , (2) The average building height in the measurement
ht = ot  σ c (ct ) environment was about 20 m. The buildings mainly consist
of steel-reinforced concrete, reinforced concrete, or timber.
where  denotes the Hadamard product that takes two We used three frequency bands (2.2, 4.7, and 26.4 GHz) and
same-dimensional matrices and generates another matrix in measured the path loss of all the frequencies at the same
which each element i, j is the product of elements i, j of the time.
original two matrices.
Input dimension: [100 timesteps,1]

p t: raw data (fast fading data) of path loss at time t
p t-100 p t-99 … p t-1 pt
…
LST M layer, 50 nodes (tanh) Tx2
(21 m)
…
Fully connected layer, 50 nodes (linear)
… Tx1
100 m
Fully connected layer, 50 nodes (linear) (16 m)
…
Fully connected layer, 50 nodes (linear) Tx3
(25 m)
…
Copyright. Geospatial Information Authority of Japan. ALL RIGHT RESERVED.
Fully connected layer, 1 node (linear)
Fig. 2. Measurement environment.
Output: median value of path loss at time t+1 sec The Tx antennas were installed on building roofs at
heights of about 16, 21, and 25 m (Tx 1, 2, and 3) in the area.
Fig. 1. Proposed model.
The Rx antennas were set on the roof of the measurement
vehicle at a height of 2.5 m. The red lines in Fig. 2 represent
The neural network we utilize consists of five types of the routes taken by the measurement vehicle. The path loss
layers: an LSTM layer with 50 hidden units, three fully was measured while the vehicle was moving. The Tx
connected layers with 50 units, and fully connected layers antennas were directional antennas with a half-power
beamwidth (HPBW) of 60°, and the Rx antennas were
with one unit. Data are standardized before the input layer.
For the optimization algorithm, Adam is applied with a omnidirectional. The Rx antenna was positioned to remain
maximum epoch of 200. within the Tx antennas’ beamwidth by adjusting the Tx
In this paper, we make predictions using 100 fast fading antennas’ direction. The transmitted signal was a continuous
data as input data. The output data is the median data of the wave. The received power was acquired at a sampling
path loss after 1 second. The median value is derived from frequency of 45 kHz, but we did downsampling to 9.4 Hz
before data processing (data samples were obtained about
100 points (about 10 seconds). If the output data are also fast
fading data, it is impossible to predict the future of the every 0.1 sec). The vehicle drove up to about 40 km/h so the
second order because there is no channel correlation between running distance per sample was up to about 1 meter.
input and output, so prediction results converges to the The measurement path loss data of Tx1 and Tx2 are used
average value. In this method, using the data processed in for training, and the data of Tx3 are used to validate our
LSTM model.
advance as the output data facilitates learning so the median
Publication
use Work."
Fig. 3 shows the distance characteristics of the path loss 115
acquired at Tx1 (used for the training data) as the Meas.
measurement results. Here, the path loss is the median value 120
LSTM
of the 10 m intervals. As Fig. 3 shows, the measurement data Conv.
125
Path loss (dB)

was acquired up to 1000 m or more at any frequency, and the
variation characteristics of path loss when moving in urban 130
areas could be widely acquired.
135
Table 1. Measurement parameters. 140
Frequency (GHz) 2.2, 4.7, 26.4 145
17600 17800 18000 18200 18400
Tx antenna height (m) 16 (Tx1), 21 (Tx2), 25 (Tx3) Sample
Rx antenna height (m) 2.5 (a) 2.2 GHz
Sampling rate (Hz) 9.4 90
95
80 100
Path loss (dB)

2.2 GHz 105
100 4.7 GHz
110
26.4 GHz
Path loss (dB)
120 115
120 Meas.
LSTM
140 125 Conv.
130
160 22200 22400 22600 22800 23000
Sample
180 (b) 4.7 GHz

0 500 1000 1500
Tx-Rx distance (m) 110
Fig. 3. Measurement example (Tx1). Meas.

120 LSTM
Conv.
Path loss (dB)
130
IV. EVALUATION OF PROPOSED MODEL
The prediction accuracy evaluation of our model 140
described in Chapter 2 is shown for the training data (Tx1
and Tx2) and validation data (Tx3) in Chapter 3. Fig. 4 150
shows the prediction results for the training data at each
frequency, and Fig. 5 shows the prediction results for the 160
11400 11500 11600 11700 11800 11900 12000 12100 12200
validation data. We compared our model with a conventional Sample
method (Conv.) that uses the latest observed median path (c) 26.4 GHz
loss value for prediction. As can be seen from the figure, Fig. 4. Example predictions for training data.
both the training data and the validation data can be
predicted to follow the measurement results, and the
prediction is basically performed with higher accuracy than
the conventional method. When an increase or decrease of 10
dB or more occurs, the measurement results are tracked and
predicted faster than the conventional method.
Publication
use Work."
80 99.999
99.99
90 99.9
99
100 95
Path loss (dB)
90
CDF (%)
80
70
110 50
30
20 LSTM (train)
120 Meas. 10
5 Conv. (train)
LSTM 1 LSTM (test)
130 Conv. .1 Conv. (test)
.01
140 .001
11600 11800 12000 12200 12400 -20 -15 -10 -5 0 5 10 15 20
Sample Prediction error (dB)
(a) 2.2 GHz (a) 2.2 GHz

110 99.999
99.99
LSTM (train)
Meas. 99.9
115 Conv. (train)
LSTM 99 LSTM (test)
Conv. 95 Conv. (test)
120 90
Path loss (dB)
CDF (%)
80
70
125 50
30
20
10
130 5
1
135 .1
.01
140 .001
-20 -15 -10 -5 0 5 10 15 20
8000 8200 8400 8600 8800 9000 Prediction error (dB)
Sample
(b) 4.7 GHz (b) 4.7 GHz
99.999
100 99.99
99.9
110 99
90
Path loss (dB)
CDF (%)
120 70
50
30
130 10 LSTM (train)
Meas. Conv. (train)
1 LSTM (test)
140 LSTM
.1 Conv. (test)
Conv. .01
.001
150 -20 -15 -10 -5 0 5 10 15 20
6600 6800 7000 7200 7400 Prediction error (dB)
Sample
(c) 26.4 GHz
(c) 26.4 GHz
Fig. 6. CDF of prediction errors.
Fig. 5. Example predictions for test data.
Fig. 7 shows the root-mean-square error (RMSE)
Fig. 6 shows the CDF of the prediction errors for the
comparison results between our method using LSTM and the
training data and the validation data at each frequency. In
conventional method. In both the training data and the
both our method and the conventional method, the median
validation data, the RMSE is 1 dB or smaller with our
error (50% value) is about 0 dB in both the training data and
method than with the conventional method. At 26.4 GHz, the
the validation data in any frequency band. On the other hand,
RMSE of the conventional method is 4 dB or more for both
the variance of our method is smaller than the conventional
training data and validation data, while the RMSE of our
method’s, and the variance of our method is in about ±10 dB
method is about 2.4 dB, and the RMSE is improved by 1.5
for all frequencies, training data, and verification data,
dB or more. This indicates a large path loss variation in the
indicating the training results are well generalized without
high frequency band and a large error with the conventional
overfitting.
method, while our method maintains the same high
prediction accuracy as other frequencies. The above shows
our method using LSTM could perform accurate predictions
in any frequency band measured including high frequency
bands.
Publication
use Work."
Vehicular Technology Conference (VTC2019-Spring), Kuala
Lumpur, Malaysia, pp. 1–5, May 2019.
5 LSTM (train) LSTM (test) [9] Y. Koda, K. Nakashima, K. Yamamoto, T. Nishio, and M. Morikura,
Conv. (train) Conv. (test) "Handover Management for mmWave Networks With Proactive
Performance Prediction Using Camera Images and Deep
4 Reinforcement Learning," IEEE Transactions on Cognitive
Communications and Networking, vol. 6, no. 2, pp. 802–816, June
2020.
RMSE (dB)
3
[10] Y. Koda et al., "Communication-Efficient Multimodal Split Learning
for mmWave Received Power Prediction," IEEE Communications
2 Letters, vol. 24, no. 6, pp. 1284–1288, June 2020.
[11] M. Alrabeiah, A. Hredzak, and A. Alkhateeb, "Millimeter Wave Base
1 Stations with Cameras: Vision-Aided Beam and Blockage
Prediction," 2020 IEEE 91st Vehicular Technology Conference
0 (VTC2020-Spring), Antwerp, Belgium, pp. 1–5, May 2020.
2.2 4.7 26.4 [12] C. Lin, W. Kao, S. Zhan, and T. Lee, "BsNet: A Deep Learning-
Frequency (GHz)
Based Beam Selection Method for mmWave Communications," 2019
Fig. 7. RMSEs of prediction results. IEEE 90th Vehicular Technology Conference (VTC2019-Fall),
Honolulu, HI, USA, pp. 1–6, Sept. 2019.
[13] S. H. A. Shah, M. Sharma, and S. Rangan, "LSTM-Based Multi-Link
Prediction for mmWave and Sub-THz Wireless Systems," ICC 2020 -
V. CONCLUSION 2020 IEEE International Conference on Communications (ICC),
Dublin, Ireland, 2020, pp. 1–6, June 2020.
In this paper, we proposed a method of predicting path
loss using LSTM, a type of RNN used in time series [14] Y. Yang, D. B. Smith, and S. Seneviratne, "Deep Learning Channel
Prediction for Transmit Power Control in Wireless Body Area
prediction. The training data and validation data use the Networks," ICC 2019 - 2019 IEEE International Conference on
measurement results of path loss at 2.2 GHz, 4.7 GHz, and Communications (ICC), Shanghai, China, pp. 1–6, May 2019.
26.4 GHz in an urban environment. The median data [15] J. D. Herath, A. Seetharam, and A. Ramesh, "A Deep Learning Model
(median calculation interval 100 points) was predicted one for Wireless Channel Quality Prediction," ICC 2019 - 2019 IEEE
International Conference on Communications (ICC), Shanghai,
second ahead using the 100 points of the acquired fast fading China, pp. 1–6, May 2019.
data as the input data. With our method, the prediction [16] A. Kulkarni, A. Seetharam, A. Ramesh, and J. D. Herath,
accuracy of RMSE was about 2 dB, more than 1 dB smaller "DeepChannel: Wireless Channel Quality Prediction Using Deep
than the conventional method using the latest observations in Learning," IEEE Transactions on Vehicular Technology, vol. 69, no.
any frequency band including high frequency band of 26 1, pp. 443–456, Jan. 2020.
GHz.
REFERENCES
[1] Recommendation ITU-R M.2083-0, "IMT Vision - Framework and
overall objectives of the future development of IMT for 2020 and
beyond," https://www.itu.int/rec/R-REC-M.2083.
[2] DOCOMO 6G White Paper. [Online]. Available:
https://www.nttdocomo.co.jp/english/binary/pdf/corporate/technology
/whitepaper_6g/DOCOMO_6G_White_PaperEN_20200124.pdf.
[3] A. Seetharam, J. Kurose, and D. Goeckel, "A markovian model for
coarse-timescale channel variation in wireless networks," IEEE
Trans., Vehicular Technology, vol. 65(3), pp. 1701–1710, 2016.
[4] C. Luo, J. Ji, Q. Wang, X. Chen, and P. Li, "Channel State
Information Prediction for 5G Wireless Communications: A Deep
Learning Approach," IEEE Transactions on Network Science and
Engineering, vol. 7, no. 1, pp. 227–236, June 2018.
[5] Y. Yang, F. Gao, G. Y. Li, and M. Jian, "Deep Learning-Based
Downlink Channel Prediction for FDD Massive MIMO System,"
IEEE Communications Letters, vol. 23, no. 11, pp. 1994–1998, Nov.
2019.
[6] M. Mehrabi, M. Mohammadkarimi, M. Ardakani, and Y. Jing,
"Decision Directed Channel Estimation Based on Deep Neural
Network k-Step Predictor for MIMO Communications in 5G," IEEE
Journal on Selected Areas in Communications, vol. 37, no. 11, pp.
2443–2456, Nov. 2019.
[7] Y. Guo, Z. Wang, M. Li, and Q. Liu, "Machine Learning Based
mmWave Channel Tracking in Vehicular Scenario," 2019 IEEE
International Conference on Communications Workshops (ICC
Workshops), Shanghai, China, 2019, pp. 1–6, May 2019.
[8] T. Mikuma, T. Nishio, M. Morikura, K. Yamamoto, Y. Asai, and R.
Miyatake, "Transfer Learning-Based Received Power Prediction
Using RGB-D Camera in mmWave Networks," 2019 IEEE 89th
Publication
use Work."

Deep Learning Based Channel Prediction at 226 GHZ Band Using Long Short-Term Memory Network

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Deep Learning Based Channel Prediction at 226 GHZ Band Using Long Short-Term Memory Network

Uploaded by

Copyright:

Available Formats

Deep Learning Based Channel Prediction at 2–26

GHz Band using Long Short-Term Memory Network

where σg denotes the state activation function that is the

Input dimension: [100 timesteps,1]

Path loss (dB)

Path loss (dB)

180 (b) 4.7 GHz

Fig. 3. Measurement example (Tx1). Meas.

(a) 2.2 GHz (a) 2.2 GHz

You might also like