You are on page 1of 6

Federated Learning Over Cellular-Connected UAV

Networks with Non-IID Datasets


Di-Chun Liang† , Chun-Hung Liu‡ , Rung-Hung Gau† , and Lu Wei∗
Institute of Communications Engineering, National Yang Ming Chiao Tung University, Hsinchu, Taiwan†
Department of Electrical and Computer Engineering, Mississippi State University, USA‡
Department of Computer Science, Texas Tech University, Lubbock TX, USA∗
e-mail: ldc.cm02g@nctu.edu.tw† ; chliu@ece.msstate.edu‡ ; runghunggau@g2.nctu.edu.tw† ; luwei@ttu.edu∗
arXiv:2110.07077v1 [cs.LG] 13 Oct 2021

Abstract—Federated learning (FL) is a promising distributed ing methodology without raw data transportation, such as
learning technique particularly suitable for wireless learning the recently proposed federated learning (FL) [1], [2], [3],
scenarios since it can accomplish a learning task without raw data becomes a viable solution.
transportation so as to preserve data privacy and lower network
resource consumption. However, current works on FL over A number of the existing FL algorithms were developed
wireless communication do not profoundly study the fundamental with uniformly compressible data and shown to achieve con-
performance of FL that suffers from data delivery outage due vergence based on the assumption of error-free and reliable
to network interference and data heterogeneity among mobile data communications between a server and clients. For ex-
clients. To accurately exploit the performance of FL over wireless ample, reference [4] proposed a compression protocol that
communication, this paper proposes a new FL model over
a cellular-connected unmanned aerial vehicle (UAV) network, inherits the compression techniques of top-k sparsification and
which characterizes data delivery outage from UAV clients to quantization for uplink and downlink communication in FL,
their server and data heterogeneity among the datasets of UAV where the distribution of datasets of each client are non-i.i.d.
clients. We devise a simulation-based approach to evaluating In [5], FL-based multi-access edge computing was studied
the convergence performance of the proposed FL model. We with limited network resources, which adopted a gradient
then propose a tractable analytical framework of the uplink
outage probability in the cellular-connected UAV network and descent approach to find the optimal trade-off between local
derive a neat expression of the uplink outage probability, which update at clients and global aggregation at a server. There
reveals how the proposed FL model is impacted by data delivery are also a number of works in the recent years studying the
outage and UAV deployment. Extensive numerical simulations problem of FL over wireless communication, where many
are conducted to show the consistency between the estimated of them approached the problem from the perspective of
and simulated performances.
signal processing. The authors of reference [6], for instance,
devised a compressive sensing approach for FL over single-
I. I NTRODUCTION
antenna communication systems. The authors of reference [7]
In the recent years, we have witnessed that machine learn- proposed a compressive sensing approach for FL over a MIMO
ing (ML) techniques have been dramatically improved and communication system, where the server recursively finds the
successfully applied to tackle many real-world problems. The linear minimum-mean-square-error estimate of the transmitted
remarkable success of ML is mainly attributed to two key signal by exploiting the sparsity of the signal. In [8], the au-
factors – highly powerful computing and extremely efficient thors studied the over-the-air computation (AirComp, proposed
data analytics, yet such a remarkable progress in ML signifi- in [9]) problem with one-bit broadband digital aggregation.
cantly relies on whether or not there are enough data to support Furthermore, FL over UAV networks has been investigated in
ML algorithms so as to make them work desirably, which some prior works, such as [10].
becomes a crucial issue in many ML applications. Due to the In these prior works, a fundamental issue of FL over
proliferation of smart mobile devices, collecting data through wireless communication is far from being fully resolved, that
them becomes much feasible and easier such that a mobile is, data communications between clients and a server may
cellular network has gradually been a huge live database fail due to unreliable wireless transmissions, which leads to
abounding with real-time information, which can be utilized by transmission outage and degrade the convergence performance
ML to optimize network operations and managements. Proper of FL accordingly. Another crucial issue that is not addressed
and efficient utilization of ML techniques from data distributed much in the prior works is the heterogeneity of datasets among
over a massive mobile network becomes an important issue. different mobile clients. Namely, most of the existing works
This is especially true when transporting raw data from all focus on developing wireless FL algorithms by assuming
mobile devices to a server with learning facilities that may that all mobile clients possess independent and identically
cause issues such as network congestion, energy consumption, distributed (i.i.d.) datasets. To tackle these two issues, in this
and security. To avoid the impractical tasks of transporting paper our first contribution is to propose a more realistic
massive distributed data for centralized ML, distributed learn- FL model that can characterize the transmission outages
between a server and mobile clients and be implemented in Algorithm 1 Federated Learning with Intermittent Updates
a cellular-connected UAV networks in which the datasets at 1: inputs: initial model vector wo
all UAV clients are non-i.i.d. Our second contribution is to 2: outputs: improved model vector w
propose a tractable framework of evaluating the convergence 3: initialize: the global model is initialized as w0 ← wo .
performance of the proposed FL model from a network-wide Each client holds non-i.i.d. dataset Di with equal size.
perspective. A three-dimensional (3D) random deployment ∇w0 , ∇wi,0 ← 0.
model is properly devised for modeling the random locations 4: for t = 1, · · · , T do
of UAV clients and access points (APs), which is the third 5: for i ∈ {1, · · · , C} do in parallel1
contribution. Our fourth contribution is to conduct intensive 6: client i does:
numerical simulations in order to validate the correctness of 7: · wi,t ← download
P i←S (wt−1 )
our analytical findings and importantly demonstrate that the 8: · ∇wi,t ← j∈Di SGD(Dj , wi,t ) − wi,t
convergence performance of FL over wireless communication 9: · uploadS←i (∇wi,t )
is fairly sensitive to data delivery outage as well as non-i.i.d. 10: end for
data nature. Moreover, the numerical results also shed light on 11: edge server does:
· ∇wt ← P 1βi,t |Di | i∈D βi,t |Di |∇wi,t
P
the fundamental interplay between the deployment of UAVs 12:
i∈D
and the learning accuracy of the proposed FL model. 13: · wt ← wt−1 + ∇wt
14: · w ← wt
II. F EDERATED L EARNING M ODEL OVER 15: · broadcasti←S (wt )
C ELLULAR -C ONNECTED UAV S 16: end for
In this paper, our main goal is to study the performance 17: return w
of FL over a cellular-connected UAV network consisting of
numerous UAVs, APs, an edge server (ES), and a cloud. All
the APs in the network are governed by an ES connected where Dj denotes data point j in set Di , xi is the input data
to a cloud through a gateway. The ES communicates with with an appropriate dimension, yi is the labeled scalar output
the cloud whenever it needs to send data for further large- corresponding to xi . All the datasets Di ’s are assumed to be
scale data processing and learning. Each UAV associates non-i.i.d. and of the same size in this paper. Note that each
with its strongest AP and conducts FL with the ES. An UAV and the ES initiate their own algorithm with the same
illustration of the proposed model is shown in Fig. 1(a). In the initial model vector, i.e., wi,0 ← wo and w0 ← wo . All the
following, we will first propose an FL model with intermittent UAVs are assumed to synchronously update their own local
updates. Afterwards, some preliminary simulation results will learning models. At the tth round of communication, client i
be provided to illustrate how the proposed FL is affected by updates its local model vector wi,t according to the following
uplink transmission outages. algorithm:
(
A. Proposed Federated Learning with Intermittent Updates wi,t ← downloadi←S (wt−1 )
P , (2)
Now consider the network in Fig. 1(a) that is comprised of ∇wi,t ← j∈Di SGD(Dj , wi,t ) − wi,t
an ES communicating with multiple UAVs. In this network,
where SGD(Dj , wi,t ) stands for the calculation of Stochastic
there are C UAVs that would like to jointly learn a global
Gradient Descent with data point Dj and model vector wi,t .
model with an ES through their nearby AP in T training
After receiving the global model vector wt−1 from the ES
rounds. Since the wireless communications between the AP
broadcasted by the AP, UAV Ui updates its local model vector
and each UAV are not reliable, data delivery outages would
wi,t by SGD so as to find the gradient ∇wi,t . Subsequently,
happen between the ES and each UAV, and they thus impact
UAV Ui uploads the gradient ∇wi,t to the ES through AP.
the performance of FL because the learning updates at the ES
To characterize the data delivery outage from Ui to AP, we
and the UAVs are intermittent. To characterize the impact of
adopt a Bernoulli random variable βi,t ∈ {0, 1} to model the
data delivery outage on FL between the ES and a UAV as well
uplink outage transmission from the UAV Ui to the AP at the
as tractably analyze the performance of FL with intermittent
tth round. Thus, the data aggregation algorithm at the ES can
updates, data delivery outage is assumed to only happen in the
be expressed as
uplink transmission from a UAV to its associated AP, yet the
(
downlink transmission from the AP to a UAV is assumed to 1
P
∇wt ← P Di ∈D βi,t ∇wi,t
Di ∈D βi,t |Di |
be outage-free. As such, we propose the FL with intermittent , (3)
uplink outages as shown in Algorithm 1. wt ← wt−1 + ∇wt
In Algorithm 1, each UAV Ui is assumed to possess a
S
where D = i Di . The ES aggregates the received gradients
dataset Di given by to ∇wt by averaging all the received gradients from the
Di , {Dj = (xj , yj ), j ∈ N+ }, ∀i ∈ N+ , (1) UAVs[2]. As aforementioned, some gradients may be lost due
to uplink outage transmissions so that the server is merely
0 Note that all the UAVs joining the learning process can be in different able to aggregate the gradients of the local model vectors
cells, that is, these C UAVs may not associate with the same AP. successfully transmitted. The global model vector wt at the
Fig. 1. (a) A cellular-connected UAV network consisting of numerous UAVs (users), APs, an edge server, and a cloud. Each UAV associates a nearby AP so
as to perform the proposed FL algorithm (Algorithm 1) with the edge server. (b) An illustration shows that the typical AP located at the origin is associated
with the UAV Ui .

tth round is then updated and subsequently broadcasted to


the UAVs in the next round until the predesignated training
round number T is reached. To illustrate how the proposed
FL model in Algorithm 1 is impacted by uplink transmission
outage, a preliminary simulation is provided in the following
subsection in order to observe the convergence performance
of the proposed FL model with intermittent updates.

B. A Numerical Glance of the Proposed FL Model


In this section, we provide a preliminary numerical illustra-
tion regarding how the convergence of the proposed FL model
is influenced by uplink outages in a predesignated training
round T = 200, as shown in Figs. 2(a) and 2(b). All the UAVs
are assumed to experience the same uplink outage probability
pout , P[βi,t = 0] and have the same capability of assembling Fig. 2. The simulation results of the proposed FL model: (a) Uplink outage
data, i.e., the quality and size of the data collected by each probability versus learning accuracy for different numbers of C UAV clients
UAV are identical. Moreover, we adopt the training data that with i.i.d. datasets; (b) Uplink outage probability versus learning accuracy for
different numbers of C UAV clients with non-i.i.d. datasets; (c) Uplink outage
are hand-written digit images from the famous MNIST dataset probability versus the ratio of the UAV density to the AP density.
and evaluate the learning convergence by using the accuracy
defined as the rate of successfully classifying the images in the UAV increases, the number of clients C participating FL
the entire dataset2 by using the global model w found by the increases so that the rate of convergence increases as well,
proposed FL model. To compare the results in the simulation, yet it could induce the risk of overfitting and increase the
the UAVs perform SGD with the same model architecture interference. According to Figs. 2(a) and 2(b) , the accuracy
that has equal batch sizes and the number of hidden layers degrades as pout increases since the uploaded gradients are
and nodes. Each case terminates the learning process once more likely to undergo transmission outage. Note that the
reaching training round T . For the case of non-i.i.d. datasets, impact of pout is more severe when the UAVs have non-i.i.d.
each UAV collects one specific class of the dataset where the datasets. In contrast, the impact of pout is less severe if the
ratio of data sizes for training and testing is 2:1. For the case UAV possess i.i.d datasets since any one of the UAVs that are
of i.i.d. datasets, all the classes of the dataset are gathered by able to successfully transmit can improve the FL process in
each UAV with the same probability. To make each case fairly each round. On the other hand, properly increasing the number
comparable, the total data of each class for training are of the of UAV clients helps FL and also provide some robustness to
same size in order to make each class of data fairly trained. uplink outage. This simulation provides an overview for the
For example, if 50 UAVs are involved in the training, exactly influences of intermittent updates given uplink outage proba-
5 UAVs out of them gather the images of handwritten “3”, bilities. It is important to point out that the distribution of the
and so forth for other images of hand-written digit. Bernoulli random process βi,t is impacted by the deployment
According to Algorithm 1, the global model vector w is densities of the UAVs and APs in the network in that it
directly affected by the number C of UAV clients and the depends on the uplink interference dominated by the densities
uplink outage indicator βi,t . Specifically, when the density of of the UAVs and APs. To exploit the fundamental relationship
between the UAV and AP densities and the distribution of
2 The entire dataset is the union of the local datasets at the UAVs. βi,t , a cellular-connected UAV deployment model is proposed
in the following section. B. Analysis for Uplink Outage Probability
In this subsection, we focus on the analysis of the uplink
III. S YSTEM M ODEL AND O UTAGE A NALYSIS FOR
outage probability. To facilitate the derivation of the outage
C ELLULAR -C ONNECTED UAV N ETWORKS
probability, we need to first introduce two related theorems in
To demonstrate how the uplink transmission outage is the following. The first theorem is about the distribution of
affected by UAV and AP deployments and channel fading, in the path loss of the link from a UAV to its associating AP,
this section, we propose a UAV deployment model in which which is shown in the following.
tractable analyses of the uplink outage probability can be
conducted. Theorem 1. Let L? and kA? k denote the LoS gain and
distance between each UAV and its associating AP. Define
A. Deployment Model of Cellular-Connected UAVs R? (r) , P[L? kA? k−α ≤ r] and it can be found as
We assume all the UAV can be deployed according to the R? (r) = exp (−πλa Υ(r)) , (8)
following 3D point process3 :
where Υ(·) is defined as
Φu , {Ui ∈ R2 × R : Ui = (Xi , Hi ), i ∈ N+ }, (4)  +
Z ( r1 ) α2 −Ho2
where Ui , denotes UAV i and its 3D location, Xi is the Υ(r) , ρ(ϑy )dy
projection of Ui on the plane of R2 , and Hi is the altitude 0
 +
of Ui , as depicted in Fig. 1(b). Note that the set of projections Z ( r` ) α2 −Ho2
{Xi } form a 2D homogeneous Poisson point process (PPP) of + (1 − ρ(ϑy ))dy. (9)
0
density λu . Similarly, all the APs in the network are assumed

to form an independent 2D homogeneous PPP of density λa [x]+ , max{0, x}, and ϑy , tan−1 (Ho / y).
given by
Proof: See Appendix A.
2 R? in Theorem 1 inherits the properties from the UAV
Φa , {Ai ∈ R : i ∈ N+ }, (5)
association scheme in (7). Note that Υ(r) is a decreasing
where Ai denotes AP i and its location. Note that we consider function wherein the upper limit of the integral is not zero.
the urban scenario that λu ≥ λa since it is more convenient Let Iul be the interference power received by the typical AP4 ,
to deploy UAVs in such an environment. and can be written as
The quality of a wireless link between a UAV and the typical X
AP highly depends on whether or not the wireless link is line- Iul , gi Li kUi k−α , (10)
of-sight (LoS). An LoS link between two spatial points means i:Ui ∈Φ
e u \U?
that the link is not visually blocked from one point to the other.
To model the LoS probability of a wireless link between the where U? is the reference UAV and Φe u is the set of UAVs from
typical AP and a UAV, we adopt the following LoS probability other cells using the same resource block as U? . gi ∼ exp(1)
model for a low-altitude platform proposed in [12]: is the uplink channel fading gain and is assumed to be i.i.d.
and independent to Li and Ui . The following theorem shows
1 the Laplace transform of Iul .
ρ(Θ) , , (6)
1 + c2 exp(−c1 Θ)
Theorem 2. If the Laplace transform of Iul is defined as
where Θ , tan−1 (Hi /kXi k) is the elevation angle of Ui LIul (s) , E[exp(−sIul )] for s > 0, it can be found as
observed from the typical AP, c1 and c2 are the environment-  Z ∞ 
−α

related positive coefficients (for rural, urban, etc.). Suppose LIul (s) = exp −πλa Ig sy 2 , ϑy dy , (11)
each UAV is positioned at the same altitude Ho and associates 0

with an AP according to the following scheme: where Ig (u, w), for u, w > 0, is defined as
−α
A? , arg max Li (Ho2 + kAi k2 ) 2 , (7) Ig (u, w) ,ρ(w)[1 − Lg (u cosα (w))]
i:Ai ∈Φa
+ (1 − ρ(w))[1 − Lg (`u cosα (w))]. (12)
where α > 2 denotes the path loss exponent and Li ∈ {1, `}
is the LoS gain of AP Ai that is a Bernoulli random variable Proof: See Appendix B.
equal to one if the link between Ui and the Ai is LoS and Theorem 2 indicates how the uplink interference is affected
equal to ` ∈ [0, 1] otherwise. Note that in this AP association by the deployments of UAVs and APs and the LoS channel
scheme, the transmit power of the AP is not considered in impact. Note that (11) is a function of the density of APs λa
that all the APs have the same transmit power and the channel since we assume that only one UAV in each cell is allowed
fading effect is averaged out. to transmit in a time slot so that the density of Φ
e u is equal to
λa . Theorem 2 helps derive the uplink outage probability as
3 This 3D point process is a generalization of the 3D point process proposed
in our previous work [11] by considering a general distribution of the altitude 4 According to the Slivnyak theorem [13], the statistical properties evaluated
of each UAV. at any particular point in homogeneous PPPs are the same.
TABLE I uplink transmission outages that depend on the deployment
N ETWORK PARAMETERS FOR S IMULATION densities of the UAVs and the APs are considered in the
Parameter Value simulations here whereas they were not affected by the de-
UAV Density λu (UAVs/m2 ) 1 × 10−5 ployment densities of the UAVs and the APs of the simulations
AP Density λa (APs/m2 ) λu /[10, 300] in Section II-B. Namely, the uplink outages of all the UAVs
UAV Height Ho (m) 100
SIR Threshold η 0.5
clients are now affected by the UAV and AP densities. All the
Path-loss Exponent α 2.75 UAVs associate with their APs using the scheme in (7) and
(c1 , c2 ) in (6) for urban (0.1581, 43.9142) they would like to jointly learn a global model with the ES
Channel Attenuation Gain of NLoS ` 0.25 through the proposed FL model.
Training Data Size for Each UAV |Di | 20
Number of clients C 10, 30, 50 The simulation results and discussions in this section aim
Predesignated Training Round T 200 to achieve two goals. The first goal is to illustrate how the
learning accuracy is affected by the UAV and AP deployments
defined in the following. The signal-to-interference ratio (SIR) and the second one is to see whether learning accuracy
at the typical AP in Fig. 1(b) is defined as can be estimated by Fig. 2. The second goal is to observe
G? L? kA? k−α whether or not λu /λa can be used to estimate the learning
γ, , (13) accuracy through pout by a series of mapping, i.e., λu /λa →
Iul
pout → learning accuracy, and we want to know how precisely
where G? is the channel fading gain from the reference UAV.
the estimation can be made when compared to the opposite
Here we define the uplink outage probability of a UAV as
simulation scenario where the UAVs and APs are deployed
G? L? kA? k−α
 
in a large-scale network5 . As can be seen in Figs. 2(b) and
pout = P[γ ≤ η] = P ≤η , (14)
Iul (c), for example, when λu /λa is 70, its corresponding uplink
and η is the SIR threshold for successful decoding. According outage probability pout is 0.6, and the learning accuracy the
to the previous theorems, we can achieve the uplink outage around 0.68 when number of clients C = 10 with non-i.i.d.
probability as follows. datasets according to the figure.
Fig. 3 shows the simulation results of the proposed FL
Theorem 3. If the uplink outage probabilities of all UAVs model over a cellular-connected UAV network in which the
at any training round are independent, the uplink outage UAVs and APs are deployed in a large area. As illustrated
probability is explicitly found as in the figure, the mapping estimation mentioned previously
  Z ∞  −α
ηy 2
  is fairly precise when it is compared with its corresponding
pout = 1 − ER?0 exp −πλa Ig , ϑy dy . simulated counterparts. Moreover, it shows that the UAV and
0 R?0
AP deployment densities affect the uplink outages, which lead
(15)
to intermittent updates in FL and accuracy degradation as
R?0 (r) = dRdr
? (r)
is the power density function (PDF) of R? pointed out in previous section. Straightforwardly, the uplink
written as outage is more likely to happen as λu /λa increases due to
more interference induced and/or larger transmission distance
R?0 (r) = −πλa Υ0 (r) exp (−πλa Υ(r)) . (16) between an AP and a UAV. Also, the learning accuracy is
Proof: According to the uplink outage probability defined affected by the number of clients, i.e., C. When λu /λa is
in (14), the uplink outage probability can be rewritten as low, C needs to be small in order to achieve a high learning
    accuracy, yet an excessively large C leads to overfitting and
ηIul η yields diminishing returns accordingly. On the other hand,
pout = P G? ≤ = 1 − L Iul .
L? kA? k−α L? kA? k−α when λu /λa is high, C should be large in order to compensate
By applying Theorems 1 and 2, the result (15) is readily the accuracy degradation caused by intermittent updates due
obtained. to uplink transmission outages.
As shown in Fig. 2(c), the uplink outage probability increases V. C ONCLUSION
as λu /λa increases since more interference is induced. In the
following section, we will demonstrate the numerical results In the literature, the studies of FL over wireless com-
and discuss how deployments of the UAVs and APs affect munication were mainly conducted based on two unrealistic
such an FL algorithm that models the outage of data delivery assumptions, that is, no transmission outages between clients
between the UAV and its associating AP proposed in Sec. II. and a server and no data heterogeneity among all wireless
clients. Such studies cannot practically reflect the accurate
IV. N UMERICAL R ESULTS AND D ISCUSSIONS performance of FL when employing FL to accomplish a
In this section, we investigate the proposed FL model over a learning job in wireless networks. To practically understand
cellular-connected UAV network. The MNIST dataset is also the impacts of data delivery outage on FL, we propose and
adopted for the simulation in this section, as the simulation 5 The discussions in this section focus on the case of the UAV clients with
performed in Section II-B. The main difference between the non-i.i.d. datasets, whereas this mapping estimation can also be implemented
simulations in this section and those in Section II-B is that in the case of the UAV clients with i.i.d. datasets.
B. Proof of Theorem 2
0.8
According to the definition of Iul , its Laplace transform is
given by
0.7  
α
 −sI  Y
E e ul = E  exp −sgi Li (kXi k2 + Ho2 )− 2 
0.6
i:Ui ∈Φ
e u \U?
 Z ∞  sgL
 
(a) −
0.5 = exp −πλa 1 − E e ky+Ho2 kα/2 dy
0
 Z ∞   
(b) sgL
0.4 = exp −πλa P Z≤ dy ,
50 100 150 200 250 300 0 ky + Ho2 kα/2
where (a) is obtained by applying the PGFL of a homogeneous
Fig. 3. Simulation results of the learning accuracy of the proposed FL model PPP to the projections of Φe u and (b) is obtained by the CDF
for the case of non-i.i.d. datasets at the UAV clients.
of a random variable Z ∼ exp(1), i.e., P[Z ≤ z] = 1 − e−z .

evaluate an FL model that is able to characterize transmission Let ϑy = tan−1 (Ho / y) and we get
outages in a cellular-connected UAV network consisting UAV 
sgL
 
sg

clients with non-iid datasets. A tractable approach to analyzing P Z≤ = ρ(ϑ y )P Z ≤
ky + Ho2 kα/2 ky + Ho2 kα/2
the uplink outage probability is proposed and the uplink outage  
sg`
probability of the UAV is explicitly derived in a neat form, + (1 − ρ(ϑy ))P Z ≤ .
which paves the way to evaluate the convergence performance ky + Ho2 kα/2
of the proposed FL model. The estimation is fairly accurate Replacing ky + Ho2 kα /2 with y α/2 secα (ϑy ) in the expression
and provides insights into how to properly deploy the UAVs of LIul (s) yields the outcome in (11).
and APs so as to achieve the accuracy requirement of the
R EFERENCES
proposed FL model.
[1] X. Wang, Y. Han, C. Wang, Q. Zhao, X. Chen, and M. Chen, “In-Edge
A PPENDIX AI: Intelligentizing mobile edge computing, caching and communication
by federated learning,” IEEE Network, vol. 33, no. 5, pp. 156–165, Jul.
A. Proof of Theorem 1 2019.
According to the definition of R? , R? (r) can be explicitly [2] Q. Yang, Y. Liu, Y. Cheng, Y. Kang, T. Chen, and H. Yu, Federated
Learning, 1st ed. Morgan and Claypool, 2019.
written as [3] S. Niknam, H. S. Dhillon, and J. H. Reed, “Federated learning for
   
(a) Li wireless communications: Motivation, opportunities, and challenges,”
R? (r) = P max α ≤ r IEEE Commun. Mag., vol. 58, no. 6, pp. 46–51, Jun. 2020.
i:Ai ∈Φa [kAi k2 + Ho2 ] 2 [4] F. Sattler, S. Wiedemann, K. R. Müller, and W. Samek, “Robust and
( )
 communication-efficient federated learning from non-i.i.d. data,” IEEE
(b) Y Li
α ≤ r
=E P Trans. Neural Netw. Learn. Syst., vol. 31, no. 9, pp. 3400–3413, Sep.
i:Ai ∈Φa
[kAi k2 + Ho2 ] 2 2020.
[5] S. Wang, T. Tuor, T. Salonidis, K. K. Leung, C. Makaya, T. He, and
Z ∞ " # !
K. Chan, “Adaptive federated learning in resource constrained edge
(c) L
= exp −2πλa P α > r xdx , computing systems,” IEEE J. Select. Areas Commun., vol. 37, no. 6,
0 [x2 + Ho2 ] 2 pp. 1205–1221, Mar. 2019.
[6] M. M. Amiri and D. Gündüz, “Federated learning over wireless fading
where (a) inherits the property from (7) since signal attenua- channels,” IEEE Trans. Wireless Commun., vol. 19, no. 5, pp. 3546–
tion is equivalent in the downlink and uplink, (b) follows from 3557, Feb. 2020.
[7] Y.-S. Jeon, M. M. Amiri, J. Li, and H. V. Poor, “A compressive sensing
the fact that all Li /(kAi k2 + Ho2 )α/2 ’s are independent, and approach for federated learning over massive MIMO communication
(c) is obtained by applying probability generating functional systems,” IEEE Trans. Wireless Commun., vol. 20, no. 3, pp. 1990–
(PGFL) of a Homogeneous PPP to Φa . The subscript i of 2004, Nov. 2021.
[8] G. Zhu, Y. Du, D. Gündüz, and K. Huang, “One-bit over-the-air
Li is dropped for simplifying notation, and this is applied aggregation for communication-efficient federated edge learning: Design
in the following of this paper. substitute x2 as y and let and convergence analysis,” IEEE Trans. Wireless Commun., vol. 20,
√ α/2
ϑy = tan−1 (Ho / y), the inner term P L/[y + Ho2

> r] no. 3, pp. 2120–2135, Nov. 2021.
[9] G. Zhu, Y. Wang, and K. Huang, “Broadband analog aggregation for
can be rewritten as low-latency federated edge learning,” IEEE Trans. Wireless Commun.,
" #
L

1
 vol. 19, no. 1, pp. 491–506, Oct. 2020.
P α > r = ρ(ϑ y )P α > r [10] Q.-V. Pham, M. Zeng, R. Ruby, T. Huynh-The, and W.-J. Hwang, “UAV
[y + Ho2 ] 2 (y + Ho2 ) 2 communications for sustainable federated learning,” IEEE Trans. Veh.
     α2 Technol., vol. 70, no. 4, pp. 3944–3948, Mar. 2021.
` ` [11] C.-H. Liu, D.-C. Liang, and R.-H. Gau, “A 3D tractable model for UAV-
+ [1 − ρ(ϑy )]P α > r = ρ(ϑy )P y < Enabled cellular networks with multiple antennas,” IEEE Trans. Wireless
(y + Ho2 ) 2 r Commun., pp. 1–17, Early Access 2021.
 "   α2 #
[12] A. Al-Hourani, S. Kandeepan, and S. Lardner, “Optimal LAP altitude
2 ` 2 for maximum coverage,” IEEE Trans. Wireless Commun., vol. 3, no. 6,
− Ho + [1 − ρ(ϑy )]P y < − Ho ,
r pp. 569–572, Dec. 2014.
[13] F. Baccelli and B. Błaszczyszyn, Stochastic Geometry and Wireless
and Υ(r) and R? are subsequently obtained. Networks: Volume I Theory. Now Foundations and Trends, 2010.

You might also like