Professional Documents
Culture Documents
Wei Yang Bryan Lim, Dusit Niyato, and Chunyan Miao are with Nanyang Technological University (NTU); Sahil Garg is with the Université du Québec;
Digital Object Identifier: Zehui Xiong is with Singapore University of Technology and Design, Yang Zhang (corresponding author) is with Nanjing University of
10.1109/MNET.002.2000334 Aeronautics and Astronautics; Cyril Leung is with The University of British Columbia.
2. Transmission 2. Transmission
IoT IoT
FIGURE 1. The FL working process. The model training is conducted locally. Then, only the model parameters
are subsequently exchanged for aggregation.
ket, for example, different in terms of mobility and aggregated through IoT sensing, for example, tem-
communication capabilities. The types are hidden perature readings or images. Note that these private
from the model owner amid information asymmetry, data may not be willingly shared with the model
that is, the DaaS providers may misreport their UAV owner or among other workers. On one hand, the
types to achieve greater rewards for their services. other workers may be competitors, for example, in
As such, to appropriately incentivize the DaaS pro- the case of industrial applications. On the other hand,
viders, we leverage the multi-dimensional contract the model owner may be a third-party developer of
theoretic incentive mechanism design. The incentive AI applications with no affiliations to the data owners.
compatibility of the contract ensures that each DaaS In the FL process, the model owner first trans-
provider only selects the contract corresponding mits a set of model parameters to each worker.
to its actual type, thus ensuring profit maximization This set of parameters may be randomized, or
amid information asymmetry. The contributions of may comprise parameters from a previous train-
this article is as follows: ing round. The FL process consists of a number
• We provide a tutorial of the fundamentals of of iterations, each of which consists of three main
FL. Then, we review the communication relat- steps as shown in Fig. 1:
ed issues and existing solutions proposed to 1. Local Model Training: The worker trains the
improve the communication efficiency. received model using the local sensing data,
• We provide an overview of the issues and for example, through the Stochastic Gradient
applications related to UAV-assisted commu- Descent to minimize a local loss function.
nication in 5G HetNet. Then, we propose 2. Wireless Transmission: The worker transmits
the UAV-assisted communication efficient FL. the updated local parameters to the model
• We develop the multi-dimensional con- owner.
tract-matching incentive mechanism design 3. Global Parameter Update: The local parame-
to reward the DaaS providers amid informa- ter updates from all workers are aggregated
tion asymmetry. and averaged in the case of the conventional
Federated Averaging algorithm [2].
fundAmentAls of federAted leArnIng After Step 3, the model parameters are trans-
mitted back to the worker for the next iteration of
federAted leArnIng training. The process continues until a desired test
The two main entities in the FL system are the data accuracy is achieved.
owners and model owners (Fig. 1). The data own- In fact, Step 3 of each iteration is usually con-
ers, hereinafter workers, each own a private dataset ducted in a synchronous manner, that is, the glob-
Contract
ct Contract Contract
2(a): The UAV assisted FL with mobile relay in the sky 2(b): Multi-dimensional contract matching
thus delaying the FL training process. As such, the Given the involvement of third-party DaaS
third-party DaaS providers, hereinafter UAVs, are providers, an incentive scheme has to be
employed by the model owner to provide net- designed to compensate the UAVs for task
work coverage and relay support from the sky. completion. However, there exist different UAV
The area in which the workers are situated is types with different traveling, computation, and
divided into subregions (Fig. 2b). Each of the sub- communication capabilities. These heteroge-
regions consists of a number of nodes in which a neous types are often the private information
subset of workers are situated. A UAV is deployed of the DaaS providers that are unknown to the
to serve each subregion. Each iteration of the model owner. In addition, the DaaS providers
UAV-assisted FL consists of the following steps may have an incentive to misreport their types
(Fig. 2a): to seek higher rewards. As such, we leverage the
Local Computation: At the beginning of each self-revealing mechanism of the contract theoret-
training round, the model owner transmits a set of ic incentive mechanism design.
parameters to the workers, that is, data owners.
Then, each worker conducts model training using
its local data. tHe multI-dImensIonAl
Edge Aggregation in the Sky: The UAV vis-
its selected workers to collect the updated local
contrAct-mAtcHIng desIgn
model parameters for intermediate aggregation. uAv And model owner utIlItIes
Mobile Relay: The aggregated parameters are UAV types related to the FL task completion are
transmitted from the UAV to the model owner as follows.
for global aggregation across all subregions. To Traveling Cost: UAVs are required to trav-
enable efficient wireless transmission to the model el to the subregion during the task, and visit
owner, the UAV travels to a transmission point the worker nodes for reception of parameter
which has favorable channel conditions for com- updates and then travel to a transmission point
munication with the model owner (Fig. 2a). for wireless relay to the model owner. The ener-
The benefits of the proposed UAV-assisted gy required for this process varies according to
FL are as follows. First, the flexible deployment the propulsion power expended for traveling.
of UAVs implies that model owners are able to The propulsion power [12] is determined by the
obtain the training inputs of workers that are geo- required power to balance the parasitic drag
graphically distant from terrestrial communication caused by skin friction, and drag force of air redi-
infrastructure. Second, UAVs have higher transmit rection. These factors are variable according to
power than individual IoT devices. This increases the reference area, wing aspect ratio, and weight
the speed of model updating. Third, UAVs are able of the UAV.
to conduct mobile relaying, that is, the link distance Computation Cost: The UAVs are required
of reception and transmission of the model param- to perform the intermediate aggregation of local
eters is reduced. As such, UAVs enable a more reli- model parameters. The energy required depends
able connection, thus reducing the worker dropout on the UAV computation capacity and cycles per
rates during the training. Moreover, the privacy bit for computing.
of the workers is preserved since the UAVs and Communication Cost: The varying communi-
model owner only receive the model updates, rath- cation cost incurred by the UAV depends on the
er than the raw data of the workers. bandwidth allocation and transmission power.