You are on page 1of 18

Sustainable Cities and Society 78 (2022) 103643

Contents lists available at ScienceDirect

Sustainable Cities and Society


journal homepage: www.elsevier.com/locate/scs

Prediction and evaluation of spatial distributions of ozone and urban heat


island using a machine learning modified land use regression method
Li Han a, Jingyuan Zhao a, *, Yuejing Gao b, Zhaolin Gu c
a
School of Architecture, Chang’an university, Xi’an, Shaanxi, China
b
School of Architecture and Civil Engineering, Xi’an University of Science and Technology, Xi’an, Shaanxi, China
c
School of Human settlements & Civil Engineering, Xi’an Jiao Tong university, Xi’an, Shaanxi, China

A R T I C L E I N F O A B S T R A C T

Keywords: In summer, Ozone (O3) pollution and urban heat island (UHI) pose serious health risks to humans. To obtain the
Ozone spatial distributions of ozone and urban heat island in Xi’an in summer and develop a simultaneous control
Urban heat island strategy of ozone and urban heat island, the land use regression model is modified and improved using the
Land use regression
machine learning random forest algorithm. The LUR-Kriging-RF integrated prediction model is then established.
Random forest
Spatial distribution
The land use regression and kriging are used to extract the feature variables, while random forest is used to
establish a regression model. The spatial distribution maps of ozone and urban heat island in Xi’an are obtained
by regression mapping of the prediction model, and the spatial relationships between them are analyzed. The
SHapley Additive explanation (SHAP) and partial dependence plot (PDP) are adopted to explain the way feature
variables act on ozone and urban heat island. Based on the spatial distribution and interaction mode, a simul­
taneous control strategy of ozone and urban heat island in Xi’an is put forward. For ozone, the R2 of the inte­
grated prediction model (0.65) is higher than that of land use regression (0.4), while the RMSE (28.18) of the
integrated model is lower than that of land use regression (35.66). For temperature, the R2 of the integrated
model (0.93) is higher than that of land use regression (0.8), while its RMSE (0.92) is lower than that of land use
regression (1.52). The performance of the LUR-Kriging-RF integrated prediction model is better than that of land
use regression. This study reveals the spatial interactions between ozone and urban heat island in the central
urban areas. The suitable strategies for mapping ozone pollution and urban heat island control include reducing
VOCs emissions from industrial sources and agricultural sources, increasing plants with low VOCs emissions, and
spray humidification. This study can be used to evaluate ozone exposure and thermal exposure, provide scientific
support for environmental protection and urban heat island control policies, contribute to reducing public health
threats, promote the sustainability of urban environments, and promote the practical application of machine
learning in this field.

1. Introduction 2019, O3 was the major pollutant in 119 cities in China for more than half of
the year. The rising number of new buildings and roads in cities can further
Ozone (O3) can stimulate the human respiratory system and cardiopul­ increase the urban heat load. In the context of global warming, persistent
monary system. It can also enter human circulation system through blood, high temperature and heat wave in cities would increase the incidence of
and cause damage to the cardiovascular system as a strong oxidant (Car­ heatstroke, intestinal diseases and cardiovascular diseases (Abadie &
valho et al., 2021; Day et al., 2017; Li et al., 2018; Lyu et al., 2021; Wang et al., Polanco-Martinez, 2021; Bongioanni et al., 2021; Jung et al., 2021; Patz,
2022). In particular, long-term ozone exposure could increase the proba­ Campbell-Lendrum, Holloway & Foley, 2005; Wang et al., 2022; Wang, Cao
bility of asthma in children and affect the growth of children’s cardiopul­ & Yu, 2020a; Yang et al., 2021b). Each elderly person in China would endure
monary system (Gabriel, Paciência, Felgueiras, Rufo, Mendes, 2021; Li et al., an average of 13 more days of heat wave in 2019, and the risk of death of the
2019; Qin et al., 2021b; Qiu et al., 2021; To et al., 2021). During 2013–2019, elderly on hot days is 10.4% above that on non-hot days (Khan et al., 2021;
O3 concentration in 74 key cities in China increased by 28.8% on average. In Kollanus, Tiittanen & Lanki, 2021; Urban, Kysely, Plavcova, Hanzlikova &

* Corresponding author.
E-mail address: zjyqtt@chd.edu.cn (J. Zhao).

https://doi.org/10.1016/j.scs.2021.103643
Received 26 November 2021; Received in revised form 24 December 2021; Accepted 24 December 2021
Available online 27 December 2021
2210-6707/© 2021 Elsevier Ltd. All rights reserved.
L. Han et al. Sustainable Cities and Society 78 (2022) 103643

Fig 1. The location and jurisdiction of Xi’an, and the monitoring stations with the city.

Stepanek, 2020; Weinberger et al., 2021). The urban heat island (UHI) will association of land use feature with pollutant concentration, which
significantly increase urban energy consumption, and the increase in energy could represent the spatial relationships between the air pollutant and
consumption will further aggravate atmospheric pollution, which is a serious other geographic variables. Following that, the spatial distribution of
obstacle to sustainable urban development (Xi, Ren, Wang, Feng & Cao, regional air pollutants in urban area could be well simulated, and the
2021). At this stage, the public is generally aware of the health hazards of causes of air pollution could be explained to some extent (Bitta,
PM2.5, but little attention has been paid to the health hazards of O3 (Cao et al., Pavlíková, Svozilík & Jančík, 2018; Han et al., 2020; Ji, Wang & Zhuang,
2017; Kumar et al., 2021a, 2021b; Meng, Cao, Kumar, Tang & Feng, 2021). 2019; Lee et al., 2017; Liu, Henderson, Wang, Yang & Peng, 2016; Miri,
The studies of thermal exposure or ozone pollution alone could not resolve Ghassoun, Dovlatabadi, Ebrahimnejad & Lowner, 2019). However, LUR
the complex urban climate issues and health threats, and thermal pollution is a linear regression that cannot sufficiently represent the nonlinear
and ozone are serious impediments to sustainable urban development relationships between variables (Chaturvedi & de Vries, 2021; Gao,
(Kalisa, Fadlallah, Amani, Nahayo & Habiyaremye, 2018; Parida et al., 2021; Wang, Li, Zheng & Peng, 2021, 2019; Liu, Chen, Wei, Wu & Li, 2021a).
Taha, 2015). The simultaneous treatment of O3 pollution and thermal To this end, random forest (RF) an emerging machine learning algorithm
exposure could effectively solve the urban climate issues. with high flexibility could improve the prediction accuracy with no
At present, land surface temperature (LST) inversion from remote significant demand on computing resources. RF is not sensitive to mul­
sensing data is considered a common method to predict temperature spatial tiple collinearities, while the results of the missing data and
distribution (Chen, Zhou, Hu & Zhou, 2020a; Despini et al., 2021; Dugord, non-equilibrium data are stable. Besides, RF is also good at learning
Lauf, Schuster & Kleinschmit, 2014; Ren, Zhou & Wang, 2021). However, complex and highly nonlinear relationships(Lary, Alavi, Gandomi &
huge differences exist between the retrieved land surface temperature and Walker, 2016; Lundberg et al., 2020; Tsanas & Xifara, 2012; Wen et al.,
temperature predicted based on meteorological stations. Compared with the 2022). Introduction of machine learning algorithm in LUR could yield
hourly temperature recorded by meteorological stations, remote sensing promising outcome in the field of pollutant prediction.
images are not continuous, and the inversion results can only represent the The development of machine learning algorithms in recent years has
land surface temperature for each day. Moreover, the retrieved land surface significantly increased their applications in the studies of air pollution
temperature is generally higher than the real temperature (Madanian, Sof­ prediction, including O3, NO2, PM2.5 and PM10 (Chen et al., 2021; Cihan,
fianian, Soltani Koupai, Pourmanafi & Momeni, 2018; Shi, Katzschner & Ng, Ozel & Ozcan, 2021; Hsu et al., 2020; Yang, Freni-Sterrantino, Fuller &
2018; Shi et al., 2019; Yang et al., 2021a). In this study, temperature data of Gulliver, 2020). The temperature prediction based on meteorological
meteorological stations were used to predict the average temperature dis­ monitoring data here can further contribute to the application of machine
tribution of the whole city for 59 consecutive days. Since the predicted learning in this field. Many previous studies confirmed that the prediction
temperature was used to correspond to the average concentration distribu­ accuracy of machine learning algorithm is higher than that of LUR, which
tion of O3 for the same period, the analysis of the underlying relationships resulted in the development of a variety of integrated algorithms for
between the feature variables is more reliable. pollution prediction (Chen et al., 2018; Gobakis et al., 2011; Ren, Mi &
To evaluate the pollutant concentrations in regions without moni­ Georgopoulos, 2020; Sun et al., 2021; Weichenthal et al., 2016). Sun et al.
toring stations, land use regression (LUR) can be used to combine the (2021) established a deep learning model structure based on long
pollutant concentration data from limited monitoring stations with short-term memory layers (LSTM). The model predicts the PM2.5 and O3
some features, (e.g., land use). The main feature of LUR is the effective distribution in Hong Kong, significantly reducing the prediction error.

2
L. Han et al. Sustainable Cities and Society 78 (2022) 103643

Fig 2. Variation of monthly average O3 concentration for five monitoring stations in 2019.

Equere, Mirzaei, Riffat and Wang (2021) predicted UHI distribution in feature variables. After that, RF algorithm is used to build a prediction
Illinois, USA based on GIS and artificial neural network (ANN). The model for O3 and UHI. Then, the regression prediction model is used to
model’s prediction accuracy was significantly improved after adding obtain the spatial distributions of O3 and UHI mapping in Xi’an. Next,
topographic factors. Gobakis et al. (2011) predicted UHI in Athens using the analysis method of machine learning was used to explain the in­
finite data series, and neural network prediction methods can be an teractions of different feature variables of O3 and TEM. Finally, the
important tool for heat wave and peak energy load prediction in hot simultaneous control strategy for O3 and UHI is prosed. The analysis on
summers, contributing to energy supply and demand management. Many spatial relationships between O3 and UHI should provide scientific
different feature variables, including land use, land cover, emission support for environmental protection and planning departments to
sources, meteorological parameters and pollutant concentrations were establish policies for managing O3 pollution and UHI, reduce public
considered in previous studies (Di et al., 2019; Stafoggia et al., 2019; health risks, and promote sustainable development of urban climate and
Wang & Akbari, 2016; Wong et al., 2021). These studies mostly focused on environment. It could serve as a reference idea for the practical appli­
predicting the spatial distribution of pollutants and lack interpretation of cation of machine learning in this field.
prediction models and integration studies of O3 and UHI.
This study aims to establish a high-precision LUR-Kriging-RF inte­ 2. Methodology and materials
grated prediction model to obtain the spatial distributions of O3 and UHI
after regression mapping. By analyzing the interaction modes of the 2.1. Region and time period
feature variables of O3 and UHI, a simultaneous control strategy for O3
and UHI could be proposed. First, LUR and Kriging are used to obtain Xi’an an important central city in western China, is located in the

Fig 3. The frequency histograms of O3 and TEM.

3
L. Han et al. Sustainable Cities and Society 78 (2022) 103643

Fig 4. An overview of the LUR-Kriging-RF integrated prediction method.

west of the Fenhe-Weihe Plain fault depression basin (as shown in and strong radiation in summer. Moreover, poor diffusion conditions of
Fig. 1). Its total area of Xi’an is 10,752 square kilometers. Its jurisdiction the landscape cause the accumulation of O3 in Xi’an.
is about 204 kms from east to west, and approximately 116 kms from According to the variation of monthly average O3 concentration at
north to south. It also has a permanent population of 10,037 million. The the five air quality monitoring stations in Fig. 2, O3 concentrations in
Fenhe-Weihe Plain was one of the most severe O3 polluted region in June, July and August are significantly higher than those in other
China. From 2014 to 2017, the number of days when summer O3 con­ months. Due to the absence of monitoring data before July 3rd, 2018,
centration exceeded the standard in Xi’an increased from 8 days to 61 the research time period is set between July 3rd, 2018 and August 31st,
days. Xi ’an is prone to ozone generation due to the high temperature 2018 in this study, except July 6 due to the missing data (59 days in

4
L. Han et al. Sustainable Cities and Society 78 (2022) 103643

Fig 5. Diagrams of three extraction modes for the feature variables.

total). The distributions of 139 air quality monitoring stations and 134 2.3.4. Road traffic
meteorological stations are shown in Fig. 1. The daily average O3 con­ The road network data used in this work is from OpenStreetMap. The
centration is used in this work, while the hourly TEM is used to calculate road network is divided into five levels, namely motorway, primary,
the average daily TEM. Unlike the traditional UHI being defined as the secondary, tertiary and trunk. According to the road network level, three
urban-rural temperature difference, this study uses urban temperature feature variables of the road network are selected eventually in this
to represent the heat island (Shi et al., 2018). study, including all roads, main roads and highways. The distribution of
the road network is shown in Fig. A2.
2.2. Dependent variables
2.3.5. Geospatial information
Fig. 3 shows the histogram of the evaluation frequency of O3 and The geospatial information includes elevation, distances from
TEM, which is a statistical analysis of the sample data, including the various land uses and elements. The elevation data come from 30mDEM
sample size, mean, standard deviation. By multiplying the number of data provided by the Chinese academy of sciences. Distances from
stations with 59 days (the research period), 8024 dependent variables of various land uses are calculated using Arcgis10.4.
O3 and 7906 dependent variables of TEM are obtained.
2.3.6. Socioeconomic information
2.3. Feature variables The socioeconomic information includes GDP and population. The
GDP data comes from the kilometer grid data set of China’s GDP spatial
2.3.1. Air quality monitoring data distribution produced by the Chinese academy of sciences. The data are
Data recorded by the air quality monitoring stations include O3, in grid data type, while each grid represents the gross GDP output value
PM2.5 and PM10. Several studies have proven that PM2.5 and O3 are within the grid range (1 square kilometer), and the unit is ten thousand
strongly correlated, as shown by spatial distribution and chemical pro­ yuan/square kilometer. The GDP value of the grid where each moni­
cesses (Chen et al., 2019; Qin et al., 2021a; Zhu, Chen, Liao & Dang, toring station locates in GIS is collected. The population data are derived
2019). So PM2.5 and PM10 in summer are used as the feature variables of from the sixth national population census, and the total population of
O3 in summer. the neighborhood or town where the monitoring station locates is
calculated, respectively, as shown in Fig. A3.
2.3.2. Meteorological monitoring data
Data recorded by the meteorological monitoring stations are hourly 2.3.7. Emission sources
temperature, relative humidity, 10 min average wind speed and The air pollution emission information comes from the monitoring
maximum wind speed. The relative humidity, 10 min average wind information release platform of key polluters in Shaanxi Province and
speed and maximum wind speed are used as the feature variables. the directory of key polluters of Xi ’an Ecological and Environmental
Protection Bureau. This study uses 83 air pollution enterprises and their
2.3.3. Land use distribution is illustrated in Fig. A4.
The land use data were derived from the remote sensing monitoring
database of land use status in China (provided by the Chinese academy 2.3.8. Point of interest (POI)
of sciences). The database uses the Landsat TM/ETM/OLI remote All POI is derived from Amap, which mainly includes restaurants,
sensing images as the main data source. After image fusion, geometric parking lots, bus stations, gas stations, factories and printing factories
correction, image enhancement and image splicing processing, China’s (See Fig. A5).
land use type were classified into 6 first-class types, 25 s-class types and
some third-class types based on the land use/cover classification system
2.4. LUR-Kriging-RF integrated prediction method
of China using the interactive visual interpretation method. Five land
types were used in this study, including cultivated land (paddy field +
The modeling process of the LUR-Kriging-RF integrated prediction
dry land), green land (forest land + grassland), water area (river canal +
method is shown in Fig. 4. This study uses LUR to obtain feature vari­
lake + reservoir), industrial and mining land (factory and mine + in­
ables and applies different buffers (100 m, 300 m, 500 m, 1000 m, 2000
dustrial area + airport), and construction land (urban land + rural
m, 3000 m, 4000 m, and 5000 m) to extract the land area, road length,
residential area). Fig. A1 shows the distribution of these 5 land types.
and number of POI (Han et al., 2020; Shi et al., 2019). The distances
between monitoring stations and various nearest elements are calcu­
lated using Arcgis10.4. Notably, these variables would not change over

5
L. Han et al. Sustainable Cities and Society 78 (2022) 103643

Fig 6. Spearman coefficients between O3 and the feature variables of different buffers.

time (Lee et al., 2017; Miri et al., 2019). Fig. 5 shows the different comparing the real and predicted values to obtain the accuracy of the
feature variables extraction methods. The Kriging interpolation method verification model (Breiman, Leo & Raymond, 2001; Dietterich, 2000).
is adopted to interpolate meteorological parameters or air pollutants on After establishing the integrated prediction model, a regression
different dates to obtain grid graphs (Hsu et al., 2020). Subsequently, the mapping is performed. In Arcgis10.4, the fishing net creation tool is used
meteorological parameters or air pollutant values at different stations to establish a square grid of 500 m × 500 m within the city area of Xi’an.
are extracted. Note that these feature variables would change over time. The total number of grids is 41,281. The grid centroid is used as the
Taking Software New Town station as an example, we firstly select a center of the circle to create the buffers after extraction of feature var­
3000 m buffer, and then extract the data of industrial land area, road iables. Then, we compute the distance from the grid centroid to different
length and factory number. The input elements represent the station, elements. Meanwhile, the grid centroid is used to extract the corre­
while the adjacent elements represent the POI, road and land use. Next, sponding meteorological data and air pollutant concentrations. The
the distances between the station and adjacent elements are calculated. feature variables of all 41,281 grids are channeled into the model in
After obtaining the feature variables, the bivariate correlation anal­ daily units to obtain all the daily O3 concentrations and TEM in each
ysis function of SPSS version 23.0 is applied to get the Spearman coef­ grid. After that, the mean values of O3 concentration and TEM for 59
ficient between the dependent variable, including O3, TEM and the days are calculated. The obtained data are then assigned in Arcgis 10.4
feature variables extracted through the buffer. For each feature variable, to get the spatial distributions of O3 and UHI.
it would be incorporated into the prediction model when the variable
has the largest coefficient in a corresponded buffer. Since most feature
2.5. Model interpretation
variables do not follow a normal distribution, and Spearman rank cor­
relation coefficients are nonparametric patterns of Pearson’s correlation
2.5.1. SHapley additive explanation (SHAP)
coefficients, Spearman coefficient is used in this work. Moreover, ac­
SHAP is a tree model interpretation method used for machine learning.
cording to the rank of the data, Spearman coefficient (recorded as “r”)
The key idea of SHAP is to first calculate the marginal contribution of all
applies to equal-interval data that do not satisfy the normal distribution
features to the model output, and then interpret the "black box model" from
assumption (Bishara & Hittner, 2012; Rebekić, Lončarić, Petrović &
both global and local levels(Parsa, Movahedi, Taghipour, Derrible &
Marić, 2015). The value of r ranges from − 1 and +1, a larger absolute
Mohammadian, 2020; Rodriguez-Perez & Bajorath, 2020; Vega García &
value of r indicates higher correlations.
Aznarte, 2020). The biggest advantage of SHAP value is that it can represent
After filtering the feature variables, we use scikit-learn 0.24.2 package
the influence of features in each sample. In each predicted sample, the model
in Python 3.6.13 to establish a random forest prediction model. Before
could generate a predicted value. SHAP value is the value assigned to each
modeling, we need to divide the data into training set and test set, the
feature in the sample, and it could also show both positive and negative in­
training set is used for model tuning parameters and model training, and
fluences. The functions of SHAP package include: output the feature
the test set is used for testing the trained model. Parameters need to be
importance ranking and feature density scatter diagram of all samples. In the
adjusted before training the model, including the number of subtrees
diagram of feature importance ranking, X-axis represents SHAP value
(n_estimators), the maximum growth depth of trees (max_depth), the
regardless of positive and negative signs. Each point in the scatter plot of
minimum sample number of leaves (min_samples_leaf), and the minimum
feature density represents a sample, while the X-axis represents the SHAP
sample number of branch nodes (min_samples_split) and the maximum
value. A negative (positive) SHAP value indicates negative (positive) influ­
number of features (max_features). The number of trees is first coarsely
ence. The value of Y-axis represents the magnitude of feature variables.
adjusted, and the learning curve is used to determine the approximate
range of n_estimators before fine-tuning the number of trees, and the best
2.5.2. Partial dependence plot (PDP)
n_estimators are determined within the determined range, with the
PDP is an interpretation method for machine learning model,
highest score being the optimal parameter. After adjusting the parameters
showing how single feature variables can affect the prediction. PDP
and building a random forest regression model with six parameters, the
could be used to observe the marginal effect of a single variable on the
accuracy of the model was then verified using a test set, inputting the
predicted value, assuming other characteristics remain unchanged. Due
feature variables of the test set into the model to predict the dependent
to the nonparametric characteristics of many machine learning algo­
variable, visualizing the real and predicted values of the test set, and
rithms, PDP can reveal linear and nonlinear characteristics which are

6
L. Han et al. Sustainable Cities and Society 78 (2022) 103643

Table 1
Descriptive statistics of feature variables of O3.
Category Subcategory Buffer Unit Average value Standard deviation Minimum Maximum
2
Land use Industrial and mining land 5000m km 2.96 3.61 0 13.67
Built-up area 3000m km2 13.28 10.25 0 28.27
Water body 5000m km2 2.03 3.00 0 14.61
Green space 5000m km2 11.15 12.45 0 71.03
Cultivated land 300m km2 0.06 0.09 0 0.28
Road traffic Motorway 5000m km 18.16 20.16 0 81.29
All road 5000m km 193.48 123.21 12.44 417.01
Major road 5000m km 90.94 63.12 0 201.47
Parking lot 5000m 1 1118 1614 0 5080
Bus station 5000m 1 179 146 0 440
Emission source Air-polluting enterprise 5000m 1 2 2 0 9
Restaurant 5000m 1 3421 4673 1 14,313
Factory 5000m 1 43 31 0 108
Gas station 5000m 1 15 11 0 45
Printing factory 5000m 1 13 19 0 71
Social economy Population n.a Person 60,473 45,082 2264 223,840
Gross domestic product n.a 10k yuan/km2 43,226.47 66,697.77 310 240,496
Geographical space Elevation n.a m 459.57 145.64 349 1124
Distance to all roads n.a m 153.32 208.3 1.57 1409.95
Distance to major road n.a m 880.11 1526 2.1 8288.13
Distance to motorway n.a m 4699.64 5076.44 24.69 43,354.11
Distance to ridge line n.a m 13,116.5 10,271.82 − 30,048.78 33,999.11
Distance to parking lot n.a m 1561.37 2968.89 7.02 16,741.65
Distance to bus station n.a m 744.04 2554.46 0.98 25,846.64
Distance to restaurant n.a m 274.24 492.76 1.29 3675.16
Distance to green space n.a m 938.21 1164.16 0 6708.48
Distance to water body n.a m 2356.2 3037.53 0 25,958.74
Distance to cultivated land n.a m 1090.02 1587.71 0 5777.73
Distance to air-polluting enterprise n.a m 5057.44 6553.5 97.98 44,126.2
Distance to industrial and mining land n.a m 3055.57 3235.68 0 26,436.18
Distance to printing factory n.a m 4994.77 6754.8 39.28 43,108.18
Distance to gas station n.a m 1249.07 1266.2 76.61 9043.61
Pollutant Inhalable particulate matter n.a μg/m3 49.67 17.35 2.6 148.15
Fine particulate matter n.a μg/m3 27.83 9.5 6.5 61.87
Meteorological parameters Average temperature n .a ◦
C 28.36 3.29 18.3 35.56
Maximum temperature n .a ◦
C 33.45 4.42 19.57 41.29
Average wind speed n.a m/s 1.17 0.43 0.3 3.23
Maximum wind speed n.a m/s 1.68 0.51 0.4 4.05
Humidity n.a %RH 69.56 11.66 44.26 98.92

easy to comprehend. PDP is simple and intuitive. When PDP curve of a 3. Results
feature is horizontal or irregular jitter, this feature may be useless. When
PDP curve of a feature is very steep, it indicates that the contribution of 3.1. Feature variables
this feature is relatively large (Apley & Zhu, 2020; Scarpone et al., 2020;
Van Nguyen, Zhou, Chong, Li & Pu, 2020; Zhao, Lovreglio & Nilsson, 3.1.1. Selecting of feature variables for O3
2020). The feature variables of O3 are divided into seven classes including
land use, road traffic, emission sources, socioeconomic information,

Fig 7. Spearman coefficient between TEM and the feature variables of different buffers.

7
L. Han et al. Sustainable Cities and Society 78 (2022) 103643

Table 2
Descriptive statistics of feature variables of TEM.
Category Subcategory Buffer Unit Average value Standard deviation Minimum Maximum
2
Land use Industrial and mining land 5000m km 2.24 3.55 0 14.378
Built-up area 3000m km2 22.20 22.65 0 78.30
Water body 5000m km2 1.81 23.06 0 14.76
Green space 5000m km2 14.47 17.74 0 76.72
Cultivated land 5000m km2 39.94 20.93 0 68.57
Road traffic Motorway 5000m km 16.76 19.29 0 87.48
All roads 5000m km 138.423 108.59 67.36 407.16
Major road 5000m km 63.91 56.20 0 193.54
Parking lot 5000m 1 549 1210 0 5073
Bus station 5000m 1 109 1259 0 426
Emission source Air-polluting enterprise 5000m 1 1 2 0 8
Restaurant 5000m 1 1700 3467 0 14,335
Factory 3000m 1 10 12 0 56
Gas station 5000m 1 10 9 0 46
Printing factory 5000m 1 6 13 0 61
Social economy Population n.a Person 48,321 46,311 2264 223,840
Gross domestic product n.a 10k yuan/km2 20,309.26 48,332.71 308 221,030
Geographical space Elevation n.a m 509.61 203.2 341 1545
Distance to all roads n.a m 169.22 287.59 0.64 1615.97
Distance to major road n.a m 1386.39 2042.04 11.4 8603.34
Distance to motorway n.a m 5731.33 6134.88 64.88 43,351.15
Distance to ridge line n.a m 9753.42 11,529.83 − 30,063.11 34,558.94
Distance to parking lot n.a m 2991.13 3705.58 11.81 21,239.43
Distance to bus station n.a m 1426.94 3282.55 0.82 25,839.5
Distance to restaurant n.a m 535.76 1032.81 3.92 7058.78
Distance to green space n.a m 1062.83 1406.27 0 7847.73
Distance to water body n.a m 3044.7 4077.36 0 27,170.62
Distance to cultivated land n.a m 496.47 1041.13 0 6146.96
Distance air-polluting enterprise n.a m 7949.51 8166.82 552.55 44,136.03
Distance to industrial and mining land n.a m 4031.1 4104.09 0 26,760.43
Distance to printing factory n.a m 7487.15 7751.17 122.74 43,121.62
Distance to gas station n.a m 1820.72 2368.71 46.82 18,842.52
Pollutant Inhalable particulate matter n.a μg/m3 46.93 16.71 2.6 140.25
Fine particulate matter n.a μg/m3 26.87 9.48 6.5 60.23
Ozone n.a μg/m3 145.27 46.85 9 310
Meteorological parameters Average wind speed n.a m/s 1.24 0.45 0.29 3.45
Maximum wind speed n.a m/s 1.76 0.53 0.4 4.15
Humidity n.a %RH 70.84 11.51 41.42 99.12

geographical information, pollutant and meteorological parameters. also increase. The variables with the largest Spearman coefficient of
Fig. 6 shows the Spearman correlation coefficients between O3 and the each class are selected as this class’s feature variable and entered into
feature variables of different buffers. The X-axis represents the different the prediction model. The Spearman coefficients for the buffer zone of
buffers. For example, we extract the number of polluting enterprises in 5000 m are the largest for all the feature variables except for built-up
each of the eight buffer zones and calculate the Spearman correlation land and factory. A total of 15 feature variables are extracted through
coefficients between polluting enterprises and O3 concentration. Y-axis buffers, and 38 feature variables are finally inserted into the prediction
represents the Spearman correlation coefficients between O3 and the model. Descriptive statistics of feature variables are shown in Table 2,
feature variables in different buffer zones. The overall view of the graph including the mean value, standard deviation, minimum value and
shows that as the buffer increases, the Spearman correlation coefficients maximum value of feature variables.
of each feature variable and O3 also increase. The variables in each class
with the largest Spearman coefficients is selected and inserted into the
prediction model as a feature coefficient. The Spearman coefficients for 3.2. Model prediction
the buffer zone of 5000 m are the largest for all the feature variables
except for built-up land and cultivated land. A total of 15 feature vari­ 3.2.1. Model parameters
ables are chosen based on the buffers. Finally, 39 feature variables are Before parameter adjustment, O3 and TEM data sets are divided into
selected and entered into the prediction model. Descriptive statistics of training set (80% of data) and test set (20% of data). For O3, 6419 data
feature variables are shown in Table 1, including the mean, standard sets are in the training set, while 1605 data sets are in the test set. For
deviation, minimum and maximum values of the feature variables. TEM, 6324 data sets are in the training set, and 1582 data sets are in the
test set. The training set is used for the parameter adjustment, while the
3.1.2. Selecting of feature variables for TEM test set is used for the model testing. Table 3 shows the model adjust­
Fig. 7 shows the Spearman correlation coefficients between TEM and ment results.
the feature variables of different buffers. The X-axis represents the
different buffers. For example, we extract the number of polluting en­ 3.2.2. Model evaluation
terprises in each of the eight buffer zones and calculate the Spearman We use the same feature variables and dependent variables to build
correlation coefficients between the number of polluting enterprises and multivariate linear regression (MLR) model, LUR and RF model. The
TEM concentration. Y-axis represents the Spearman correlation co­ coefficient of determination (R2), mean square error (MSE), root mean
efficients between TEM and the feature variables in different buffer square error (RMSE) and mean absolute error (MAE) of each model is
zones. The overall view of the graph shows that as the buffer increases, calculated and presented in Table 4. The R2 value of RF model is greater
the Spearman correlation coefficients of each feature variable and TEM than those of MLR and LUR, indicating that RF regression feature vari­
ables have the better explanatory capacity and data fitting for O3 and

8
L. Han et al. Sustainable Cities and Society 78 (2022) 103643

Table 3 distribution of O3 shows a high concentration in the south and a low


Parameters for O3 and TEM prediction models. concentration in the north (see Fig. 10). O3 concentration in the north is
Parameter O3 TEM higher than that in the southern mountainous area. Except for the high
n_estimators 270 287 O3 concentration regions in the central city, Zhouzhi County and Huyi
max_depth 29 29 District in the west exhibit high O3 concentration regions. The 38 feature
min_samples_leaf 1 1
variables obtained after extraction are entered into the random forest
min_samples_split 2 2
max_features 33 27 regression prediction model to predict the TEM of 41,281 grids for 59
consecutive days. The average TEM of each grid for 59 consecutive days
was calculated as the final TEM value of the grid. When mapping the
spatial variation of UHI, the interval in high TEM region is set to 0.5 ◦ C,
Table 4
Comparisons of the MLR, LUR and RF models for O3 and TEM.
while the interval in low TEM region is set to 1.5 ◦ C (as shown in
Fig. 12). According to the UHI distribution mapping of Xi’an, the high
Dependent variable Evaluating indicator MLR LUR RF
temperature regions in the central and northeastern parts of the city
O3 R2 0.4 0.4 0.65 spread to the northeast, and high TEM region occurs in some areas of
MSE 1361.55 1364.25 794.07 Gaoling, Lintong and Yanliang in the northeastern part of the main city.
RMSE 35.66 36.93 28.18
MAE 28.26 28.34 20.31
TEM R2 0.8 0.8 0.93
MSE 2.3 2.29 0.85 3.4. Contributions of feature variables
RMSE 1.52 1.51 0.92
MAE 1.21 1.21 0.7 According to the ranking of importance on the left of Fig. 13, except
for PM2.5 and PM10, feature variables that vary over time are signifi­
TEM than MLR and LUR. The MSE, RMSE and MAE values of RF are cantly more important than those that do not vary over time. The density
smaller than MLR and LUR, indicating the smaller error of the RF scatters diagram on the right also shows that the sample distributions of
regression model. Figs. 8 and 9 show the real (black dots) and predicted feature variables of meteorological parameters are more dispersed,
(red dots) values of the test set of O3 and TEM in the random forest indicating the greater impact of the feature variables. Among the time-
prediction model. varying feature variables (see their definitions in Table 1), humidity is
the most important feature variable. The blue low humidity sample on
the right side has a significant positive effect on O3. The maximum
3.3. Spatial distributions of O3 and UHI temperature is the second important feature variable, a large number of
red high temperature samples are located on the right, indicating that
The 39 feature variables obtained after extraction are channeled into high temperature has a significant positive effect on O3. The maximum
the random forest regression prediction model to predict the O3 con­ wind speed and average speed wind are the third and fourth important
centrations of 41,281 grids for 59 consecutive days. The average O3 feature variables, respectively. The two feature variables of red sample
concentration of each grid for 59 consecutive days is calculated as the high wind speed on the left side, indicate the obvious negative influence
final concentration value of the grid. The O3 spatial distribution map­ on the O3 concentration. The average temperature is the fifth important
ping is plotted with an interval of 10 μg/m3 (see Fig. 11). The feature variable. Similar to the maximum temperature, high

Fig 8. Predicted values for O3 test sets.

Fig 9. Predicted values for TEM test sets.

9
L. Han et al. Sustainable Cities and Society 78 (2022) 103643

Fig 10. Regional distribution map of Xi’an.

Fig 11. Spatial distribution of O3 in Xi’an.

temperature samples located on the right side indicate a significant the density scatters diagram on the right suggests that the sample dis­
positive effect on the O3 concentration. Unlike the maximum tempera­ tributions of meteorological parameters and feature variables of pol­
ture, abnormal high temperature sample is absent on the left side. lutants are more dispersed, indicating huge influence of the feature
Among the variables that do not vary over time, the top five in impor­ variables. Among the time-varying feature variables, humidity is the
tance ranking are elevation, gross domestic product, green space (5000 most important feature variable, which has a significant negative effect
m buffer), distance to cultivated land, and distance to printing factory, on temperature. The maximum wind speed is the second important
which negatively influence the O3 concentration. feature variable, which has a significant positive effect on TEM. O3
From the ranking of importance on the left side of Fig. 14, it can be concentration is the third important feature variable, and its high value
seen that in addition to elevation, feature variables that vary over time samples are distributed on the left and right sides. PM10 is the fourth
are more important than those that do not vary over time. Horizontally, important feature variable, and it has a significant positive effect on

10
L. Han et al. Sustainable Cities and Society 78 (2022) 103643

Fig 12. Spatial distribution of UHI in Xi’an.

Fig 13. Importance ranking diagram (left) and density scatter diagram (right) of feature variables for O3.

TEM. PM2.5 is the fifth important feature variable, where most samples also confirmed (Chen et al., 2018; Ren et al., 2020; Weichenthal et al.,
with high values are distributed on the right side, and a small number of 2016). The two models performed well in the test set, except for a few
samples are distributed on the left side. The top five feature variables values (due to overestimation/underestimation), most of the predicted
that do not vary over time are elevation, cultivated land (5000 m buffer), values fitted the real values well. Meanwhile, the integrated prediction
green space (5000 m buffer), distance to ridge line and distance to water model demonstrated high operation efficiency and could further predict
body. the spatial distributions of O3 and UHI at a finer scale. Compared with
linear models that simply explain the relationships between feature
4. Discussions variables and dependent variables, coefficient represents the importance
of the feature. The larger the coefficient is, the more important the
4.1. Advantages of the LUR-Kriging-RF integrated prediction model feature is(Lee et al., 2017; Meng et al., 2016; Miri et al., 2019). SHAP
value, the visual tree model interpretation tool for machine learning, can
The LUR-Kriging-RF integrated prediction model modified by ma­ identify the important feature variables and represent the influence of
chine learning has better performance, higher fitting accuracy R2 and features in each sample (Vega García & Aznarte, 2020). The PDP can
less error than LUR. The integrated prediction model could cover the show how a single feature variable would affect the prediction (Zhao
inability of LUR to fit nonlinear relations, which previous studies have et al., 2020).

11
L. Han et al. Sustainable Cities and Society 78 (2022) 103643

Fig 14. Importance ranking diagram (left) and density scatter diagram (right) of feature variables for TEM.

4.2. Spatial relationship between O3 and UHI selected to regulate O3 concentration and TEM. Three feature variables
can be adjusted for TEM control, and PDPs of the 3 feature variables are
At the municipal level, there are two obvious pollution regions of O3 plotted in Fig. 15. The higher the relative humidity is, the lower the TEM
concentration in Xi’an city. One is along the north-south direction of the is. Spraying water in summer is an effective method to reduce TEM
central urban area. The high O3 concentration region in the central urban (Sussman, Dai & Roundy, 2021; Wang et al., 2020c). The larger the
area is consistent with the high TEM area, while the high O3 concentration green area in the 5000 m buffer region, the lower the TEM is. The effect
and high TEM outside the central urban area do not overlap, indicating that of green space on reducing TEM is significant (Gao, Zhao & Han, 2022;
the influence of high TEM on O3 will appear only when there are enough Huang et al., 2021; Ke, Men, Zhou, Li & Zhu, 2021; Min, Lin, Duan, Jin &
emission sources. There is a "lowland" of O3 concentration in the central Zhang, 2019; Pascal et al., 2021; Xiao, Dong, Yan, Yang, & Xiong, 2018).
urban area in the town, i.e., the O3 concentration in the area is much lower Surprisingly, TEM increases with the increase of wind speed, which may
than the surrounding regions, which is caused by the titration of NOx to O3. be attributed to low wind speed in summer do not reduce TEM.
The large traffic in this area can result in higher NOx emissions. The NOx
species can consume atmospheric radicals. Since the radicals are important
precursors of O3 formation, the decrease of precursor substance can reduce 4.4. Feature variables for O3 reduction
the production of O3 (Huang et al., 2018; Tan et al., 2018; Xie et al., 2016).
The other one is located in the north of Zhouzhi County and Huyi District. The A total of 6 feature variables of O3 can be regulated, and PDPs of the
O3 pollution region in north of Zhouzhi and Huyi connects with Xianyang 6 feature variables are plotted in Fig. 16The higher the average wind
city. This area is far away from the central city, industrial source and regional speed is, the lower the O3 concentration is, thus improving urban
transportation in this area could can high concentration of O3 concentration. ventilation can significantly reduce O3 concentration (He et al., 2021;
Controlling industrial source emissions and regional joint management is the Romshoo, Bhat & Beig, 2021). The higher the average TEM is, the higher
key to O3 prevention and control in this region. the concentration of O3 is, indicating that the reduction of TEM can
reduce the O3 concentration, and it is necessary to coordinate the
management of O3, high temperature and heat wave (Fallmann, Forkel
4.3. Feature variables for UHI mitigation & Emeis, 2016; Liu et al., 2021b; Wai, Tan, Morakinyo, Chan & Lai,
2020; Wang et al., 2018; Zhu et al., 2015). The higher the humidity is,
The importance ranking diagram and density scatter diagram indi­ the lower the concentration of O3 is. Since humidification is an effective
cated that some feature variables could affect the dependent variables way to reduce O3, spraying water in summer can reduce TEM and dust,
(see Figs. 13 and 14). Nevertheless, they only have very little effect on and also inhibit O3 pollution (Chen et al., 2020b; Lee, Kahng, Kim &
the dependent variables. These feature variables are of little significance Park, 2018; Li et al., 2021; Niu et al., 2018; Yu, 2018). The larger the
in regulating of O3 and TEM. Therefore, only some feature variables are area of industrial lands in the buffer of 5000 m, the higher the

Fig 15. PDPs for TEM and feature variables.

12
L. Han et al. Sustainable Cities and Society 78 (2022) 103643

Fig 16. PDPs for O3 concentration and feature variables.

concentration of O3. It is known that industrial sources (e.g. surface increased green space, while replacing the high VOCs emission tree
coating industry, furniture manufacturing industry, packaging and species in an orderly manner; thirdly, cities should spray and increase
printing industry, rubber and plastic products industry, non-metallic humidity, which not only can reduce dust and temperature, but also
mineral products industry, architectural coating industry and auto inhibit ozone generation; fourthly, to improve urban ventilation, which
repair industry) are O3 emission sources (He et al., 2019; Li et al., 2017; is an effective way to reduce O3 and UHI. The synergistic management of
Liu, Zhang, Han, Li & Chen, 2019; Wang, Wang & Ying, 2020b). O3 and UHI is an important means to achieve sustainable urban envi­
Therefore, some adjustment of industrial land use is necessary. The ronment and promote sustainable urban development.
closer to the cultivated land, the higher the concentration of O3 is. The
nitrification reaction and partitioning of soil microorganisms in agri­
4.6. Limitations and future work
cultural sources can generate NOx, and the usage of pesticide can
generate VOCs (Lee et al., 2020; Potard et al., 2017). Since NOx and
The main land use data applied here are remote sensing images. Due
VOCs are both precursors of O3, and controlling agricultural sources is
to the lag of data source collection time, there is a certain difference
conducive to reducing O3 concentration. When the green space area in
between land use status during pollutant data collection, which could
the 5000 m buffer is less than 5 km2, the increase of green space area has
lead to errors in correlation analysis between feature variables and
a significant effect on the decline in O3 concentration. When the green
dependent variables. In the research stage, the random forest algorithm
space area is larger than 5 km2, the downtrend becomes small. However,
achieved good results, so the other machine learning algorithms were
previous studies have demonstrated that more green space can signifi­
not considered in this work. We plan to compare different algorithms in
cantly increase O3 concentration, and plants can emit VOCs for defense,
future research. Moreover, this study considered the whole city level,
pollination and communication (Cai, Zhuang & Ren, 2020; Churkina,
which is relatively board scale. In the next stage, we will examine the
Grote, Butler & Lawrence, 2015; Ren et al., 2017b). The reason for this is
simultaneous treatment strategy of O3 and UHI in central urban areas at
that the contribution of VOCs from plant sources to O3 in summer is
the mesoscale level, so as to build a more complete treatment system.
much lower than those of anthropogenic sources. Previous studies also
proved that a large number of VOCs from plant sources in the mountains
in winter play a leading role, and their contribution is greater than those 5. Conclusion
of anthropogenic sources. Therefore, plants with low VOCs emission
should be used to increase green space area (Han, Zhao & Gu, 2021; Ren In this paper, the machine learning random forest algorithm was
et al., 2017a). applied to improve the LUR model, which could overcome the disad­
vantages of LUR in nonlinear fitting. A non-linear regression model with
higher accuracy and smaller error was built and predicted for O3 and
4.5. O3 and UHI synergistic governance strategy UHI. The R2 of O3 model increased from 0.4 to 0.65, and that of TEM
model increased from 0.8 to 0.93. The newly established model suffi­
UHI leads to high local temperatures in cities, and high temperatures ciently predicted the spatial distributions of O3 concentration and UHI.
significantly enhance O3. VOCs and nitrogen oxides formed by vehicle Some machine learning interpretation methods were applied to interpret
exhaust catalyze O3 production under the combined effect of high the influence of part feature variables on O3 concentration and TEM.
temperatures and light radiation, and the higher, the temperature the Eventually, the collaborative governance strategy for O3 and UHI was
faster the production (Abeleira & Farmer, 2017; Berezina et al., 2020; developed under the integrated interpretation framework, which con­
Jaén, Udina & Bech, 2021). This is also evidenced by the increase in O3 tributes to sustainable urban development.
with temperature in Fig. 16, so mitigating UHI is an important means of Whilst LUR is still considered a classical spatial prediction model, the
reducing O3 pollution. The strategy for the synergistic management of introduction of machine learning can push forward this research field.
O3 and UHI is firstly to reduce emission sources, including industrial and The relationships between feature variables and the dependent variables
agricultural sources; secondly, to increase the area of green space, and to (including O3, TEM) can be obtained by interpreting machine learning
choose plants with low VOCs emission and high shading effect for the models based on the SHAP and PDP methods. The demonstration here

13
L. Han et al. Sustainable Cities and Society 78 (2022) 103643

highlights the practical value of applying machine learning models Acknowledgments


rather than just staying at the prediction level. There are spatial in­
teractions between O3 concentration and UHI in the central urban area This study was supported by the Scientific and Technological Inno­
in summer. The collaborative management of the two variables is vation Team in Key Fields of Shaanxi Province (No. 2020TD-029); the
essential. It is important to continuously expand green space, (by Innovation Team Support Project of Central University Fund
selecting tree species with low VOCs emissions), increase humidity and (300102411401); the National Natural Science Foundation of China
reduce emission sources. (No. 11872295); the Projects of International Cooperation and Ex­
changes NSFC (No. 41861144021). The authors would like to express
Declaration of Competing Interest their gratitude to EditSprings (https://www.editsprings.com/) for the
expert linguistic services provided.
The authors report no declarations of interest.
Appendix A

See Figs. A1–A5.

Fig A1. Land use map of Xi ’an.

Fig A2. The road networks of Xi’an.

14
L. Han et al. Sustainable Cities and Society 78 (2022) 103643

Fig A3. Population distribution of Xi’an.

Fig A4. Locations of eighty-three air-polluting enterprises in Xi’an.

15
L. Han et al. Sustainable Cities and Society 78 (2022) 103643

Fig A5. Locations of printing factories, gas stations and factories in Xi’an.

References sensing, meteorological and land use information. Science of the Total Environment,
636, 52–60. https://doi.org/10.1016/j.scitotenv.2018.04.251
Chen, J., Shen, H., Li, T., Peng, X., Cheng, H., & Ma, A. C. (2019). Temporal and spatial
Abadie, L. M., & Polanco-Martinez, J. M. (2021). Sensitivities of heat-wave mortality
features of the correlation between PM2.5 and O3 concentrations in China.
projections: Moving towards stochastic model assumptions. Environmental Research,
International Journal of Environmental Research and Public Health, 16. https://doi.org/
204, Article 111895. https://doi.org/10.1016/j.envres.2021.111895
10.3390/ijerph16234824
Abeleira, A. J., & Farmer, D. K. (2017). Summer ozone in the northern Front Range
Chen, M., Zhou, Y., Hu, M., & Zhou, Y. (2020a). Influence of urban scale and urban
metropolitan area: Weekend–weekday effects, temperature dependences, and the
expansion on the urban heat island effect in metropolitan areas: Case study of
impact of drought. Atmospheric Chemistry and Physics, 17, 6517–6529. https://doi.
Beijing–Tianjin–Hebei urban agglomeration. Remote Sensing, 12. https://doi.org/
org/10.5194/acp-17-6517-2017
10.3390/rs12213491
Apley, D. W., & Zhu, J. Y. (2020). Visualizing the effects of predictor variables in black
Chen, Z., Li, R., Chen, D., Zhuang, Y., Gao, B., Yang, L., et al. (2020b). Understanding the
box supervised learning models. Journal of the Royal Statistical Society Series B-
causal influence of major meteorological factors on ground ozone concentrations
Statistical Methodology, 82, 1059–1086. https://doi.org/10.1111/rssb.12377
across China. Journal of Cleaner Production, 242. https://doi.org/10.1016/j.
Berezina, E., Moiseenko, K., Skorokhod, A., Pankratova, N. V., Belikov, I., Belousov, V.,
jclepro.2019.118498
et al. (2020). Impact of VOCs and NOx on ozone formation in Moscow. Atmosphere,
Churkina, G., Grote, R., Butler, T. M., & Lawrence, M. (2015). Natural selection? Picking
11. https://doi.org/10.3390/atmos11111262
the right trees for urban greening. Environmental Science & Policy, 47, 12–17. https://
Bishara, A. J., & Hittner, J. B. (2012). Testing the significance of a correlation with
doi.org/10.1016/j.envsci.2014.10.014
nonnormal data: Comparison of Pearson, Spearman, transformation, and resampling
Cihan, P., Ozel, H., & Ozcan, H. K. (2021). Modeling of atmospheric particulate matters
approaches. Psychological Methods, 17, 399–417. https://doi.org/10.1037/a0028087
via artificial intelligence methods. Environmental Monitoring and Assessment, 193,
Bitta, J., Pavlíková, I., Svozilík, V., & Jančík, P. (2018). Air pollution dispersion modeling
287.. https://doi.org/10.1007/s10661-021-09091-1
using spatial analyzes. ISPRS International Journal of Geo-Information, 7. https://doi.
Day, D. B., Xiang, J., Mo, J., Li, F., Chung, M., Gong, J., et al. (2017). Association of
org/10.3390/ijgi7120489
ozone exposure with cardiorespiratory pathophysiologic mechanisms in healthy
Bongioanni, P., Del Carratore, R., Corbianco, S., Diana, A., Cavallini, G.,
adults. JAMA Internal Medicine, 177, 1344–1353. https://doi.org/10.1001/
Masciandaro, S. M., et al. (2021). Climate change and neurodegenerative diseases.
jamainternmed.2017.2842
Environmental Research, 201, Article 111511. https://doi.org/10.1016/j.
Despini, F., Ferrari, C., Santunione, G., Tommasone, S., Muscio, A., & Teggi, S. (2021).
envres.2021.111511
Urban surfaces analysis with remote sensing data for the evaluation of UHI
Breiman, L. B., Leo, Cutler, & Raymond, A. (2001). Random forests. Machine Learning, 45
mitigation scenarios. Urban Climate, 35. https://doi.org/10.1016/j.
(1), 5–32.
uclim.2020.100761
Cai, L., Zhuang, M., & Ren, Y. (2020). Spatiotemporal characteristics of NO2, PM2.5 and
Di, Q., Amini, H., Shi, L., Kloog, I., Silvern, R., Kelly, J., et al. (2019). An ensemble-based
O3 in a coastal region of southeastern China and their removal by green spaces.
model of PM2.5 concentration across the contiguous United States with high
International Journal of Environmental Health Research, 1–17. https://doi.org/
spatiotemporal resolution. Environment International, 130, Article 104909. https://
10.1080/09603123.2020.1720620
doi.org/10.1016/j.envint.2019.104909
Cao, S. J., Kong, X. R., Li, L., Zhang, W., Ye, Z. P., & Deng, Y. (2017). An investigation of
Dietterich, T. G. (2000). An experimental comparison of three methods for constructing
the PM2.5 and NO2 concentrations and their human health impacts in the metro
ensembles of decision trees: Bagging, boosting, and randomization. Machine
subway system of Suzhou, China. Environmental Science: Processes & Impacts, 19,
Learning, 40(2), 139–157.
666–675. https://doi.org/10.1039/c6em00655h
Dugord, P.-. A., Lauf, S., Schuster, C., & Kleinschmit, B. (2014). Land use patterns,
Carvalho, R. B., Marmett, B., Dorneles, G. P., da Silva, I. M., Romao, P. R. T., da Silva
temperature distribution, and potential heat stress risk – The case study Berlin,
Junior, F. M. R., et al. (2021). O3 concentration and duration of exposure are factors
Germany. Computers, Environment and Urban Systems, 48, 86–98. https://doi.org/
influencing the environmental health risk of exercising in Rio Grande, Brazil.
10.1016/j.compenvurbsys.2014.07.005
Environmental Geochemistry and Health. https://doi.org/10.1007/s10653-021-01060-
Equere, V., Mirzaei, P. A., Riffat, S., & Wang, Y. (2021). Integration of topological aspect
4
of city terrains to predict the spatial distribution of urban heat island using GIS and
Chaturvedi, V., & de Vries, W. T. (2021). Machine learning algorithms for urban land use
ANN. Sustainable Cities and Society, 69. https://doi.org/10.1016/j.scs.2021.102825
planning: A review. Urban Science, 5. https://doi.org/10.3390/urbansci5030068
Fallmann, J., Forkel, R., & Emeis, S. (2016). Secondary effects of urban heat island
Chen, G., Chen, J., Dong, G.h., Yang, B.-y., Liu, Y., Lu, T., et al. (2021). Improving
mitigation measures on air quality. Atmospheric Environment, 125, 199–211. https://
satellite-based estimation of surface ozone across China during 2008–2019 using
doi.org/10.1016/j.atmosenv.2015.10.094
iterative random forest model and high-resolution grid meteorological data.
Gabrie, M. F., Paciência, I., Felgueiras, F., Rufo, C. J., Mendes, F. C., et al. (2021).
Sustainable Cities and Society, 69. https://doi.org/10.1016/j.scs.2021.102807
Environmental quality in primary schools and related health effects in children. An
Chen, G., Li, S., Knibbs, L. D., Hamm, N. A. S., Cao, W., Li, T., et al. (2018). A machine
overview of assessments conducted in the Northern Portugal. Energy and Buildings,
learning method to estimate PM2.5 concentrations across China with remote
250. https://doi.org/10.1016/j.enbuild.2021.111305

16
L. Han et al. Sustainable Cities and Society 78 (2022) 103643

Gao, Y., Wang, Z., Li, C. Y., Zheng, T., & Peng, Z.-. R. (2021). Assessing neighborhood Li, G., Bei, N., Cao, J., Wu, J., Long, X., Feng, T., et al. (2017). Widespread and persistent
variations in ozone and PM2.5 concentrations using decision tree method. Building ozone pollution in eastern China during the non-winter season of 2015: Observations
and Environment, 188. https://doi.org/10.1016/j.buildenv.2020.107479 and source attributions. Atmospheric Chemistry and Physics, 17, 2759–2774. https://
Gao, Y., Wang, Z., Liu, C., & Peng, Z. R. (2019). Assessing neighborhood air pollution doi.org/10.5194/acp-17-2759-2017
exposure and its relationship with the urban form. Building and Environment, 155, Li, H., Wu, S., Pan, L., Xu, J., Shan, J., Yang, X., et al. (2018). Short-term effects of
15–24. https://doi.org/10.1016/j.buildenv.2018.12.044 various ozone metrics on cardiopulmonary function in chronic obstructive
Gao, Y., Zhao, J., & Han, L. (2022). Exploring the spatial heterogeneity of urban heat pulmonary disease patients: Results from a panel study in Beijing, China.
island effect and its relationship to block morphology with the geographically Environmental Pollution, 232, 358–366. https://doi.org/10.1016/j.
weighted regression model. Sustainable Cities and Society, 76. https://doi.org/ envpol.2017.09.030
10.1016/j.scs.2021.103431 Li, L., Hart, J. E., Coull, B. A., Cao, S. J., Spengler, J. D., & Adamkiewicz, G. (2019). Effect
Gobakis, K., Kolokotsa, D., Synnefa, A., Saliari, M., Giannopoulou, K., & Santamouris, M. of residential greenness and nearby parks on respiratory and allergic diseases among
(2011). Development of a model for urban heat island prediction using neural middle school adolescents in a Chinese city. International Journal of Environmental
network techniques. Sustainable Cities and Society, 1, 104–115. https://doi.org/ Research and Public Health, 16. https://doi.org/10.3390/ijerph16060991
10.1016/j.scs.2011.05.001 Li, M., Yu, S., Chen, X., Li, Z., Zhang, Y., Wang, L., et al. (2021). Large scale control of
Han, L., Zhao, J., Gao, Y., Gu, Z., Xin, K., & Zhang, J. (2020). Spatial distribution surface ozone by relative humidity observed during warm seasons in China.
characteristics of PM2.5 and PM10 in Xi’an City predicted by land use regression Environmental Chemistry Letters, 19, 3981–3989. https://doi.org/10.1007/s10311-
models. Sustainable Cities and Society, 61, Article 102329. https://doi.org/10.1016/j. 021-01265-0
scs.2020.102329 Liu, C., Henderson, B. H., Wang, D., Yang, X., & Peng, Z. R. (2016). A land use regression
Han, L., Zhao, J., & Gu, Z. (2021). Assessing air quality changes in heavily polluted cities application into assessing spatial variation of intra-urban fine particulate matter
during the COVID-19 pandemic: A case study in Xi’an, China. Sustainable Cities and (PM2.5) and nitrogen dioxide (NO2) concentrations in city of Shanghai, China.
Society, 70. https://doi.org/10.1016/j.scs.2021.102934 Science of the Total Environment, 565, 607–615. https://doi.org/10.1016/j.
He, Y., Wang, H., Wang, H., Xu, X., Li, Y., & Fan, S. (2021). Meteorology and topographic scitotenv.2016.03.189
influences on nocturnal ozone increase during the summertime over Shaoguan, Liu, H., Zhang, M., Han, X., Li, J., & Chen, L. (2019). Episode analysis of regional
China. Atmospheric Environment, 256. https://doi.org/10.1016/j. contributions to tropospheric ozone in Beijing using a regional air quality model.
atmosenv.2021.118459 Atmospheric Environment, 199, 299–312. https://doi.org/10.1016/j.
He, Z., Wang, X., Ling, Z., Zhao, J., Guo, H., Shao, M., et al. (2019). Contributions of atmosenv.2018.11.044
different anthropogenic volatile organic compound sources to ozone formation at a Liu, M., Chen, H., Wei, D., Wu, Y., & Li, C. (2021a). Nonlinear relationship between
receptor site in the Pearl River delta region and its policy implications. Atmospheric urban form and street-level PM2.5 and CO based on mobile measurements and
Chemistry and Physics, 19, 8801–8816. https://doi.org/10.5194/acp-19-8801-2019 gradient boosting decision tree models. Building and Environment, 205. https://doi.
Hsu, C. Y., Zeng, Y. T., Chen, Y. C., Chen, M. J., Lung, S. C., & Wu, C. D. (2020). Kriging- org/10.1016/j.buildenv.2021.108265
based land-use regression models that use machine learning algorithms to estimate Liu, X., Guo, H., Zeng, L., Lyu, X., Wang, Y., Zeren, Y., et al. (2021b). Photochemical
the monthly BTEX concentration. International Journal of Environmental Research and ozone pollution in five Chinese megacities in summer 2018. Science of the Total
Public Health, 17. https://doi.org/10.3390/ijerph17196956 Environment, 801, Article 149603. https://doi.org/10.1016/j.scitotenv.2021.149603
Huang, D., Li, Q., Wang, X., Li, G., Sun, L., He, B., et al. (2018). Characteristics and trends Lundberg, S. M., Erion, G., Chen, H., DeGrave, A., Prutkin, J. M., Nair, B., et al. (2020).
of ambient ozone and nitrogen oxides at urban, suburban, and rural sites from 2011 From local explanations to global understanding with explainable AI for trees. Nature
to 2017 in Shenzhen, China. Sustainability, 10. https://doi.org/10.3390/su10124530 Machine Intelligence, 2, 56–67. https://doi.org/10.1038/s42256-019-0138-9
Huang, H., Yang, H., Chen, Y., Chen, T., Bai, L., & Peng, Z.-. R. (2021). Urban green space Lyu, L., Li, Y., Ou, X., Guo, W., Zhang, Y., Duan, S., et al. (2021). Health effects of
optimization based on a climate health risk appraisal – a case study of Beijing city, occupational exposure to printer emissions on workers in China: Cardiopulmonary
China. Urban Forestry & Urban Greening, 62. https://doi.org/10.1016/j. function change. NanoImpact, 21. https://doi.org/10.1016/j.impact.2020.100289
ufug.2021.127154 Madanian, M., Soffianian, A. R., Soltani Koupai, S., Pourmanafi, S., & Momeni, M.
Jaén, C., Udina, M., & Bech, J. (2021). Analysis of two heat wave driven ozone episodes (2018). The study of thermal pattern changes using landsat-derived land surface
in Barcelona and surrounding region: Meteorological and photochemical modeling. temperature in the central part of Isfahan province. Sustainable Cities and Society, 39,
Atmospheric Environment, 246. https://doi.org/10.1016/j.atmosenv.2020.118037 650–661. https://doi.org/10.1016/j.scs.2018.03.018
Ji, W., Wang, Y., & Zhuang, D. (2019). Spatial distribution differences in PM2.5 Meng, M.-. R., Cao, S.-. J., Kumar, P., Tang, X., & Feng, Z. (2021). Spatial distribution
concentration between heating and non-heating seasons in Beijing, China. characteristics of PM2.5 concentration around residential buildings in urban traffic-
Environmental Pollution, 248, 574–583. https://doi.org/10.1016/j. intensive areas: From the perspectives of health and safety. Safety Science, 141.
envpol.2019.01.002 https://doi.org/10.1016/j.ssci.2021.105318
Jung, J., Uejio, C. K., Adeyeye, T. E., Kintziger, K. W., Duclos, C., Reid, K., et al. (2021). Meng, X., Fu, Q., Ma, Z., Chen, L., Zou, B., Zhang, Y., et al. (2016). Estimating ground-
Using social security number to identify sub-populations vulnerable to the health level PM(10) in a Chinese city by combining satellite data, meteorological
impacts from extreme heat in Florida, U.S. Environmental Research, 202, Article information and a land use regression model. Environmental Pollution, 208, 177–184.
111738. https://doi.org/10.1016/j.envres.2021.111738 https://doi.org/10.1016/j.envpol.2015.09.042
Kalisa, E., Fadlallah, S., Amani, M., Nahayo, L., & Habiyaremye, G. (2018). Temperature Min, M., Lin, C., Duan, X., Jin, Z., & Zhang, L. (2019). Spatial distribution and driving
and air pollution relationship during heatwaves in Birmingham, UK. Sustainable force analysis of urban heat island effect based on raster data: A case study of the
Cities and Society, 43, 111–120. https://doi.org/10.1016/j.scs.2018.08.033 Nanjing metropolitan area, China. Sustainable Cities and Society, 50. https://doi.org/
Ke, X., Men, H., Zhou, T., Li, Z., & Zhu, F. (2021). Variance of the impact of urban green 10.1016/j.scs.2019.101637
space on the urban heat island effect among different urban functional zones: A case Miri, M., Ghassoun, Y., Dovlatabadi, A., Ebrahimnejad, A., & Lowner, M. O. (2019).
study in Wuhan. Urban Forestry & Urban Greening, 62. https://doi.org/10.1016/j. Estimate annual and seasonal PM1, PM2.5 and PM10 concentrations using land use
ufug.2021.127159 regression model. Ecotoxicology and Environmental Safety, 174, 137–145. https://doi.
Khan, A. M., Finlay, J. M., Clarke, P., Sol, K., Melendez, R., Judd, S., et al. (2021). org/10.1016/j.ecoenv.2019.02.070
Association between temperature exposure and cognition: A cross-sectional analysis Niu, Y., Cai, J., Xia, Y., Yu, H., Chen, R., Lin, Z., et al. (2018). Estimation of personal
of 20,687 aging adults in the United States. BMC Public Health, 21, 1484. https://doi. ozone exposure using ambient concentrations and influencing factors. Environment
org/10.1186/s12889-021-11533-x International, 117, 237–242. https://doi.org/10.1016/j.envint.2018.05.017
Kollanus, V., Tiittanen, P., & Lanki, T. (2021). Mortality risk related to heatwaves in Parida, B. R., Bar, S., Kaskaoutis, D., Pandey, A. C., Polade, S. D., & Goswami, S. (2021).
Finland - factors affecting vulnerability. Environmental Research, 201, Article 111503. Impact of COVID-19 induced lockdown on land surface temperature, aerosol, and
https://doi.org/10.1016/j.envres.2021.111503 urban heat in Europe and North America. Sustainable Cities and Society, 75, Article
Kumar, P., Hama, S., Abbass, R. A., Nogueira, T., Brand, V. S., Abhijith, K. V., et al. 103336. https://doi.org/10.1016/j.scs.2021.103336
(2021a). Potential health risks due to in-car aerosol exposure across ten global cities. Parsa, A. B., Movahedi, A., Taghipour, H., Derrible, S., & Mohammadian, A. K. (2020).
Environment International, 155, Article 106688. https://doi.org/10.1016/j. Toward safer highways, application of XGBoost and SHAP for real-time accident
envint.2021.106688 detection and feature analysis. Accident; Analysis and Prevention, 136, Article 105405.
Kumar, P., Hama, S., Nogueira, T., Abbass, R. A., Brand, V. S., Andrade, M. F., et al. https://doi.org/10.1016/j.aap.2019.105405
(2021b). In-car particulate matter exposure across ten global cities. Science of the Pascal, M., Goria, S., Wagner, V., Sabastia, M., Guillet, A., Cordeau, E., et al. (2021).
Total Environment, 750, Article 141395. https://doi.org/10.1016/j. Greening is a promising but likely insufficient adaptation strategy to limit the health
scitotenv.2020.141395 impacts of extreme heat. Environment International, 151, Article 106441. https://doi.
Lary, D. J., Alavi, A. H., Gandomi, A. H., & Walker, A. L. (2016). Machine learning in org/10.1016/j.envint.2021.106441
geosciences and remote sensing. Geoscience Frontiers, 7, 3–10. https://doi.org/ Patz, J. A., Campbell-Lendrum, D., Holloway, T., & Foley, J. A. (2005). Impact of regional
10.1016/j.gsf.2015.07.003 climate change on human health. Nature, 438, 310–317. https://doi.org/10.1038/
Lee, K. J., Kahng, H., Kim, S. B., & Park, S. K. (2018). Improving environmental nature04188
sustainability by characterizing spatial and temporal concentrations of ozone. Potard, K., Monard, C., Le Garrec, J.-. L., Caudal, J.-. P., Le Bris, N., & Binet, F. (2017).
Sustainability, 10. https://doi.org/10.3390/su10124551 Organic amendment practices as possible drivers of biogenic volatile organic
Lee, M., Brauer, M., Wong, P., Tang, R., Tsui, T. H., Choi, C., et al. (2017). Land use compounds emitted by soils in agrosystems. Agriculture, Ecosystems & Environment,
regression modeling of air pollution in high density high rise cities: A case study in 250, 25–36. https://doi.org/10.1016/j.agee.2017.09.007
Hong Kong. Science of the Total Environment, 592, 306–315. https://doi.org/ Qin, M., Hu, A., Mao, J., Li, X., Sheng, L., Sun, J., et al. (2021a). PM2.5 and O3
10.1016/j.scitotenv.2017.03.094 relationships affected by the atmospheric oxidizing capacity in the Yangtze River
Lee, M., Wi, J., Koziel, J. A., Ahn, H., Li, P., Chen, B., et al. (2020). Effects of UV-A light Delta, China. Science of the Total Environment, 810, Article 152268. https://doi.org/
treatment on ammonia, hydrogen sulfide, greenhouse gases, and ozone in simulated 10.1016/j.scitotenv.2021.152268
poultry barn conditions. Atmosphere, 11. https://doi.org/10.3390/atmos11030283

17
L. Han et al. Sustainable Cities and Society 78 (2022) 103643

Qin, P., Luo, X., Zeng, Y., Zhang, Y., Li, Y., Wu, Y., et al. (2021b). Long-term association Wai, K.-. M., Tan, T. Z., Morakinyo, T. E., Chan, T.-. C., & Lai, A. (2020). Reduced
of ambient air pollution and hypertension in adults and in children: A systematic effectiveness of tree planting on micro-climate cooling due to ozone pollution—a
review and meta-analysis. Science of the Total Environment, 796, Article 148620. modeling study. Sustainable Cities and Society, 52. https://doi.org/10.1016/j.
https://doi.org/10.1016/j.scitotenv.2021.148620 scs.2019.101803
Qiu, H., Chuang, K. J., Bai, C. H., Fan, Y. C., Chang, T. P., Yim, S. H., et al. (2021). Wang, C., Lin, J., Niu, Y., Wang, W., Wen, J., Lv, L., et al. (2022). Impact of ozone
Association of ambient ozone with pneumonia hospital admissions in Hong Kong and exposure on heart rate variability and stress hormones: A randomized-crossover
Taipei: A tale of two Southeast Asian cities. Environment International, 156, Article study. Journal of Hazardous Materials, 421, Article 126750. https://doi.org/10.1016/
106634. https://doi.org/10.1016/j.envint.2021.106634 j.jhazmat.2021.126750
Rebekić, A., Lončarić, Z., Petrović, S., & Marić, S. (2015). Pearson’s or spearman’s Wang, J., Cao, S.-. J., & Yu, C. W. (2020a). Development trend and challenges of
correlation coefficient - which one to Use? Poljoprivreda, 21, 47–54. https://doi.org/ sustainable urban design in the digital age. Indoor and Built Environment, 30, 3–6.
10.18047/poljo.21.2.8 https://doi.org/10.1177/1420326x20976058
Ren, T., Zhou, W., & Wang, J. (2021). Beyond intensity of urban heat island effect: A Wang, P., Wang, T., & Ying, Q. (2020b). Regional source apportionment of summertime
continental scale analysis on land surface temperature in major Chinese cities. ozone and its precursors in the megacities of Beijing and Shanghai using a source-
Science of the Total Environment, 791, Article 148334. https://doi.org/10.1016/j. oriented chemical transport model. Atmospheric Environment, 224. https://doi.org/
scitotenv.2021.148334 10.1016/j.atmosenv.2020.117337
Ren, X., Mi, Z., & Georgopoulos, P. G. (2020). Comparison of machine learning and land Wang, X., Liu, H., Miao, S., Wu, Q., Zhang, N., & Qiao, F. (2020c). Effectiveness of urban
use regression for fine scale spatiotemporal estimation of ambient air pollution: hydrological processes in mitigating urban heat island and human thermal stress
Modeling ozone concentrations across the contiguous United States. Environment during a heat wave event in Nanjing, China. Journal of Geophysical Research:
International, 142, Article 105827. https://doi.org/10.1016/j.envint.2020.105827 Atmospheres, 125. https://doi.org/10.1029/2020jd033275
Ren, Y., Ge, Y., Ma, D., Song, X., Shi, Y., Pan, K., et al. (2017a). Enhancing plant diversity Wang, Y., & Akbari, H. (2016). Analysis of urban heat island phenomenon and mitigation
and mitigating BVOC emissions of urban green spaces through the introduction of solutions evaluation for Montreal. Sustainable Cities and Society, 26, 438–446.
ornamental tree species. Urban Forestry & Urban Greening, 27, 305–313. https://doi. https://doi.org/10.1016/j.scs.2016.04.015
org/10.1016/j.ufug.2017.08.011 Wang, Y., Du, H., Xu, Y., Lu, D., Wang, X., & Guo, Z. (2018). Temporal and spatial
Ren, Y., Qu, Z., Du, Y., Xu, R., Ma, D., Yang, G., et al. (2017b). Air quality and health variation relationship and influence factors on surface urban heat island and ozone
effects of biogenic volatile organic compounds emissions from urban green spaces pollution in the Yangtze River Delta, China. Science of the Total Environment, 631-
and the mitigation strategies. Environmental Pollution, 230, 849–861. https://doi. 632, 921–933. https://doi.org/10.1016/j.scitotenv.2018.03.050
org/10.1016/j.envpol.2017.06.049 Weichenthal, S., Ryswyk, K. V., Goldstein, A., Bagg, S., Shekkarizfard, M., &
Rodriguez-Perez, R., & Bajorath, J. (2020). Interpretation of machine learning models Hatzopoulou, M. (2016). A land use regression model for ambient ultrafine particles
using shapley values: Application to compound potency and multi-target activity in Montreal, Canada: A comparison of linear regression and a machine learning
predictions. Journal of Computer Aided Molecular Design, 34, 1013–1026. https://doi. approach. Environmental Research, 146, 65–72. https://doi.org/10.1016/j.
org/10.1007/s10822-020-00314-0 envres.2015.12.016
Romshoo, S. A., Bhat, M. A., & Beig, G. (2021). Particulate pollution over an urban Weinberger, K. R., Wu, X., Sun, S., Spangler, K. R., Nori-Sarma, A., Schwartz, J., et al.
Himalayan site: Temporal variability, impact of meteorology and potential source (2021). Heat warnings, mortality, and hospital admissions among older adults in the
regions. Science of the Total Environment, 799, Article 149364. https://doi.org/ United States. Environment International, 157, Article 106834. https://doi.org/
10.1016/j.scitotenv.2021.149364 10.1016/j.envint.2021.106834
Scarpone, C., Brinkmann, S. T., Grosse, T., Sonnenwald, D., Fuchs, M., & Walker, B. B. Wen, Y., Wu, R., Zhou, Z., Zhang, S., Yang, S., Wallington, T. J., et al. (2022). A data-
(2020). A multimethod approach for county-scale geospatial analysis of emerging driven method of traffic emissions mapping with land use random forest models.
infectious diseases: A cross-sectional case study of COVID-19 incidence in Germany. Applied Energy, 305. https://doi.org/10.1016/j.apenergy.2021.117916
International Journal of Health Geographics, 19, 32. https://doi.org/10.1186/s12942- Wong, P. Y., Lee, H. Y., Chen, Y. C., Zeng, Y. T., Chern, Y. R., Chen, N. T., et al. (2021).
020-00225-1 Using a land use regression model with machine learning to estimate ground level
Shi, Y., Katzschner, L., & Ng, E. (2018). Modeling the fine-scale spatiotemporal pattern of PM2.5. Environmental Pollution, 277, Article 116846. https://doi.org/10.1016/j.
urban heat island effect using land use regression approach in a megacity. Science of envpol.2021.116846
the Total Environment, 618, 891–904. https://doi.org/10.1016/j. Xi, C., Ren, C., Wang, J., Feng, Z., & Cao, S.-. J. (2021). Impacts of urban-scale building
scitotenv.2017.08.252 height diversity on urban climates: A case study of Nanjing, China. Energy and
Shi, Y., Ren, C., Cai, M., Lau, K. K., Lee, T. C., & Wong, W. K. (2019). Assessing spatial Buildings, 251. https://doi.org/10.1016/j.enbuild.2021.111350
variability of extreme hot weather conditions in Hong Kong: A land use regression Xiao, X. D., Dong, L., Yan, H., Yang, N., & Xiong, Y. (2018). The influence of the spatial
approach. Environmental Research, 171, 403–415. https://doi.org/10.1016/j. characteristics of urban green space on the urban heat island effect in Suzhou
envres.2019.01.041 Industrial Park. Sustainable Cities and Society, 40, 428–439. https://doi.org/10.1016/
Stafoggia, M., Bellander, T., Bucci, S., Davoli, M., de Hoogh, K., De’ Donato, F., et al. j.scs.2018.04.002
(2019). Estimation of daily PM10 and PM2.5 concentrations in Italy, 2013–2015, Xie, M., Zhu, K., Wang, T., Chen, P., Han, Y., Li, S., et al. (2016). Temporal
using a spatiotemporal land-use random-forest model. Environment International, characterization and regional contribution to O3 and NOx at an urban and a
124, 170–179. https://doi.org/10.1016/j.envint.2019.01.016 suburban site in Nanjing, China. Science of the Total Environment, 551-552, 533–545.
Sun, H., Fung, J. C. H., Chen, Y., Chen, W., Li, Z., Huang, Y., et al. (2021). Improvement https://doi.org/10.1016/j.scitotenv.2016.02.047
of PM2.5 and O3 forecasting by integration of 3D numerical simulation with deep Yang, C., Zhu, W., Sun, J., Xu, X., Wang, R., Lu, Y., et al. (2021a). Assessing the effects of
learning techniques. Sustainable Cities and Society, 75. https://doi.org/10.1016/j. 2D/3D urban morphology on the 3D urban thermal environment by using multi-
scs.2021.103372 source remote sensing data and UAV measurements: A case study of the snow-
Sussman, H. S., Dai, A., & Roundy, P. E. (2021). The controlling factors of urban heat in climate city of Changchun, China. Journal of Cleaner Production, 321. https://doi.
Bengaluru, India. Urban Climate, 38. https://doi.org/10.1016/j.uclim.2021.100881 org/10.1016/j.jclepro.2021.128956
Taha, H. (2015). Meteorological, air-quality, and emission-equivalence impacts of urban Yang, Z., Freni-Sterrantino, A., Fuller, G. W., & Gulliver, J. (2020). Development and
heat island control in California. Sustainable Cities and Society, 19, 207–221. https:// transferability of ultrafine particle land use regression models in London. Science of
doi.org/10.1016/j.scs.2015.03.009 the Total Environment, 740, Article 140059. https://doi.org/10.1016/j.
Tan, Z., Lu, K., Dong, H., Hu, M., Li, X., Liu, Y., et al. (2018). Explicit diagnosis of the scitotenv.2020.140059
local ozone production rate and the ozone-NOx-VOC sensitivities. Science Bulletin, Yang, Z., Yang, J., Zhou, M., Yin, P., Chen, Z., Zhao, Q., et al. (2021b). Hourly
63, 1067–1076. https://doi.org/10.1016/j.scib.2018.07.001 temperature variability and mortality in 31 major Chinese cities: Effect modification
To, T., Zhu, J., Terebessy, E., Zhang, K., Fong, I., Pinault, L., et al. (2021). Does exposure by individual characteristics, season and temperature zone. Environment
to air pollution increase the risk of acute care in young children with asthma? An International, 156, Article 106746. https://doi.org/10.1016/j.envint.2021.106746
Ontario, Canada study. Environmental Research, 199, Article 111302. https://doi. Yu, S. (2018). Fog geoengineering to abate local ozone pollution at ground level by
org/10.1016/j.envres.2021.111302 enhancing air moisture. Environmental Chemistry Letters, 17, 565–580. https://doi.
Tsanas, A., & Xifara, A. (2012). Accurate quantitative estimation of energy performance org/10.1007/s10311-018-0809-5
of residential buildings using statistical machine learning tools. Energy and Buildings, Zhao, X., Lovreglio, R., & Nilsson, D. (2020). Modeling and interpreting pre-evacuation
49, 560–567. https://doi.org/10.1016/j.enbuild.2012.03.003 decision-making using machine learning. Automation in Construction, 113. https://
Urban, A., Kysely, J., Plavcova, E., Hanzlikova, H., & Stepanek, P. (2020). Temporal doi.org/10.1016/j.autcon.2020.103140
changes in years of life lost associated with heat waves in the Czech Republic. Science Zhu, B., Kang, H., Zhu, T., Su, J., Hou, X., & Gao, J. (2015). Impact of Shanghai urban
of the Total Environment, 716, Article 137093. https://doi.org/10.1016/j. land surface forcing on downstream city ozone chemistry. Journal of Geophysical
scitotenv.2020.137093 Research: Atmospheres, 120, 4340–4351. https://doi.org/10.1002/2014jd022859
Van Nguyen, T., Zhou, L., Chong, A. Y. L., Li, B., & Pu, X. (2020). Predicting customer Zhu, J., Chen, L., Liao, H., & Dang, R. (2019). Correlations between PM2.5 and Ozone
demand for remanufactured products: A data-mining approach. European Journal of over China and associated underlying reasons. Atmosphere, 10. https://doi.org/
Operational Research, 281, 543–558. https://doi.org/10.1016/j.ejor.2019.08.015 10.3390/atmos10070352
Vega García, M., & Aznarte, J. L. (2020). Shapley additive explanations for NO2
forecasting. Ecological Informatics, 56. https://doi.org/10.1016/j.
ecoinf.2019.101039

18

You might also like