You are on page 1of 8

RESEARCH LETTER Uncovering the Large-Scale Meteorology That Drives

10.1029/2021GL096684
Continental, Shallow, Green Cumulus Through Supervised
Key Points:
• W e use machine-learning
Classification
classification based on visual Tom Dror1 , Vered Silverman1 , Orit Altaratz1 , Mickaël D. Chekroun1 , and Ilan Koren1
inspection to identify green cumulus
cloud fields 1
Department of Earth and Planetary Sciences, Weizmann Institute of Science, Rehovot, Israel
• We establish a novel satellite-based
data set of high-resolution green
cumulus images over three areas
worldwide Abstract One of the major sources of uncertainty in climate prediction results from the limitations in
• We show that green cumulus representing shallow cumulus (Cu) in models. Recently, a class of continental shallow convective Cu was
are driven by similar large-scale shown to share distinct morphological properties and to emerge globally mostly over forests and vegetated
meteorology regardless of their
geographical location areas, thus named greenCu. Using machine-learning supervised classification, we identify greenCu fields over
three regions, from the tropics to mid- and higher-latitudes, and establish a novel satellite-based data set called
greenCuDb, consisting of 1° × 1° sized, high-resolution MODIS images. Using greenCuDb in conjunction with
Supporting Information:
ERA5 reanalysis data, we create greenCu composites for different regions and reveal that greenCu are driven by
Supporting Information may be found in
the online version of this article. similar large-scale meteorological conditions, regardless of their geographical locations throughout the world's
continents. These conditions include distinct profiles of temperature, humidity and large-scale vertical velocity.
Correspondence to: The boundary layer is anomalously warm and moderately humid, and is accompanied by a strong large-scale
I. Koren, subsidence in the free troposphere.
ilan.koren@weizmann.ac.il
Plain Language Summary It has been recently shown that a large part of low-altitude clouds over
Citation: the world's continents share universal properties, especially in the way these clouds are organized in time and
Dror, T., Silverman, V., Altaratz, O., space, and their preferred formation over forests and vegetated areas, hence named greenCu. In this study, by
Chekroun, M. D., & Koren, I. (2022). exploiting machine-learning classification tools, we probe over 450,000 high-resolution satellite images, over
Uncovering the large-scale meteorology
that drives continental, shallow,
different land areas and climatic conditions, spanning a period of 10 years, to produce a novel data set of about
green cumulus through supervised 90,000 images of such greenCu. The physical insight gained by this operation is that greenCu form under
classification. Geophysical Research similar meteorological conditions irrespectively of their geographical locations and their climatic conditions,
Letters, 49, e2021GL096684. https://doi.
org/10.1029/2021GL096684
throughout the globe.

Received 21 OCT 2021


Accepted 8 APR 2022 1. Introduction
Shallow convective cumulus (Cu) clouds, often referred to as fair-weather clouds, are ubiquitous over the world's
oceans and continents (Bony et al., 2004; Norris, 1998). These clouds impose a net cooling effect at the surface,
since they reflect part of the incoming solar radiation while having far less influence on the outgoing long-
wave radiation (Boucher et al., 2013). Despite extensive research, shallow Cu still constitute an important factor
in the uncertainty related to climate sensitivity and cloud feedback (Bony, 2005; Webb et al., 2006; Zelinka
et al., 2020), with their space-time organization and environmental conditions playing a potential major role
regarding the latter (Bony et al., 2020; Nuijens & Siebesma, 2019; Vial et al., 2017).

There exists an extensive amount of literature on marine shallow Cu, that is, trade Cu, which are mostly confined
to the tropical and subtropical oceans. In this marine environment, sea surface temperatures are relatively warm
and stable, and moderate trade winds and large-scale subsidence in the free troposphere (FT) prevail (Stevens
et al., 2016). These conditions over the ocean exert a relatively weak and steady forcing which promotes shal-
low convection. In contrast, continental shallow Cu, are by far less studied. These clouds prevail in much more
diverse locations over the world's continents. Continental shallow Cu experience a stronger and time-dependent
© 2022. The Authors. forcing stemming from the more significant diurnal cycle over land. They form at late morning, peak in the
This is an open access article under early afternoon, and dissipate before sunset, coinciding with the diurnal cycle of surface fluxes and convective
the terms of the Creative Commons
Attribution-NonCommercial-NoDerivs boundary layer (CBL) development (Berg & Kassianov, 2008; Lenderink et al., 2004). Recently, a large subset
License, which permits use and of continental shallow Cu was shown to look similar, regardless of the clouds' locations around the world, and to
distribution in any medium, provided the share many cloud field properties, for example, cloud size distribution, cloud fraction (CF), organization patterns,
original work is properly cited, the use is
non-commercial and no modifications or and their tendency to form over forested and vegetated regions, thus termed greenCu (Dror et al. (2020), here-
adaptations are made. after D2020). Despite their small sizes (∼1 km, Lamer and Kollias (2015)) and short lifetimes (∼30 min, Jiang

DROR ET AL. 1 of 8
Geophysical Research Letters 10.1029/2021GL096684

et al. (2006)), greenCu form highly organized mesoscale-sized patterns that sustain throughout the day (Dror,
Chekroun, et al., 2021). Stable on one hand, these clouds show high-sensitivity to local conditions such as land-
cover type and topography (Da Silva et al., 2011; Rabin et al., 1990), and also to mesoscale conditions, as shown
for example, for the presence of smoke over the Amazon (Koren et al., 2004).

Yet, an understanding of the large-scale meteorological conditions that allow greenCu to prevail over a wide vari-
ety of geographical environments, from the tropics through mid- and higher-latitudes, is still missing.

To address this question, by exploiting machine-learning (ML) classification tools, we are able to probe over
450,000 high-resolution satellite images, over different land areas (the Amazon, central USA and Eastern Europe)
and climatic conditions, spanning a period of 10 years. We produce a novel data set of ∼90,000 greenCu images,
named greenCuDb (Dror, Silverman, et al., 2021) extracted during the northern hemisphere (NH) summer
(June–July–August [JJA]), which is the dry season in the tropics, when shallow convection prevails. To train the
neural-network (NN) model, we benefit from the visual identification of greenCu patterns operated in D2020
for about 12,000 labeled cloud field images, and analyzed over a much shorter 2-month time-period than in the
present work. As a main result of this study, the inspection of the meteorological and environmental conditions
from ERA5 reanalysis (Hersbach et al., 2020) of the large greenCuDb data set reveals that the greenCu fields
share very similar characteristics irrespectively of their geographical location.

2. Cloud Fields Classification


The need to obtain large data sets of high-resolution, labeled cloud images is a key initial step in building-up
knowledge about the different clouds' properties. With the recent advances of ML methods for image analysis
and classification (e.g., Ker et al. (2017); Kremer et al. (2017)) and dedicated platforms for the underlying algo-
rithms, such clouds' data sets have been produced either from ground (Dev et al., 2015; J. Zhang et al., 2018)
or satellite observations. Thus, various ML-labeled satellite data sets have been obtained for different purposes:
CUMULO (Zantedeschi et al., 2019) for learning cloud classes; Shallow Cloud (Rasp et al., 2020) to explore
mesoscale organization of shallow clouds in the trades; CloudCast (Nielsen et al., 2021) for clouds' forecasting,
and LSCIDMR (Bai et al., 2021) to infer weather systems from clouds. Each of these data sets differ in the clouds
properties and goals they are after, and thus differ in their spatio-temporal resolutions, the spectral bands of the
images, the labels they annotate, and the area and time periods they span. The present study prolongs these recent
efforts for clouds classification by producing a novel ML satellite data set, greenCuDb, which targets specifically
shallow, organized convection over land, from high-resolution MODIS images.

2.1. Training Data Set


The training stage of a ML model for supervised classification heavily relies on the availability of expertly
labeled data, which in turn plays a key role to the model's performance. However, such labeled data sets are
not always available. Here we benefit from such a data set to train our NN model, namely, the visually-labeled
images presented in D2020, based on high-resolution (500 m) MODIS Aqua true-color (RGB) images. The
Aqua satellite crosses the equator at ∼13:30 local solar time, an ideal time to capture the peak of the evolution
of continental shallow convection, since the surface fluxes and many of the CBL processes are maximal at that
time (Brown et al., 2002; Y. Zhang & Klein, 2013). The images were taken over three 14° × 14° continental
regions, which represent different climatic environments, including the Amazon in the tropics (17°S−3°S and
63°W − 49°W), through the USA in the mid-latitudes (30°N−44°N and 96°W−82°W), to Europe in higher-lati-
tudes (46°N−60°N and 27°E−41°E). All regions feature shallow convection during JJA, and constitute greenCu
hotspots. The 14° × 14° sized images were divided to smaller 1° × 1° domains, each comprised of 228 × 228 × 3
pixels, and visually classified to four different classes determined based on human expertise: (a) sparse (no clouds
to a few greenCu), (b) greenCu (organized shallow Cu), (c) transition from shallow to deep convection (deeper
and more clustered greenCu, hereafter transition), and (d) deep convective (large clouds that cover most or all of
the domain, hereafter deep). Figure 1 shows a classic 1° × 1° sized image of each class. Note that other classes
of clouds, such as shallow stratiform or cirrus, are not common over these regions during this season. In case of
images that contain stratus, visible cirrus or smoke, the model is designed such that they will be labeled as deep
convection. See for example, the low CFs shown for the deep class (especially over the Amazon), stemming from

DROR ET AL. 2 of 8
Geophysical Research Letters 10.1029/2021GL096684

smokey images (Figure 1). The visual classification was performed over two
months, July–August 2008, and resulted into a total of 12,146 labeled cloud
field images (D2020): 2,762 sparse, 2,142 greenCu, 2,446 transition, and
4,796 deep convective images.

2.2. Classification Model

The ResNet-34 model (He et al., 2016b) that uses 34 layers of convolutional
NN, widely used for image classification purposes (He et al., 2016a), is
trained on the data set obtained from human labeling in D2020 to perform
classification over a larger data set of satellite images. The model encoder
applies a set of convolution operations (Conv2D) and nonlinear activation
functions (ReLU) which are adjusted during the training to detect the mean-
ingful features for a correct classification. For this task, the ResNet model
implementation from the fastai python package (Howard & Gugger, 2020) is
used with kernel size of 3 × 3 on images of 228 × 228 × 3 pixels, initialized
with random weights and biases. To balance the data, the training data set is
Figure 1. Model predictions: boxplot of cloud fraction (CF) per class and
given by 1,500 randomly and uniformly selected labeled images within each
region (Amazon, USA, and Eastern Europe). In each box, the circle marks
the median, the bottom and top edges of the box indicate the 25th and 75th class, out of which 20% are kept for validation. The trained model predicts
percentiles, respectively. The whiskers extend to the most extreme data points greenCu with 90.0% accuracy (acc), and the recall (R), precision (P) and
not considered outliers. Dashed horizontal lines indicate the mean 25th and F1-measure (F1) are 90.0%, 73.6% and 81.1%, respectively (see Support-
75th percentiles of greenCu CF used as lower and upper thresholds to filter the ing Information S1 for more details), which is suitable for the purposes of
greenCu post-processing. Bottom row shows, for each class, a representative
this study. For comparison, when the model was trained with the full data
1° × 1° image.
set from D2020: acc = 91.5%, R = 92.4%, P = 68.5% and F1 = 78.7%. To
assess the model's predictions we compare morphological properties of the
model-labeled and the visually-labeled images. The two datasets show good
agreement and the four classes are consistent in both (Figure S1 in Supporting Information S1). The shift between
the different classes is not sharp and can be ambiguous. For example, a sparse field can essentially be a forming
or dissipating greenCu field, and a transition field may be deeper and/or clustered greenCu (D2020). Also, some
images may have a mixed distribution of different classes. Therefore, confusions between (especially) the tran-
sition to greenCu are expected (Figure S2 in Supporting Information S1) and accounted for in the filtering steps
described in Section 2.3.

2.3. The GreenCuDb Data Set

The classification task resulted in a total of 455,252 images, of which 21.65% (98,582) were labeled as sparse,
18.51% (84,282) as greenCu, 20.18% (91,851) as transition and 39.66% (180,537) as deep. Note that even-though
the model was trained with a balanced data set, it is still able to reproduce the imbalance that exists between
the classes of the visual classification (e.g., the deep class contains about twice the amount of images as each
of the other classes, both in the model-labeled and in the visual data sets). To further improve our confidence
in the model's attribution of a given cloud field to a specific class, we calculate the CF for the classified images
using a cloud mask constructed specifically to detect greenCu (D2020). Figure 1 shows a monotonic increase in
CF for the four classes. This increase is expected since the different classes essentially represent a continuous
shift from almost clear sky (sparse) to an overcast state (deep). While there exist a natural spread in the distri-
butions of CF for both classes and regions, the medians of each distribution are similar for all regions within
the same class, and are well-separated from the other classes, proving that the model successfully discriminates
between the classes (see Figure S1 in Supporting Information S1 for other morphological properties).

Focusing on the greenCu, Figures 2a–2c show examples of such fields for each region. To avoid model false-pos-
itive predictions (i.e., images wrongfully labeled as greenCu), we use the mean of the 25th (0.136) and 75th
(0.291) percentiles of greenCu CF as the upper and lower limits to clean the greenCu data set (green shading
and dotted gray lines in Figure 1). While this filtering affects greenCu CF distributions, it has a negligible
effect on the distributions of other morphological and meteorological variables (Figures S1 and S3 in Supporting

DROR ET AL. 3 of 8
Geophysical Research Letters 10.1029/2021GL096684

D  $PD]RQ E 86$ F ((XURSH

G  H I

Figure 2. The greenCu share similar organizational patterns.Examples of 1° × 1° greenCu fields over the (a) Amazon, (b) USA, and (c) Eastern Europe. Observed
Nearest Neighbor Cumulative Density Function (NNCDF) against Poisson NNCDF, and a boxplot of Iorg for the (d) Amazon, (e) USA, and (f) Eastern Europe.

Information S1). Thanks to this strategy, we eliminate approximately 50% of the images but increase the reliabil-
ity of the data set and thus, of the following analyses.

Finally, the new greenCuDb combines high-resolution imagery from three regions worldwide, during 10 JJAs
(2003–2012). It consists 42,128 images, of which 15,727 are from the Amazon, 14,300 from the USA and 12,101
from Eastern Europe, scaling-up the human analysis of D2020 by a factor of ∼20.

As evidenced from the images shown in Figures 2a–2c, greenCu throughout the world appear strikingly similar.
Furthermore, D2020 have shown that beyond this similarity, these fields share many common properties, for
example, their size distributions, the number of clouds in the field and the distances between them, and CF and
cloud top height (CTH). However, it is the distinct and unique organization of greenCu that distinguishes these
fields from the other cloud classes. Indeed, greenCu tend to organize into a regular pattern, taking often the shape
of cloud streets. To quantify and characterize greenCu organization, we use the commonly applied organization
index (Iorg; Weger et al. (1992); Tobin et al. (2012)), which compares the cloud field Nearest Neighbor Cumu-
lative Density Function (NNCDF) to that of a randomly distributed cloud field (Poisson NNCDF, given by the
Weibull distribution). Figures 2d–2f show the mean observed NNCDF against Poisson NNCDF for the greenCu
in the different regions. And a boxplot of Iorg (the area under the NNCDF curve) for each region is shown to the
right (see boxplot of Iorg for all other classes in Figure S1 in Supporting Information S1). The mean NNCDF
curves are similar for all regions and show that greenCu fields deviate from randomness toward a regular (grid-
like) organizational pattern, with medians of Iorg = 0.418, 0.426, 0.424 in the Amazon, USA and Eastern Europe,
respectively. This standard metric to measure clouds' organization against a random field (Weger et al., 1992)
provides further evidence regarding the ability of the trained ML model to predict greenCu patterns. For a more
general discussion regarding probabilistic metrics used in atmosphere science see Bröcker and Smith (2007).

DROR ET AL. 4 of 8
Geophysical Research Letters 10.1029/2021GL096684

Figure 3. Large-scale meteorological conditions associated with the different classes (upper-row), greenCu composites per
region (middle-row), and greenCu normalized anomalies from the June–July–August climatological mean (lower-row) in the
Amazon (black), USA (blue), and Eastern Europe (red). Mean vertical profiles of θ, q, RH, and ω (column-wise, from left to
right, respectively). The standard deviations are shown in Figure S4 in Supporting Information S1.

3. Dependence of GreenCu on the Large-Scale Meteorological Conditions


Next, we use the novel greenCuDb to gain insight on the large-scale meteorological conditions associated with
greenCu formation, and to understand how similar cloud fields are formed in such different geographical envi-
ronments. To do so, we create composites for each class by assigning to each image the corresponding ERA5
reanalysis hourly products (Hersbach et al., 2020). We examine the mean vertical profiles of potential tempera-
ture (θ), specific humidity (q), relative humidity (RH), and the large-scale vertical velocity (ω) of the different
classes, the greenCu composites of each region, and the normalized deviations of the greenCu composites from
the corresponding JJA climatological mean (Figure 3). The standard deviations of the mean profiles, and the
climatologies are shown in Figures S4 and S5 in Supporting Information S1, respectively. The composites of the
different classes show consistent behavior in terms of all inspected atmospheric profiles. The different classes
are controlled by the interplay between the temperature, humidity and the large-scale vertical velocity profiles.
Going from the sparse to the deep class, we smoothly shift from a warmer and drier atmosphere, dominated by
subsidence, to a colder and moister atmosphere at which the subsidence dominates at higher altitudes (greenCu
and transition) or is non-existent (deep).

Focusing on the greenCu composites, we compare their characteristic profiles throughout the different regions
(Figures 3e–3h). Although the θ- and q-values differ between the regions, the greenCu RH profiles are remark-
ably similar. In terms of the large-scale vertical velocity, all greenCu composites feature weak updraft near the
surface, overlaid by a large-scale subsidence, but the absolute values and the heights at which ω changes its sign
differ. In the Amazon, the large-scale updraft as well as the overlaying subsidence share moderate values. In the

DROR ET AL. 5 of 8
Geophysical Research Letters 10.1029/2021GL096684

USA and Eastern Europe, the updraft is weaker and confined into the lower CBL, near the surface, and turns into
a weak to moderate subsidence throughout the CBL, to finally adjust to a strong subsidence in the FT (altitudes
above 800 mb). The CBL depth (seen here as the height at which RH values sharply decrease) is shown to be
greater for the Amazon than for the other regions, in agreement with D2020. The authors of D2020 showed that
greenCu CTH in the Amazon is larger compared to the USA and Eastern Europe (Figure 5 in D2020).

Although the Amazon, USA, and Eastern Europe regions belong to different climate zones (Köppen, 1923),
and generally exhibit different large-scale circulations, the climatologies of the three regions reveal that during
JJA these environments share stable large-scale conditions, such as warm, humid CBLs (Figure S5 in Support-
ing Information S1). These stable conditions result in the Amazon from the prevalence of the South Atlantic
Subtropical High (Nobre et al., 1998) as the Intertropical Convergence Zone reaches its northward displacement
(∼10°) in JJA. In the USA and Eastern Europe, it is related to the weakening of the extratropical storm track over
mid-latitudes and its migration poleward during NH summer (Parker et al., 1989; Whittaker & Horn, 1981).

To gain insight on the key players discriminating each greenCu composite from the local climatology, we calcu-
late the variable's normalized deviations from their corresponding JJA climatologies by subtracting the clima-
tological mean and dividing by the climatological standard-deviation (Figures 3i–3l). Since greenCu are more
frequent over the Amazon during JJA (D2020), larger anomalies are expected in the USA and Eastern Europe.
The anomalies indicate that the large-scale conditions favoring greenCu formation are obtained by (a) a warmer
CBL (near the surface) and FT compared to the climatologies (Figure 3i). (b) A drier profile (negative anom-
alies), especially near the surface and in the FT in the USA and Eastern Europe, and a moister CBL (positive
anomalies), and drier FT in the Amazon (Figures 3j and 3k). And (c) an increase in the magnitude of the large-
scale subsidence relative to the climatologies, mostly pronounced in the FT (δω > 0 in all regions, becoming
larger as going from the Amazon to the USA to Eastern Europe, Figure 3l). The drier FT and stronger subsidence
(compared to the climatology) in all regions inhibit greenCu's development into deeper clouds such as those of
the transition and/or deep class.

4. Summary and Discussion


By means of visual classification, continental, shallow clouds were shown to share many common properties:
their sizes, CF, the patterns they form, and their preferred formation over forests and vegetated area, named
greenCu (D2020). These results allow us to: (a) scale-up the human analysis of D2020 by producing a novel
machine-learned data set of greenCu images, and this way (b) characterize the large-scale meteorological condi-
tions that drive greenCu and their very existence in such diverse environments throughout the world's continents.

We use morphological properties such as CF as metrics to evaluate the model's predictions and show that it is
able to successfully discriminate between four consistent classes: sparse, greenCu, transition to deep convection,
and deep convection (Figures 1 and S1 in Supporting Information S1). We then restrict greenCu CF, and estab-
lish the new greenCuDb. It contains 42,128 high-resolution greenCu images over three regions from the tropics
(Amazon), to mid-latitude (USA), and higher-latitudes (Eastern Europe), over a period of 10 seasons (JJAs,
2003–2012). One of greenCu's most prominent features is their distinct cloud field organization. It is shown here
to be similar throughout the regions and to exhibit a regular (grid-like) structure (Figure 2), in agreement with
Dror et al. (2020), Dror, Chekroun, et al. (2021).

Using greenCuDb together with ERA5 reanalysis we show that greenCu profiles of potential temperature, specific
humidity and RH and large-scale vertical velocity are distinguished from those of the other classes, and that
greenCu form under similar meteorological conditions regardless of their geographical location. These conditions
include a distinct combination of temperature, humidity and large-scale vertical velocity throughout the CBL and
the FT: the CBL is anomalously warm and just moist enough to allow cloud formation. It is capped by large-
scale subsidence throughout the FT which inhibits the shallow clouds from transitioning to deep convection. A
slightly drier profile with stronger subsidence will result in greenCu dissipation (i.e., sparse-like conditions), and
a colder, moister profile with weaker subsidence will result in greenCu clustering and deepening (i.e., transition/
deep-like conditions).

We believe that the findings of this study, by improving the detection and understanding of shallow Cu, can bene-
fit short-term forecasting over land and estimations of Earth's energy balance. Specifically, the insights regarding

DROR ET AL. 6 of 8
Geophysical Research Letters 10.1029/2021GL096684

the large-scale meteorological conditions propitious to greenCu formation are useful for the parameterization
of such clouds in General Circulation Models as well as advance our understanding in low cloud feedback. The
question of how such cloud fields and their feedback effects will respond to changes in the meteorological condi-
tions within a warming climate, remains however open.

Data Availability Statement


The greenCuDb, the visually-labeled images used to train the model, as well as a Python interface of the ResNet-
34 classifier implementation and training for the greenCu labeling, the final trained classifier used to generate the
greenCuDb, the code of the (ERA5 based) meteorological analysis and the data to reproduce the figures of the
manuscript, are publicly available at https://doi.org/10.34933/wis.000393. ERA5 reanalyzes were downloaded
from https://www.ecmwf.int/en/forecasts/datasets/reanalysis-datasets/era5.

Acknowledgments References
This research received funding from
the European Research Council under Bai, C., Zhang, M., Zhang, J., Zheng, J., & Chen, S. (2021). LSCIDMR: Large-Scale Satellite Cloud Image Database for Meteorological Research.
the European Union's Horizon 2020 IEEE Transactions on Cybernetics, 1–13. https://doi.org/10.1109/tcyb.2021.3080121
research and innovation programme Berg, L. K., & Kassianov, E. I. (2008). Temporal variability of fair-weather cumulus statistics at the ACRF SGP site. Journal of Climate, 21(13),
(grant agreement No. 810370). This 3344–3358. https://doi.org/10.1175/2007JCLI2266.1
research was partially supported by the Bony, S. (2005). Marine boundary layer clouds at the heart of tropical cloud feedback uncertainties in climate models. Geophysical Research
Israeli Council for Higher Education via Letters, 32(20), L20806. https://doi.org/10.1029/2005GL023851
the Weizmann Data Science Research Bony, S., Dufresne, J.-L., Le Treut, H., Morcrette, J.-J., & Senior, C. (2004). On dynamic and thermodynamic components of cloud changes.
Center, by a Ben May Center grant for Climate Dynamics, 22(2–3), 71–86. https://doi.org/10.1007/s00382-003-0369-6
theoretical and/or computational research Bony, S., Schulz, H., Vial, J., & Stevens, B. (2020). Sugar, gravel, fish, and flowers: Dependence of mesoscale patterns of trade-wind clouds on
and by a research grant from the Estate environmental conditions. Geophysical Research Letters, 47(7), e2019GL085988. https://doi.org/10.1029/2019gl085988
of Tully and Michele Plesser. The Israel Boucher, O., Randall, D., Artaxo, P., Bretherton, C., Feingold, G., Forster, P., et al. (2013). Clouds and aerosols. In Climate change 2013: The
Meteorological Service and ECMWF are physical science basis. Contribution of working group I to the fifth assessment report of the intergovernmental panel on climate change (pp.
acknowledged for providing access to 571–657). Cambridge University Press.
ERA5 data. Finally, we thank two anon- Bröcker, J., & Smith, L. A. (2007). Scoring probabilistic forecasts: The importance of being proper. Weather and Forecasting, 22(2), 382–388.
ymous reviewers, whose comments have https://doi.org/10.1175/waf966.1
improved the manuscript considerably. Brown, A., Cederwall, R., Chlond, A., Duynkerke, P., Golaz, J.-C., Khairoutdinov, M., et al. (2002). Large-eddy simulation of the diurnal cycle of
shallow cumulus convection over land. Quarterly Journal of the Royal Meteorological Society: A journal of the atmospheric sciences, applied
meteorology and physical oceanography, 128(582), 1075–1093. https://doi.org/10.1256/003590002320373210
Da Silva, R. R., Gandu, A. W., Sá, L. D., & Dias, M. A. S. (2011). Cloud streets and land–water interactions in the Amazon. Biogeochemistry,
105(1–3), 201–211. https://doi.org/10.1007/s10533-011-9580-4
Dev, S., Lee, Y. H., & Winkler, S. (2015). Categorization of cloud image patches using an improved texton-based approach. In 2015 IEEE inter-
national conference on image processing (ICIP) (pp. 422–426). IEEE.
Dror, T., Chekroun, M. D., Altaratz, O., & Koren, I. (2021). Deciphering organization of GOES-16 green cumulus through the empirical orthog-
onal function (EOF) lens. Atmospheric Chemistry and Physics, 21(16), 12261–12272. https://doi.org/10.5194/acp-21-12261-2021
Dror, T., Koren, I., Altaratz, O., & Heiblum, R. H. (2020). On the abundance and common properties of continental, organized shallow (green)
clouds. IEEE Transactions on Geoscience and Remote Sensing, 59(6), 4570–4578. https://doi.org/10.1109/TGRS.2020.3023085
Dror, T., Silverman, V., Altaratz, O., Chekroun, M. D., & Koren, I. (2021). Data from: The large-scale meteorology that drives continental,
shallow, green cumulus (greencu) through supervised classification. The Weizmann Institute of Science.
He, K., Zhang, X., Ren, S., & Sun, J. (2016a). Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer
vision and pattern recognition (pp. 770–778).
He, K., Zhang, X., Ren, S., & Sun, J. (2016b). Identity mappings in deep residual networks. In European conference on computer vision (pp.
630–645).
Hersbach, H., Bell, B., Berrisford, P., Hirahara, S., Horányi, A., Muñoz-Sabater, J., et al. (2020). The ERA5 global reanalysis. Quarterly Journal
of the Royal Meteorological Society, 146(730), 1999–2049. https://doi.org/10.1002/qj.3803
Howard, J., & Gugger, S. (2020). FASTAI: A layered API for deep learning. Information, 11(2), 108. https://doi.org/10.3390/info11020108
Jiang, H., Xue, H., Teller, A., Feingold, G., & Levin, Z. (2006). Aerosol effects on the lifetime of shallow cumulus. Geophysical Research Letters,
33(14), L14806. https://doi.org/10.1029/2006gl026024
Ker, J., Wang, L., Rao, J., & Lim, T. (2017). Deep learning applications in medical image analysis. IEEE Access, 6, 9375–9389. https://doi.
org/10.1109/access.2017.2788044
Köppen, W. P. (1923). Die klimate der erde: Grundriss der klimakunde. Walter de Gruyter.
Koren, I., Kaufman, Y. J., Remer, L. A., & Martins, J. V. (2004). Measurement of the effect of Amazon smoke on inhibition of cloud formation.
Science, 303(5662), 1342–1345. https://doi.org/10.1126/science.1089424
Kremer, J., Stensbo-Smidt, K., Gieseke, F., Pedersen, K. S., & Igel, C. (2017). Big universe, big data: Machine learning and image analysis for
astronomy. IEEE Intelligent Systems, 32(2), 16–22. https://doi.org/10.1109/mis.2017.40
Lamer, K., & Kollias, P. (2015). Observations of fair-weather cumuli over land: Dynamical factors controlling cloud size and cover. Geophysical
Research Letters, 42(20), 8693–8701. https://doi.org/10.1002/2015GL064534
Lenderink, G., Siebesma, A. P., Cheinet, S., Irons, S., Jones, C. G., Marquet, P., et al. (2004). The diurnal cycle of shallow cumulus clouds over
land: A single-column model intercomparison study. Quarterly Journal of the Royal Meteorological Society: A journal of the atmospheric
sciences, applied meteorology and physical oceanography, 130(604), 3339–3364. https://doi.org/10.1256/qj.03.122
Nielsen, A. H., Iosifidis, A., & Karstoft, H. (2021). Cloudcast: A satellite-based dataset and baseline for forecasting clouds. IEEE Journal of
Selected Topics in Applied Earth Observations and Remote Sensing, 14, 3485–3494. https://doi.org/10.1109/jstars.2021.3062936

DROR ET AL. 7 of 8
Geophysical Research Letters 10.1029/2021GL096684

Nobre, C. A., Mattos, L. F., Dereczynski, C. P., Tarasova, T. A., & Trosnikov, I. V. (1998). Overview of atmospheric conditions during the
Smoke, Clouds, and Radiation-Brazil (SCAR-B) field experiment. Journal of Geophysical Research, 103(D24), 31809–31820. https://doi.
org/10.1029/98jd00992
Norris, J. R. (1998). Low cloud type over the ocean from surface observations. Part II: Geographical and seasonal variations. Journal of Climate,
11(3), 383–403. https://doi.org/10.1175/1520-0442(1998)011<0383:lctoto>2.0.co;2
Nuijens, L., & Siebesma, A. P. (2019). Boundary layer clouds and convection over subtropical oceans in our current and in a warmer climate.
Current Climate Change Reports, 5(2), 80–94. https://doi.org/10.1007/s40641-019-00126-x
Parker, S. S., Hawes, J. T., Colucci, S. J., & Hayden, B. P. (1989). Climatology of 500 mb cyclones and anticyclones, 1950–85. Monthly Weather
Review, 117(3), 558–571. https://doi.org/10.1175/1520-0493(1989)117<0558:comcaa>2.0.co;2
Rabin, R. M., Stadler, S., Wetzel, P. J., Stensrud, D. J., & Gregory, M. (1990). Observed effects of landscape variability on convective clouds.
Bulletin of the American Meteorological Society, 71(3), 272–280. https://doi.org/10.1175/1520-0477(1990)071<0272:oeolvo>2.0.co;2
Rasp, S., Schulz, H., Bony, S., & Stevens, B. (2020). Combining crowdsourcing and deep learning to explore the mesoscale organization of shal-
low convection. Bulletin of the American Meteorological Society, 101(11), E1980–E1995. https://doi.org/10.1175/bams-d-19-0324.1
Stevens, B., Farrell, D., Hirsch, L., Jansen, F., Nuijens, L., Serikov, I., et al. (2016). The Barbados Cloud Observatory: Anchoring investigations
of clouds and circulation on the edge of the ITCZ. Bulletin of the American Meteorological Society, 97(5), 787–801. https://doi.org/10.1175/
bams-d-14-00247.1
Tobin, I., Bony, S., & Roca, R. (2012). Observational evidence for relationships between the degree of aggregation of deep convection, water
vapor, surface fluxes, and radiation. Journal of Climate, 25(20), 6885–6904. https://doi.org/10.1175/jcli-d-11-00258.1
Vial, J., Bony, S., Stevens, B., & Vogel, R. (2017). Mechanisms and model diversity of trade-wind shallow cumulus cloud feedbacks: A review.
In Shallow clouds, water vapor, circulation, and climate sensitivity (pp. 159–181). Springer.
Webb, M. J., Senior, C., Sexton, D., Ingram, W., Williams, K., Ringer, M., et al. (2006). On the contribution of local feedback mechanisms to the
range of climate sensitivity in two GCM ensembles. Climate Dynamics, 27(1), 17–38. https://doi.org/10.1007/s00382-006-0111-2
Weger, R., Lee, J., Zhu, T., & Welch, R. (1992). Clustering, randomness and regularity in cloud fields: 1. Theoretical considerations. Journal of
Geophysical Research, 97(D18), 20519–20536. https://doi.org/10.1029/92jd02038
Whittaker, L. M., & Horn, L. H. (1981). Geographical and seasonal distribution of North American cyclogenesis, 1958–1977. Monthly Weather
Review, 109(11), 2312–2322. https://doi.org/10.1175/1520-0493(1981)109<2312:gasdon>2.0.co;2
Zantedeschi, V., Falasca, F., Douglas, A., Strange, R., Kusner, M. J., & Watson-Parris, D. (2019). Cumulo: A dataset for learning cloud classes.
arXiv preprint arXiv:1911.04227.
Zelinka, M. D., Myers, T. A., McCoy, D. T., Po-Chedley, S., Caldwell, P. M., Ceppi, P., et al. (2020). Causes of higher climate sensitivity in
CMIP6 models. Geophysical Research Letters, 47(1), e2019GL085782. https://doi.org/10.1029/2019gl085782
Zhang, J., Liu, P., Zhang, F., & Song, Q. (2018). Cloudnet: Ground-based cloud classification with deep convolutional neural network. Geophys-
ical Research Letters, 45(16), 8665–8672. https://doi.org/10.1029/2018gl077787
Zhang, Y., & Klein, S. A. (2013). Factors controlling the vertical extent of fair-weather shallow cumulus clouds over land: Investigation of diur-
nal-cycle observations collected at the ARM Southern Great Plains site. Journal of the Atmospheric Sciences, 70(4), 1297–1315. https://doi.
org/10.1175/JAS-D-12-0131.1

DROR ET AL. 8 of 8

You might also like