Professional Documents
Culture Documents
https://doi.org/10.1038/s41586-019-1852-5 Liangsheng Zhang1,23,24*, Fei Chen1,2,23,24, Xingtan Zhang1,23, Zhen Li3,4,23, Yiyong Zhao5,6,23,
Rolf Lohaus3,4,23, Xiaojun Chang1,7,23, Wei Dong1, Simon Y. W. Ho8, Xing Liu1, Aixia Song1,
Received: 6 August 2019
Junhao Chen9, Wenlei Guo9, Zhengjia Wang9, Yingyu Zhuang1, Haifeng Wang1, Xuequn Chen1,
Accepted: 31 October 2019 Juan Hu1, Yanhui Liu1, Yuan Qin1, Kai Wang1, Shanshan Dong7, Yang Liu7,10, Shouzhou Zhang7,
Xianxian Yu11, Qian Wu12,13, Liangsheng Wang12,13, Xueqing Yan13,14, Yuannian Jiao13,14,
Published online: 18 December 2019
Hongzhi Kong13,14, Xiaofan Zhou15, Cuiwei Yu16, Yuchu Chen16, Fan Li17, Jihua Wang17,
Open access Wei Chen18, Xinlu Chen19, Qidong Jia20, Chi Zhang19, Yifan Jiang2, Wanbo Zhang2, Guanhua Liu21,
Jianyu Fu21, Feng Chen2,19,20,24, Hong Ma6,24, Yves Van de Peer3,4,22,24 & Haibao Tang1,24
Many water lily species, particularly from Nymphaea (Nymphaeaceae), were anchored onto 14 pseudo-chromosomes (Extended Data Fig. 1
have large and showy flowers and belong to the angiosperms (also and Extended Data Table 1). Genome completeness was estimated to
called flowering plants). Their aesthetic beauty has captivated nota- be 94.4% (Supplementary Note 2). We annotated 31,580 protein-coding
ble artists such as the French impressionist Claude Monet. Water lily genes and predicted repetitive elements with a collective length of
flowers have limited differentiation in perianths (outer floral organs), 160.4 Mb, accounting for 39.2% of the genome (Supplementary Note 3).
but they possess both male and female organs and have diverse scents The N. colorata genome provides an opportunity to resolve the
and colours, similar to many mesangiosperms (core angiosperms, relationships between Amborellales, Nymphaeales and all other extant
including eudicots, monocots, and magnoliids) (Supplementary angiosperms (Fig. 1a). Using six eudicots, six monocots, N. colorata and
Note 1). In addition, some water lilies have short life cycles and enormous Amborella5, and each of three gymnosperm species (Ginkgo biloba,
numbers of seeds4, which increase their potential as a model plant to rep- Picea abies and Pinus taeda) as an outgroup in turn, we identified 2,169,
resent the ANA-grade of angiosperms and to study early evolutionary 1,535 and 1,515 orthologous low-copy nuclear (LCN) genes, respec-
events within the angiosperms. In particular, N. colorata Peter has a tively (Fig. 1b). Among the LCN gene trees inferred from nucleotide
relatively small genome size (2n = 28 and approximately 400 Mb) and sequences using G. biloba as an outgroup, 62% (294 out of 475 trees)
blue petals that make it popular in breeding programs (Supplementary place Amborella as the sister lineage to all other extant angiosperms
Note 1). with bootstrap support greater than 80% (type II, Fig. 1c). Using P. abies
We report here the genome sequence of N. colorata, obtained using or P. taeda as the outgroup, Amborella is placed as the sister lineage
PacBio RSII single-molecule real-time (SMRT) sequencing technol- to the remaining angiosperms in 57% and 54% of the LCN gene trees,
ogy. The genome was assembled into 1,429 contigs (with a contig N50 respectively. LCN gene trees inferred using amino acid sequences show
of 2.1 Mb) and total length of 409 Mb with 804 scaffolds, 770 of which similar phylogenetic patterns (Supplementary Note 4.1).
Caryophyllales
Type I Type I Type I Stellaria media
46 26
Mirabilis jalapa
24 Phytolacca americana
10% Type III Type III Type III
14% 11% Delosperma cooperi
135 56 74 F14
28% Opuntia dillenii
29% 35% Polygonum runcinatum
Tamarix chinensis
Type II Type II Type II Hieracium umbellatum
294 116 Cichorium intybus
108 Lactuca sativa
Eudicots
62% 57% 54%
Sonchus asper
Tagetes erecta
Dahlia pinnata
Cirsium vulgare
BS > 80% Dipsacus laciniatus
Asterids
Lonicera japonica
Ligusticum striatum
1 Daucus carota
Apium graveolens
e Hedera nepalensis
Mentha canadensis
3 Mimulus guttatus
2 Phyla dulcis
F16 Sesamum indicum
Olea europaea
Solanum lycopersicum
4 5 6 Capsicum annuum
6 Vinca major
F7
Aucuba japonica
Philadelphus incanus
8 9 F12 Hydrangea macrophylla
Cornus wilsoniana
Camellia japonica
Gunnera manicata Gunnerales
10 14 11 F4
Buxus sinica
F10
Nelumbo nucifera
Platanus acerifolia Early-diverging
Meliosma parviflora
Aquilegia coerulea Eudicot lineages
12 13 Nandina domestica
F21 Eschscholzia californica
Cinnamomum camphora
Magnoliids
Litsea cubeba Laurales
F8
15 17 16 Chimonanthus praecox
Liriodendron chinense Magnoliales
F9 Piper nigrum
Houttuynia cordata Piperales
18 19 Aristolochia tagala
Oryza sativa
Brachypodium distachyon
F15 Sorghum bicolor
Ananas comosus Commelinids
20 F20
Canna indica
F13
Musa acuminata
Trachycarpus fortunei
Monocots
Asparagus officinalis
Yucca filamentosa Asparagales
Phalaenopsis equestris
Phalaenopsis aphrodite
F19
Lilium brownii Liliales
7 Pandanus utilis Pandanales
7 Dioscorea opposita Dioscoreales
Pinellia ternate
Spirodela polyrhiza Alismatales
Zostera marina
Austrobaileyales Acorus calamus Acorales
Illicium henryi Austrobaileyales
Nymphaea ‘Choolarp’ 1
Nymphaea ‘Paramee’ 2
Nymphaea ‘Thong Garnjana’ 3
ANA-grade angiosperms
Nymphaea ‘Midnight’ 4
Summarized BS values from 5 coalescent trees F18 Nymphaea ‘Woods blue goddess’5
At least 4 trees: BS ≥ 80 Nymphaea caerulea 6
At least 4 trees: BS ≥ 70 Nymphaea colorata 7
Nymphaea gigantea ‘Hybrid I’ 8
Nymphaeales
At least 3 trees: BS ≥ 80 Nymphaea gigantea‘Albert de Lestang’ 9
Low support Nymphaea tetragona 10
Nymphaea mexicana 11
F Fossil calibration point Nymphaea potamophila 12
F17 Nymphaea prolifera 13
Nymphaea rubra 14
F6
Euryale ferox 15
Nymphaeales Victoria cruziana 16
F5 Nuphar lutea 17
F1
Nuphar advena 18
Brasenia schreberi 19
F11
Amborellales Cabomba caroliniana 20
Amborella trichopoda Amborellales
Pinus taeda
F3
Picea abies Gymnosperms
F2 Gnetum montanum
Ginkgo biloba
Carboniferous Permian Triassic Jurassic Cretaceous Palaeogene Neogene
375 350 325 300 275 250 225 200 175 150 125 100 75 50 25 0
Time (Ma)
Fig. 1 | Phylogenomic relationships of angiosperms. a, Three different outgroup. The percentage was calculated by dividing the number of type I, II or
evolutionary relationships among major clades of angiosperms. b, Number of III trees (BS > 80%) by the total number of trees. d, Summary phylogeny and
LCN gene trees with different bootstrap support (BS) values based on timescale of 115 plant species. Blue bars at nodes represent 95% credibility
nucleotide sequences from six eudicots, six monocots, N. colorata, Amborella intervals of the estimated dates. e, The flowers of the 20 sampled water lilies in
and three different gymnosperms. c, Comparison of gene trees supporting the Nymphaeales used in d.
three evolutionary relationships using each gymnosperm in turn as the
To minimize the potential shortcomings of sparse taxon sampling6, ANA-grade, eudicots, magnoliids, monocots and a gymnosperm out-
we also inferred an angiosperm species tree using sequences from group (Gnetum montanum, G. biloba, P. abies and P. taeda) (Methods).
44 genomes and 71 transcriptomes, including representatives of the For further phylogenetic inference of these 115 species, we selected,
Nymphaea caerulea
500 Nuphar advena−N. colorata 2.5
Number of anchor pairs
Orthologue density
400 2.0 9
9 Nymphaea mexicana
0.2569 0.1016
Euryale ferox
300 Cabomba caroliniana−N. colorata 1.5 19 0.0607
19
0.0734
Victoria cruziana
0.0527 0.0864
200 1.0
Illicium henryi−N. colorata Nymphaeales
211(28)
213
Nuphar lutea
0.0203
Amborella trichopoda 0.5132 0.1301
Nuphar advena
100 −N. colorata 0.5 5(1)
5 0.0435
5(2) 0.033
Cabomba caroliniana
0.5284
0 0 6
Austrobaileyales
Illicium henryi
0.5892
0 1 2 3 4 Amborellales
Amborella trichopoda
KS 0.7231
Ginkgo biloba
c d
N2 N2
N. colorata N. colorata
C2 (loss) N1
C. caroliniana N1
C. caroliniana C1
C1
Fig. 2 | A Nymphaealean WGD shared by Nymphaeaceae and possibly pairs in N. colorata coalesced. All mapped duplication events have BS ≥ 80% in
Cabombaceae. a, KS age distributions for paralogues found in collinear regions the gene trees. c, Left, the scenario of a WGD (yellow four-pointed star) before
(anchor pairs) of N. colorata and for orthologues between N. colorata and the divergence between Nymphaeaceae and Cabombaceae. Right, a possible
selected Nymphaealean and angiosperm species. Red and yellow arrows gene tree under this scenario, with loss of one duplicate in C. caroliniana. Two
indicate under- and overestimations of the N. colorata–Nuphar advena and red dots show where the anchor pair of N. colorata would coalesce. d, Left,
N. colorata–C. caroliniana divergence, respectively. b, WGD phylogenomic scenario of a WGD in the stem lineage of Nymphaeaceae involving an
analysis. Numbers in parentheses are the number of gene families with retained allotetraploid (green four-pointed star) that formed between two ancestral
C. caroliniana duplicates supporting the duplication events. Numbers below parents after the divergence of the lineages leading to N. colorata and
branches show branch lengths in KS units. The double-arrowed line denotes C. caroliniana, with one of the parents being more closely related to
total KS from the pointed node to N. colorata. We used G. biloba (dashed branch) C. caroliniana. Right, a gene tree under such a scenario. Red dots are as in c.
as an outgroup. The red dot denotes the branch on which most of the anchor
based on various criteria, five different LCN gene sets including 1,167, occurred before or close to the divergence between Nymphaeaceae
834, 683, 602 and 445 genes. Analyses of these five datasets all yielded and Cabombaceae (Extended Data Fig. 2d, Supplementary Note 5.3),
similar tree topologies with Amborella and Nymphaeales as successive considering the variable substitution rates among Nymphaealean line-
sister lineages to all other extant angiosperms (Fig. 1d, e, Supplemen- ages (Fig. 2a, b, Extended Data Fig. 2c). An alternative interpretation
tary Note 4.2). of the above results could be that the WGD signatures were from an
Molecular dating of angiosperm lineages, using a stringent set of 101 allopolyploidy event that occurred between ancestral Nymphaeaceae
LCN genes and with age calibrations based on 21 fossils7, inferred the and Cabombaceae lineages shortly after their divergence and that
crown age of angiosperms at 234–263 million years ago (Ma) (Fig. 1d). gave rise to the Nymphaeaceae (but not Cabombaceae) stem lineage
The split between monocots and eudicots was estimated at 171–203 Ma (Fig. 2d, Supplementary Note 5.4).
and that between Nymphaeaceae and Cabombaceae at 147–185 Ma. The water lily lineage descended from one of the early divergences
Genomic collinearity unveiled evidence of a whole-genome dupli- among angiosperms, before the radiation of mesangiosperms. Thus,
cation (WGD) event in N. colorata (Extended Data Figs. 1f, 2a and Sup- this group offers a unique window into the early evolution of angio-
plementary Note 5.1). The number of synonymous substitutions per sperms, particularly that of the flower. We identified 70 MADS-box
synonymous site (KS) distributions for N. colorata paralogues further genes, including homologues of the genes for the ABCE model of floral
showed a signature peak at KS of approximately 0.9 (Fig. 2a) and peaks at organ identities: AP1 (and also FUL) and AGL6 (A function for sepals
similar KS values were identified in other Nymphaeaceae species (Sup- and petals), AP3 and PI (B function for petals and stamen), AG (C func-
plementary Note 5.2), which suggests an ancient single WGD event that tion for stamen and carpel), and SEP1 (E function for interacting with
is probably shared among Nymphaeaceae members. Comparison of ABC function proteins). Phylogenetic and collinearity analyses of the
the N. colorata paralogue KS distribution with KS distributions of ortho- MADS-box genes and their genomic neighbourhood indicate that an
logues (representing speciation events) between N. colorata and other ancient tandem duplication before the divergence of seed plants gave
Nymphaeales lineages, Illicium henryi, and Amborella suggests that birth to the ancestors of A function (FUL) and E function genes (SEP)
the WGD occurred just after the divergence between Nymphaeaceae (Extended Data Fig. 3, Supplementary Note 6.1). Also, owing to the Nym-
and Cabombaceae (Fig. 2a). By contrast, phylogenomic analyses of phaealean WGD, N. colorata has two paralogues, AGa and AGb of the
gene families that contained at least one paralogue pair from collinear C-function gene AG (Extended Data Fig. 4). Similarly, the Nymphaealean
regions of N. colorata suggest that the WGD is shared between Nym- WGD-derived duplicates are homologous to other genes associated
phaeaceae and Cabombaceae (Fig. 2b, Supplementary Note 5.4). If true, with development of carpel and stamen8, and to genes that regulate
Cabomba caroliniana seems to have retained few duplicates (Fig. 2b, flowering time9 and auxin-controlled circadian opening and closure of
c), which would explain the absence of a clear peak in the C. caroliniana the flower10 (Extended Data Figs. 4–6, Supplementary Note 6.2–6.4).
paralogue KS distribution (Supplementary Note 5.2). Absolute dating The expression profiles of N. colorata ABCE homologues largely agree
of the paralogues of N. colorata does suggest that the WGD could have with their putative ascribed roles in floral organ patterning (Fig. 3a).
10
a b Eudicots, typical ABCE model
8
6
4
2
0
log2(FPKM+1)
B
FLCb
r1
A C
e
OsMADS32
st
lu
C
FLCa
SOC1
E
SVP Sepal Petal Stamen Carpel
AGa, C function
r2e
st AGb, C function
Nymphaea, ancestral ABCE model
lu
SEP, E function
C
AP3, B function
AGL6, A function A, AGL6
AGL32b
AGL32c B, AP3
AGL32f
AGL12a B, PI
AGL32g
STK B, AGL32d
AGL32a
C, AGa
r3
AGL32d, B function
e
st
ANR1a
lu
C, AGb
C
ANR1b
AGL32e
PI, B function E, SEP
AGL15
FUL Sepal Petal Stamen Carpel
AGL12b
Gymnosperms, BC model
e re e l k
ni ile t
Se l
ve Sta al
le en
ga C er
l
M leaf af
ve en oo
ta
pe
tiv tu ur al
ga a f
al
or le ea
w
le le
ta Ma at st
Pe
ni m
ns fst
Ju Juv R
ns ar
B
flo
C
C
Ju
or
Female cone/
Male cone/
al
megasporangiate
or
microstrobilus
Fl
strobilus
ge
Ve
Fig. 3 | MADS-box genes in N. colorata and proposed floral ABCE model in genes. Expression values were scaled by log 2(FPKM + 1), in which FPKM is
early angiosperms. a, Gene expression patterns of MIKCc from various organs fragments per kilobase of exon per million mapped reads. b, The flowering
of N. colorata. Three clusters of genes were classified according to the ABCE model in N. colorata that specifies floral organs is proposed based on the
expression of type II MADS-box genes. The organ types (vegetative organs and gene expression values (bar heights) from a.
floral organs) were matched to the expression patterns of type II MADS-box
Notably, the N. colorata AGL6 homologue is mainly expressed in sepals sesquiterpene biosynthase in N. colorata. Also, methyl decanoate has
and petals, whereas the FUL homologue is mainly expressed in carpels, not been detected as a volatile compound in monocots and eudicots18
suggesting that AGL6 acts as an A-function gene in N. colorata. The two and is thought to be synthesized in N. colorata by the SABATH family
C-function homologues AGa and AGb are highly expressed in stamens of methyltransferases19. The N. colorata genome contains 13 SABATH
and carpels, respectively, whereas AGb is also expressed in sepals and homologues and 12 of them form a Nymphaeales-specific group (Sup-
petals, suggesting that they might have undergone subfunctionaliza- plementary Fig. 41). Among these 12 members, NC11G0120830 showed
tion and possibly neofunctionalization for flower development after the highest expression in petals (Fig. 4c) and its corresponding recom-
the Nymphaealean WGD. Furthermore, the ABCE homologues in N. binant protein was demonstrated to be a fatty acid methyltransferase
colorata generally exhibit wider ranges of expression in floral organs that had the highest activity with decanoic acid as the substrate (Fig. 4d,
than their counterparts in eudicot model systems (Fig. 3b). This wider Supplementary Note 7.1). These results suggest that the floral scent
expression pattern, in combination with broader expression of at least biosynthesis in N. colorata has been accomplished through enzymatic
some ABCE genes in some eudicots representing an early-diverging functions that have evolved independently from those in mesangio-
lineage11, some monocots12 and magnoliids13, suggest an ancient ABCE sperms (Fig. 4e).
model for flower development, with subsequent canalization of gene Nymphaea colorata is valued for the aesthetically attractive blue
expression and function regulated by the more specialized ABCE genes colour of petals, which is a rare trait in ornamentals. To understand the
during the evolution of mesangiosperms, especially core eudicots8. molecular basis of the blue colour, we identified delphinidin 3′-O-(2″-
This could also account for the limited differentiation between sepals O-galloyl-6″-O-acetyl-β-galactopyranoside) as the main blue anthocya-
and petals in Nymphaeales species, and is consistent with a single type nidin pigment (Extended Data Fig. 8a–c). By comparing the expression
of perianth organ proposed in an ancestral angiosperm flower14. profiles between two N. colorata cultivars with white and blue petals
Floral scent serves as olfactory cues for insect pollinators15. Whereas for genes in a reconstructed anthocyanidin biosynthesis pathway, we
Amborella flowers are scentless16, N. colorata flowers release 11 different found genes for an anthocyanidin synthase and a delphinidin-modi-
volatile compounds, including terpenoids (sesquiterpenes), fatty- fication enzyme, the expression of which was significantly higher in
acid derivatives (methyl decanoate) and benzenoids (Fig. 4a). The N. blue petals than in white petals (Extended Data Fig. 8d, e). These two
colorata genome contains 92 putative terpene synthase (TPS) genes, enzymes catalyse the last two steps of anthocyanidin biosynthesis and
which are ascribed to four previously recognized TPS subfamilies in are therefore key enzymes specialized in blue pigment biosynthesis20,21
angiosperms: TPS-b, TPS-c, TPS-e/f and TPS-g (Fig. 4b), but none was (Supplementary Note 7.2).
found for TPS-a, which is responsible for sesquiterpene biosynthesis Water lilies have a global distribution that includes cold regions
in mesangiosperms17. Notably, TPS-b contains more than 80 genes in (northern China and northern Canada), unlike the other ANA-grade
N. colorata; NC11G0123420 is highly expressed in flowers (Extended angiosperms Amborella (Pacific Islands) and Austrobaileyales (tem-
Data Fig. 7); this result suggests that it may be a candidate gene for perate and tropical regions). We detected marked expansions of genes
4.0
11 fragrance molecules
Tetradecyne
3.0
9
2.0 Germacrene B
(E)-β-
α-Bergamotene Fanesene
Isodaucene Heptadecene
Methyl Pentadecane 10
1.0 Methyl decanoate
3
3 78
octanoate 8 Farnesane
11 6 11
44 5 6 0.6
0
d
15 20 25 1.0 NC11G0120830
Retention time (min)
0.8
Relative activity
c
SABATH gene family 0.6
FPKM
NC11G0120830 0.4
NC7G0246550 1,400
NC2G0007900 0.2
NC11G0240820 1,200
NC11G0240810 0
NC13G0302010 1,000
Butanoic Hexanoic Octanoic Decanoic Dodecanoic Tetradecanoic
NC12G0138630 acid (C4) acid (C6) acid (C8) acid (C10) acid (C12) acid (C14)
NC11G0138610 800
NC11G0138620 Fatty acids
600
NC2G0007880
NC11G0120810 e Floral sesquiterpenes Floral methyl decanoate
400
NC11G0120820 Compound Gene Compound Gene
Petal
Stamen
Sepal
Juvenile leaf
Juvenile leafstalk
Root
Mature leaf
Carpel
Mature leafstalk
Juvenile flower
Eudicots + TPS-a –
Amborellales – –
Fig. 4 | Floral scent and biosynthesis in N. colorata. a, Gas chromatogram of d, Relative activity of Escherichia coli-expressed NC11G0120830 with six fatty
floral volatiles from the flower of N. colorata. The internal standard (IS) is nonyl acids as substrates, with the activity on decanoic acid set at 1.0. Data are
acetate. Methyl esters are in blue; terpenes are in red. Floral scent was mean ± s.d. of three independent measurements. e, The presence (+) and
measured three times independently with similar results. b, Phylogenetic tree absence (−) of sesquiterpenes and methyl decanoate as floral scent compounds
of terpene synthases from N. colorata and representative plants showing the and their respective biosynthetic genes in four major lineages of angiosperms
subfamilies from a–h and x. c, Expression analysis of SABATH genes of when known. DAMT, decanoic acid methyltranferase.
N. colorata showed that NC11G0120830 had the highest expression level in petal.
related to immunity and stress responses in N. colorata, including 1. Byng, J. W. et al. An update of the Angiosperm Phylogeny Group classification
for the orders and families of flowering plants: APG IV. Bot. J. Linn. Soc. 181, 1–20
genes encoding nucleotide-binding leucine-rich repeat (NLR) proteins, (2016).
protein kinases and WRKY transcription factors, compared with those 2. Zeng, L. et al. Resolution of deep angiosperm phylogeny using conserved nuclear genes
in Amborella and some mesangiosperms (Extended Data Fig. 9, Sup- and estimates of early divergence times. Nat. Commun. 5, 4956 (2014).
3. Qiu, Y. L. et al. The earliest angiosperms: evidence from mitochondrial, plastid and
plementary Note 8). It is possible that increased numbers of these genes nuclear genomes. Nature 402, 404–407 (1999).
enabled water lilies to adapt to various ecological habitats globally. 4. Chen, F. et al. Water lilies as emerging models for Darwin’s abominable mystery. Hortic.
In conclusion, the N. colorata genome offers a reference for compara- Res. 4, 17051 (2017).
5. Amborella Genome Project. The Amborella genome and the evolution of flowering
tive genomics and for resolving the deep phylogenetic relationships plants. Science 342, 1241089 (2013).
among the ANA-grade and mesangiosperms. It has also revealed a WGD 6. Wiens, J. J. Missing data, incomplete taxa, and phylogenetic accuracy. Syst. Biol. 52,
specific to Nymphaeales, and provides insights into the early evolution 528–538 (2003).
7. Coiro, M., Doyle, J. A. & Hilton, J. How deep is the conflict between molecular and fossil
of angiosperms on key innovations such as flower development and evidence on the age of angiosperms? New Phytol. 223, 83–99 (2019).
floral scent and colour. 8. Alvarez-Buylla, E. R. et al. Flower development. Arabidopsis Book 8, e0127 (2010).
9. Zhao, N. et al. Identification of flowering regulatory genes in allopolyploid Brassica
juncea. Hortic. Plant J. 5, 109–119 (2019).
10. Ke, M. et al. Auxin controls circadian flower opening and closure in the waterlily. BMC
Online content Plant Biol. 18, 143 (2018).
11. Sharma, B. & Kramer, E. M. Aquilegia B gene homologs promote petaloidy of the sepals
Any methods, additional references, Nature Research reporting sum-
and maintenance of the C domain boundary. Evodevo 8, 22 (2017).
maries, source data, extended data, supplementary information, 12. Dodsworth, S. Petal, sepal, or tepal? B-genes and monocot flowers. Trends Plant Sci. 22,
acknowledgements, peer review information; details of author con- 8–10 (2017).
13. Chanderbali, A. S. et al. Conservation and canalization of gene expression during
tributions and competing interests; and statements of data and code
angiosperm diversification accompany the origin and evolution of the flower. Proc. Natl
availability are available at https://doi.org/10.1038/s41586-019-1852-5. Acad. Sci. USA 107, 22570–22575 (2010).