Professional Documents
Culture Documents
net/publication/345766280
CITATIONS READS
183 1,743
Some of the authors of this publication are also working on these related projects:
Single-cell transcriptomics tracing evo-devo of neural substrates underlying division of labor in social insects View project
All content following this page was uploaded by Josefin Stiller on 12 November 2020.
https://doi.org/10.1038/s41586-020-2873-9 A list of authors and affiliations appears at the end of the paper.
Comparative genomics is rapidly growing, fuelled by the advancement (Pedionomus torquatus) and the Bali myna (Leucopsar rothschildi, which
of sequencing technologies. Many large-scale initiatives have been has fewer than 50 adults remaining in the wild13).
proposed with a core mission of producing genomes for hundreds of Two hundred and sixty-seven of the 363 species represented in our
species, representing the phylogenetic diversity of particular taxa6–8. genome data are newly released, comprising 18.4 trillion base pairs
Although the generation of genomes is now more routine, an immediate (bp) of raw data and 284 billion bp of assemblies. The assemblies are
challenge is how to efficiently compare large numbers of genomes in comparable in quality to previously published bird genomes9,10, but
an evolutionary context. A critical first step is the accurate detection of vary in contiguity (average scaffold N50 = 1.42 megabases (Mb), contig
orthologous sequences. In this study, we release a large-scale dataset of N50 = 42.57 kilobases (kb); see interactive supplementary figure 1,
bird genomes, which we use to establish a framework for comparative hosted at https://genome-b10k.herokuapp.com/main). The sequencing
analysis. We provide insight on how scaling-up genome sampling assists coverage ranged from 35× (blue-throated roller (Eurystomus gularis)
in our understanding of avian genomic diversity and in the detection and yellowhead (Mohoua ochrocephala)) to 368× (song sparrow (Melo-
of signals of natural selection down to individual bases. spiza melodia)) and genomic completeness was high (average 95.8%).
The B10K Project began with the Avian Phylogenomics Consortium We annotated all 363 genomes using a homology-based method with a
(phase I), which analysed 48 genomes from representatives of most uniform gene set that included gene models from chicken, zebra finch
bird orders9,10. Here we report the genome sequencing outcomes from and human, and published transcriptomes (Supplementary Tables 6–
phase II of the project: these outcomes include a total of 363 species in 8), to predict an average of 15,464 protein-coding genes for each species
92.4% (218 out of 23611,12) of avian families (Supplementary Tables 1–5). (Supplementary Table 1). We also assembled mitochondrial genomes
Species were selected to span the overall diversity and to subdivide long for 336 species, with 216 species fully circularized and 228 species
branches, when possible (Fig. 1, Supplementary Data). Our sampling with a complete mitochondrial annotation (Supplementary Table 1).
covers bird species from every continent (Extended Data Fig. 1) and Bird genomes at the ordinal level were previously found to contain a
more than triples the previous taxonomic coverage of avian genome low proportion of transposable elements, except for the downy wood-
sequencing; to our knowledge, 155 bird families are represented here pecker (Picoides pubescens) in Piciformes10. Consistent with these
for the first time. We chose short-read sequencing as our main strategy findings, 96.1% of birds at the family level had a transposable element
for generating data, which enabled us to use older samples (the oldest of content lower than 15%—but we found additional outliers (Extended
which was collected in 1982) and access rare museum specimens—such Data Fig. 2a, Supplementary Table 1). In particular, long interspersed
as one of the few vouchered tissues of the Henderson crake (Zapornia nuclear elements were prevalent in all nine sequenced species in Pici-
atra), which occurs on a single island. We incorporated 68 species of formes, which suggests an ancestral expansion in this lineage (24% on
concern on the International Union for Conservation of Nature (IUCN) average, Welch two-sample t-test, P = 9.98 × 10−5) (Extended Data Fig. 2b,
Red List of Threatened Species (Supplementary Table 1); these include d). The common scimitarbill (Rhinopomastus cyanomelas) and com-
12 endangered and 2 critically endangered species—the plains-wanderer mon hoopoe (Upupa epops) in Bucerotiformes also had exceptionally
Arenaria interpres
Fregetta
grallaria
Aquila chrysaetos
Agapornis
roseicollis Piaya
cayana
Calypte anna
Odontophorus
gujanensis
Onychorhynchus
coronatus
Apteryx owenii
Orthonyx spaldingii
Dyaphorophyia
castanea
**
Agelaius phoeniceus
Fig. 1 | Newly sequenced genomes densely cover the bird tree of life. The per sequenced family (purple branches). The grey arc marks the diverse
10,135 bird species11,12 are shown on a draft phylogeny that synthesizes Passeriformes radiation, with 6,063 species, of which 173 species have genome
taxonomic and phylogenetic information36 (Supplementary Data). In total, assemblies now. Chicken (*) and zebra finch (**) are marked for orientation.
363 species, covering 92.4% of all families, now have at least 1 genome assembly Paintings illustrate examples of sequenced species.
high transposable element content (23% and 18%, respectively) owing Project phase I. Nonetheless, 498 genes remained absent across all
to recent expansions of long interspersed nuclear elements, whereas 363 bird species, which adds to evidence that these genes were truly
two hornbill species in the same order exhibited the typical low propor- lost in the common ancestor.
tions (Extended Data Fig. 2e). We also investigated a number of genes that were previously associ-
Previous studies have suggested that hundreds of genes were lost ated with phenotypes and physiological pathways. For example, we
in the ancestor of birds10,14. Gene-loss inference is complicated by found that rhodopsin (encoded by RH1) and the medium-wavelength
incomplete assemblies and can be unreliable with only a few species15. sensitive opsin (encoded by RH2) were present in all 363 birds, but
We found that 142 genes previously considered to be absent in Aves10 were incomplete or pseudogenized in 5 and 11 species, respectively
were detected in at least one of the newly sequenced bird genomes (Supplementary Table 10). The other three conopsin genes showed a
(Supplementary Table 9), which implies that these genes were either more varied pattern of presence and absence. OPN1sw2 and OPN1lw
lost multiple times or missed in the assemblies of the 48 birds of B10K existed either as partial sequences or were completely absent in 310
APP
Chr. 27
2
GAT
Coliiformes
GH_L GH_L
Zebra finch Trogoniformes
MO
Chr. 27
GH_L Columbiformes
GH_L
Atlantic canary HG009 Pterocliformes
Otidiformes
1
1
251.1
4A
A3
9B
KHM
RDM
GH_S Chr. 1
SCN
C37
GH_S Psittaciformes
CD7
Zebra finch
HG009
PLE
Podicipediformes
LRR
GH_S 281.1
GH_S
Atlantic canary Rheiformes
Eurypygiformes
c Corvidae Musophagiformes
DAC 15-like
Paridae Cuculiformes
L1
Passeriformes
H1
Cariamiformes
JC
44
Scotocercidae
A
F1
T1
5
KLH
BOR
RRP
DNA
KLF
Passeriformes
PIB
MZ
Sylviidae Falconiformes
MRCA of Passeriformes Emberizidae Anseriformes
Apterygiformes
Tachurididae
Tinamiformes
Eurylaimidae Sphenisciformes
Acanthisittidae Galliformes
Psittaciformes Coraciiformes
Accipitriformes Charadriiformes
Strigiformes
Pelecaniformes Piciformes
Charadriiformes Gruiformes
Caprimulgiformes Casuariiformes
Bucerotiformes
Columbiformes
Procellariiformes Passerea
Anseriformes Accipitriformes Columbea
Galliformes Caprimulgiformes Palaeognathae
Tinamiformes Pelecaniformes Galloanseres
Fig. 2 | Improved orthologue distinction and detection of lineage-specific than one sequenced representative, lineage-specific sequences are those
sequences. a, Incorporating synteny in the orthologue assignment pipeline present in the reconstructed ancestral genome but absent in other lineages.
resolves complex cases of orthology. The growth hormone gene (GH) has one Colours denote higher-level taxonomic groupings9. c, A novel gene in
copy in chicken and two copies in Passeriformes (exemplified by zebra finch Passeriformes. Phylogeny based on the B10K Project phase I9 plotted with
and Atlantic canary). On the basis of the conserved synteny of the GH_L in synteny of a putative lineage-specific gene (DNAJC15L) and its surrounding
Passeriformes with GH of chicken, the pipeline recognized GH_L as the genes. DNAJC15L is found in 131 out of 173 sequenced Passeriformes and their
ancestral copy—despite high similarity to the other copy. b, The whole-genome reconstructed ancestral genome, but is not found in non-Passeriformes.
alignment allows detecting lineage-specific sequences. For orders with more MRCA, most-recent common ancestor.
and 308 species, respectively, and OPN1sw1 was functional in more Gene duplications are an important mechanism that shapes genome
than half of the 363 birds—especially in Passeriformes (perching birds) evolution, because duplicated copies often evolve under different
(Extended Data Fig. 3, Supplementary Table 10). selective pressures and evolutionary rates20. We developed an ortho-
Passeriformes also had a notably higher GC content than other birds logue assignment pipeline that incorporates information about the
in coding regions (Welch two-sample t-test, P = 7.59 × 10−43) (Extended genomic context of the gene copies (synteny) with the Cactus alignment
Data Fig. 4a) but not in non-coding regions (Welch two-sample t-test, to permit distinguishing between the ancestral copies, those inherited
P = 0.06). Differences in GC content can result in biased use of particular from a more recent common ancestor and duplicated novel copies
synonymous codons over others, which can affect gene expression (Extended Data Fig. 5a, Supplementary Tables 11, 12). An example is the
and translation efficiency16. Consistent with this hypothesis, relative growth hormone (GH) gene that was previously found to be duplicated
synonymous codon use values for 59 synonymous codons (excluding in 24 Passeriformes (to produce GH_L and GH_S)21. We confirmed that
non-degenerate codons, Met, Trp and three stop codons) showed sub- this gene duplication occurred exclusively in Passeriformes (found in
stantial differences between Passeriformes and other birds, especially 161 out of 173 species; its absence in 12 species is caused by incomplete
in the preference of codons ending in G or C (Extended Data Fig. 4b, c, e). assemblies), resulting in a one-to-many relationship with the single
Passeriformes significantly deviated from random use of synonymous copy in other birds (Extended Data Fig. 6). The synteny with surround-
codons with a smaller average effective number of codons compared ing genes identified the passeriform GH_L as the ancestral copy, and
to other birds (paired-sample t-test, P < 2.2 × 10−16) (Extended Data GH_S as a newly derived copy located in a different genomic context
Fig. 4f). These results indicate that the GC content may have affected (Fig. 2a). Moreover, when the pipeline was applied to both datasets
the gene evolution of the speciose Passeriformes. (of 48 and 363 bird species), the higher taxon sampling allowed the
To gain further evolutionary insight from the genomes, we con- detection of 439 additional orthologues with conserved synteny to
structed a whole-genome alignment of the 363 genomes using a chicken—many of which were lineage-specific gene copies. These addi-
progressive version of the reference-free aligner Cactus17,18. Cactus tional orthologues, improved by the denser representation of species
produced a substantially more-complete alignment than the com- and the Cactus alignment, will drive downstream comparative analyses.
monly used reference-based method MULTIZ19, particularly when the Using the Cactus alignment, we reconstructed an ancestral
aligned species were phylogenetically distant from the chicken refer- genome for each evolutionary node to characterize both shared and
ence17. In comparison to a previous alignment of the 48 bird genomes lineage-specific genomic diversity. Being able to identify sequences
using chicken and zebra finch as references10, our reference-free unique to particular lineages, and not only those shared with a refer-
approach and extended sampling unlocked a far greater proportion ence genome, is a major advantage of a reference-free alignment22. We
of orthologous sequences: 981 Mb across the whole genome (a 149% found that lineage-specific sequences constitute 0.2% to 5.5% of the
increase), 24 Mb of orthologous coding sequence (an 84.4% increase) reconstructed genome of the most-recent common ancestor of each
and 141 Mb of orthologous introns (a 631% increase) that derived order (Fig. 2b, Supplementary Table 13). Among these, we identified
from the common avian ancestors between chicken and any other 154 Passeriformes-specific genes (Supplementary Table 14). The gene
bird species. present in the largest number of passerines (131 out of 173 species) is
Proportion of significantly
Neutral rate
0.6
7.5
conserved single bp
53-way
53-way
0.40
5.0
7.5
77-way
0
Not significant 5.0 Coding UTR lncRNA Whole Introns Ancestral
0.20 exons exons exons genome repeats
Significantly 2.5
conserved d
53-way
(FDR < 0.05): 0 4
13.21% 363-way 10.0
2 FDR < 0.05
−log(q value)
0.10 7.5
0
363-way
5.0
363-way
–2
3.76% 77-way 2.5 4
2.11% 53-way 2 FDR < 0.05
0 0
0 0.05 0.10 0.15 0.20 0 0.25 0.50 0.75 1.00
0
Expected fraction of the genome Rate of column A A A C T G C T G A G T C A T
with a false-positive call (as a proportion of neutral rate) Position on chromosome 2 (MAF::NFE2)
Fig. 3 | Denser phylogenomic sequencing increases the power to detect non-significantly conserved columns. A rate of zero contains a relatively high
selective constraints. Results are shown from 3 alignments for 53 birds, proportion of recent insertions present in only a few species; there is limited
77 vertebrates, and 363 birds. a, Proportion of alignment columns labelled as statistical power to classify such insertions. c, Proportion of various functional
conserved. The cumulative portion of the genome with a conserved call is regions of the chicken genome that contain single-bp conserved elements in
shown, starting from the column with the smallest P value and proceeding to the large alignment compared to alignments with fewer species. UTR,
the columns with the highest P values. The dotted lines show the path after untranslated region. d, An example of a MAF::NFE2 motif overlaid on one of its
hitting the false-discovery rate (FDR) P value cutoff of 0.05, below which calls predicted binding sites demonstrates the high resolution of our conserved site
are significant (marked by arrows). b, Histograms of the rate of alignment predictions and the increased power to predict conservation in the larger
columns evolving slower relative to the neutral rate (labelled 1.00). Coloured alignment.
areas indicate significantly conserved columns, and light grey areas indicate
a paralogue of the heat shock protein gene DNAJC15, which has many genome. We scaled our model of the genome-wide mutation rate to
copies in bird genomes and is thought to be associated with the bio- match the neutral rate observed in microchromosomes, macrochro-
genesis of mitochondria23 and fertilization24. We identified a novel mosomes and sex chromosomes, because each chromosome type
Passeriformes-specific copy (which we named DNAJC15-like (DNA- shows different evolutionary rates in birds29,30. This resulted in one
JC15L)) at a newly derived genomic region between the MZT1 and DACH1 model for each chromosome type, which together were then used to
genes (based on the chicken coordinates), which was reconstructed as evaluate the degree of departure from the neutral rate and to estimate
a duplication in the most-recent common ancestor of Passeriformes the conservation score for each site. With the 363-way data, we found
(Fig. 2c, Extended Data Fig. 7c). The DNAJC15L gene model showed that the neutral rate within sex chromosomes is 16% faster than in mac-
exon fusions compared to its parental gene, which suggests that a ret- rochromosomes, and that the neutral rate within macrochromosomes
rotransposition mechanism was the probable origin of this duplication is 9% faster than in microchromosomes.
(Extended Data Fig. 7d). We compared these results against conservation scores derived
Moreover, we identified lineage-specific losses of genes such as cor- from two smaller alignments: a MULTIZ 77-way alignment including
nulin (CRNN), which encodes a prominent structural protein of the birds and other vertebrate outgroups31, and a 53-way alignment con-
oesophageal and oral epithelium in humans and chicken25. This gene is taining only birds of the 77-way alignment. A previous comparison
disrupted by mutations or is entirely absent in Accipitriformes (eagles of 48 bird genomes found that at least 7.5% of the chicken genome
and related birds of prey), Phalacrocoracidae (cormorants) and Passeri was conserved, with significantly lower substitution rates than the
(songbirds, a group of Passeriformes) (Extended Data Fig. 8a). The background10. This ratio was reached at 10-bp resolution by integrat-
latter use rapid changes in the diameter of the upper oesophagus to ing across multiple adjacent bases, trading off a lower resolution for
tune their vocal tract to the fundamental frequency of their song26. The a necessary increase in statistical power. This is because the statistical
absence of CRNN might correspond to changes in visco-elastic proper- power to detect conserved elements is roughly proportional to the total
ties of the oesophageal epithelium, and the loss of this gene may have branch length between the aligned species32. Our reference-free align-
contributed to the evolution of the diverse pure-tone vocalizations of ment of 363 bird species resulted in a predicted total branch length of
songbirds (Extended Data Fig. 8b). 16.5 expected substitutions per site, compared to 9.9 within the 77-way
We next explored avian conserved sequences, genomic regions that and 4.3 within the 53-way alignments. We transformed the conserva-
evolve at a substantially slower substitution rate than expected under tion scores into calls of significantly conserved single-base-pairs at an
neutral evolution. Conserved sequences are often indicators of puri- expected false-discovery rate33 of 5%. The 363-way alignment provided
fying selection27 and are therefore useful for investigating function ample increases in the number of bases detectable as conserved relative
within the genome28. To identify and measure conserved regions, we to alignments that contain fewer taxa (13.2% of the chicken genome in
created conservation scores for each base pair of the 363-species Cactus the 363-way alignment versus 3.8% in the 77-way and 2.1% in the 53-way)
alignment projected onto the chicken genome. The dense sampling (Fig. 3a, Supplementary Table 15). Such an improvement cannot be
increased our ability to detect purifying selection enormously, and explained by the alignment method, as a Cactus 48-way alignment
allowed us to produce what is—to our knowledge—the first base-by-base of birds showed very similar results to the 53-way MULTIZ alignment
conservation annotation that covers a substantial portion of a bird (Extended Data Fig. 10d). In the Z chromosome (which has a generally
Extended Data Fig. 1 | Sampling and processing of the 363 genomes. are light blue. b, Map63 of geographical origin of the 281 bird samples for
a, Sources of the 363 genomes. Each genome is a square; colour indicates the which geographical coordinates are available. c, Summary of the species
data source. Newly published genomes from the B10K Project phase II are red; confirmation of 236 B10K Project newly sequenced species. The downward
unpublished genomes contributed by external labs are yellow; published arrows are excluded genomes. d, Summary of mitochondrial genome assembly
genomes from phase I are orange; genomes contributed by the community that and annotation for 336 species. The downward arrows are excluded
have since been published are dark blue; and other genomes available on NCBI mitochondrial genomes.
Extended Data Fig. 2 | Distribution of transposable elements. a, Percentage sequenced species. d, S.d. of the per cent LINE content for orders with at least
of the genome that is a transposable element (TE). Box plots are shown for three sequenced species. e, Ancestral state reconstruction of total
groups with at least three sequenced species. b, Per cent base pairs of the transposable elements. The branch colour from blue to red indicates an
genome that are long interspersed nuclear elements (LINEs), grouped by increase in transposable elements. Two orders with noticeable patterns—
orders. Box plots are shown for groups with at least three sequenced species. Piciformes and Bucerotiformes—are labelled on the tree. A zoomable figure
c, S.d. of the transposable element content for orders with at least three with labels for all terminals is available at www.doi.org/10.17632/fnpwzj37gw.
Article
Extended Data Fig. 3 | Patterns of the presence and absence of 5 visual five annotated states of opsin sequences. A zoomable figure with labels for all
opsins in 363 bird species. This figure shows patterns for the visual opsins terminals is available at www.doi.org/10.17632/fnpwzj37gw.
encoded by RH1, RH2, OPN1sw1, OPN1sw2 and OPN1lw. Colours correspond to
Extended Data Fig. 4 | GC content and codon use. a, Principal component over-represented codons (>1.6, orange box plots). e, Pearson correlation
analysis (PCA) of GC content in the coding regions of orthologues with between GC content of the third codon position and the primary axis in
conserved synteny with chicken for 340 bird species, including b, colour-coded to distinguish Passeriformes and non-Passeriformes. The
164 Passeriformes species. b, Correspondence analysis of RSCU for all 363 strong correlation (R 2 = 0.9, P = 4.1 × 10 −184) indicates that the frequencies of
birds. The primary and secondary axes account for 78.18% and 14.82% of the codons ending in G or C is the main driver of the codon bias in Passeriformes.
total variation, respectively. c, The distribution of codons on the same two axes f, Comparison of the mean Nc values between the Passeriformes and other
as shown in b, with each codon coloured according to its ending nucleotide. species for orthologues with conserved synteny with chicken (Supplementary
This showed that the axis-1 score of a species is primarily determined by Table 12). Each dot represents the mean Nc value of an orthologue in the
differences in frequencies of codons ending in G, C, A or T. d, RSCU analysis of Passeriformes and other species, respectively. Orthologues with at least
59 codons across avian genomes (n = 363 biologically independent species for 20 individuals in both the Passeriformes and the non-Passeriformes were
each box plot). The horizontal lines indicate thresholds of under-represented included in this analysis.
codons (<0.6, blue box plots), average representation (1.0, white box plots) and
Article
Extended Data Fig. 5 | Overview of the pipelines for identifying genomic Cactus whole-genome alignments. The credible intron fragments in chicken
regions. a, Assignment of orthologous protein-coding regions. All pairwise were picked out after filtering out regions mapped by RNA sequences, and
relationships between homologous regions obtained from the Cactus chicken-specific or repetitive regions. Orthologous relationships of intron
alignment (4 species shown here in different colours) were used to construct fragments were detected on the basis of the aligned Cactus hits and the
the homologous groups across all 363 birds. Using chicken as the reference, we orthologues with conserved synteny with chicken. The non-intron regions of
further generated a table containing homologues with conserved synteny to each bird in the alignments were masked as gaps.
chicken. b, Annotation of conserved orthologous intron regions on the basis of
Extended Data Fig. 6 | Gene tree for copies of the growth hormone gene GH. Passeriformes, corresponding to the GH_S gene copy (blue) and the GH_L gene
The tree was generated by maximum likelihood phylogenetic analysis64 of copy (orange). Twelve species with only one copy are indicated by green stars. A
avian GH gene copies. Only nodes with >80 bootstrap are annotated as dots; zoomable figure with labels for all terminals and the tree file is available at
the larger the dot, the higher the bootstrap. All Passeriformes sequences are www.doi.org/10.17632/fnpwzj37gw.
clustered in a single clade and there are two sister gene clades within
Article
Extended Data Fig. 8 | The evolution of songbirds was associated with the gene not found (blank). Genes were identified by Exonerate65 using
loss of the cornulin gene. a, Presence and absence of the cornulin gene phylogenetically diverse EDDM, CRNN and S100A11 sequences as queries. A
(CRNN) and its surrounding genes (EDDM and S100A11) in all 363 birds. zoomable figure with labels for all terminals is available at www.doi.
Branches are coloured as oscine Passeriformes (blue), non-oscine org/10.17632/fnpwzj37gw. b, Hypothesis on the evolutionary loss of cornulin
Passeriformes (green) and non-Passeriformes (black). The states of genes are and the appearance of a fine-tuned extensibility of the oesophagus as a vocal
shown in three ways: functional gene (filled box), pseudogene (empty box) and tract filter in songbirds.
Extended Data Fig. 9 | Acceleration and conservation scores. Results are Proportion of chicken functional regions covered by significantly accelerated
shown from 3 alignments for 53 birds, 77 vertebrates, and 363 birds. a, or conserved sites. This panel is similar to Fig. 3c, but includes accelerated
Acceleration (left) and conservation (right) within alignment columns on columns.
chicken. This panel is similar to Fig. 3a, but includes accelerated columns. b,
Article
Extended Data Fig. 10 | Distribution of acceleration and conservation realignment with MAFFT for a random sample of significantly conserved sites.
scores. a, Distribution of conservation and acceleration scores within d, Comparison of the distribution of PhyloP scores across alignments. Scores
different functional region types across alignments. Lines mark quartiles of indicate log-scaled probabilities of conservation (positive values) or
the density estimates. b, Larger histogram of chicken column rates. This panel acceleration (negative values) for each base in the genome. a and d show results
is similar to Fig. 3b, but includes accelerated columns ending at a rate of 10× the from three alignments for 53 birds, 77 vertebrates and 363 birds.
neutral rate. c, Difference in PhyloP scores (compared to original scores) after
12345656762589
5653
17425
!"#$"%&'()*+, f"#$4(A&("=k'$̀m)&"%
-&('!#&(#./&'()*+,n'7op=qpqp
03&('0!&(4)5$$")%1 ' 22& /
($2!6()!#'4$.$7$(/8()59()&(5!'.7$):;)$82!6$# ('4('84"$("4/&"#(&"!&"4/
$"!($"%:<8'()$"82&($""3&('0&4)!7$4$=>'()?08 &"#()@#$($&7A7$4/)497$(:
1(&($($4
<&77(&($($4&7&"&7/=4"8$2()&(()8775$"%$(2 &!"($"()8$%'7%"#=(&.77%"#=2&$"(B(=C()# 4($":
"D& "8$2#
r ;)B&4(&2!7 $E*F+8&4)B!$2"(&7%'!D4"#$($"=%$6"& &#$4("'2.&"#'"$(82&'2"(
r > (&(2"("5)()2&'2"(5(&9"82#$($ "4(&2!7 5)()() &2 &2!75& 2&'#!&(#7/
r; ) (&($($4&7((*+'#>3G5)()()/&"H(5H$##
IFJKLMNOONFLPQRPRLRSNTJULVQLUQRMWXVQULRNJQJKLVKLFYOQZLUQRMWXVQLONWQLMNO[JQ\LPQMSFX]TQRLXFLPSQL^QPSNURLRQMPXNF_
r >#4$!($"8&7746&$&( ((#
r >#4$ !($"8&"/& '2!($" 44($"='4)& ((8"2&7$(/&"#&#`'(2"(82'7($!742!&$"
r >8 '77#4$!($"8() (&($($4&7!&&2($"47'#$"%4"(&7("#"4/*:%:2&"+().&$4($2&( *:%:% $"488$4$"(+
>3G6&$&($"*:%:(&"#&##6$&($"+& 4$&(#($2&( 8'"4(&$"(/*:%:4"8$#"4$"(6&7+
r <"'7 7)/!()$(($"%=()(((&($($4*:%:a=P=W+5$()4"8$#"4$"(6&7=884($E=#% 88#2&"#b6&7'"(#
cXdQLbLdYJTQRLYRLQ\YMPLdYJTQRLeSQFQdQWLRTXPYVJQ_
r <f&/$&"&"&7/$=$"82&($""()4)$48!$&"#C&964)&$"C"(&7 (($"%
r <)$&4)$4&7&"#42!7B#$%"=$#"($8$4&($"8()&!!!$&(7678((&"#8'77!($"%8'(42
r @($2&( 8884($E *:%:)"gU=A&"gW+=$"#$4&($"%)5()/54&74'7&(#
ITWLeQVLMNJJQMPXNFLNFLRPYPXRPXMRLhNWLVXNJNiXRPRLMNFPYXFRLYWPXMJQRLNFLOYFKLNhLPSQL[NXFPRLYVNdQ_
18(5&&"#4#
A7$4/$"82&($"&.'(&6&$7&.$7$(/842!'(4#
G&(&4774($" 3 8(5&5& '#:
G&(&&"&7/$ 22".$$"82&($4&"# (&($($4&7&"&7/$ 8(5&!&49&% 5'#=$"47'#$"%,%%(*6q:q:o+=1s>A#"6*6q:pt=6$"
n'"Hqpou=6$"n'7Hqpoq+=k&!47*6o:oq+=>77!&()H-k*6tvowt=6uppwo=6upovo=6uqoox=6uqtwu=6uqtww=6$"n'7Hqpoy=6$"
G4Hqpoz+=1'!36&*6o:p=6o:o=6o:q+=C&1'0>*6y:o:o=6y:q:o+=A7&(&"' *6:o:q:o=6o:q:t+=C&4'7' *6q:p:t+=1!&# *6y:u:p+=>./
*6:o:v:p+=Afn77/*6ou:w:qt+=l$H0$*6$"n'7/qpou+=f{1s*6y+=f->1;*6q:q:qz+=(.7&("*6q:q:q+=%"f7&(>*6o:p:t+=k"j$
*5$q:t:o+=C{1-@*6y:w:yo+=;!l&(*6q:o:o+='887$"9*6q:q:o+=35.7*6q:v+=;$"$(/*($"$(/"&|}qpotpxox+=4#H)$(*6t:z:z+=
~"(A*6u:qtHzy:p+=15$A(*7&Hqpow}px+=@kk*6wo+=;&"#20!&(<$"#*6t:px.+=0!&(C&9*6t:p:x+=0!f&
*6qpoxpoqx+=0!&(C#7*6oHpHw+=!)/(7*6p:xHqp+=Al-{@*422$(zvxwtv+=2&88(*6x:t=6x:yoy+=;$2>7*6o:t:6ou+=A>{A
*6t&ozt+=@B&C-*6y:p:v+=&4(' *422$(8ww8qy#+=f7&(!*6q:q:qz+=#"j*6o:t:q+=3ssA7&(/*6q:x:q+=C$(m*6q:y+=~H;0@@*6o:z+=
!)/7<$(*6o:u+=!)/7A*6o:u+=Al>1;*6o:u+=l>-(79$(*6q:o+=j$%%7(7*6o:q:y+&"#3s3sG@*6u+:1!4$8$4!&&2('##'$"%
'"H($2&!6$##$"1'!!72"(&/<$7o5)"&!!!$&(:'(2 4$!(&!" '4&"#&6&$7&.7"k$(l'.!&%)((!,DD
%$()'.:42Dfopk"2D&""(&($":
<2&"'4$!('($7$E$"%4'(2&7%$()2 8(5&()&(&4"(&7(()&4).'("(/(#4$.#$"!'.7$)#7$(&('=8(5&2'(.2&#&6&$7&.7(#$(D6$5:
j ("%7/"4'&%4##!$($"$"&422'"$(/!$(/*:%:k$(l'.+:1()3&('0&4)%'$#7$" 8'.2$(($"%4#? 8(5&88'()$"82&($":
0
G&(&
12345656762589
5653
17425
A7$4/$"82&($"&.'(&6&$7&.$7$(/8#&(&
>772&"'4$!(2'($"47'#&#&(&&6&$7&.$7$(/(&(2"(:;)$ (&(2"()'7#!6$#()8775$"%$"82&($"=5)&!!7$4&.7,
H>44 $"4#='"$|'$#"($8$=5.7$"98!'.7$47/&6&$7&.7#&(&(
H>7$(88$%' ()&()&6& 4$&(#&5#&(&
H>#4$!($"8&"/($4($" "#&(&&6&$7&.$7$(/
k"2 |'"4$"%#&(&=()%"2& 2.7$ &"#&""(&($" 8qzx !4$ %"&(#$"()$ ('#/)&6."#!$(#$"()3f~10>&"#k"f&"9'"#
A0n3>utuwzw*)((!,DD555:"4.$:"72:"$):%6D.$!`4(DA0n3>utuwzw+:;)&.6#&(&)&6&7."#!$(#$"()31>83kf#.5$()&44 $""'2.
3Appppupu*)((!,DD#.:4"%.:%D&4)D!`4(D3AppppupuD+:;)2$(4)"#$&7%"2 &"#&""(&($" 8yyz !4$ )&6."#!$(#$"()3f~
k"f&"9'"#A0n3>utuwzw:G&(&%"&(#&"#&"&7/##'$"%()$ ('#/&$"47'##$"() '!!72"(&/$"82&($"8$7 C"#7/G&(&'"#6#
Gs~#$,op:oxzyqD8"!5Eỳx%5:;)5)7%"2&7$%"2"(8()yzy.$# $&6&$7&.7&()((!,DD&7$%"2"(H'(!'(:y:&2&E"&5 :42D.$#H8$"&7:)&7:j)&6
4&(#&{1.5 )'.8&77yzy !4$=&6&$7&.7./!7&4$"%())'.{0-)((!,DD42!&&($6H%"2$4H)'.:yH'H5(Hq:&2&E"&5 :42D.op9})'.:(B(
$"(()C/l'.
(&.8();&49l'.
4($":
->7$78('#$2'4$(#"$447"()(!'#$"/# $% "
(6"5)"()#$47'$"%&($6:
1&2!7 $E >#(2$"&($"8&2!7 $E& !82#8B!$2"(&7('#$ 5& "(&!!7$4&.7$"()$ ('#/:A6#($' &2!7 8%"2
|'"4$"%5 74(#(!"(&77.$#8&2$7$ .&#"()(&B"2$4/(2:s"(&%(.$# !4$ )& "!6#($'
&2!7'#8()G3>B(&4($":<0$&($#&4(/7&*s{;Hppqo+=(5$"#$6$#'&7&2!7 &'#=.4&'& $"%7!6#($'
&2!74&""(2(()&2'"(8G3>|'$#8|'"4$"%:
G&(&B47'$" j B47'##%"2& 2.7$ ()&()K& 2.7/|'&7$(/5!("($&77/4"(&2$"&(#:qyz%"2 2&$"#82&((&7qxq
|'"4# !4$ &8(B47'#$"%()8775$"% &2!7,
Hoy%"2 526##'(!%"2& 2.7/|'&7$(/*4&887#3upop9.&"#D((&7& 2.7/7"%()p:vk.+
Hoy%"2 526#.4&'8!("($&74"(&2$"&($"8() &2!7
Hop%"2 5#'"#&"(5$()&%"28.((|'&7$(/&6&$7&.7"3f~82B("&77&.:
0!7$4&($" 0!7$4&($"5& "(&!!7$4&.7.4&'"B!$2"(&($"5& !82#$"()$ ('#/:
0&"#2$E&($" 0&"#2$E&($"5& "(&!!7$4&.7.4&'"B!$2"(&($"5& !82#$"()$ ('#/:
f7$"#$"% f7$"#$"%5& "(&!!7$4&.7.4&'"B!$2"(&($"5& !82#$"()$ ('#/:
r l'2&"&4)!&($4$!&"(
r 7$"$4&7#&(&
>"$2&7&"#()%&"$2
12345656762589
55653
17
A7$4/$"82&($"&.'(('#$ $"676$"%&"$2&7>00~@%'$#7$" 422"##8!($"%&"$2&7&4)
-&.&(/&"$2&7 1('#/#$#"($"6767&.&(/&"$2&7:
j$7#&"$2&7 1('#/#$#"($"6765$7#&"$2&7:
<$7#H4774(# &2!7 1;()'#/# $#"(4774("5 &2!7 $"(
" )8$7#&"##$#"(!82$
$"6
" $6&"$2&7&4)&
& #8$"#$
$"(
" )>00~@%'$#7$":
('#/'#!6#($' &2!7 '4#822''2 &"#()"&('&7)$(/4774($":;)$"($('($"=()8$7#
4774(=4'&(&"# (&88&(()76&"(2''2 &"#"&('&7)$(/4774($" &7$(#$
$"1
" '!!72"(&/;&.71o:
@()$46$%)( 3()$4&7&!!6&75& |'$#8() ('#/.4&'"
3 "7
$6 &2!7 54774(#&"#""7
$6B!$2"(&($"5&
7425
!82#:C''2 7$(#$ $"1
" '!!72"(&/;&.7o$'#5$(("!2$$"( ( |'"4=&"&7/E=&"#!'.7$)()%"($4
2&($&7!6$##.
./(
/)2( ((
)fop4"($'2:
3(()&(8'77$"82&($"
"(
" )&!!6&78() ('#/!(472'(&7..! 6$##$
$"(
" )2&"'4$!(: