Professional Documents
Culture Documents
List of Papers
This thesis is based on the following papers, which are referred to in the text
by their Roman numerals.
Eriksson J., Larson G., Gunnarsson U., Bed'hom B., TixierBoichard M., Strmstedt L., Wright D., Jungerius A., Vereijken A.,
Randi E., Jensen P., Andersson L. (2008) Identification of the yellow
skin gene reveals a hybrid origin of the domestic chicken. PLoS
genetics 4(2): e1000010.
II
Rubin C.J., Zody M.C., Eriksson J., Meadows J.R., Sherwood E.,
Webster M.T., Jiang L., Ingman M., Sharpe T., Ka S., Hallbk F.,
Besnier F., Carlborg O., Bed'hom B., Tixier-Boichard M., Jensen P.,
Siegel P., Lindblad-Toh K., Andersson L. (2010) Whole-genome
resequencing reveals loci under selection during chicken
domestication. Nature 464: 587-591.
III Eriksson J., Hellstrm A.R., Rubin C.J., Wang C., Shumaila S.,
Kerje S., Gourichon D., Bedhom B., Tixier-Boichard M., Leif Andersson. (2010) A frameshift mutation in COMTD1 specifically
dilutes pheomelanin pigmentation in chicken. Manuscript.
Reprints were made with permission from the respective publishers.
Contents
Introduction ..................................................................................................... 9
Mutations the Source of Genetic Variation ........................................... 10
Functional Categorization of Mutations .............................................. 11
Identification of Sequence Elements Regulating Gene Expression ..... 13
Domestication ........................................................................................... 14
Domestication of the Chicken .............................................................. 16
Pigmentation ............................................................................................. 18
Carotenoid-Based Pigmentation .......................................................... 19
Melanin-Based Pigmentation ............................................................... 20
The Chicken Genome ............................................................................... 21
Methods for Genetic Mapping .................................................................. 22
Introduction .......................................................................................... 22
Linkage Analysis ................................................................................. 23
Identical-By-Descent Mapping ............................................................ 23
Mutation Detection .............................................................................. 25
Aims of this thesis ......................................................................................... 27
Present Investigations .................................................................................... 29
Mapping Monogenic Traits in Chicken .................................................... 29
Background .......................................................................................... 29
Results and Discussion ........................................................................ 31
Signatures of Selection in the Chicken Genome ...................................... 37
Background .......................................................................................... 37
Results and Discussion ........................................................................ 38
General Discussion and Future Perspectives ................................................ 43
Acknowledgements ....................................................................................... 47
References ..................................................................................................... 49
Abbreviations
ASIP
BCDO2
BCO2
bp
cAMP
cM
COMT
COMTD1
DNA
ENCODE
Gb
HPLC
IBD
indel
kg
kb
LD
LOD
Mb
mtDNA
RNA
SNP
TSHR
TYR
TYRP1
TYRP2
QTL
Introduction
Charles R. Darwin combined geological, geographical and biological observations together as support for his theory that all organisms on planet Earth
share a common ancestry and that organisms are constantly evolving through
natural selection (1). Throughout his published work, Darwin used the process of domestication of animals and plants as a proof-of-principle that natural selection exists and acts (1, 2). He argued that if man through selective
breeding (artificial selection) has altered the morphology, physiology and
behavior (phenotypes) of the domesticates compared to their wild ancestors,
then the selective forces of the environment must achieve similar changes in
wild organisms (natural selection), however at a much slower rate. Darwin
was so amazed by the phenotypic variation observed in domestic animals
that he even devoted some of his precious time writing a book about the
subject: The variation of plants and animals under domestication (1868).
Darwin could never explain how the variation in traits arose or how the
variation was inherited from parents to their offspring. The publication Versuche ber Pflanzenhybriden in 1866 by Gregor Mendel gave some answers to the latter question. However, it did take almost more than half a
century until Mendels laws of inheritance were combined with Darwins
theories of natural selection to form the modern theory of evolution, known
as Neo-Darwinism. A number of landmark discoveries, such as the structure
of DNA (3) and the unearthing and deciphering of the genetic code, were
required until we had a clear picture that the variation in phenotypic traits is
caused by mutations in the DNA sequence that makes up our heritable genetic material, the genome.
In the year 2000, the sequence of the human genome was presented by
two separate efforts; private and public consortiums directed by Craig Venter and Francis Collins respectively. The analyses of the genome sequences
were published the year after in the scientific journals Nature and Science (4,
5). This has been followed by a number of generated genome sequences for
different animal and plant species in the last ten years, with an enormous
increase in the recent past as a consequence of the launch of new DNA sequencing technologies. The current sequencing technologiesmassively
parallel sequencing or next-generation sequencing technologiescan read
~250 billion bases in a week, compared to 5 million bases in the same time
period in the year of 2000 (6). The big challenge for contemporary molecular
geneticists is thus not to generate DNA sequences, but to decipher the ge9
10
11
miRNA
Trans-factor
Target site
Cis-regulatory
elements
Promotor
Intron
Coding exon
Coding mutations
Cis-regulatory mutations
*
Deletion
Cis-regulatory
element
*
*
*
Mutations in cis-regulatory elements have been suggested to play an important role in the evolution of phenotypic variation (13). King and Wilson
(1975) observed that human and chimpanzee showed surprisingly few protein sequence differences, and proposed that those alone could not account
for the phenotypic differences evident between the species. With the notion
of transcriptional regulation in minddiscovered by the bacterial geneticists
Jacob and Monod (1961) (14)they suggested that mutations that affect the
regulation of gene expression could account for the majority of the phenotypic differences observed between human and chimpanzee (15). A more
recent argument is that a cis-regulatory mutation is expected to be less pleiotropicaffect fewer phenotypescompared to a coding mutation, due to
the modular organization of the regulatory regions (13). A mutation in an
enhancer can, for instance, alter the expression of a gene in one tissue but
leave expression unaltered in other tissues. In contrast, a functional coding
mutation affects the function of the protein in all tissues in which it is expressed. This could lead to negative pleiotropic effects on many traits and
such mutations would thus be the object of purifying selection and perhaps
eliminated from the gene pool.
12
13
purifying selection (20). Some of these constrained sequences may be important in maintaining a number of traits that are specific for birds.
ENCODE, the Encyclopedia of DNA Elements, is a major research effort
with the aim to identify all functional elements in the human genome
through a combination of comparative genomic and experimental studies
(21). In 2007, only 1% of the human genome had been investigated, but the
effort continues and new data are released continuously (21). The aboveexemplified efforts in annotating genomes for functional elements will greatly simplify the detection of mutations that affect gene expression.
Domestication
Domestication refers to the human-directed process of making wild plants
and animals adapted to new environments and human needs. This adaption
has been achieved through selective breeding, or artificial selection, and has
resulted in an enrichment of mutations that control the phenotypes desired by
man (22).
The early domestication of animals was most likely unintentional, where
individuals carrying genetic variants predisposing them to be less afraid of
human approach were likely rewarded with food and protection (23). This
was later followed by intentional human selection for tame animals and other
desirable phenotypes, such as coat color variants (24, 25). Other typical phenotypic changes associated with animal domestication are as listed in the
review article by Jensen and Andersson (2005):
I
II
III
IV
V
External morphological changes (e.g. altered body size, fur and feather
color and growth pattern)
Internal morphological changes (e.g. reduced brain size)
Physiological changes (e.g. altered reproductive cycle and endocrine
response)
Developmental changes (e.g. earlier sexual maturity)
Behavioral changes (e.g. reduced fear, increased sociability and reduced
antipredator responses)
tions, dropping ears, shorter legs and tail, earlier sexual maturity, longer
mating seasons and alterations in skull morphology (27).
The rapid change in traits observed by Belyaev and colleagues, together
with the catalogue of shared domestication phenotypes among domestic
animals, have led some scientists to speculate that there are so-called domestication genes with large pleiotropic effects and that these created the
domestication phenotypes (28).
It has been proposed that thyroid hormones play an important role in animal domestication (29). Susan J. Crockfordthe main advocate of this theorycame to this conclusion after observing the similarities between domestication phenotypes and those associated with hypothyroidism (29). Crockford suggested that only a few changes of the regulation of thyroid hormones
during development, would be required to achieve many of the phenotypic
changes associated with the process of domestication. In paper II of this
thesis, we present data suggesting that the thyroid stimulating hormone receptor (TSHR)a key player in the regulation of thyroid hormone expressionhas played an important role in shaping the domestic chicken.
As described above, the domestication of animals has led to dramatic
changes in the phenotypic appearance of these individuals compared to their
wild ancestors (30). This long (~10 000 years) and intense selective breeding
has also created an enormous range of phenotypic variation within the domesticated species. The majority of these selected mutations have a favorable phenotypic effect on the selected trait (e.g. egg production or coat color)
but rarely a deleterious effect on other traits (22). Mutations with deleterious
effects are most often eliminated from the domestic animal breeding pool,
and so very few remain in the populations at large. The dog may be considered as an exception to that general rule, since selective breeding for specific
characteristics has resulted in pathological outcomeseither due to pleiotropism or linked deleterious mutationsthat have been accepted by the
breeders (31, 32).
The domestication of plants and animals is considered to be the most important development during the last 10 000 years of human history (33) and
was a prerequisite for the rise of civilization. Maintaining animals and plants
under our provision at the farm instead of hunting and gathering our food,
led to a population explosion due to the increased accessibility to food (33).
Permanent settlements could be established due to the decreased need for
migration to follow the seasonal shift in wild food supplies. It is no coincidence that inventions such as the wheel, metal tools, writing and advanced
social systems arose in the same areas as early domestication (33).
15
Figure 2. Panel A depicts a map of South Asia onto which the ranges of the four
species of jungle fowl are drawn. Panel B depicts a European domestic chicken.
Red, grey, blue and green regions represent the respective ranges of red, grey, Ceylon and green jungle fowls. Images of these birds are presented in panels C through
F respectively, within colored borders that correspond to the colors on the map. The
figure was adapted from Eriksson et al (2008).
16
Early molecular genetic studies of mitochondrial DNA (mtDNA) supported the initial view that the red jungle fowl was the sole ancestor of the
chicken (36, 37). However, the study by Nishibori et al. (2005)which
compared sequences of mtDNA and five autosomal loci from domestic
chickens and all four jungle fowl speciessuggested that two additional
species of jungle fowl (grey and Ceylon jungle fowl) might hybridize with
the domestic chicken (38). Nevertheless, this study did not provide an answer to the question of whether they had contributed to the domestication of
chicken. In paper I of this thesis, we present the first conclusive genetic evidence supporting the theory of a hybrid or polyphyletic origin of the domestic chicken.
The earliest archeological findings of chicken remainsbones larger than
the red jungle fowlwere found in Southeast Asia and were estimated to be
almost 8000 years old (39). Additional archeological findingsbones and
art objects depicting chickens in the Indus Valley dated to about 2500
B.C.suggested that there have been multiple domestication events of the
chicken (34). The multiple geographical origin of the chicken is also supported by molecular genetic data that further implies that the majority of
European and Middle Eastern domestic chickens originate from India (40,
41). Further molecular studies present evidence that American, Oceanic and
African chickens also originated from India, suggesting that India was the
original platform for the worldwide dispersal of chicken (42-45).
Chicken was most likely initially domesticated for mainly cultural reasons
such as religion, art and entertainment and much later as a source of food
(34). This would consequently imply an early selection for increased body
size, an important feature in cockfighting, and for plumage variants, used as
decorative art. Most of the hundreds of chicken breeds currently in existence
were formed during the 1800s, when selection for fancy characterssuch as
plumage and skin color variation, amount of feathering, size of various external organsbecame high fashion among the royalties and the upper classes in Europe and America (34). Many Asian breeds were at that time imported to Europe, and very often hybridized with European breeds to create
new stocks with fascinating characters (34).
It was first during the 20th century when more systematic breeding
schemes focused on production traits were initiated (34). Chicken lines were
created with the sole task of producing either eggs or meatto overcome the
negative genetic and phenotypic relationship between reproduction and
growth (46). This has resulted in extreme differences in reproduction between the modern egg-layer chicken and its main wild ancestor, the red jungle fowl. The modern layer used in commercial egg production produces
more than 300 eggs per year, while a pure red jungle fowl lays around 5 eggs
once per year (47). The same extreme difference is observed in body weight
between the modern commercial broiler and the red jungle fowl, 4-5 kilogram (kg) versus 1 kg respectively (34). This great response to selective
17
breeding for egg and meat production has thus been achieved by sophisticated breeding programs, and is made possible due to the high genetic variability and large effective population sizes within the domestic chicken populations (46).
In paper II we, utilize massively parallel sequencing technology to search
for haplotypes in the chicken gene pool, which are thought to be responsible
for some of the phenotypic changes that have taken place during the domestication process, and the subsequent selection for specialized breeds used for
food production.
Chicken as Model Organism
The chicken has a long and proud history as a model organism. It was the
first animal in which the Mendelian inheritance of traits was demonstrated
(48), and scientific discoveries made with the chicken as an experimental
system in developmental biology (49), virology (50) and immunology (51)
have been awarded with Nobel Prizes.
The chicken possesses a number of features which make it particularly
suited to the study of genotype-phenotype relationships. Firstly, there is a
great diversity in phenotypessuch as plumage and skin color, body composition, production traitsproviding material for dissecting the genetic
mechanisms underlying phenotypic variation. Secondly, the chicken is easy
and cheap to breed, permitting the construction of large pedigree material
which segregates for the trait of interest (46). Third, the chicken genome has
a high recombination rate compared to many mammalian species (46, 52).
Fourth, many mutations underlying traits are shared between chicken breeds
due to gene flow and the recent history of breed creation (22). Fifth, there is
high genetic diversity within the chicken gene pool (53, 54). The last four
points provide excellent conditions for powerful genetic analyses to determine genotype-phenotype relationships.
Identifying the genes underlying some of the enormous variation in
chicken phenotypes is of great importance since it can be used to improve
the future breeding of domestic animals and also as it can provide information about the function of orthologous genes in vertebrates.
Pigmentation
One distinctive feature of all domestic animals is the huge variation in coloration among and within these individuals compared to their wild ancestors
(55). Selective breeding, since the early days of domestication (24, 25), has
achieved this phenomenon. The reason for that phenotypic selection might
have been to reduce camouflage in order to facilitate animal husbandry,
and/or to make the domesticated forms distinct from their wild ancestors, or
just as a human preference for novelty (24). The mutations controlling pig18
mentation variation were either present in the wild species prior to domestication or arose during domestication (55).
Pigmentation serves many different functions in animals, such as camouflage, mimicry, intraspecific communication, and protection against ultraviolet radiation (UVR) (56, 57). The mouse is the prime organism for the identification of genes controlling pigmentation due to the extensive mutagenesis
projects that have generated a tremendous number of pigment mutants. To
date (November, 2011), 171 genes have been identified which influence
pigmentation in the mouse in various ways, and more than 200 loci remain
to be mapped (58). Many of these genes have also been reported to affect
pigmentation in other vertebrate species, suggesting a high degree of conservation for the genes and molecular pathways controlling pigmentation in
vertebrates (55).
The chicken has also proven to be an excellent model organism for understanding the genetics of pigmentationillustrated by the number of pigmentation genes mapped in chicken (59-68). The identification of genes underlying plumage and skin color variants in the domestic chicken will provide
new insight about basic pigmentation biology.
Carotenoid-Based Pigmentation
Carotenoids are pigmentsyellow, orange or red in colorthat are synthesized by plants and microorganisms where they, for instance, facilitate photosynthesis and in plants protect chlorophyll from photo-damage (69). Animals lack the ability to synthesize carotenoids and must therefore acquire
them from their feed (69). Carotenoids serve several important functions in
vertebrate physiology. They act as precursors to vitamin A and retinoic acid
(70); anti-oxidative molecules (71); and photoprotectors of the human retina
(72).
Carotenoids are also important as pigments and add coloration to the tissue/organ where they are deposited, as observed in insects, fish, reptiles,
birds and mammals (69, 73). For instance, carotenoids are responsible for
the color of the egg yolk, lobster shell, salmon flesh and the flamingo plumage (69). In birds carotenoid-based pigmentation is known to act as an honest signal of the condition of a male carrier and functions as a sexually selected trait (74, 75). There is a trade-off between the allocation of carotenoids for colorful ornamentation versus the maintenance of carotenoids for
important physiological processese.g. as antioxidants in the immune system (74, 75)and only males in a good physical condition can afford to
allot carotenoids to ornamentation.
The genetic mechanisms involved in the metabolism and deposition of carotenoid pigments in animals were elusive until recently. In paper I, we present the first genetic mechanism underlying variation in carotenoid-based
pigmentation in animals. We conclude that a decreased expression of the
19
enzyme BCO2 (beta-carotene oxygenase 2) in the skinallowing the depositions of colorful carotenoidscauses the yellow skin and beak phenotype
in the domestic chicken.
Melanin-Based Pigmentation
The melanocyte is the cell type responsible for the synthesis of melanin
pigment in birds and mammals. Two types of melaninthe brown/black
eumelanin and the red/yellow pheomelaninare synthesized in the melanocyte-specific organelle, the melanosome (57). The melanin packed melanosome is later transferred to surrounding keratinocytes in the skin, eye, hair or
feather, where it provides color and photoprotection (57). It is the number,
size, cellular distribution and type of melanosomepheomelanosome or
eumelanonsomethat determines the color of the pigmented organ (76).
Research in the field of melanin-based pigmentation biology is extensive,
and so I will focus my review on the pathway that affects my work, namely
the synthesis of melaninalso know as melanogenesisor more specifically the production of pheomelanin pigment.
Melanogenesis
The production of melanin pigment is primarily initiated by the interaction
of external signaling proteinse.g. -melanocyte stimulating hormone (MSH) or agouti signaling protein (ASIP)to the melanocortin-1 receptor
(MC1R) that is located in the membrane of the melanocyte (77). The interaction between -MSH and MC1R leads to elevated levels of intracellular cyclic AMP (cAMP) that increases the transcription of the genes encoding
tyrosinase (Tyr) and the tyrosinase related protein 1 (Tyrp1) and 2
(Tyrp2/Dct) (78). Tyrosinase (Tyr) catalyzes the conversion of L-tyrosine or
L-dopa to dopaquinone, which is the precursor molecule to both eumelanin
and pheomelanin. The Tyrp1 and Tyrp2 enzymes catalyze reactions in the
biosynthesis of eumelanin (eumelanogenesis) with dopaquinone as the substrateleading to the production of the brown/black eumelanin pigment.
The interaction between ASIP and MC1R leads to lowered intracellular
levels of cAMP resulting in pheomelanin synthesis, which is also known as
pheomelanogenesis. The biosynthesis of pheomelanin requires the amino
acid cysteine in addition to dopaquinone (79). But unlike eumelanogenesis,
no additional enzymes seem to be required for pheomelanogenesis to proceed: pheomelanin is produced in the presence of dopaquinone and cysteine
(80). The potential lack of enzymes in pheomelanogenesis could explain
why only a scarce number of mice mutants (dwarf gray, lethal grey, grizzled,
subtle grey) (58) displaying a specific dilution or removal of pheomelanin
pigmentation have been identified. In contrast, numerous mice with mutations affecting only eumelanin pigmentation have been identified, and of
these 56 are caused by a mutated Tyrp1 gene (81).
20
Dopaquinonethe melanin precursor moleculeand intermediate molecules produced in melanogenesis are known to be toxic to the melanocyte
(57). To cope with those cytotoxic intermediates, melanogenesis takes place
in the closed environment of the melanosome (57). Occasionally some leakage to cytosol from the melanosome occurs, and protective mechanisms have
evolved to incapacitate the cytotoxic intermediates (57). One example of
such a mechanism is the O-methylation of the melanogenic intermediates
mediated by catechol-O-methyltransferase (COMT), leading to a reduced
cytotoxicity of the intermediates (57).
In paper III of this thesis, we propose that catechol-O-methyltransferase
domain containing 1 protein (COMTD1) is affecting pheomelanin pigmentation either; by the detoxification of cytotoxic pheomelanin intermediates
leaked from the melanosome; or by altering the cellular levels of cysteine; or
by acting as a regulator of pheomelanogenesisthis is because a frameshift
mutation in COMTD1 is underlying the Inhibitor of Gold phenotype in
chicken.
positive selection during the domestication process and the subsequent specialization of meat or egg producing chicken lines.
Linkage Analysis
Linkage analysis is a method used to identify a chromosomal region harboring a gene that underlies a phenotypic trait of interest. In order to perform a
linkage analysis three prerequisites need to be fulfilled: (i) a pedigree material segregating for the trait, (ii) genotype data from polymorphic genetic
markers and (iii) phenotype data of the individuals in the pedigree material.
Linkage analysis utilizes the phenomena of genetic linkage to identify
genetic markers that co-segregate with the phenotype of interest in a pedigree material. Genetic linkage refers to the observation that the closer two
genetic loci physically are to each other, the greater chance that they will be
inherited togetheri.e. they will not be separated by recombination during
meiosis (85). The genetic distance between marker pairs is measured in centi-Morgans and can be directly inferred from the recombination fraction: 1%
recombination between markers equals 1 centi-Morgan. The recombination
fraction () between markers is calculated by dividing the number of recombinant gametes with the total number of informative meiosis. Markers
are considered linked if the recombination fraction between them is significantly less than 0.5.
When performing linkage analysis, the phenotype is treated as a marker
with a specific mode of inheritance, and the aim is to detect genetic markers,
of known genomic positions, that are inherited together with the phenotypic
trait. To each marker-phenotype combination, a logarithm of odds (LOD)
score is assigned to statistically test the chance of obtaining a similar outcome as if the markers were unlinked (=0.5) (85). A LOD score of 3 is
generally considered to be significant evidence of linkage and means that
there is a 1 in 1000 chance that the observed linkage would have occurred by
chance (85).
Identical-By-Descent Mapping
Identical-by-descent (IBD) mapping, or linkage-disequilibrium analysis, is a
natural follow-up to linkage mapping and is used to refine the position of a
locus controlling a phenotypic trait. IBD mapping has proven to be especially powerful for detecting genes underlying phenotypic traits in domestic
animals (22).
IBD mapping utilizes the fact that many mutations underlying selected
phenotypes within domestic animals have been inherited in a manner termed
identical-by-descent (IBD) from a common ancestor and are therefore shared
between populations of a domesticated species (30). The intense selection
for such mutations in domestic animals has created so-called selective
23
24
g=0
Ig
ig
g=2
ig
g=4
ig
g=8
ig
g=n
Ig
ig
Ig
Mutation Detection
Mutation detection used to be a daunting task due to the time-consuming
processes of primer-design, PCR amplification and nucleotide sequence
determination of the trait-associated region. Today, using the latest sequencing technologies provided by companies such Illumina and Life Technolo25
26
The overall aim of this thesis was to utilize genetic and genomic methods to
develop our understanding of the genetic mechanisms underlying phenotypic
variation and consequently add function to vertebrate genes.
The specific aims were to:
Identify the casual genes for two pigmentation phenotypes, yellow skin
(W) and Inhibitor of Gold (IG), in the domestic chicken (Paper I and
III).
27
Present Investigations
29
31
Sequence comparisons between the yellow skin and wild type alleles revealed the surprisingly high sequence difference of 0.81%, which was much
higher than the previously reported genome average of 0.5% (53). This observation, combined with the notion that both Ceylon and grey jungle fowl
have yellow/red legs, led us to sequence the 23.8 kb IBD region in the additional three jungle fowl species. The subsequent phylogenetic analysis revealed that the sequences clustered into two distinct clades (Figure 4). The
first comprised of BCO2 sequences from red jungle fowls and white skinned
(W/W) chickens, while the second clade comprised of sequences from yellow
skinned chickens and Ceylon and grey jungle fowls. We concluded that the
most plausible explanation to the observation was that the yellow skin allele
originates from the grey jungle fowlpossibly Ceylon jungle fowland
was introgressed into the chicken genome sometime during early domestication. Resequencing of the yellow skin interval in Chinese Shek-ki chickens
known to express the yellow skin phenotyperevealed that they carried
alternative yellow skin haplotype that was slightly different to yellow skin
haplotype found in Western breeds. The Chinese yellow skin haplotype did
however cluster with the Western yellow skin haplotype and BCO2 sequences from grey and Ceylon jungle fowls. The existence of two different yellow
skin haplotypes suggested that there have been multiple introgression events
of the yellow skin gene into the chicken gene pool.
Phylogenetic analyses of sequences of mtDNA and random nuclear loci
(38) derived from the four species of jungle fowl and yellow and white
skinned chickens, revealed a tight cluster of chicken and red jungle fowls
sequences separated from the other jungle fowl specieswhich suggested a
limited contribution of genetic material from the grey jungle fowl to the genetic makeup of domestic chicken.
32
Domes!c, Shek-ki
Grey jungle fowl
98
Yellow Skin
100
63
54
Domes!c, Friesian
Red jungle fowl
The Inhibitor of Gold locus was mappedby linkage analysis in a pedigree segregating for IGto a 3.58 megabase (Mb) region located between
the nucleotide positions 12,388,39915,970,174 on chicken chromosome 6.
No recombination was observed between genetic markers within the 3.58
Mb region and IG, which was a much larger non-recombinant region than
expected considering the size of the pedigree material (166 informative meiosis) and the high average recombination rate of the chicken genome (52).
However, a low recombination rate in the chromosomal region was also
observed in the consensus map for chicken (102), which made us conclude
that IG*IG allele is not associated with a large inversioninversions are
known to suppress recombination.
Using IBD mappingunder the assumption that most IG*IG chromosomes trace back to one common ancestorwe identified a shared haplotype
among IG birds from 5 different populations that spanned 438,921 bp. The
region harbored three ENSEMBL gene predictions corresponding to the
protein-coding genes COMTD1, C10ORF11 and ZNF503. A forth ENSEMBL gene prediction (ENSGALG00000024010) was also located in the
region but was only supported by a weak sequence similarity to a protein
encoded by the baboon herpes virus (Uniprot ID; Q9Q5K9). An even smaller haplotype associated with Inhibitor of Goldcontaining only
COMTD1was also determined. This IBD region was just supported by one
recombinant IG*IG chromosome and further analyses were performed on the
high confidence 439 kb minimum shared region.
To search for potential causal mutations for the IG phenotype, the whole
genome sequence of a bird homozygous IG*IG was generated, using nextgeneration Illumina sequencing technology. Polymorphic sites, including
single nucleotide polymorphisms, small insertions and deletions and large
structural variants, were identified and intersected with coding and evolutionary conserved sequences within the high confidence IBD regionin
which the causative mutation was supposed to be located. The only mutation
that fell in the coding or highly conserved sequences was a 2-bp-insertion in
the 5th exon of the COMTD1 gene. The insertion was predicted to alter the
reading frame of COMTD1 resulting in a premature stop codon.
Reverse Transcriptase PCR analysis of RNA samples derived from feather follicles representing the three genotypes at the IG locus, revealed the
presence of two different COMTD1 transcripts associated with the IG*IG
allele: one full-length transcript with the 2-bp insertion, and a second with
the 2-bp insertion that was lacking the 6th exon of COMTD1. The proteins
(Chicken_IG1 and Chicken_IG2) translated from the two IG*IG associated
transcripts, were both predicted to be completely or partially non-functional,
as both were lacking C-terminal sequences that were highly conserved
across vertebrate orthologs (Figure 5).
34
Figure 4
Chicken
Chicken_IG1
Chicken_IG2
Zebra_finch
G. Anole
Pufferfish
Dog
Human
Salmon
Mouse
. * :*:*.:*:**** * *
----MPLFSVPKEVAVGTAMLGVAFATGLLAGKRYPS-LLFGTSKPHKSIIGKSSPLWQYILDHSLREHPILKKL
----.................................................
.................................................. .....................
----.................................................. .....................
----........................M....L.....V..SPS---GM.R.NN..R.................
----....GIS..A.....V...VA-A.FF..R.SR.RFV...P.------.SPNL.Q..V......
....S.T..
----------------------------------------MSIL.S--HVGK--D..M..V.S..........G.
MSPPA.RL.V.AAL.L.S.A.GA...A...LGR.G.P---WRSRRERRLLPPEDS...R.L.SR.M......RS.
MTQPV.RL.V.PAL.L.S.A.GA......FLGR.C.P---WRGRREQCLLPPEDSR....L.SR.M......RS.
----MSFH.VSREALI.S.T.G.V.VA.YYIG..RST-FSMDIF.S--HSGA.DN..M..V.N......
.......
MAQPV.RL.I.AAL.L.S.A.GA.......LGK.W.P---W.SRRQERLLPPEDN.....L.SR.M......RS.
1.......10........20........30........40........50........60........70.....
Chicken
Chicken_IG1
Chicken_IG2
Zebra finch
G. Anole
Pufferfish
Dog
Human
Salmon
Mouse
:: * :
*:: :*:*::*** :**:***.:::*. ***.:* :**.:* * :::*::.
.::*:*.*
RLLTAEYPWGKMMVSCDQAQLMANLIKLIKAKKVIEVGVLTGYNALSMALALPDNGRVIACDINEDYAKIGKPLW
...........................................................................
...........................................................................
.....DH.S.R.....E........V..........I..F......N...V........................
KMV..DRRDKR....R.........AR..R.........F....T.N...VI..D.K.V...V..ES.NL...V.
..R.M.DSYNV...A.E.S.F....A..........I..Y....T........ED.ALV....S.E..N....F.
....L.Q.QGD..MT.E....L...AR......ALDL.TF...S..AL.....PA...VT.EV.PGPPEL.R...
....L.Q.QGDS.MT.E....L...AR..Q...ALDL.TF...S..AL.....AD...VT.EVDAQPPEL.R...
..R.MDDS.NFI.VAAE.S......AR......AI.I..Y....T..I..V...D.KMV....KDE.PN....F.
....L.Q.QGDS.MT E....L...AR......ALDL.TF...S..AL.....EA...VT.EVDAEPP.L.R.M.
...80........90.......100.......110.......120.......130.......140.......150
Chicken
Chicken_IG1
Chicken_IG2
Zebra finch
G. Anole
Pufferfish
Dog
Human
Salmon
Mouse
::*
:**:**::** :***:*:: *** ***...:**** .
***:.* *:: **::*: *: * *::*
KEAGVDHKIDLRIKPATQTLDELLAGGEAETFDFAFIDADKESYNEYYEKCLRLIKKGGIIAIDNVLRCGMVLKP
.................LKHLMNCWLVEKQKPLTLLSLMRIKKATMSTMKNVCAS
.................LKHL----- ---------------------------------------..........
.....E..........I......V.N..........................R.............FWN.K....
.....E..........S.......SN...G............G..D.........R...........LS.K..N.
.....EQ......Q..LK......S.........V......QN.DN....S.L.LR...VV......WG.D..N.
RQ.EEE...E..L...LE.......A...G...V.VV.....NCAA...R....LRS..VL.VLS..WR.E..Q.
RQ.EAE......L...LE.......A...G...V.VV.....NCSA...R....LRP...L.VLR..WR.K..Q.
.....AQ......Q..LK...D...N........V......AN.DN....S...VR........N..WS.K.VN.
.Q.E.EQ.....LQ..L........A...G...I.VV.....NCTA...R....LRP..VL.VLR.VWR.E..Q.
.......160.......170.......180.......190.......200.......210.......220.....
Chicken
Chicken_IG1
Chicken_IG2
Zebra finch
G. Anole
Pufferfish
Dog
Human
Salmon
Mouse
35
36
37
region showed the forth-lowest ZHp score (8.2) among all ZHp scores in the
all-domestic comparison, and thus confirmed the capacity of our approach to
detect true selective sweeps.
For the all-domestic comparison, a total of 58 windows representing 21
loci, displayed a ZHp score lower than 6. The putative selective sweep with
the lowest ZHp score (9.2) occurred at the locus encoding the thyroid
stimulating hormone receptor (TSHR) on chromosome 5. This was a noteworthy discovery since TSHR has a well-established role in metabolic regulation and reproduction (113-116), changes in which are associated with the
domestication process (26). To examine the geographical distribution of the
selected TSHR haplotype, genotyping of eight SNP loci at the TSHR locus in
271 birds representing 36 different chicken populations of a worldwide distribution, was performed. This revealed an amazingly high homozygosity at
the TSHR locus: 264 of the 271 birds were homozygous for the TSHR sweep
haplotype and the remaining 7 were heterozygous (Figure 6). This extreme
fixation observed in distantly related chicken populations led us to propose
that TSHR may be a so-called domestication locus, and have played an important role in shaping domestic chicken.
To search for selected mutations, Sanger sequencing was utilized to cover
gaps in the TSHR sweep region. The non-synonymous mutation at nucleotide
position 43,250,347 on chromosome 5resulting in the glycine (Gly) to
arginine (Arg) substitution at residue 558 in the TSHR proteinwas considered a top candidate for the selective sweep at the TSHR locus. The glycine
at that position was found to be completely conserved among all known vertebrate TSHR sequences, which suggested that the substitution most likely
would alter the function of TSHR.
39
A previous experiment aimed at identifying QTL for growth used an intercross between the red jungle fowl and White Leghorn (117). Even though
the TSHR locus was determined to segregate in the intercross, it did not
overlap with any of the QTL regions previously reported, suggesting that the
mutation is not affecting growth. Instead, we speculate that the domestic
TSHR allele potentially acts to reduce the regulation placed on seasonal reproduction. TSHR signaling has an established role in the regulation of photoperiodic control of reproduction (114-116), and the absence of strict reguwww.VectorOpenStock.com
lation of seasonal breeding is in fact a common feature in domestic animals
(118).
Several of the putative sweep regions identified in the commercial broiler
lines occurred at genes with roles in obesity (TBC1D1), growth (IGF1,
INSR), and regulation of appetite (PMCH). This was anticipated since broiler
chickens have primarily been selected for muscle growth. An additional 5
putative selected loci with ZHp scores less than 6 were also identified in the
broiler lines.
We also searched for loss-of-function mutationsnonsense mutations
and deletionsin the sequence data, since these have been proposed as an
important factor in rapid evolution (119). Very little evidence was found that
the selection of loss-of-function mutations had been important during the
domestication process of chicken. Out of 1300 identified deletions, only
seven affected coding sequences, and two of those were considered to be
functional. The first deleted sequence in the growth hormone receptor (GHR)
gene, which was previously described to cause sex-linked dwarfism (120).
40
The second was a deletion in the uncharacterized gene SH3RF2 (SH3 domain containing ring finger 2). This deletion was considered to be especially
interesting since it sat within a QTL region for growth identified in the F2
generation of a cross between the High and Low growth lines. The High
growth line was fixed for the deletion, while it occurred at a low frequency
in the Low Growth line, and expression data from the hypothalamus showed
SH3RF2 expression in the Low growth line but not in the High growth line.
A QTL analysis with genotype data from the SH3RF2 locus in the F8 generation of the High and Low growth line intercross revealed a strong association between the deletion and increased growth. This suggests that the deletion in SH3RF2 is the causal mutation for the QTL on chromosome 20.
41
In papers I and III of this thesis, we present two genetic mechanisms underlying some of the variation in carotenoid and melanin pigmentation observed
in the domestic chicken. The genes identified to control the two monogenic
traits are excellent candidates for deciphering the genetic basis of the variation in carotenoid and pheomelanin pigmentation that is observed in many
other organisms. In paper II, we take one step closer to unraveling the genetic basis for the transformation of the wild red jungle fowl into the highly
productive domestic chicken. In combination, all three papers have advanced
our understanding of the genetic mechanisms underlying phenotypic variation.
The yellow skin study (Paper I) provides the first evidence of a hybrid
origin of the domestic chickenas proposed by Hutt in 1949. Despite that
the amount of genetic introgression from the grey jungle fowl seems to be
limited, it would be worthwhile to explore the extent of introgression from
the grey jungle fowl and the additional two jungle fowl species on a genomewide basis. This may be achieved by generating the genome sequences for
the four jungle fowl species, and a set of chicken breeds, and searching for
genomic regions where the chicken sequences cluster tightly with one of the
jungle fowls, rather than red jungle fowl. Such regions are thought to be
enriched for functional alleles since they have not been removed from the
chicken population through the extensive backcrossing that most likely took
place after the introgression. These alleles would probably also exert large
phenotypic effects on the selected traits, if we assume that the introgression
occurred early during domestication and before the worldwide dispersal of
chicken, when visual selection was the only selection method present. Mutations with small phenotypic effects on traits such as growth and reproduction
have probably first increased in frequency within the last 200 years or so,
when more advanced methods for phenotypic selection were developed.
This study also provides insight into the molecular mechanisms underlying variation in carotenoid pigmentation between two distinct wild species
namely the red and grey jungle fowlsand thus sheds some light on the
evolution of carotenoid-based pigmentation in wild species. Our study did in
fact compare sequences from two different species even though the sequences were segregating in the gene pool of a single species. With the fact that
grey jungle fowl has yellow/red skin and red jungle fowl white skin, we can
be quite confident that the mutation controlling the variation in carotenoid43
pigmentation arose prior to domestication. However, we are not able to determine if it is yellow or white skin that represents the ancestral statethere
is no reliable information of leg color of the closely related green jungle
fowl. It is quite remarkable that a study aimed at to identify the gene responsible for a phenotype observed within a single species, led to the detection of
the genetic mechanism underlying variation in a phenotypic trait, observed
between two distinct species.
The BCO2 gene is hence an excellent candidate for examining the genetic
mechanisms underlying variation in carotenoid-based pigmentation in other
species. We have recently started a projectin collaboration with evolutionary biologiststo test if BCO2 underlies the variation in beak color (yellow
or white/pink) observed in another bird species.
Our study also adds support to the view that cis-regulatory mutations have
played a significant role in the evolution of phenotypic variationthe yellow skin phenotype is caused by a cis-regulatory mutation affecting the expression of BCO2 in skin but not in liver. BCO2 has recently been shown to
be a mitochondrial protein with an important role in carotenoid homeostasis
and the protection of the mitochondria against carotenoid induced dysfunction (16). In addition Bco2 knockout mice display enlarged spleens and reduced testis size compared to wild-type mice after being fed a carotenoidrich diet (121). A complete loss-of-function mutation of BCO2 would most
likely confer deleterious effects on many traits and hence would be removed
from the gene pool. Interestingly, no adverse effects on health or development are associated with the complete loss-of-function (nonsense) mutations
of BCO2 causing increased carotenoid levels in cow milk (100, 101) and
yellow fat in cow (101) and sheep (73). A more thorough investigation of
numerous phenotypic traits for the different genotypes is required to truly
rule out the presence of deleterious effects associated with loss-of-function
mutations in BCO2 in these species.
Regarding the Inhibitor of Gold locus (Paper III), several functional experiments are required to develop our understanding of how the 2-bp insertion in COMTD1 results in reduced pheomelanin pigmentation. Characterization of the morphology and the number of melanocytes in feather follicles
derived from IG and wild type birds, would perhaps reveal if the COMTD1
protein exerts a protective role against the toxic intermediates produced during pheomelanogenesis. Whereas knock-down or over-expression experiments of COMTD1 in melanocyte cell lines, followed by the quantification
of pheomelanin using HPLC, would indicate if COMTD1 possesses a regulatory role in the synthesis of pheomelanin. To assess if COMTD1 is affecting
pheomelanin pigmentation, through regulating the levels of available cysteine for the pheomelanin synthesis, an appropriate experiment would be to
measure the cysteine levels in blood plasma from IG and wild type birds
(122).
44
It is interesting to speculate why COMTD1 has never previously been associated with pheomelanin pigmentation in other specieseven through
extensive mouse mutagenesis screens. One explanation could be that in other
species, a mutant COMTD1 would only have a subtle effect on pheomelanin
pigmentation and is thus not detected in the mouse mutagenesis programs.
Another explanation is that COMTD1 mutations are perhaps lethal due to
negative pleiotropic effects on essential physiological processes. The Inhibitor of Gold bird represents a unique resource in which to study the function
of COMTD1.
In paper II, we identified a number of loci in the chicken genome that
likely have been under selection during the domestication process of chicken. These regions are thus most likely enriched for alleles that can explain
some of the major phenotypic differences observed between the major wild
ancestor, red jungle fowl, and the domestic chicken. We also detected a
number of putative selected loci specific to meat or egg producing chicken
lines. The selected loci, found in broiler and layer chickens, may be utilized
to advance our knowledge of how energy metabolism, appetite and reproduction are regulated in vertebrate species.
First, further investigations in terms of genetic and functional experiments
are required to establish if sweep signals are the result of selection or genetic
drift. This may be followed up with experiments to determine the link between the selected haplotypes and phenotypes. One obvious effort that
would add support to the premise that selection has created the sweep signals, is if the selected mutations responsible for the sweep signatures were
found. For instance, non-synonymous single nucleotide polymorphisms
(nsSNP(s)) are fairly easy to detect and may be examined using SIFT (90) or
PolyPhen software (123), to predict the functional effect of the polymorphism. The nsSNPspredicted to have a functional effectcould be further
investigated using intercrosses between different chicken lines (e.g. egg layer/broiler or White Leghorn/red jungle fowl) if they were found to segregate
within these populations. QTL analyses with the SNP loci genotypes could
reveal if they are associated with any of the phenotypes collected from the
intercross. All putative sweep regionswith or without the presence of candidate mutationsshould be genotyped in intercrosses that are believed to
segregate at the sweep loci, in order to search for associations between the
sweep haplotypes and the recorded traits. In fact, such analyses are now being performed with broiler sweep loci in an intercross between egg layer and
broiler chickens.
The genes within, or in close proximity to, the putative sweep regions
should also be examined to see if they display differential expression between the appropriate populationsi.e. red jungle fowl vs. domestic and
layer vs. broilerthat may unearth if the selected mutations have a cisacting effect on gene expression. The combination of results from the abovedescribed analyses will perhaps provide the evidence required to decide
45
which of the putative selective sweeps that are the result of selection or genetic drift, and also possibly link the true selected loci to their phenotypes.
The supposed domestication locus at the TSHR gene should be followed
up with functional experiments and segregation analysis with data of phenotypic traits that are proposed to be affected by altered TSHR signaling. It is
tempting to speculate that the regulation of thyroid hormone signaling is
altered in other domesticated speciesresulting in many of phenotypes associated with the domestication process. This may be settled when the data
concerning the genetic basis of domestication in other animal species are
examined and made public.
Our study provides evidence that it is possible to detect loci that have
been under strong positive selection during chicken domestication using
next-generation sequencing technology on pools of DNA. The same should
now be possible for mammalian domesticated species with larger genomes
sizes than chicken, since the cost per sequenced base has dropped considerably in the last two years. It would also be interesting to utilize our approach
to detect signatures of selection in wild populations, in order to identify haplotypes controlling phenotypic traits that are important for adaptation to different environments.
46
Acknowledgements
47
mushroom picking, rugby, Silvio Berlusconi, Chinese pop music, breastfeeding, beekeeping, to kitchen blenders.
Jag vill tacka min mentor Christer Harbeck som ftt mig att inse att
mycket spnnande vntar efter disputation. Samt fr att ha ptalat vikten av
planering och frberedelse infr stora prvningar, som att skriva avhandling
och att sedan frsvara den.
Jag vill ocks framfra ett stort tack till min omfattande familj i Upplands
Vsby, Leksand, Solna, Norrtlje, Tby, Julita, Tullinge, o.s.v. Jag vill speciellt tacka pappa, mamma, och mina brder fr all gldje och allt std ni
gett mig under rens gng. Ett gigantiskt tack till min mamma fr alla de
timmar du tragglade plutifikationstabellen och hglsning med decemberpojken Jonas. Tragglandet visade ju sig ge resultat!
Ett stort tack till alla mina vnner som utsttt min extra ptagliga frnvaro
det senaste halvret. Nu kommer det skert finnas mer tid fr ost- och vinkvllar och Halo sessioner.
Sist men absolut inte minst, vill jag tacka min underbara Anna, fr att du
ordnat med allt hemma, kpt mina julklappar, korrekturlst min avhandling
och till och med hunnit ordna en nobelmiddag t mig. Utan din matlagningskonst hade jag nog tappat ytterligare 5 kg i vikt, och utan dina lugnande ord
hade jag varit mycket mer stressad. Det ska bli underbart att nu f mer tid
med dig och jag ser framemot allt spnnande vi tillsammans har framfr oss
i livet.
48
References
1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
13.
14.
15.
16.
17.
18.
19.
49
20.
21.
22.
23.
24.
25.
26.
27.
28.
29.
30.
31.
32.
33.
34.
35.
36.
37.
38.
39.
40.
50
41.
42.
43.
44.
45.
46.
47.
48.
49.
50.
51.
52.
53.
54.
55.
56.
57.
58.
59.
60.
51
61.
62.
63.
64.
65.
66.
67.
68.
69.
70.
71.
72.
73.
74.
75.
76.
77.
78.
79.
52
80.
53
54
55