Professional Documents
Culture Documents
B
we review these recent findings and highlight the
acteria and archaea are constantly threat- Red Queen CRISPR adaptation insights that they offer on the function of different
ened by phage infection and invasion by The ability to keep defenses up to date by ac- CRISPR adaptation mechanisms used by diverse
mobile genetic elements (MGEs) through quiring new spacers is central to the success of CRISPR-Cas systems.
conjugation and transformation. In response, CRISPR-Cas systems. Typically, new spacers are
a defense arsenal has evolved, including inserted at a specific end of the CRISPR array, The molecular basis of
various innate mechanisms and the CRISPR-Cas adjacent to a “leader” region that contains con- CRISPR adaptation
(clustered regularly interspaced short palindromic served sequence motifs (4, 12–14). The leader CRISPR adaptation requires the integration of
repeats and associated proteins) adaptive immune usually also contains the promoter driving CRISPR new spacers into CRISPR loci and duplication
systems (1–3). CRISPR-Cas systems are widely dis- transcription, and it has been demonstrated that of the associated repeat sequences. The Cas1 and
tributed, occurring in 50 and 87% of complete integration of new spacers at the leader end Cas2 proteins, which form a Cas14-Cas22 complex
bacterial and archaeal genomes, respectively. These enhances defense against phages and MGEs (hereafter, Cas1-Cas2) (29, 30), constitute the “work-
systems function as RNA-guided nucleases that encountered recently (15). This “polarized” addi- horse” of spacer integration. Spacers added to
provide sequence-specific defense against invading tion of spacers into CRISPR loci produces a chron- CRISPR arrays must be compatible with the
MGEs (4, 5). The repurposing of sequence-specific ological account of the encounters between diverse range of type-specific effector complex
Cas nucleases, particularly Cas9, has stimulated a phages and bacteria that can provide insights machinery (Fig. 1). Thus, despite being near-
biotechnological revolution in genome editing into phage-host co-occurrences, evolution, and ubiquitous among CRISPR-Cas types, Cas1-Cas2
(6). In native hosts, the advantage conferred by ecology (16, 17). However, phage and MGE var- homologs meet the varied requirements for the
CRISPR-Cas systems over innate defenses lies in iants with genetic mutations can avoid detection acquisition of appropriate spacer sequences in
the ability to update the resistance repertoire in by existing CRISPR spacers; these evaders are different systems. For example, the effector com-
response to infection (a process termed CRISPR termed “escape mutants.” Additionally, spacers plexes of several CRISPR-Cas types only recognize
adaptation). CRISPR adaptation is achieved by can be lost from CRISPR arrays by recombina- targets containing a specific sequence adjacent
incorporating short DNA fragments from MGEs tion between the repeats (16, 18). Thus, main- to where the CRISPR RNA (crRNA) base-pairs
into CRISPR arrays to form memory units called tenance of CRISPR-Cas defense is reliant on the with the target strand of a MGE (Fig. 1) (31). The
spacers. Early bioinformatic studies showed that addition of new spacers into CRISPR arrays (19, 20). crRNA-paired target sequence is termed the pro-
many spacers were of foreign origin, hinting that The continuous competition between host CRISPR tospacer, and the adjacent target-recognition mo-
CRISPR loci may act as a form of memory for a adaptation and MGE escape, akin to Red Queen tif is called a protospacer-adjacent motif (PAM)
prokaryotic immune system (7–10). Subsequent dynamics, has been exposed in several recent meta- (32). PAM-based target discrimination prevents
confirmation of the link between spacers and genome studies (21, 22). Individual cells within the unintentional recognition and self-destruction
resistance to phage and MGEs was gained ex- a prokaryotic community acquire different, and of the CRISPR locus by the crRNA-effector com-
perimentally (4, 5, 11). An overview of CRISPR- often multiple, spacers during CRISPR adapta- plex, yet canonical PAM sequences vary between
Cas–mediated defense and CRISPR adaptation tion (23, 24). The diversity of CRISPR loci within and sometimes within systems.
is provided in Box 1. cell populations optimizes defense by limiting the Much of what we know about the Cas1-Cas2
reproductive success of mutants that escape the molecular structure and function has been gained
CRISPR-Cas defenses of individual cells (25). Fur- from studies in the Escherichia coli type I-E sys-
1
Department of Microbiology and Immunology, University of thermore, the resulting polymorphisms in CRISPR tem. Within the Cas14-Cas22 complex, the Cas1
Otago, Post Office Box 56, Dunedin 9054, New Zealand. loci enable fast and accurate differentiation of spe- subunits form two dimers that are bridged by a
2
Department of Bionanoscience, Kavli Institute of
Nanoscience, Delft University of Technology, Van der
cies subtypes, which may prove to have economic central Cas2 dimer (Fig. 2A) (29, 33, 34). Cas1-
Maasweg 9, 2629 HZ Delft, Netherlands. 3Bio-Protection and clinical benefits—for example, enabling trac- Cas2–mediated spacer integration prefers double-
Research Centre, University of Otago, Post Office Box 56, ing of pathogens during outbreaks (7, 26). stranded DNA (dsDNA) substrates and proceeds
Dunedin 9054, New Zealand. 4Laboratory of Microbiology, through a mechanism resembling retroviral inte-
Wageningen University, Wageningen, Netherlands. Origins of CRISPR adaptation
*These authors contributed equally to this work. †Corresponding
gration (35, 36). In addition to Cas1-Cas2, at least
author. Email: peter.fineran@otago.ac.nz (P.C.F.); stanbrouns@ According to their constituent Cas proteins, CRISPR- one CRISPR repeat, part of the leader sequence
gmail.com (S.J.J.B.) Cas systems are classified into two major classes (12, 13, 15, 37), and several host factors for repair
of the insertion sites (e.g., DNA polymerase) are type I systems, the presence of a canonical PAM stranded ends of the prespacer extend into active
required (38). Spacer acquisition involves three within the prespacer substrate increases the af- subunits of each corresponding Cas1 dimer (Fig.
main processes: substrate capture, recognition finity for Cas1-Cas2 binding but is not requisite 2, A to C) (33, 34). The length of new spacers is
of the CRISPR locus, and integration within the (33). Details of how prespacer substrates are pro- governed by the fixed distances between the two
array. duced from foreign DNA are discussed later. For Cas1 wedges and from the branch points to the
the E. coli type I-E Cas1-Cas2–prespacer complex, integrase sites. Many CRISPR-Cas systems have
Cas1-Cas2 substrate capture the ends of the dsDNA prespacer are splayed by highly consistent yet system-specific spacer lengths,
During substrate capture, Cas1-Cas2 is loaded with tyrosine wedges in each Cas1 dimer, which lock and it is likely that analogous wedge-based Cas1-
an integration-compatible prespacer, which is open the DNA branch points while fixing in place Cas2 “molecular rulers” exist in these systems to
thought to be partially duplexed dsDNA (35). For a core 23–base pair dsDNA region. The 3′ single- control prespacer length (33, 34). However, in some
systems, such as type III, the length of spacers
found within CRISPR arrays appears more var-
iable, and studies of Cas1-Cas2 structure and func-
Box 1. A roadmap of CRISPR-Cas adaptation and defense. In the example illustrated,
tion in these systems are lacking.
a bacterial cell is infected by a bacteriophage. The first stage of CRISPR-Cas defense is CRISPR
adaptation. This involves the incorporation of small fragments of DNA from the invader into the Recognition of the CRISPR array
host CRISPR array. This forms a genetic “memory” of the infection. The memories are stored
Before integration, the substrate-bound Cas1-
as spacers (colored squares) between repeat sequences (R), and new spacers are added at
Cas2 complex must locate the CRISPR leader-
the leader-proximal (L) end of the array. The Cas1 and Cas2 proteins, encoded within the cas
repeat sequence. Specific sequences upstream
gene operon, form a Cas1-Cas2 complex (blue)—the “workhorse” of CRISPR adaptation. In
5' 5'
Ruler
Cas1 ‘wedge’
PA
M
Cas2 dimer 3'
3' 180°
PAM sensing site
Cas1 dimers
PA
3' 5' 3'
PA
M
M
5' 3' 5'
CT
T
ii
PAM
PA
M
16 nt iii
M
GT PA
GT
T CCCCGCGCCAGCGGGGATAAACC
CACAAGGGGCGCGGTCGCCCCTATTTGG PAM-proximal nucleophilic attack PAM-distal nucleophilic attack
8 nt
Anchor sequences 180° 180°
iv
Fig. 2. Cas1-Cas2–mediated spacer acquisition. (A) The Cas1-Cas2 protein (D) Spacer integration proceeds as follows: (i) The Cas1-Cas2–prespacer complex
complex loaded with a prespacer substrate (the E. coli type I-E structure is binds to the leader (green) and first repeat (black). For type I and type II systems,
shown; Protein Data Bank ID, 5DQZ). (B) The Cas1 PAM sensing site, showing Cas1-Cas2 docking to the leader-proximal repeat is assisted by integration host
the canonical type I-E PAM (CTT, yellow) residue-specific interactions (a residue factor (IHF) and recognition of the leader-anchoring site (LAS), respectively. (ii)
from the noncatalytic Cas1 monomer is annotated with an asterisk) and the site The first nucleophilic attack most likely occurs at the leader-repeat junction
of PAM processing (scissors). H, histidine; K, lysine; Q, glutamine; R, arginine; Y, and gives rise to a half-site intermediate. (iii) The second nucleophilic attack
tyrosine. (C) A schematic representation of the substrate-loaded Cas1-Cas2 occurs at the boundary between the repeat and the spacer (orange), re-
protein complex with the active PAM-sensing site highlighted (light purple) sulting in full-site integration. (iv) Host DNA repair enzymes fill the integra-
and a partially duplexed DNA prespacer substrate (strands are purple and tion site. (E) The type I-E repeat is magnified to indicate the inverted repeats
pink). The ruler mechanism determining spacer length for the E. coli type I-E within its sequence and highlight the anchoring sites of the molecular rulers
system uses two conserved tyrosine residues (the “Cas1 wedge;” gray hexagons). that determine the point of integration. nt, nucleotides.
spacers (i.e., their corresponding protospacers) has existing spacers are not strictly necessary (79), Other host proteins may also be necessary for
revealed subtype-specific patterns and has provided suggesting that the PAM interactions of Cas9 prespacer substrate production. For example, RecG
much of our insight into the mechanisms of primed could be sufficient to select appropriate new spac- is required for efficient primed CRISPR adaptation
CRISPR adaptation (23, 43, 44, 60, 74, 76, 77). In ers. Some Cas9 variants can also function with in type I-E and I-F systems, but its precise role re-
type I-E systems, new protospacers typically map to non-CRISPR RNAs and tracrRNA (80). This raises mains speculative (38, 90). Additionally, it is still
the same strand as the priming protospacer (43, 44) the possibility that host or MGE-derived RNAs enigmatic why some CRISPR-Cas systems require
(Fig. 4). For type I-B priming, Cas3 is predicted to might direct promiscuous Cas9 activity, resulting accessory proteins, whereas closely related types
load onto either strand at the priming proto- in DNA breaks or replication fork stalling that do not. For example, type II-C systems lack cas4
spacer, resulting in a bidirectional distribution could potentially result in prespacer generation. and csn2, which assist CRISPR adaptation in type
of new protospacers (76). For type I-F priming, II-A and II-B systems, respectively. These type-
the first new protospacer typically maps to the Roles of accessory Cas proteins in specific differences exemplify the diversity that has
strand opposite the priming protospacer, in a di- CRISPR adaptation arisen during the evolution of CRISPR-Cas systems.
rection consistent with Cas3 loading and 3′ to 5′ Although Cas1 and Cas2 play a central role in
helicase activity on the nontarget strand. Once CRISPR adaptation, type-specific variations in cas The genesis of adaptive
the first spacer is acquired, two protospacer tar- gene clusters occur. In many systems, Cas1-Cas2 is immunity in prokaryotes
gets in the MGE are recognized, and prespacer assisted by accessory Cas proteins, which are often Expanding knowledge of the molecular mech-
production can be driven from both locations mutually exclusive and type-specific (27). For ex- anisms underlying CRISPR adaptation indicates
(23, 74) (Fig. 4). However, priming is stimulated ample, in the S. thermophilus type II-A system, the evolutionary origin of CRISPR-Cas systems (91).
more strongly from the interference-proficient deletion of csn2 impaired the acquisition of spacers Casposons are transposon-like elements typified
is relatively unexplored, but two aspects may be RE FERENCES AND NOTES 21. C. L. Sun, B. C. Thomas, R. Barrangou, J. F. Banfield,
interrelated. First, DNA breaks (52) could stimulate 1. R. L. Dy, C. Richter, G. P. Salmond, P. C. Fineran, Remarkable Metagenomic reconstructions of bacterial CRISPR loci
mechanisms in microbes to resist phage infections. Annu. constrain population histories. ISME J. 10, 858–870 (2016).
the generation of substrates for spacer acquisition doi: 10.1038/ismej.2015.162; pmid: 26394009
Rev. Virol. 1, 307–331 (2014). doi: 10.1146/annurev-virology-
(Fig. 3), and second, the stalling of infection could 031413-085500; pmid: 26958724 22. D. Paez-Espino et al., Uncovering Earth’s virome. Nature 536,
buy time for CRISPR adaptation (78, 102, 103). 2. J. E. Samson, A. H. Magadán, M. Sabri, S. Moineau, Revenge 425–430 (2016). doi: 10.1038/nature19094; pmid: 27533034
Analogously, it remains to be determined whether of the phages: Defeating bacterial defences. Nat. Rev. 23. R. H. Staals et al., Interference-driven spacer acquisition is
Microbiol. 11, 675–687 (2013). doi: 10.1038/nrmicro3096; dominant over naive and primed adaptation in a native
interference by CRISPR-Cas systems other than CRISPR-Cas system. Nat. Commun. 7, 12853 (2016).
pmid: 23979432
type I can also stimulate primed CRISPR adapta- 3. L. A. Marraffini, CRISPR-Cas immunity in prokaryotes. Nature doi: 10.1038/ncomms12853; pmid: 27694798
tion. If not, the benefits of priming might provide 526, 55–61 (2015). doi: 10.1038/nature15386; pmid: 26432244 24. D. Paez-Espino et al., Strong bias in the bacterial CRISPR
an explanation for why type I systems are the 4. R. Barrangou et al., CRISPR provides acquired resistance elements that confer immunity to phage. Nat. Commun. 4, 1430
against viruses in prokaryotes. Science 315, 1709–1712 (2013). doi: 10.1038/ncomms2440; pmid: 23385575
most prevalent and diverse CRISPR-Cas type. 25. S. van Houte et al., The diversity-generating benefits of a
(2007). doi: 10.1126/science.1138140; pmid: 17379808
It is also unclear why many CRISPR-Cas sys- 5. S. J. Brouns et al., Small CRISPR RNAs guide antiviral prokaryotic adaptive immune system. Nature 532, 385–388
tems have more than one CRISPR array that is defense in prokaryotes. Science 321, 960–964 (2008). (2016). doi: 10.1038/nature17436; pmid: 27074511
used by a single set of Cas proteins. Given that doi: 10.1126/science.1159689; pmid: 18703739 26. F. Liu et al., Novel virulence gene and clustered regularly
6. A. V. Wright, J. K. Nuñez, J. A. Doudna, Biology and interspaced short palindromic repeat (CRISPR) multilocus
Cas1-Cas2 is directed to leader-repeat junctions sequence typing scheme for subtyping of the major serovars
applications of CRISPR systems: Harnessing nature’s toolbox
during integration, multiple arrays might provide for genome engineering. Cell 164, 29–44 (2016). of Salmonella enterica subsp. enterica. Appl. Environ.
additional integration sites, increasing CRISPR doi: 10.1016/j.cell.2015.12.035; pmid: 26771484 Microbiol. 77, 1946–1956 (2011). doi: 10.1128/AEM.02625-10;
adaptation efficiency. In addition, parallel CRISPR 7. C. Pourcel, G. Salvignol, G. Vergnaud, CRISPR elements in pmid: 21278266
Yersinia pestis acquire new repeats by preferential uptake of 27. K. S. Makarova et al., An updated evolutionary classification
arrays should increase crRNA production from of CRISPR-Cas systems. Nat. Rev. Microbiol. 13, 722–736
bacteriophage DNA, and provide additional tools for
Nucleic Acids Res. 45, 367–381 (2017). doi: 10.1093/nar/ 62. A. G. Patterson, J. T. Chang, C. Taylor, P. C. Fineran, system. Structure 24, 70–79 (2016). doi: 10.1016/
gkw1151; pmid: 27899566 Regulation of the Type I-F CRISPR-Cas system by CRP-cAMP j.str.2015.10.019; pmid: 26671707
42. S. L. Shipman, J. Nivala, J. D. Macklis, G. M. Church, and GalM controls spacer acquisition and interference. 82. Z. Arslan et al., Double-strand DNA end-binding and sliding of
Molecular recordings by directed CRISPR spacer acquisition. Nucleic Acids Res. 43, 6038–6048 (2015). doi: 10.1093/nar/ the toroidal CRISPR-associated protein Csn2. Nucleic Acids
Science 353, aaf1175 (2016). doi: 10.1126/science.aaf1175; gkv517; pmid: 26007654 Res. 41, 6347–6359 (2013). doi: 10.1093/nar/gkt315;
pmid: 27284167 63. K. Severinov, I. Ispolatov, E. Semenova, The influence of pmid: 23625968
43. S. Shmakov et al., Pervasive generation of oppositely copy-number of targeted extrachromosomal genetic 83. K. H. Lee et al., Identification, structural, and biochemical
oriented spacers during CRISPR adaptation. Nucleic Acids elements on the outcome of CRISPR-Cas defense. Front. Mol. characterization of a group of large Csn2 proteins involved
Res. 42, 5907–5916 (2014). doi: 10.1093/nar/gku226; Biosci. 3, 45 (2016). doi: 10.3389/fmolb.2016.00045; in CRISPR-mediated bacterial immunity. Proteins 80,
pmid: 24728991 pmid: 27630990 2573–2582 (2012). doi: 10.1002/prot.24138;
44. D. C. Swarts, C. Mosterd, M. W. van Passel, S. J. Brouns, 64. T. Künne et al., Cas3-derived target DNA degradation pmid: 22753072
CRISPR interference directs strand specific spacer fragments fuel primed CRISPR adaptation. Mol. Cell 63, 84. A. Plagens, B. Tjaden, A. Hagemann, L. Randau, R. Hensel,
acquisition. PLOS ONE 7, e35888 (2012). doi: 10.1371/journal. 852–864 (2016). doi: 10.1016/j.molcel.2016.07.011; Characterization of the CRISPR/Cas subtype I-A system
pone.0035888; pmid: 22558257 pmid: 27546790 of the hyperthermophilic crenarchaeon Thermoproteus
45. C. Rollie, S. Schneider, A. S. Brinkmann, E. L. Bolt, M. F. White, 65. S. Redding et al., Surveillance and processing of foreign DNA tenax. J. Bacteriol. 194, 2491–2500 (2012). doi: 10.1128/
Intrinsic sequence specificity of the Cas1 integrase directs new by the Escherichia coli CRISPR-Cas system. Cell 163, JB.00206-12; pmid: 22408157
spacer acquisition. eLife 4, 08716 (2015). pmid: 26284603 854–865 (2015). doi: 10.1016/j.cell.2015.10.003; 85. J. Zhang, T. Kasciukovic, M. F. White, The CRISPR associated
46. V. Kunin, R. Sorek, P. Hugenholtz, Evolutionary conservation pmid: 26522594 protein Cas4 is a 5′ to 3′ DNA exonuclease with an iron-sulfur
of sequence and secondary structures in CRISPR repeats. 66. T. R. Blosser et al., Two distinct DNA binding modes guide cluster. PLOS ONE 7, e47232 (2012). doi: 10.1371/journal.
Genome Biol. 8, R61 (2007). doi: 10.1186/gb-2007-8-4-r61; dual roles of a CRISPR-Cas protein complex. Mol. Cell 58, pone.0047232; pmid: 23056615
pmid: 17442114 60–70 (2015). doi: 10.1016/j.molcel.2015.01.028; 86. S. Lemak et al., Toroidal structure and DNA cleavage by the
47. M. G. Goren et al., Repeat size determination by two pmid: 25752578 CRISPR-associated [4Fe-4S] cluster containing Cas4
molecular rulers in the type I-E CRISPR array. Cell Rep. 16, 67. R. P. Hayes et al., Structural basis for promiscuous PAM nuclease SSO0001 from Sulfolobus solfataricus. J. Am. Chem.
2811–2818 (2016). doi: 10.1016/j.celrep.2016.08.043; recognition in type I-E Cascade from E. coli. Nature 530, Soc. 135, 17476–17487 (2013). doi: 10.1021/ja408729b;
Res. 26, 1273–1287 (2016). doi: 10.1038/cr.2016.135; 4, 2087 (2013). doi: 10.1038/ncomms3087; 108. S. D. Perli, C. H. Cui, T. K. Lu, Continuous genetic
pmid: 27857054 pmid: 23820428 recording with self-targeting CRISPR-Cas in human cells.
100. E. R. Westra, A. Buckling, P. C. Fineran, CRISPR-Cas 104. J. Elmore, T. Deighan, J. Westpheling, R. M. Terns, M. P. Terns, Science 353, aag0511 (2016). doi: 10.1126/science.
systems: Beyond adaptive immunity. Nat. Rev. Microbiol. DNA targeting by the type I-G and type I-A CRISPR-Cas aag0511; pmid: 27540006
12, 317–326 (2014). doi: 10.1038/nrmicro3241; systems of Pyrococcus furiosus. Nucleic Acids Res.
pmid: 24704746 43, 10353–10363 (2015). pmid: 26519471 AC KNOWLED GME NTS
101. C. Almendros, N. M. Guzmán, J. García-Martínez, 105. J. Bondy-Denomy, A. Pawluk, K. L. Maxwell, Work in the Fineran laboratory on CRISPR-Cas is supported
F. J. Mojica, Anti-cas spacers in orphan A. R. Davidson, Bacteriophage genes that inactivate by a Rutherford Discovery Fellowship (to P.C.F.) from
CRISPR4 arrays prevent uptake of active the CRISPR/Cas bacterial immune system. Nature 493, the Royal Society of New Zealand (RSNZ), the Marsden Fund
CRISPR-Cas I-F systems. Nat. Microbiol. 1, 429–432 (2013). doi: 10.1038/nature11723; of the RSNZ, the University of Otago, and the Bio-Protection
16081 (2016). doi: 10.1038/nmicrobiol.2016.81; pmid: 23242138 Research Centre (Tertiary Education Commission). The Brouns
pmid: 27573106 106. A. Pawluk et al., Inactivation of CRISPR-Cas systems by anti- laboratory is funded by a European Research Council
102. K. S. Makarova, V. Anantharaman, L. Aravind, CRISPR proteins in diverse bacterial species. Nat. Microbiol. starting grant (639707), a Netherlands Organisation for
E. V. Koonin, Live virus-free or die: Coupling 1, 16085 (2016). doi: 10.1038/nmicrobiol.2016.85; Scientific Research (NWO) Vidi grant (864.11.005), and a
of antivirus immunity and programmed suicide pmid: 27573108 Foundation for Fundamental Research on Matter (FOM)
or dormancy in prokaryotes. Biol. Direct 7, 40 107. D. Vorontsova et al., Foreign DNA acquisition by the projectruimte grant (15PR3188). We thank members
(2012). doi: 10.1186/1745-6150-7-40; pmid: 23151069 I-F CRISPR-Cas system requires all components of the of the Fineran and Brouns groups for useful feedback on
103. M. E. Dupuis, M. Villion, A. H. Magadán, S. Moineau, interference machinery. Nucleic Acids Res. 43, the manuscript.
CRISPR-Cas and restriction-modification systems are 10848–10860 (2015). doi: 10.1093/nar/gkv1261;
compatible and increase phage resistance. Nat. Commun. pmid: 26586803 10.1126/science.aal5056
Editor's Summary
Article Tools Visit the online version of this article to access the personalization and
article tools:
http://science.sciencemag.org/content/356/6333/eaal5056
Science (print ISSN 0036-8075; online ISSN 1095-9203) is published weekly, except the last week
in December, by the American Association for the Advancement of Science, 1200 New York
Avenue NW, Washington, DC 20005. Copyright 2016 by the American Association for the
Advancement of Science; all rights reserved. The title Science is a registered trademark of AAAS.