Professional Documents
Culture Documents
DNA exists primarily as a double helix in which two complementary function as an analogue of GFP, we determined the co-crystal structure
strands hybridize, making the overall B-form helical geometry largely of Lettuce bound to DFHBI-1T at 2.5 Å resolution (Methods, Extended
independent from the nucleotide sequence9–11. Functional DNA mol- Data Fig. 1 and Extended Data Table 1).
ecules, the sophisticated biochemical or biophysical activities of which
arise from their specific nucleotide sequences, have been isolated
through in vitro selection12–14. Analogous to their RNA counterparts Overall structure of a fluorogenic DNA
(including aptamers, ribozymes and riboswitches), it is thought that The functional core of the 53-nucleotide Lettuce DNA adopts a con-
the non-helical elements of functional DNAs adopt non-canonical 3D voluted 3D structure. It is flanked on either end by B-form duplexes
folds that endow them with specific biochemical properties. However, (paired elements P1 and P2, the latter capped by a trinucleotide loop)
and in contrast to the rapidly growing body of structures of functional (Fig. 1c–e and Extended Data Fig. 1). The two formally ssDNA strands
RNAs determined by X-ray crystallography, NMR spectroscopy or cryo- connecting these duplexes on either side fold back on themselves to
genic electron microscopy (cryo-EM)15–19, only limited information is form two stem–loops (with non-canonical base pairs in their stems):
available on the atomistic basis of functional DNA activity. P1.1 and P2.1. This 4WJ, cloverleaf-like secondary structure adopts a
Lettuce is a previously reported4 DNA mimic of GFP. This formally 3D fold comprising two coaxial stacks (P1.1 stacking on P1, and P2.1
single-stranded DNA (ssDNA) induces fluorescence of conditional stacking on P2). The two stacks associate near co-linearly, coming
fluorophores20,21 such as DFHBI-1T, DFHO and DFAME (Fig. 1a,b), which together at the centre of an approximately 66-Å-long DNA fold. To
are small molecules derived from the intrinsic fluorophore of GFP. achieve this, P1.1 packs against the broadened minor groove at the
Numerous fluorescent turn-on aptamer RNAs have been character- base of P2, whereas P2.1 packs against the major groove that follows
ized, and such RNA mimics of GFP have been widely used as molecu- P1. This results in close apposition of four DNA strands at the centre of
lar tags in applications ranging from live imaging of cellular RNAs to Lettuce, where they assemble into an antiparallel G-quadruplex17,25,26
small-molecule sensors5,6,22–24. Structural determination of fluorogenic composed of two canonical G-quartets (Q1 and Q2) (Extended Data
RNAs has revealed complex, diverse 3D structures that enable these Fig. 1j). A single molecule of DFHBI-1T binds between Q2 and P1.1. The
aptamers to recognize their fluorophores and constrain their photo- presence of a G-quadruplex in Lettuce is consistent with previous4
excited states to produce fluorescence5. To elucidate how DNA can also biochemical characterization. The DNA molecule appears to fold
Laboratory of Nucleic Acids, National Heart, Lung, and Blood Institute, National Institutes of Health, Bethesda, MD, USA. 2Department of Pharmacology, Weill-Cornell Medical College, Cornell
1
University, New York, NY, USA. 3Present address: Beijing Institutes of Life Science, Chinese Academy of Sciences, Beijing, China. ✉e-mail: adrian.ferre@nih.gov
2,000
A T 50
P1 T A
1,500
T A
1,000
C G
P1 5′ 3′
P1
500
0 5′
3′ 3′
O
E
T
5′
AM
-1
H
DF
BI
DF
H
DF
Fig. 1 | Overall structure of the Lettuce fluorescence turn-on DNA aptamer. spheres highlight bound DFHBI-1T molecule. This colour scheme is used
a, Structures of the fluorophores DFHBI-1T, DFHO and DFAME. b, Fluorescence throughout. d, Orthogonal view. e, Secondary structure of Lettuce. Lines with
(in arbitrary units (a.u.)) of Lettuce in the presence of different fluorophores arrowheads and Leontis–Westhof symbols 51 denote connectivity and base
(mean ± s.d., n = 3 technical replicates). c, Cartoon representation of the pairs, respectively. f, Cryo-EM envelope of unliganded Lettuce superimposed
Lettuce–DFHBI-1T co-crystal structure. Arrows denote 5′-to-3′ chain direction; on the Lettuce–DFHBI-1T co-crystal structure.
purple and green spheres represent K+ and Mg 2+, respectively. Translucent
independently of its small-molecule ligand, as circular dichroism (CD) may be sterically restrained by the deoxyribose of G26. The trifluo-
spectra of fluorophore-bound and fluorophore-free Lettuce both sup- romethyl substituent of the imidazolone is in van der Waals contact
port the presence of a G-quadruplex (Extended Data Fig. 1k). Moreover, with the base of C20. The A11 base is unpaired, but it appears to be
single-particle cryo-EM analysis of the ligand-free DNA produced a held in place by K+ coordination and sandwiching between Q2 and
Coulomb potential map that indicated that the DNA is monomeric and the T12•C17•G26 base triple. A notable finding is the conformation
overall similar to the liganded structure (Fig. 1f, Methods, Extended of G44, the base of which lies at an angle of about 55° relative to the
Data Fig. 2 and Extended Data Table 3). Size-exclusion chromatography base of G26 (and most of the coaxially stacked nucleotides of Lettuce).
coupled to multi-angle light scattering confirmed that the monomeric This distinct arrangement allows a G44-G26 hydrogen bond while at
aptamer predominates in solution (Extended Data Fig. 1l). the same time bringing the N2 of G44 in proximity (3 Å) to a fluorine
of the fluorophore. A43 is also at the same unusual angle (as it stacks
on the base of G44), which enables it to simultaneously pack against
Fluorophore-binding site structure the T12•C17•G26 and G13•G16•C27 base triples (Figs. 2a,b and 3a,b and
The fluorophore-binding site of Lettuce is formed by the G-quartet Q2, Extended Data Fig. 3). Overall, several atypical interactions form the
the four DNA strands adjacent to it, C20 from P2.1 and a monovalent Lettuce fluorophore-binding site.
cation (Figs. 1e and 2a,b and Extended Data Fig. 3). Two monovalent We further examined the role of Lettuce binding-site nucleotides
cations (Tl+ in the crystals used for de novo structure determination in fluorophore recognition by determining co-crystal structures of
using the single-wavelength anomalous dispersion method; K+ in Lettuce–DFHO and Lettuce–DFAME complexes (Extended Data Fig. 3e–h,
the native crystals used for refinement) (Methods, Extended Data Extended Data Table 1 and Methods) and by performing site-directed
Fig. 3a–c and Extended Data Table 1) bind at the four-fold axis of the mutagenesis. Most of the interactions between the fluorophores
quadruplex. One is octahedrally coordinated between Q1 and Q2, as and the DNA molecule were conserved among the three structures
is typical for G-quadruplexes27. The second coordinates the carbonyl (Fig. 2a–d and Extended Data Fig. 3). The main differences were in
oxygen molecules of the guanine residues of Q2, the N1 imino nitrogen interactions of the divergent exocyclic substituents of the imida-
of A11 and the phenolate oxygen of the bound DFHBI-1T (additional zolones of the three fluorophores. Thus, the DFHO oxime hydrogen
ligands are presumed to be water molecules). Consistent with the bonds with N2 of G26, whereas the methylacrylate of DFAME coordi-
direct cation–fluorophore coordination, the fluorescence spectrum nates a Mg2+ ion. Mutations of A11, C17 and G26, and deletion of C20,
of Lettuce–DFHBI-1T exhibits an 8 nm bathochromic shift in the pres- largely abolished fluorescence, thereby confirming the importance
ence of Tl+ compared with the spectrum in K+ (Extended Data Fig. 3d). of these residues (Fig. 2e). The importance of close packing of the
The phenolate of the fluorophore also hydrogen bonds to the N6 DNA was underlined by the deleterious effect of replacing C17, C18,
amine of A11, and the phenyl ring is sandwiched between G18 and G25 and G26 with the corresponding ribonucleotides. Notably, the
the C17•G26 base pair. The two fluorine substituents of the phenyl C20T mutant activated DFHBI-1T fluorescence nearly as well as the
ring may also participate in weakly polar interactions28 with the N6 wild-type form. Moreover, this mutant slightly increased the activa-
of A11 and the N2 of G44. The imidazolone ring of the fluorophore tion of DFAME and nearly doubled the activation of DFHO (Fig. 2f).
does not make hydrogen bonds with the DNA molecule, although it The importance of residue 20 in conferring fluorophore selectivity
45°
G25
Q2 A11
C20
A11
G18 G18
C20 C20 G18 Q2
Q2
Q1
Q1
d e 1.0
f 2.0
G44
Normalized fluorescence
Normalized fluorescence
1.5
G26
T12
G25 0.5 1.0
0.5
C20 A11
0 0
G18
26 ,
.G 8
G
1C
1T
T• C
r.G C17 8I
T
A
A
de
, r C1
Q2
26
20
20
20
26
26
20
20
20
6
1
20
20
20
1
A1
A1
C G2
A1
G
20
G
C
25 , r.
G
C
C
C
•
C
G
17
17
r.
C
g h i j G44 T12
G26 G44 T12 G44 T12 G26
G26 T12 G44
G26
G25 G25
G25 G25
A11 A11 A11 A11
Q2 Q2 Q2 Q2
G18 G18 G18 G18
G20 T20 T20 T20
Fig. 2 | The Lettuce fluorophore-binding site. a, View of bound DFHBI-1T. f, Fluorescence of C20 mutants of Lettuce in the presence of DFHO (yellow bars)
Grey mesh depicts the |Fo| − |Fc| electron density map before building the or DFAME (red bars), normalized to wild-type bound to each fluorophore
fluorophore, contoured at 3.0 standard deviations above mean peak height (σ). (mean ± s.d., n = 3 technical replicates). g, View of the Lettuce C20G binding site
Grey and orange dashed lines denote hydrogen bonds and metal ion with bound DFHO. h, Lettuce C20T binding site with bound DFHO. i, Lettuce
coordination, respectively. Red spheres are water molecules. b, View rotated C20T binding site with bound DFHBI-1T. j, Lettuce C20T binding site with bound
45°. c, View of bound DFHO. d, View of bound DFAME. e, Fluorescence of DFAME. Grey mesh depicts the |Fo| − |Fc| electron density map before building
site-directed mutants of the binding site in the presence of DFHBI-1T, normalized the fluorophores, contoured at 3.0σ (c,g,h,i) or 2.0σ (d,j).
to wild-type (mean ± s.d., n = 3 technical replicates). r., ribonucleotide.
Circular permutation
T15 T15
A43 P1.1 P1.1
T28
G13
G13 120° Q2 Q2
G16 A43 G16
Q1 Q1
Q2 h 1.5
Normalized fluorescence
c d
nt
nt
53
39
1.0
Q2
Q2
G24 G24 0.5
C20
C20 A21
G9 Q1
G19 150° G9 G22
Q1 G19
0
G8 G46
A21
bp
bp
bp
bp
TC p
l/T p
P1 l/TC p
oo p
l
G46
de
G8
o
o
o
lo
lo
lo
6
1
G22
p
G47
=
cp
cp
cp
cp
cp
P1 de
l/T
l/l
P1
P1
P1
P1
P1
T23
de
A7 G47
de
A7
de
cp
T23
cp
cp
P1
cp
P1
P1
P1
e f i
1.5 1.5 1.5
Normalized fluorescence
nt
Normalized fluorescence
Normalized fluorescence
nt
37
46
1.0 1.0 1.0
0 0 p 0
lo de bp
ly n
4 p
P1 1cp = p
P2 el bp
4C 9C
5C 8C
5G T•T G
A4 l
3G
3C
G 3T
l
C
T
A
5A
2 A
A
de
de
de
on tio
P2 b
= loo
b
44
47
4G 29
5G 28
•T 29
44
44
A7
22
)
29
T1 14 T28
A4 8C
bp
A4
T1
A1 •T2
T1 •T2
A4
3
TC P2 = 2
1
op le
3
44
8
G
G
A1 •T
T1 T
G
G
G
•T
/T
G
=
A •
4T
P2
cp
P1
d
A1
cp
cp
(T
Fig. 3 | The Lettuce core and its structure-guided minimization. a, View of wild-type (mean ± s.d., n = 3 technical replicates). g, Schematic of circular
P1.1. b, Rotated view of a. c, View of the P2.1 element of Lettuce. d, Rotated view permutation (cp). h, Fluorescence of Lettuce constructs with permuted P1
of c. e, Fluorescence of site-directed mutants of P1.1 and P2.1 in the presence of (P1cp) in the presence of DFHBI-1T, normalized to wild-type (mean ± s.d., n = 3
DFHBI-1T, normalized to wild-type (mean ± s.d., n = 3 technical replicates). technical replicates). nt, nucleotides. i, Fluorescence of Lettuce mutant
Darker shaded bar, A14T•T29A, T15G•T28C mutant. f, Fluorescence of Lettuce A14T•T29A, T15G•T28C and shortened P1cp in the presence of DFHBI-1T,
constructs with shortened P1 in the presence of DFHBI-1T, normalized to normalized to wild-type (mean ± s.d., n = 3 technical replicates).
tertiary contacts (both common in folded RNAs), except when medi- and P2 (ref. 4). Those experiments suggested that stability of the two
ated through bound cations. paired elements is important for activity. By progressively shortening
We tested the functional importance of tertiary interactions in the P2, we found that at least three base pairs are needed for optimal func-
Lettuce core, and the possibility of optimizing its function, through tion (Fig. 3f). To shorten P1 while maintaining stability, we circularly
site-directed mutagenesis (Fig. 3e). In the wild-type form, A14 closes the permuted the DNA, such that its 5′ and 3′ termini are located in P2,
apex of the P1.1 turn by pairing through its Hoogsteen face to T29, which and P1 is closed by a loop (Fig. 3g). This rearrangement enabled us to
is at the base of P2 (Fig. 3a). The mutagenesis experiments showed that delete P1 entirely, shortening the DNA to 39 nucleotides (Fig. 3h) while
inversion of this pair to a T•A pair improved activity by 25%. However, retaining 85% of activity. We then improved activity by including the
strengthening the pair by converting it to either a C•G or G•C pair was A14T•T29A and T15G•T28C mutations that enhanced fluorescence
detrimental. This differed from the T15•T28 pair at the centre of the of the wild-type DNA (Fig. 3e) and shortened P2 to 4 nucleotides. All
P1.1 turn, which improved activity when converted to a C•G or G•C pair. these changes produced ‘Bibb Lettuce’, a fluorescent DNA tag about
A43 and A44 were generally intolerant of mutation or deletion, which 30% smaller (37 nucleotides) and brighter than the parental Let-
underscores the functional importance of these two nucleotides with tuce (Fig. 3i). As observed with Lettuce, CD spectra of Bibb Lettuce,
unusual conformation. Similarly, the inclined G22 of P2.1 was sensi- fluorophore-bound and fluorophore-free, supported the presence of
tive to mutation. A7, G8 and G47, which participate in the network of a G-quadruplex and folding independent of the fluorophore (Extended
hydrogen bonding in P2.1, were also sensitive to mutation. Data Fig. 5k). Fluorescence analysis of Bibb Lettuce showed a slightly
weaker dissociation constant compared with Lettuce (Extended Data
Fig. 5l). However, Bibb Lettuce showed a reduced dependency on Mg2+
Structure-guided optimization concentration (in the presence of physiological K+), exhibiting a K1/2
Initial experiments aimed at minimizing the length of Lettuce showed value at about 0.5 mM Mg2+ (Extended Data Fig. 5m). Bibb Lettuce
that the DNA molecule could be split by separating the strands of P1 showed a similar profile of fluorophore activation in the presence of
Normalized fluorescence
3′ 5′
5′ 3′ (100 nt)
Anneal
0.2
0
5′ 3′
3′ 5′ RNA + + + + + + + +
DN le A
ed A
A
A
1T
ds nea RN
al RN
no DN DN
RN
– – – – – –
d
ssDNA + +
BI
ds ds
H
n +
ne +
dsDNA – – + + – – + +
DF
ta A
an A
Annealing + + – + + + – +
DFHO – – – – + + + +
Fig. 4 | Detection of R-loop formation by Lettuce. a, Schematic of the R-loop replicates). c, Non-denaturing PAGE. Arrow denotes an R-loop. d, Fluorescence
assay. RNA is indicated by the red line. b, Fluorescence of DFHBI-1T in the of same gel as in c, after soaking in DFHO. Arrow denotes an R-loop. For
presence of buffer, RNA, dsDNA, dsDNA plus RNA not co-annealed, dsDNA plus uncropped gels, see Supplementary Fig. 1. The detection of R-loop formation
RNA co-annealed, normalized to wild-type Lettuce (mean ± s.d., n = 3 technical by non-denaturing PAGE was replicated three times.
different divalent cations (Extended Data Fig. 5n). In addition, ther- side chains, Phe64 and Tyr145 are at angles of about 45° to the plane of
mal stability comparisons of Lettuce and Bibb Lettuce showed that the fluorophore rather than stacking face-to-face (Fig. 5b). The all-RNA
the fluorophore increased the melting temperature of Lettuce from binding site of Spinach also hydrogen bonds to all available polar atoms
47 °C to 52 °C. By contrast, for Bibb Lettuce, the melting temperatures of the fluorophore, but unlike the protein, stacks face-to-face with the
were higher and basically unchanged in presence of the ligand (57 °C heterocycles of the small molecule, sandwiching it between a G-quartet
compared with 56 °C liganded and free, respectively) (Extended Data and a base triple. Spinach uses ribose 2′-OH groups as well as nucleobase
Fig. 5o). Altogether, our results indicate that Bibb Lettuce is more stable polar groups to construct a dense network of direct and water-mediated
than Lettuce even before fluorophore binding. hydrogen bonds around the fluorophore. Notably, the binding site of
the RNA molecule is organized in three distinct stacked layers, with
the fluorophore and the functionally essential37 unpaired residue G31
A fluorescent reporter for R-loops comprising the middle layer (Fig. 5c). Chili is the only fluorogenic RNA
A number of biological processes, including recombination, repair, thus far known to coordinate directly to its ligand through a buried
replication and transcription30,31, transiently produce ssDNA, often monovalent cation40. The cationic Chili fluorophore induces folding
as R-loops. An R-loop is a DNA duplex whereby one strand is displaced of the RNA and stacks between a G-quartet and the minor groove of a
locally by a complementary RNA, which results in extrusion of a ssDNA G•C pair (Fig. 5d). When compared to GFP, Spinach and Chili, Lettuce is
segment32,33. To test whether Lettuce can report on the formation noteworthy in only hydrogen bonding to the phenolate oxygen of the
of R-loops, we synthesized a 151 bp duplex DNA encoding Bibb Let- fluorophore (Fig. 5e). Similar to Chili, it uses a buried monovalent cation
tuce on one strand. When it is annealed to a 100-nucleotide RNA, an to coordinate directly to this anionic functional group. The Lettuce
R-loop should form (Fig. 4a). In this context, Bibb Lettuce fluoresced binding site also is organized in three layers, with the fluorophore•A11
around 30% as brightly as when in isolation, but at least 15 times more layer sandwiched between a G-quartet and a base triple. However, the
brightly than the free fluorophore (DFHBI-1T; Fig. 4b). Non-denaturing DNA molecule augments this with nucleobases that are not parallel
polyacrylamide gel (PAGE) analysis showed that although R-loop for- to the planes, including the unusual 55°-inclined A43 and G44 and
mation under these conditions was not efficient (Fig. 4c,d; R-loop the functionally important extrahelical C20. The more diverse use of
was confirmed by RNase H and RNase A treatment; Extended Data nucleobases by the DNA when compared to the RNA probably reflects
Fig. 6a), its formation was detectable by soaking the gel in DFHO and the unavailability of the ribose 2′-OH to participate in hydrogen bond
illuminating at 488 nm (Fig. 4d). To verify whether the system sup- networks and tertiary interactions abundant in structured RNAs such
ports longer and potentially more complex R-loops, we tested a 310 bp as ribose zippers42 and A-minor43 motifs.
duplex DNA encoding Bibb Lettuce on one strand. After annealing to Structures formed at the junction of four helical elements (4WJ)
a 260-nucleotide RNA, this system fluoresced about 50% as brightly (Fig. 5f) are widespread in both RNA and DNA. Prototypical 3D folds
as the isolated fluorogenic DNA (Extended Data Fig. 6b,c). Improved adopted by RNA and DNA 4WJs are the L-shaped transfer RNA (tRNA)
fluorescence of the longer complex suggests that strain in the DNA–RNA and the H-shaped Holliday junction, respectively (Fig. 5g,h). Numer-
complex is partially suppressing Bibb Lettuce activity. Overall, we show ous RNAs adopt a fold that is closely related to the latter, including
that extrusion of a single strand from a duplex DNA can be monitored the hairpin ribozyme (Fig. 5i) that differs from the Holliday junction
directly through fluorescence of Bibb Lettuce. in the angle subtended by the helical axes of its two coaxial stacks44.
The near-colinear 4WJ fold of Lettuce (Fig. 5j) is previously unknown8.
We analysed compactness based on the solvent-accessible surface
New principles of DNA 3D organization area of the structures of RNAs of diverse size and complexity as a func-
Our co-crystal structures of fluorophore-bound Lettuce provide the tion of molecular weight45 (Extended Data Fig. 6d and Supplemen-
atomic-level information on how a third category of biomolecule tary Table 1). The results showed that typical structured RNAs such
can activate a conditional fluorophore (Fig. 5a). Comparisons of the as ribozymes, aptamers and fluorogenic RNAs have a compactness of
fluorophore-binding sites of GFP34–36, the RNA aptamer Spinach in com- less than 2.0 Da Å–2 (for example, 1.98 Da Å–2 for tRNAPhe and 1.99 Da Å–2
plex with DFHBI37,38, the RNA aptamer Chili39,40 and Lettuce showed how for the hairpin ribozyme), with many approximately 50-nucleotide
the distinct chemical characteristics of the three biopolymers respond RNAs (comparable in mass to Lettuce) having a compactness of about
to similar selective pressure. GFP uses a combination of aromatic, 1.90 Da Å–2 (Extended Data Fig. 6d and Supplementary Table 1). Sim-
non-polar and polar amino acid side chains to surround its fluorophore, ple DNA structures ranged in compactness from about 1.7 Da Å–2 for a
and make hydrogen bonds with every polar atom in it. Even though the duplex dodecamer to 1.93 Da Å–2 for a stacked Holliday junction. Let-
maximally fluorescent species of the fluorophore is the phenolate41, tuce has a compactness of 2.05 Da Å–2, which is the largest thus far for a
GFP does not use a cation to bind the phenolic oxygen. Two aromatic DNA. This value is also comparable to some of the most compact RNAs
Gln94 G65
G26
Val61 G70
Arg96
His148
Tyr145 Gln183
Phe64 G30
Protein RNA
d G31
e
G14 A43
C17
G15 G26
C40 T12
G13
G41 A11
G32 G44
G18
G37 G10
G9
G25
G10 G45 C20
RNA DNA
f g i j
5′
3′
Fig. 5 | Comparison to RNA and protein counterparts. a, Molecular surfaces g–j, Cartoon representations of the L-shaped tRNA 4WJ (g), the H-shaped DNA
and cartoons of GFP 34, Spinach37, Chili40 and Lettuce, to scale. b, GFP Holliday junction (h), the hairpin ribozyme 4WJ (i) and the near-colinear 4WJ
fluorophore site34. c, Spinach bound to DFHBI37. d, Chili bound to DMHBO+ fold of the Lettuce DNA aptamer ( j), all coloured as in f.
(ref. 40). e, Lettuce bound to DFHBI-1T. f, Schematic representation of a 4WJ.
known, such as the aptamer domain of the guanidine III riboswitch RNA complexes of deoxyribozymes 9DB1 and Dz36 (refs. 49,50), this
(2.07 Da Å–2). Our structural analysis of Lettuce therefore suggests analysis shows that in those molecules, the majority of DNA residues
that relief from the steric bulk of the ribose 2′-OH functional group can are also in the C2′-endo conformation characteristic of deoxyribonu-
allow DNA to pack tightly, even though deoxyribose is less versatile in cleotides (when their extensive DNA–RNA heteroduplexes, which are
hydrogen bonding than ribose. constrained to the C3′-endo pucker by the RNA strand, are excluded).
Our crystallographic and cryo-EM analyses of Lettuce extend previ- Overall, our structural analysis of Lettuce indicates that functional
ous structural studies of smaller DNA aptamers, as well as DNA–RNA DNAs use structural strategies different from those of functional RNAs.
complexes, to a functional, all-DNA molecule. Lettuce is large enough Moreover, new principles of nucleic acid 3D organization will be forth-
to assemble a binding site by spatially juxtaposing residues separated coming from analyses of complex DNA molecules.
by as many as 35 nucleotides. Convergence into spatial proximity of
multiple residues distant in linear sequence (that is, with high contact
order46) is the key structural problem that biopolymers need to solve in Online content
assembling active sites47. Smaller DNA aptamers use nucleotides from Any methods, additional references, Nature Portfolio reporting summa-
the two strands of their often non-canonical duplex stems to assemble ries, source data, extended data, supplementary information, acknowl-
their binding sites (Extended Data Fig. 6e–i). By contrast, by virtue of its edgements, peer review information; details of author contributions
larger size, Lettuce can recruit nucleotides from four separate strands and competing interests; and statements of data and code availability
to construct its fluorophore-binding site. Lettuce has evolved to take are available at https://doi.org/10.1038/s41586-023-06229-8.
advantage of specific properties of DNA. Thus, an RNA composed of the
Lettuce sequence is inactive (Extended Data Fig. 6j). Moreover, analysis 1. Cruz, J. A. & Westhof, E. The dynamic landscapes of RNA architecture. Cell 136, 604–609
(2009).
of pucker angles showed that the majority of Lettuce nucleotides adopt
2. Westhof, E. & Leontis, N. B. An RNA-centric historical narrative around the Protein Data
the C2′-endo conformation that is characteristic of DNA. Only seven Bank. J. Biol. Chem. 296, 100555 (2021).
core deoxyriboses adopt the C3′-endo furanose pucker typical of RNA 3. Assmann, S. M., Chou, H. L. & Bevilacqua, P. C. Rock, scissors, paper: how RNA structure
informs function. Plant Cell https://doi.org/10.1093/plcell/koad026 (2023).
(or the rare O4′-endo form). This is a near-mirror of the distribution of
4. VarnBuhler, B. S., Moon, J., Dey, S. K., Wu, J. & Jaffrey, S. R. Detection of SARS-CoV-2 RNA
pucker angles in the RNA aptamer48 iSpinach (Extended Data Fig. 6k–n using a DNA aptamer mimic of green fluorescent protein. ACS Chem. Biol. 17, 840–853
and Supplementary Table 2). When extended to the structures of the (2022).
Extended Data Fig. 2 | Single-particle cryo-EM analysis of the unliganded 2D class averages of unliganded Lettuce from the final 2D classification.
Lettuce aptamer. a, Image processing workflow for unliganded Lettuce. d, Global resolution assessment by Fourier shell correlation curve with the
Processing steps denoted in red and black represent programs used in either 0.143 gold standard threshold. e, Distribution of orientations over azimuth
cryoSPARC or RELION, respectively. b, Representative of a motion-corrected and elevation angles for particles included in the calculation of the final map.
micrograph from dataset of 3769 dose-weighted micrographs. c, Representative
Extended Data Fig. 3 | See next page for caption.
Article
Extended Data Fig. 3 | Lettuce–DFHBI-1T in complex with thallium (I), green spheres represent K+ and Mg 2+, respectively. The bound DFHO molecule is
overall structures of Lettuce in complex with DFHO and DFAME, ball-and- shown in ball-and-stick representation with translucent spheres. DNA color as
stick representation of the tiers of Lettuce, and wall-eyed stereoviews of the in Fig. 1c. f, Orthogonal view of e. g, Cartoon representation of the Lettuce–
binding site. a, Cartoon representation of the Lettuce–DFHBI-1T Tl+ complex. DFAME complex. Arrows indicate 5′ to 3′ chain direction, purple and green
Arrows denote 5′ to 3′ chain direction, brown and green spheres represent Tl+ spheres represent K+ and Mg 2+, respectively. The bound DFAME molecule is
and Mg 2+, respectively. The bound DFHBI-1T molecule is shown in ball-and-stick shown in ball-and-stick representation with translucent spheres. DNA color as
representation with translucent spheres. DNA color as in Fig. 1c. b, Orthogonal in Fig. 1c. h, Orthogonal view of g. i, Ball-and-stick representation of the nine
view of a. c, Density-modified SAD electron density map for the Lettuce–DFHBI- tiers (numbered 1-9 in bold numerals) in the Lettuce core. Some background or
1T Tl+ complex contoured at 1.0 σ (blue mesh), superimposed on the final interaction nucleotides of different tiers are shown for clarity. Gray and orange
refined model. d, Emission spectra of Lettuce–DFHBI-1T in the presence of K+ dashed lines denote hydrogen bonds and metal coordination, respectively.
(solid green line) or Tl+ (dashed green line). e, Cartoon representation of the Secondary structure representation corresponds to that in Fig. 1e. Wall-eyed
Lettuce–DFHO complex. Arrows indicate 5′ to 3′ chain direction, purple and stereoviews of Figs. 2a–d and are shown on j, k, l, and m, respectively.
Extended Data Fig. 4 | See next page for caption.
Article
Extended Data Fig. 4 | Fluorescence titrations, overall structures and g, Cartoon representation of the C20T Lettuce–DFHO, i, –DFHBI1T, and
spectra of Lettuce C20 specificity mutants complexed with different k, –DFAME complexes. Arrows indicate 5′ to 3′ chain direction, purple spheres
fluorophores, and wall-eyed stereoviews of Lettuce C20 specificity mutants represent K+, and green spheres represent Mg 2+. The bound fluorophore
binding sites. Fluorescence of DFHBI-1T (green), DFHO (yellow), and DFAME molecules are shown in ball-and-stick representation with translucent spheres.
(red) titrated with a, Lettuce, b, Lettuce C20G mutant, and c, Lettuce C20T Asterisk marks the mutated residue. DNA color as in Fig. 1c. h, Orthogonal view
mutant (n = 3 technical replicates). d, Calculated Kd values for a, b, c (mean ± of g. j, Orthogonal view of i. l, Orthogonal view of k. m, Excitation and emission
s.d., n = 3 technical replicates). e, Cartoon representation of the C20G Lettuce– spectra of Lettuce and Lettuce C20T mutant in the presence of DFHBI-1T.
DFHO complex. Arrows indicate 5′-to-3′ chain direction, purple and green n, Excitation and emission spectra of Lettuce, Lettuce C20G mutant, and
spheres represent K+ and Mg 2+, respectively. Asterisk marks the mutated Lettuce C20T mutant in the presence of DFHO. o, Excitation and emission
residue. The bound DFHO molecule is shown in ball-and-stick representation spectra of Lettuce and Lettuce C20T mutant in the presence of DFAME. Wall-
with translucent spheres. DNA color as in Fig. 1c. f, Orthogonal view of e. eyed stereoviews of Figs. 2g–j and are shown on p, q, r, and s, respectively.
Extended Data Fig. 5 | Wall-eyed stereoviews of the Lettuce core, divalent Mg 2+ ion bound in the P2.1 loop. k, Circular dichroism spectra of Bibb Lettuce–
cation dependence of Lettuce fluorescence, and Bibb Lettuce DFHBI1T (green) and unliganded Bibb Lettuce (gray) recorded at 21 °C.
characterization. Wall-eyed stereoviews of Figs. 3a–d are shown on a, b, c, and l, Fluorescence of DFHBI-1T (green), DFHO (yellow), and DFAME (red) titrated
d, respectively. e, Fluorescence of Lettuce-DFHBI-1T as a function of Mg 2+ with Bibb Lettuce. Calculated Kd values are shown (mean ± s.d., n = 3 technical
concentration (in the presence of 150 mM K+). The half-maximal activity (K 1/2) is replicates). m, Fluorescence of Bibb Lettuce-DFHBI-1T as a function of Mg 2+
at 1.41 ± 0.14 mM Mg 2+ (mean ± s.d.; n = 3 technical replicates). f, Fluorescence concentration (in the presence of 150 mM K+). The half-maximal activity (K 1/2) is
activation of Lettuce–DFHBI-1T in 150 mM K+ alone, or supplemented with at 0.52 ± 0.06 mM Mg 2+ (mean ± s.d.; n = 3 technical replicates). n, Fluorescence
10 mM Mg 2+, Ca2+, or Mn2+ (mean ± s.d., n = 3 technical replicates). * denotes activation of Lettuce–DFHBI-1T in 150 mM K+ alone, or supplemented with
P= 0.0002 (two-sided t-test). No significancy (P= 0.251) between K+ + Mg 2+ and 10 mM Mg 2+, Ca2+, or Mn2+ (mean ± s.d., n = 3 technical replicates). * denotes
K+ + Mn2+ (two-sided t-test). g, Electrostatic potential of Lettuce mapped on its P = 0.012 (two-sided t-test). No significancy (P = 0.395) between K+ + Mg 2+ and
molecular surface. Color ramp from 0 to +20 k BT (white to red). h, Green sphere K+ + Mn2+ (two-sided t-test). o, Circular dichroism thermal analysis of Lettuce
shows Mg 2+ ion bound in the P1.1 loop. i, Molecular surface of Lettuce (same and Bibb Lettuce in the presence and absence of DFHBI1T at 290 nm.
color scheme as g) showing the P2.1 loop at its center. j, Green sphere shows
Article
Extended Data Fig. 6 | R-loop characterization, compactness study of RNA Lettuce used in the crystallographic studies, Split Lettuce, Lettuce with shorter
and DNA, and structure and functions of DNA and RNA aptamers. P1 used in fluorescence studies (Supplementary Table 3), RNA comprised of the
a, Autoradiogram of 6% non-denaturing PAGE demonstrating R-loop formation Lettuce sequence, and split Lettuce comprised by DNA (34 nts) and RNA (16 nts of
(arrow) showed on Fig. 4. For uncropped gel, see Supplementary Fig. 1. The 3' terminus) in the presence of DFHBI-1T (mean ± s.d., n = 3 technical replicates).
detection of R-loop formation experiment by non-denaturing PAGE was Asterisk denotes P = 0.002 (two-sided t-test). No significant difference (P= 0.407)
replicated three times. b, Schematic of 260-nt R-loop assay. c, Fluorescence between Lettuce w.t. and Lettuce with shorter P1 (two-sided t-test). k, Pucker
analysis of longer R-loop formation as depicted on b. Fluorescence of DFHBI-1T in angles of each deoxynucleotide of Lettuce–fluorophore complexes (mean of 8
the presence of buffer, RNA, 310-bp dsDNA, dsDNA + RNA not co-annealed, structures). l, Pucker angles of each deoxynucleotide of the RNA-ligating
dsDNA + RNA co-annealed, normalized to wild-type Lettuce (mean ± s.d., n = 3 deoxyribozyme 9DB1 (ref. 49; PDB:5CKK). Puckers of core deoxynucleotides
technical replicates). d, RNA and DNA compactness study plotted graph using are in magenta; puckers of deoxynucleotides that hybridize to the RNA product
data from Supplementary Table 1. e, Lettuce–DFHBI-1T co-crystal structure (this strand are in cyan. m, Pucker angles of each deoxynucleotide of the RNA-cleaving
study). f, NMR structure of the AMP-binding DNA aptamer in complex with two deoxyribozyme Dz36 (ref. 50; PDB:5XM8). Puckers of core deoxynucleotides are
AMP molecules (ref. 71; PDB:1AW4). g, NMR structure of a DNA aptamer bound to in orange; puckers of deoxynucleotides that hybridize to the RNA substrate
argininamide (ref. 72; PDB:1DB6). h, NMR structure of the OBA3 DNA aptamer strand are in gray. n, Pucker angles for each ribose of the RNA fluorogenic
bound to ochratoxin A (ref. 73; PDB:6J2W). i, NMR structure of the OBA36 DNA aptamer iSpinach in complex with DFHBI (ref. 48; PDB:5OB3). Individual puckers
aptamer bound to ochratoxin A (ref. 74; PDB:7W9N). j, Fluorescence of wild-type for all four aptamers are in Supplementary Table 2.
Extended Data Table 1 | Crystallographic statistics
Crystallographic statistics of Lettuce. One crystal was used for each data set. Values in parenthesis are for the highest resolution shell.
Article
Extended Data Table 2 | Crystallographic statistics
Crystallographic statistics of Lettuce mutants. One crystal was used for each dataset. Values in parenthesis are for the highest resolution shell.
Extended Data Table 3 | Cryo-EM data collection and processing parameters for unliganded Lettuce
Total extracted particles are defined as the particles obtained after Topaz picking. No model refinement was performed on the reconstructed map. Cryo-EM envelope of unliganded Lettuce was
superimposed on the Lettuce–DFHBI-1T co-crystal structure (see Methods section).