Professional Documents
Culture Documents
Hu W Et Al PNAS-2014 PDF
Hu W Et Al PNAS-2014 PDF
Edited by Anthony S. Fauci, National Institute of Allergy and Infectious Diseases, Bethesda, MD, and approved June 19, 2014 (received for review
March 19, 2014)
AIDS remains incurable due to the permanent integration of HIV-1 receptor 4 (CXCR4) and proviral DNA-encoding viral pro-
into the host genome, imparting risk of viral reactivation even after teins (8, 9). CCR5 gene-targeting ZFNs are in phase II clinical
antiretroviral therapy. New strategies are needed to ablate the trials for HIV-1/AIDS treatment (11). Also, various gene editing
viral genome from latently infected cells, because current methods technologies have recently been shown to remove the proviral
are too inefficient and prone to adverse off-target effects. To HIV-1 DNA from the host cell genome by targeting its highly
eliminate the integrated HIV-1 genome, we used the Cas9/guide conserved 5′ and 3′ long terminal repeats (LTRs) (12, 13). How-
RNA (gRNA) system, in single and multiplex configurations. We ever, introduction of nucleases into cells via these nuclease-based
identified highly specific targets within the HIV-1 LTR U3 region genomic editing approaches remains inefficient and partially se-
MEDICAL SCIENCES
that were efficiently edited by Cas9/gRNA, inactivating viral gene lective to remove the entire HIV-1 genome. Thus, the key barrier
expression and replication in latently infected microglial, promon- to their clinical translation is insufficient gene specificity to prevent
ocytic, and T cells. Cas9/gRNAs caused neither genotoxicity nor off- potential off-target effects (toxicities). To achieve highly specific
target editing to the host cells, and completely excised a 9,709-bp HIV-1 genome editing, we combined approaches to identify HIV-1
fragment of integrated proviral DNA that spanned from its 5′ to 3′ targets while circumventing host off-target effects. The resulting
LTRs. Furthermore, the presence of multiplex gRNAs within Cas9- highly specific Cas9-based method proved capable of eradicating
expressing cells prevented HIV-1 infection. Our results suggest that integrated HIV-1 DNA with high efficiency from latently infected
Cas9/gRNA can be engineered to provide a specific, efficacious pro-
human “reservoir” cell types, and prevented their infection
phylactic and therapeutic approach against AIDS.
by HIV-1.
CRISPR/Cas9 | genome editing | latency | retrovirus | reservoir Results
We assessed the ability of HIV-1–directed guide RNAs (gRNAs)
Fig. 1. Cas9/LTR-gRNA suppresses HIV-1 reporter virus production in CHME5 microglial cells latently infected with HIV-1. (A) Representative gating diagram
of EGFP flow cytometry shows a dramatic reduction in TSA-induced reactivation of latent pNL4-3-ΔGag-d2EGFP reporter virus by stably expressed Cas9 plus
LTR-A or -B, vs. empty U6-driven gRNA expression vector (U6-CAG). (B) SURVEYOR Cel-I nuclease assay of PCR product (−453 to +43 within LTR) from selected
LTR-A- or -B-expressing stable clones shows dramatic indel mutation patterns (arrows). (C and D) PCR fragment analysis shows a precise deletion of 190-bp
region between LTR-A and -B cutting sites (red arrowhead and arrow), leaving 306-bp fragment (black arrow) validated by TA-cloning and sequencing results.
(E−G) Subcloning of LTR-A/B stable clones reveals complete loss of reporter reactivation determined by EGFP flow cytometry (E) and elimination of pNL4-3-
ΔGag-d2EGFP proviral genome detected by standard (F) and real-time (G) PCR amplification of genomic DNA for EGFP and HIV-1 Rev response element (RRE);
β-actin is a DNA purification and loading control. (H) PCR genotyping of LTR-A/B subclones (#8, #13) using primers to amplify DNA fragment covering HIV-1
LTR U3/R/U5 regions (−411 to +129) shows indels (a, deletion; c, insertion) and “intact” or combined LTR (b).
2 of 6 | www.pnas.org/cgi/doi/10.1073/pnas.1405186111 Hu et al.
fragment via a 190-bp excision between cleavage sites A and B from cell population heterogeneity and/or incomplete genome
(Fig. 1H and Fig. S4), and a 682-bp fragment with a 175-bp in- editing. We also validated the ablation of HIV-1 genome by Cas9/
sertion and a 27-bp deletion at the LTR-A and -B sites, respectively LTR-A/B gRNAs in latently infected J-Lat T cells harboring
(Fig. S4C). The residual HIV-1 genome (Fig. 1 F−H) may reflect integrated HIV-R7/E-/EGFP (21) using flow cytometry analysis,
the presence of trace Cas9/gRNA-negative cells. These results SURVEYOR assay, and PCR genotyping (Fig. S6), supporting
indicate that LTR-targeting Cas9/gRNAs A/B eradicates the the results of previous reports on HIV-1 proviral deletion in
HIV-1 genome and blocks its reactivation in latently infected Jurkat T cells by Cas9/gRNA (12) and ZFN (13). Taken together,
microglial cells. our results suggest that the multiplex LTR-gRNAs/Cas9 system
The promonocytic U-937 cell subclone U1, an HIV-1 latency efficiently suppresses HIV-1 replication and reactivation in la-
model for infected perivascular macrophages and monocytes, is tently HIV-1–infected “reservoir” (microglial, monocytic, and T)
chronically HIV-1 infected and exhibits low-level constitutive cells typical of human latent HIV-1 infection, and in TZM-bI
viral gene expression and replication (19). GenomeWalker map- cells highly sensitive for detecting HIV-1 transcription and reac-
ping detected two integrated proviral DNA copies at chromo- tivation. Single or multiplex gRNAs targeting 5′ and 3′ LTRs ef-
somes Xp11-4 (Fig. 2A) and 2p21 (Fig. S5A) in U1 cells. A 9,935- fectively eradicated the entire HIV-1 genome.
bp DNA fragment representing the entire 9,709-bp proviral HIV-1 We next tested whether combined Cas9/LTR gRNAs can im-
DNA plus a flanking 226-bp X-chromosome-derived sequence munize cells against HIV-1 infection using stable Cas9/gRNAs-A
(Fig. 2A), and a 10,176-bp fragment containing 9,709-bp HIV-1 and -B-expressing TZM-bI-based clones (Fig. 3A). Two of seven
genome plus its flanking 2-chromosome-derived 467 bp (Fig. S5 A puromycin-selected subclones exhibited efficient excision of the
and B) were identified by the long-range PCR analysis of the 190-bp LTR-A/B site-spanning DNA fragment (Fig. 3B). How-
parental control or empty-vector (U6-CAG) U1 cells. The 226-bp ever, the remaining five subclones exhibited no excision (Fig. 3B)
and 467-bp fragments represent the predicted segment from the and no indel mutations as verified by Sanger sequencing. PCR
other copy of chromosome X and 2, respectively, which lacked the genotyping using primers targeting Cas9 and U6-LTR showed that
integrated proviral DNA. In U1 cells expressing LTR-A/B gRNAs none of these ineffective subclones retained the integrated copies
MEDICAL SCIENCES
and Cas9, we found two additional DNA fragments of 833 and 670 of Cas9/LTR-A/B gRNA expression cassettes. (Fig. S7 A and B).
bp in chromosome X and one additional 1,102-bp fragment in As a result, no expression of full-length Cas9 was detected (Fig. S7
chromosome 2. Thus, gRNAs A/B enabled Cas9 to excise the C and D). The long-term expression of Cas9/LTR-A/B gRNAs did
HIV-1 5′−3′ LTR-spanning viral genome segment in both chro- not adversely affect cell growth or viability, suggesting a low
mosomes. The 833-bp fragment includes the expected 226-bp occurrence of off-target interference with the host genome or
from the host genome and a 607-bp viral LTR sequence with a 27- Cas9-induced toxicity in this model. We assessed de novo HIV-1
bp deletion around the LTR-A site (Fig. 2 A and B). The 670-bp replication by infecting cells with the VSV-G-pseudotyped pNL4-3-
fragment encompassed a 226-bp host sequence and residual 444- ΔE-EGFP reporter virus (22), with EGFP positivity by flow
bp viral LTR sequence after 190-bp fragment excision (Fig. 1D), cytometry indicating HIV-1 replication. Unlike the control U6-
caused by gRNAs-A/B-guided cleavage at both LTRs (Fig. 2A). CAG cells, the cells stably expressing Cas9/LTR-A/B gRNAs failed
The additional fragments did not emerge via circular LTR in- to support HIV-1 replication at 2 d postinfection, indicating that
tegration, because it was absent in the parental U1 cells, and such they were immunized effectively against new HIV-1 infection (Fig. 3
circular LTR viral genome configuration occurs immediately after C and D). A similar immunity against HIV-1 was observed in Cas/
HIV-1 infection but is short lived and intolerant to repeated LTR-A/B gRNA expressing cells infected with native T-tropic
passaging (20). These cells exhibited substantially decreased HIV- X4 strain pNL4-3-ΔE-EGFP reporter virus (Fig. S8A) or native
1 viral load, shown by the functional p24 ELISA replication assay M-tropic R5 strains such as SF162 and JRFL (Fig. S8 B−D).
(Fig. 2C) and real-time PCR analysis (Fig. S5 C and D). The The appeal of Cas9/gRNA as an interventional approach rests
detectable but low residual viral load and reactivation may result on its highly specific on-target indel-producing cleavage (15, 16),
4 of 6 | www.pnas.org/cgi/doi/10.1073/pnas.1405186111 Hu et al.
Fig. 4. Off-target effects of Cas9/LTR-A/B on hu-
man genome. (A) SURVEYOR assay shows no indel
mutations in predicted/potential off-target regions
in human TZM-bI and U1 cells. LTR-A on-target re-
gion (A) was used as a positive control and empty
U6-CAG vector (U6) as a negative control. (B−D)
Whole-genome sequencing of LTR-A/B stable TZM-
bI subclone showing the numbers of called indels in
the U6-CAG control and LTR-A/B samples (B), de-
tailed information on 10 called indels near gRNA
MEDICAL SCIENCES
target sites in both samples (C), and distribution of
off-target called indels (D).
use of newly developed Cas9 double-nicking (23) and RNA- presents an important next step to assess the ability of Cas9 to
guided FokI nuclease (31, 32) may further assist identification of eradicate viral reservoirs in vivo. Moreover, in light of recent
new targets within the various conserved regions of HIV-1 with data illustrating efficient in vitro genome editing using a mixture
reduced off-target effects. of Cas9/gRNA and DNA (39–42), one may explore various sys-
More recently, a clinical trial using the ZFN gene editing tems for delivery of Cas9/LTR-gRNA via various routes for
strategy was launched to disrupt the gene encoding the HIV-1 immunizing high-risk subjects. Once advanced, one may use gene
coreceptor, CCR5 (8, 9, 11). Functional knockout of CCR5 in therapies (viral vector and nanoparticle) and transplantation of
autologous CD4 T cells of a small cohort of patients revealed autologous Cas9/gRNA-modified bone marrow stem/progenitor
that in one out of four enrolled subjects, the viral load remained cells (43, 44) or inducible pluripotent stem cells for eradicating
undetectable at the time of treatment (33). Similarly, TALEN HIV-1 infection.
and Cas9 have been tested experimentally for efficient disruption Here, we demonstrated the high specificity of Cas9/gRNAs
of CCR5 and CXCR4 (9, 28, 34–37); therefore, taking them into in editing HIV-1 target genome. Results from subclone data
consideration for clinical trials is anticipated. Whether or not the revealed the strict dependence of genome editing on the pres-
strategies targeting HIV-1 entry can reach the “sterile” cure of
ence of both Cas9 and gRNA. Moreover, only one nucleotide
AIDS remains to be seen. Our results show that the HIV-1 Cas9/
mismatch in the designed gRNA target will disable the editing
gRNA system has the ability to target more than one copy of the
potency. In addition, all four of our designed LTR gRNAs
LTR, which are positioned on different chromosomes, suggest-
ing that this genome-editing system can alter the DNA sequence worked well with different cell lines, indicating that the editing is
of HIV-1 in latently infected patient’s cells harboring multiple more efficient in the HIV-1 genome than the host cellular ge-
proviral DNAs. To further ensure high editing efficacy and consis- nome, wherein not all designed gRNAs are functional, which
tency of our technology, one may consider the most stable region of may be due to different epigenetic regulation, variable genome
HIV-1 genome as a target to eradicate HIV-1 in patient samples, accessibility, or other reasons. Given the ease and rapidity of
which may not harbor only one strain of HIV-1. Alternatively, one Cas9/gRNA development, even if HIV-1 mutations confer re-
may develop personalized treatment modalities based on the data sistance to one Cas9/gRNA-based therapy, as described above,
from deep sequencing of the patient-derived viral genome be- HIV-1 variants can be genotyped to enable another personalized
fore engineering therapeutic Cas9/gRNA molecules. therapy for individual patients (10).
Our results also demonstrate, for the first time to our knowl-
edge, that Cas9/gRNA genome editing can be used to immunize Materials and Methods
cells against HIV-1 infection. The preventative vaccination is Plasmid Preparation. Vectors containing human Cas9 and gRNA expression
independent of HIV-1 strain’s diversity because the system tar- cassette, pX260, and pX330 (Addgene) were used to create various constructs,
LTR-A, -B, -C, and -D (for details, see SI Materials and Methods).
gets genomic sequences regardless of how the viruses enter the
infected cells. Interestingly, the preexistence of the Cas9/gRNA
Cell Culture and Stable Cell Lines. TZM-bI reporter and U1 cell lines were
system in cells leads to a rapid elimination of the new HIV-1
obtained from the National Institutes of Health (NIH) AIDS Reagent Program,
before it integrates into the host genome, just like the way by and CHME5 microglial cells were described previously (17). The detailed
which the bacteria defense system evolved to combat phage in- procedure for cell growth or preparation of stable cell lines is described in SI
fection (38). Similarly, a gene-editing-based vaccine strategy Materials and Methods.
may be effective in eradicating postintegrated HIV-1 genome
and newly packaged proviruses in cells. Therefore, investigation Immunocytochemistry and Western Blot. Standard methods for immunocy-
of such HIV-1 vaccination in various latent reservoir cells and tochemical observation of the cells and evaluation of protein expression by
animal models with stable expression of Cas9/LTR-gRNAs Western blot were used as described in detail in SI Materials and Methods.
1. UNAIDS (2012) Global Report: UNAIDS Report on the Global AIDS Epidemic 2012 23. Ran FA, et al. (2013) Double nicking by RNA-guided CRISPR Cas9 for enhanced ge-
(Joint United Nations Programme on HIV/AIDS, Geneva). nome editing specificity. Cell 154(6):1380–1389.
2. Taylor BS, Wilkin TJ, Shalev N, & Hammer SM (2013) CROI 2013: Advances in anti- 24. Cho SW, et al. (2014) Analysis of off-target effects of CRISPR/Cas-derived RNA-guided
retroviral therapy. Top Antiviral Med 21(2):75–89. endonucleases and nickases. Genome Res 24(1):132–141.
3. Eisele E, Siliciano RF (2012) Redefining the viral reservoirs that prevent HIV-1 eradi- 25. Fu Y, et al. (2013) High-frequency off-target mutagenesis induced by CRISPR-Cas
cation. Immunity 37(3):377–388. nucleases in human cells. Nat Biotechnol 31(9):822–826.
4. Chun TW, et al. (1997) Quantification of latent tissue reservoirs and total body viral 26. Gabriel R, et al. (2011) An unbiased genome-wide analysis of zinc-finger nuclease
load in HIV-1 infection. Nature 387(6629):183–188. specificity. Nat Biotechnol 29(9):816–823.
5. Chun TW, Fauci AS (2012) HIV reservoirs: Pathogenesis and obstacles to viral eradi- 27. Pattanayak V, et al. (2013) High-throughput profiling of off-target DNA cleavage
cation and cure. AIDS 26(10):1261–1268.
reveals RNA-programmed Cas9 nuclease specificity. Nat Biotechnol 31(9):839–843.
6. Johnson TP, et al. (2013) Induction of IL-17 and nonclassical T-cell activation by HIV-
28. Cho SW, Kim S, Kim JM, Kim JS (2013) Targeted genome engineering in human cells
Tat protein. Proc Natl Acad Sci USA 110(33):13588–13593.
with the Cas9 RNA-guided endonuclease. Nat Biotechnol 31(3):230–232.
7. Zhang J, Crumpacker C (2013) Eradication of HIV and cure of AIDS, now and how?
29. Zhang F, Wen Y, Guo X (2014) CRISPR/Cas9 for genome editing: progress, implications
Front Immunol 4:337.
and challenges. Hum Mol Genet, 10.1093/hmg/ddu125.
8. Stone D, Kiem HP, Jerome KR (2013) Targeted gene disruption to cure HIV. Curr Opin
30. Jinek M, et al. (2012) A programmable dual-RNA-guided DNA endonuclease in
HIV AIDS 8(3):217–223.
9. Manjunath N, Yi G, Dang Y, Shankar P (2013) Newer gene editing technologies to- adaptive bacterial immunity. Science 337(6096):816–821.
ward HIV gene therapy. Viruses 5(11):2748–2766. 31. Tsai SQ, et al. (2014) Dimeric CRISPR RNA-guided FokI nucleases for highly specific
10. Mali P, Esvelt KM, Church GM (2013) Cas9 as a versatile tool for engineering biology. genome editing. Nat Biotechnol 32(6):569–576.
Nat Methods 10(10):957–963. 32. Guilinger JP, Thompson DB, Liu DR (2014) Fusion of catalytically inactive Cas9 to FokI
11. Hofer U, et al. (2013) Pre-clinical modeling of CCR5 knockout in human hematopoietic stem nuclease improves the specificity of genome modification. Nat Biotechnol 32(6):577–582.
cells by zinc finger nucleases using humanized mice. J Infect Dis 208(Suppl 2):S160–S164. 33. Tebas P, et al. (2014) Gene editing of CCR5 in autologous CD4 T cells of persons in-
12. Ebina H, Misawa N, Kanemura Y, Koyanagi Y (2013) Harnessing the CRISPR/Cas9 fected with HIV. N Engl J Med 370(10):901–910.
system to disrupt latent HIV-1 provirus. Sci Rep 3:2510. 34. Cradick TJ, Fine EJ, Antico CJ, Bao G (2013) CRISPR/Cas9 systems targeting β-globin and
13. Qu X, et al. (2013) Zinc-finger-nucleases mediate specific and efficient excision of HIV-1 CCR5 genes have substantial off-target activity. Nucleic Acids Res 41(20):9584–9592.
proviral DNA from infected and latently infected human T cells. Nucleic Acids Res 41(16): 35. Yang L, et al. (2013) Optimization of scarless human stem cell genome editing. Nucleic
7771–7782. Acids Res 41(19):9049–9061.
14. Hsu PD, et al. (2013) DNA targeting specificity of RNA-guided Cas9 nucleases. Nat 36. Mussolino C, et al. (2014) TALENs facilitate targeted genome editing in human cells
Biotechnol 31(9):827–832. with high specificity and low cytotoxicity. Nucleic Acids Res 42(10):6762–6773.
15. Mali P, et al. (2013) RNA-guided human genome engineering via Cas9. Science 37. Liu J, Gaj T, Patterson JT, Sirk SJ, Barbas CF, 3rd (2014) Cell-penetrating peptide-
339(6121):823–826. mediated delivery of TALEN proteins via bioconjugation for genome engineering.
16. Cong L, et al. (2013) Multiplex genome engineering using CRISPR/Cas systems. Science PLoS ONE 9(1):e85755.
339(6121):819–823. 38. Horvath P, Barrangou R (2010) CRISPR/Cas, the immune system of bacteria and ar-
17. Wires ES, et al. (2012) Methamphetamine activates nuclear factor kappa-light-chain-
chaea. Science 327(5962):167–170.
enhancer of activated B cells (NF-κB) and induces human immunodeficiency virus
39. Chen H, Choi J, Bailey S (2014) Cut site selection by the two nuclease domains of the
(HIV) transcription in human microglial cells. J Neurovirol 18(5):400–410.
Cas9 RNA-guided endonuclease. J Biol Chem 289(19):13284–13294.
18. Derdeyn CA, et al. (2000) Sensitivity of human immunodeficiency virus type 1 to the
40. Kim JM, Kim D, Kim S, Kim JS (2014) Genotyping with CRISPR-Cas-derived RNA-guided
fusion inhibitor T-20 is modulated by coreceptor specificity defined by the V3 loop of
endonucleases. Nat Commun 5:3157.
gp120. J Virol 74(18):8358–8367.
41. Jinek M, et al. (2013) RNA-programmed genome editing in human cells. eLife
19. Folks TM, Justement J, Kinter A, Dinarello CA, Fauci AS (1987) Cytokine-induced expres-
sion of HIV-1 in a chronically infected promonocyte cell line. Science 238(4828):800–802. 2:e00471.
20. Pace MJ, Graf EH, O’Doherty U (2013) HIV 2-long terminal repeat circular DNA is 42. Karvelis T, Gasiunas G, Siksnys V (2013) Programmable DNA cleavage in vitro by Cas9.
stable in primary CD4+T Cells. Virology 441(1):18–21. Biochem Soc Trans 41(6):1401–1406.
21. Jordan A, Bisgrove D, Verdin E (2003) HIV reproducibly establishes a latent infection 43. Li L, et al. (2013) Genomic editing of the HIV-1 coreceptor CCR5 in adult hematopoietic
after acute infection of T cells in vitro. EMBO J 22(8):1868–1877. stem and progenitor cells using zinc finger nucleases. Mol Ther 21(6):1259–1269.
22. Zhang H, et al. (2004) Novel single-cell-level phenotypic assay for residual drug sus- 44. Younan P, Kowalski J, Kiem HP (2014) Genetically modified hematopoietic stem cell
ceptibility and reduced replication capacity of drug-resistant human immunodefi- transplantation for HIV-1-infected patients: Can we achieve a cure? Mol Ther 22(2):
ciency virus type 1. J Virol 78(4):1718–1729. 257–264.
6 of 6 | www.pnas.org/cgi/doi/10.1073/pnas.1405186111 Hu et al.
Supporting Information
Hu et al. 10.1073/pnas.1405186111
SI Materials and Methods TZM-bI cells cultured in six-well plate were solubilized in 200 μL
Plasmid Preparation. DNA segment expressing long-term repeats of Triton X-100-based lysis buffer containing 20 mM Tris·HCl
(LTR)-A or LTR-B for precrRNA was cloned into the pX260 (pH 7.4), 1% Triton X-100, 5 mM EDTA, 5 mM DTT, 150 mM
vector that contains the puromycin selection gene (Addgene, NaCl, 1 mM phenylmethylsulfonyl fluoride, 1× nuclear extraction
plasmid #42229). DNA segment expressing LTR-C or LTR-D proteinase inhibitor mixture (Cayman Chemical), 1 mM sodium
for the chimeric crRNA-tracrRNA was cloned into the pX330 orthovanadate, and 30 mM NaF. Cell lysates were rotated at 4 °C
vector (Addgene, plasmid #42230). Both vectors contain a hu- for 30 min. Nuclear and cellular debris was cleared by centrifu-
manized Cas9 coding sequence driven by a CAG promoter and gation at 20,000 × g for 20 min at 4 °C. Equal amounts of lysate
a gRNA expression cassette driven by a human U6 promoter (1). proteins (20 μg) were denatured by boiling for 5 min in sodium
The vectors were digested with BbsI and treated with Antarctic dodecyl sulfate (SDS) sample buffer, fractionated by SDS-poly-
Phosphatase, and the linearized vector was purified with a Quick acrylamide gel electrophoresis in Tris-glycine buffer, and trans-
ferred to nitrocellulose membrane (BioRad). The SeeBlue
nucleotide removal kit (Qiagen). A pair of oligonucleotides for
prestained standards (Invitrogen) were used as a molecular
each targeting site (Table S2, AlphaDNA) was annealed, phos-
weight reference. Blots were blocked in 5% (wt/vol) BSA/Tris-
phorylated, and ligated to the linearized vector. The gRNA ex- buffered saline (pH 7.6) plus 0.1% Tween-20 (TBS-T) for 1 h
pression cassette was sequenced with U6 sequencing primer (Table and then incubated overnight at 4 °C with mouse anti-Flag M2
S2) in GENEWIZ. For pX330 vectors, we designed a pair of monoclonal antibody (1:1,000, Sigma) or mouse anti-GAPDH
universal PCR primers with overhang digestion sites (Table S2) monoclonal antibody (1:3,000, Santa Cruz Biotechnology). After
that can tease out the gRNA expression cassette (U6-gRNA- washing with TBS-T, the blots were incubated with IRDye 680LT-
crRNA-stem-tracrRNA) for direct transfection or subcloning to conjugated antimouse antibody for 1 h at room temperature.
other vectors. Membranes were scanned and analyzed using an Odyssey In-
frared Imaging System (LI-COR Biosciences).
Cell Culture. TZM-bI reporter cell line from John C. Kappes,
Xiaoyun Wu, and Tranzyme Inc., U1/HIV-1 cell line from Firefly Luciferase Assay. Cells were lysed 24 h posttreatment using
Thomas Folks, and J-Lat full length clone from Eric Verdin were Passive Lysis Buffer (Promega) and assayed with a Luciferase
obtained through the National Institute of Health (NIH) AIDS Reporter Gene Assay kit (Promega) according to the protocol of
Reagent Program, Division of AIDS, National Institute of Allergy the manufacturer. Luciferase activity was normalized to the
and Infectious Diseases, NIH. CHME5/HIV fetal microglia cell number of cells determined by parallel MTT assay (Vybrant;
lines were generated as previously described (2). TZM-bI and Invitrogen).
CHME5 cells were cultured in DMEM high glucose supple-
mented with 10% (vol/vol) heat-inactivated FBS (FBS) and 1% p24 ELISA. After infection or reactivation, the HIV-1 viral load
penicillin/streptomycin. U1 and J-Lat cells were cultured in levels in the supernatants were quantified by p24 Gag ELISA
RPMI 1640 containing 2.0 mM L-glutamine, 10% FBS, and 1% (Advanced BioScience Laboratories, Inc.) following the manu-
penicillin/streptomycin. facturer’s protocol. To assess the cell viability upon treatments,
MTT assay was performed in parallel according to the manu-
Stable Cell Lines and Subcloning. TZM-bl or CHME5/HIV cells facturer’s protocol (Vybrant; Invitrogen).
were seeded in six-well plates at 1.5 × 105 cells/well and trans-
fected using Lipofectamine 2000 reagent (Invitrogen) with 1 μg EGFP Flow Cytometry. Cells were trypsinized, washed with PBS,
of pX260 (for LTR-A and -B) or 1 μg/0.1 μg of pX330/pX260 and fixed in 2% paraformaldehyde for 10 min at room temper-
(for LTR-C and -D) plasmids. Next day, cells were transferred ature, then washed twice with PBS and analyzed using a Guava
into 100-mm dishes and incubated with growth medium con- EasyCyte Mini flow cytometer (Guava Technologies).
taining 1 μg/mL of puromycin (Sigma). Two weeks later, surviving
HIV-1 Reporter Virus Preparation and Infections. HEK293T cells
cell colonies were isolated using cloning cylinders (Corning). U1
were transfected using Lipofectamine 2000 reagent (Invitrogen)
and J-Lat cells (1.5 × 105) were electroporated with 1 μg of DNA
with pNL4-3-ΔE-EGFP (NIH AIDS Research and Reference
using 10-μL tip, 3 × 10 ms 1,400-V impulses at The Neon Trans-
Reagent Program). For pseudotyped pNL4-3-ΔE-EGFP, the
fection System (Invitrogen). Cells were selected with 0.5 μg/mL of VSV-G vector was cotransfected. After 48 h, the supernatant
puromycin for two weeks. The stable clones were subcultured using was collected, 0.45-μm filtered, and titered in HeLa cells using
a limited dilution method in 96-well plates, and single cell-derived expressed EGFP as an infection marker. HIV-1 SF162 and JRFL
subclones were maintained for further studies. were obtained through the NIH AIDS Research and Reference
Reagent Program: HIV-1 SF162 from Dr. Jay Levy and HIV-1
Immunocytochemistry and Western Blot. The Cas9/gRNA stable
JRFL from Dr. Irvin Chen. Infectious virus was prepared from
expression TZM-bI cells were cultured in eight-well chamber slides phytohaemagglutinin-stimulated peripheral blood mononuclear
for 2 d and fixed for 10 min in 4% (wt/vol) paraformaldehyde/PBS. cells (5 μg/ml, 48 h) and titrated by 50% Tissue Culture Infective
After three rinses, the cells were treated with 0.5% Triton X-100/ Dose (TCID50) assay in TZM-bl cells. For viral infection, stable
PBS for 20 min and blocked in 10% (vol/vol) donkey serum for Cas9/gRNA TZM-bI cells were incubated 2 h with a diluted
1 h. Cells were incubated overnight at 4 °C with mouse anti-Flag viral stock, and washed twice with PBS. At 2 and 4 d post-
M2 primary antibody (1:500, Sigma). After rinsing three times, cells infection, cells were collected, fixed, and analyzed by flow cy-
were incubated for 1 h with donkey antimouse Alexa-Fluor-594 tometry for EGFP expression, or genomic DNA purification was
secondary antibodies, and incubated with Hoechst 33258 for 5 min. performed for PCR and whole-genome sequencing.
After three rinses with PBS, the cells were coverslipped with anti-
fading aqueous mounting media (Biomeda) and analyzed under Genomic DNA Purification, PCR, TA Cloning, and Sanger Sequencing.
a Leica DMI6000B fluorescence microscope. Genomic DNA was isolated from cells using an ArchivePure DNA
Hu et al. www.pnas.org/cgi/content/short/1405186111 1 of 13
cell/tissue purification kit (5PRIME) according to the protocol Phusion High-Fidelity PCR Kit (New England Biolabs) following
recommended by the manufacturer. One hundred nanograms of the manufacturer’s protocol. The PCR products were visualized
extracted DNA were subjected to PCR using a high-fidelity FailSafe on 1% agarose gel and validated by Sanger sequencing.
PCR kit (Epicentre) using primers listed in Table S2. Three steps
of standard PCR were carried out for 30 cycles with 55 °C annealing SURVEYOR Assay. The presence of mutations in PCR products was
and 72 °C extension. The products were resolved in 2% (wt/vol) tested using a SURVEYOR Mutation Detection Kit (Transgenomic)
agarose gel. The bands of interest were gel-purified and cloned according to the protocol of the manufacturer. Briefly, heteroge-
into pCRII T-A vector (Invitrogen), and the nucleotide sequence neous PCR products were denatured for 10 min in 95 °C and hy-
of individual clones was determined by sequencing at Genewiz bridized by gradual cooling using a thermocycler. Next, 300 ng of
using universal T7 and/or SP6 primers. hybridized DNA (9 μL) was subjected to digestion with 0.25 μL of
SURVEYOR Nuclease in the presence of 0.25 μL SURVEYOR
Conventional and Real-Time Reverse Transcription-PCR. For total Enhancer S and 15 mM MgCl2 for 4 h at 42 °C. Then, Stop
RNA extraction, cells were processed with an RNeasy Mini kit Solution was added and samples were resolved in 2% agarose gel
(Qiagen) as per manufacturer’s instructions. The potentially re- together with equal amounts of undigested PCR products.
sidual genomic DNA was removed through on-column DNase Some PCR products were used for restriction fragment length
digestion with an RNase-Free DNase Set (Qiagen). One micro- polymorphism analysis. Equal amounts of PCR products were
gram of RNA for each sample was reversely transcribed into digested with BsaJI. Digested DNA was separated on an ethi-
cDNAs using random hexanucleotide primers with a High-Capacity dium bromide-contained agarose gel (2%). For sequencing, PCR
cDNA Reverse Transcription Kit (Invitrogen). Conventional PCR products were cloned using a TA Cloning Kit Dual Promoter
was performed using a standard protocol. with pCRII vector (Invitrogen). The insert was confirmed by
Quantitative PCR (qPCR) analyses were carried out in a digestion with EcoRI, and positive clones were sent to Genwiz
LightCycler480 (Roche) using an SYBR Green PCR Master Mix for Sanger sequencing.
Kit (Applied Biosystems) as described previously (3). The RT
reactions were diluted to 5 ng of total RNA per microliter of Selection of LTR Target Sites and Prediction of Potential Off-Target
reactions, and 2 μL was used in a 20-μl PCR. For qPCR analysis Sites. For initial studies, we obtained the LTR promoter sequence
of HIV-1 proviruses, 50 ng of genomic DNA were used. The (-411 to -10) of the integrated lentiviral LTR-luciferase reporter
primers were synthesized in AlphaDNA and shown in Table S2. by TA-cloning sequencing of PCR products from the genome of
The primers for human housekeeping genes GAPDH and RPL13A human TZM-bI cells because of potential mutation of LTR
were obtained from RealTimePrimers. Each sample was tested in during passaging. This promoter sequence has 100% match to the
triplicate. Cycle threshold (Ct) values were obtained graphically 5′-LTR of pHR’-CMV-LacZ lentiviral vector (AF105229). Thus,
for the target genes and housekeeping genes. The difference in sense and antisense sequences of the full-length pHR’ 5′-LTR
Ct values between the housekeeping gene and target gene was (634 bp) were used to search for Cas9/gRNA target sites con-
represented as ΔCt values. The ΔΔCt values were obtained by taining 20 bp gRNA targeting sequence plus the PAM sequence
subtracting the ΔCt values of control samples from those of (NRG) using Jack Lin’s CRISPR/Cas9 gRNA finder tool (http://
experimental samples. Relative fold or percentage change was spot.colorado.edu/∼slin/cas9.html). The number of potential off-
calculated as 2-ΔΔCt. In some cases, absolute quantification was targets with exact match was predicted by blasting each gRNA
performed using the pNL4-3-ΔE-EGFP plasmid spiked in hu- targeting sequence plus NRG (AGG, TGG, GGG and CGG;
man genomic DNA as a standard. The number of HIV-1 viral AAG, TAG, GAG, CAG) against all available human genomic
copies was calculated based on standard curve after normaliza- and transcript sequences using the NCBI/blastn suite with E-value
tion with housekeeping gene. cutoff 1,000 and word size 7. After pressing Control + F, copy/paste
the target sequence (1-23 through 9–23 nucleotides) and find the
GenomeWalker Link PCR and Long-Range PCR. The integration sites number of genomic targets with 100% match to the target se-
of HIV-1 in host cells were identified using a Lenti-X Integration quence. The number of off-targets for each search was divided
Site Analysis Kit (Clontech) following the manufacturer’s in- by 3 because of repeated genome library.
struction. Briefly, high-quality genomic DNAs were extracted
from U1 cells using a NucleoSpin Tissue kit (Clontech). To con- Whole-Genome Sequencing and Bioinformatics Analysis. The control
struct the viral integration libraries, each genomic DNA sample was subclone C1 and experimental subclone AB7 of TZM-bI cells
digested with blunt-end-generating digestion enzymes DraI, SspI, or were validated for target cut efficiency and functional suppression
Hpal separately overnight at 37 °C. The digestion efficiency was of the LTR-luciferase reporter. The genomic DNA was isolated
verified by electrophoresis on 0.6% agarose. The digested DNA was with NucleoSpin Tissue kit (Clontech). The DNA samples were
purified using a NucleoSpin Gel and PCR Clean-Up kit fol- submitted to the NextGen sequencing facility at Temple Uni-
lowed by ligation of the digested genomic DNA fragments to versity Fox Chase Cancer Center. Duplicated genomic DNA li-
GenomeWalker Adaptor at 16 °C overnight. The ligation reaction braries were prepared from each subclone using a NEBNext Ultra
was stopped by incubation at 70 °C for 5 min and diluted 5 times DNA Library Prep Kit for Illumina (New England Biolab) fol-
with TE buffer. The primary PCR was performed on the DNA lowing the manufacturer’s instruction. All libraries were sequenced
segments with adaptor primer 1 (AP1) and LTR-specific primer 1 with paired-end 141-bp reads in two Illumina Rapid Run flowcells
(LSP1) using Advantage 2 Polymerase Mix followed by a second- on HiSeq 2500 instrument (Illumina). Demultiplexed read data
ary (nested) PCR using AP2 and LSP2 primers (Table S2). The from the sequenced libraries were sent to AccuraScience, LLC
secondary PCR products were separated on 1.5% ethidium bro- (http://www.accurascience.com) for professional bioinformatics
mide-containing agarose gel. The major bands were gel-purified analysis. Briefly, the raw reads were mapped against human ge-
and cloned into pCRII T-A vector (Invitrogen), and the nucleo- nome (hg19) and HIV-1 genome by using Bowtie2 (4). A genomic
tide sequence of individual clones was determined by sequencing analysis toolkit (GATK, version 2.8.1) was used for the duplicated
at Genewiz using universal T7 and SP6 primers. read removal, local alignment, base quality recalibration, and
The sequence reads were analyzed by NCBI BLAST searching. indel calling. The confidence scores 10 and 30 were the thresholds
Two integration sites of HIV-1 in U1 cells were identified in for low quality (LowQual) and high confidence calling (PASS).
chromosomes X and 2. A pair of primers covering each in- The potential off-target sites of LTR-A and LTR-B with various
tegration site (Table S2) was synthesized in AlphaDNA. Long- mismatches were predicted by NCBI/blastn suite as described
range PCR using the U1 genomic DNA was performed with a above and by a CRISPR Design Tool (http://crispr.mit.edu/).
Hu et al. www.pnas.org/cgi/content/short/1405186111 2 of 13
All of the potential gRNA target sites (Table S3) were used to Statistical Analysis. The quantitative data represented mean ± SD
map the ±300-bp regions around each indel identified by GATK. from three to five independent experiments, and were evaluated
The locations of the overlapped regions in the human genome by Student t test or ANOVA and Newman-Keuls multiple
and HIV-1 genome were compared between the control C1 and comparison test. A P value < 0.05 or 0.01 was considered as a
experimental AB7. statistically significant difference.
1. Cong L, et al. (2013) Multiplex genome engineering using CRISPR/Cas systems. Science 3. Hu W, Mahavadi S, Li F, Murthy KS (2007) Upregulation of RGS4 and downregulation
339(6121):819–823. of CPI-17 mediate inhibition of colonic muscle contraction by interleukin-1beta. Am J
2. Wires ES, et al. (2012) Methamphetamine activates nuclear factor kappa-light-chain- Physiol Cell Physiol 293(6):C1991–C2000.
enhancer of activated B cells (NF-κB) and induces human immunodeficiency virus (HIV) 4. Langmead B, Trapnell C, Pop M, Salzberg SL (2009) Ultrafast and memory-efficient
transcription in human microglial cells. J Neurovirol 18(5):400–410. alignment of short DNA sequences to the human genome. Genome Biol 10(3):R25.
Fig. S1. LTR U3 sequence of the integrated lentiviral LTR-firefly luciferase reporter identified by TA cloning and sequencing of PCR product (−411 to −10) from
the genomic DNA of human TZM-bI cells. The protospacer and PAM (NGG) sequences of four gRNAs (LTRs A−D) and the predicted binding sites of indicated
transcription factors are highlighted. The precise cleavage sites are marked with scissors; +1 indicates the transcriptional start site.
Hu et al. www.pnas.org/cgi/content/short/1405186111 3 of 13
Fig. S2. LTR-C and LTR-D remarkably suppress TSA-induced reactivation of latent pNL4-3-ΔGag-d2EGFP virus in CHME5 microglia cells. (A) Diagram sche-
matically showing pNL4-3-ΔGag-d2EGFP vector containing transactivator of transcription (Tat), Rev, Env, Vpu, and Nef with the reporter gene d2EGFP. (B) The
SURVEYOR assay showing indel mutations in the on-target LTR genome of Cas9/LTR-D but not Cas9/LTR-C transfected cells. (C) Representative gating diagram
of EGFP flow cytometry showing a dramatic reduction in TSA-induced reactivation of latent pNL4-3-ΔGag-d2EGFP reporter viruses by stable expression of Cas9/
LTR-C or LTR-D compared with empty U6-driven gRNA expression vector (U6-CAG).
Hu et al. www.pnas.org/cgi/content/short/1405186111 4 of 13
Fig. S3. Both LTR-C and LTR-D induced indel mutations and significantly decreased constitutive and TSA/PMA-induced luciferase activity in TZM-bI cells stably
incorporated with HIV-1 LTR-firefly luciferase reporter gene. (A) Functional luciferase reporter assay revealing a significant reduction of LTR reactivation by
LTR-C, LTR-D, or both. (B) SURVEYOR assay showing indel mutation in LTR DNA (−453 to +43) induced by LTR-C and LTR-D (red arrow). A combination of LTR-C
and LTR-D generates a 194-bp fragment (black arrow) resulting from the deletion of 302-bp region between LTR-C and LTR-D. (C and D) Sanger sequencing of
30 clones validating the indel efficiency at 23% for LTR-C and 13% for LTR-D and example chromatograms showing inertion/deletion. (E) PCR-restriction
fragment length polymorphism analysis using BsaJ I to cut five sites (96, 102, 372, 386, 482) of the PCR product covering −453 to +43 of LTR showing two major
bands (96 bp and 270 bp) in the U6-CAG control sample, but an additional 372-bp band (red arrow) after LTR-C-induced indel mutation at the 96/102 sites,
a 290-bp band (red arrow) after LTR-D-induced mutations at the 372 site, and a 180-bp fragment (black arrow) after LTR-C/D-induced excision. (F) Example
chromatograms showing the deletion of a 302-bp fragment between LTR-C and LTR-D (Upper) and an additional 17-bp deletion (Lower). Red arrows indicate
the junction sites. *P < 0.05 indicates a significant decrease in LTR-C or LTR-D-mediated luciferase activation compared with U6-CAG control.
Hu et al. www.pnas.org/cgi/content/short/1405186111 5 of 13
Fig. S4. TA cloning and Sanger sequencing of PCR products from CHME5 subclones of LTR-A/B and empty U6-CAG control using primers covering HIV-1 LTR
U3/R/U5 regions (−411 to + 129). (A) Possible combination of LTR-A and LTR-B cuts on both 5′ and 3′ LTRs generating potential fragments a−c as indicated. (B)
Blasting of fragment a (351 bp) showing 190-bp deletion between LTR-A and LTR-B cut sites. (C) Blast of fragment c (682 bp) showing a 175-bp insertion at the
LTR-A cleavage site and a 27-bp deletion at the LTR-B cleavage site.
Fig. S5. Cas9/LTR-gRNA efficiently eradicates latent HIV-1 virus from U1 monocytic cells. (A) Sanger sequencing of a 1.1-kb fragment from long-range PCR
using a primer pair (T492/T493) targeting a chromosome 2 integration site-flanking sequence (small letters, 467 bp) reveals elimination of the entire HIV-1
genome (9709 bp), leaving combined 5′ LTR (green) and 3′ LTR with a 6-bp insertion (red) precisely at the third nucleotide from PAM (highlighted green on red
text) LTR-A targeting site (underlined) and a 4-bp deletion (highlighted pink). (B) The representative DNA gel picture shows specific eradication of the HIV-1
genome. NS, nonspecific band. (C and D) Quantitative PCR analysis using the primer pair targeting the Gag gene (T457/T458) shows 85% efficiency of entire
HIV-1 genome eradication in Cas9/LTR-A/B-expressing U1 cells. U1 cells were transfected with pX260 empty vector (U6-CAG) or LTRs-A/B-encoding vectors. After
2-wk puromycin selection, the cellular genomic DNAs were used for absolute quantitative qPCR analysis using spiked pNL4-3-ΔE-EGFP human genomic DNA as
a standard. **P < 0.01 indicates a significant decrease compared with the U6-CAG control.
Hu et al. www.pnas.org/cgi/content/short/1405186111 6 of 13
Fig. S6. Cas9/LTR gRNAs effectively eradicates HIV-1 provirus in J-Lat latently infected T cells. (A) Functional analysis by EGFP flow cytometry reveals ∼50%
reduction of PMA and TNFα-induced reactivation of EGFP reporter viruses. (B) The SURVEYOR assay shows indel mutations (red arrow) in the on-target LTR
genome of Cas9/LTR-A/B transfected cells. J-Lat cells were transfected with pX260 empty vector or LTR-A and -B. After 2-wk puromycin selection, cells were
treated with PMA or TNFα for 24 h. The genomic DNAs were subject to PCR using primers covering HIV-1 LTR U3/R/U5 regions (−411 to +129), and the
SURVEYOR assay was performed. **P < 0.01 indicates a significant decrease compared with the U6-CAG control. (C) PCR fragment analysis using primers
covering HIV-1 LTR (−374 to +43) shows a precise deletion of 190-bp region between LTR-A and -B cutting sites, leaving 227-bp fragment (red arrow).
Housekeeping gene β-actin serves as a DNA purification and loading control.
Fig. S7. Genome-editing efficiency depends upon the presence of Cas9 and gRNAs. (A and B) PCR genotyping reveals the absence of a U6-driven LTR-A or LTR-B
expression cassette (A) and absence/reduction of CMV-driven Cas9 DNA (B) in puromycin-selected TZM-bI subclones without any indication of genomic editing.
Genomic DNAs from indicated subclones were subject to conventional (A) or real-time (B) PCR analyses using a primer pair covering U6 promoter (T351) and LTR-A
(T354) or LTR-B (T356), and targeting Cas9 (T477/T491). (C and D) Cas9 protein expression is absent in ineffective TZM-bI subclones. The Flag-tagged Cas9 fusion
protein was detected by Western blot (WB) and immunocytochemistry (ICC) with anti-Flag monoclonal antibody. HEK293T cell line stably expressing Flag-Cas9 was
used as a positive control for WB (C). GAPDH serves as a protein loading control. Clone c6 contains Cas9 DNA but no Cas9 protein expression, suggesting a potential
mechanism of epigenetic repression after puromycin selection. Clones c5 and c3 may represent a truncated Flag-Cas9 (tCas9). Nucleus was stained with Hoechst
33258 (D).
Hu et al. www.pnas.org/cgi/content/short/1405186111 7 of 13
Fig. S8. Stable expression of Cas9/LTR-A/B gRNAs in TZM-bI cells vaccinates against pseudotyped or native HIV-1 viruses. (A) Flow cytometry shows a significant
reduction of native pNL4-3-ΔE-EGFP reporter virus infection efficiency in Cas9/LTR-A/B expressing TZM-bI subclones. (B and C) Real-time PCR analysis reveals
suppression or elimination of viral RNA (B) and DNA (C) by Cas9/LTR-A/B gRNAs. (D) The firefly luciferase luminescent assay demonstrates dramatic inhibition of
virus infection-stimulated LTR promoter activity by Cas9/LTR-A/B gRNAs. The stable Cas9/LTR-A/B gRNA-expressing TZM-bI cells were infected for 2 h with
indicated native HIV-1 viruses, and washed twice with PBS. At 2 d postinfection, cells were collected, fixed, and analyzed by flow cytometry for EGFP expression
(A), or lysed for total RNA extraction and RT-qPCR (B), genomic DNA purification for qPCR (C), and luminescence measurement (D). *P < 0.05 and **P < 0.01
indicate significant decreases compared with the U6-CAG control.
Hu et al. www.pnas.org/cgi/content/short/1405186111 8 of 13
Table S1. Predicted LTR gRNAs and their off-target numbers (100% match)
Name No. gRNA sequence 20 19 18 17 16 15 14 13 12
Sense
1 TCAGACCCTTTTAGTCAGTGTGG 0 0 0 0 0 0 0 2 25
2 TTGCTTGTACTGGGTCTCTCTGG 0 0 0 0 0 0 0 3 10
3 CAGCTGCTTTTTGCTTGTACTGG 0 0 0 0 0 0 2 4 11
4 CTGACATCGAGCTTGCTACAAGG 0 0 0 0 0 0 0 0 17
5 CCGCCTAGCATTTCATCACATGG 0 0 0 0 0 1 2 9 55
6 CGGAGAGAGAAGTATTAGAGTGG 0 0 0 0 0 0 0 3 18
7 AGTACCAGTTGAGCAAGAGAAGG 0 0 0 0 0 0 0 1 18
8 GATATCCACTGACCTTTGGATGG 0 0 0 0 0 1 3 22 211
LTR-C 9 GATTGGCAGAACTACACACCAGG 0 0 0 0 0 0 0 2 12
10 CACAAGGCTACTTCCCTGATTGG 0 0 0 0 0 0 0 2 11
11 CTGTGGATCTACCACACACAAGG 0 0 0 0 0 2 5 12 37
12 TGGGAGCTCTCTGGCTAACTAGG 0 0 0 0 0 0 0 4 14
13 GGTTAGACCAGATCTGAGCCTGG 0 0 0 0 0 0 0 9 33
14 TGCTACAAGGGACTTTCCGCTGG 0 0 0 0 0 0 0 2 5
15 AGAGAGAAGTATTAGAGTGGAGG 0 0 0 0 0 0 5 7 16
16 TTACACCCTGTGAGCCTGCATGG 0 0 0 0 0 0 3 14 35
17 AAGGTAGAAGAAGCCAATGAAGG 0 0 0 0 0 0 4 36 75
LTR-A 18 ATCAGATATCCACTGACCTTTGG 0 0 0 0 0 0 1 6 16
19 GACAAGATATCCTTGATCTGTGG 0 0 0 0 0 0 0 0 10
20 GCCCGTCTGTTGTGTGACTCTGG 0 0 0 0 0 0 3 7 35
21 ATCTGAGCCTGGGAGCTCTCTGG 0 0 0 0 1 2 9 32 78
22 CTTTCCGCTGGGGACTTTCCAGG 0 0 0 0 0 0 0 0 4
23 CAGAACTACACACCAGGGCCAGG 0 0 0 0 0 0 0 3 20
24 CCTGCATGGGATGGATGACCCGG 0 0 0 0 2 2 2 5 21
25 CCCTGTGAGCCTGCATGGGATGG 0 0 0 0 1 1 2 9 30
26 CTTTCCAGGGAGGCGTGGCCTGG 0 0 0 0 0 8 15 32 75
27 GGGGACTTTCCAGGGAGGCGTGG 0 0 0 0 0 0 0 5 24
28 CCGCTGGGGACTTTCCAGGGAGG 0 0 0 0 0 1 3 9 25
29 CATGGCCCGAGAGCTGCATCCGG 0 0 0 0 0 0 0 0 16
30 GCCTGGGCGGGACTGGGGAGTGG 0 0 0 0 1 2 7 18 250
31 AGGCGTGGCCTGGGCGGGACTGG 0 0 0 0 0 0 0 2 6
LTR-D 32 GCGTGGCCTGGGCGGGACTGGGG 0 0 0 0 0 0 6 9 29
33 CCAGGGAGGCGTGGCCTGGGCGG 0 0 0 0 2 2 2 11 22
Antisense
1 TGTGGTAGATCCACAGATCAAGG 0 0 0 0 0 0 0 3 13
2 GGTGTGTAGTTCTGCCAATCAGG 0 0 0 0 0 0 0 8 11
3 GTCAGTGGATATCTGATCCCTGG 0 0 0 0 0 0 0 2 10
4 TAGCACCATCCAAAGGTCAGTGG 0 0 0 0 0 0 2 3 10
5 TAGCTTGTAGCACCATCCAAAGG 0 0 0 0 0 1 1 6 12
6 TCTACCTTCTCTTGCTCAACTGG 0 0 0 0 0 0 1 1 6
7 CACTCTAATACTTCTCTCTCCGG 0 0 0 0 0 0 0 1 26
8 CCATGTGATGAAATGCTAGGCGG 0 0 0 0 0 1 1 5 17
9 GGGCCATGTGATGAAATGCTAGG 0 0 0 0 0 0 0 3 48
LTR-B 10 CAGCAGTTCTTGAAGTACTCCGG 0 0 0 0 0 0 0 0 5
11 CTGCTTATATGCAGCATCTGAGG 0 0 0 0 0 0 0 4 39
12 CACACTACTTGAAGCACTCAAGG 0 0 0 0 0 0 0 3 19
13 TACCAGAGTCACACAACAGACGG 0 0 0 0 0 0 0 2 12
14 ACACTGACTAAAAGGGTCTGAGG 0 0 0 0 1 2 3 4 15
15 CAAGGATATCTTGTCTTCGTTGG 0 0 0 0 0 0 0 1 5
16 CAGGGAAGTAGCCTTGTGTGTGG 0 0 0 0 0 1 2 4 17
17 GCGGGTGTTCTCTCCTTCATTGG 0 0 0 0 0 2 2 15 49
18 TAGTTAGCCAGAGAGCTCCCAGG 0 0 0 0 0 2 4 24 93
19 CTTTATTGAGGCTTAAGCAGTGG 0 0 0 0 0 0 0 4 25
20 ACTCAAGGCAAGCTTTATTGAGG 0 0 0 0 0 0 0 1 28
21 GGATATCTGATCCCTGGCCCTGG 0 0 0 0 0 1 2 8 43
22 GGCTCACAGGGTGTAACAAGCGG 0 0 0 0 0 0 0 2 5
23 TCCATCCCATGCAGGCTCACAGG 0 0 0 0 0 3 3 8 20
Hu et al. www.pnas.org/cgi/content/short/1405186111 9 of 13
Table S1. Cont.
Name No. gRNA sequence 20 19 18 17 16 15 14 13 12
24 AGTACTCCGGATGCAGCTCTCGG 0 0 0 0 0 0 1 5 48
25 AGAGCTCCCAGGCTCAGATCTGG 0 0 0 0 0 0 4 15 38
26 GATTTTCCACACTGACTAAAAGG 0 0 0 0 0 0 0 8 21
27 CCGGGTCATCCATCCCATGCAGG 0 0 0 0 0 0 0 4 36
28 CCTCCCTGGAAAGTCCCCAGCGG 0 0 0 0 0 0 3 14 37
29 GCCACTCCCCAGTCCCGCCCAGG 0 0 0 0 0 0 1 4 14
30 CCGCCCAGGCCACGCCTCCCTGG 0 0 0 0 0 1 1 6 19
The 5′ LTR sense and antisense sequences (634 bp) of pHR’-CMV-LacZ lentiviral vector (AF105229) were
used to search for Cas9/gRNA target sites containing a 20-bp guide sequence (protospacer) plus the proto-
spacer adjacent motif sequence (NGG) using Jack Lin’s CRISPR/Cas9 gRNA finder tool (http://spot.colorado.
edu/~slin/cas9.html). Each gRNA plus NGG (AGG, TGG, GGG, CGG) was blasted against available human
genomic and transcript sequences with 1,000 aligned sequences being displayed. After pressing Control + F,
copy/paste the target sequence (1-23 through 9-23 nucleotides) and find the number of genomic targets
with 100% match. The number of off-targets for each searching was divided by 3 because of repeated
genome library. The number shown indicates the sum of four searches (NGG). The red number indicates the
gRNA target sequences farthest from NGG. The sequence and off-target numbers for the selected LTR-A/B
and LTR-C/D are highlighted red and green, respectively.
Hu et al. www.pnas.org/cgi/content/short/1405186111 10 of 13
Table S2. Oligonucleotides for gRNA targeting sites and primers used for PCR and sequencing
Target name Direction Sequences (5′ to 3′)
Hu et al. www.pnas.org/cgi/content/short/1405186111 11 of 13
Table S3. Locations of predicted gRNA targeting sites of LTR-A and LTR-B
Mismatch
Subject Identity, (12-bp
Name Query seq Id % E-value Start End Strand Ref. seq seed)
Hu et al. www.pnas.org/cgi/content/short/1405186111 12 of 13
Table S3. Cont.
Mismatch
Subject Identity, (12-bp
Name Query seq Id % E-value Start End Strand Ref. seq seed)
Hu et al. www.pnas.org/cgi/content/short/1405186111 13 of 13