You are on page 1of 4

news

There are amendments to doi:10.1038/s41587-020-0449-8

Go-ahead for first peanut


allergy drug
Single-cell RNA-seq analysis
The US Food and Drug Administration
has approved a first food allergy drug, software providers scramble to
offer solutions
Aimmune Therapeutics’ Palforzia, for
children 4–17 years old with peanut
allergies. Palforzia (AR101) consists
of peanut (Arachis hypogaea) powder
capsules to help patients build up A raft of tools have sprung up to help biologists work through the single-cell
tolerance to accidental peanut exposure. transcriptomic bottleneck, but integration remains elusive.
Patients start the oral desensitization

T
course with 3-mg daily dose of peanut
he market for single-cell analytical whole tissue samples. This technology is
protein and gradually build up to
technologies is booming: from a allowing researchers to tackle complex
a 300-mg daily maintenance dose.
value of $1.83 billion in 2018, it biological systems and phenomena — for
In a 496-patient pivotal trial, 67%
could triple by 2025, according to a report example, profiling rare cell types within a
of Palforzia recipients tolerated a
from ResearchAndMarkets.com published tissue, analyzing tumor heterogeneity, or
600-mg peanut protein challenge,
last year. The research community’s tracing lineages in differentiating cells —
after 6 months on maintenance
enthusiastic embrace of single-cell RNA- at single-cell resolution. Achieving such
treatment, with only mild allergic
seq has already transformed the fortunes insights requires specialized algorithms
reactions. Only 4% of placebo recipients
of some instrument and kit manufacturers. that can account for the distinctive artifacts
tolerated this challenge.
10X Genomics CEO Serge Saxonov, in a and biases associated with data arising
Peanut allergy affects around
Q3 earnings call from November 2019, from these experiments. “Single-cell data
1 million children in the United States,
announced that the near-term market for is much noisier than bulk RNA-seq,” says
and accidental exposure can provoke
his company’s technologies for single-cell biostatistician Stephanie Hicks, of the Johns
life-threatening anaphylactic shock in
analysis could reach $13 billion. But this Hopkins Bloomberg School of Public Health.
some patients, leading Aimmune to
rapid uptake could also create headaches for Although bioinformaticians have devised
predict the drug could exceed $1 billion
new users as they learn how to come to grips computational tools to overcome that noise,
in global annual sales. But others have
with the complexities of single-cell data. as the scale of analysis expands, other
their doubts. At over $10,000 per year, the
For over a decade, researchers have challenges will grow too.
drug’s high cost could deter insurers, and
been developing RNA sequencing (RNA- The scientific community has eagerly
its side effect profile — which mirrors
seq) techniques to analyze changes in gene embraced single-cell RNA-seq (scRNA-seq).
the effects of peanut exposure — could
expression in individual cells rather than This interest has given impetus to academics’
put patients off, say some. While some
patients might be tempted by a do-it-
yourself peanut desensitization program,
clinicians caution against it because
of the increased need for epinephrine
while on treatment. For Palforzia,
treatment initiation and dose escalation
must take place in a supervised
medical setting.
Aimmune’s closest competitor
is DBV Technologies. The biotech,
headquartered in Paris, first filed its
transdermal peanut tolerance patch
Viaskin Peanut for approval in 2018,
but withdrew this submission months
later pending more manufacturing and
quality control data. It resubmitted
the patch in 2019 and anticipates a
decision by August. Aimmune and DBV
Technologies are also working on other
food allergy desensitization products,
including products for egg allergy and
milk allergy.

Published online: 9 March 2020


https://doi.org/10.1038/s41587-020-0458-7

Making sense of single-cell RNA-seq data can be problematic for labs without in-house bioinformatics
capabilities. Credit: Nobeastsofierce Science / Alamy Stock Photo

254 Nature Biotechnology | VOL 38 | March 2020 | 251–257 | www.nature.com/naturebiotechnology


news

Table 1 | Selected software providers for scRNA-seq analysis


Software name Developer Price structure Platform-specific Relevant stages of experiment
Cell Ranger 10X Genomics Free download 10X Chromium Raw read alignment, QC and matrix generation for
scRNA-seq and ATAC-seq; data normalization;
dimensionality reduction and clustering
Loupe Cell Browser 10X Genomics Free download 10X Chromium Visualization and analysis
Partek Flow Partek License No Complete data analysis and visualization pipeline for
scRNA-seq data
Qlucore Omics Explorer Qlucore License No scRNA-seq data filtering, dimensionality reduction and
clustering, visualization
mappa Analysis Pipeline Takara Bio Free download Takara ICell8 Raw read alignment and matrix generation for scRNA-seq
hanta R kit Takara Bio Free download Takara ICell8 Clustering and analysis of mappa data
Singular Analysis Toolset Fluidigm Free download Fluidigm C1 or Biomark Analysis and visualization of differential gene expression
data for scRNA-seq
SeqGeq FlowJo/BD License No Data normalization and QC, dimensionality reduction and
Biosciences clustering, analysis and visualization
Seven Bridges Seven Bridges/ License BD Rhapsody and Cloud-based raw read alignment, QC and matrix
BD Biosciences Precise generation
Tapestri Pipeline/Insights Mission Bio Free download Mission Bio Tapestri Analysis of single-cell genomics data
BaseSpace SureCell Illumina License Illumina SureCell Raw read alignment and matrix generation
libraries
OmicSoft Array Studio Qiagen License No Raw read alignment, QC and matrix generation,
dimensionality reduction and clustering
QC, quality control; ATAC-seq, assay for transposase-accessible chromatin using sequencing.

in-house algorithms, as well as commercial make these pipelines more user-friendly; mapped to their gene of origin, and
development of both instruments and concurrently, several commercial providers the resulting count matrix depicts the
analytical software from startups like 10X, are also offering their own solutions for number of transcripts per gene. Single-cell
1CellBio and Dolomite Bio alongside typical scRNA-seq experiments (Table 1). experiments complicate this process. For
biotech giants like Illumina, Bio-Rad and Software developers include instrument example, many scRNA-seq protocols —
Takara Bio. “The field has moved from makers themselves (e.g., 10X and Fluidigm) including the workflows used by 10X
users that do very cutting-edge methods that have developed software specifically and Dolomite instruments — capture
on homemade systems to users that require for their platforms, as well as companies individual cells in tiny droplets. All the
complete, easy to use systems,” says Juliane that have an established track record in RNAs captured in each droplet are tagged
Fischer, senior post sales, applications and providing bioinformatics solutions. As an with a distinct barcode sequence indicating
support specialist at Blacktrace Holdings, example of the latter, Partek has extended that they came from the same cell, and
parent company of Dolomite Bio. Dolomite its Partek Flow software for end-to-end each transcript is also reverse-transcribed
manufactures the Nadia scRNA-seq scRNA-seq analysis, taking users from to include a unique molecular identifier
platform. And these turnkey systems are raw reads to visualization. The SeqGeq (UMI) sequence to enable reliable counting
becoming popular — this January, Saxonov pipeline, developed jointly by Illumina of individual RNAs. Reads must therefore
noted that 10X had placed its Chromium and FlowJo (now part of BD Biosciences), undergo careful quality control to ensure
instrument for single-cell analysis at 96 of has also evolved from a general tool for that both sequences are present before
the world’s top 100 research institutes. next-generation sequencing analytics into a generating the matrix. This process must
With scRNA-seq taking academic labs platform for scRNA-seq experiments. “I have also account for droplets with multiple
by storm, it means many biologists are been very encouraged by the development of cells or no cells, as well as factors like
getting their first exposure to scRNA- tools that are designed to expose users to the innate differences in RNA content between
seq data. Those labs with experienced standard workflows from the field without cell types.
bioinformaticians have developed open- forcing them to write a single line of code,” Furthermore, single-cell data can be
source analysis tools in popular languages says Cole Trapnell, a genomics researcher at sparse. If only a small fraction of a cell’s
like R and Python. One of these, from the University of Washington. Nevertheless, RNA is captured, this means that genes that
Rahul Satija’s group at the New York successful analysis still requires broad appear to be non-expressed may simply
Genome Center, is Seurat; another is expertise and careful oversight. “We are have eluded detection. Ian Taylor, Director
Scanpy, from Fabian Theis and colleagues; not at the point where everything comes of Product Innovation at BD Biosciences
and contributors to the Bioconductor out and the results are just waiting for you,” Informatics, notes that users expecting clean
community have also generated an extensive says Mirko Corselli, scientific marketing two-dimensional plots that clearly depict
toolbox for single-cell analysis. These manager at BD Biosciences. patterns of gene expression in their sample
algorithms may be intimidating to non- In a standard RNA-seq experiment, are “going to be somewhat shocked by the
coders, but there is an ongoing effort to sequencing reads are computationally sparse data in front of them.”

Nature Biotechnology | VOL 38 | March 2020 | 251–257 | www.nature.com/naturebiotechnology 255


news

expression profiles into different cell types you can move a slider or click a check
BIO’s bias report based on their similarity or dissimilarity to box and the visualizations are updated
one another. instantly” to reflect the user’s changes.
A survey by the Biotechnology
The initial stages of data processing are Customer service and company onsite
Innovation Organization (BIO) of its
relatively straightforward. For example, 10X training programs often add to the appeal
member companies highlights a lack of
has developed the Cell Ranger pipeline, of commercial tools. “We try to put together
both gender and racial diversity at the
which is designed to perform efficient workflows in presentations and webinars
higher echelons of biotech. At the
quality control — including barcode and and education materials that guide people
98 companies who responded to the
UMI detection — and matrix generation down a happy path,” says Taylor of the
survey, women make up 45% of total
on raw data from Chromium experiments. Illumina/BD SeqGeq toolbox.
employees, 30% of an executive team and
“They have been quite good about providing Enhancing the user-friendliness
18% of the board, while people of color
tools for low-level processing and viewing,” and accessibility of analytical tools is a
make up 32% of total employees, 15% of
says Harvard Medical School computational natural sweet spot for companies, says
the executive team and 14% of the board.
biologist Peter Kharchenko. “If you have Kharchenko. “[In academia] we’re very
BIO’s report is the first to look at
a 10X machine and rely on their tools, good at developing algorithms, but we’re not
racial diversity in biotech. The findings
you’ll probably get a pretty good expression very good at polishing things and making
have prompted the trade organization
matrix.” Ines Hellmann of the Ludwig- them convenient,” he says. This may be
to recommend that companies should
Maximilians University Munich notes that changing, however. For example, Satija’s
put a disproportionately high focus on
10X software is designed to work with the toolbox Seurat has won widespread praise
recruiting diverse board members —
specific idiosyncrasies of 10X data, whereas as a powerful and relatively easy to use
by using, for example, targeted talent
other pipelines might have to be customized software and for its guided tutorials. And
networks rather than word of mouth and
to process the data appropriately. Takara Hicks, who is part of the technical advisory
personal connections.
Bio and Fluidigm have likewise opted to board for Bioconductor, has worked hard
BIO’s gender-imbalance findings,
produce their own software suites for data to inform new users about how to use this
however, aren't new to the industry.
processing, which are free to download but initiative’s software tools for scRNA-seq,
A 2017 report showed that women
also platform-specific. including a recently published guide and an
held only around one in ten of
Some instrument makers, such as online e-book. “This is a way for people to
board positions. Another analysis by
1CellBio, made their analytical software sift through the enormous amount of rich
Massachusetts Institute of Technology
freely available to researchers at the outset. resources,” she says.
(MIT) entrepreneur Sangeeta Bhatia
But customers found the open-source But there are dangers in too much
and colleagues forming the Boston
software too complicated to navigate, says simplification or blindly trusting the tools.
Biotech Working Group investigated
CEO Colin Brenan. 1CellBio therefore BD’s Corselli says, “The challenge is always:
why fewer female faculty at MIT set up
partnered with Partek to develop a pipeline how do I know that what I’m seeing is
companies as compared with their male
that combines their inDrop platform with true?” Many steps require careful tweaking
counterparts. Their findings suggest that
Partek Flow, which uses a far simpler, depending on the experiment and the
40 to 50 more biotechs would exist if
point-and-click user interface. This type of samples. For example, Hellmann
female MIT faculty had begun startups
combination frees researchers from having recently embarked on a broad comparison
at an equal rate to that of their male
to know coding, says Partek president of different analytical workflows to identify
colleagues. In response, several Boston
Tom Downey, and allows them to process factors that can shape the success or failure
venture capital firms are pledging that
data generated from any single-cell RNA of a given experiment. Normalization came
the boards of their portfolio companies
sequencing system. Dolomite made a out as a top factor. “In many big papers,
will be 25% female by the end of 2022.
similar decision to collaborate with Partek normalization is not taken very seriously, but
The premise is that board participation
for its platform, although it also promotes that can actually be very important,” she says.
gives women access to a network of
the open-source dropSeqPipe software — Before embarking on a deep analysis,
investors, scientists and other contacts
developed by Patrick Roelli and colleagues scientists should also pay close attention to
needed to start a business. BIO hopes to
at the Swiss Institute of Bioinformatics — the dimensionality-reduction and clustering
run the survey annually with increased
for more expert users. steps. One of the most popular mathematical
participation to track progress.
For novice scRNA-seq users, commercial approaches for dimensionality reduction
Published online: 9 March 2020 software is easiest to use, and this is a major is principal component analysis, but many
https://doi.org/10.1038/s41587-020-0460-0 selling point. How much help researchers groups have also been moving to implement
will need depends on the complexity of newer, non-linear dimensionality reduction
the analysis. “Finding new cell clusters techniques such as t-distributed stochastic
is something most people could easily neighbor embedding (t-SNE) or the more
To make sense of the data, researchers do,” Dolomite’s Fischer says of the Partek recent uniform manifold approximation
must ‘normalize’ it first, a process that Flow software offered with her company’s and projection (UMAP). However, all
ensures they are comparing apples to instruments. User-friendly software can also of these methods can potentially be
apples when they attempt to identify gene help in visualizing the data — a major focus confounded by the sparsity of scRNA-seq
expression differences between cells. of Qlucore’s Omics Explorer software, and a data, and at present there is still no clear
Finally, these results must be subjected step that could help or hinder interpretation. consensus on which is the most robust
to dimensionality reduction, which “We have an API [application programming ‘gold standard’ solution for simplifying and
simplifies the data in a way that enables interface] that brings in data from off of interpreting scRNA-seq data to identify
visual representation and straightforward the Cell Ranger pipeline,” says company biologically accurate cell clusters — or even
mathematical analysis, and clustering president Carl-Johan Ivarsson, referring to whether such a single solution exists for
algorithms that can group the individual the widely used 10X software tool. “Then all experiments.

256 Nature Biotechnology | VOL 38 | March 2020 | 251–257 | www.nature.com/naturebiotechnology


news

As a consequence, scRNA-seq still edge applications remain works in progress studies using different samples — and, most
requires a marriage of bioinformatics and and will almost certainly require open- likely, different experimental workflows
wet-lab expertise. With this in mind, many source algorithms. “We’re quickly moving — is still very much an unsolved problem.
academic centers set up hybrid workflows past ‘we did an scRNA-seq experiment and “Right now, it’s sort of like a magic trick,”
that incorporate user-friendly commercial here are the cell types we see’,” says Trapnell, says Kharchenko. “You put all your data in a
algorithms and open-source software, which whose group has developed computational pot and then say ‘integrate!’” This problem
can be customized. Ivarsson notes that tools for mapping the developmental becomes even more daunting when one
commercial package Qlucore aspires to offer trajectories of millions of individual cells in begins to contemplate combining scRNA-
a software for single-cell platforms that can parallel. New tools are also enabling physical seq data with other -omic data layers, such
adapt to diverse workflows. “Our vision is mapping of RNA-seq data in multicellular as chromatin accessibility, DNA methylation
not to lock in users — rather the opposite, samples. For example, 10x Genomics’ or protein expression.
to make it easier to interact,” he says. This Visium Spatial Gene Expression Solution The perfect software tool to deliver all
is also true for commercially produced maps gene activity within tissue specimens, answers is unlikely to emerge. “You can
SeqGeq, which is designed to work with and although the company has developed provide tools to make sure that you make no
externally developed or custom-written R software called Space Ranger to facilitate mistakes,” says Corselli. “But ultimately it’s
software packages. Dolomite’s Fischer says analysis, other algorithms will surely be still the responsibility of the scientist to look
that, in her experience, most researchers needed to make the most of this still-novel at the data and be rigorous.” ❐
don’t even think about the software aspect experimental approach.
of the workflow until they begin wrestling The ultimate challenge is integration. Michael Eisenstein
with data — and at that point, flexibility “The bottleneck for many researchers isn’t Philadelphia, PA, USA
and availability of support are the critical necessarily the early stages of processing
considerations. data, but the later, more in-depth analysis to Published online: 9 March 2020
Such interoperability is especially integrate results,” says Theis, who is Director https://doi.org/10.1038/s41587-020-0449-8
important for users looking to venture of the Institute of Computational Biology
beyond routine applications like assessing at the Helmholtz Zentrum Munich. How Acknowledgements
differential gene expression. These cutting- to best combine data from many different Additional reporting by Chris Lieu, Redwood City, CA, USA.

Around the world in a month


CHINA
In a new trade deal with the United States,
SWITZERLAND China agrees to implement patent linkage
Merck KGaA will build a 250 ($273) million and patent term extension. This provision will ensure
facility in Corsier-sur-Vevey to produce clinical generic rivals cannot be approved before patents have
trial material for its R&D pipeline, incorporating two fully expired. The decision is a major victory for multination-
integrated drug substance manufacturing lines. The als and Chinese manufacturers of innovative medicines.
biotech facility will open by the end of 2022 with
approximately 250 employees.

BRAZIL
An international consortium launches aimed at
discovering new drugs to treat malaria, visceral
leishmaniasis, and Chagas disease. The participants,
which include the University of Campinas, University of
São Paulo, Medicines for Malaria Venture and Drugs for
Neglected Diseases Initiative, will invest $10.7 million over
the next five years to produce compounds that can be
developed into drugs to combat the three tropical diseases.

SINGAPORE
Academic spinout Enleofen partners with
German drugmaker Boehringer Ingelheim to
develop first-in-class anti-interleukin-11 therapies for a
range of fibrotic diseases. Enleofen could receive $1
billion per product in up-front and success-based
development and commercialization milestones.

Credit: Map: © iStockphoto; Flags: pop_jop / DigitalVision Vectors / Getty

Published online: 9 March 2020


https://doi.org/10.1038/s41587-020-0451-1

Nature Biotechnology | VOL 38 | March 2020 | 251–257 | www.nature.com/naturebiotechnology 257

You might also like