You are on page 1of 15

Perspective

https://doi.org/10.1038/s41592-019-0686-2

There are amendments to this paper

SciPy 1.0: fundamental algorithms for scientific


computing in Python
Pauli Virtanen1, Ralf Gommers 2*, Travis E. Oliphant2,3,4,5,6, Matt Haberland 7,8*, Tyler Reddy 9*,
David Cournapeau10, Evgeni Burovski11, Pearu Peterson12,13, Warren Weckesser14, Jonathan Bright15,
Stéfan J. van der Walt 14, Matthew Brett16, Joshua Wilson17, K. Jarrod Millman 14,18,
Nikolay Mayorov19, Andrew R. J. Nelson 20, Eric Jones5, Robert Kern5, Eric Larson21, C J Carey22,
İlhan Polat23, Yu Feng24, Eric W. Moore25, Jake VanderPlas26, Denis Laxalde 27, Josef Perktold28,
Robert Cimrman29, Ian Henriksen6,30,31, E. A. Quintero32, Charles R. Harris33,34, Anne M. Archibald35,
Antônio H. Ribeiro 36, Fabian Pedregosa37, Paul van Mulbregt 38 and SciPy 1.0 Contributors39

SciPy is an open-source scientific computing library for the Python programming language. Since its initial release in 2001, SciPy
has become a de facto standard for leveraging scientific algorithms in Python, with over 600 unique code contributors, thou-
sands of dependent packages, over 100,000 dependent repositories and millions of downloads per year. In this work, we provide
an overview of the capabilities and development practices of SciPy 1.0 and highlight some recent technical developments.

S
ciPy is a library of numerical routines for the Python program- has become the standard others follow and has seen extensive adop-
ming language that provides fundamental building blocks tion in research and industry.
for modeling and solving scientific problems. SciPy includes SciPy’s arrival at this point is surprising and somewhat anoma-
algorithms for optimization, integration, interpolation, eigenvalue lous. When started in 2001, the library had little funding and was
problems, algebraic equations, differential equations and many written mainly by graduate students—many of them without a
other classes of problems; it also provides specialized data struc- computer science education and often without the blessing of their
tures, such as sparse matrices and k-dimensional trees. SciPy is advisors. To even imagine that a small group of ‘rogue’ student pro-
built on top of NumPy1,2, which provides array data structures and grammers could upend the already well-established ecosystem of
related fast numerical routines, and SciPy is itself the foundation research software—backed by millions in funding and many hun-
upon which higher level scientific libraries, including scikit-learn3 dreds of highly qualified engineers10–12—was preposterous.
and scikit-image4, are built. Scientists, engineers and others around Yet the philosophical motivations behind a fully open tool
the world rely on SciPy. For example, published scripts5,6 used in stack, combined with an excited, friendly community with a sin-
the analysis of gravitational waves7,8 import several subpackages of gular focus, have proven auspicious in the long run. They led not
SciPy, and the M87 black hole imaging project cites SciPy9. only to the library described in this paper, but also to an entire
Recently, SciPy released version 1.0, a milestone that traditionally ecosystem of related packages (https://wiki.python.org/moin/
signals a library’s API (application programming interface) being NumericAndScientific) and a variety of social activities centered
mature enough to be trusted in production pipelines. This version around them (https://wiki.python.org/moin/PythonConferences).
numbering convention, however, belies the history of a project that The packages in the SciPy ecosystem share high standards of

1
University of Jyväskylä, Jyväskylä, Finland. 2Quansight LLC, Austin, TX, USA. 3Ultrasound Imaging, Mayo Clinic, Rochester, MN, USA. 4Electrical
Engineering, Brigham Young University, Provo, UT, USA. 5Enthought, Inc., Austin, TX, USA. 6Anaconda Inc., Austin, TX, USA. 7BioResource and Agricultural
Engineering Department, California Polytechnic State University, San Luis Obispo, CA, USA. 8Department of Mathematics, University of California Los
Angeles, Los Angeles, CA, USA. 9Los Alamos National Laboratory, Los Alamos, NM, USA. 10Independent researcher, Tokyo, Japan. 11National Research
University Higher School of Economics, Moscow, Russia. 12Independent researcher, Saue, Estonia. 13Department of Mechanics and Applied Mathematics,
Institute of Cybernetics at Tallinn Technical University, Tallinn, Estonia. 14Berkeley Institute for Data Science, University of California Berkeley, Berkeley,
CA, USA. 15Independent researcher, New York, NY, USA. 16School of Psychology, University of Birmingham, Edgbaston, Birmingham, UK. 17Independent
researcher, San Francisco, CA, USA. 18Division of Biostatistics, University of California Berkeley, Berkeley, CA, USA. 19WayRay LLC, Skolkovo Innovation
Center, Moscow, Russia. 20Australian Nuclear Science and Technology Organisation, Lucas Heights, NSW, Australia. 21Institute for Learning and Brain
Sciences, University of Washington, Seattle, WA, USA. 22College of Information and Computing Sciences, University of Massachusetts Amherst,
Amherst, MA, USA. 23Independent researcher, Amsterdam, the Netherlands. 24Berkeley Center for Cosmological Physics, University of California Berkeley,
Berkeley, CA, USA. 25Bruker Biospin Corp., Billerica, MA, USA. 26University of Washington, Seattle, WA, USA. 27Independent researcher, Toulouse,
France. 28Independent researcher, Montreal, Quebec, Canada. 29New Technologies Research Centre, University of West Bohemia, Plzeň, Czech Republic.
30
Department of Mathematics, Brigham Young University, Provo, UT, USA. 31Oden Institute for Computational Engineering and Sciences, The University
of Texas at Austin, Austin, TX, USA. 32Independent researcher, Belmont, Massachusetts, USA. 33Space Dynamics Laboratory, North Logan, UT, USA.
34
Independent researcher, Logan, Utah, USA. 35Anton Pannekoek Institute, Amsterdam, The Netherlands. 36Graduate Program in Electrical Engineering,
Universidade Federal de Minas Gerais, Belo Horizonte, Brazil. 37Google LLC, Montreal, Quebec, Canada. 38Google LLC, Cambridge, MA, USA. 39A list of
members and affiliations appears at the end of the paper. *e-mail: scipy.articles@gmail.com

Nature Methods | VOL 17 | March 2020 | 261–272 | www.nature.com/naturemethods 261


Perspective NaTure MeThodS

implementation, documentation and testing, and a culture eager to Box 1 | Excerpt from the SciPy 0.1 release announcement
learn and adopt better practices—both for community management (typos corrected), posted 20 August 2001 on the Python-list
and software development. mailing list
Background
Here we capture a selective history of some milestones and impor- SciPy is an open-source package that builds on the strengths of
tant events in the growth of SciPy. Despite what we highlight here, it Python and Numeric, providing a wide range of fast scientific
is important to understand that a project like SciPy is only possible and numeric functionality. SciPy’s current module set includes
because of the contributions of very many contributors—too many the following:
to mention individually, but each bringing an important piece to • Special functions (Bessel, Hankel, Airy and others)
the puzzle. • Signal/image processing
Python is an interpreted, high-level, general-purpose computer • 2D plotting capabilities
programming language, designed by Guido van Rossum in the late • Integration
1980s, with a dynamic type system and an emphasis on readability • ODE solvers
and rapid prototyping13 (https://github.com/python/cpython). As • Optimization (simplex, BFGS, Newton-CG and others)
a general-purpose programming language, it had no special sup- • Genetic algorithms
port for scientific data structures or algorithms, unlike many of the • Numeric to C++ expression compiler
other established computation platforms of the time. Yet scientists • Parallel programming tools
soon discovered the language’s virtues, such as its ability to wrap C • Splines and interpolation
and Fortran libraries, and to then drive those libraries interactively. • Other items
Scientists could thereby gain access to a wide variety of existing
computational libraries without concerning themselves with low-
level programming concepts such as memory management. (and currently a SciPy core developer), provided compilation
In 1995, Jim Hugunin, a graduate student at the Massachusetts instructions under Windows. Around the same time, Pearu
Institute of Technology, wrote the first message in a new Python Peterson, a PhD student from Estonia, released F2PY19, a command
Matrix Special Interest Group (Matrix-SIG) mailing list14: line tool for binding Python and Fortran codes, and wrote modules
for linear algebra and interpolation. Eric Jones, while a graduate
“There seems to be a fair amount of interest in the Python com-
student at Duke University, wrote packages to support his disserta-
munity concerning the addition of numeric operations to Python.
My own desire is to have as large a library of matrix based func- tion, including a parallel job scheduler and genetic optimizer. Gary
tions available as possible (linear algebra, eigenfunctions, signal Strangman, a postdoctoral fellow at Harvard Medical School, pub-
processing, statistics, etc.). In order to ensure that all of these lished several descriptive and inferential statistical routines20.
libraries interoperate, there needs to be agreement on a basic ma- With a rich programming environment and a numerical array
trix object that can be used to represent arrays of numbers.” object in place, the time was ripe for the development of a full scien-
tific software stack. In 2001, Eric Jones and Travis Vaught founded
Over the next several months, conversations on that mailing list Enthought Scientific Computing Solutions (now Enthought, Inc.)
by, among others, Jim Fulton, Jim Hugunin, Paul Dubois, Konrad in Austin, Texas, USA. To simplify the tool stack, they created the
Hinsen and Guido van Rossum led to the creation of a package SciPy project, centered around the SciPy library, which would sub-
called Numeric with an array object that supported a high num- sume all the above-mentioned packages. The new project quickly
ber of dimensions. Jim Hugunin explained the utility of Python for gained momentum, with a website and code repository21 appear-
numerical computation15: ing in February, and a mailing list announced22 in June 2001. By
August 2001, a first release was announced23, an excerpt of which
“I’ve used almost all of the available numerical languages at one
is shown in Box 1. In September, the first documentation was pub-
time or another over the past 8 years. One thing I’ve noticed is
that over time, the designers of these languages are steadily adding lished24. The first SciPy workshop25 was held in September 2002 at
more of the features that one would expect to find in a general- Caltech—a single track, two-day event with 50 participants, many of
purpose programming language.” them developers of SciPy and surrounding libraries.
At this point, scientific Python started attracting more serious
This remains a distinguishing feature of Python for science and attention; code that started as side projects by graduate students
one of the reasons why it has been so successful in the realm of data had grown into essential infrastructure at national laboratories
science: instead of adding general features to a language designed and research institutes. For example, Paul Dubois at Lawrence
for numerical and scientific computing, here scientific features are Livermore National Laboratory (LLNL) took over the maintenance
added to a general-purpose language. This broadens the scope of of Numeric and funded the writing of its manual26, and the Space
problems that can be addressed easily, expands the sources of data Telescope Science Institute (STScI), which was in charge of Hubble
that are readily accessible and increases the size of the community Space Telescope science operations, decided to replace their custom
that develops code for the platform. scripting language and analysis pipeline with Python27. As STScI
continued to use Python for an increasingly large portion of the
SciPy begins. By the late 1990s, discussions appeared on Matrix-SIG Hubble Space Telescope data analysis pipeline, they encountered
expressing a desire for a complete scientific data analysis environ- problems with the Python numerical array container. Numeric, the
ment16. Travis Oliphant, a PhD student at the Mayo Clinic, released original array package, was suitable for small arrays, but not for the
a number of packages17,18 that built on top of the Numeric array large images processed by STScI. With the Numeric maintainer’s
package, and provided algorithms for signal processing, special blessing, the decision was made to write NumArray28, a library
functions, sparse matrices, quadrature, optimization, fast Fourier that could handle data on a larger scale. Unfortunately, NumArray
transforms and more. One of these packages, Multipack (http:// proved inefficient for small arrays, presenting the community with
pylab.sourceforge.net/multipack.html), was a set of extension a rather unfortunate choice. In 2005, Travis Oliphant combined
modules that wrapped Fortran and C libraries to solve nonlinear the best elements of Numeric and NumArray, thereby solving the
equations and least-squares problems, integrate differential equa- dilemma. NumPy 1.0 was released29 in October 2006, paving the
tions and fit splines. Robert Kern, then an undergraduate student way for the reunified scientific Python community to mature.

262 Nature Methods | VOL 17 | March 2020 | 261–272 | www.nature.com/naturemethods


NaTure MeThodS Perspective
Box 2 | Package organization

The SciPy library is organized as a collection of subpackages. The misc


16 subpackages include mathematical building blocks (for exam- A collection of functions that did not fit into the other subpackages.
ple, linear algebra, Fourier transforms, special functions), data Although this subpackage still exists in SciPy 1.0, an effort is
structures (for example, sparse matrices, k-D trees), algorithms underway to deprecate or relocate the contents of this subpackage
(for example, numerical optimization and integration, clustering, and remove it.
interpolation, graph algorithms, computational geometry) and odr
higher-level data analysis functionality (for example, signal and
image processing, statistical methods). Orthogonal distance regression, including Python wrappers for
Here we summarize the scope and capabilities of each subpackage. the Fortran library ODRPACK54.
Additional information is available in the SciPy tutorial (https:// optimize
docs.scipy.org/doc/scipy/reference/tutorial/) and API reference This subpackage includes simplex and interior-point linear
(https://docs.scipy.org/doc/scipy/reference/index.html#api- programming solvers, implementations of many nonlinear
reference). minimization algorithms, a routine for least-squares curve fitting,
cluster and a collection of general nonlinear solvers for root-finding.
The cluster subpackage contains cluster.vq, which signal
provides vector quantization and k-means algorithms, and The signal subpackage focuses on signal processing and basic
cluster.hierarchy, which provides functions for linear systems theory. Functionality includes convolution and
hierarchical and agglomerative clustering. correlation, splines, filtering and filter design, continuous and
constants discrete time linear systems, waveform generation, window
Physical and mathematical constants, including the CODATA functions, wavelet computations, peak finding and spectral analysis.
recommended values of the fundamental physical constants119. sparse
fftpack This subpackage includes implementations of several
Fast Fourier Transform routines. In addition to the FFT itself, the representations of sparse matrices. scipy.sparse.
subpackage includes functions for the discrete sine and cosine linalg provides a collection of linear algebra routines that
transforms and for pseudo-differential operators. work with sparse matrices, including linear equation solvers,
eigenvalue decomposition, singular value decomposition and LU
integrate factorization. scipy.sparse.csgraph provides a collections
The integrate subpackage provides tools for the numerical of graph algorithms for which the graph is represented using
computation of single and multiple definite integrals and for the a sparse matrix. Algorithms include connected components,
solution of ordinary differential equations, including initial value shortest path, minimum spanning tree and more.
problems and two-point boundary value problems. spatial
interpolate This subpackage provides spatial data structures and algorithms,
The interpolate subpackage contains spline functions and including the k-d tree, Delaunay triangulation, convex hulls and
classes, one-dimensional and multi-dimensional (univariate Voronoi diagrams. scipy.spatial.distance provides a large
and multivariate) interpolation classes, Lagrange and Taylor collection of distance functions, along with functions for computing
polynomial interpolators, and wrappers for FITPACK53 and the distance between all pairs of vectors in a given collection of
DFITPACK functions. points or between all pairs from two collections of points.
io special
A collection of functions and classes for reading and writing The name comes from the class of functions traditionally known
Matlab (https://www.mathworks.com/products/matlab.html), as special functions, but over time, the subpackage has grown to
IDL, Matrix Market120, Fortran, NetCDF121, Harwell-Boeing122, include functions beyond the classical special functions. A more
WAV and ARFF data files. appropriate characterization of this subpackage is simply useful
linalg functions. It includes a large collection of the classical special
Linear algebra functions, including elementary functions of functions such as Airy, Bessel and others; families of orthogonal
a matrix, such as the trace, determinant, norm and condition polynomials; the Gamma function, and functions related to it;
number; basic solver for Ax = b; specialized solvers for Toeplitz functions for computing the PDF, CDF and quantile function for
matrices, circulant matrices, triangular matrices and other several probability distributions; information theory functions;
structured matrices; least-squares solver and pseudo-inverse combinatorial functions comb and factorial; and more.
calculations; eigenvalue and eigenvector calculations (basic and stats
generalized); matrix decompositions, including Cholesky, Schur, The stats subpackage provides a large collection of continuous
Hessenberg, LU, LDLT, QR, QZ, singular value and polar; and and discrete probability distributions, each with methods to
functions to create specialized matrices, such as diagonal, Toeplitz, compute the PDF or PMF, CDF, moments and other statistics,
Hankel, companion, Hilbert and more. generation of random variates and more; statistical tests, including
ndimage tests on equality of means/medians/variance (such as the t-test) and
This subpackage contains various functions for multi-dimensional tests whether a sample is drawn from a certain distribution (such as
image processing, including convolution and assorted linear the Kolmogorov-Smirnov test); measures of correlation, including
and nonlinear filters (Gaussian filter, median filter, Sobel filter Pearson’s r, Kendall’s τ, and Spearman’s ρ coefficients; descriptive
and others); interpolation; region labeling and processing; and statistics including trimmed values; kernel density estimation; and
mathematical morphology functions. transformations of data such as the Box-Cox power transformation.

Nature Methods | VOL 17 | March 2020 | 261–272 | www.nature.com/naturemethods 263


Perspective NaTure MeThodS

SciPy matures. By the middle to late 2000s, SciPy was starting to are discussed in the “Key technical improvements” section below,
mature after a long phase of significant growth and adoption. The and milestones in its history are highlighted in Fig. 1.
scope of the SciPy library narrowed, while the breadth of the eco-
system grew through a new type of auxiliary package: the scikit Architecture and implementation choices
(https://www.scipy.org/scikits.html), a complementary library Project scope. SciPy provides fundamental algorithms for scientific
developed outside SciPy, allowing for more rapid exploration of computing. The breadth of its scope was derived from the guide to
experimental ideas while maintaining familiar style and develop- available mathematical software (GAMS) classification system44. In
ment methodology. In SciPy itself, tooling, development, documen- areas that move relatively slowly, for example, linear algebra, SciPy
tation and release processes became more professional. The library aims to provide complete coverage. In other areas it aims to pro-
was expanded carefully, with the patience affordable in open-source vide fundamental building blocks while interacting well with other
projects and via best practices, which are increasingly common in packages specialized in that area. For example, SciPy provides what
the scientific Python ecosystem and industry30. one expects to find in a statistics textbook (probability distribu-
Early versions of SciPy had minimal documentation, but this tions, hypothesis tests, frequency statistics, correlation functions,
began to change with the 2006 release of a Guide to NumPy1. In 2007, and more), whereas Statsmodels45 provides more advanced statisti-
Sphinx31 made it possible to render hypertext and PDF documents cal estimators and inference methods, scikit-learn3 covers machine
automatically from plain text (docstrings) interspersed with Python learning, and PyMC346, emcee47 and PyStan (http://mc-stan.org)
code, and in 2008, pydocweb32 enabled collaborative documenta- cover Bayesian statistics and probabilistic modeling. scikit-image4
tion development in a wiki-like fashion. The SciPy Documentation provides image processing capabilities beyond SciPy’s ndimage,
Project33,34 used these tools to complete documentation of SciPy’s SymPy48 provides a Python interface for symbolic computation, and
user-facing functionality: offering t-shirts to contributors from sparse.csgraph and spatial offer basic tools for working
around the world in exchange for high-quality text, it collected con- with graphs and networks compared to specialized libraries such as
tributions from over 75 people to produce an 884-page manual35. NetworkX49.
Since then, SciPy has remained committed to maintaining high- We use the following criteria to determine whether to include
quality documentation as part of the normal development cycle. new functionality in SciPy:
In the early SciPy workshops, recurrent topics reflected the state
of development, with emphasis being placed on the underlying • The algorithm is of relevance to multiple fields of science.
array package, plotting, parallel processing, acceleration/wrapping • The algorithm is demonstrably important. For example, it is
and user interfaces. By 2004, presentations about the application of classic enough to be included in textbooks, or it is based on a
SciPy to scientific problems began to appear. The event also started peer-reviewed article that has a substantial number of citations.
to draw in more keynote speakers from outside the community,
such as Guido van Rossum (creator of Python, 2006), Ivan Krstić In terms of software systems and architecture, SciPy’s scope
(One Laptop per Child, 2007), Alex Martelli (Google, 2008) and matches NumPy’s: algorithms for in-memory computing on single
Peter Norvig (Google Research, 2009). The informal workshop grew machines, with support for a wide range of data types and process
from a small gathering of core developers into an international con- architectures. Distributed computing and support for graphics pro-
ference with hundreds of attendees, increased funding, a published cessing units (GPUs) were explicitly out of scope at the 1.0 release
proceedings and scholarships for attending students. By 2010, the point, but this has been revised in our roadmap (see Discussion).
US SciPy conference had multiple tracks, and satellite conferences
were being organized by volunteers elsewhere, such as EuroSciPy Language choices. According to analysis using the linguist
(since 2008) and SciPy India (since 2009). Special sessions and library (https://github.com/github/linguist), SciPy is approximately
minisymposia dedicated to scientific Python began appearing at 50% Python, 25% Fortran, 20% C, 3% Cython and 2% C++, with a
many other events. For example, a three-part minisymposium orga- dash of TeX, Matlab, shell script and Make. The distribution of sec-
nized for International Conferences on Computational Science and ondary programming languages in SciPy is a compromise between
Engineering (CSE) 2009 was featured in SIAM News36. a powerful, performance-enhancing language that interacts well
In 2007, Python had a strong enough presence in science and with Python (that is, Cython) and the usage of languages (and
engineering that the editors of IEEE Computing in Science and their libraries) that have proven reliable and performant over many
Engineering solicited a special issue about Python in science37, decades.
edited by Paul Dubois. However, Python was still sufficiently niche Fortran, despite its age, is still a high-performance scientific pro-
that the average reader would need additional information to gramming language with continued contemporary usage50. Thus,
decide whether it would be useful in their own work. The follow- we wrap the following excellent, field-tested Fortran libraries to
up March/April 2011 Python for Scientists and Engineers special provide Python convenience while benefiting from their perfor-
issue38 focused more on the core parts of the scientific Python eco- mance: QUADPACK51 and ODEPACK52 for numerical integration
system39 including NumPy2, Cython40 and Mayavi41. Python became and solution of initial value problems; FITPACK53, ODRPACK54
so pervasive that journals began publishing domain-specific special and MINPACK55 for curve-fitting and least-squares minimization;
issues. For example, in 2015, Frontiers in Neuroinformatics pub- FFTPACK56,57 for performing Fourier transforms; ARPACK58 for
lished a collection of 25 articles—covering topics including model- solving eigenvalue problems; ALGORITHM 644 (ref. 59) for com-
ing and simulation, data collection, electrophysiology, visualization puting Bessel functions; and CDFLIB60 for evaluating cumulative
as well as stimulus generation and presentation—called Python in density functions.
Neuroscience42. Rounding out the top three languages in SciPy is C, which is
also extremely well-established over several decades61 of scientific
SciPy today. As of February 2019, the SciPy library consists of computing. The C libraries that we wrap in SciPy include trlib62 for
nearly 600,000 lines of open-source code organized in 16 subpack- optimization, SuperLU63,64 for solving sparse linear systems, Qhull65
ages summarized in Box 2. The development team and community for computational geometry and Cephes (http://www.netlib.org/
currently interact and operate primarily on GitHub, an online ver- cephes/) for special functions.
sion control and task management platform. Over 110,000 GitHub Cython has been described as a creole language that mixes the
repositories and 6,500 packages depend on SciPy43. Some of the best parts of Python and lower-level C/C++ paradigms40. We often
major feature highlights from the three years preceding SciPy 1.0 use Cython as a glue between well-established, low-level scientific

264 Nature Methods | VOL 17 | March 2020 | 261–272 | www.nature.com/naturemethods


NaTure MeThodS Perspective
scipy.sparse.csgraph
Development moves scipy.optimize.minimize
SciPy 0.1 released Creation of SciKits to GitHub SciPy 1.0 released

2005 2008 2010 2013 2015


2001 2007 2011 2012 2017
Transition to NumPy Move to 6-month Continuous integration Cython interface
First Cython code, release cycles with TravisCI for BLAS/LAPACK
scipy.spatial

Fig. 1 | Major milestones from SciPy’s initial release in 2001 to the release of SciPy 1.0 in 2017. Note that SciKits and GitHub have been introduced in
the Background section; more information about Cython and SciPy subpackages (for example, scipy.sparse) is available in the ‘Architecture and
implementation choices’ section, BLAS/LAPACK support is detailed in the ‘Key technical improvements’ section, and continuous integration is discussed
in the ‘Test and benchmark suite’ section.

computing libraries written in C/C++ and the Python inter- 5.2.1, enhancing the low-level implementations leveraged by a sub-
face offered by SciPy. We also use Cython to enable performance set of our sparse offerings.
enhancements in Python code, especially for cases where heavily From a new features standpoint, scipy.sparse matrices
used inner loops benefit from a compiled code with static typing. and linear operators now support the Python matrix multiplication
For implementing new functionality, Python is the still the lan- (@) operator. We added scipy.sparse.norm and scipy.
guage of choice. If Python performance is an issue, then we prefer sparse.random for computing sparse matrix norms and draw-
the use of Cython followed by C, C++ or Fortran (in that order). The ing random variates from arbitrary distributions, respectively. Also,
main motivation for this is maintainability: Cython has the highest we made a concerted effort to bring the scipy.sparse API into
abstraction level, and most Python developers will understand it. C line with the equivalent NumPy API where possible.
is also widely known, and easier for the current core development
team to manage than C++ and especially Fortran. cKDTree. The scipy.spatial.ckdtree module, which
The position that SciPy occupies near the foundation of the sci- implements a space-partitioning data structure that organizes
entific Python ecosystem is such that adoption of new languages or points in k-dimensional space, was rewritten in C++ with tem-
major dependencies is generally unlikely; our choices are strongly plated classes. Support was added for periodic boundary conditions,
driven by long-term stability. GPU acceleration, new transpiling which are often used in simulations of physical processes.
libraries and the latest JIT compilation approaches (for example, In 2013, the time complexity of the k-nearest-neighbor search
Numba66) are very powerful but have traditionally fallen outside from cKDTree.query was approximately loglinear68, con-
the remit of the main SciPy library. That said, we have recently sistent with its formal description69. Since then, we enhanced
increased our efforts to support compatibility with some of these cKDTree.query by reimplementing it in C++, removing
options, and our full test suite passed with the PyPy JIT compiler67 memory leaks and allowing release of the global interpreter lock
at the 1.0 release point. (GIL) so that multiple threads may be used70. This generally
improved performance on any given problem while preserving the
API and ABI evolution. The API for SciPy consists of approxi- asymptotic complexity.
mately 1,500 functions and classes. Our policy for evolving the API In 2015, SciPy added the sparse_distance_matrix rou-
over time is that new functionality can be added, while removing tine for generating approximate sparse distance matrices between
or changing existing functionality can only be done if the benefits KDTree objects by ignoring all distances that exceed a user-
exceed the (often significant) costs to users and only after giving provided value. This routine is not limited to the conventional L2
clear deprecation warnings to those users for at least one year. (Euclidean) norm but supports any Minkowski p-norm between
In general, we encourage changes that improve clarity in the API 1 and infinity. By default, the returned data structure is a diction-
of the library but strongly discourage breaking backward compat- ary of keys (DOK)-based sparse matrix, which is very efficient
ibility, given our position near the base of the scientific Python for matrix construction. This hashing approach to sparse matrix
computing stack. assembly can be seven times faster than constructing with CSR
In addition to the Python API, SciPy has C and Cython inter- format71, and the C++ level sparse matrix construction releases
faces. Therefore, we also have to consider the application binary the Python GIL for increased performance. Once the matrix is
interface (ABI). This ABI has been stable for a long time, and we constructed, distance value retrieval has an amortized constant
aim to evolve it only in a backward-compatible way. time complexity72, and the DOK structure can be efficiently con-
verted to a CSR, CSC or COO matrix to allow for speedy arith­
Key technical improvements metic operations.
Here we describe key technical improvements made in the last In 2015, the cKDTree dual tree counting algorithm73 was
three years. enhanced to support weights74, which are essential in many scien-
tific applications, for example, computing correlation functions of
Data structures. Sparse matrices. scipy.sparse offers seven sparse galaxies75.
matrix data structures, also known as sparse formats. The most
important ones are the row- and column-compressed formats (CSR Unified bindings to compiled code. LowLevelCallable. As of SciPy
and CSC, respectively). These offer fast major-axis indexing and version 0.19, it is possible for users to wrap low-level functions in
fast matrix-vector multiplication, and are used heavily throughout a scipy.LowLevelCallable object that reduces the over-
SciPy and dependent packages. head of calling compiled C functions, such as those generated using
Over the last three years, our sparse matrix handling internals Numba or Cython, directly from Python. Supported low-level func-
were rewritten and performance was improved. Iterating over and tions include PyCapsule objects, ctypes function pointers and
slicing of CSC and CSR matrices is now up to 35% faster, and the cffi function pointers. Furthermore, it is possible to generate a
speed of coordinate (COO)/diagonal (DIA) to CSR/CSC matrix for- low-level callback function automatically from a Cython module
mat conversions has increased. SuperLU63 was updated to version using scipy.LowLevelCallable.from_cython.

Nature Methods | VOL 17 | March 2020 | 261–272 | www.nature.com/naturemethods 265


Perspective NaTure MeThodS

Table 1 | Optimization methods from minimize


Nelder- Powell COBYLA CG BFGS L-BFGS-G SLSQP TNC Newton- dogleg trust- trust- trust-
Mead CG ncg exact Krylov
Version added 0.6* 0.6* 0.6* 0.6* 0.6* 0.6* 0.9 0.6* 0.6* 0.13 0.13 0.19 1.0
Wrapper ✓ ✓ ✓ ✓ ✓
First derivatives ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓
Second derivatives ~ ~ ~ ✓ ✓ ✓ ✓ ✓ ✓
Iterative Hessian ✓ ✓ ✓ ✓
factorization
Local convergence L S L S S* S* Q S* Q S*
Global convergence ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓ ✓
Trust region Neither LS TR LS LS LS LS LS LS TR TR TR TR
Bound constraints ✓ ✓ ✓ ✓
Equality constraints ✓
Inequality constraints ✓ ✓
References 98,99 100 101–103 104,105 105 106,107 108–111 112 105 105,113 105,114 115,116 62,117

n n
Optimization methods from minimize, which solves problems of the form minx f (x) , where x ∈ R and f : R → R. ‘Version added’ specifies the algorithm’s first appearance in SciPy. Algorithms with
version added “0.6*” were added in version 0.6 or before. ‘Wrapper’ indicates whether the implementation available in SciPy wraps a function written in a compiled language (for example, C or FORTRAN).
‘First derivatives’ and ‘second derivatives’ indicate whether first or second order derivatives are required. When ‘second derivatives’ is flagged with ‘~’, the algorithm accepts but does not require second-
order derivatives from the user; it computes an approximation internally and uses it to accelerate method convergence. ‘Iterative Hessian factorization’ denotes algorithms that factorize the Hessian in an
iterative way, which does not require explicit matrix factorization or storage of the Hessian. ‘Local convergence’ gives a lower bound on the rate of convergence of the iteration sequence once the iterate
is sufficiently close to the solution: linear (L), superlinear (S) and quadratic (Q). Convergence rates denoted S* indicate that the algorithm has a superlinear rate for the parameters used in SciPy, but can
achieve a quadratic convergence rate with other parameter choices. ‘Global convergence’ is marked for the algorithms with guarantees of convergence to a stationary point (that is, a point x* for which
∇f (x *) = 0); this is not a guarantee of convergence to a global minimum. ‘Lines-search’ (LS) or ‘trust-region’ (TR) indicates which of the two globalization approaches is used by the algorithm. The table
also indicates which algorithms can deal with constraints on the variables. We distinguish among bound constraints ( x l ≤ x ≤ x u), equality constraints (ceq (x) = 0) and inequality constraints (cineq (x) ≥ 0).

Cython bindings for BLAS, LAPACK and special. SciPy has interface across all supported LAPACK versions. The standard
provided special functions and leveraged basic linear algebra subpro- BLAS interface provided by the various existing BLAS libraries is
grams (BLAS) and linear algebra package (LAPACK)76 routines for not currently changing, so changes are not generally needed in the
many years. SciPy now additionally includes Cython40 wrappers for wrappers provided by SciPy. Changes to the Cython wrappers for
many BLAS and LAPACK routines (added in 2015) and the special the functions in scipy.special follow corresponding changes
functions provided in the scipy.special subpackage (added in to the interface of that subpackage.
2016), which are available in scipy.linalg.cython_blas,
scipy.linalg.cython_lapack and scipy.special. Numerical optimization. The scipy.optimize subpackage
cython_special, respectively. When writing algorithms in provides functions for the numerical solution of several classes of
Cython, it is typically more efficient to call directly into the librar- root finding and optimization problems. Here we highlight recent
ies SciPy wraps rather than indirectly, using SciPy’s Python APIs. additions through SciPy 1.0.
These low-level interfaces for Cython can also be used outside of
the SciPy codebase to gain access to the functions in the wrapped Linear optimization. A new interior-point optimizer for continu-
libraries while avoiding the overhead of Python function calls. This ous linear programming problems, linprog with method =
can give performance gains of one or two orders of magnitude for ’interior-point’, was released with SciPy 1.0. Implementing
many use cases. the core algorithm of the commercial solver MOSEK78, it solves all
Developers can also use the low-level Cython interfaces without of the 90+ NETLIB LP benchmark problems79 tested. Unlike some
linking against the wrapped libraries77. This lets other extensions interior point methods, this homogeneous self-dual formulation pro-
avoid the complexity of finding and using the correct libraries. vides certificates of infeasibility or unboundedness as appropriate.
Avoiding this complexity is especially important when wrapping A presolve routine80 solves trivial problems and otherwise per-
libraries written in Fortran. Not only can these low-level wrappers forms problem simplifications, such as bound tightening and
be used without a Fortran compiler, they can also be used without removal of fixed variables, and one of several routines for elimi-
having to handle all the different Fortran compiler ABIs and name nating redundant equality constraints is automatically chosen
mangling schemes. to reduce the chance of numerical difficulties caused by singular
Most of these low-level Cython wrappers are generated automat- matrices. Although the main solver implementation is pure Python,
ically to help with both correctness and ease of maintenance. The end-to-end sparse matrix support and heavy use of SciPy’s compiled
wrappers for BLAS and LAPACK are primarily generated using type linear system solvers—often for the same system with multiple right
information that is parsed from the BLAS and LAPACK source files hand sides owing to the predictor-corrector approach—provide
using F2PY19, though a small number of routines use hand-written speed sufficient for problems with tens of thousands of variables
type signatures instead. The input and output types of each routine and constraints.
are saved in a data file that is read at build time and used to generate
the corresponding Cython wrapper files. The wrappers in scipy. Nonlinear optimization: local minimization. The minimize func-
special.cython_special are also generated from a data file tion provides a unified interface for finding local minima of nonlin-
containing type information for the wrapped routines. ear optimization problems. Four new methods for unconstrained
Since SciPy can be built with LAPACK 3.4.0 or later, Cython wrap- optimization were added to minimize in recent versions of SciPy:
pers are only provided for the routines that maintain a consistent dogleg, trust-ncg, trust-exact and trust-krylov.

266 Nature Methods | VOL 17 | March 2020 | 261–272 | www.nature.com/naturemethods


NaTure MeThodS Perspective
600,000 stats.multinomial (used, for example, in natural language
processing83).

Polynomial interpolators. Historically, SciPy relied heavily on the


venerable FITPACK Fortran library by P. Dierckx53,84 for univariate
400,000
interpolation and approximation of data, but the original mono-
Lines of code

lithic design and API for interaction between SciPy and FITPACK
was limiting for both users and developers.
Implementing a new, modular design of polynomial interpola-
200,000 tors was spread over several releases. The goals of this effort were
to have a set of basic objects representing piecewise polynomials, to
implement a collection of algorithms for constructing various inter-
polators, and to provide users with building blocks for constructing
additional interpolators.
At the lowest level of the new design are classes that represent
.1

.3

.1

.1

.1

.1

.1

.1

.0
univariate piecewise polynomials: PPoly (SciPy 0.13)85, BPoly
2

.0
.1

.1

.1

.1

.1

.1

.1

.1

v1
v0

v0

v0

v0

v0

v0

v0

v0
(SciPy 0.13) and BSpline (SciPy 0.19)86, which allow efficient
Compiled (covered) Compiled (uncovered) Python (covered) Python (uncovered) vectorized evaluations, differentiation, integration and root-find-
ing. PPoly represents piecewise polynomials in the power basis
Fig. 2 | Python and compiled code volume in SciPy over time. in terms of breakpoints and coefficients at each interval. BPoly is
similar and represents piecewise polynomials in the Bernstein basis
(which is suitable, for example, for constructing Bézier curves).
All are trust-region methods that build a local model of the objec- BSpline represents spline curves, that is, linear combinations of
tive function based on first and second derivative information, B-spline basis elements87.
approximate the best point within a local ‘trust region’ and iterate In the next layer, these polynomial classes are used to construct
until a local minimum of the original objective function is reached, several common ways of interpolating data: CubicSpline (SciPy
but each has unique characteristics that make it appropriate for 0.18)88 constructs a twice differentiable piecewise cubic function,
certain types of problems. For instance, trust-exact achieves Akima1DInterpolator and PCHIPInterpolator imple-
fast convergence by solving the trust-region subproblem almost ment two classic prescriptions for constructing a C1 continuous
exactly, but it requires the second derivative Hessian matrix to be monotone shape-preserving interpolator89,90.
stored and factored every iteration, which may preclude the solu-
tion of large problems (≥1,000 variables). In contrast, trust-ncg Test and benchmark suite. Test suite. Test-driven development has
and trust-krylov are well suited to large-scale optimization been described as a way to manage fear and uncertainty when mak-
problems because they do not need to store and factor the Hessian ing code changes91. For each component of SciPy, we write multiple
explicitly, instead using second derivative information in a faster, small executable tests that verify its intended behavior. The collec-
approximate way. We compare the characteristics of all minimize tion of these, known as a ‘test suite’, increases confidence in the cor-
methods in detail in Table 1, which illustrates the level of complete- rectness and accuracy of the library, and allows us to make code
ness that SciPy aims for when covering a numerical method or topic. modifications known not to alter desired behavior. According to
the practice of continuous integration92, all proposed contributions
Nonlinear optimization: global minimization. As minimize may to SciPy are temporarily integrated with the master branch of the
return any local minimum, some problems require the use of a library before the test suite is run, and all tests must be passed before
global optimization routine. The new scipy.optimize.dif- the contribution is permanently merged. Continuously monitoring
ferential_evolution function81,82 is a stochastic global opti- the number of lines of code in SciPy covered by unit tests is one way
mizer that works by evolving a population of candidate solutions. we maintain some certainty that changes and new features are cor-
In each iteration, trial candidates are generated by combination rectly implemented.
of candidates from the existing population. If the trial candidates The SciPy test suite is orchestrated by a continuous integration
represent an improvement, then the population is updated. Most matrix that includes POSIX and Windows (32/64-bit) platforms
recently, the SciPy benchmark suite gained a comprehensive set of managed by Travis CI and AppVeyor, respectively. Our tests cover
196 global optimization problems for tracking the performance of Python versions 2.7, 3.4, 3.5, 3.6, and include code linting with
existing solvers over time and for evaluating whether the perfor- pyflakes and pycodestyle. There are more than 13,000 unit
mance of new solvers merits their inclusion in the package. tests in the test suite, which is written for usage with the pytest
(https://docs.pytest.org/en/latest) framework. In Fig. 2, we show
Statistical distributions. The scipy.stats subpackage con- historical test coverage data generated using a Docker-based
tains more than 100 probability distributions: 96 continuous and 13 approach (https://github.com/tylerjereddy/scipy-cov-track). With
discrete univariate distributions, and 10 multivariate distributions. the exception of the removal of ∼61,000 lines of compiled code for
The implementation relies on a consistent framework that provides SciPy v0.14, the volume of both compiled (C, C++ and Fortran)
methods to sample random variates, to evaluate the cumulative and Python code has increased between releases, as have the num-
distribution function (CDF) and the probability density function ber of lines covered by unit tests. Test coverage at the SciPy 1.0
(PDF), and to fit parameters for every distribution. Generally, the release point was at 87% for Python code according to pytest-
methods rely on specific implementations for each distribution, cov (https://pypi.org/project/pytest-cov/). Coverage of compiled
such as a closed-form expression of the CDF or a sampling algo- (C, C++ and Fortran) code was only 45% according to gcov
rithm, if available. Otherwise, default methods are used based on (https://gcc.gnu.org/onlinedocs/gcc/Gcov.html), but the compiled
generic code, for example, numerical integration of the PDF to codebase is much more robust than this figure would suggest as
obtain the CDF. Key recent distributions added to scipy.stats the figure does not correct for the inclusion of reputable vendor
include the histogram-based distribution in scipy.stats. code, the original library of which is well-tested; generated code, for
rv_histogram and the multinomial distribution in scipy. which full coverage is impractical; and deprecated code, which does

Nature Methods | VOL 17 | March 2020 | 261–272 | www.nature.com/naturemethods 267


Perspective NaTure MeThodS

cKDTree introduced
1.0 cKDTree fully cythonized

cKDTree rewritten in C++

SciPy 1.0 released

0.1
Execution time (s)

m = 16
m=8
m=3

0.01

0.001
2009 2010 2011 2012 2013 2014 2015 2016 2017 2018

Commit date

Fig. 3 | Results of the scipy.spatial.cKDTree.query benchmark from the introduction of cKDTree to the release of SciPy 1.0. The benchmark
generates a k-d tree from uniformly distributed points in an m-dimensional unit hypercube, then finds the nearest (Euclidean) neighbor in the tree for each
of 1,000 query points. Each marker in the figure indicates the execution time of the benchmark for a commit in the master branch of SciPy.

not require unit tests. Documentation for the code is automatically a code of conduct committee and outlines procedures for the com-
built and published by the CircleCI service to facilitate evaluation of mittee’s response. Our diversity statement “welcomes and encour-
documentation changes/integrity. ages participation by everyone.”

Benchmark suite. In addition to ensuring that unit tests are pass- Maintainers and contributors. The SciPy project has ~100 unique
ing, it is important to confirm that the performance of the SciPy contributors for every 6-month release cycle. Anyone with the inter-
codebase improves over time. Since February 2015, the perfor- est and skills can become a contributor; the SciPy contributor guide
mance of SciPy has been monitored with Airspeed Velocity (asv (https://scipy.github.io/devdocs/dev/contributor/contributor_toc.
https://github.com/airspeed-velocity/asv). SciPy’s run.py script html) provides guidance on how to do that. In addition, the project
conveniently wraps asv features such that benchmark results over currently has 15 active (volunteer) maintainers: people who review
time can be generated with a single console command. For exam- the contributions of others and do everything else needed to ensure
ple, in Fig. 3 we illustrate the improvement of scipy.spatial. that the software and the project move forward. Maintainers are
cKDTree.query over roughly nine years of project history. The critical to the health of the project93; their skills and efforts largely
tree used in the benchmark was generated without application determine how fast the project progresses, and they enable input
of toroidal topology (boxsize = None), and tests were per- from the much larger group of contributors. Anyone can become
formed by Airspeed Velocity 0.4 using Python 2.7, NumPy 1.8.2 and a maintainer, too, as they are selected on a rolling basis from con-
Cython versions 0.27.3, 0.21.1 and 0.18 (for improved backward tributors with a substantial history of high-quality contributions.
compatibility). Substantial performance improvements were real-
ized when cKDTree was fully Cythonized and again when it was Funding. The development cost of SciPy is estimated in excess
rewritten in C++. of 10 million dollars by Open Hub (https://www.openhub.net/p/
scipy/estimated_cost). Yet the project is largely unfunded, having
Project organization and community been developed predominantly by graduate students, faculty and
Governance. SciPy adopted an official governance document members of industry in their free time. Small amounts of funding
(https://docs.scipy.org/doc/scipy/reference/dev/governance/gover- have been applied with success: some meetings were sponsored by
nance.html) on August 3, 2017. A steering council, currently com- universities and industry, Google’s Summer of Code program sup-
posed of 18 members, oversees daily development of the project by ported infrastructure and algorithm work, and teaching grant funds
contributing code and reviewing contributions from the commu- were used early on to develop documentation. However, funding
nity. Council members have commit rights to the project reposi- from national agencies, foundations and industry has not been
tory, but they are expected to merge changes only when there are no commensurate with the enormous stack of important software that
substantive community objections. The chair of the steering coun- relies on SciPy. More diverse spending to support planning, devel-
cil, Ralf Gommers, is responsible for initiating biannual technical opment, management and infrastructure would help SciPy remain
reviews of project direction and summarizing any private council a healthy underpinning of international scientific and industrial
activities to the broader community. The project’s benevolent dicta- endeavors.
tor for life, Pauli Virtanen, has overruling authority on any matter,
but is expected to act in good faith and only exercise this authority Downstream projects. The scientific Python ecosystem includes
when the steering council cannot reach agreement. many examples of domain-specific software libraries building on
SciPy’s official code of conduct was approved on October 24, top of SciPy features and then returning to the base SciPy library
2017. In summary, there are five specific guidelines: be open to to suggest and even implement improvements. For example, there
everyone participating in our community; be empathetic and are common contributors to the SciPy and Astropy core libraries94,
patient in resolving conflicts; be collaborative, as we depend on each and what works well for one of the codebases, infrastructures or
other to build the library; be inquisitive, as early identification of communities is often transferred in some form to the other. At
issues can prevent serious consequences; and be careful with word- the codebase level, the binned_statistic functionality is
ing. The code of conduct specifies how breaches can be reported to one such cross-project contribution: it was initially developed in

268 Nature Methods | VOL 17 | March 2020 | 261–272 | www.nature.com/naturemethods


NaTure MeThodS Perspective
of SciPy usage tutorials beyond our current 15 section offering.
Table 2 | Summary of SciPy Roadmap items following 1.0
Also, the low-level Cython code in our library (which interacts with
release
C-level code and exposes it for Python usage) could use some mea-
SciPy subpackage Summary of change sure of modernization, including migration to typed memoryviews
optimize A few more high-quality global optimizers
to handle NumPy arrays.
A problem faced by many open-source projects is attracting and
fftpack Reduce overlap with NumPy equivalent retaining developers. Although it is normal for some individuals
linalg Reduce overlap with NumPy equivalent to contribute to a project for a while and then move on, too much
interpolate New spline fitting and arithmetic routines turnover can result in the loss of institutional memory, leading to
mistakes of the past being repeated, APIs of new code becoming
interpolate New transparent tensor-product splines
inconsistent with the old code and a drifting project scope. We are
interpolate New non-uniform rational B-splines fortunate that the SciPy project continues to attract enthusiastic and
interpolate Mesh refinement and coarsening of B-splines and competent new developers while maintaining the involvement of
tensor products a small but dedicated old guard. There are contributors who were
signal Migrate spline functionality to interpolate present in the early years of the project who still contribute to dis-
cussions of bug reports and reviews of new code contributions. Our
signal Second order sections update to match
capabilities in other routines
benevolent dictator for life has been with the project for more than
10 years and is still actively contributing code, and the head of our
linalg Support a more recent version of LAPACK steering council, who also acts as a general manager, is approach-
ndimage Clarify usage of the ‘data point’ coordinate model, ing his eleventh anniversary. An additional half dozen or so active
and add additional wrapping modes developers have been contributing steadily for five or more years.
sparse Incorporate sparse arrays from Sparse package118 The combination of a committed old guard and a host of new con-
sparse.linalg Add PROPACK wrappers for faster SVD
tributors ensures that SciPy will continue to grow while maintaining
a high level of quality.
spatial Add support for (quaternion) rotation matrices A final important challenge to address is the accommodation of
special Precision improvements for hypergeometric, GPU and distributed computing without disrupting our conven-
parabolic cylinder and spheroidal wave functions tional and heavily used algorithm/API infrastructure. Although the
exact approach we will adopt across the entire library to leverage
these emerging technologies remains unclear, and was not a priority
an Astropy-affiliated package and then placed in SciPy afterward. at the 1.0 release point, we now have a concrete implementation of
In this perspective, SciPy serves as a catalyst for cross-fertilization a subpackage that allows for the experimental use of multiple back-
throughout the Python scientific computing community. ends, such as GPU-tractable data structures, in the new scipy.
fft. This will be described in detail in a future report.
Discussion
SciPy has a strong developer community and a massive user base. Reporting Summary. Further information on research design
GitHub traffic metrics report roughly 20,000 unique visitors to the is available in the Nature Research Reporting Summary linked to
source website between 14 May 2018 and 27 May 2018 (near the this article.
time of writing), with 721 unique copies (‘clones’) of the codebase
over that time period. The developer community at that time con- Data availability
sisted of 610 unique contributors of source code, with more than Raw data for Fig. 2 are available at https://github.com/tylerjereddy/
19,000 commits accepted into the codebase (GitHub page data). scipy-cov-track, and raw data for Fig. 3 are available at https://
From the user side, there were 13,096,468 downloads of SciPy github.com/scipy/scipy-articles/tree/master/scipy-1.0/supporting_
from the Python Packaging Index (PyPI)95 and 5,776,017 via the info/asv_bench/cKDTree.
default channel of the conda (https://github.com/ContinuumIO/
anaconda-package-data) package manager during the year 2017. Code availability
These numbers establish a lower bound on the total number of All SciPy library source code is available in the SciPy GitHub repos-
downloads by users given that PyPI and conda are only two of itory, https://github.com/scipy/scipy.
several popular methods for installing SciPy. The SciPy website
(http://www.scipy.org/), which has been the default citation in the Received: 28 July 2019; Accepted: 14 November 2019;
absence of a peer-reviewed paper, has been cited over 3,000 times Published online: 3 February 2020
(https://scholar.google.com/scholar?cites=2086009121748039507).
Some of the most prominent uses of or demonstrations of cred- References
ibility for SciPy include the LIGO-Virgo scientific collaboration 1. Oliphant, T.E. Guide to NumPy 1st edn (Trelgol Publishing USA, 2006).
2. van derWalt, S., Colbert, S. C. & Varoquaux, G.The NumPy array: a
that lead to the observation of gravitational waves96, the fact that structure for efficient numerical computation. Comput. Sci. Eng. 13,
SciPy is shipped directly with macOS and in the Intel distribution 22–30 (2011).
for Python97, and that SciPy is used by 47% of all machine learning 3. Pedregosa, F.et al.Scikit-learn: machine learning in Python. J. Mach. Learn.
projects on GitHub (https://github.blog/2019-01-24-the-state-of- Res. 12, 2825–2830 (2011).
the-octoverse-machine-learning/). 4. van derWalt, S.et al.scikit-image: image processing in Python. Peer J. 2,
e453 (2014).
Nevertheless, SciPy continually strives to improve. The SciPy 5. Nitz, A. et al. gwastro/pycbc: PyCBC v1.13.2 release, https://doi.
Roadmap (https://docs.scipy.org/doc/scipy-1.0.0/reference/road- org/10.5281/zenodo.1596771 (27 November 2018).
map.html, https://scipy.github.io/devdocs/roadmap.html), sum- 6. Vallisneri, M., Kanner, J., Williams, R., Weinstein, A. & Stephens, B.The
marized in Table 2, is a continually updated document maintained LIGO Open Science Center. J. Phys. Conf. Ser. 610, 012021 (2015).
7. Abbott, B. P.et al.GW150914: First results from the search for binary
by the community that describes some of the major directions for
black hole coalescence with Advanced LIGO. Phys. Rev. D. 93,
improvement for the project, as well as specific limitations and 122003 (2016).
matters that require assistance moving forward. In addition to the 8. Abbott, B. P.et al.GW170817: observation of gravitational waves from a
items on the roadmap, we are still working to increase the number binary neutron star inspiral. Phys. Rev. Lett. 119, 161101 (2017).

Nature Methods | VOL 17 | March 2020 | 261–272 | www.nature.com/naturemethods 269


Perspective NaTure MeThodS
9. The Event Horizon Telescope Collaboration.et al.First M87 event horizon 43. GitHub. Network dependents - scipy/scipy, https://github.com/scipy/scipy/
telescope results. III. Data processing and calibration. Astrophys. J. Lett. 875, network/dependents (2019).
L3 (2019). 44. Boisvert, R. F., Howe, S. E. & Kahaner, D. K.The guide to available
10. Blanton, K. At Mathworks, support + fun = success: CEO Jack Little mathematical software problem classification system. Commun. Stat. Simul.
believes in power of his workers–and their ideas. The Boston Globe, J5 Comput 20, 811–842 (1991).
(20 April 1997). 45. Seabold, S. & Perktold, J. Statsmodels: econometric and statistical
11. Howell, D. Jack Dangermond’s digital mapping lays it all out. Investor’s modeling with Python. In Proceedings of the 9th Python in Science
Business Daily (14 August 2009). Conference 57–61 (2010).
12. Port, O. Simple solutions. BusinessWeek, 24–24 (3 October 2005). 46. Salvatier, J., Wiecki, T. V. & Fonnesbeck, C.Probabilistic programming in
13. van Rossum, G. Python/C API Reference Manual, http://citeseerx.ist.psu. Python using PyMC3. PeerJ Comput. Sci. 2, e55 (2016).
edu/viewdoc/download?doi=10.1.1.211.6702&rep=rep1&type=pdf (2001). 47. Foreman-Mackey, D., Hogg, D. W., Lang, D. & Goodman, J. emcee: the
14. Hugunin, J. The matrix object proposal (very long), https://mail.python.org/ MCMC hammer. Publ. Astron. Soc. Pac.125, 306–312 (2013).
pipermail/matrix-sig/1995-August/000002.html (18 August 1995). 48. Meurer, A.et al.SymPy: symbolic computing in Python. PeerJ Comput. Sci.
15. Hugunin, J. Extending Python for numerical computation, http://hugunin. 3, e103 (2017).
net/papers/hugunin95numpy.html (1995). 49. Hagberg, A. A., Schult, D. A. & Swart, P. J. Exploring network structure,
16. Oliphant, T. E. Moving forward from the last decade of SciPy. Presentation dynamics, and function using NetworkX. In Proceedings of the 7th Python
slides, https://conference.scipy.org/scipy2010/slides/travis_oliphant_keynote. in Science Conference. (eds G. Varoquaux, G., Vaught, T. & Millman, K. J.)
pdf (1 July 2010). 11–15 (2008).
17. Oliphant, T. E. Some Python modules. Web Archive, https://web.archive.org/ 50. Koelbel, C.H. & Zosel, M.E. The High Performance FORTRAN Handbook
web/19990125091242/http://oliphant.netpedia.net:80/ (25 January 1999). (MIT Press, 1993).
18. Oliphant, T. E. Modules to enhance Numerical Python. Web Archive, 51. Piessens, R., de Doncker-Kapenga, E., Uberhuber, C.W. & Kahaner, D.K.
https://web.archive.org/web/20001206213500/http://oliphant.netpedia. QUADPACK: A Subroutine Package for Automatic Integration
net:80/ (6 December 2000). (Springer, 1983).
19. Peterson, P.F2PY: a tool for connecting Fortran and Python programs. 52. Hindmarsh, A.C. ODEPACK, a systematized collection of ODE solvers.
Int. J. Comput. Sci. Eng. 4, 296–305 (2009). Scientific Computing 55–64 (1983).
20. Strangman, G. Python modules. Web Archive, https://web.archive.org/ 53. Dierckx, P. Curve and Surface Fitting with Splines (Oxford Univ. Press, 1993).
web/20001022231108/http://www.nmr.mgh.harvard.edu/Neural_Systems_ 54. Boggs, P.T., Byrd, R.H., Rogers, J.E. & Schnabel, R.B. User’s Reference Guide
Group/gary/python.html (2000). for ODRPACK Version 2.01: Software for Weight Orthogonal Distance
21. SciPy Developers. SciPy.org. Web Archive, https://web.archive.org/ Regression (U.S. Department of Commerce, National Institute of Standards
web/20010309040805/http://scipy.org:80/ (2001). and Technology, 1992).
22. Vaught, T. N. SciPy Developer mailing list now online, https://mail.python. 55. Moré. Jorge J., Garbow, B. S. & Hillstrom, K. E. User guide for
org/pipermail/scipy-dev/2001-June/000000.html (2001). MINPACK-1. Report ANL-80–74 (Argonne National Laboratory, 1980).
23. Jones, E. ANN: SciPy 0.10–scientific computing with Python, https://mail. 56. Swarztrauber, P. N. Vectorizing the FFTs. In Parallel Computations
python.org/pipermail/python-list/2001-August/106419.html (2001). (ed. Rodrigue, G.) 51–83 (Academic, 1982).
24. Vaught, T.N. Reference documentation and Tutorial documentation are now 57. Swarztrauber, P. N.FFT algorithms for vector computers. Parallel Comput. 1,
available for download as tarballs. Web Archivehttps://web.archive.org/ 45–63 (1984).
web/20021013204556/http://www.scipy.org:80/scipy/site_content/site_news/ 58. Lehoucq, R.B., Sorensen, D.C. & Yang, C. ARPACK users’ guide: solution of
docs_released1 (2002). large scale eigenvalue problems with implicitly restarted Arnoldi methods.
25. Vaught, T. N. [ANN] SciPy ‘02 - Python for Scientific Computing (Rice University, 1997).
Workshop, https://mail.python.org/pipermail/numpy-discussion/2002- 59. Amos, D. E.Algorithm 644: A portable package for Bessel functions of a
June/001511.html (2002). complex argument and nonnegative order. ACM Trans. Math. Softw. 12,
26. Ascher, D., Dubois, P. F., Hinsen, K., Hugunin, J. & Oliphant, T. E. 265–273 (1986).
An open source project: Numerical Python, https://doi.org/10.5281/ 60. Brown, B., Lovato, J. & Russell, K. CDFLIB, https://people.sc.fsu.
zenodo.3599566 (2001). edu/~jburkardt/f_src/cdflib/cdflib.html (accessed 6 July 2018).
27. Greenfield, P. How Python slithered Into astronomy. Presentation, 61. Kernighan, B. W. & Ritchie, D. M. The C Programming Language 2nd edn
https://conference.scipy.org/scipy2011/slides/greenfield_keynote_astronomy. (Prentice Hall Professional Technical Reference, 1988).
pdf (2011). 62. Lenders, F., Kirches, C. & Potschka, A.trlib: a vector-free implementation
28. Greenfield, P., Miller, J.T., Hsu, J.T. & White, R.L. numarray: a new scientific of the GLTR method for iterative solution of the trust region problem.
array package for Python. PyCon DC (2003). Optim. Methods Softw. 33, 420–449 (2018).
29. NumPy Developers. v1.0, https://github.com/numpy/numpy/releases/tag/ 63. Li, X.S. et al. SuperLU Users’ Guide. Report LBNL-44289 (Lawrence
v1.0 (25 October 2006). Berkeley National Laboratory, 1999).
30. Millman, K. J. & Pérez, F. Developing open-source scientific practice. in 64. Li, X. S.An overview of SuperLU: algorithms, implementation, and user
Implementing Reproducible Research (CRC Press) 149–183 (2014). interface. ACM Trans. Math. Softw. 31, 302–325 (2005).
31. Brandl, G. & the Sphinx team. Sphinx - Python Documentation Generator, 65. Barber, C. B., Dobkin, D. P. & Huhdanpaa, H.The Quickhull algorithm for
http://www.sphinx-doc.org/en/master/ (2007). convex hulls. ACM Trans. Math. Softw. 22, 469–483 (1996).
32. Virtanen, P. et al. pydocweb: a tool for collaboratively documenting Python 66. Lam, S. K., Pitrou, A. & Seibert, S. Numba: A LLVM-based Python JIT
modules via the web. Web Archive, https://code.google.com/archive/p/ compiler. In Proceedings of the Second Workshop on the LLVM Compiler
pydocweb/ (2008). Infrastructure in HPC 7:1–7:6 (ACM, 2015).
33. Harrington, J. The SciPy documentation project. In Proceedings of the 7th 67. Bolz, C. F., Cuni, A., Fijalkowski, M. & Rigo, A. Tracing the meta-level:
Python in Science Conference (eds G. Varoquaux, G., Vaught, T. & Millman, PyPy’s tracing JIT compiler. In Proceedings of the 4th Workshop on the
K. J.) 33–35 (2008). Implementation, Compilation, Optimization of Object-Oriented Languages
34. van der Walt, S. The SciPy documentation project (technical overview). In and Programming Systems 18–25 (ACM, 2009).
Proceedings of the 7th Python in Science Conference (eds G. Varoquaux, G., 68. VanderPlas, J. Benchmarking nearest neighbor searches in Python,
Vaught, T. & Millman, K. J.) 27–28 (2008). https://jakevdp.github.io/blog/2013/04/29/benchmarking-nearest-neighbor-
35. Harrington, J. & Goldsmith, D. Progress report: NumPy and SciPy searches-in-python/ (19 April 2013).
documentation in 2009. In Proceedings of the 8th Python in Science Conference 69. Maneewongvatana, S. & Mount, D. M. Analysis of approximate nearest
(eds Varoquaux, G., van der Walt, S. & Millman, K. J.) 84–87 (2009). neighbor searching with clustered point sets. Preprint at https://arxiv.org/
36. Pérez, F., Langtangen, H. P. & LeVeque, R. Python for scientific computing. pdf/cs/9901013.pdf (1999).
In SIAM Conference on Computational Science and Engineering, 42 (5) (2009). 70. Molden, S. ENH: Enhancements to spatial.cKDTree, https://github.com/
37. Dubois, P. F.Python: batteries included. Comput. Sci. Eng.9, 7–9 (2007). scipy/scipy/pull/4374/ (7 January 2015).
38. Millman, K. J. & Aivazis, M.Python for scientists and engineers. Comput. 71. Aspnas, M., Signell, A. & Westerholm, J. Efficient assembly of sparse
Sci. Eng. 13, 9–12 (2011). matrices using hashing. In Applied Parallel Computing. State of the Art in
39. Pérez, F., Granger, B. E. & Hunter, J. D.Python: an ecosystem for scientific Scientific Computing (eds Kagstrom, B. et al.) 900–907 (Springer, 2007).
computing. Comput. Sci. Eng. 13, 13–21 (2011). 72. Cormen, T. H., Stein, C., Rivest, R. L. & Leiserson, C. E. Introduction to
40. Behnel, S.et al.Cython: the best of both worlds. Comput. Sci. Eng.13, Algorithms 2nd edn (McGraw-Hill Higher Education, 2001).
31–39 (2011). 73. Moore A. W. et al. Fast algorithms and efficient statistics: N-point
41. Ramachandran, P. & Varoquaux, G.Mayavi: 3D visualization of scientific correlation functions. In Mining the Sky. ESO Astrophysics Symposia
data. Comput. Sci. Eng. 13, 40–51 (2011). (European Southern Observatory) (eds Banday, A. J., Zaroubi, S. &
42. Muller, E.et al. Python in neuroscience. Front. Neuroinform. 9, 11 (2015). Bartelmann, M.) 71–82 (Springer, 2001).

270 Nature Methods | VOL 17 | March 2020 | 261–272 | www.nature.com/naturemethods


NaTure MeThodS Perspective
74. Feng, Y. ENH: Faster count_neighour in cKDTree / + weighted input data 109. Schittkowski, K.The nonlinear programming method of Wilson, Han, and
https://github.com/scipy/scipy/pull/5647 (2015). Powell with an augmented Lagrangian type line search function. Part 2: an
75. Martin, A. M., Giovanelli, R., Haynes, M. P. & Guzzo, L. The clustering efficient implementation with linear least squares subproblems. Numer. Math.
characteristics of HI-selected galaxies from the 40% ALFALFA survey. 38, 115–127 (1982).
Astrophys. J.750, 38 (2012). 110. Schittkowski, K.The nonlinear programming method of Wilson, Han, and
76. Anderson, E. et al. LAPACK Users’ Guide 3rd edn (Society for Industrial Powell with an augmented Lagrangian type line search function. Part 1:
and Applied Mathematics, 1999). convergence analysis. Numer. Math. 38, 83–114 (1982).
77. Henriksen, I. Circumventing the linker: using SciPy’s BLAS and LAPACK 111. Kraft, D. A software package for sequential quadratic programming. Report
within Cython. In Proceedings of the 14th Python in Science Conference DFVLR-FR 88–28 (Deutsche Forschungs- und Versuchsanstalt für Luft-
(SciPy 2015) (eds Huff, K. & Bergstra, J.) 49–52 (2015). und Raumfahrt, 1988).
78. Andersen, E. D. & Andersen, K. D. (2000) The Mosek interior point 112. Nash, S. G.Newton-type minimization via the Lanczos method. SIAM J.
optimizer for linear programming: an implementation of the homogeneous Numer. Anal. 21, 770–788 (1984).
algorithm. In High Performance Optimization 197–232 (Springer, 2000). 113. Powell, M. J. D. A new algorithm for unconstrained optimization. Nonlinear
79. The NETLIB LP test problem set, http://www.numerical.rl.ac.uk/cute/netlib. Programming 31–65 (1970).
html (2019). 114. Steihaug, T.The conjugate gradient method and trust regions in large scale
80. Andersen, E. D. & Andersen, K. D.Presolving in linear programming. Math. optimization. SIAM J. Numer. Anal. 20, 626–637 (1983).
Program. 71, 221–245 (1995). 115. Conn, A.R., Gould, N.I.M. & Toint, P.L. Trust Region Methods (SIAM, 2000).
81. Wormington, M., Panaccione, C., Matney Kevin, M. & Bowen, D. 116. Moré, J. J. & Sorensen, D. C. Computing a trust region step. SIAM J. Sci.
K.Characterization of structures from X-ray scattering data using genetic Statist. Comput. 4, 553–572 (1983).
algorithms. Philos. Trans. R. Soc. Lond. A 357, 2827–2848 (1999). 117. Gould, N. I. M., Lucidi, S., Roma, M. & Toint, P. L.Solving the trust-region
82. Storn, R. & Price, K.Differential evolution — a simple and efficient subproblem using the Lanczos method. SIAM J. Optim. 9, 504–525 (1999).
heuristic for global optimization over continuous spaces. J. Glob. Optim. 11, 118. Abbasi, H. Sparse: a more modern sparse array library. In Proceedings of the
341–359 (1997). 17th Python in Science Conference (eds Akici, F. et al.) 27–30 (2018).
83. Griffiths, T. L. & Steyvers, M.Finding scientific topics. Proc. Natl Acad. Sci. 119. Mohr, P. J., Newell, D. B. & Taylor, B. N.CODATA recommended values of
USA 101(Suppl. 1), 5228–5235 (2004). the fundamental physical constants: 2014. J. Phys. Chem. Ref. Data 45,
84. Dierckx, P. Curve and Surface Fitting with Splines (Oxford Univ. Press, 1993). 043102 (2016).
85. Virtanen, P. ENH: interpolate: rewrite ppform evaluation in Cython, 120. Boisvert, R. F., Pozo, R., Remington, K., Barrett, R. F. & Dongarra, J. J.
https://github.com/scipy/scipy/pull/2885 (2013). Matrix Market: a web resource for test matrix collections. In Quality of
86. Burovski, E. add b-splines, https://github.com/scipy/scipy/pull/3174 Numerical Software 125–137 (Springer, 1997).
(27 December 2013). 121. Rew, R. & Davis, G.NetCDF: an interface for scientific data access.
87. de Boor, C. A Practical Guide to Splines (Springer, 1978). IEEE Comput. Graph. Appl. 10, 76–82 (1990).
88. Mayorov, N. ENH: CubicSpline interpolator, https://github.com/scipy/scipy/ 122. Duff, I.S., Grimes, R.G. & Lewis, J.G. Users’ guide for the Harwell-Boeing
pull/5653 (2 January 2016). sparse matrix collection (release I), http://citeseerx.ist.psu.edu/viewdoc/
89. Fritsch, F. N. & Carlson, R. E.Monotone piecewise cubic interpolation. summary?doi=10.1.1.41.8922 (1992).
SIAM J. Numer. Anal. 17, 238–246 (1980).
90. Akima, H.A new method of interpolation and smooth curve fitting based
on local procedures. J. Assoc. Comput. Mach. 17, 589–602 (1970). Acknowledgements
We thank everyone who has contributed to SciPy 1.0, from those who have posted one
91. Beck, K. Test-driven Development: By Example (Addison-Wesley, 2003).
comment about an issue through those who have made several small patches and beyond.
92. Silver, A.Collaborative software development made easy. Nature 550,
143–144 (2017).
93. Eghbal, N. Roads and Bridges: The Unseen Labor Behind Our Digital Author contributions
Infrastructure (Ford Foundation, 2016). P.V. and R.G. led the development of SciPy for over 10 years. T.E.O., E.J. and P.P. created
94. The Astropy Collaboration.et al.The Astropy Project: building an open- SciPy. T.R. and M.H. composed the manuscript with input from others. Other named
science project and status of the v2.0 core package. Astron. J. 156, 123 (2018). authors are SciPy core developers. All authors have contributed significant code,
95. Lev, O., Dufresne, J., Kasim, R., Skinn, B. & Wilk, J. pypinfo: view PyPI documentation and/or expertise to the SciPy project. All authors reviewed the manuscript.
download statistics with ease, https://github.com/ofek/pypinfo (2018).
96. Abbott, B. P.et al.Observation of gravitational waves from a binary black
hole merger. Phys. Rev. Lett. 116, 061102 (2016). Competing interests
The following statements indicate industry affiliations for authors in the main author list,
97. David Liu. The Intel distribution for Python, https://software.intel.com/
but not for authors in the SciPy 1.0 Contributor group beyond that. These affiliations
en-us/articles/intel-optimized-packages-for-the-intel-distribution-for-python
may have since changed. R.G. was employed by Quansight LLC. T.E.O., E.J. and R.K.
(25 August 2017, updated 30 October 2017, accessed 25 July 2018).
were employed by Enthought, Inc. T.E.O. and I.H. were employed by Anaconda Inc.
98. Nelder, J. A. & Mead, R.A simplex method for function minimization.
N.M. was employed by WayRay LLC. E.W.M. was employed by Bruker Biospin Corp. F.P.
Comput. J. 7, 308–313 (1965).
and P.v.M. were employed by Google LLC.
99. Wright, M. H. Direct search methods: once scorned, now respectable.
Pitman Research Notes in Mathematics Series 191–208 (1996).
100. Powell, M. J. D.An efficient method for finding the minimum of a Additional information
function of several variables without calculating derivatives. Comput. J. 7, Supplementary information is available for this paper at https://doi.org/10.1038/
155–162 (1964). s41592-019-0686-2.
101. Powell, M. J. D. A direct search optimization method that models the
objective and constraint functions by linear interpolation. In Advances in Correspondence should be addressed to R.G., M.H. or T.R.
Optimization and Numerical Analysis (eds Gomez, S. & Hennart, J. P.) Peer review information Rita Strack was the primary editor on this article and managed
51–67 (Springer, 1994). its editorial process and peer review in collaboration with the rest of the editorial team.
102. Powell, M. J. D.Direct search algorithms for optimization calculations. Reprints and permissions information is available at www.nature.com/reprints.
Acta Numerica 7, 287–336 (1998).
103. Powell, M. J. D.A view of algorithms for optimization without derivatives.
Math. Today Bull. Inst. Math. Appl. 43, 170–174 (2007). Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in
104. Polak, E. & Ribiere, G.Note sur la convergence de methodes de directions published maps and institutional affiliations.
conjuguees. Rev. française d’informatique et. de. Rech. op.érationnelle 3, Open Access This article is licensed under a Creative Commons
35–43 (1969). Attribution 4.0 International License, which permits use, sharing, adap-
105. Nocedal, J. & Wright, S. Numerical Optimization 2nd edn (Springer Science tation, distribution and reproduction in any medium or format, as long
& Business Media, 2006). as you give appropriate credit to the original author(s) and the source, provide a link to
106. Byrd, R. H., Lu, P., Nocedal, J. & Zhu, C.A limited memory algorithm for the Creative Commons license, and indicate if changes were made. The images or other
bound constrained optimization. SIAM J. Sci. Comput. 16, 1190–1208 (1995). third party material in this article are included in the article’s Creative Commons license,
107. Zhu, C., Byrd, R. H., Lu, P. & Nocedal, J.Algorithm 778: L-BFGS-B: Fortran unless indicated otherwise in a credit line to the material. If material is not included in
subroutines for large-scale bound-constrained optimization. ACM Trans. the article’s Creative Commons license and your intended use is not permitted by statu-
Math. Softw. 23, 550–560 (1997). tory regulation or exceeds the permitted use, you will need to obtain permission directly
108. Schittkowski, K.On the convergence of a sequential quadratic programming from the copyright holder. To view a copy of this license, visit http://creativecommons.
method with an augmented Lagrangian line search function. Mathematische org/licenses/by/4.0/.
Operationsforschung und Statistik. Ser. Optim. 14, 197–216 (1983). © The Author(s) 2020

Nature Methods | VOL 17 | March 2020 | 261–272 | www.nature.com/naturemethods 271


Perspective NaTure MeThodS

SciPy 1.0 Contributors

Aditya Vijaykumar40,41, Alessandro Pietro Bardelli42, Alex Rothberg15, Andreas Hilboll43,


Andreas Kloeckner44, Anthony Scopatz2, Antony Lee45, Ariel Rokem46, C. Nathan Woods9,
Chad Fulton47, Charles Masson48, Christian Häggström49, Clark Fitzgerald50, David A. Nicholson51,
David R. Hagen52, Dmitrii V. Pasechnik53, Emanuele Olivetti54, Eric Martin55, Eric Wieser56,
Fabrice Silva57, Felix Lenders58,59,60, Florian Wilhelm61, G. Young17, Gavin A. Price62, Gert-Ludwig Ingold63,
Gregory E. Allen64, Gregory R. Lee65,66, Hervé Audren67, Irvin Probst68, Jörg P. Dietrich69,70,
Jacob Silterra71, James T Webber72, Janko Slavič73, Joel Nothman74, Johannes Buchner75,76,
Johannes Kulick77, Johannes L. Schönberger15, José Vinícius de Miranda Cardoso78, Joscha Reimer79,
Joseph Harrington80, Juan Luis Cano Rodríguez81, Juan Nunez-Iglesias82, Justin Kuczynski83,
Kevin Tritz84, Martin Thoma85, Matthew Newville86, Matthias Kümmerer87, Maximilian Bolingbroke88,
Michael Tartre89, Mikhail Pak90, Nathaniel J. Smith91, Nikolai Nowaczyk92, Nikolay Shebanov93,
Oleksandr Pavlyk94, Per A. Brodtkorb95, Perry Lee96, Robert T. McGibbon97, Roman Feldbauer98,
Sam Lewis99, Sam Tygier100, Scott Sievert101, Sebastiano Vigna102, Stefan Peterson15, Surhud More103,104,
Tadeusz Pudlik105, Takuya Oshima106, Thomas J. Pingel107, Thomas P. Robitaille108, Thomas Spura109,
Thouis R. Jones110, Tim Cera111, Tim Leslie15, Tiziano Zito112, Tom Krauss113, Utkarsh Upadhyay114,
Yaroslav O. Halchenko115 and Yoshiki Vázquez-Baeza116
40
International Centre for Theoretical Sciences, Tata Institute of Fundamental Research, Bengaluru, India. 41Department of Physics, Birla Institute
of Technology and Science, Pilani, India. 42Independent researcher, Milan, Italy. 43Institute of Environmental Physics, University of Bremen, Bremen,
Germany. 44Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, USA. 45Laboratoire Photonique, Numérique et
Nanosciences UMR 5298, Université de Bordeaux, Institut d’Optique Graduate School, CNRS, Talence, France. 46The University of Washington eScience
Institute, The University of Washington, Seattle, WA, USA. 47Federal Reserve Board of Governors, Washington, DC, USA. 48Datadog Inc., New York, NY,
USA. 49HQ, Orexplore, Stockholm, Sweden. 50Statistics Department, University of California - Davis, Davis, CA, USA. 51Emory University, Atlanta, GA,
USA. 52Applied BioMath, Concord, MA, USA. 53Department of Computer Science, University of Oxford, Oxford, UK. 54NeuroInformatics Laboratory,
Bruno Kessler Foundation, Trento, Italy. 55Independent researcher, Chicago, IL, USA. 56Department of Engineering, University of Cambridge, Cambridge,
UK. 57Aix Marseille Univ, CNRS, Centrale Marseille, LMA, Marseille, France. 58Interdisciplinary Center for Scientific Computing (IWR), Heidelberg
University, Heidelberg, Germany. 59ABB Corporate Research, ABB AG, Ladenburg, Germany. 60Institut für Mathematische Optimierung, Technische
Universität Carolo-Wilhelmina zu Braunschweig, Braunschweig, Germany. 61Independent researcher, Cologne, Germany. 62Lawrence Berkeley National
Laboratory, Berkeley, CA, USA. 63Institut für Physik, Universität Augsburg, Augsburg, Germany. 64Applied Research Laboratories, The University of Texas
at Austin, Austin, TX, USA. 65Department of Radiology, School of Medicine, University of Cincinnati, Cincinnati, OH, USA. 66Department of Radiology,
Cincinnati Children’s Hospital Medical Center, Cincinnati, OH, USA. 67Ascent Robotics Inc., Tokyo, Japan. 68ENSTA Bretagne, Brest, France. 69Faculty of
Physics, Ludwig-Maximilians-Universität, München, Germany. 70Excellence Cluster Universe, München, Germany. 71Independent researcher, Malden,
Massachusetts, USA. 72Data Sciences, Chan Zuckerberg Biohub, San Francisco, CA, USA. 73Faculty of Mechanical Engineering, University of Ljubljana,
Ljubljana, Slovenia. 74Sydney Informatics Hub, The University of Sydney, Camperdown, NSW, Australia. 75Instituto de Astrofísica, Pontificia Universidad
Católica de Chile, Santiago, Chile. 76Max Planck Institute for Extraterrestrial Physics, Garching, Germany. 77University of Stuttgart, Machine Learning
and Robotics Lab, Stuttgart, Germany. 78Department of Electrical Engineering, Universidade Federal de Campina Grande, Campina Grande, Brazil.
79
Department of Computer Science, Kiel University, Kiel, Germany. 80Planetary Sciences Group and Florida Space Institute and Department of Physics,
University of Central Florida, Orlando, FL, USA. 81Independent researcher, Madrid, Spain. 82Monash Micro Imaging, Monash University, Clayton, VIC,
Australia. 83Department of Molecular, Cellular, and Developmental Biology, University of Colorado, Boulder, Boulder, CO, USA. 84Department of Physics
and Astronomy, Johns Hopkins University, Baltimore, MD, USA. 85Independent researcher, Munich, Germany. 86Center for Advanced Radiation Sources,
The University of Chicago, Chicago, IL, USA. 87University of Tübingen, Tübingen, Germany. 88Independent researcher, Rugby, UK. 89Two Sigma Investments,
New York, NY, USA. 90Department of Mechanical Engineering, Technical University of Munich, Garching, Germany. 91Independent researcher, Berkeley,
CA, USA. 92Independent researcher, London, UK. 93Independent researcher, Berlin, Germany. 94Intel Corp., Austin, TX, USA. 95Independent Researcher,
Horten, Norway. 96Independent researcher, Daly City, CA, USA. 97D. E. Shaw Research, New York, NY, USA. 98Division of Computational Systems Biology,
Department of Microbiology and Ecosystem Science, University of Vienna, Vienna, Austria. 99Independent researcher, Melbourne, Australia. 100School of
Physics and Astronomy, University of Manchester, Manchester, UK. 101Electrical and Computer Engineering, University of Wisconsin–Madison, Madison,
WI, USA. 102Dipartimento di Informatica, Università degli Studi di Milano, Milan, Italy. 103Inter-University Centre for Astronomy and Astrophysics,
Ganeshkhind, Pune, India. 104Kavli Institute for the Physics and Mathematics of the Universe, Kashiwa-shi, Japan. 105Waymo LLC, Mountain View, CA,
USA. 106Faculty of Engineering, Niigata University, Nishi-ku, Niigata, Japan. 107Virginia Polytechnic Institute and State University, Blacksburg, VA, USA.
108
Aperio Software, Headingley Enterprise and Arts Centre, Leeds, UK. 109Independent researcher, Duisburg, Germany. 110Broad Institute, Cambridge,
MA, USA. 111Independent researcher, Gainesville, FL, USA. 112Department of Psychology, Humboldt University of Berlin, Berlin, Germany. 113Epiq Solutions,
Schaumburg, IL, USA. 114Max Planck Institute for Software Systems, Kaiserslautern, Germany. 115Department of Psychology and Brain Sciences, Dartmouth
College, Hanover, NH, USA. 116Jacobs School of Engineering, University of California San Diego, La Jolla, CA, USA.

272 Nature Methods | VOL 17 | March 2020 | 261–272 | www.nature.com/naturemethods


12345656762589 56 5317425
!"#$"%&'()*+, 0&78k22=C&((l&.7&"#=;/70##/
-&('!#&(#./&'()*+,noDpDqonr
03&('0!&(4)5$$")%1 ' 22&  /
 ($2!6()!#'4$.$7$(/8()59()&(5!'.7$):;)$82!6$# ('4('84"$("4/&"#(&"!&"4/
$"!($"%:<8'()$"82&($""3&('0&4)!7$4$=>'()?08 &"#()@#$($&7A7$4/)497$(:

1(&($($4
<&77(&($($4&7&"&7/=4"8$2()&(()8775$"%$(2 &!"($"()8$%'7%"#=(&.77%"#=2&$"(B(=C()# 4($":
"D& "8$2#
s ;)B&4(&2!7 $E*F+8&4)B!$2"(&7%'!D4"#$($"=%$6"& &#$4("'2.&"#'"$(82&'2"(
s > (&(2"("5)()2&'2"(5(&9"82#$($"4(&2!7 5)()() &2 &2!75& 2&'#!&(#7/
s ;) (&($($4&7((*+'#>3G5)()()/&"H(5H$##
IFJKLMNOONFLPQRPRLRSNTJULVQLUQRMWXVQULRNJQJKLVKLFYOQZLUQRMWXVQLONWQLMNO[JQ\LPQMSFX]TQRLXFLPSQL^QPSNURLRQMPXNF_
s >#4$!($"8&7746&$&( ((#
s >#4$!($"8&"/& '2!($" 44($"='4)& ((8"2&7$(/&"#&#`'(2"(82'7($!742!&$"
s >8'77#4$!($"8() (&($($4&7!&&2($"47'#$"%4"(&7("#"4/*:%:2&"+().&$4($2&( *:%:% $"488$4$"(+
>3G6&$&($"*:%:(&"#&##6$&($"+& 4$&(#($2&( 8'"4(&$"(/*:%:4"8$#"4$"(6&7+
s <"'77)/!()$(($"%=()(((&($($4*:%:a=P=W+5$()4"8$#"4$"(6&7=884($E=#% 88#2&"#b6&7'"(#
cXdQLbLdYJTQRLYRLQ\YMPLdYJTQRLeSQFQdQWLRTXPYVJQ_
s <f&/$&"&"&7/$=$"82&($""()4)$48!$&"#C&964)&$"C"(&7 (($"%
s <)$&4)$4&7&"#42!7B#$%"=$#"($8$4&($"8()&!!!$&(7678((&"#8'77!($"%8'(42
s @($2&( 8884($E *:%:)"gU=A&"gW+=$"#$4&($"%)5()/54&74'7&(#
ITWLeQVLMNJJQMPXNFLNFLRPYPXRPXMRLhNWLVXNJNiXRPRLMNFPYXFRLYWPXMJQRLNFLOYFKLNhLPSQL[NXFPRLYVNdQ_
18(5&&"#4#
A7$4/$"82&($"&.'(&6&$7&.$7$(/842!'(4#
G&(&4774($" &-&6"&$%7&'.&7%4)$4#&(&5& 4774(#'$"%()7$"%'$(7$.&/=&"#$"!&($4'7&'$"%& !4$&78&('.&"4)8()$!`4(()&($!"7/
,)((!,DD%$()'.:42D%$()'.D7$"%'$(D!'77Dtnun
G&(&8<$%'q5& 4774(#'$"%()!/((H46&"#%46!&49&%vG495& '#(2&"&%4"(&$"?()B&4('44#
'#$!"7/&6&$7&.7,)((!,DD%$()'.:42D(/7`##/D4$!/H46H(&49
G&(&88$%'w5& 4774(#'$"%()>$!#x74$(/*&66$"o:t+!&49&%HH()$!"7/&6&$7&.7!$(/,)((!,DD
%$()'.:42D&$!#H674$(/D&6
G&(&&"&7/$ 3>
<2&"'4$!('($7$E$"%4'(2&7%$()2 8(5&()&(&4"(&7(()&4).'("(/(#4$.#$"!'.7$)#7$(&('=8(5&2'(.2&#&6&$7&.7(#$(D6$5:
j ("%7/"4'&%4##!$($"$"&422'"$(/!$(/*:%:k$(l'.+:1()3&('0&4)%'$#7$" 8'.2$(($"%4#? 8(5&88'()$"82&($":
G&(&
A7$4/$"82&($"&.'(&6&$7&.$7$(/8#&(&
>772&"'4$!(2'($"47'#&#&(&&6&$7&.$7$(/(&(2"(:;)$ (&(2"()'7#!6$#()8775$"%$"82&($"=5)&!!7$4&.7,
H>44 $"4#='"$m'$#"($8$=5.7$"98!'.7$47/&6&$7&.7#&(&(


H>7$(88$%' ()&()&6& 4$&(#&5#&(&


H>#4$!($"8&"/($4($" "#&(&&6&$7&.$7$(/
>7714$A/7$.&/'44#&"#2(#&(&%"&(#8()4'"(('#/&&6&$7&.7$"()14$A/k$(l'.!$(/=)((!,DD%$()'.:42D4$!/:12 '!!($"%
4#&"##&(&)&6&7." (#$"()!'.7$4!$($ 4$(#./()$2&"'4$!(:

0
<A7$&7#7H4((!)"4.$87$5(4  !  ($ " %

12345656762589 56 5317425
)&($().(8$(8/'&4):z8/'&"('=&#()&!!!$&( 4($" .82&9$"%/'74($":
-$8 4$"4 f)&6$'&7? 4$&74$"4 @47%$4&7=67'($"&/?"6$"2"(&74$"4
<&8"44!/8()#4'2"(5$()&774($"="&(':42D#4'2"(D"H!($"%H'22&/H87&(:!#8

->7$78('#$2'4$(#"$447"()(!'#$"/#  $% "
(6"5)"()#$47'$"%&($6:
1&2!7 $E {QRMWXVQLSNeLRYO[JQLRX|QLeYRLUQPQWOXFQU}LUQPYXJXFiLYFKLRPYPXRPXMYJLOQPSNURLTRQULPNL[WQUQPQWOXFQLRYO[JQLRX|QLI~LXhLFNLRYO[JQRX|QLMYJMTJYPXNFL
eYRL[QWhNWOQU}LUQRMWXVQLSNeLRYO[JQLRX|QRLeQWQLMSNRQFLYFUL[WNdXUQLYLWYPXNFYJQLhNWLeSKLPSQRQLRYO[JQLRX|QRLYWQLRThhXMXQFP_
G&(&B47'$" W{Q RMWXVQLYFKLUYPYLQ\MJTRXNFR_L€hLFNLUYPYLeQWQLQ\MJTUQULhWNOLPSQLYFYJKRQR}LRPYPQLRNLI~LXhLUYPYLeQWQLQ\MJTUQU}LUQRMWXVQLPSQLQ\MJTRXNFRLYFULPSQL
YPXNFYJQLVQSXFULPSQO}LXFUXMYPXFiLeSQPSQWLQ\MJTRXNFLMWXPQWXYLeQWQL[WQQRPYVJXRSQU_L
0!7$4&($" I~{QRMWXVQLPSQLOQYRTWQRLPYQFLPNLdQWXhKLPSQLWQ[WNUTMXVXJXPKLNhLPSQLQ\[QWXOQFPYJLhXFUXFiR_L€hLYJJLYPPQO[PRLYPLWQ[JXMYPXNFLeQWQLRTMMQRRhTJ}LMNFhXWOLPSXRL
LXhLPSQWQLYWQLYFKLhXFUXFiRLPSYPLeQWQLFNPLWQ[JXMYPQULNWLMYFFNPLVQLWQ[WNUTMQU}LFNPQLPSXRLYFULUQRMWXVQLeSK_
0&"#2$E&($" eQ
{QRMWXVQLSNeLRYO[JQR‚NWiYFXROR‚[YWPXMX[YFPRLeQWQLYJJNMYPQULXFPNLQ\[QWXOQFPYJLiWNT[R_L€hLYJJNMYPXNFLeYRLFNPLWYFUNO}LUQRMWXVQLSNeLMNdYWXYPQRL
WQLMNFPWNJJQULI~LXhLPSXRLXRLFNPLWQJQdYFPLPNLKNTWLRPTUK}LQ\[JYXFLeSK_
f7$"#$"% {QRMWXVQLeSQPSQWLPSQLXFdQRPXiYPNWRLeQWQLVJXFUQULPNLiWNT[LYJJNMYPXNFLUTWXFiLUYPYLMNJJQMPXNFLYFU‚NWLYFYJKRXR_L€hLVJXFUXFiLeYRLFNPL[NRRXVJQ}L
UQRMWXVQLeSKLI~LQ\[JYXFLeSKLVJXFUXFiLeYRLFNPLWQJQdYFPLPNLKNTWLRPTUK_

f>77(')#$&2'6$(#'$47&7"(?)!$4"($&67"5)4$"()"#4$47'$("'%#&($6/#
:
$%
"
1('#/#4$!($" ]ƒTWXYQFhJPKXLPUYQPRXdMQWXLVQ\Q[LPQSWQXOQ
LRPTUKLPK[QLXFMJTUXFiLeSQPSQWLUYPYLYWQL]TYFPXPYPXdQ}L]TYJXPYPXdQ}LNWLOX\QUOQPSNURL„Q_i_L]TYJXPYPXdQLMWNRRRQMPXNFYJ}L
FPYJ}LOX\QUOQPSNURLMYRQLRPTUK_L
0&4) &2!7 „†QP_YiP_QLYLPiSQQ}LLWRQQR\QYLYWFMUSLLXRFYUO[ JQL„Q_i_L‡YWdYWULTFXdQWRXPKLTFUQWiWYUTYPQR}LdXJJYiQWRLXFLWTWYJL€FUXYLYFUL[WNdXUQLWQJQdYFPLUQONiWY[SXMLXFhNWOYPXNFL
XMYPQLeSQPSQWLPSQLRYO[JQLXRLWQ[WQRQFPYPXdQ_LbWNdXUQLYLWYPXNFYJQLhNWLPSQLRPTUKLRYO[JQLMSNRQF_LaNWLRPTUXQRLXFdNJdXFiL
Q\XRPXFiLUYPYRQPR}L[JQYRQLUQRMWXVQLPSQLUYPYRQPLYFULRNTWMQ_
1&2!7$"% (&(%/ [{QWQRUMQWXPVQQWLOX
PSQLRYO[JXFiL[WNMQUTWQL„Q_i_LWYFUNO}LRFNeVYJJ}LRPWYPXhXQU}LMNFdQFXQFMQ_L{QRMWXVQLPSQLRPYPXRPXMYJLOQPSNURLPSYPLeQWQLTRQULPNL
FQLRYO[JQLRX|QLI~LXhLFNLRYO[JQRX|QLMYJMTJYPXNFLeYRL[QWhNWOQU}LUQRMWXVQLSNeLRYO[JQLRX|QRLeQWQLMSNRQFLYFUL[WNdXUQLYLWYPXNFYJQL
hNWLeSKLPSQRQLRYO[JQLRX|QRLYWQLRThhXMXQFP_LaNWL]TYJXPYPXdQLUYPY}L[JQYRQLXFUXMYPQLeSQPSQWLUYPYLRYPTWYPXNFLeYRLMNFRXUQWQU}LYFULeSYPLMWXPQWXYL
eQWQLTRQULPNLUQMXUQLPSYPLFNLhTWPSQWLRYO[JXFiLeYRLFQQUQU_
G&(&4774($" MbNWNO[ dXUQLUQPYXJRLYVNTPLPSQLUYPYLMNJJQMPXNFL[WNMQUTWQ}LXFMJTUXFiLPSQLXFRPWTOQFPRLNWLUQdXMQRLTRQULPNLWQMNWULPSQLUYPYL„Q_i_L[QFLYFUL[Y[QW}L
TPQW}LQKQLPWYMQW}LdXUQNLNWLYTUXNLQ]TX[OQFPLeSQPSQWLYFKNFQLeYRL[WQRQFPLVQRXUQRLPSQL[YWPXMX[YFP„RLYFULPSQLWQRQYWMSQW}LYFULeSQPSQWL
PSQLWQRQYWMSQWLeYRLVJXFULPNLQ\[QWXOQFPYJLMNFUXPXNFLYFU‚NWLPSQLRPTUKLSK[NPSQRXRLUTWXFiLUYPYLMNJJQMPXNF_
;$2$"% €FUXMYPQLPSQLRPYWPLYFULRPN[LUYPQRLNhLUYPYLMNJJQMPXNF_L€hLPSQWQLXRLYLiY[LVQPeQQFLMNJJQMPXNFL[QWXNUR}LRPYPQLPSQLUYPQRLhNWLQYMSLRYO[JQLMNSNWP_
G&(&B47'$" V€hQLFSNXFLUUYLPPSYQLeQ WQLQ\MJTUQULhWNOLPSQLYFYJKRQR}LRPYPQLRNLI~LXhLUYPYLeQWQLQ\MJTUQU}L[WNdXUQLPSQLQ\YMPLFTOVQWLNhLQ\MJTRXNFRLYFULPSQLWYPXNFYJQL
O}LXFUXMYPXFiLeSQPSQWLQ\MJTRXNFLMWXPQWXYLeQWQL[WQQRPYVJXRSQU_
3"H!&($4$!&($" [†PYYWPPQXMLSX[NYeLOYFKL[YWPXMX[YFPRLUWN[[QULNTP‚UQMJXFQUL[YWPXMX[YPXNFLYFULPSQLWQYRNF„RLiXdQFLI~L[WNdXUQLWQR[NFRQLWYPQLI~LRPYPQLPSYPLFNL
FPRLUWN[[QULNTP‚UQMJXFQUL[YWPXMX[YPXNF_
0&"#2$E&($" Y€hJLJ[NYMWYPPXMXNX[FYLeY
FPRLeQWQLFNPLYJJNMYPQULXFPNLQ\[QWXOQFPYJLiWNT[R}LRPYPQLRNLI~LUQRMWXVQLSNeL[YWPXMX[YFPRLeQWQLYJJNMYPQULPNLiWNT[R}LYFULXhL
RLFNPLWYFUNO}LUQRMWXVQLSNeLMNdYWXYPQRLeQWQLMNFPWNJJQU_

@>774('#$72'%$(4#$&477="(6)!7'($"$(6""5)&/? " 6 $
 " 2 "(
&74$
"4
 (
'#/#$%
"


"()#$47'$"%&($6:
1('#/#4$!($" ƒWXQhJKLUQRMWXVQLPSQLRPTUK_LaNWL]TYFPXPYPXdQLUYPYLXFMJTUQLPWQYPOQFPLhYMPNWRLYFULXFPQWYMPXNFR}LUQRXiFLRPWTMPTWQL„Q_i_LhYMPNWXYJ}LFQRPQU}L
SXQWYWMSXMYJ}LFYPTWQLYFULFTOVQWLNhLQ\[QWXOQFPYJLTFXPRLYFULWQ[JXMYPQR_

y
0&4) &2!7 {QRMWXVQLPSQLWQRQYWMSLRYO[JQL„Q_i_LYLiWNT[LNhLPYiiQULbYRRQWLUNOQRPXMTR}LYJJL†PQFNMQWQTRLPSTWVQWXLeXPSXFLIWiYFLbX[QL‹YMPTRLŒYPXNFYJL
^NFTOQFP}LYFUL[WNdXUQLYLWYPXNFYJQLhNWLPSQLRYO[JQLMSNXMQ_LSQFLWQJQdYFP}LUQRMWXVQLPSQLNWiYFXROLPY\Y}LRNTWMQ}LRQ\}LYiQLWYFiQLYFUL

12345656762589 56 5317
YFKLOYFX[TJYPXNFR_L†PYPQLeSYPL[N[TJYPXNFLPSQLRYO[JQLXRLOQYFPLPNLWQ[WQRQFPLeSQFLY[[JXMYVJQ_LaNWLRPTUXQRLXFdNJdXFiLQ\XRPXFiLUYPYRQPR}L
UQRMWXVQLPSQLUYPYLYFULXPRLRNTWMQ_
1&2!7$"% (&(%/ ŒN PQLPSQLRYO[JXFiL[WNMQUTWQ_L{QRMWXVQLPSQLRPYPXRPXMYJLOQPSNURLPSYPLeQWQLTRQULPNL[WQUQPQWOXFQLRYO[JQLRX|QLI~LXhLFNLRYO[JQRX|QL
MYJMTJYPXNFLeYRL[QWhNWOQU}LUQRMWXVQLSNeLRYO[JQLRX|QRLeQWQLMSNRQFLYFUL[WNdXUQLYLWYPXNFYJQLhNWLeSKLPSQRQLRYO[JQLRX|QRLYWQLRThhXMXQFP_
G&(&4774($" {QRMWXVQLPSQLUYPYLMNJJQMPXNFL[WNMQUTWQ}LXFMJTUXFiLeSNLWQMNWUQULPSQLUYPYLYFULSNe_
;$2$"%&"# !&($&74&7 €PFSUQXRMQYLMPQSLNPSXMQQLRR_PLY€hWLPPLSYQFWUQLLRXRPLNY[LiLUYY[PLQVRQLPNeQ
hLUYPYLMNJJQMPXNF}LFNPXFiLPSQLhWQ]TQFMKLYFUL[QWXNUXMXPKLNhLRYO[JXFiLYFUL[WNdXUXFiLYLWYPXNFYJQLhNWL
QFLMNJJQMPXNFL[QWXNUR}LRPYPQLPSQLUYPQRLhNWLQYMSLRYO[JQLMNSNWP_L†[QMXhKLPSQLR[YPXYJLRMYJQLhWNOLeSXMSL
PSQLUYPYLYWQLPYQF

7425
G&(&B47'$" €hLFNLUYPYLeQWQLQ\MJTUQULhWNOLPSQLYFYJKRQR}LRPYPQLRNLI~LXhLUYPYLeQWQLQ\MJTUQU}LUQRMWXVQLPSQLQ\MJTRXNFRLYFULPSQLWYPXNFYJQLVQSXFULPSQO}L
XFUXMYPXFiLeSQPSQWLQ\MJTRXNFLMWXPQWXYLeQWQL[WQQRPYVJXRSQU_
0!#'4$.$7$(/ {QRMWXVQLPSQLOQYRTWQRLPYQFLPNLdQWXhKLPSQLWQ[WNUTMXVXJXPKLNhLQ\[QWXOQFPYJLhXFUXFiR_LaNWLQYMSLQ\[QWXOQFP}LFNPQLeSQPSQWLYFKLYPPQO[PRLPNL
WQ[QYPLPSQLQ\[QWXOQFPLhYXJQULI~LRPYPQLPSYPLYJJLYPPQO[PRLPNLWQ[QYPLPSQLQ\[QWXOQFPLeQWQLRTMMQRRhTJ_
0&"#2$E&($" {QRMWXVQLSNeLRYO[JQR‚NWiYFXROR‚[YWPXMX[YFPRLeQWQLYJJNMYPQULXFPNLiWNT[R_L€hLYJJNMYPXNFLeYRLFNPLWYFUNO}LUQRMWXVQLSNeLMNdYWXYPQRLeQWQL
MNFPWNJJQU_L€hLPSXRLXRLFNPLWQJQdYFPLPNLKNTWLRPTUK}LQ\[JYXFLeSK_
f7$"#$"% {QRMWXVQLPSQLQ\PQFPLNhLVJXFUXFiLTRQULUTWXFiLUYPYLYM]TXRXPXNFLYFULYFYJKRXR_L€hLVJXFUXFiLeYRLFNPL[NRRXVJQ}LUQRMWXVQLeSKLI~LQ\[JYXFLeSKL
VJXFUXFiLeYRLFNPLWQJQdYFPLPNLKNTWLRPTUK_
G$#() ('#/$"6768$7#59‰ Š s 3

0jj!m'$$"8(2&$"($%8   !  4$8$42& (   $& 7 = / ( 2 & " #2


"82&'()&.'(2(/! 82&($&7=B!$2"(&7/(2 &"#2()# '#$
$"2&
( )  #
" "/('#$:l=$"#$4&(5)()&4)2&($&7=
/(22()#7$(#$
$76&"((/
 '('#/:z8/'&"('$
$8&7$($(2&!!7$ (/
 '&4)=&#()&!!!$&( 4($".8 74($"%&!":
C&($&7?B!$2"(&7/(2 C()#
"D& z"676#$
$"(
" ) ('#/ "D& z"676#$
$"(
" ) ('#/
s >"($.#$ s )zAHm
s @'9&/($44777$" s <754/(2(/
s A&7&"(7%/ s C0zH.&#"'$2&%$"%
s >"$2&7&"#()%&"$2
s l'2&"&4)!&($4$!&"(
s 7$"$4&7#&(&



You might also like