You are on page 1of 1

EDITORIAL

Defining the scientific method


The rise of ‘omics’ methods and data-driven research presents new possibilities for discovery
but also stimulates disagreement over how science should be conducted and even how it
should be defined.
how Google Translate relies on only correlative analyses of
“Hypotheses Modern biological research methods are powerful tools in
documents on the internet.
biologists’ arsenal for investigating biology. But is the abil-
aren’t simply ity of these methods to amass extraordinary amounts of This generated quite a response from the scientific com-
data altering the nature of scientific inquiry? munity with California Institute of Technology physicist
useful tools in As schoolchildren we are taught that the scientific Sean Carroll arguing in Edge that “hypotheses aren’t simply
© 2009 Nature America, Inc. All rights reserved.

some potentially method involves a question and suggested explanation useful tools in some potentially outmoded vision of sci-
(hypothesis) based on observation, followed by the care- ence; they are the whole point. Theory is understanding,
outmoded vision ful design and execution of controlled experiments, and and understanding our world is what science is all about.”
finally validation, refinement or rejection of this hypoth- Is the generation of parts lists and correlations in the
of science; they esis. Developed by thinkers including Bacon, Descartes and absence of functional models science? Based on the often
are the whole Pierce, this methodology has been credited with much of accepted definition of the scientific method, the answer
science’s success. Modern philosophers such as Feyerabend would be a qualified no. But not everyone would agree.
point.” have argued that this is not how most science is conducted, Carroll’s colleague, David Goodstein, previously stated in
but up to now most modern scientists have subscribed to a Thesis article in Nature Physics that “science, it turns out,
Sean Carroll
the hypothesis-centric scientific method. is whatever scientists do.” A philosopher would find this
Scientists’ defense of this methodology has often been to be a circular and unfulfilling argument, but it is likely
vigorous, likely owing to the historic success of predictive that many biologists who are more interested in the prac-
“Science, it hypothesis-driven mechanistic theories in physics, the tical outcomes of their methods than their philosophical
underpinnings would agree with this sentiment.
turns out, dangers inherent in ‘fishing expeditions’ and the likeli-
But the rise of methodologies that generate massive
hood of false correlations based on data from improperly
is whatever designed experiments. For example, The Human Genome amounts of data does not dictate that biology should be data-
Project was considered by many at the time to be a serious driven. In a return to hypothesis-driven research, systems
scientists do.” break with the notion that proper biological research must biologists are attempting to use the same ‘omics’ methods
be hypothesis-driven. But the project proceeded because to generate data for use in quantitative biological mod-
David Goodstein
others successfully argued that it would yield information els. Hypotheses are needed before data collection because
vital for understanding human biology. model-driven quantitative analyses require rich dynamic
Methodological developments are now making it pos- data collected under defined conditions and stimuli.
sible to obtain massive amounts of ‘omics’ data on a variety So where does this leave us? It is likely that the high com-
of biological constituents. These immense datasets allow plexity of biology will actually make full biological under-
biologists to generate useful predictions (for example, standing by purely correlative analysis impossible. This
gene-finding and function or protein structure and func- method works for Google because language has simple
tion) using machine learning and statistics that do not rules and low complexity. Biology has neither constraint.
take into account the underlying mechanisms that dictate Correlations in large datasets may be able to provide some
design and function}considerations that would form the useful answers, but not all of them.
basis of a traditional hypothesis. But ‘omics’ data can provide information on the size
Now that the bias against data-driven investigation has and composition of biological entities and thus deter-
weakened, the desire to simplify ‘omics’ data reuse has led mine the boundaries of the problem at hand. Biologists
to the establishment of minimal information requirements can then proceed to investigate function using classical
for different types of primary data. The hope is that this hypothesis-driven experiments. It is still unclear whether
will allow new analyses and predictions using aggregated even this marriage of the two methods will deliver a com-
data from disparate experiments. plete understanding of biology, but it arguably has a better
Last summer, the editor-in-chief of Wired, Chris chance than either method on its own.
Anderson, went so far as to argue that biology is too com- Philosophers are free to argue whether one method is
plex for hypotheses and models, and that the classical sci- science and the other is not. Ultimately the public who
entific method is dead. Instead, he called for these methods funds the work and the biologists who conduct it want
to be replaced by powerful correlative analyses of massive results that will materially impact the quality of life regard-
amounts of data gathered by new technologies similar to less of what the method is called.

NATURE METHODS | VOL.6 NO.4 | APRIL 2009 | 237

You might also like