Professional Documents
Culture Documents
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at .
http://www.jstor.org/page/info/about/policies/terms.jsp
.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact support@jstor.org.
Oxford University Press and The British Society for the Philosophy of Science are collaborating with JSTOR to
digitize, preserve and extend access to The British Journal for the Philosophy of Science.
http://www.jstor.org
by ELIE ZAHAR
i Introduction
2 Adaptation of Theories to Facts
3 The Correspondence Principle
4 The Metaphysical Character of the Hard Core
5 Logic of Discovery
6 Examples from the History of Science
7 Conclusion
I INTRODUCTION
This paper can be viewed as an attempt to answer the following two
questions: is there in physical science such a thing as rational heuristics, and,
if so, how does it operate?
Until recently, it was very much part of the orthodoxy in philosophy of
science that there is no room for what might be called rational heuristics.
Popper, Reichenbach and other members of the Vienna Circle all agreed
that there is a sharp distinction between the 'context of discovery' and the
'context of justification'. Only the latter lies within the domain of meth-
odology, whose proper task is to evaluate theories supposed to be laid on the
table, i.e. supposed to have already been constructed. As for the context of
discovery, it belongs to the psychology of invention. Neither calling for, nor
even being susceptible of, any rational reconstruction, the process of
discovery allegedly rests on what Einstein called 'empathy with nature'.
Most philosophers have said little about how theories arrive 'on the table'
in the first place. An exception was Popper in a book entitled Die beiden
Grundprobleme der Erkenntnistheorie, which was written in 1930-1 but has
only recently been published ([ 1979]). Popper's account of the emergence of
new theories is linked to an overall Darwinian world-view: if new theories
turn out to explain certain facts, this is not the result of rational and
concerted efforts on the part of the scientists; it is an effect of the
superabundance of available theories. New hypotheses emerge like spon-
* This title is a direct allusion to Kuhn's
[1970o] paper. Though the solution offered in the
present article fundamentally differs from Kuhn's, there is no denying that his views,
particularly those expressed in his [1962], made a deep impression on Lakatos and on
Lakatos's students.
process of discovery is much more rational than it appears at first sight; that
it is neither inductive nor largely intuitive; that it does not belong to any kind
of limbo, but rests largely on deductive arguments from principles which
underlie not only science and deductive metaphysics but also everyday
decisions. The choice of consistent sets of such principles constitutes the
positive heuristics of research programmes. Thus I hope both to underline
the continuity between science and common sense knowledge and to remove
some of the mystery surrounding the notion of heuristics by showing that, at
any rate in some programmes, they operate purely deductively (at the meta-
level). Rather than try to provide a general a priori definition of heuristics, I
shall begin by examining some specific methods used in the construction of
theories with a view to extracting from them a more general structure. At the
same time I shall pursue a policy of attrition, gradually reducing the scope
and importance of the allegedly irrational aspects of the process of
discovery.
Arguing against strict positivism, Popper adds on his own behalf (p. 50):
Other observations are decisive, those which have not been used in setting up the
law. Natural laws are thus more than (or at any rate something different from) a mere
report of what one already knows.
Yet Popper is perfectly right in distinguishing between the context of
discovery and that of justification. In the methodological appraisal of
theories it is not the heuristic devices as such, but certain logical consider-
ations which play a significant role. In the considered examples it can be
mathematically proved that the presence of sufficiently many parameters
guarantees that a finite number of facts will necessarily be accommodated;
for there are at least as many unknowns as there are equations. Since a
mathematical theorem tells us nothing about physical reality, it follows that
the facts which help us towards constructing a theory cannot simultaneously
support it.
d
f =-(m)
dt
in order to derive
d
m-v
d my
dt I
--v2/C2
Thus he overthrew Newton's law of motion
d
= (mV),
dt
which had hitherto been regarded as the foundation stone of the whole of
physics, by a proof involving the Correspondence Principle applied to that
same law. The relation of Kepler to Newton, which is more complicated
than that of Newton to Planck, will be examined in detail below.
Through the Correspondence Principle and the method of directly
adapting theories to certain facts the quasi-inductive path of science loses its
miraculous character; it becomes an intended effect.
5 LOGIC OF DISCOVERY
Let me start with the simplest examples illustrating the principles men-
tioned in 5 above.
In Ptolemaic Astronomy each of the five planets performs a complex
movement, one of whose components simulates the revolution of the sun
about the earth. This is a huge cosmic accident, a series of coincidences
which are accepted without explanation. It can be mathematically proved
that relatively to the sun all five components would disappear at one stroke.
This, according to Dreyer ([90o6], ch. XIII), was a main starting point of
Copernican Astronomy. Whatever the case may be, it is true that
Copernican theory explains these coincidences by deriving them from a
single source, namely from the wrong, but perfectly natural, choice of the
earth as our primary frame of reference.
In classical mechanics inertia and weight are two fundamentally different
qualities; the first is the resistance of a given body to impressed force and the
second, its receptiveness to any gravitational field. Yet, as if by accident,
these two qualities are measured by the same scalar, namely the mass m.
Thus:
m = mi = m,; where mi = inertial mass and m. = gravitational mass.
Refusing to believe in a simple coincidence, Einstein proposed ([1916]) to
construct a theory which would explain the identity of the gravitational and
inertial masses, and which would thus unify gravitation and kinematics. In
view of the Correspondence Principle, he had to take account of the
empirical consequences of mi = m,, in particular of the law that in a
gravitational field all bodies experience the same acceleration irrespective of
their internal constitution. This law had been noted by Newton and had
subsequently been severely tested and corroborated.1 Hence, at the
observational level, the field is determined by the acceleration it generates.
Since acceleration is a kinematical, i.e. a geometrical, quantity, the simplest
solution consists in letting geometry annex gravity. I have already said in 4
above that annexation is the simplest form of unification. It is also clear that,
if annexation is to take place at all, then kinematics should annex gravitation
and not the other way round; for kinematics i.e. our space-time system, is the
most fundamental structure within physics. Einstein was also aware that, as
shown by Newton, the field must depend on the distribution of masses in the
Cf. Westfall [19711], p. 463.
modifying mechanics. Once again this was not the outcome of a sudden
inspiration. Einstein knew that all attempts at explaining electromagnetic
phenomena in mechanistic terms had failed; this was why Lorentz had tried
to carry out a reduction in the opposite direction. Thus if one of the two
theories, Newton's or Maxwell's, was to give way to the other, it would have
to be the former. Einstein consequently extended the Relativity Principle to
the whole of physics, whilst taking Maxwell's equations as his fixed point. In
this sense electromagnetism was annexing mechanics; annexation proved
once more to be the simplest means of unification.
To sum up: the invariance of c was the last step in a long deductive
argument involving philosophical principles, or meta-principles. The
argument was about the structure of theories as well as about certain empirical
facts; it is a second-order form of reasoning. I do not claim to have identified
all the premises used by Einstein. There are probably gaps in the argument,
which may have been bridged by considerations of simplicity or con-
venience. For example: Einstein could have modified both Newton's and
Maxwell's theories; however it was simpler to adhere to at least one of them.
All the same, Einstein did not experience a flash of intuition, but worked
through a series of deductive steps, making use of the principle of unity and
of the exclusion of all conspiracy theories.
I should finally like to consider one of the most controversial episodes in
the history of science: Newton's discovery of universal gravitation.
According to Max Born ([1964], Appendix i), Newton deduced his theory
from Kepler's laws. Duhem ([1906], ch. i i) and Popper ([1972], ch. 5)
correctly pointed out that these laws contradict universal gravitation, hence
cannot validly imply Newtonian Theory. By describing Newton's path to
discovery as comprising two distinct stages, I propose to show that Born's
views can be reconciled with Popper's.
The first stage is deductive in the ordinary sense of the word. It consists
on the one hand in deriving the second law of motion f = ma from the
Cartesian principle of inertia and, on the other hand, in extracting certain
logical consequences from Kepler's theory. One of Newton's most import-
ant discoveries is in fact a theorem of kinematics, i.e. of pure mathematics,
which naturally involves the differential calculus. Newton proved that
Kepler's laws entail every planet is subject to a central acceleration whose
magnitude is inversely proportional to the square of the distance between
the planet and the sun, where the proportionality factor is the same for all
planets. Conversely: every mobile P whose acceleration is directed towards
some fixed point S and is inversely proportional to SP2 describes a conic
section with one focus at S; moreover the radius vector connecting S to P
sweeps out at equal areas in equal intervals of time (Newton [i686], pp.
40-3, 56-61). Gravitational theory is wholly contained in these two
propositions without being logically implied by them. Let us note that
Newton formulates these two propositions in terms, not of central acceler-
ations, but of centripetal forces. This is because he makes use of the equation
f = mai, which I have so far not had to postulate. Mach wondered ([1883],
ch. II) why Newton put forward two laws, namely the principle of inertia
and the equation f = ma, of which the first is a trivial corollary of the
second. Yet Newton had good reasons to proceed as he did: he had borrowed
the law of inertia from Descartes and then used the principle of the
proportionality of cause to effect in order to derive, his second law of
motion.1 Physicists had always looked upon force as a dynamical cause; the
problem was to determine its precise mechanical effect. Aristotle had
directly connected force with velocity; but the law of inertia, which was used
by Galileo and then given its classical formulation by Descartes, asserts that
motion is a state of the mobile, hence requires no cause in order to persist.
Force was thus needed only in order to alter this state. Since force is an
instantaneous cause, such an alteration will be proportional to the cause
acting throughout an interval of time, i.e. to the product of this interval by
the force. By a convenient choice of units, the force will therefore be equal to
the rate of change of the quantity of motion, i.e. to the derivative of the
momentum. Thus:
- d
f dt
<(m)= ma.
ma
This explains why Newton chose to formulate two laws of motion, namely
the principle of inertia and the force law, of which the second is not
postulated, but derived from the first. In this way we obtain two schemes:
(c) [(Law of Inertia) & (Effect oc Cause)] - [f = -+ [(Kepler's Laws)
-+ (Centripetal force inversely proportional to themrn]
square of the distance
from the sun and directly proportional to the mass of the planet)]:
- [f = mi] -+ [(Centripetal force
(fi) [(Law of Inertia) & (Effect oc Cause)]
inversely proportional to the square of the distance from a fixed point S)
-+((Path is a conic section with one focus at S) & (Equal areas law holds
relatively to S))].
At this point we have to bring in Newton's third law which says that every
action provokes an equal and opposite reaction (where, in this context, both
'action' and 'reaction' mean 'force'). This principle has a double origin,
empirical and purely theoretical. On the one hand Newton conceived of
force in terms of pressure, tension, or muscular effort; i.e. in anthropo-
morphic terms. In statics every pressure is neutralised by an equal
counter-pressure. As for gravitational action, we know that Newton assimi-
lated it to a tension in a rope connecting two material bodies; since the
tension is the same all along the rope, the two bodies must attract each other
with equal and opposite forces. Gravity thus conforms to Newton's third
law. On the other hand, Newton was well aware that the third law flows
For a brilliant account of Newton's path to discovery, cf. Westfall [1971], chapters VII and
VIII.
P(m)
S(M)
Figure I
From (a) above Newton inferred that the force acting on P is constantly
directed towards G, hence also towards S, and that its magnitude is
proportional to I/GP2. Since
m+M
where M and m denote the masses of S and P respectively, the force to which
P is subject is inversely proportional to SP2 as well as to GP2. Similar
considerations apply to S because of the obvious symmetry between P
and S.
If, in accordance with (a) above, we suppose that the force at P is
proportional to m, then, by symmetry, i.e. by the principle of sufficient
reason, the force at S must be proportional to M; but, by virtue of Newton's
third law, these two forces have equal magnitudes; hence each one of them
must be proportional to mM/SP2. We have thus arrived at the law of
universal gravitation.
To sum up: the path leading to universal gravitation consists of two stages
of which the first is the more arduous and more important. In this first stage
Newton derived his laws of motion from the Cartesian law of inertia
together with the Principle of the Proportionality of cause to effect; he also
deduced an inverse square law for central accelerations from Kepler's laws.
It is important to note that from Galileo's or Kepler's theories we do not obtain even
the slightest hint of how these theories would have to be adjusted-what false
premises would have to be dropped, or what conditions stipulated--should we try to
proceed from these theories to another and more generally valid one such as
Newton's. Only after we are in possession of Newton's theory can we find out
whether, and in what sense, the older theories can be said to be approximations to it.
This passage, which underlines the continuity in Popper's thought from
1930 to at least 1972, brings out the stark conflict between his views on the
logic of discovery and Lakatos's.
(It should be conceded that Newton could have arrived at his inverse
square law by a different route from the one described in the Principia. It
was not unusual in the seventeenth century to assimilate the intensity of
gravity to that of light, which was known to obey an inverse square law.' As
was later 'proved' by Kant, one can use the Principle of Sufficient Reason in
order to show that a gravitational action which emanates from a unique body
in space and which propagates without leaving any remnants behind must
be uniformly distributed over the surface of a sphere; since the latter has an
area equal to 4rR2', the inverse square law follows. However, the symmetry
which is essential if this argument is to go through breaks down in the case of
a two-body system; unless of course one assumes that each of the two bodies
behaves as if the other were absent. Anyway, this second proof does not
invalidate the one presented by Newton in the Principia; he may well have
reached the same goal by two different routes.)
7 CONCLUSION
Science can adapt its theories to certain facts selected in advance. Such facts
may have refuted extant theories which scientists knowingly set out to
REFERENCES
BORN, M. [1964]: Natural Philosophy of Cause and Chance. New York: Dover Publications.
DREYER, J. L. E. [1960]: A History of Astronomy from Thales to Kepler. New York: Dover
Publications (2nd edition).
DUHEM, P. [ 1906]: La Theorie Physique: Son Object, sa Structure. Paris: Marcel Riviere.
EINSTEIN, A. [1907]: Uiber die vom Relativitditsprinzip geforderte Trigheit der Energie',
Annalen der Physik, 23, pp. 371-84.
EINSTEIN, A. [1916]: 'Grundlagen der allgemeinen Relativitditstheorie', Annalen der Physik,
49, PP- 768-822.
EINSTEIN, A. [1949]: 'Autobiographical Notes' in P. A. Schilpp (ed.): Albert Einstein:
Philosopher-Scientist, pp. 1-95.
HANSON, N. R. [1958]: Patterns of Discovery. Cambridge: Cambridge University Press.
HESSE, M. B. [1963]: Models and Analogies in Science. London and New York: Sheed and
Ward.
HOLTON, G. [1969]: 'Einstein, Michelson and the "Crucial" Experiment', Isis, 6o, pp. 133-97.
KUHN, T. S. [1962]: The Structure of Scientific Revolutions. Chicago: University of Chicago
Press.
KUHN, T. S. [1970]: 'Logic of Discovery or Psychology of Research?', in Lakatos and
Musgrave (eds.), Criticism and the Growth of Knowledge. Cambridge: Cambridge
University Press.
LAKATOS, I. [ 1978]: The Methodology of Scientific Research Programmes, Philosophical Papers,
Volume I (John Worrall and Gregory Currie (eds.)). Cambridge University Press.
LAKATOS, I. and MUSGRAVE, A. (eds.) [1970]: Criticism and the Growth of Knowledge.
Cambridge University Press.
LORENTZ, H. A. [1904]: 'Electromagnetic Phenomena in a System Moving with any Velocity
Smaller than that of Light', Proceedings of the Royal Academy of Amsterdam, 6, pp. 809-34.
LORENTZ, H. A. [1909]: The Theory of Electrons. New York: Dover Publications (2nd edition,
I952).
MACH, E. [1883]: The Science of Mechanics. Evanston: Illinois. Open Court (6th edition, 1960).
MEYERSON, E. [1907]: Identite et Realite. Paris: J. Vrin (5th edition, 1951).
NEWTON, I. [1686]: Principia, Volume I. University of California Press (English translation by
A. Motte, 1962).
PLANCK, M. [I906]: 'Das Prinzip der Relativitit und die Grundgleichungen der Mechanik',
Verhandlungen der deutschen physikalischen Gesellschaft, pp. 136-41.
POPPER, K. R. [1934]: Logik der Forschung. Tiibingen: Mohr (Siebeck).
POPPER, K. R. [1963]: Conjectures and Refutations. Henley, Oxon: Routledge and Kegan Paul.
POPPER, K. R. [1972]: Objective Knowledge. Oxford University Press.
POPPER, K. R. [1979]: Die beiden Grundprobleme der Erkenntnistheorie. Tiibingen: Mohr
(Siebeck).
REICHENBACH, H. [1958]: The Philosophy of Space and Time. New York: Dover Publications.
WESTFALL, R. S. [1971]: Force in Newton's Physics. London: Macdonald.