Professional Documents
Culture Documents
‘P explains Q’ , P ⊆ E, Q ∈ E,
or ‘P is contradictory’ , P ⊆ E,
or ‘E is a fact’ , E ∈ E.
A FUZZY MEASURE FOR EXPLANATORY COHERENCE 293
• An algorithm which translates the set of rules R to the weights ar1 ,...,rn
or wij and vi respectively.
A reason for choosing the form of V will be given later, because the
motivation is different from that of Thagard. He appeals to some special
intuitions about the coherence of propositions. Let us consider the special
case of relations between two propositions, in which I share his intuitions
about the coefficients in the function VT . If hypothesis H explains evid-
ence E, then H and E cohere. If H contradicts E, they incohere. The
weight wij between proposition i and j is set to a positive value if they co-
here, and to a negative value if they incohere. The coefficient vi represents
the empirical confidence in proposition i. If the third type of rule applies
to proposition i, i.e., if i is a fact, then vi > 0.
The extremal values of I represent the classical truth values ‘true’ and
‘false’. The following lemma states that we can restrict ourselves to them
without loss of generality. This means a clear-cut distinction can always be
made between the set of accepted and rejected propositions.
LEMMA 1.1. The function V (x) has at least one global maximum x =
(x1 , . . . , xn ) with xi ∈ {sup I, inf I }.
Proof. Let I = [a, b]. Since I n is compact and V is continuous, s =
supx∈I n V (x) exists and there is an x ∈ I n with V (x) = s. Let y 0 = x. For
i = 1, . . . , n let yji = yji−1 for j 6 = i. If yii−1 ∈ {a, b}, we let yii = yii−1 .
Since for each k we can write V (z) = zk · fk (z) + gk (z) for any z, where
fk and gk do not depend on zk , we set
a, fi (y i−1 ) < 0
yi =
i
b, else.
294 DANIEL SCHOCH
V (x) = V (y 0 ) ≤ V (y 1 ) ≤ · · · ≤ V (y n ).
where
wij , if i, j ≤ n,
wij0 = vi , if i ≤ n, j = n + 1,
0, otherwise.
The basic mechanism of a neural net is its update rule. The net starts
in an arbitrary state x(0). Then, in a fixed or randomly chosen sequence,
each neuron k is updated in accordance with
where
X
0
netk (t) = wkj · xj (t).
1≤j ≤n+1
The function netk (t) represents the input of neuron k, which leads to
a new activation potential xk (t + 1). There are several possible activation
functions fact ; they increase or decrease the activation of neuron k if the
sign of netk is positive or negative respectively. Effectively, the update
rule increases the value of (1.3) until a local maximum is reached. This
procedure often (but not necessarily) attains a global maximum.
The relation between two nodes of a neural net could always be con-
sidered symmetrical. There is no need for an extra symmetry principle as
stated by Thagard. This follows from the simple identity
X X symm symm 1
wij · xi xj = wij · xi xj , wij = wij + wj i .
1≤i6 =j ≤n 1≤i6 =j ≤n
2
under any permutation of the propositions, even if the roles of Q and P1 are
rather asymmetrical. Moreover, if one proposition is false (−1) while the
others were true (+1), we obtain the same amount of negative coherence
as when two propositions are false and the third is true in either case. But
for R2 , one would expect positive coherence if P2 and Q are true, since
Q is successfully explained by P2 . However, what beats everything is that
there is positive coherence if a false value has been assigned to all three
propositions!
There is no rule for dealing with contradictions between more than two
sentences in Thagard’s approach. It is too obvious that the inconsistence of
a set of propositions cannot be reduced to properties of pairs. There is no
way to explicitly deal with negations of propositions. If P is a proposition,
its negation must be implemented by a new proposition, say NP , and an
inhibitory link be established to P . For the system of rules R3 = {’{P , Q}
explains R’, ‘{P , NQ} explains NR’, ‘Q contradicts NQ’, ‘R contradicts
NR’}, the coherence function does not depend on the value of P , if Q and
NQ as well as R and NR have contrary truth values.
Another problem is directly connected with the connectionist
paradigma. Since Thagard chose I = [−1, 1] as the range of values, as is
usual for artificial neural nets, the coherence function (1.2) is symmetrical
under the negation of all values, VT (x) = VT (−x), if vi = 0 for all i̇. In
other words, if there is no confidence in a proposition besides that stem-
ming from coherence, the two sets of accepted and rejected propositions
may be interchanged. If, for example, P and Q are contradictory hypo-
theses, and exactly one can be accepted, then it is possible to accept either,
irrespective of how many ‘coherent’ links they have to other propositions.
A FUZZY MEASURE FOR EXPLANATORY COHERENCE 297
E = P ∪ {¬P |P ∈ P }.
Since we identify ¬¬P and P for each P , the set of signed propositions
is always closed under negation.
VPi (x1 , . . . , xn ) : = xi
V¬Pi (x1 , . . . , xn ) : = 1 − xQi
VP (x1 , . . . , xn ) : = cPP · P ∈P VPP
VhC,Ii (x1 , . . . , xn ) : = P ∈C VP − P ∈I VP
P1 · · · · · Pm · Q − P1 · · · · · Pm · (1 − Q) = P1 · · · · · Pm · (2Q − 1).
(2.2) P1 · · · · · Pm · (2Q − 1)
(2.3) −P1 · · · · · Pm
(2.4) P1 , . . . , Pm Q,
A FUZZY MEASURE FOR EXPLANATORY COHERENCE 301
(2.5) P1 , . . . , Pm ⊥.
If P2 and Q are both false, then VR2 = −2 · P1 . If they are both true,
then there is always positive coherence VR2 = 2 · P1 + 1 > 0, since Q is
successfully explained by P2 .
In Thagard’s theory, pairs of propositions which are connected by a
positive coherence link can nevertheless contribute with a negative value
to coherence if the elements have opposite truth values. This is not possible
in our approach. Propositions which appear as an explanandum in a rule
only contribute to the decision if other parts of the belief system lead
to a conflict, either by competing with it or by explaining the negation.
Otherwise, they can be accepted regardless of the truth values of the other
propositions. This is the statement of the following lemma.
P1 · · · · · Pm ≥ P1 · · · · · Pm · (2Q − 1).
0 ≥ −P1 · · · · · Pm · (1 − Q).
Figure 2. Unification.
isolated, it can be removed from the explanatory structure. For each com-
peting system of hypotheses, a negative term of the form (2.3) is added to
the coherence function.
The measure of coherence should be sensitive not only to the number
of explanatory relations, but also to the degree to which the subsystems
are interconnected. In Figure 2, R2 needs two distinct theories to explain
the two pieces of evidence, while R3 uses a stronger theory capable of
explaining both. We may also say that R3 is a unified theory. The trivial
way in R1 of defining such a theory, namely taking the conjunction of both
subtheories, is explicitly ruled out from being regarded as unification.
Theory unification is the most important form of scientific progress. It is
extensively discussed by Friedman et al. (1989) and Watkins (1984), and
further developed by Redhead (1989) and Bartelborth (1996). The core
idea states that the more non-trivial components of conjunction a theory
has, the less unified it is. The elements of the decomposition should have
disjoint theoretical terms. Otherwise the four Maxwell Equations are not
considered as a unified theory of electromagnetism, since their theoretical
terms, the components of the E- and B-field, are entangled. If such a
decomposition is possible, then each explanandum could be explained by
only one component.3 Therefore, either R1 reduces to R2 , or R1 contains
redundant elements in the explanans. We introduce the following concept
of an irreducible ‘proper’ explanation and define the weight factors only
for them.4
cP = f (NR (P )).
for n, m ≥ 1, or
n m
f (n + m) > · f (n) + · f (m).
n+m n+m
This is just the convexity of f !
We can obtain a stronger inequality by considering the following case:
P1 explains E1 , . . . , En , P2 explains E2 , . . . , En+1 and the unified theory
system Q explains all of them (E1 , . . . , En+1 ). Since Q explains all that
P1 and P2 does, we expect it to be superior to the combined system. It
follows
n
f (n + 1) > · 2 · f (n),
n+1
which is satisfied by f (n) = 2n . Therefore I propose the following formula
for the weight factor
(2.6) cP = 2NR (P ) .
3. EXAMPLES
[w :]P1 [. . . Pm ]− > Q,
[w :]P1 [. . . Pm ]− >
adds a term (2.3) to the coherence function. This corresponds to the nota-
tion (2.5), where the empty explanandum stands for the constantly false
value.
Data evidence is specified in the form ‘[w :]− > E’ for positive
and ‘[w :]E− >’ for negative convictions. Comments are preceded by
a double slash ‘//’.
A negated proposition is expressed by a leading exclamation mark ‘!’,
as in the well-known programming language C. Thus, the two rules ‘P
explains E’ and ‘Q explains ¬E’ are written as follows:
P- > E
Q - > !E
3.2. Lavoisier
The case of Lavoisier’s oxygen hypothesis (1783) against the phlogiston
theory is one of the most famous and successful applications of Thagard’s
program. It is interesting for our proposes since some of the explanatory
rules have multiple-proposition explanans. We adopt the reconstruction
and notation of Thagard (1989, 444). Although not all Lavoisier’s argu-
ments are contained by these propositions, they recapitulate the major
points. According to Zytkow, many improvements of the late phlogiston
theory after 1780 concerning E3-E7 are not incorporated.6
306 DANIEL SCHOCH
Oxygene hypotheses
OH1 Pure air contains oxygen principle.
OH2 Pure air contains matter of fire and heat.
OH3 During combustion, oxygen from the air combines with the burning
body.
OH4 Oxygen has weight.
OH5 During calcination, metals add oxygen to become calxes.
OH6 During reduction, oxygen is given off.
Phlogiston hypotheses
PH1 Combustible bodies contain phlogiston.
PH2 Combustible bodies contain matter of heat.
PH3 During combustion, phlogiston is given off.
PH4 Phlogiston can pass from one body to another.
PH5 Metals contain phlogiston.
PH6 During calcination, phlogiston is given off.
The interesting point about this example is that there are only two ana-
lytical contradictions. There is no need to implement the main hypotheses
OH1 and PH1 as competing.
ACKNOWLEDGEMENT
This paper was written within the research project “Explanatory Coher-
ence”, which is part of the research group “Kommunikatives Verstehen”
supported by the Deutsche Forschungsgemeinschaft (DFG).
NOTES
1 In the discussion part of Thagard’s article, Cohen also criticizes that it is impossible to
deal with conjunctions of hypotheses in his framework. Bereiter and Scardamalia mention
at the same location that a hypothesis can undergo positive activation if contradicting a
negatively activated proposition – a consequence of choosing the [−1, +1]-interval for
activation potentials. Thagard’s attempt to resolve the problem by raising the threshold
value of the associated neuron does not seem to be very coherent with his other rules (see
the reply part of Thagard (1989).
2 In sentential approaches to fuzzy logic, it is generally required that ‘x and x’ yields
the same fuzzy value as x. The multiplication function does not have the property of
idempotence, since x · x 6= x for 0 < x < 1. However, this is not a problem in our
approach. The representation of sentences by sets explicitly excludes the possibility that
a proposition appears more than once in a sentential expression. Moreover, according to
Definition 2.1, a proposition must not occur together with its contradiction.
The only idempotent fuzzy conjunction, the minimum function, does not satisfy the
principle of language independence. For example, take the positive data evidence E as the
only part of the explanatory structure, hC, Ii = h{E}, ∅i. Then for P , 1 − P < E < 1 we
find
REFERENCES
Department of Philosophy
University of Saarland
P.O. Box 15 11 50
D-66041 Saarbrücken
Germany
E-mail: d.schoch@mx.uni-saarland.de