Professional Documents
Culture Documents
Author(s): P. H. A. Sneath
Source: Taxon, Vol. 25, No. 4 (Aug., 1976), pp. 437-450
Published by: International Association for Plant Taxonomy (IAPT)
Stable URL: http://www.jstor.org/stable/1220526 .
Accessed: 17/06/2014 21:55
Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at .
http://www.jstor.org/page/info/about/policies/terms.jsp
.
JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of
content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms
of scholarship. For more information about JSTOR, please contact support@jstor.org.
International Association for Plant Taxonomy (IAPT) is collaborating with JSTOR to digitize, preserve and
extend access to Taxon.
http://www.jstor.org
P. H. A. Sneath*
Summary
Phenetic analyses are valuable at all taxonomic ranks. A review of work at higher
ranks is presented, and certain difficulties are discussed: these are just as serious for
traditional studies, though often glossed over. Difficulties include determination of
homologies, incompleteness of data and shortage of constant characters. It is also
necessaryto employ similarity coefficients and cluster methods that are not too sensitive,
respectively, to effects of gross size and numbers of OTU's. The number of OTU's
required to represent a homogeneouscluster is apt to be underestimated:even with a
simplified model of phenetic variation this number should be at least io and preferably
25 or more. Some newer developments in numerical analysis of phylogeny are briefly
reviewed.
?
Sums of
WPGMA UPGMA Squares
a
0
d
10
A
Fig. i. Illustrationof the effect of numberof OTU's in a cluster upon different clustering
methods.
(A) The upper part of the figure shows the positions of OTU's a to g two dimensions.
The lower part shows the phenograms based on Euclidean distances resulting from
three methods of clustering. These are the Weighted Pair Group Method, with
arithmetic averages (WPGMA), the Unweighted Pair Group Method with arithmetic
averages (UPGMA), and Sums of Squares (in which the OTU's or clusters are joined
to maintain at a minimum the total of the within-group sums of squares). Information
Analysis is known to behave very similarly to Sums of Squares but is not readily
illustrated with two-dimensional data. All three phenograms show the same topology.
that minimize the sums of squared differences within clusters, behave similarly.
A geometric interpretation is that the distances (more strictly the squared
distances) are summed, rather than averaged as in the Average Link methods.
Indeed it is known that Information Analysis behaves much like sums of squares,
because of its relation to the chi-square distribution. Examples of distortion by
these cluster methods are seen in the studies of Clifford and Williams (1973) and
Tadauchi(I975)
The opposite behaviour is seen with the weighted group methods. Thus, in
contrast to the unweighted pair-group method using arithmetic averages
(UPGMA), the weighted group method, WPGMA, is not disturbed by a large
number of very similar OTU's within one of the clusters (Fig. I). This is because
Sums of
WPGMA UPGMA Squares
oa 0
5- 5
d d1
10 10
15 15
(B) OTU c has been replicated so that there are 6 identical OTU's at this position
on the two-dimensional diagram. The WPGMA phenogram is unchanged except for
the furcation carrying 6 members of phenon c. The UPGMA phenogram now shows a
united first with d, because the average distance between a and the phenon formed
by b plus six c's is slightly greater than the distance between a and d. The Sums of
Squares phenogram shows that a, b and d have been united before the six c's join,
because the sums of the squares of the cluster a, b, d is less than the sums of the
squares given by a cluster formed by b and 6 c's.
WPGMA does not, - as the various clusters join, - give as much weight to each
OTU in the dense cluster as it does to the OTU's of the sparse clusters. Despite
the fact that the cophenetic correlation is always higher with UPGMA than with
WPGMA (Sneath, 1969), this insensitivity to the number of OTU's is an advantage
that may be important for high level studies. It may be noted in passing that
Single Link clustering is also relatively insensitive to the number of OTU's,
except insofar as an increased number is likely to give a maximum similarity
between clusters that is slightly higher than the maximum when few OTU's are
present.
There have been several recent critiques of numerical phenetics (Johnson, I968;
Sneath, I97I; Moss and Hendrickson, I973; Sneath and Sokal, I973), so I shall
?
Sums of
Squares
0-
5-
d
10
15-
(C) The number of identical c's has been increased to 17. The WPGMA phenogram is
still unaffected by the numerous OTU's in phenon c. The UPGMA phenogram has
scarcely been further affected. The Sums of Squares phenogram, however, shows that
all the OTU's outside phenon c have been forced together into a group which c
joins at a low level. This is because the sums of squares of the group a b d e f g
is less than that of any group that contains 17 c's plus another OTU from a, b, d, e,
f or g.
References
ANDERSON, A. J. B. 1971 - Similaritymeasurefor mixed attributetypes. Nature
232: 416-417.
BARNETT,R. S. I974 - An application of numerical taxonomy to the classification of
the Nummulitidae(Foraminiferida). J. Paleontol.48: 1249-2163.
BAUM, B. R. I973 - The conceptof relevancein taxonomywith specialemphasison
automaticclassification.Taxon22: 329-332.
BIDAULT, M. 1968 - Essai taxonomieexperimental et numeriquesur Festuca ovina L.
s.l. dansle sud-estde la France.Rev. Cytol. Biol. Veg. 31: 2I7-356.
BOULTER,D. I974 - The evolutionof plant proteinswith specialreference to higher
plant cytochromes c. Curr.Adv. Plant Sci. 8: i-I6.
BOULTER, D., B. G. HASLET, D. PEACOCK, J. A. M. RAMSHAW and M. D. SCOWAN 1976 -
The chemistry,functionand evolutionof plastocyanin.In D. H. Northcote,(ed.)
Medicaland TechnicalPubl.Co., Lancaster.
PlantBiochemistry. in press
AUGUST 1976 447