You are on page 1of 13

Ecology, 85(3), 2004, pp.

834–846
q 2004 by the Ecological Society of America

CO-CORRESPONDENCE ANALYSIS: A NEW ORDINATION METHOD TO


RELATE TWO COMMUNITY COMPOSITIONS
CAJO J. F. TER BRAAK1,3 AND ANDRÉ P. SCHAFFERS2
1Biometris, Wageningen University and Research Centre, P. O. Box 100, 6700 AC Wageningen, The Netherlands
2Nature Conservation and Plant Ecology Group, Department of Environmental Sciences, Wageningen University and
Research Centre, Bornsesteeg 69, 6708 PD Wageningen, The Netherlands

Abstract. A new ordination method, called co-correspondence analysis, is developed


to relate two types of communities (e.g., a plant community and an animal community)
sampled at a common set of sites in a direct way. The method improves the simple, indirect
approach of applying correspondence analysis (reciprocal averaging) to the separate species
data sets and correlating the resulting ordination axes. Co-correspondence analysis maxi-
mizes the weighted covariance between weighted averaged species scores of one community
and weighted averaged species scores of the other community. It thus attempts to identify
the patterns that are common to both communities. Both a symmetric descriptive and an
asymmetric predictive form are developed. The symmetric form relates to co-inertia analysis
and the asymmetric, predictive form to partial least-squares regression. In two examples
the predictive power of co-correspondence analysis is compared with that of canonical
correspondence analyses on syntaxonomic and environmental data. In the first example,
carabid beetles in roadside verges are shown to be more closely related to plant species
composition than to vegetation structure (biomass, height, roughness, among others), and,
in the second example, bryophytes in spring meadows are shown to be more closely related
to the species composition of the vascular plants than to the measured water chemistry.
Key words: bryophytes; canonical correspondence analysis; carabid beetles; co-inertia analysis;
community data; correspondence analysis; gradient analysis; ordination; partial least-squares re-
gression; plant species composition; species–environment relationships.

INTRODUCTION dence analysis for community data showing unimodal


Are carabid beetles in roadside verges more closely responses (ter Braak 1986) and redundancy analysis
related to vegetation structure than to plant species for data with linear responses (Jongman et al. 1995,
composition or is the reverse true? To answer such a Legendre and Legendre 1998). These methods are part
question from field data, we need quantitative methods, of the computer program CANOCO (ter Braak and
one to predict the beetle community (that is, the in- Šmilauer 2002). Canonical correspondence analysis
(Table 1) is the method of choice (ter Braak and Pren-
cidences and abundances of each of the beetle species)
tice 1988, Jongman et al. 1995) when the community
from a set of variables characterizing vegetation struc-
data show unimodal responses, have a strong qualita-
ture (biomass, height, and roughness, among others)
tive (presence/absence) nature, and/or are sum con-
and another to predict the beetle community data from
strained (which means that the total abundance per site
the plant community data. (The word ‘‘community’’ is
is fixed or determined by the sampling method). This
used here to mean an assemblage of taxa.) The plant
method works to predict the beetle community from
community can be thought of as a bio-assay of the
the variables on the vegetation structure or the envi-
environment (Persson 1981, Prentice and Cramer
ronment but not to predict the beetle community from
1990). A related question is then: Is the beetle com-
the plant community data. The reasons are twofold: (1)
munity better predicted from the measured environ-
canonical correspondence analysis breaks down when
mental variables than from the plant community? The
the number of predictor variables, the individual plant
data on beetles, plants, vegetation structure, and en-
species in our case, is larger than the number of sites,
vironment that we consider are all from a common set
and (2) the method uses linear combinations—weight-
of sites.
ed sums (Jongman et al. 1995)—of the predictor var-
For relating a biological community to a set of (en-
iables, which does not work well when the community
vironmental) predictor variables, we have at our dis-
data in the predictor role show a unimodal structure, a
posal methods of multivariate direct gradient analysis
strong qualitative nature and/or are sum constrained,
(ter Braak and Prentice 1988): canonical correspon-
i.e., when the predictor community is better analyzed
by a correspondence analysis technique than by a linear
Manuscript received 9 January 2003; revised 18 April 2003;
accepted 5 May 2003; final version received 22 July 2003. Cor- technique such as principal components analysis (ter
responding Editor: N. C. Kenkel. Braak and Juggins 1993, ter Braak et al. 1993, Frisvad
3 E-mail: cajo.terBraak@wur.nl and Norsker 1996).
834
March 2004 CO-CORRESPONDENCE ANALYSIS 835

TABLE 1. Overview of gradient analysis methods based on weighted averaging.

Response
Method Abbreviation variables Predictors
Correspondence analysis CA Community data ···
Canonical correspondence analysis CCA Community data Environmental variables
CCA partial least squares CCA-PLS Community data Many environmental variables
Weighted averaging calibration WA Environmental variable Community data
WA partial least squares WA-PLS Environmental variable(s) Community data
Co-correspondence analysis CO-CA Community data Community data
Notes: The community data consist of incidences or abundances ($0) of a set of species at a set of sites. The environmental
variables, measured at the same set of sites, are quantitative and/or qualitative (0/1). The methods use weighted averages of
species scores, appropriate for unimodal data, and linear combinations of environmental variables, appropriate for linear data
(ter Braak and Prentice 1988).

A possible approach is to analyze the community we show how to relate two correspondence analysis
data sets separately by (detrended) correspondence tables in the co-inertia framework.
analysis (or another suitable indirect gradient analysis Partial least squares (PLS; Martens and Naes 1992)
method) and correlate the first few axes from these is a methodology for multivariate linear regression,
analyses by calculating pairwise correlations (Hájek et popular in chemometrics, that can handle large num-
al. 2002). This method is correlative instead of pre- bers of predictor variables without much loss of pre-
dictive. It can be made predictive by applying a (de- dictive power. As in co-inertia analysis, the ordination
trended) correspondence analysis solely to the com- axes of PLS maximize covariance, but the constraints
munity data in the predictor role and taking the first used in the maximization differ. We consider PLS the
few resulting axes as predictor variables in a canonical predictive counterpart of unweighted co-inertia anal-
correspondence analysis of the community data in the ysis (de Jong and ter Braak 1994). C. J. F. ter Braak
response role. This is a two-step (indirect) approach, and coworkers proposed PLS extensions of existing
which works fine if the major pattern of variation in ecological methods (Table 1). The PLS extension of
the predictor community is important for the response canonical correspondence analysis, called CCA-PLS,
community. This need not to be the case. We therefore can handle large numbers of predictor variables (ter
need a more generally applicable one-step (direct) ap- Braak and Verdonschot 1995). CCA-PLS maximizes
proach that integrates the two steps of the analysis. the covariance between weighted averaged species
With a one-step method, the most important relation- scores and linear combinations of the predictor vari-
ships are expressed in the first few axes so that one ables. Because CCA-PLS takes linear combinations of
can be sure one does not miss something important. the predictor variables, it is not the method we are
The new method needs to integrate two correspondence looking for. Weighted averaging calibration (Table 1)
analyses in a predictive fashion. Before we describe predicts an environmental variable from community
our new method, it is instructive to briefly discuss two data by averaging indicator values of the species pres-
related methodologies. ent at a site (e.g., Persson 1981, Jongman et al. 1995).
Co-inertia analysis (Dolédec and Chessel 1994) is a When the indicator values (the species scores of this
general eigenvector framework to relate two data sets method) for a particular environmental variable are un-
in a symmetric way, i.e., neither set takes the response known, they can be estimated from training data by
or predictor role. The ordination axes of co-inertia anal- weighted averaging regression (ter Braak and van Dam
ysis maximize covariance. The strong points of co- 1989, Birks et al. 1990, Fritz et al. 1991). The PLS
inertia analysis are that it can deal with large numbers extension of weighted averaging regression and cali-
of variables in either set and that it includes, by way bration (WA-PLS; ter Braak and Juggins 1993, ter
of preprocessing and row and column weighing, both Braak et al. 1993) uses ideas from PLS to estimate the
methods related to principal components analysis (lin- indicator values. WA-PLS is popular in palaeoecology
ear response) and methods related to correspondence for reconstructing palaeoenvironments from fossil as-
analysis (unimodal response). To the best of our knowl- semblages (Birks 1998). WA-PLS is called correspon-
edge, co-inertia analysis has, however, never been used dence analysis partial least squares by Frisvad and Nor-
to relate two correspondence analysis tables. The rea- sker (1996). The community data take the role of re-
son perhaps is that it is not immediately clear how to sponse variables in CCA-PLS and the role of predictor
carry out such a co-inertia analysis because each cor- variables in WA-PLS (Table 1). These methods can be
respondence analysis implies its own site weights (the applied by using existing computer programs for PLS
site’s total abundance of the species in the analysis) (Martens 1986, Wise and Gallagher 2000, Wold 2002)
whereas co-inertia analysis requires common site by preprocessing the community data before it enters
weights. In this paper we overcome this difficulty and PLS and by postprocessing the results of PLS (ter Braak
836 CAJO J. F. TER BRAAK AND ANDRÉ P. SCHAFFERS Ecology, Vol. 85, No. 3

et al. 1993, ter Braak and Verdonschot 1995). It can and K2 5 diag ({y21l}). To keep the formulae as simple
thus be expected that the new method proposed in this as possible, we preprocess each abundance table by
paper can formally be expressed as a PLS with partic- dividing its values by the grand total, so that y111 5 1
ular pre- and postprocessing steps. The pre- and post- and y211 5 1. This does not change the results of the
processing steps are, however, not simply those applied analyses in this paper.
to the community data in CCA-PLS and WA-PLS: we In addition, R0 will contain user-defined site weights
must first solve the problem of differences in implied in the combined analysis of Y1 and Y2. In predictive
site weights. co-correspondence analysis, a logical choice is to make
Here we propose a new ordination method, called R0 equal to R1, whereas in the symmetric analysis, one
co-correspondence analysis, to relate one community can make R0 5 (R1 1 R2)/2. These choices are justified
data set to another in a direct way (Table 1). Our start- in the Discussion. In the general case, R0 5 diag({ri0})
ing point is the reciprocal averaging approach to cor- with ri0 . 0 and r10 5 1. The results of the analysis
respondence analysis (Hill 1973, ter Braak and Prentice are sets of scores for species and sites. We denote the
1988). We resolve the problem of different sets of site vectors of species scores for the two sets by u1 and u2,
weights, noted above, by posing an explicit maximi- and the vectors of site scores derived from these by x1
zation criterion for co-correspondence analysis: the and x2. The superscript T denotes the transpose of a
(possibly weighted) covariance between weighted av- vector or matrix, Ip the p 3 p identity matrix and 1p a
eraged species scores of one community with weighted column vector of p ones.
averaged species scores of the other community. The Computations were carried out in MATLAB (MAT-
weighted averages in the maximization criterion re- LAB 2000) using the PLS Toolbox, version 2.1 (Wise
place the linear combinations used in co-inertia anal- and Gallagher 2000) and in Canoco for Windows 4.5
ysis, PLS, and principal component analysis. This re- (ter Braak and Šmilauer 2002). The extra MATLAB
placement makes the new method a correspondence functions for the methods presented in this paper and
analysis type of method, suited for unimodal data. We the example data are available in Ecological Archives
define a symmetric, descriptive form, which fits in the as Supplements 1, 2, and 3.
framework of co-inertia analysis, and an asymmetric,
predictive form, which is a weighted form of PLS. Pre- Co-correspondence analysis: definition
dictive co-correspondence analysis relates to corre-
spondence analysis just as nonweighted PLS relates to Correspondence analysis of community data is a
principal component analysis. method that assigns scores to species and sites that have
We demonstrate the use of co-correspondence anal- certain optimality properties. Weighted averaging is
ysis in two examples and compare its predictive power one of the key concepts herein (Jongman et al. 1995).
with that of canonical correspondence analysis on, When looking for optimal species scores, we derive
among others, environmental data. In the first example site scores from the species scores by the method of
we show that carabid beetles in roadside verges are weighted averaging. Correspondence analysis assigns
more closely related to plant species composition than species scores so as to maximize the variance of these
to vegetation structure or environmental data, and in site scores under the constraint that the assigned species
the second that bryophytes in spring meadows are more scores have unit variance. (The method is symmetrical
closely related to the species composition of the vas- in that species and site can be interchanged in the op-
cular plants than to the measured water chemistry. timization criterion). For mathematical reasons, the
variances are weighted with each site and each species
THEORY receiving a weight proportional to its abundance total.
Notation and computation When the species can be subdivided in two sets (for
example, beetles and plants) and the object is to relate
Let Y1 5 {y1ik}[i 5 1 . . . n; k 5 1 . . . p] and Y2 these two sets, there are other ways to assign species
5 {y2il}[i 5 1 . . . n; l 5 1 . . . q] be n 3 p and n 3 scores, simply because we can calculate two sets of
q matrices containing the abundances of each of p and site scores, one from each species set. Instead of aiming
q species of community 1 and 2 at each of n sites, at maximum variance we can now aim at maximum
respectively. In the theory and examples that follow covariance between the sets of site scores, as is done
later on, one data set (Y1) is designated as the com- in co-inertia analysis (Dolédec and Chessel 1994) and
munity in the ‘‘response role’’ and the other (Y2) as in PLS (Martens and Naes 1992, ter Braak and de Jong
the community in the ‘‘predictor role.’’ We then call 1998). More precisely, co-correspondence analysis seeks
the species in Y1 and Y2 response species and predictor vectors of species scores u1 and u2 that maximize
species, respectively. By denoting summation across
an index by a 1, the site (row) weights are y1i1 and x1T R0 x 2 with x1 5 R21 1 Y1 u1
y2i1 and the species weights are y11k and y21l . These
x 2 5 R22 1 Y2 u 2 (1)
weights are also collected in the diagonal matrices R1
5 diag ({y1i1}), R2 5 diag ({y2i1}), K1 5 diag ({y11k}) with R0 an n 3 n diagonal matrix with user-defined
March 2004 CO-CORRESPONDENCE ANALYSIS 837

weights for the sites, and subject to the constraints that function COCA-trans in Supplement 1 gives all details.
the species scores have zero mean and unit variance: The algorithm yields the scores for the first ordination
axis of co-correspondence analysis.
1pT K1 u1 5 0 1qT K 2 u 2 5 0
The transition equations can be condensed to an ei-
u1T K1 u1 5 1 u T2 K 2 u 2 5 1. (2) genvalue problem. Solving for the dominant eigenvalue
l yields the scores for the first ordination axis. The
T
In words, what is maximized (x 1 R 0 x2) is the co- maximum covariance in Eq. 1 is the square root of l.
variance between the two sets of site scores with com- In the next two subsections we consider two different
mon site weights; the covariance is maximized by find- ways of extracting further ordination axes. In the first
ing the appropriate vectors of species scores u1 and u2. way the analysis is fully symmetrical in Y1 and Y2,
Note, however, that what is maximized is formally not whereas it is asymmetric in the second, yielding sym-
a covariance, because it is not guaranteed that the site metric and predictive co-correspondence analysis, re-
scores x1 and x2 have zero mean. Yet, their mean will spectively. Despite the theoretical differences, the re-
be close to zero if R0 ø R1 ø R2, and if, for example, sults of two methods need not be very different in
R0 5 R1, then the R0-weighted mean of x1 is zero and practice (Burnham et al. 1996).
adding a constant to x2 would not change the criterion.
Eqs. 1 and 2 are chosen to give a close link with stan- Symmetric co-correspondence analysis
dard correspondence analysis. Specifically, if Y1 5 Y2 as a form of co-inertia analysis
and we choose R0 5 R1, then the Eqs. 1 and 2 specify
a correspondence analysis. By applying the Lagrange In symmetric co-correspondence analysis the eigen-
multiplier method (Magnus and Neudecker 1988) as in value problem of the previous subsection is solved not
ter Braak and de Jong (1998), the maximization prob- only for the first, dominant, eigenvalue, but also for
lem Eq. 1–Eq. 2 leads to the transition formulae of co- subdominant eigenvalues, giving rise to further ordi-
correspondence analysis (with l the Lagrange multi- nation axes. The resulting species scores of different
plier, which turns out to be an eigenvalue): axes are orthonormal, i.e., U T1 K1U1 5 IA and U T2 K2U2
5 IA with A axes arranged in columns (A 5 min(n, p,
l u1 5 K21 1 Y1T x*2 with x*2 5 R21 1 R0 x 2 (3) q) 2 1; see ter Braak and de Jong 1998). This analysis

Oy
p fits in the framework of co-inertia analysis (Dolédec
x1 5 R21 1 Y1 (u1 2 ū1 ) with ū1 5 y1211
1
11k u1 (4) and Chessel 1994) for relating two data tables, each
k51 represented by a statistical triplet consisting of the data
u 2 5 K22 1 Y2T x*1 with x*1 5 R22 1 R0 x1 (5) table itself and column and row weights. Together the
two statistical triplets define an eigenvalue problem.

Oy
q By straightforward (but tedious) algebra, it can be
x 2 5 R22 1 Y2 (u 2 2 ū 2 ) with ū 2 5 y2211
1
21l u 2 . (6) shown that symmetric co-correspondence analysis is
l51
the co-inertia analysis of the statistical triplets (Q1, K1,
The required optimal species scores are the centered R0) and (Q2, K2, R0) with
ones, u1 2 ū1 and u2 2 ū2. Apart from terms like
R21 1R0 which account for differences in weights among Q1 5 R21 1 Y1 K21 1 2 1n 1pT
the sets, all equations are weighted averages: Q 2 5 R22 1 Y2 K22 1 2 1n 1qT . (7)
1) the species scores of one set are obtained as Each Qs simply contains the residuals (o 2 e)/e with
weighted averages of the other set’s site scores o 5 the observed abundance and e 5 (row total 3
(Eqs. 3 and 5); column total)/(grand total), the expected abundance un-
2) the site scores are weighted averages of the spe- der row–column independence in the original abun-
cies scores of their own set, as required by Eq. 1. dance table Ys (s 5 1, 2), when treated as a contingency
By comparison, the transition formulae of canonical table. Therefore symmetric co-correspondence analysis
correspondence analysis (ter Braak 1986) mix weighted can be carried out by any computer program that can
averaging equations with equations from multiple re- perform co-inertia analysis, such as ADE-4 (Thiou-
gression. louse et al. 1995) as outlined in the Appendix. To the
As in correspondence analysis (Jongman et al. 1995) best of our knowledge, this form of co-inertia analysis
and canonical correspondence analysis (ter Braak has not been used before.
1986), the transition formulae can be solved by an it- For comparison, separate correspondence analyses
eration algorithm, which starts with arbitrary site are based on the statistical triplets (Q1, K1, R1) and (Q2,
scores and then applies the transition formulae in turn K2, R2). These triplets cannot be used in a co-inertia
(ignoring l). In contrast with the iteration algorithm of analysis because of the differences in site weights be-
canonical correspondence analysis (ter Braak 1986), tween the sets if R1 ± R2. The analysis of the previous
centering and standardization should be applied to each section (Theory: Co-correspondence analysis: defini-
set of species scores, and not to the site scores, as is tion) shows that the weighted averaging properties of
evident from Eq. 2 and Eqs. 4 and 6. The MATLAB correspondence analysis can be retained by replacing
838 CAJO J. F. TER BRAAK AND ANDRÉ P. SCHAFFERS Ecology, Vol. 85, No. 3

R1 and R2 by a single set of weights R 0, which can mathematical trick to carry out a weighted analysis by
be chosen by the user. This is not the first paper to a computer program that does not use weights (Seber
combine a double-centered table, such as Q 1 in Eq. 1977). The trick is to multiply each row of the data
7, with non-standard row and column weights. Thiou- matrix by the square root of the row weight and, sim-
louse et al. (1995) did so in their multivariate analysis ilarly, each column of the data matrix by the square
of local and global spatial patterns. Their analysis root of the column weight. By applying this trick we
derives from maximization of the local (or global) obtain that predictive co-correspondence analysis of Y1
spatial autocovariance, which, like our Eq. 1, is not with respect to Y2 is PLS of the response matrix Y with
a real covariance. respect to the predictor matrix X with
For further comparison, the simplest form of co-in-
Y 5 R1/2 1/2
0 Q1K1 X 5 R1/2 1/2
0 Q2K2 (8)
ertia, based on two principal components analyses, uses
the triplets (Y1, Ip, In) and (Y2, Iq, In) and equals in- in which Q1 and Q2 are derived from the original data
terbattery factor analysis (Tucker 1958, ter Braak Y1 and Y2 by Eq. 7. If R0 5 R1, then Y contains the
1990). It leads to transition formulae in which, com- chi-square residuals (o 2 e)/Ïe (Legendre and Legen-
pared to Eqs. 3, 4, 5, and 6, weighted averages are dre 1998). By applying noncentered SIMPLS to these
replaced by weighted sums (with centering of site Y and X we obtain, for axis a, species score vectors
scores rather than species scores). Also for comparison, denoted by qa and ra in de Jong (1993) and ter Braak
canonical correspondence analysis is formally the co- and de Jong (1998). The species scores u1a and u2a for
inertia of the triplets (Q1, K1, R1) and (Z, (Z T R1Z)21, axis a are obtained by the back-transformation
R1) with Z being the matrix containing the environ-
u1a 5 K121/2qa u2a 5 K221/2ra. (9)
mental data. To facilitate the comparison of the vari-
ance explained by co-correspondence analysis with that The first axis so obtained is the same as in co-inertia
of a canonical correspondence analysis of Y1 with re- analysis and maximizes Eq. 1 subject to Eq. 2. For axis
spect to Z we choose in the examples R0 5 R1, although a . 1, the additional constraint is that axis a, derived
this destroys, strictly speaking, the symmetry of the from Y2, is orthogonal to the previous axes, i.e.,
method. For further arguments on this choice, see the
x T2a R 0 x 2b 5 0 for a . b with
Discussion.
x 2a 5 R22 1 Y2 u 2a . (10)
Predictive co-correspondence analysis as a form of
partial least squares (PLS) This constraint makes the difference with symmetric
Partial least squares regression (PLS) works with co-correspondence analysis.
ordination axes (called components or latent variables Fitted values for the response variables are obtained
in this context) that maximize the covariance between in PLS by regression of Y on the A ordination axes t1,
linear combinations of response variables and of pre- . . . tA, derived from X (i.e., ta 5 Xra). The values so
dictor variables, subject to particular constraints (Mar- obtained (for instance Ŷ) must be back-transformed to
tens and Naes 1992). PLS differs from co-inertia anal- obtain fitted values for the original abundance data Y1.
ysis in two ways. First, PLS is less general than co- The back-transformation is
inertia analysis in that it does not use row and column ˆ 1 5 R1(R021/2YK
Y ˆ 21 1/2 1 1n1pT )K1. (11)
weights. Second, the constraints used in PLS for the
second and higher axes differ from those used in co- If for a new site, i, the abundances of predictor spe-
inertia analysis. Rather than requiring that the species cies are known (and the assigned weight is ri0), we can
scores be uncorrelated to those of previous axes, PLS similarly obtain predicted values for the response spe-
requires that the site scores derived from the predictor cies.
variables be uncorrelated with the previously derived For completeness we note that symmetric co-corre-
site scores. [There are actually two slightly different spondence analysis can be obtained by calculating the
versions of multivariate PLS, NIPALS and SIMPLS singular value decomposition of Y T X, with Y and X
(ter Braak and de Jong 1998), which need not concern from Eq. 8. The transformations of Eq. 9 have to be
us here.] This simple difference from co-inertia anal- applied to the left (qa) and right (ra) singular vectors
ysis makes PLS asymmetric in the two data sets; PLS to obtain the species scores u1a and u2a. The singular
is a true regression method, and its main use is pre- values are the square root of the eigenvalues.
dictive: to predict one set of variables from the other
Number of ordination axes and cross-validatory fit
set. In contrast, co-inertia analysis is symmetric in the
two sets. If q . n (more predictor species than sites, as is
The question arises whether we might find a way to common in ecological data sets), the response data can
define a predictive version of co-correspondence anal- be fitted without error by taking as many PLS axes as
ysis by placing it in the PLS framework instead of in there are sites, even if there is no relation between the
the co-inertia framework. For this we must take account two sets of data. Such fit has no predictive value. The
of the row and column weights. But there is a general number of axes to use is therefore an essential ingre-
March 2004 CO-CORRESPONDENCE ANALYSIS 839

dient of PLS: the selected number minimizes the pre- iables also orthonormal (ter Braak and de Jong 1998)
diction error as estimated by cross-validation methods by a singular value decomposition. The explained in-
(Martens and Naes 1992). We applied ‘‘leave-one-out’’ ertias of the axes so obtained were the same as those
cross-validation, that is, PLS is carried out n times, obtained with CANOCO (ter Braak and Šmilauer
leaving out in turn one of the sites and applying the 2002).
obtained PLS model to the omitted site to predict its
response species from the predictor species. The se- Ordination diagrams
lected number of axes is the number that minimizes In symmetric co-correspondence analysis, ordination
the sum of squared prediction errors. We report in the diagrams can be made in the usual way: by jointly
example the cross-validatory fit as 100(1 2 ssp a /ssp0) plotting the species scores and site scores (u1 with x1
with sspa the sum of squared prediction errors using a and u2 with x2) for the first A axes (typically A 5 2 for
PLS axes and ssp0 the sum of squared prediction errors easy visual inspection). As in correspondence analysis,
under the null model, in our case the row–column in- the interpretation of such diagrams can proceed by the
dependence model for Y1. With transformation (Eq. 8) centroid principle and the biplot rule (ter Braak and
and R0 5 R1, the sums of squares of the elements of Verdonschot 1995, ter Braak and Šmilauer 2002). Ac-
Y and Y 2 Ŷ are the total inertia and the residual inertia cording to Eqs. 1 and 2, sites are at the center of the
of Y1, respectively. The cross-validatory fit calculated species they contain; the diagram so conveys infor-
on the transformed data is thus in terms of inertia. It mation on the species they are likely to contain (the
can be negative when the model fits so badly that the centroid principle). The u scores and x scores together
null model predicts the data better. also form correspondence analysis biplots for Y1 and
There is a detail that deserves attention. When leav- Y2. The interpretation of such biplots is described in
ing out a site, the species totals and, thus, the weights full in ter Braak and Verdonschot (1995:273). In brief,
K1 and K2, change (the site weights change propor- the correspondence analysis biplot displays for each
tionally with the weights). In our examples we use these particular species the approximate share that species
modified weights to transform the data by Eqs. 7 and has in the abundance at each site, with share defined
8. The omitted site is given a weight proportional to as ysik / ysi1 (s 5 1, 2 indicating the community) and,
ri0 5 y1i1 in the complete data set. If a species occurs conversely, for each particular site the approximate
only once, it receives weight 0 in one of the analyses share that site has in the abundance of each species,
of n 2 1 sites. Without modification of the weights, with share defined as ysik / ys1k.
ssp0 is simply the total inertia of Y1, but with modifi- Symmetric co-correspondence analysis puts empha-
cation it is somewhat larger. sis on the association among species from different
An alternative approach to select the number of axes communities. The measure of association is the R0-
is to test the statistical significance of each ordination weighted covariance between species calculated on the
axis using permutation tests. By applying the permu- basis of Q matrices in Equation 7; it is a covariance
tations to the rows of Y from Eq. 8, for fixed X, we between relative abundances. For the optimal repre-
do not need to worry about differential weights, be- sentation of this association in a biplot, the u scores of
cause the rows and columns of these matrices have each axis must be multiplied by the quarter root of the
equal weight in the analysis. The test statistic we use eigenvalue of the axis (this result follows from the
is the F ratio based on the fit by the first axis to the above mentioned singular value decomposition; see
response data (ter Braak and Šmilauer 2002). The sec- also ter Braak 1990). This is important only when the
ond axis was tested by treating the first axis as co- multipliers differ strongly among axes. To retain the
variable (i.e., by analyzing residuals of Y and X after biplot interpretation for Y1 and Y2, the x scores must
fitting the first axis) so that the second axis of the be divided by the same multipliers, but we do not rec-
original data becomes the first axis of the residualized ommend to do so. The rescaled u scores and original
data, and so on for further axes. Strictly speaking, this x scores together form a Benzécri plot, an excellent
approach does not test the significance of SIMPLS compromise of conflicting aims in biplots (Gabriel
axes, but of NIPALS-PLS axes (ter Braak and de Jong 2002). In such a plot, distances among sites and among
1998). species represent chi-square distances and points of
In the examples we obtained the cross-validatory fit different items allow, for all practical purposes, a biplot
of canonical correspondence analysis and CCA-PLS by interpretation (Gabriel 2002).
using SIMPLS. The fit of CCA-PLS was obtained by Predictive co-correspondence analysis puts emphasis
transforming the species data Y1 as in Eq. 8 with R0 on the regression of transformed Y1 on transformed Y2,
5 R1 and centering and standardizing (autoscaling) the and thus on the matrix of fitted values Ŷ1 (Eq. 11) and
environmental variables in the R1 metric, followed by the rank A matrix of regression coefficients (ter Braak
premultiplication by the square root of R1 and sub- 1990). The fit of the response community to the pre-
mitting the transformed data to noncentered SIMPLS. dictor community (Ŷ1) can be displayed in a corre-
The fit of canonical correspondence analysis was ob- spondence analysis biplot of u1 and x2 (instead of with
tained by making the transformed environmental var- x1) and the matrix of regression coefficients by a biplot
840 CAJO J. F. TER BRAAK AND ANDRÉ P. SCHAFFERS Ecology, Vol. 85, No. 3

TABLE 2. Eigenvalues of the first three axes of separate CAs formed. The abundances of the plant species were on
and DCAs and of symmetric CO-CA of beetles and plants.
a 1–9 van der Maarel scale (Jongman et al. 1995).
Axis Both data sets are highly structured as judged from
the eigenvalues and lengths of gradient (Table 2) ob-
Factor Method 1 2 3
tained from separate ordinations by correspondence
Beetles CA 0.50 0.36 0.32 analysis (CA) and detrended correspondence analysis
DCA 0.50 0.32 0.21
Length of gradient 3.22 2.74 2.57 (DCA). Table 2 also shows the largest three eigenvalues
Plants CA 0.57 0.53 0.42 of symmetric co-correspondence analysis (CO-CA) but
DCA 0.57 0.41 0.27 these cannot be compared directly with those of CA
Length of gradient 3.44 2.99 2.88 and DCA (if Y1 5 Y2, for example, the CO-CA eigen-
Beetles–plants CO-CA 0.25 0.13 0.08
values are the square of the CA-eigenvalues; see also
Notes: The total inertia (sum of all eigenvalues of CA) is Eq. 12 in Dolédec and Chessel (1994)). The gain of
4.99 for beetles and 5.65 for plants. The sum of all eigen-
values of CO-CA is 0.94. directly relating beetles and plants by CO-CA over re-
lating them through two separate analyses is expressed
in terms of correlation coefficients in Table 3. The cor-
of u1 and u2 (see de Jong 1993:37). To infer also about relation between the axes of the separate analyses is
the predictor community (Y2) one should not use u2 high for the first axis (almost 0.9) but only moderate
with x2, but the loadings of the predictor species with (,0.6) for subsequent axes (Table 3). CO-CA finds an
x2 (the coefficients of the R0-weighted regression of Q2 even higher correlation on the first axis and correlations
on the site scores x2). Together with the u scores of the close to 0.90 for the second and third axes. High cor-
response species, the loadings of the predictor species relations on subsequent, unimportant axes may be
form a biplot of their association (the covariance based meaningless. Therefore a further comparison is made
on the Q matrices). In this biplot, distances among in terms of the variance in the beetle data that is ex-
species represent chi-square distances. For complete- plained by ordination axes that are derived from the
ness, we remark that in predictive co-correspondence plant data. When such axes are obtained by CA or DCA
analysis (and SIMPLS) the loadings of the response and two of them are used for prediction, the percentage
species (with respect to x2) are equal to their u scores, variance explained is ;15–16%, whereas the first two
and that the predictor site scores x2 are by default R0 plant-derived axes of CO-CA explain 19% (Table 3,
normalized. In the examples, with A 5 2 and large last column). CO-CA for relating beetles to plants thus
numbers of species, the differences between u scores achieves for these data a small improvement over the
and loadings of the predictor species were not very indirect method of relating the results of two separate
large (r . 0.97). analyses.
Since we made the distinction between u scores and Fig. 1 displays the ordination diagram of symmetric
loadings, it is instructive to note why the diagrams CO-CA. Selected beetle and plant species are displayed
suggested above for symmetric co-correspondence by their species scores (u1 and u2) (abbreviations are
analysis (u1 with x1 and u2 with x2) are biplots (Thiou- listed in Table 4). Site points are weighted averaged
louse et al. 1995): given the u scores, the x scores are species scores. The multipliers for the u scores to turn
optimal. For example, the scores x1 (Eq. 1) are equal Fig. 1 into a Benzécri plot are 0.7 and 0.6 for axes 1
to the regression coefficients of the K1-weighted re- and 2, respectively. For the optimal representation of
gression of QT1 on the species scores (u1) in the diagram, beetle–plant association the aspect ratio of Fig. 1
because the u scores are orthonormal in symmetric cor- (width to height) should thus be changed from 1:1 to
respondence analysis (and other forms of co-inertia 1:0.85 (5 0.6/0.7 5 the quarter root of the ratio of the
analysis). second to the first eigenvalue). This change does not
really influence the global interpretation of Fig. 1: bee-
TESTS ON REAL DATA tles and plants in corresponding positions with respect
Carabid beetles and vegetation in
Dutch roadside verges TABLE 3. Correlation coefficients between beetle-derived
Raemakers et al. (2001) studied the relation between and plant-derived site scores of the first three axes of sep-
arate CAs and DCAs and of symmetric CO-CA.
carabid beetles and vascular plants along roadside verg-
es in the Netherlands. Roadside verges contribute con- Axis
siderably to the amount of natural area in a highly Percentage
Method 1 2 3 fit†
cultivated country as the Netherlands. The ultimate aim
of the research is to develop management methods that CA 0.88 0.27 0.46 15
DCA 0.89 0.53 0.07 16
increase the ecological value of roadside verges. Here CO-CA 0.96 0.94 0.88 19
we analyze the ‘‘moist’’ subset of their data, compris-
Note: Site weights are beetle based.
ing 30 sites with 91 carabid beetle species and 173 † Percentage fit of the beetle data by the first two plant-
plant species. The beetle counts were log(y 1 1) trans- derived axes.
March 2004 CO-CORRESPONDENCE ANALYSIS 841

FIG. 1. Biplot based on symmetric co-correspondence analysis of carabid beetles (left: u1 and x1) and plants (right: u2
and x2) in roadside verges showing ;19% of the total variance of each data set. The species displayed (triangles) have a
more than average fit and occur five or more times in the data. Symbols of sites indicate their syntaxonomic unit. Scores of
species and sites are scaled according to Eqs. 1 and 2. The syntaxonomic units are (approximately from left to right): Molinio–
Arrhenatheretea/Koelerio–Corynephoretea (black rectangle), Tanaceto–Artemisietum/Arrhenatheretalia (squares), Arrhenath-
eretum medicaginetosum (gray circles), Arrhenatheretum elatiorus (stars), Galio–Alliarion/Arrhenatherion (gray rectangles),
Galio–Alliarion/Alopecurion (diamonds), Valeriano–Filipenduletum (open circles), Molinietalia (Calthion) (down triangles).
Abbreviations for the carabid beetle and plant species are given in Table 4.

TABLE 4. Carabid beetle and plant species abbreviations as used in Fig. 1.

Carabid beetles Plants


Abbreviation Species name Abbreviation Species name
ACUPPARV Acupalpus parvulus achimill Achillea millefolium
AMARAAEN Amara aenea agrocapi Agrostis capillaris
AMARABIF Amara bifida agrostol Agrostis stolonifera
AMARALUN Amara lunicollis alopprat Alopecurus pratensis
ANISOBIN Anisodactylus binotatus bellpere Bellis perennis
BEMBIPRO Bembidion properans bracruta Brachythecium rutabulum
BRADYHAR Bradycellus harpalinus callicus Calliergonella cuspidata
CALATFUS Calathus fuscipes cardprat Cardamine pratensis
CALATMEL Calathus melanocephalus cerafont Cerastium fontanum
CARABGRA Carabus granulatus festrubr Festuca rubra
CARABMON Carabus monilis glechede Glechoma hederacea
DYSCHGLO Dyschirius globosus heraspho Heracleum sphondylium
HARPAAFF Harpalus affinis holclana Holcus lanatus
HARPARUF Harpalus rufibarbis lotucorn Lotus corniculatus
LORICPIL Loricera pilicornis phalarun Phalaris arundinacea
PTEROANT Pterostichus anthracinus phraaust Phragmites australis
PTEROMEL Pterostichus melanarius planlanc Plantago lanceolata
PTEROMIN Pterostichus minor poa prat Poa pratensis
PTERONIG Pterostichus niger poa triv Poa trivialis
PTERONIH Pterostichus nigrita ranubulb Ranunculus bulbosus
PTEROSTR Pterostichus strenuus rhytsqua Rhytidiadelphus squarrosus
TRECH-SP Trechus species senejaco Senecio jacobea
tanavulg Tanacetum vulgare
trifdubi Trifolium dubium
842 CAJO J. F. TER BRAAK AND ANDRÉ P. SCHAFFERS Ecology, Vol. 85, No. 3

FIG. 2. Cross-validatory fit of the beetle data set against the number of ordination axes for different predictor data sets
(plants, closed squares; syntaxa, open squares; environment, triangles; vegetation structure, circles) and methods (predictive
CO-CA, solid line; CCA, long-dashed line; CCA-PLS, short-dashed line).

to the origin in each figure are positively associated, are low, but that is inherent to data that are largely
with stronger associations for species far from the or- qualitative (presence/absence), rather than quantitative.
igin. Symmetric and predictive CO-CA yield always CO-CA has a local maximum (7.8%) for two axes and
the same first axes. In this example, the second axes a global maximum (8.7%) at seven axes. We retain the
are also nearly the same (r 5 0.98). two-dimensional solution to keep the model as simple
Raemakers et al. (2001) also classified the vegetation as possible. When using CO-CA, the plants thus predict
samples in eight syntaxonomic units, characterized the 7.8% of the beetle inertia. With the optimal dimension
environment by 13 variables representing soil and mi- in brackets, the syntaxa predict 8.1% using CCA [2],
croclimate (acidity, moisture content in spring and the environmental variables predict 4.8% using CCA
summer, organic matter content, sandiness, availability [2] and 3.1% using CCA-PLS [1], whereas vegetation
of nitrate, ammonium and mineral N, soluble P and K, structure gives negative percentages and thus has no
degree of nitrification, exposure to sun, and a temper- predictive value for the beetle community using either
ature index) and characterized the vegetation structure CCA or CCA-PLS. In conclusion, the vegetation either
at each site by nine variables, including total biomass, expressed as abundances of individual species or as
maximum and mean vegetation height, and roughness. syntaxonomic units, predicted the beetle data better
Nearly all the quantitative variables were log trans- than the environmental measurements, and vegetation
formed to make their distributions less skewed. Sep- structure had no predictive value for the beetle data in
arate canonical correspondence analyses (CCA) of the this study.
beetle data with respect to these three data sets ex- The first two axes of CO-CA were significant (P ,
plained, in two dimensions, 19%, 18%, and 13% of the 0.001), whereas subsequent axes were not ( P . 0.60).
beetle variance, which can be compared with the 19% In Fig. 1, the horizontal axis is positively correlated
explained by CO-CA. In four dimensions these figures with the moisture content variables and organic matter
are 27%, 28%, 22%, and 28%, respectively, and in the (all r ø 0.6), the vertical is negatively correlated with
maximum dimension in each analysis 33%, 51%, 34%, acidity (r 5 20.43). The CCAs with respect to the
and 100%. Clearly, comparing these statistics has little syntaxonomic data and the environmental data also
meaning because the numbers of predictor variables showed two significant axes each (P 5 0.002 and 0.01
differ. To place the analyses on equal footing we ap- for the syntaxonomic data and P 5 0.001 and 0.038
plied ‘‘leave-one-out’’ cross-validation to determine for the environmental data), whereas further axes were
the optimal dimension for each (Fig. 2). In the cross- not (P . 0.08). With the structure variables none of
validation, we used predictive CO-CA. All percentages the CCA axes was significant (P . 0.30). The number
March 2004 CO-CORRESPONDENCE ANALYSIS 843

FIG. 3. Biplot based on predictive co-correspondence analysis of bryophytes (left) against vascular plants (right) in
Carpathian spring meadows, showing 30% and 21% of the total variance in the bryophyte data and vascular plant data,
respectively. The species (triangles) are positioned in the graph according to their loadings with respect to normalized site
scores (circles) derived from the vascular plants (x2). Only vascular plants with more than average fit are shown. Full names
of bryophytes and vascular plants are given in Hájek et al. (2002).

of significant axes thus agreed in this study with the analyses are strongly correlated (r ø 0.9) whereas the
optimal dimension chosen by ‘‘leave-one-out’’ cross- second axes are not (r ø 0.1) (Hájek et al. 2002). Cor-
validation. respondence analyses show clearly arched configura-
The use of individual plant species to predict the tions of species and site points for these data. Fig. 3,
beetle community avoids the sometimes debatable clas- based on predictive CO-CA of the bryophytes against
sification of sites into syntaxonomic units. But, because the vascular plants, illustrates that CO-CA also suffers
the analyses showed that there is not much to be gained from the arch effect. We have not yet implemented a
by using all plant species to predict the beetle com- detrended version.
munity, we may equally well conclude that it is suf- For predictive purposes, detrending may not be nec-
ficient in this case to summarize the plant species com- essary. Fig. 4 shows that the bryophytes are better pre-
position into syntaxomonic units, and also that it is dicted by the vascular plants (28% with five CO-CA
unlikely that a modification of the vegetation classifi- axes) than by the environmental variables (17% with
cation would result in a better fit to the beetle com- two CCA axes and 18% with four CCA-PLS axes). In
munity. terms of cross-validatory fit, the arched two-dimen-
sional CO-CA (Fig. 3) accounts already for 25% of the
Bryophytes and vascular plants in bryophyte inertia. Only these two axes are significant
Carpathian spring meadows (P 5 0.001), whereas subsequent axes are not ( P .
Hájek, Hekera, and Hájková (2002) explored data on 0.10). In the CCA, the first two axes are significant (P
the community composition of bryophytes and vascular 5 0.001 and 0.04), whereas the third is not ( P 5 0.52).
plants in 70 spring meadows in the Western Carpathians
and environmental correlates expressed as 14 water- DISCUSSION
chemical variables and one variable representing slope. No direct quantitative method existed so far for pre-
We log-transformed all concentration variables. The dicting one biological community from another. Co-
data are also analyzed in the second case study of Lepš correspondence analysis fills this gap. It fits in the
and Šmilauer (2003). For practical purposes, we limited weighted averaging family of methods for ecological
the data to species that occur five or more times, giving gradient analysis (Table 1; ter Braak and Prentice
30 bryophyte species and 123 vascular plants. The data 1988), of which weighted averaging, (detrended) cor-
contain a very strong first gradient as judged by DCA respondence analysis, and canonical correspondence
(5.2 SD for the bryophytes and 3.0 SD for the vascular analysis are the most widely applied (Birks et al. 1996).
plants), whereas the second gradient is ;2.5 SD for Within this family, it has a well-defined domain of
both data sets (using SD of species turnover as defined application; it is the direct ordination method for re-
by Hill and Gauch 1980). The first axes of the two lating one community data set to another (Table 1). It
844 CAJO J. F. TER BRAAK AND ANDRÉ P. SCHAFFERS Ecology, Vol. 85, No. 3

predictions than the second community’s data, as in our


examples.
Here we resolved the problem of differences in im-
plied site weights (see Introduction) by working from
first principles: the maximization of the weighted co-
variance between weighted averaged species scores.
This resolved the weight problem in a versatile way.
Rather than being implied, the weights for use in the
covariance (denoted by R0) can be chosen freely by the
user. As the choice influences the results of the analysis,
this brings up a new problem: Which weights to
choose? In the examples, we chose to use the implied
weights of the community in the response role (R0 5
R1) in order to facilitate the comparison with canonical
FIG. 4. Cross-validatory fit of the bryophyte data set
against the number of ordination axes for different predictor correspondence analysis. This is also the logical choice
data sets (higher plants, squares; environment, triangles) and in regression (Seber 1977)—and thus in predictive co-
methods (predictive CO-CA, solid line; CCA, long-dashed correspondence analysis and canonical correspondence
line; CCA-PLS, short-dashed line). analysis—when these site weights are indicative for
the precision of the abundance data of the response
species. This is typically the case for count data and
presence/absence (1/0) data with low incidence prob-
improves upon the indirect method of correlating the
ability (Jongman et al. 1995). With this choice, the
ordination axes of separate (detrended) correspondence
scores of the response species are (proportional to)
analyses of the data sets, the main improvement being
weighted averages of site scores derived from the pre-
that co-correspondence analysis can be used in a pre-
dictors (Eq. 3), precisely as in canonical correspon-
dictive way, as our examples show.
dence analysis (ter Braak 1986). This choice thus
Co-correspondence analysis combines the ecological
makes co-correspondence analysis closest to a (canon-
method of weighted averaging and the chemometric
ical) correspondence analysis of the response species.
method of partial least squares, and derives data ana-
In symmetric co-correspondence analysis, a symmetric
lytic strength from both. By maximizing the covariance
choice has perhaps more appeal, e.g., R0 5 (R1 1 R2)/
between the weighted averaged species scores of one
2. We propose the term co-correspondence analysis to
community with those of the other community, co-cor-
default to predictive co-correspondence analysis using
respondence analysis attempts to identify the ecolog-
SIMPLS and R0 5 R1.
ical gradients that are common to both communities.
Co-correspondence analysis inherits not only the
To further clarify the logic of the method, let us assume
good properties from correspondence analysis, but also
for a moment that these ecological gradients coincide the bad ones, most notably the arch effect (Jongman
with particular environmental variables. If measured, et al. 1995, Legendre and Legendre 1998). The arch
these could be used to predict one community (or both) effect—the effect that the second axis’ scores show a
through canonical correspondence analysis. But, what systematic, often parabolic, relation with the first axis’
if in a future application the environmental measure- scores (Fig. 3)—may hamper ecological understanding
ments were missing and we want to infer about the first of the underlying gradients. The proposed remedy, de-
community from the second one? The data on the sec- trending (Hill and Gauch 1980), can in principle be
ond community could then be used to infer about the applied in co-correspondence analysis, but we have not
environmental data—a calibration problem (ter Braak yet attempted to apply it. Detrending is no panacea
and Prentice 1988)—and the inferred environmental because it may flatten out some real variation (Minchin
data could then be entered to canonical correspondence 1987), and is therefore rather controversial (Legendre
analysis to predict the first community. Herein, the cal- and Legendre 1998). Moreover, detrending is not nec-
ibration problem could be solved with WA-PLS (Table essary for prediction purposes, as we showed in our
1). This method of predicting one community from second example.
another thus consists of two steps. Clearly, co-corre- The transformation in Eq. 8 is akin to one of the
spondence analysis is a one-step method: it integrates ecologically meaningful transformations for ordination
the calibration method of WA-PLS with the constrained of species data proposed by Legendre and Gallagher
ordination method of canonical correspondence anal- (2001). Their idea was to transform the community data
ysis. The second (predictor) community is used as a in such a way that the Euclidean distance between sites
multivariate bio-assay for the true underlying gradi- after transformation is identical to the chi-square dis-
ents. In practice, the environmental basis of the eco- tance between the sites before transformation. The
logical gradients is not precisely known, and that is transformation that achieves this is y* 2ik 5 y2ik /
why environmental measurements may yield worse y2i1Ïy21k). Pinel-Alloul et al. (1995) applied this trans-
March 2004 CO-CORRESPONDENCE ANALYSIS 845

formation to phytoplankton and fish data that were used Dolédec, S., and D. Chessel. 1994. Co-inertia analysis: an
as predictors in a canonical correspondence analysis of alternative method for studying species–environment re-
lationships. Freshwater Biology 31:277–294.
a zooplankton community. This analysis implies row Frisvad, J. C., and M. Norsker. 1996. Use of correspondence
weights {y11}. The resulting transformation is almost analysis partial least squares on linear and unimodal data.
identical to X (and also Y) in Eq. 8 with R0 5 R1 (there Journal of Chemometrics 10:677–685.
is a small difference in the implied centering of Y2). Fritz, S. C., S. Juggins, R. W. Battarbee, and D. R. Engstrom.
1991. Reconstruction of past changes in salinity and cli-
What makes their method really different from ours is mate using a diatom-based transfer function. Nature 352:
that Pinel-Alloul et al. (1995) used forward selection 706–708.
(instead of PLS) to overcome the problem of the num- Gabriel, K. R. 2002. Goodness of fit of biplots and corre-
ber of predictor species being greater than the number spondence analysis. Biometrika 89:423–436.
of sites. Selection of predictor species has four dis- Hájek, M., P. Hekera, and P. Hájková. 2002. Spring fen veg-
etation and water chemistry in the Western Carpathian
advantages: (1) it complicates the formal statistical flysch zone. Folia Geobotanica 37:205–224.
testing of the association between the communities, (2) Hill, M. O. 1973. Reciprocal averaging: an eigenvector meth-
it does not aim to preserve (chi-square) distances od of ordination. Journal of Ecology 61:237–249.
among sites in the predictor space, (3) it destroys the Hill, M. O., and H. G. Gauch. 1980. Detrended correspon-
dence analysis, an improved ordination technique. Vege-
logic of the original transformation because the total tatio 42:47–58.
abundance of the selected species is not equal to that Jongman, R. H. G., C. J. F. ter Braak, and O. F. R. van
of all predictors species, and (4) it does not use all the Tongeren. 1995. Data analysis in community and landscape
information available in the predictor set. In particular, ecology. Cambridge University Press, Cambridge, UK.
their method does not have the weighted averaging Legendre, P., and E. D. Gallagher. 2001. Ecologically mean-
ingful transformations for ordination of species data. Oe-
properties of our method (Eqs. 3, 4, 5, and 6). cologia 129:271–280.
Co-correspondence analysis may help in the search Legendre, L., and P. Legendre. 1998. Numerical ecology.
for good indicators for biodiversity (Noss 1990). Not Elsevier, Amsterdam, The Netherlands.
all species groups are equally easy to sample or iden- Lepš, J., and P. Šmilauer. 2003. Multivariate analysis of eco-
logical data using CANOCO. Cambridge University Press,
tify. One could try to predict a difficult species group
Cambridge, UK.
from an easy one. The building of the prediction model Magnus, J. R., and H. Neudecker. 1988. Matrix differential
with co-correspondence analysis requires representa- calculus with applications in statistics and econometrics.
tive training data for both species groups from a com- Wiley, New York, New York, USA.
mon set of sites, but from there only the easy group Martens, H. 1986. ‘‘Unscrambler’’ program for calibration.
CAMO, Trondheim, Norway. [Online: ^http://www.camo.
need to be sampled and identified. Co-correspondence no&.]
analysis may help in the search for the most suitable Martens, H., and T. Naes. 1992. Multivariate calibration. Wi-
indicators. ley, Chicester, UK.
MATLAB. 2000. Using MATLAB version 6. The Math
ACKNOWLEDGMENTS Works, Natrick, Massachusetss, USA. [Online: ^http://
www.mathworks.com&.]
We thank Dean Fairbanks and Karlè Sýkora for their ques-
Minchin, P. R. 1987. An evaluation of the relative robustness
tions that led to this research, Ivo Raemakers and Michal
of techniques for ecological ordination. Vegetatio 67:1167–
Hájek for permission to use their data, and Petr Šmilauer,
1179.
Hilko van der Voet, Theo Reijmers, John Birks, Richard Fur-
Noss, R. F. 1990. Indicators for monitoring biodiversity: a
nas, Pierre Legendre and the subject editor Norman Kenkel
hierarchical approach. Conservation Biology 4:355–364.
for comments on the manucript.
Persson, S. 1981. Ecological indicator values as an aid in the
interpretation of ordination diagrams. Journal of Ecology
LITERATURE CITED
69:71–84.
Birks, H. J. B. 1998. Numerical tools in palaeolimnology— Pinel-Alloul, B., T. Niyonsenga, and P. Legendre. 1995. Spa-
progress, potentialities, and problems. Journal of Paleolim- tial and environmental components of fresh-water zoo-
nology 20:307–332. plankton structure. Ecoscience 2:1–19.
Birks, H. J. B., J. M. Line, S. Juggins, A. C. Stevenson, and Prentice, H. C., and W. Cramer. 1990. The plant community
C. J. F. ter Braak. 1990. Diatoms and pH reconstruction. as a niche bioassay: environmental correlates of local var-
Philosophical Transactions of the Royal Society (London), iation in Gypsophila fastigiata. Journal of Ecology 78:313–
Series B 327:263–278. 325.
Birks, H. J. B., S. M. Peglar, and H. A. Austin. 1996. An Raemakers, I. P., A. P. Schaffers, K. V. Sýkora, and T. Hei-
annotated bibliography of canonical correspondence anal- jerman. 2001. The importance of plant communities in road
ysis and related constrained ordination methods 1986– verges as habitat for insects. Proceedings of the Section
1993. Abstracta Botanica 29:17–36. Exprimental and Applied Entomology of the Netherlands
Burnham, A. J., R. Viveros, and J. F. MacGregor. 1996. Entomological Society 12:101–106.
Frameworks for latent variable multivariate regression. Seber, G. A. F. 1977. Linear regression analysis. Wiley, New
Journal of Chemometrics 10:31–45. York, New York, USA.
de Jong, S. 1993. SIMPLS: an alternative approach to partial ter Braak, C. J. F. 1986. Canonical correspondence analysis:
least squares regression. Chemometrics and Intelligent a new eigenvector technique for multivariate direct gradient
Laboratory Systems 18:251–263. analysis. Ecology 67:1167–1179.
de Jong, S., and C. J. F. ter Braak. 1994. Comments on the ter Braak, C. J. F. 1990. Interpreting canonical correlation
PLS kernel algorithm. Journal of Chemometrics 8:169– analysis through biplots of structural correlations and
174. weights. Psychometrika 55:519–531.
846 CAJO J. F. TER BRAAK AND ANDRÉ P. SCHAFFERS Ecology, Vol. 85, No. 3

ter Braak, C. J. F., and S. de Jong. 1998. The objective func- ter Braak, C. J. F., and H. van Dam. 1989. Inferring pH from
tion of partial least squares regression. Journal of Che- diatoms: a comparison of old and new calibration methods.
mometrics 12:41–54. Hydrobiologia 178:209–223.
ter Braak, C. J. F., and S. Juggins. 1993. Weighted averaging ter Braak, C. J. F., and P. F. M. Verdonschot. 1995. Canonical
partial least squares regression (WA-PLS): an improved correspondence analysis and related multivariate methods
method for reconstructing environmental variables from in aquatic ecology. Aquatic Sciences 57:255–289.
species assemblages. Hydrobiologia 269:485–502. Thioulouse, J., D. Chessel, and S. Champely. 1995. Multi-
ter Braak, C. J. F., S. Juggins, H. J. B. Birks, and H. Van der variate analysis of spatial patterns: a unified approach to
Voet. 1993. Weighted averaging partial least squares re- local and global structures. Environmental and Ecological
gression (WA-PLS): definition and comparison with other Statistics 2:1–14.
methods for species–environment calibration. Pages 525– Thioulouse, J., S. Dolédec, D. Chessel, and J. M. Olivier.
560 in G. P. Patil and C. R. Rao, editors. Multivariate en- 1995. ADE-4 program library: multivariate analysis and
vironmental statistics. North-Holland, Amsterdam, The graphical display for environmental data. Université
Netherlands. Claude Bernard Lyon I, Villeurbanne, France. [Online:
ter Braak, C. J. F., and I. C. Prentice. 1988. A theory of ^http://pbil.univ-lyon1.fr/ADE-4/&.]
gradient analysis. Advances in Ecological Research 18: Tucker, L. R. 1958. An inter-battery method of factor anal-
271–317. ysis. Psychometrika 23:111–136.
ter Braak, C. J. F., and P. Šmilauer. 2002. Canoco reference Wise, B. M., and N. B. Gallagher. 2000. PLS Toolbox 2.1.
manual and CanoDraw for Windows user’s guide: software Eigenvector Research, Manson, Washington, USA. [On-
for canonical community ordination. Version 4.5. Micro- line: ^http://www.eigenvector.com&.]
computer Power, Ithaca, New York, USA. [Online: ^http:// Wold, S. 2002. SIMCA, version 10. Umetrics, Kinnelon, New
www.canoco.com&.] Jersey, USA. [Online: ^http://www.umetrics.com&.]

APPENDIX
How to carry out symmetric co-correspondence analysis in 4) Carry out two noncentered principal component analy-
the computer program ADE-4 for ecological data analysis ses via the ‘‘PCA’’ section, one on Q1 using row weights
(Thioulouse et al. 1995). R0 and column weights K1, and one on Q2 using row
1) Divide the data matrices by their overall totals, yielding weights R0 and column weights K2. In each analysis,
new data matrices Y1 and Y2. Calculate and save the save all axes with nonzero eigenvalues.
row totals (R1 and R2) and column totals (K1 and K2) 5) Carry out the co-inertia analysis via the ‘‘CoInertia’’
of Y1 and Y2. section. From this menu, first choose ‘‘Matching two
2) Decide on R0, for example, R0 5 (R1 1 R2)/2 or R0 5 statistical triplets’’ and specify as input files the two
R 1. output files with extension ‘‘.ncta’’ of step 3. This yields
3) Calculate the chi-square residuals as indicated in and an output file with extension ‘‘.iita’’. Second, choose
below Eq. 7 in the text, either in a spreadsheet or in ‘‘Coinertia analysis,’’ with the ‘‘.iita’’-file as input, to
ADE-4, yielding the data matrices Q1 and Q2. run the actual analysis.

SUPPLEMENT 1
MATLAB software for symmetric and predictive co-correspondence analysis and cross-validatory canonical correspondence
analysis is available in ESA’s Electronic Data Archive: Ecological Archives E085-022-S1.

SUPPLEMENT 2
Data on carabid beetles and vegetation in Dutch roadside verges is available in ESA’s Electronic Data Archive: Ecological
Archives E085-022-S2.

SUPPLEMENT 3
Data on bryophytes and vascular plants in Carpathian spring meadows is available in ESA’s Electronic Data Archive:
Ecological Archives E085-022-S3.

You might also like