Professional Documents
Culture Documents
Abstract
1. Introduction
A C E L S
English 2 0 1 25 3
German 15.5 1 4 9.5 1
[Haspelmath 14.5 0 4 11.5 1]
Swedish 19 4 3 4 1
French 20.5 2 0 7.5 1
Rumanian 24 1 0 3 2 þ1 unclear
Russian 23 0 5 0 3
Lithuanian 17.5 6 6 0.5 1
Greek, Modern 13.5 0 0 16.5 1
Armenian 16 8.5 5.5 0 1
Hindi-Urdu 7.5 14 7.5 2 0
Finnish 4.5 14 6.5 0.5 1.5 þ4 unclear
[Haspelmath 12 13.5 0.5 0.5 1.5 þ3 unclear]
Udmurt 10.5 12.5 4.5 2.5 1
Hungarian 7 9 12 0 3
Turkish 9 17.5 2.5 0 1
Mongolian 6 22 2 0 1
Georgian 9 4.5 15.5 0 2
Tsez 4 18 8 0 1
Lezgian 8 12 6 5 0
Arabic, Standard 19 7.5 1 1 1.5
Maltese 17 11 0 2 1
Hebrew 20.5 7.5 2 1 0
Japanese 3.5 5.5 20.5 0.5 1
Indonesian 0 14 17 0 0
Swahili 11 11 8 0 1
language tables for the languages he considers. The Swedish data are
given in Comrie (2005), the Tsez data in Comrie (2000), the Maltese
data in Table 1. I have made some small changes to Haspelmath’s analy-
sis of German, in particular the consideration of enden — be-enden to
constitute a causative pair, and of er-löschen — löschen as an anticausa-
tive pair. Changes made to the analysis of the Finnish data are more sub-
stantial, and are discussed in more detail in Comrie (2005); they hinge pri-
marily on the analysis of pairs like parant-a-a — parant-u-a ‘improve’,
which I treat as having an opposition between intransitive su‰x -a and
transitive su‰x -u (the final -a is the infinitive marker). Haspelmath’s
original statistics are given in square brackets in Table 2.
It will be noted that the profiles of individual languages can di¤er
quite considerably. Considering only languages where the most popular
type has more than twice as many members as the next most popular,
2. Markedness
data from the individual languages form part of the crosslinguistic data
pool that initially informed Haspelmath’s hypothesis. But nonetheless,
if the ordering is valid not only for the pooled data but also for each
individual language, or almost so, then the hypothesis is strengthened.
Table 3 provides the relevant information.
This table gives, separately for anticausatives and for causatives, and
for each language in Table 2, the number of verbs falling in the top half
(1–15) of the list of verbs as given in Table 1 versus the bottom half (16–
30) — item 31 is omitted since it usually involves suppletion.6 Since Indo-
nesian has no anticausatives, and English, Modern Greek, and Russian
no causatives, these combinations do not figure in Table 3. If our hypoth-
esis is correct, then the overall pattern should be that anticausatives have
fewer verbs in the top half of the verb list than in the bottom half, and
this is expressed in the right-hand column of Table 3 by dividing the num-
ber found in the top half of the verb list by the number found in the bot-
tom half. Conversely, causatives should have fewer verbs in the bottom
half of the verb list than in the top half, and this is expressed in the
right-hand column by dividing the number found in the bottom half of
the list by the number found in the top half. Doing the calculations this
way around almost avoids having to divide by zero. If the hypothesis
held in its strongest form, all the numbers in the right-hand column of
Table 3 would be less than zero.
Examination of the right-hand column shows that the hypothesis is al-
most borne out in its strongest form. Among the anticausatives, all lan-
guages but one fall in the range 0.00–0.80, while among the causatives,
all languages but two fall in the range 0.00–0.55. Two of the exceptions
involve very small numbers of instances, namely English with only two
anticausatives (both in the top half of the table, thus requiring division
by zero), and Swedish with only three causatives (one in the top half,
two in the bottom half ). For consistency, if one excludes these on the
basis of low numbers, one should also exclude confirmatory instances in-
volving three or fewer instances, namely causatives in French, German,
and Rumanian. The remaining exception is provided by causatives in
Swahili, with slightly more in the bottom half than in the top half (6
in the bottom versus 5.5 in the top, to give a ratio of 1.09), although
anticausatives in Swahili are well behaved (4 in the top versus 7 in the
bottom, to give a ratio of 0.57). Overall, the hypothesis is very strongly
confirmed.
Given that possibilities for future work have been introduced, it is
worth considering some further possibilities, in additional to tightening
up the statistical analysis. One could test the ordering by means of psy-
cholinguistic experiments, to see if it is indeed the case that subjects are
Distribution of anticausatives:
Tsez 0 4 0.00
Mongolian 0.5 5.5 0.09
Udmurt 1 9.5 0.11
Finnish 1 4.5 0.22
Arabic 3.5 13.5 0.26
Turkish 2 7 0.29
Lezgian 2 6 0.33
German 4 11.5 0.35
Hungarian 2 5 0.40
Japanese 1 2.5 0.40
Georgian 3 6 0.50
Greek, Modern 4.5 9 0.50
Lithuanian 6 11.5 0.52
Armenian 5.5 10.5 0.52
Russian 8 15 0.53
Maltese 6 11 0.55
Swahili 4 7 0.57
Hebrew 7.5 13 0.58
Swedish 7 12 0.58
French 8.5 12 0.71
Rumanian 10 14 0.71
Hindi–Urdu 4 5 0.80
English 2 0 —
Distribution of causatives:
Arabic 8.5 0 0.00
French 1 0 0.00
Lithuanian 6 0 0.00
Rumanian 1 0 0.00
Hebrew 6 0.5 0.08
Hindi–Urdu 9.5 2 0.21
Maltese 9 2 0.22
Udmurt 10 2.5 0.25
Georgian 3.5 1 0.29
Hungarian 7 2 0.29
Lezgian 9 3 0.33
Japanese 4 1.5 0.38
Finnish 10 4 0.40
Indonesian 8.5 3.5 0.41
Armenian 6 2.5 0.42
Turkish 11 5.5 0.50
Mongolian 14.5 7.5 0.52
Tsez 11 6 0.55
Swahili 5.5 6 1.09
Swedish 1 2 2.00
more likely to treat concepts with lower numbers in the list as more typi-
cally inchoative, those with higher numbers as more typically causative.
A di¤erent kind of test would involve searching for the relevant verbs in
a large corpus and seeing whether the hypothesized cognitive markedness
also has correlated frequency e¤ects, with the lower numbers occurring
more frequently as inchoatives, the higher numbers more frequently as
causatives.
In Japanese, it would also be possible to carry out such tests against a
more specific set of data. One of the striking characteristics of inchoative–
causative pairs in Japanese is that the su‰x -e is used with some verbs to
mark the inchoative, with other verbs to mark the causative. Comprehen-
sive lists of 36 pairs where -e marks the anticausative and 57 where it
marks the causative are given in Tables 4 and 5 respectively.
The lists are adapted from Jacobsen (1982: 197–199) and encompass
only verbs with the -e su‰x, whereas many verbs in Japanese, including
some in Haspelmath’s list, use other su‰xes to mark causative or anticau-
sative.7 In general, only those pairs are listed where the formal relation-
ship is absolutely transparent and the semantic relationship at least plau-
sible; the only exception to morphological transparency is the pair hairu
— ireru, where the intransitive member has an additional initial ha, his-
torically a separate morpheme.
With respect to the data in Tables 4 and 5, one can now ask the same
question as gave rise to the data in Table 3, namely, whether anticausa-
tive pairings tend to occur lower down in Table 1 and causatives higher
up. Unfortunately, the number of concepts in Tables 4 and 5 that corre-
spond closely to the verbal concepts identified by Haspelmath is small, so
that a detailed execution of the proposed task may not be particularly
valid. The fact that there are more verbs in Table 5 than in Table 4 might
suggest that causative pairings are more normal in Japanese than anticau-
sative pairings, a generalization that finds some support from the Japa-
nese entry in Table 2. If we start with the causative pairings of Table 5,
then my assessment is that two concepts from the top half are found
(‘sink’, ‘stop’) and three from the bottom half (‘connect’, ‘open’, ‘close’);
the prediction that the numbers should decline as we progress from lower
to higher decades is not borne out, but the number of items is small and
causative pairings do seem to be less marked in Japanese than anticausa-
tive pairings. Turning now to the anticausative pairings of Table 4, I find
three concepts from the top half (‘boil’, ‘melt’, ‘burn’) and four from
the bottom half (‘rock’, ‘open’, ‘break’, ‘split’); the prediction is that the
number of items should increase as we progress from lower to higher de-
cades, and this is rather weakly confirmed, in that the number in the third
decade is higher than in the lower decades. But probably one should
rather say that with so few items, the relevant hypotheses cannot really be
tested.
3. Diachronic stability
Table 5 (Continued )
The question that I wish to address in this section is whether there is any
principled basis to these di¤erences in terms of either the genealogical
a‰liation of the languages concerned or their geographical distribution.
This investigation is particularly important in light of claims made by
Nichols (1993 and elsewhere) about the historical stability of patterns of
inchoative–causative pairing in the lexicon.
In her work on this topic, Nichols notes that crosslinguistically there is
a preference for causative pairings across the languages of the world in
general, although much of Europe constitutes a significant exception,
with a preference for anticausative pairings. This feature of European
languages was already noted in Section 1. Many European languages, in
particular Indo-European languages, make widespread use of what is ety-
mologically a reflexive marker (e.g. French se, German sich, Russian -sja)
as a means of deriving intransitives from transitives, although other mor-
phological strategies are also found, such as the use of the middle voice in
Modern Greek. Before proceeding to a more detailed examination of the
data, it is necessary to clarify certain important points. First, the nature
of the morphological relationship between inchoative and causative mem-
bers of a pair is only one of a number of features taken into consideration
by Nichols in assessing relevant aspects of the typological profile of a lan-
guage, so the material to be presented is not an exhaustive confrontation
with her claims. Second, Nichols’ claims concerning diachronic stability
relate primarily to the anticausative and causative types, suggesting that
it requires a considerable time period for a language to develop either an
overwhelmingly anticausative or an overwhelmingly causative profile.
Changes leading towards at least some of the other types are less decisive.
In particular, it is always possible for a language to lose morphology, as
has happened in the case of English, with its striking and areally atypical
predominance of labile pairs.
Three sets of data from among those discussed in this article are rele-
vant to the question of diachronic stability. The first concerns a restricted
set of languages, namely the relationship between Standard Arabic and
Maltese, two closely related Semitic languages — indeed, from a purely
linguistic point of view Maltese can be considered a variety of vernacular
Arabic. The time depth separating the two varieties from their common
ancestor is just over a millennium, with the proviso that Standard Arabic
has been maintained artificially close to that ancestor while Maltese has
undergone substantial influence from Sicilian and more recently Standard
Italian, Romance varieties with the typical European propensity for anti-
causative pairings. Over this time period, however, Maltese and Standard
Arabic have diverged little from one another in terms of their transitivity
profiles, both having a preference for anticausative formations, these out-
numbering causative formations by about 2 to 1, with little more than a
token presence of the other types. The numbers are perhaps too small to
make any more specific claims, in addition to which slight di¤erences in
decisions as to which lexical item best corresponds to the relevant verbal
concept might have led to slight di¤erences in the overall statistics. None-
theless, it is worth noting that the slight increase in causative formations
in Maltese vis-à-vis Standard Arabic actually goes against the dominant
European pattern, to which Italian conforms.
One can, however, take the comparison of Standard Arabic and Mal-
tese somewhat further, by examining particular concepts in the list. It
turns out that for 23 of the 31 items, Maltese and Standard Arabic agree
in the assignment of that item to a particular type. (More specifically,
in nineteen cases the type assigned by me for Maltese is the only type
assigned by Haspelmath for Arabic; in four cases one of the two types
assigned by Haspelmath for Arabic). In ten cases, shown in Table 6, the
two languages agree even though etymologically unrelated items are
given in the source lists — although this number should perhaps be re-
duced to seven, since in three cases Haspelmath gives alternative lexical
choices, belonging to di¤erent types, for Standard Arabic. But at least
a high degree of overall diachronic stability, extending down even into
some of the details, is manifested by these two languages over a period
of about a millennium.
One item in the Maltese list of Table 1 is worthy of specific comment,
namely the one Italian/Sicilian loan żviluppa ‘develop’, which has the
same form as inchoative and as causative. In Italian, the two are, or
at least can be, distinguished as intransitive sviluppar-si and transitive
sviluppare, the former having the etymologically reflexive su‰x to form a
Table 6. Standard Arabic and Maltese pairs with the same type but di¤erent lexical choices
middle voice has also fallen out of use across most of the Indo-European
languages of Europe, surviving intact only in Modern Greek; in Ger-
manic, the middle voice was intact in Gothic, but is found only in fossil-
ized traces in the other oldest Germanic languages; it was alive in Latin
deponent verbs (though with a di¤erent formation), but does not survive
into the Romance languages; it is not found in Balto-Slavic. Rather, the
European languages, with few exceptions (such as English) have partici-
pated in what must have been an areally conditioned renewal of the se-
mantics of the middle voice by means of reflexive morphology. Only at
the periphery do we find languages like Modern Greek that have not lost
the inherited middle voice or English that have not developed a reflexive-
based anticausative formation. (In the case of English, the early loss of
the inherited reflexive, cognate with German sich, Danish sig, may have
been a contributory factor. Even in the earliest Old English the only sur-
viving descendant of the Common Germanic reflexive is the possessive
sin, and even this is only sporadically attested in verse.)
In the case of the Indo-European languages of Europe, then, we are
looking at a period of perhaps two millennia (more or less in the case
of individual languages and branches) in which an innovative system of
marking anticausatives arose, albeit at least in part as a replacement for
the earlier middle voice. The predominance of anticausative pairings in
European Indo-European languages nonetheless gives some indication of
the time span needed by a language to develop this crosslinguistically
marked pattern as its basic typological profile.
Third, we can turn to the Uralic languages that have been in close con-
tact with European languages, to see to what extent they have assimilated
to the predominantly anticausative pattern of their Indo-European neigh-
bors. The languages for which analyzed data are available from Haspel-
math (1993) are Finnish (in close contact with Balto-Slavic and Germanic
languages), Hungarian (in close contact with Slavic, Germanic, and Ro-
mance languages), and Udmurt (in close contact with Russian). I have ex-
amined this question in more detail elsewhere (Comrie 2005), and a brief
summary will be given here. These languages do indeed seem to occupy
an intermediate position between the globally preferred pattern of caus-
ative pairings and the locally preferred pattern of anticausative pairings
in Europe, although a detailed evaluation of this claim would require
comparison of other Uralic languages, in particular those whose contact
with European languages has been less long-lasting. Udmurt is a clear
case of a language with roughly equal numbers of causative and anti-
causative pairings, with a slight preference for the former in the con-
cepts selected by Haspelmath. By my reanalysis, Finnish is overwhelm-
ingly characterized by causative pairings, although Haspelmath’s original
4. Conclusions
Notes
References
Comrie, Bernard (2000). Tsez as an intransitive language: an aspect of the structure of the
Tsez verbal lexicon. In Slovo v tekste i v slovare: sbornik statej k semidesjatiletiju akade-
mika Ju. D. Apresjana, L. L. Iomdin and L. P. Krysin (eds.), 312–316. Moscow: Jazyki
russkoj kul’tury.
— (2005). La typologie des langues ouraliennes, comparées avec les autres grandes familles
d’Eurasie septentrionale. In Les langues ouraliennes aujoudh’hui, Jocelyne Fernandez-Vest
(ed.), 75–85. Paris: Librairie Honoré Champion.
Dryer, Matthew (1989). Large linguistic areas and languages sampling. Studies in Language
13, 257–292.
Haspelmath, Martin (1993). More on the typology of inchoative/causative verb alterna-
tions. In Causatives and Transitivity, Bernard Comrie and Maria Polinsky (eds.), 87–120.
Amsterdam: John Benjamins.
Jacobsen, Wesley (1982). Transitivity in the Japanese Verbal System. Bloomington: Indiana
University Linguistics Club.
Nedjalkov, Vladimir P. (1969). Nekotorye verojatnostnye universalii v glagol’nom slovo-
obrazovanii. In Jazykovye universalii i lingvističeskaja tipologija, Igor’ F. Vardul’ (ed.),
106–114. Moscow: Nauka.
Nichols, Johanna (1993). Transitive and causative in the Slavic lexicon: evidence from Rus-
sian. In Causatives and Transitivity, Bernard Comrie and Maria Polinsky (eds.), 69–86.
Amsterdam: John Benjamins.