Professional Documents
Culture Documents
Series Editor
Alexander Vovin (Ecole des Hautes Etudes en Sciences Sociales (EHESS), France)
Editorial Board
VOLUME 15
Edited by
LEIDEN | BOSTON
Cover illustration: “FLYING ‘WHITE’ SHAMAN IN ‘T’ LETTER SHAPE” by Ahmet Ali Aslan.
Reproduced with kind permission. According to Altaian belief, the soul is carried off by the spirits
eastward if the youth is destined to become a ‘White’ shaman. The flying shaman form, full of symbols
referring to the cycle of life (snake), and the divine (Upper, Middle, and Lower World, Black and White),
creates the letter “T” in the original alphabet used by Turkic peoples in their Orhun and Yenisei script.
Typeface for the Latin, Greek, and Cyrillic scripts: “Brill”. See and download: brill.com/brill-typeface.
issn 2452-2961
isbn 978-90-04-32564-7 (hardback)
isbn 978-90-04-32869-3 (e-book)
Brill has made all reasonable efforts to trace all rights holders to any copyrighted material used in this work.
In cases where these efforts have not been successful the publisher welcomes communications from
copyright holders, so that the appropriate acknowledgements can be made in future editions, and to settle
other permission matters.
Preface vii
Index 289
Preface
According to UNESCO, it is believed that at least half of the nearly 7,000 lan-
guages spoken around the world will cease to be used within the next 100 years.
If this issue is neglected, people will lose not only their cultural heritage but
also invaluable understandings about the history of all humankind. In other
words, with the disappearance of unwritten and undocumented languages,
humanity would lose not only cultural wealth but also important ancestral
knowledge embedded, in particular, in indigenous languages. As many of these
languages are no longer being handed down from generation to generation,
the number of the native speakers of these languages is decreasing, leaving
them endangered. For instance, the Ubykh language, one of the languages spo-
ken in the Northwest Caucasus, became extinct when the last speaker Tevfik
Esenç died in 1992. For such reasons, if efforts are not made to document the
speech and cultural practices of those who use these languages, many speech
forms will disappear along with the cultural heritage they embody.
Within the context above, the 1st International CUA Conference on
Endangered Languages was held by the Caucasus University Association (CUA)
on 13-16 October 2014, at Ardahan, Turkey, in collaboration with the Turkish
Language Society, Ardahan University and Harvard University. The goal of this
“open by invitation conference” was to bring together 35 prominent scholars
from all over the world who were actively engaged in research projects and
partnerships on different aspects of language endangered, documentation and
revitalization to create a forum to be able to discuss the problems of dying lan-
guages on both theoretical and practical levels. The motto of the Conference
was “before shooting stars vanish” in the sense that the CUA views each lan-
guage as a star in the sky that could turn into a meteor and believes that as
many of these languages should be documented and/or revitalised as possible
before they fade away. The regional focus of the Conference was global, with
a particular emphasis on the languages and cultures of the Caucasus and the
Asia. The aim was to inspire international discussion to a linguistic diverse and
unique corner of the globe, through which glocal scholars would be able to
benefit in their work. The conference program is available on the official web
site (https://www.ardahan.edu.tr/CUAConference2014/).
The region in which the 124 participant universities of the CUA are situated
is home to a rich linguistic diversity, much of it highly endangered, making it
particularly appropriate to situate the Conference at Ardahan.
The primary intent of the Conference was to offer the invited scholars the
opportunity to address relevant linguistic issues to articulate best practices
viii preface
in the field. Hence the participants addressed issues such as the state of the
field of language documentation, conservation and revitalization; experi-
ences that reflect on establishing research centres and international research
collaborations; or topics related to technology, data collection, archiving and
preservation.
The conference concluded with an action plan for which the ultimate goal
was to establish an “Endangered Languages Research Centre” at Ardahan. We
are very happy to announce that the Turkish Language Society kindly declared
to support this Centre financially.
All 35 participants were invited to submit their papers for this publica-
tion and 19 scholars submitted their work which represent the papers in this
volume.
We would like to thank all of the presenters in the panel sessions, who made
the conference not only interesting but provocative as well.
The Brill’s editors kindly accepted the current editor’s suggestion to publish
these papers in the form of a special volume.
We believe that the present volume forms a coherent collection to com-
plement the previously published volumes of the Global Oriental. Special
thanks go to our conference sponsors for their continued support: Turkish
Language Society, Kyrgz – Turkish Manas University and the Turkish National
Commission for UNESCO. Last but not least we wish to thank Smithsonion
Institution, Foundation for Endangered Languages, and Living Tongues
(Institute for Endangered Laguages) for their invaluable academic support.
Gregory D. S. Anderson
Introduction
Siberia – the very name can evoke a shudder. Why? True, it is the coldest
inhabited part of the earth so a shiver, yes, but why a shudder? This is due
to the fact that Siberia is known around the world and in Russia itself, where
it constitutes the vast majority of the land mass of that giant nation,1 as a
frozen wall-less prison. To be sure, Siberia served as a penal colony for the
Tsarist Imperial Russian leaders and this tradition was institutionalized with
murderous zeal by their successor Soviet hegemons. But this is only part of the
sad story of the consequences of Russian/Soviet colonialism on the diverse
native populations of Siberia. For the purposes of the present study I focus
only on the linguistic consequences of Russian imperialism and hegemony on
the Native Siberian peoples.
I start with a brief overview of the diverse Native Siberian groups as they
stood at the time of the initial colonialist expansion and exploitation in the
16th century in section 1. I turn in section 2 to an introduction to various pre-
Soviet phases of Russian colonialism and hegemony over Native Siberian
populations. In section 3, using post-Soviet census data, I discuss the issues
of language shift and ethnic shame that move ever forward among the Native
Siberian population groups in the post-Soviet colonial space. Finally in section
4 I present some structural linguistic consequences of Russian linguistic hege-
mony on the grammatical structures of the dwindling and receding languages
of the vast Siberian territory.
1 Indeed we should say the entire Asian portion of Russia in a traditional, non-administrative
understanding of the term Siberia.
Siberia at the time of contact was home to several dozen languages belonging
to a range of different language families. Moving west to east we find various
northern Samoyedic (Nenets, Enets, Nganasan) and Ob-Ugric (Khanty, Mansi)
speaking peoples in the western edge of Siberia between the Urals and the
Ob-Irtysh River complex. These people mainly pursued reindeer breeding in
the north and hunting and fishing economies in the southern parts of this
region. To their east were found a range of Siberian Turkic groups (Tuvan,
several Altai and Xakas groups, Shor, Chulym Turks, the Tofa and Siberian Tatars
groups) in the southern regions and southern Samoyedic (Selkup, Kamasian,
etc.) and Yeniseic groups, today represented only by the Ket and Yugh. The
Turkic speakers were largely pastoral nomads, but this mixed with hunting/
fishing or reindeer breeding in the northern mountainous and swampy regions
where these economic pursuits were more viable, while the Samoyedic and
Yeniseic peoples originally pursued hunter/fishing economies with some
limited reindeer-based economies, for example among the northern Selkup.
The Tungusic speaking groups occupied a vast territory stretching eastward
from central Siberia all the way to Sakhalin in the south, Kamchatka in the east,
and the Russian Arctic Far East in the north. These include such languages as
Evenki, Even, Negidal, Nanai, Udihe, Ulcha, Oroch and Orok. Tungusic peoples
were engaged in either hunting pursuits or reindeer economies, depending
on the environmental conditions. In the southern part of the western half of
this territory, Tungusic speakers were in contact with the pastoralist Buriat,
speakers of a Mongolic language, and their now extinct linguistic cousins
the Soyot. On Sakhalin and in the Amur river area, Tungusic speakers were
in contact with Nivkh (Gikyak) a riverine fishing-oriented people who speak
a language isolate. In the southern and central part of Kamchatka, Itelmen-
speaking people were found, and in the north Koryak who speak a Chukotko-
Kamchatkan language. In the northern part of the Tungus-speaking area were
found to the east the reindeer-breeding Yukaghiric-speaking peoples like the
Odul, Wadul, Chuvan and Omok, the latter two now extinct linguistically, and
the Omok ethnically as well too. To the east of the Yukaghiric peoples were
the reindeer-herding Chukchi, also of the Chukotko-Kamchatkan language
family (which also includes Chukchi’s sister languages Kerek and Al’utor),
while in the coastal parts of Chukotka, Chukchi-speaking people pursued sea-
mammal hunting oriented economies similar to the local Eskimoic-speaking
populations, the Sireniki, Naukan and Siberian Yupik. In short, Native Siberia
at the time of Russian contact was home to a vast array of different peoples
Consequences of Russian Linguistic Hegemony 3
All things being equal, minority population groups should grow steadily unless
some catastrophe has occurred like epidemic disease or war that has fractioned
the population. Indeed we can see this trend when looking at census returns
on ethnic affiliation for some (but not all) Native Siberian groups. If we look at
these trends as reported in six successive census returns from 1959-2010, four in
the Soviet period and two in the post-Soviet period, for groups like the Nenets,
Consequences of Russian Linguistic Hegemony 5
the Evenki, the Dolgan or the Yukaghir, everything appears normal in terms of
demographic growth.
For many other groups however, there are disturbing trends revealed, and
the number actually declines in many cases. What the demographic decline
numbers actually reveal is not loss of physical representatives/actual
population members, but rather the decline in the subjective evaluation
of being associated with that identity. This is a process called ethnic shame.
Thus, most Native Siberian groups have experienced, or are now undergoing,
rapid spread of ethnic shame and are hiding their identity in favor of reporting
as Russian. We can divide the Native Siberian groups in roughly four such
groups as they reflect somewhat different trends in the spread of ethnic
shame. Two groups from western Siberia, the Khanty and Mansi experienced
this only during the Soviet era between 1970 and 1979. This period marked
a massive upturn in the exploitation of the extractive oil and gas industries
on their territories. Since then, the groups that had self-identified as Mansi
has grown steadily while Khanty experienced an up-turn in ethnic identity
between 1989 and 2002, but then a more plausible increase between 2002
and 2010.
(2)
Four other Native Siberian groups began showing this spread of ethnic shame
already in the same period, but this has continued and/or accelerated. These
are the Oroch, Nganasan, Ket and Nivkh. The various fluctuating numbers show
local trends that distinguish each of these groups. Oroch began this decline in
ethinc identity and spread of ethnic shame in the period between 1979 and 1989,
the other three between 1970 and 1979. Nivkhs experienced normal growth reports
between 1970 and 1979. Nganasans returned a serious upturn in self-reported
ethnic identity between 1979 and 1989, but experienced steep decline between
1989 and 2002. Kets showed a reverse trend with continued decline between
1979 and 1989 and an up-turn in the immediate post-Soviet decade (1989-2002).
However, all four report declines in the most recent census.
(3)
Indeed this is the trend one finds across Siberia: the process of ethnic shame
has advanced and spread considerably in the post-Soviet period. Some groups
began to show this decline in the period between the end of the USSR and
the first post-Soviet census (1989-2002), a group which includes four Tungusic
languages, Nanai, Ulcha, Udihe and Negidal, plus Koryak and Aleut. This
process has continued in the 2010 census returns.
(4)
The last group are just recently revealing these trends in ethnic shame and self-
invisibilization. The 2010 census returns show the first reported demographic
decline among the Chukchi, Sel’kup, Itelmen and Tofa.
(5)
(6)
a stretch to say that most of the language families of Siberia will be extinct
before 2100.
Since basically all Native Siberian people outside of Tuva (which only joined
the USSR in the 1940s) are fluent in Russian, it is no surprise that codemixing
or codeswitched utterances are commonplace in the speech of those Native
Siberians who continue to use the minority languages, such as the following
mixed Russian-Chulym Turkic sentence.
Code-switching with Russian is still not very well explored for almost any
minority language of the Russian Federation and will likely advance the study
of this phenomenon greatly when it can be better investigated. For example,
there is complex gender agreement interactions found in Erzya-Russian
codeswitching (Janurik 2015) and similar phenomena are found in most
languages of Russia today, but the topic has not been explored in most of them.
resulting directly from Russian contact and the adoption of Russian linguistic
norms into these originally quite different and distinct language systems.
One area of the syntax of complex sentences that has a clear Russian origin
and one which was originally entirely alien to Siberian Turkic structure is the
use of a clause-initial complementizer followed by a finite verb of the type
S <finite.matrix> COMP S <finite.“embedded”> (8) instead of the original structure of
S <non-finite.“embedded”> + S <finite.matrix> (9), often with a genitive marking on the
subject of the embedded clause to further show the non-finite or nominalized
quality of the embedded predicate. The borrowed Russian complementizer ʃto
introduces this new finite complement clause in (8).
Another Russian feature that has found its way into Native Siberian languages
is the use of a scopeless negative operator and a finite verb together with a
borrowed subordinator in a type of temporally subordinate clause generally
introduced by ‘until’ or ‘before’ in English translations, and by poka in
the original Russian and restructured Siberian Turkic forms. So compare
for example (12) and (14) with borrowing/restructuring with the original
formations in (13) and (15).
Intermediate stages in this shift can be found in the speech of some speakers
too. Thus in (16) we find a form introduced by the borrowed subordinator and
with the scopeless negative operator, but with a non-finite verb as well (here in
a participle form marked also with the locative case).
Lastly, the introduction of new functions to cases also extends to the genitive
in the speech of high-contact varieties of Siberian Turkic, such as Tofa (21) or
Abakan Xakas (22). Genitive case previously never was the case subcategorized
for by any verb as the form of its complement/object/2nd argument in the
Turkic languages, but now it is used in these highly restructured varieties
as the case that is projected onto the complement of the verb ‘fear, be afraid of’,
a pattern of usage that clearly and obviously reflects Russian norms. Originally
and still in less restructured varieties of these Turkic languages the ablative case
was/is used (23).
(21) Tofa
kør-gen-ɪ irezaŋ-nɯŋ men kòrt-pa-an men
see-pst.prtcpl-def bear-gen I fear-neg-pst 1
‘I was not afraid of the bear I saw’
(Field Notes 2001, SDA-Bear Story)
(24) Nivkh
t‘ana ñ-aχ lu-gu-ja
give.imp.2sg I-acc sing-caus-imp.2sg
‘let me sing’
(Gruzdeva 2015: 171)
(25) Russian
daj spo-ju
give.imp sing-fut.1sg
‘let me sing’
(Gruzdeva 2015: 171)
5 Summary
Five centuries of Russian colonialism and hegemony have left the once rich
diversity of languages in Native Siberia in steep decline. Economic exploitation
followed by waves of settlement colonization altered the demographics of
Siberia forever, and Russian emerged as the only language with a place long-
term in the linguistic market. Pressures came from below and above, and
language abandonment and ethnic shame have arisen and spread. The future is
very bleak for the languages of Siberia. Before they disappear entirely, massive
structural borrowing and incorporation of loans from Russian is their fate.
Only adequate documentation undertaken now can leave future generations
of Native Siberians the legacy they deserve should the social and economic
conditions ever once again become favorable to reclaim these identities.
References
Stolz, Christel (ed.) 2015. Language Empires in Comparative Perspective. Berlin: Mouton
de Gruyter.
Subrakova, O. P. 1992. Padezhnaja sistema v bel’tirskix govorax xakasskogo jazyka.
[The case system of the Bel’tir variety of Xakas] Xakasskaja dialektologija, pp. 32-50.
Abakan: XakNIIJALiI.
Verner, G. K. 1997. Jugskij jazyk. [The Yugh language] In A. P. Volodin (ed.) Jazyki mira:
Paleoaziatskie jazyki. Moscow: Indrik, pp. 187-195.
Internet Sources
http://www.perepis2002.ru/
http://www.tooyoo.l.u-tokyo.ac.jp/Russia/bibl/
Chapter 2
⸪
1 Introduction
to the same census, 212,522 of the combined total of Karachays and Balkars
declare to use Karachay-Balkar natively. The Karachay-Balkar settlements are
divided in two contiguous political units, but unlike the Ossetians in their
home regions, the Karachays and Balkars do not constitute a majority in their
respective republics, where there are sizeable, ethnic Russians and Caucasian-
speaking Kabardino-Cherkess (also known as Circassians).
Historically, the Scythians, Alans, and Sarmatians are considered the linguistic
ancestors of the modern Ossetians, although, evidently, the linguistic
documentation is rather meagre and often limited to personal names, the
occasional quote in a Classical Greek source (such as Herodotus), and grave
inscriptions.
The same may apply to the attempts to establish a direct linguistic link
between the modern day Karachay-Balkars and the (presumably) Kıpçak
speaking Cumans and Pechenegs, if not including the other elusive Bolghars
with their unclear Turkic affiliation. This is obviously a cause of disagreement.
Of course, there are other, mostly lesser-known Turkic languages spoken in the
Caucasus, such as Nogay (a South/Central Kıpçak or “Aralo-Kaspian” Turkic
language), Kumyk (West Kıpçak), who might also lay claim on these historically
attested peoples and tribes.
The scholarly study of Ossetic and Karachay-Balkar did not start in earnest
until the second half of the 19th century, right after the conquest of the
Caucasus by the Russians. Ossetic was studied in depth, thanks to the efforts of
the Finnish scholar Anders Sjögren and a prominent Russian scholar, Vsevolod
Miller. Subsequently, the native Ossetian scholar Vassiliy Abaev would build on
their works. Especially the historic relations of Ossetic with the other Iranian
languages, several European language groups (Slavic, Celtic), and, of course,
with Turkic too, became much better known. Abaev has also published several
important synchronic descriptions of Ossetic and textual editions of Ossetic
myths and folklore.
Karachay-Balkar, on the other hand, lacks similar kinds of wide-ranging
research, such as the interpretation of the customs, the historic dimensions
of the language, possible contacts with other ethno-linguistic groups, and so
on. Actually, it was not until at the turn of the 20th century that the Russian
linguist Nikolaj Karaulov recorded the Karachay-Balkar in earnest (Karaulov
1908). Even an in-depth description of the dialects of Karachay-Balkar has yet
to appear. A bilingual Karachay-Balkar – Russian dictionary did not appear
until 1989 (Tenišev 1989). The older Russian-written literature on Karachay-
Balkar was meant as an aid to help the Karachay-Balkars to master Russian.
The famous Ossetian linguist and prominent Ossetologist Vasilij I. Abaev seized
upon this historic “cohabitation” to confirm Nikolaj Marr’s Japhetic theory,
which was proclaimed in the early 1920s.4 According to this pseudo-scientific
theory, also known as the New Study of Language, languages rather reflect a
continuous merger of previous languages.5 Nikolaj Marr gave a Marxist twist
to the European mediaeval idea that, analogous to the legendary origin of the
Semitic peoples and their languages from Noah’s son Shem, most European
nations and ethnicities descended from Noah’s other son, Japheth. According
to Marr’ own interpretation, the languages spoken by Japheth’s children would
be the substrate that was later overlaid by Indo-European languages. The
different layers (of borrowing) would correspond to the different social classes
of ancient societies (in Europe). Language was considered a superstructure on
the base of society, concurrent to the creation of a (single) socialist economy.
As language mixing was therefore the logical consequence, the notion that the
languages of peoples could be traced back and therefore classified according
3 Other explanations are conceivable: a large group of mostly, Iranophone Alans may have
vacated these areas, voluntarily or involuntarily (after the well-documented Mongol inva-
sions). Subsequently, the Turkic Karachay-Balkar population occupied or dominated the
region, while they would also impose their language on the remaining (Iranophone) sheep-
herders. Neighbouring groups would still have called them by their older names, etc. It is
also conceivable that a previous, centuries-old situation of active bilingualism in these areas
had shifted in favour of a predominant monolingual Turkic environment, when an external,
Turkic language, i.e. Azeri, became the lingua franca of the Caucasus. This kind of linguistic
symbiosis and co-existence between two linguistically unrelated groups is well known else-
where in the world, e.g. between the Iranophone Balochis and Dravidian-speaking Brahuis
in Pakistan. The dominance of one language over another depended on the political constel-
lation and the linguistic preference often alternated with each generation.
4 Most of his articles on the Japhetic Theory can be found in Marr (1933).
5 This has led to a complaint by a Soviet scholar critical of Marr that “the study of the connec-
tions between related languages is turned over as a monopoly to bourgeois linguistics” (cited
by Pollock 2006, 122).
The Contacts between the Ossetians and the Karachay-Balkars 23
In the article published in the journal, Language and Thought, Abaev (1933)
discussed precisely the intermingling and mixing of the Karachay-Balkars and
Ossetians, being reflected in the mutual borrowings between these two groups.
Later on, he incorporated this study in his collected writings known as Ossetic
Language and folklore (Abaev 1949).6
Apparently, shortly after the official endorsement by Marr, a youthful Abaev
undertook a field trip to Baksan and its surroundings (located in the Kabardino-
Balkarian Republic). He then compiled a comprehensive list of putative Ossetic
and Karachay-Balkar parallels as published in Abaev (1933), which reflected
his faithful adherence to Marr’ Japhetic theory, the prevailing dogma of that
period. It comes as no surprise that this publication shows severe, method-
ological shortcomings. Notably, the article does not give an ultimate origin of
the forms, i.e. whether derived from Proto-Iranian, Old / Proto-Turkic or from
“Caucasian”. Even if we ignore the ideological bias, Abaev’s paper also contains
numerous factual errors, which he did not correct, when, subsequently, these
forms were incorporated in his famous Historical-Etymological Dictionary
of Ossetic. In this dictionary, he frequently assigned an older origin of these
parallels, which was finally permitted after Stalin had denounced the Japhetic
theory in an article.7
Unfortunately, Abaev often suggested etymologies that were a priori implau-
sible. Also, it seems that some of the forms cited by Abaev were rather ephem-
eral, such as the counting system with Digoron sounding names. This system
was apparently used in the southern Kabardino-Balkar region of Greater
Khulam (near to the Russian – Georgian border). Actually, these numerals do
not appear to be attested in other Karachay-Balkar-speaking areas (and there-
fore, not incorporated in the standard(ized) Karachay-Balkar language). The
publication includes other terms too. A good example is Ossetic gæmæx ‘bare,
with bare spots’, for which a Karachay-Balkar form gǝmǝx ‘a spot covered by
scarce vegetation’ was cited by Abaev as parallel, without giving any source.
So far, I have not found any corroboration for this, only Karachay qımıja ‘bare
(footed)’ (?). Forms such as gǝmǝx are perhaps no more than ad hoc borrow-
ings that we may very well encounter in the vocabulary of the few (bilingual?)
Balkar speakers who happened to have been in intensive contact with local
Digoron speakers (by marriage, trade or otherwise).
Although Abaev introduced the region as a kind of melting pot of customs,
traditions and languages of the local peoples, he did not explain the exact social
or sociolinguistic circumstances (such as code-switching, active bilingual-
ism, and other aspects of interlinguistic and multilingual communications)
of this region. It remains, for instance, unclear how competent those inform-
ers were in either Digoron Ossetic or Balkar, and how the linguistic skills were
acquired, through marriage, upbringing, trade or otherwise. There were argu-
ably no religious objections against intermarriages between Sunni Digoron
and Balkar speakers. Abaev asserted that the Ossetic elements in Karachay-
Balkar were not recent but the result of “the legacy of ancient Alanic-Turkic
mingling, which took place in the areas of all the gorges, from the Terek to the
Upper Kuban river”8 (Abaev 1949, 18). However, many of the claimed Ossetic
loanwords in Karachay-Balkar are not attested elsewhere, which would rather
suggest recent or ad hoc borrowing. This could be an indication of (recent)
bilingualism.
For this conference, I would like to present a few of my own observations and a
personal assessment of Abaev’s treatment of the Ossetic and Karachay-Balkar
“parallels”. I will limit myself to the categories A. and B. Abaev’s work is a very
instructive example of Soviet linguistics of the interbellum.
As for the assessment of the Karachay-Balkar material,9 I have relied prin-
cipally on the dictionary that Tenišev published in 1989, in order to find a con-
firmation whether Karachay-Balkar has genuinely assimilated these loanwords
mentioned by Abaev in its vocabulary. Another valuable publication is that
from Gustav Schmidt, who more explicitly considered the Ossetic borrowings
into Karachay (Schmidt 1931). Again, regrettably, he did not always identify
the ultimate origin of the Ossetic elements, whether they were inherited from
Old Iranian or merely local, Caucasian Wanderwörter, remained unanswered.
Recently, Ewa Siemieniec-Gołas has carried out a very valuable lexical study
on the Turkic “Erbwortschatz” of Karachay-Balkar (Siemieniec-Gołas 2000).
I have considered this work as well for the present talk.
ii. On the other hand, Ossetic must have borrowed quite substantially from
Karachay-Balkar as well. The difficulty is that quite often the Karachay-
Balkar forms are almost indistinguishable from their Turkic correspondences.
The following forms may derive from Karachay-Balkar due to its typical
phonological features:
– KB töppe ‘top, crown (of the head); peak; tuft’ (< PT *töppe, cf. OT töpi,
Kumyk töbe, Turkish tepe) ~ Oss. (Dig.) c’opp ‘pluck, wool’, (Iron) c’upp sum-
mit, peak’ (c- < *ti).
– KB kaya ‘rock, boulder’ (< PT *kaya, Turkish kaya etc.) ~ Oss. k’æj/ k’æjæ
‘slate’. Evidently, the Ossetic forms may also derive from another Turkic lan-
guage. The Svanetic form k’a ‘slate’ however is rather a direct loanword from
Iron Ossetic k’æj.
– KB ırxı ‘stream, creek’ (< OT arık ‘irrigation canal’, cf. Chagatay arığ, Turkish
ark, etc.) ~ Oss. (Dig.) ærxæ ‘gorge, dry riverbed’.
– KB tılpıw ‘vapour; air’ ~ Oss. (Dig.) tulfæ ‘vapour, steam’, see further below.
iv. The following forms may stem from a third source, perhaps independently:
– Oss. (æ)zme(n)sæ (Dig.) ‘sand’ (Iron yzmis) ~ KB (Balk.) üzmez ‘id.’. The Balkar
form does not conform to Turkic morphology, hence it might be a borrow-
ing from Ossetic, although it has no further correspondences in Iranian or
in the neighbouring Caucasian languages. An Iranian preform *uz-maišā-
‘mixture, being mixed up’ (*maiz- ‘to mix, mingle’) has often been suggested
(cf. Abaev 1958-1995, 4: 282), but this reconstruction is fraught with prob-
lems, both semantically and morphologically.
– Oss. xuræ ‘gravel’ (Iron xoyr) ~ ? KB (Balk.) xuru ‘stony place, cobblestones’
(no further documentation).
– Oss. cuxcur ‘flowing water’ ~ KB çuçxur ‘water fall’ ← South Caucasian /
Kartvelian ?, cf. *me-rčx-e ‘shallow (of water)’, *rečx-/rčx- ‘to purl, babble,
murmur’ (Klimov 1998, 119, 157).
– typpyr / tuppur ‘bloated, fat’; [Dig.] hill’ ~ duppur ‘hill’, with similar forms in
Darginian dupur ‘mountain’, Persian topoli ‘fat’, derived from a Turkic forma-
tion with *töppe?
28 Cheung
B. The 32 terms from the animate natural field are largely neither from Iranian
nor Turkic. The botanical terms are usually indigenous (Caucasian). Of the
parallels, 7 are Ossetic forms borrowed into Karachay-Balkar, 5 from Karachay-
Balkar into Ossetic, whereas the remaining 14 may be most likely from a third
source (independently). Finally, 6 borrowed forms may be just ephemeral (4)
or misinterpreted (2).
ii. Several Karachay-Balkar forms from the animate realm have entered
Ossetic. We may cite the following forms, which in turn may be borrowings
from another language:
– KB bittir ‘bat’ ~ Oss. (Dig.) bittir (Iron xælyn-byttyr) ‘id.’, see below.
– KB gabu ‘dandruff’, (Karachay) gıbı, (Balkar) gubu ‘spider’ ~ Oss. gæby, gyby /
gæbu ‘mite’. The Ossetic forms appear to be borrowings from Karachay
Balkar gabu, etc., which again may be an adaptation of a Kartvelian
The Contacts between the Ossetians and the Karachay-Balkars 29
formation, notably from a Georgian dialect form, cf. Gurian ǯɣiba- ‘tick’
(Klimov 1998, 100).
– KB gılıw ‘foal; rat’ ~ Oss. (Dig.) gælæw ‘rat’. According to Abaev, gælæw is
an “infantile deformation” of k’ælæw ‘foal’, which would be comparable to
Kabardian qolow ‘piglet’, Georgian qoqo, Megrelian ɣoɣo ‘calf of buffalo’.
Rather than considering “infantile deformation”, gælæw may simply be a
loanword from Karachay-Balkar, as gılıw has retained the two meanings
‘foal; rat’. Of course, Karachay-Balkar gılıw may well be Caucasian in origin.
– KB mıga ‘quail’ ~ Oss. mæga ‘snipe’. The Balkar form probably directly stems
from Kartvelic, notably Georgian mc̣qer- ‘quail’. Balkar would have simpli-
fied the consonant cluster of the Kartvelian original formation, and, then,
have passed on the term to Ossetic: Oss. mæga clearly shows a semantic
shift.
– Oss. (Dig.) pursa (Iron pysyra) ‘nettle, Urtica urens’ ~ mursa ~ ‘id.’, see below.
iii. The following forms feature borrowings from a third source. They consists
mostly of terms from the local flora, which are often Caucasian:
– Oss. æxsæli, æxsælæ, (Iron æxsæly) ‘juniper’ ~ KB (Balk.) şkeyli, şkildi
‘id.’ ← South Caucasian?, cf. Georgian ašk’ili ‘wild rose’, Mingrelian
šker- ‘rhododendron’.
– Oss. sk’eldu ‘cowberry’ ~ KB şkildi ‘juniper’ (‘можжевельник’, Tenišev 1989,
751), kızıl şkildi, (Kar.) kızıl işkildi ‘cowberry, Vaccinium vitis-idaea’, also dial.
ışkıldı ?) ← a variant of the Caucasian ‘juniper’ forms (-di: unanalyzable suf-
fix in both Ossetic and Karachay-Balkar). The Karachay-Balkar forms seem
to be borrowings from an unattested Iron correspondence *(y)sk’ildy, cf.
Dig. sk’eldu.
– Oss. cym/cumæ ‘dogwood, Cornus’ ~ KB çum ‘id.’, cf. Lezgian čumal,
Tabassaran čemel ‘id.’ (similar forms: Turkish çim ‘grass’).
– Oss. ʒedyr, ʒeʒyr, ʒedyræg / ʒæduræ ‘blackberry’ ~ KB züdür ‘id.’ ← a Caucasian
language ?, perhaps to be analyzed as *zǝ ‘red; blackberry’ (cf. Adyghe zǝ
‘red’ or Abkhaz Bzyp a-z ‘blackberry shrub, bush’, Chirikba 1996, 87), and
*dur ‘fruit’ ? (cf. Lezgian dur ‘dried fruit’, Ossetic dyrǧ ‘fruit’, loanword).
Alternatively, it may be a borrowing from Finno-Ugric, according to Tenišev
(1989, 807), who apparently follows Abaev (1958-1995, 1: 396).
– Oss. ʒæbidyr / ʒæbodur, ʒæbedur ‘mountain goat, Capra caucasica’ ~ cuǧutur
‘id.’ ← a preform *ǯəɣʷətur, undoubtedly of Caucasian origin, probably West-
Caucasian, cf. Adyghe šəquɫtər ‘id.’ (Apažev – Kokov 2008, 576).
– Oss. mæntæg / mæntæg, mont ‘burdock’ ~ KB mant ‘id.’ ← Wanderwort ?, cf.
Svan mant ‘id.’, Greek mínthē ‘mint’.
30 Cheung
– Oss. næzy / næzi ‘pine, Pinus sylvestris’ ~ KB (Balk.) nazı, (Karachay) nızı
‘fir’ ← Kartvelic *naʒw ‘spruce, fir(-tree)’, cf. Georgian naʒv (but also as a
regional Wanderwort in other Middle Eastern languages, cf. Persian nāz,
nāžu, nājū ?).
– Oss. murtgæ, murk’æ ‘Viburnum’ ~ KB (Karachay) murtxu, from Kartvelic, cf.
Georgian marc̣qv- ‘strawberry’.
– Oss. mæra / mura, pura ‘hollow’ ~ KB pura ‘hollow, rotten (tree)’ ← ?, cf.
Chechen, Ingush mur ‘hollow tree’.
– Oss. ninæǧ ‘raspberry, Rubus idaeus’ (Iron mænærǧ) ~ KB nanık ‘id.’ ← ?
– Oss. tægær ‘maple’ ~ tıgır, (Balk.) tıkır ← Caucasian, cf. Svan tek’er, tek’ra
‘maple’.
– Oss. turtu, (Iron) tyrty ‘barbarry, Berberis vulgaris’ ~ KB türtü ← Wanderwort ?,
cf. Lezgian turt ‘id.’, similar forms such as Persian tūt.
– Oss. ug ‘owl’ (Iron wyg) ~ KB uku. No doubt, these forms are onomatopoetic
in origin, cf. Megrelian, Laz ɣu, Svan ɣu, etc. Ossetic and Karachay-Balkar
have probably borrowed the forms independently from each other, perhaps
from another Caucasian language (if we are not dealing with “spontaneous”
expressive forms).
– Oss. gælæbo, gæbælo, (Iron) gælæbu ‘butterfly’ ~ KB (Karachay) göbelek
‘id.’ Abaev also cites the Balkar forms gebelo, gelbo (← Digoron?), probably,
ultimately, of Turkic origin (cf. Turkish kelebek). Almost all Turkic corre-
spondences of kelebek have retained a final velar (with the exception of geo-
graphically distant Uyghur kepilɛ). In addition, the voiced velar g- needs an
explanation.
– Oss. mæga ‘snipe’ ~ KB (Balk.) mıga ‘quail’ ← a Kartvelic preform * mc̣q̣a-,
cf. Georgian mc̣q̣er- ‘quail’. Both Ossetic and Karachay-Balkar would have
simplified a difficult to pronounce consonant cluster mc̣q̣°. On final *–r in
Proto-Kartvelic, cf. Klimov (1998, 317f.).
12 Abaev also entertains the possibility of a connection with several European designa-
tions for ‘wood; log’, e.g. Greek kládos ‘branch’, Slavic *kòlda ‘block, log’ (Russian kolóda),
Germanic (Old Icelandic) holt, German Holz ‘wood’. This may be co-incidental rather
than an instance of “Scytho-European” borrowing.
The Contacts between the Ossetians and the Karachay-Balkars 31
v. The following cited forms are fully unclear, also because of the unclear
meaning (misinterpretation, misheard?):
– Oss. bynʒ / binʒæ ‘fly’ ~ KB didin ‘wasp’ (not confirmed elsewhere, also not
included in Abaev 1958, 280).
– Oss. ʒægæræg ‘not fully bloomed flower’ ~ KB cıgıra, zıǧıra a kind of edible
plant.
3 Some Observations
We can notice several highly interesting forms that the linguistic ancestors
of Karachay-Balkar and Ossetic must have borrowed from the period prior to
their arrival in the (northern) Caucasus, i.e. before the Mongol invasions in
the 13th century. Abaev was the first scholar to label these ancient borrowings
as “Scytho-European” isoglosses, which in practice, meant that the ancestors
of the Ossetians would have borrowed, mainly, from Germanic and Slavic
(also Celtic and Latin), on which see Abaev (1965). However, similar, ancient
borrowings from Hungarian were not included in this label, simply because of
the fact that Hungarian was not part of the Indo-European language family.
A typical example of such a “Scytho-European” isogloss as defined by Abaev is
the following “Ossetic ~ Karachay-Balkar parallel”:
– Oss. synʒ /sinʒæ ‘thorn; blackthorn; splinter ~ KB (Balk.) şinji ‘spine, (plant)
needle’. In this case, sinʒæ may reflect older *spina-13 + dimin. suff. *čī. The
preform *spina- would be a loanword, most conceivably from East Slavic,
cf. Russian spiná ‘spine’, Old Polish spina ‘id.’ (inherited forms or, loanwords
ultimately from Latin ?). The “spine” form appears to be a widespread
European cultural term, attested in Latin spina ‘thorn’, Baltic (Latvian) spina
‘rod’, Germanic (e.g. Old High German spinela ‘hairpin’), English spine, etc.
13 Initial *sp- > *sf- > *s’s’ (palatalization) > modern Oss. s-, cf. sistæ, Iron syst ‘louse’ < PIr.
*spiš + *čī (e.g. Avestan nom. sg. spiš, Persian šepeš ‘id.’).
32 Cheung
We can cite several borrowed forms for which Abaev claims Ossetic as the
initial adopter, but actually, they most likely have entered an earlier stage of
Karachay-Balkar first, before their adoption into Ossetic:
– KB bittir ‘bat’ ~ Oss. (Dig.) bittir (Iron xælyn-byttyr) ‘id.’ ← South Slavic, espe-
cially Church Slavic nepŭtyrǐ ‘bat’ (which shows metathesis of t . . . p > p . . . t,
cf. Russian netopyr’). The Ossetic and Karachay-Balkar forms appear to be
an ancient borrowing from (South) Slavic. The question of course is which
language has borrowed first. The apparent loss of ne˚ may give us a clue.
Karachay-Balkar (and other Turkic languages) does not have native nouns
with initial ne˚, only derivatives of the pronoun ne ‘what’ are attested, cf.
Siemieniec-Gołas (2000, 158 f.). A Turkic speaker would have most likely
re-analyzed such a foreign formation, South Slavic nepŭtyrǐ, as an expres-
sion with the interrogative pronoun ne. In contrast, there would be no
apparent reason to resort to such a re-interpretation in Ossetic. There are
several inherited formations with initial (Proto-Ossetic) *ne˚, e.g. *nez
(= Digoron nez, Iron niz) ‘disease’, *new- ‘to cry’ (= Dig. new-, Iron niw-), *ne-
negative prefix. Initially, an early predecessor of Karachay-Balkar would
thus have borrowed the South Slavic form, after which it was passed on
to Ossetic.
The Contacts between the Ossetians and the Karachay-Balkars 33
– KB mursa ~ Oss. (Dig.) pursa (Iron pysyra) ‘nettle, Urtica urens’ ‘id.’. According
to Abaev (1949) the Ossetic form has been borrowed into Karachay-Balkar,
with the initial labial stop becoming the corresponding nasal m-. This
however cannot be correct, as only older voiced b-14 may become m- in
Karachay-Balkar, e.g. the indigenous name for the Balkar is Malkar. It is
more likely that an earlier Karachay-Balkar form *bursa is the source of the
Ossetic form, which looks not very ancient anyway (with atypical p- and
final -a, rather than f- and -æ respectively). Therefore, those Cumans with
their extensive contacts may have passed on this *bursa to Karachay-Balkar.
If so, Karachay-Balkar mursa < *bursa may have been an old borrowing from
Hungarian, viz. borsó ‘pea’ (Old Hungarian burso 1254, a place name), with
final -ó < *-Vk(V) and de-affricatisation of *č > s).15 The Hungarian form
itself reflects a Turkic loanword *burčak (Benkő 1992-1997, 1: 129), which is
the term for a legume, pulse(-like) plant, notably pea, vetch (and also ‘hail-
stone’), cf. Turkish burçak ‘vetch’, Karachay-Balkar burçak ‘hail’, (Balkar) ‘pea’
(Siemieniec-Gołas 2000, 70 f.; Clauson 1972, 357; Sevortjan 1978, 275 f.). The
semantic shift from ‘a legume’ to ‘nettle’ in Karachay-Balkar mursa needs an
explanation16 though.
14 Admittedly, the fate of the initial labial stops in Turkic is rather complicated. According to
Pritsak (1958, 352), b- becoming m- is a typical Kıpçak development (“echt kiptschakisch!”),
e.g. maka ‘frog’ (< PT *bāka, cf. Kumyk baka), (Karachay) miyik ‘big’ (but Balkar biyik, cf.
Tatar, Nogay biyik < PT *bädük, cf. Turkish büyük). It is difficult to postulate a watertight
phonetic rule though, especially since there are relatively few cases within Karachay-
Balkar (and borrowings from other Turkic languages may have distorted a possible pho-
netic distribution).
15 Besides, Chuvash părça ‘pea’ shows loss of the final velar. Assuming that Karachay-Balkar
mursa is a loanword from Chuvash is fraught with phonological and historical inconsis-
tencies. The older suggestion that pre-historic Ossetic would have borrowed somehow
from Chuvash is equally problematic. According to Gombocz (1912, 52), the Finno-Ugric
forms, Mari pursa, pırsa (both modern Chuvash and Mari lack indigenous voiced stops)
and Hungarian borsó would all have been borrowed from Old Chuvash *burčaɣ, but this
reconstructed Old Chuvash *burčaɣ is simply too close to all the other Turkic forms to
corroborate this statement, at least, with regard to Hungarian borsó.
16 Perhaps, the preform *bursa is a blend formation of two similar Hungarian forms: borsó
‘pea, vetch’ and bors ‘pepper’ (bors ← Turkic burç ← ultimately Sanskrit marica, Clauson
1972, 771 f.; Sevortjan 1978, 274 f.).
34 Cheung
Finally, from the inanimate sphere (cat. A., see above), we can cite another
likely instance of ancient Cumanic borrowing:
17 On the development of intervocalic *p > Ossetic v, (after *u) b, cf. Cheung (2002, 18f.).
The Contacts between the Ossetians and the Karachay-Balkars 35
modern Karachay-Balkar also in certain lexicalized phrases, e.g. bu-kün ‘today’ >
bügün. However, the exact circumstances of this kind of umlaut are unclear.
If the direction of the borrowing were the other way round, Ossetic initial -u-
would have been consistently adapted as -u-/-ü- in Karachay-Balkar.
A long exposé published by the Ossetian scholar Vassilij Abaev (Abaev 1933)
illustrates this situation on the relation between the linguistic ancestors of
modern Iranophone Ossetic and Turcophone Karachay-Balkar speakers.
He interpreted the Ossetic – Karachay-Balkar parallels found in the local
Balkar dialect as the outcome of ancient “Alanic-Turkic” mingling, on top of
a Japhetic/Caucasian substrate. Nevertheless, the bias is not only due to the
adoption of Marr’s Japhetic Theory. It also had a personal bias, as he ascribed
the great majority of these cases to an earlier Ossetic provenance, giving little
thought to the possibility that Karachay-Balkar could also have passed on
many borrowings to Ossetic as well.
We can summarize our conclusions drawn from the assessment of the Ossetic–
Karachay-Balkar parallels discussed by Abaev (1933), as follows (based on two
semantic categories):
The main criteria that have allowed us to distinguish the direction of borrowing
between Ossetic and Karachay-Balkar are:
References
Abaev, Vasilij I. 1933. “Poezdka k verxov’jam Kubani, Baksana i Čereka [Trip to the
upper reaches of the Kuban, Baksan and Cherek rivers].” Jazyk i Myšlenie I: 71-89
[= Abaev 1949, 271-290].
Abaev, Vasilij I. 1949. Osetinskij Jazyk i Folk’lor, vypusk I [Ossetian Language and Folklore,
part I]. Moskva – Leningrad: Izdatelʹstvo Akademii nauk SSSR.
Abaev, Vasilij I. 1958-1995. Istoriko-ètimologičeskij slovar’ osetinskogo jazyka [Historical-
Etymological Dictionary of the Ossetic Language]. 5 vols. Moskva – Leningrad:
Institut jazykoznanija RAN.
Abaev, Vasilij I. 1965. Skifo-evropejskie izoglossy na styke Vostoka i Zapada [Scytho-
European isoglosses on the crossroad between East and West]. Moskva: Nauka.
Anfert’eva, Antonina N. and Nikolaj N. Kazanskij. 2013. “Materialy k istorii Instituta
Lingvističestix issledovanij RAN 1921-1934 gg. (ot Instituta jafetidologičeskix
Izyskanij do Instituta jazyka i myšlenija im. N. Ja. Marra) [= Materials on the his-
tory of the Institute of Linguistic Research RAN from the years 1921-1934 (from
the Institute of Japhethological Investigation to the Institute of Language and
Thought, named after N. Ja. Marr)].” Acta Linguistica Petropolitana, Trudy Instituta
Lingvističestix issledovanij RAN 9, no. 1: 1-437.
Apažev, Muxamed and Džamaldin N. Kokov. 2008. Adygè-urys psal’al’è / Kabardino-
čerkessko-russkij slovar’ [Kabardino-Cherkess Russian Dictionary]. Nalčik:
Gosudarstvennoe izdatelʹstvo inostrannyx i nacionalʹnyx slovarej.
Benkő, Lórand. 1992-1997. Etymologisches Wörterbuch des Ungarischen. 3 vols.
Budapest: Akadémiai Kiadó.
Bennigsen, Alexandre. 1983. The Islamic Threat to the Soviet State. London – Canberra:
Croom Helm.
Cheung, Johnny. 2002. Studies in the Historical Development of the Ossetic Vocalism.
Wiesbaden: Dr. Ludwig Reichert Verlag.
Chirikba, Viacheslav. 1996. Common West Caucasian: the reconstruction of its phonologi-
cal system and parts of its lexicon and morphology. Leiden: Research School CNWS,
School of Asian, African, and Amerindian studies.
38 Cheung
Introduction
The articles in this volume are for the most part concerned with sociolinguistic
aspects of languages of the Caucasus, but in the present article I want to
draw attention to the fact that these languages also have unusual features
of grammatical interest. In addition to the importance of documenting and
preserving languages of the Caucasus as part of their communities’ cultural
heritage, the languages are also of scientific importance because of their
structural properties.
I have selected six features for brief discussion, with reference to more
extensive treatment in the literature. The examples represent all three of the
indigenous language families of the Caucasus, with one each from Kartvelian
and West Caucasian (Abkhaz-Adyghe), and four from East Caucasian (Nakh-
Daghestanian), more specifically the Tsezic branch of East Caucasian; the
greater concentration on the last mentioned simply reflects my own greater
familiarity with these languages.
In Georgian, the finite verb indexes (or in more traditional terminology: agrees
with) its subject and object in person and number. This means that a transitive
verb indexes both its subject and its object. The relevant forms can be found in
any standard grammar of Georgian, and are to a large extent straightforward.
The forms in (1) show a selection of relevant forms in the present tense.
1 A version of this article was presented at the 1st International CUA Conference on Endangered
Languages, Ardahan, Turkey, 13-16 October 2014. I am grateful to the conference organizers
for making this event possible and to all those who participated in the discussion of my pre-
sentation. The following abbreviations are used: abs absolutive, ad ad(essive), adj adjec-
tive, all allative, antip antipassive, attr attributive, caus causative, erg ergative, in
in(essive), ipfvcvb imperfective converb, loc locative, obl oblique, prf perfect, prs pres-
ent, pstunw past unwitnessed, rel relative, resptcp resultative participle, sg singular.
The stem of the verb is invariable, xedav, to which can be attached a prefix
and/or a suffix, depending on the person of the arguments to be indexed.
Each of the overt subject prefix, overt subject suffix, and the two overt object
prefixes receives a consistent interpretation. Beyond that, inferencing is
required, as developed in more detail in Comrie (2013: 24-26).
First, absence of an object prefix on a transitive verb always indicates a
third person object. Thus, having identified that none of the object prefixes
is present, the hearer can make this inference. The case of absence of subject
prefixes is more complex, and here the hearer may have to go through several
stages. There is no overt second person affix, whether prefix or suffix, so as a
2 Although the inferences are presented as if conscious processes undertaken by the hearer,
such processing is of course below the level of consciousness, and combinations may well be
routinized by the time a child acquiring Georgian achieves fluency in the system.
Why Caucasian Languages ? 41
general rule the absence of both the first person subject prefix and the third
person subject suffix can be taken as an indication of a second person sub-
ject. There is, however, one exception to this, which follows from the fact that
Georgian does not permit combinations of person prefixes. The form g-xedav
has a second person object prefix, so clearly the object is ‘you’. What is the
subject? It cannot be third person, since this would require the third person
subject suffix -s, as indeed shows up in the form g-xedav-s in (1). Normally,
absence of a subject affix indicates a second person subject, so can g-xedav
mean ‘you see yourself’, with coreferential subject and object second person
arguments? The answer is negative, because another rule of Georgian inter-
venes, namely one that says that objects coreferential with the subject are not
expressed by means of an object affix on the verb, but rather by means of a
separate word tavi, literally ‘head’, so that a combination of second person sub-
ject and second person object is expressed literally as ‘you hit your head’; since
‘head’ counts as third person, it has no overt person affix in the verb morphol-
ogy. For g-xedav this leaves only one remaining interpretation, namely ‘I see
you’, and this turns out to be correct, since in Georgian when a subject and an
object prefix compete for the prefixal person position in the verb morphology,
the object wins out and the subject remains unexpressed. In g-xedav, there
is no piece of the verb’s structure that we can identify as expressing the first
person subject, and a zero subject marker does not in itself indicate a first per-
son subject; rather, it is the interaction of a number of principles of Georgian
verb morphology that conspire to determine the unique correct interpretation
of g-xedav.
In examples (3) and (4), the relative clause consists of a sequence of several
words (in fact, two in (3) and three in (4)), and the head is a separate word. This
can be seen most obviously from the presence of one stress per word, marked
in this transcription by means of an acute accent on the vowel in question.
In addition, Kabardian also has a postnominal relative clause, with basi-
cally the same internal structure as the prenominal relative clause, except that
there are much heavier restrictions and a very different prosodic structure. An
example is provided in (5).
3 Though not directly relevant to the structure of relative clauses, the voicing of the initial
fricative of ‘boy’ in (5) shows that there is no phonological word boundary between it and the
preceding article.
Why Caucasian Languages ? 43
more than a single grammatical word in the relative clause must be expressed
by means of prenominal relatives.
Kabardian illustrates first a clear example of language that has both pre-
nominal and postnominal relative clauses but where it is the postnominal rela-
tive clause that is more restricted in its expressive possibilities. Moreover, it
provides excellent illustration of the need to include phonological information
in carrying out syntactic analysis.
3 Tsezic Pharyngealization
This section examines two languages from the Tsezic branch of East Caucasian,
Tsez (based on the fuller discussion in Maddieson, Rajabov, & Sonnenschein
1996) and Bezhta, with the link between the relevant phenomenon in the two
languages following the discussion in Comrie (2003).
One of the characteristic phonetic features of Tsez is pharyngealization. In
the indigenous vocabulary, phonemic pharyngealization occurs in two sets
of circumstances. First, uvulars may occur pharyngealized in any position in
the word, including word-finally, cf. the contrast between raq ‘side’ and raqˁ
‘wound’. In this position, phonetic pharyngealization characterizes basically
just the consonant, including its release, with only minimal effect on the pre-
ceding vowel.
Second, and more interestingly for present purposes, phonemic pharynge-
alization may characterize the initial (C)V of a word, cf. the contrast between
-oƛo4 ‘amongst’ and ˁoƛno ‘seven’, and between mo ‘(eye) tear’ and mˁow ‘kind
of mushroom’. Here, phonetic pharyngealization characterizes the whole of
the vowel, and the last part of the initial consonant if there is one; it does not
extend to the rest of the word, i.e. beyond the initial (C)V.
Corresponding to a pharyngealized vowel (or CV sequence) in Tsez, in
Bezhta one finds what has traditionally been described as an “umlauted”
vowel, as seen when comparing Tsez ˁaƛ ‘village’ with Bezhta äƛ. The precise
distinctive phonetic value of umlauted vowels in Bezhta remains to be investi-
gated in detail, although impressionistically they are less pharyngealized and
4 The leading hyphen indicates that the postposition is normally preceded by a gender-number
prefix. While pharyngealization is frequent in Tsez lexical representations and clearly phone-
mic, it is not easy to find minimal or even near-minimal pairs.
44 Comrie
more fronted than their Tsez cognates.5 But there is a further, perhaps more
interesting difference.
In Tsez, as noted, pharyngealization on the initial syllable is restricted pho-
netically to that syllable. By contrast, in Bezhta umlauting of the vowel of an ini-
tial syllable extends to following syllables, though tending to fall off in intensity
as one gets towards the end of the word, especially with longer words. Thus, the
past tense of the verb ‘cough’ in Bezhta is öhƛö-yö, where not only the second
vowel of the stem but also the vowel of the suffix is umlauted, giving rise to vowel
alternations in suffixes, cf. xuƛo-yo, past tense of ‘drink’. In other words, Bezhta
has developed vowel harmony: In general, the vowels of a word are either all
umlauted or all non-umlauted. The comparison of Tsez and Bezhta thus shows
us one possible scenario for the origin of vowel harmony, through the extension
of what was originally an opposition phonetically characterizing only the first
syllable to one that characterizes phonetically the whole of the word.
4 Personification in Tsez
Tsez, like most East Caucasian languages, has a gender system.6 In Tsez, there
are four genders, distinguished in the singular for instance by the prefix which
they require on vowel-initial verbs and adjectives that agree with that noun
phrase. The four classes are set out in (6), with the prefix used for agreeing with
a noun of that gender, and a brief semantic characterization of the nouns that
constitute each class.
5 While the Bezhta umlauted vowels ä, ö, and ü are often traditionally characterized as front
vowels, acoustic investigation of their formant structure by Sven Grawunder (Max Planck
Institute for Evolutionary Anthropology) suggests that while there is some fronting, at least
for ö and ü, it does not advance even as far as the central position. The Tsez and Bezhta coun-
terparts sound different, but the precise phonetic characterization of the difference remains
open.
6 Traditionally, this often referred to as a noun class system in speaking of East Caucasian lan-
guages. For present purposes at least, this is purely terminological. In general, the gender of a
noun is not reflected in its own form, much as in German one simply has to learn the genders
of Löffel ‘spoon’ (masculine), Gabel ‘fork’ (feminine), and Messer ‘knife’ (neuter).
Why Caucasian Languages ? 45
In somewhat more detail: Gender I contains all and only nouns denoting
male humans (plus supernatural beings assimilated to male humans).
Gender II contains all nouns denoting female humans plus a restricted
number of inanimate nouns. Gender III includes all nouns denoting animals,
plus a large number of inanimate nouns. Gender IV contains a large number
of nouns, all of them inanimate. While in one sense the main problem in
learning the gender of nouns in Tsez is the fact that inanimates are assigned
across three genders, for present purposes we are only concerned with the
semantically determined features of the system: Male humans are in gender I,
female humans in gender II, animals in gender III.
What, then, happens in a traditional tale where the participants are ani-
mals, but playing the role of humans? One traditional Tsez story that I discuss
in more detail in Comrie (2005) involves as main participants a rooster and a
hen, who play the role of a husband and wife, with traditional family roles, as
can be seen in (7), where the rooster goes out to work while the hen stays home
to look after the house.7 There is thus a potential conflict between the fact that
the protagonists are animals (which would require gender III), but behave as
humans (which would require genders I and II).
A similar problem arises in English with the choice of pronouns in the third
person singular, where English distinguishes, roughly speaking, male human
he, female human she, non-human (including animals and inanimates) it.
If I were to tell the story of the rooster and the hen in English, then I would
almost certainly use the pronouns he for the rooster, she for the hen, since what
is deemed relevant is the human social role that each illustrates. In Tsez, by
contrast, gender III, appropriate for animals, is used, so that in the first line
of (b) the verb b-ik’i-x has the gender III prefix agreeing with ‘rooster’, and in
the second line the verb b-eynoy-xo again has the gender III prefix, this time
agreeing with ‘hen’.
7 The full Tsez text of the story, with a Russian translation, is available in Abdulaev & Abdullaev
(2010: 44-47); the version there is slightly different from the one used here, but the differences
are not significant.
46 Comrie
5 Bezhta Antipassive
8 The formation of antipassives in Bezhta is, however, rather idiosyncratic, lexicalized, as will
be seen in the following examples.
Why Caucasian Languages ? 47
However, unlike the English passive, the Bezhta antipassive not only changes
the grammatical relations, it also changes the TAM of the clause, adding the
semantic component of durative, i.e. extended in time, which depending on
the lexical meaning of the verb might involve simply extending the particular
action in time, as in (12), or repeating it so that the resultant complex action
takes up more time than a single instance of the action in question, as in (14)
and (16) below. The question therefore arises whether the Bezhta antipassive
is really to be characterized as a voice, even though in (12) the grammatical
relations are different from in (11), or whether it should rather be characterized
as an aspectual, durative form, since this is its semantic import.
The situation becomes more complex if we include intransitive verbs. First,
Bezhta has a small class of intransitive verbs with onomatopoeic semantics
that take their single argument in the ergative case, as in (13). Diachronically,
these may originally have been transitive constructions (of the general type
‘the boy uttered X’), but synchronically they are one-place predicates.
With the general run of intransitive verbs, the antipassive, as in (16), has exactly
the same case frame as the basic construction, as in (15); although the form of
the verb shifts to indicate the antipassive.
6 Bezhta Adjectives
As the final set of data on Tsezic languages, we consider the word classes
represented in the native and borrowed vocabulary of Bezhta, following
Comrie & Khalilov (2009). Of particular interest here are adjectives. Bezhta has
only a small number of indigenous underived adjectives, such as -uk’o9 ‘big’,
k’et’o ‘good’. In addition, within the indigenous vocabulary there are adjectives
derived from other word classes, such as participles derived from verbs, e.g.
-uɣo-yo ‘dead’, morphologically -die-resptcp, i.e. something like ‘having died’.
A large proportion of Bezhta’s adjectives, however, are loans from Avar, the
traditional lingua franca of the area, such as bercinab ‘beautiful’.
Within the Loanword Typology project, of which Comrie & Khalilov (2009)
forms part, for each language a fixed list of lexical meanings was used, trans-
lated into the language in question, and then identified as an indigenous for-
mation or a loan, in the latter case with further identification of the immediate
9 A leading hyphen indicates the position where an agreement (indexing) prefix precedes
the stem.
Why Caucasian Languages ? 49
source. The data in (17) show, for each word class in Bezhta, the percentage of
words on the fixed list that are indigenous versus borrowed from Avar, from
Russian, from Georgian, or from some other source.
(18) Loanwords in English by part of speech and source language (Grant 2009)
Bezhta is unusual in that among its substantial number of loans from Avar, the
main source of its borrowings in the LWT wordlist, the percentage of adjectives
that is borrowed is greater than the corresponding percentage for any other
word class. These adjectives also, incidentally, include many frequent words,
such as bercinab.
7 Conclusion
10 Likewise from Latin into English, although the numbers are smaller and therefore less
probative—a single item can shift the percentages of word classes significantly. From Old
Norse into English there are actually more adjectives than nouns or verbs, although the
numbers are again small.
50 Comrie
References
Abdulaev, Arsen K. & Isa K. Abdullaev. 2010. Cezyas folklor [Tsez folklore]. Leipzig–
Makhachkala: “Lotos”.
Applebaum, Ayla Ayda Bozkurt. 2013. Prosody and grammar in Kabardian. PhD disser-
tation, University of California, Santa Barbara.
Applebaum, Ayla B. & Andrea L. Berez. 2009. A theory is only as good as the data: cast-
ing a wide net in Kabardian and Ahtna documentation. In Peter K. Austin, Oliver
Bond, Monik Charette, David Nathan, & Peter Sells (eds) Proceedings of Conference
on Language Documentation and Linguistic Theory 2, 29-38. London: SOAS.
Comrie, Bernard. 2003. A note on pharyngealization and umlaut in two Tsezic lan-
guages. In Winfried Boeder (ed.) Kaukasische Sprachprobleme, 105-109. Oldenburg:
Bibliotheks- und Informationssystem der Universität Oldenburg.
Comrie, Bernard. 2005. Grammatical gender and personification. In Dorit Diskin Ravid
& Hava Bat-Zeev Shyldkrot (eds) Perspectives on language and language develop-
ment: Essays in honor of Ruth A. Berman, 105-114. Dordrecht: Kluwer.
Comrie, Bernard. 2013. Ergativity: some recurrent themes. In Edith L. Bavin & Sabine
Stoll (eds) The acquisition of ergativity, 15-34. Amsterdam: John Benjamins.
Comrie, Bernard & Madzhid Khalilov. 2009. Loanwords in Bezhta, a Nakh-Daghestanian
language of the North Caucasus. In Martin Haspelmath & Uri Tadmor (eds)
Loanwords in the world’s languages: A comparative handbook, 414-429. Berlin: De
Gruyter Mouton.
Comrie, Bernard, Madzhid Khalilov, & Zaira Khalilova. 2015. Valency in Bezhta. In
Andrej Malchukov & Bernard Comrie (eds) Valency classes in the world’s languages,
541-570. Berlin: De Gruyter Mouton.
Grant, Anthony P. 2009. Loanwords in British English. In Martin Haspelmath & Uri
Tadmor (eds) Loanwords in the world’s languages: A comparative handbook, 360-383.
Berlin: De Gruyter Mouton.
Maddieson, Ian, Ramazan Rajabov, & Aaron Sonnenschein. 1996. The main features of
Tsez phonetics. UCLA Working Papers in Phonetics 93.
Chapter 4
İryna M. Dryga
I would like to begin by thanking Ardahan University and Turk Dil Kurumu
for inviting me to speak here today. It’s a real pleasure to have this opportunity
to share my views on documentation and revitalization with you. I represent
A. Krymsky Institute of Oriental Studies, National Academy of Sciences of
Ukraine and at the same time National Network organized for the revitalization
of the Turkic minority languages in our country.
This initiative of combined efforts of professional linguists, politicians and
legislators was demonstrated not at once. It all started years ago with the rare
revitalisation attempts and efforts.
All of the Turkic languages in Ukraine (Crimean Tatar, Gagauz, Karaim,
Krymchak, Urum) are endangered, notably they are at risk of extinction in
the short or in the long terms. According to UNESCO’s “Atlas of Endangered
Languages” [http://www.unesco.org/languages-atlas/index.php] the degree of
their vitality varies from definitely endangered (Gagauz) to extinct (Karaim)
[Czató 2010]. Even the most numerous and demographically sizeable Crimean
Tatar population is considered linguistically endangered because the language
is spoken mainly by the representatives of the older generation.
The assessment of this language in the Atlas of Endangered Languages has
long appeared to be too optimistic and it is necessary to revise it on the basis
of a more profound analysis.
First, for the last several years we have spared no effort to collect and
preserve the remains of ‘small’ or ‘insular’ Turkic languages in our country
[Dryga 2010: 195-200, 220-233, 355-362, 406-419]. In 2006-2007 we conducted
a field study in Ukraine and Lithuania in cooperation with the Altaic Society
of Korea with a goal to revive, preserve and study the languages and, if pos-
sible, cultures of numerically small Turkic language speaking minorities such
as Karay, Qrymchaq and Urum, residing mainly in rural localities spread
over the multilingual regions of the Crimea, Trakai and Azov and having no
native language education [Author’s recordings 2006]. As a result Korean and
Ukrainian linguists have collected an invaluable language material given that
there was only one speaker of Qrymchak remaining alive, about ten speakers
of Qypchaq – Polovets dialects of Urum, eight speakers of the Crimean dialect
and two speakers of Halych-Volyn’ dialect of Karaim as of 2008 [Altaic Society
of Korea, 2006: 198-203; http://altaireal.snu.ac.kr/askreal_v25/photogroup
.html]. The age of all the speakers was over 60, and mostly 80 or more.
Second, Lenara Kubedinova and Radovan Garabik from Slovakia began to
develop the corpus of Crimean Tatar Wikipedia as a collaborative free con-
tent internet encyclopedia [Yavorska, Dryga 2015: 130-136]. Natural language
processing often turns to Wikipedia for the source of material, because it
exhibits the content, represents contemporary living language and is a valu-
able source of texts, for example for Linguistic corpus of Crimean Tatar lan-
guage. For Crimean Tatar it became a point of grassroot prestige, and a major
source of accessible online texts in the language. For comparison, no other
Turkic language endangered in Ukraine has a Wikipedia corpus. Crimean Tatar
Wikipedia (in Latin script) officially started on 12 January 2008, though the first
pilot version dates from September 2006. It contains about 4000 articles, rank-
ing 164th by the number of articles.
Moreover, Miquel Cabal-Guarro on the basis of the data drawn from the
sociolinguistic survey that he conducted in 2011 amongst Crimean Tatars
across the peninsula of Crimea, provided an analysis of the language uses and
transmission [http://ru.krymr.com/a/25467619.html]. Although the Crimean
Tatar language was either seldom or never spoken, especially amongst
the individuals of the younger generation (that tend to use only or mostly
Russian in their everyday communication, even with their relatives), it is
still one of the main identification elements of the Crimean Tatar ethnicity
and nearly in all cases claimed to be the identity language of the respond-
ents, almost always even their declared native language. Miquel Cabal-
Guarro also tried to elucidate the degree of endangerment of the Crimean
Tatar language.
Tudora Arnaut organized 3 international symposiums in Kyiv in 2005-2010
[Arnaut 2005, 2010, 2013] on the problems of the Turkic speaking peoples,
and it became our initial experience of public collaborative discussions on
the endangered Turkic peoples’ linguistic problems. Quite important deci-
sions taken by the symposiums were quite commendable; they were trans-
ferred to the relevant governmental departments and commissions but had
no effect [Arnaut 2007]. Moreover, the new language law offered in 2010 by
ENDANGERED TURKIC LANGUAGES IN UKRAINE 53
1 http://oriental-studies.org.ua/uk/перша-міжнародна-конференція-мови-п/.
54 Dryga
of extinction and dormant languages. It was also told us about the initiatives
of Tatar activists in Poland on revitalization of their native language [Yavorska,
Dryga 2015: 10-11].
And the last lesson taught us by the recent events and finally by The lan-
guage law is that the definition of a particular language endangerment must
clearly display the language status in synchrony and diachrony. The fact that
this definition is not clearly established created the situation when in certain
countries the vigorous languages like German and Russian receive the same
protection level as languages on the verge of extinction.2
Conclusions based on reports presented and discussions conducted on
the topic:
– the need for digitizing texts including old and new oral ones while
native speakers of dialects are still alive;
– the need for both bi-lingual and explanatory dictionaries;
– the need for oral accounts to be collected while there is still something
to collect;
– the need for translation as an instrument of enhancing the corpus of
modern texts, sustaining and developing language.
8. UNESCO’s Atlas of World’s Endangered Languages still has inadequate
characteristics for a number of Turkic languages. The cause is the
lack of sources. Consequently, the situation demands a professional
re-assessment of the current status of every of Turkic languages.
Our research Institute and our organization are ready to enter as a partner
organization into every kind of international research projects, groups, and
to join international language corpora and databases with our own language
databases and accomplishments, etc.
We believe that professional discussion on endangered languages’ preser-
vation issues will contribute to the promotion of cultural diversity as a value,
of tolerance towards different ethnic groups and people living in Ukraine.
This would ensure the dialog and the understanding between Crimean Tatars,
Ukrainian- and Russian-speaking population of the Crimea. It will also create
additional opportunities for Crimean Tatars and other Turkic-speaking peo-
ples of Ukraine to take finally, though may be too late, a worthy place in the
social and cultural life of the country. It will also help to overcome the political
tension around the language issue in Ukraine.
Primary Sources
Secondary Sources
Ajniuk, Bohdan (ed.), 2012. Ekologia movy i movna polityka v suchasnomu suspilstvi
(zbirnyk naukovyh prats).—Кyiv: Vydavnychyj dim Dmytra Burago, 376 p.
ENDANGERED TURKIC LANGUAGES IN UKRAINE 57
Altaic Society of Korea, 2006. Fieldwork Studies of Endangered Altaic Languages. For
the Genealogical Study of Korean and the Preservation of Endangered Languages.The
Language and Cultural Studies Series 2. Seoul: Altaic Society of Korea.
Arnaut, Fedora and Dermenci, Ömer (eds.), 2005. Kruhlyi stil: Tiurkomovni narody
Ukrajiny (movy ta kultury tatar, gagauziv, urumiv, karajimiv i krymchakiv). Kyiv:
Vydavnychyj dim Dmytra Burago, 112 p.
Arnaut, Tudora and Nasrattinoglu Irfan (eds.), 2010. Materialy druhoho Vseukrajinskoho
kruhloho stolu “Problemy osvity tiurkomovnyh narodiv Ukrajiny: gagauziv, urumiv,
karajimiv, krymchakiv i krymskyh tatar” / II.Ukrayna’daki Türkçe Konuşan Halklar
(Gagauz, Urum, Karay, Kırımçak ve Kırım Tatarları’nın eğitim sorunları) Paneli. Kyiv:
Vydavnycho—polihrafichnyj tsentr “Kyivskyj universytet”, 257 p.
Arnaut, Tudora and Nasrattinoglu Irfan (eds.), 2013. Materialy tretioho Mizhnarodnoho
sympoziumu “Tiurkomovni narody Ukrajiny” / III.Uluslararası Ukrayna’da Türkçe
Konuşan Halklar Sempozyumu. Кyiv: Vydavnycho—polihrafichnyj tsentr “Kyivskyj
universytet”, 335 p.
Arnaut, Tudora, 2007. Ukrayna Gagavuzların Arasında Ana Dilinin Gelişmesi, In:
Gagavuz Türkçesi Araştırmaları 27-29 Aralık 2007 Bilgi Şöleni. Ankara: Türk Dil
Kurumu Yayınları, p. 52.
Czató, Èva Á., 2010. ‘Report on an Uppsala workshop on Karaim studies’, In: Johanson,
Lars and Csató, Éva Á. (eds.), Turkic languages, Wiesbaden: Routledge.
Dryga Iryna, 2010. Pontika: Türkoloji yazıları. Тюркологічні студії. Кyiv: Chetverta
hvylia, 530 p.
Yavorska, Halyna and Bogomolov, Alexander, 2010. Nepevny object bazhannia: Jevropa v
ukrajins’komu politychnomu dyskursi. Кyiv: Vydavnychyj dim Dmytra Burago, 136 p.
Yavorska, Halyna, Dryga, Iryna (eds.), 2016 Zahrozheni movy. Krymskotatars’ka ta
inshi tiurkski movy v Ukrajini: zbirnyk naukovyh prats’. 1st International Conference
Endangered languages: Crimean Tatar and other Turkic languages in Ukraine
(26-27th September 2014). Proceedings. Kyiv: Aksioma Medobory, 344 p.
Internet Resources
http://altaireal.snu.ac.kr/askreal_v25/photogroup.html
http://oriental-studies.org.ua/index.php?news=5379.
http://ru.krymr.com/a/25467619.html
http://sonseslerduyulmadan.hacettepe.edu.tr
http://www.turkiyat.hacettepe.edu.tr/kitap/tehlikedekidillerbildirileri_090413.pdf
http://www.unesco.org/languages-atlas/index.php
https://en.wikipedia.org/wiki/Language_policy_in_Ukraine
58
Appendix
Dryga
Diana Forker
1 Introduction
The Tsezic languages are a group of closely related languages that form one
subbranch within the Nakh-Daghestanian (or East-Caucasian) language
family. They can be divided into East Tsezic, comprising Hunzib and Bezhta,
and West Tsezic, comprising Khwarshi, Tsez and Hinuq. Tsezic languages are
spoken in the Republic of Daghestan, which belongs to the Russian Federation.
Daghestan is located in the north-eastern part of the Caucasus. Smaller groups
of Tsez and Bezhta speakers also live in Turkey, and some Bezhta speakers live
in Georgia. The largest Tsezic language is Tsez with about 12,000 speakers
(according to the Russian census of 2010); the smallest language is Hinuq with
around 600 speakers.
Case assignment in the Tsezic languages is largely semantically moti-
vated, and morphosyntactic features play only a marginal role (cf. Kibrik
1997). Due to the dominant role of semantics in the assignment of case it
seems that it is relatively simple to extend the case inventory. That is, suffixes
and enclitics with an autonomous distinguished form paired with a clear-
cut meaning can, in principle, develop into cases. This seems to explain the
origin of many spatial cases in Tsezic that most probably go back to spatial
postpositions.
The aim of this paper is to explore a number of nominal markers that
resemble cases and compare them with genuine case markers with respect
to functional and formal similarities and differences. I will adopt the canoni-
cal approach as exemplified by Corbett (2008) for the feature of case. Corbett
(2008) provides ten criteria for canonical case markers and examines the
Russian cases in regard to the criteria. I will use these criteria for investigating
whether the respective nominal markers from the Tsezic languages could be
analyzed as case markers.
Inflectional categories of nouns in Tsezic languages are number and case. The
languages have elaborate gender systems with up to five genders, but nouns
are normally not inflected for gender. Inflection is almost exclusively suffixing
and largely agglutinative. For the nominal inflection this means that number
and case are expressed by different suffixes.
Tsezic languages have a rich case inventory with a large number of spatial
cases. Non-spatial cases found in all Tsezic languages are absolutive, ergative,
genitive, and instrumental. All cases except for the absolutive are expressed by
suffixes that are added to the so-called ‘oblique stem’. By contrast, the form that
the noun takes in the absolutive case is called ‘direct stem’. The same distinc-
tion between direct and oblique stem is found with all sorts of pronouns, and
partially also with adjectives and participles.
Case formation is straightforward and regular. The main difficulty in the
nominal morphology of the Tsezic languages is the formation of the oblique
stem from the direct stem. There are several operations that are used in order
to form the oblique stem: insertion of vowels, consonants or glides, ablaut,
stress shift, conversion, and suffixation of oblique stem markers (Forker 2010a).
The last process, the suffixation of an oblique stem marker, is the most fre-
quent and the only productive way of forming oblique stems. The number
of oblique markers differs from language to language. In Khwarshi, there are
only six markers whereas Hunzib has more than twenty. In all languages,
the number of markers is much lower for plural nouns than it is for nouns
in the singular. Table 1 illustrates the formation of oblique singular stems to
which the genitive suffix has been added.
There is some variation in the stem formation systems. This means that often
nouns have more than one way of forming the oblique stem; they follow a
rare and unproductive pattern but can also be used with a common oblique
marker. Moreover, with spatial cases the case suffix is sometimes attached to
the unmodified noun. Depending on the analysis we can either say that in such
examples the spatial cases are suffixed to the direct stem or that we deal with
conversion.
It is plausible to assume that Proto-Tsezic had a case marker *-ɣo that is still
found in Bezhta and Khwarshi, but has almost been lost in Hinuq and is absent
from Tsez and Hunzib (see Forker 2010b, 2012a for more information on the
spatial case systems of Tsezic languages). In Bezhta, the spatial suffix -ɣa has
the meaning ‘next to, by, at, near’ (1a) (Kibrik & Testelec 2004: 236). It is also
used for the standard of comparison, and occasionally with temporal and
metaphorical meaning. The Khwarshi suffix -ɣo translates as ‘in close contact
with, nearby’ (1b) (Khalilova 2009: 82). In both languages, further directional
cases (lative, ablative, etc.) can be added to the suffix as the Khwarshi example
illustrates.
(1a) Bezhta
buxari-ya-ɣa gäʔä sukʼo=na
flue-obl-next be.neg who=add
‘There is nobody next to the flue.’
In Hinuq, the situation is different because -ɣo is poorly integrated and only
occurs in very few spatial and temporal adverbs (Forker 2013: 103). The spatial
adverbs č’ek’k’uzaɣo ‘everywhere’ and seda-ɣo ‘in one place’ are formed from
quantifiers. The temporal adverbs are sebedoɣo ‘in autumn’ (< sebe ‘autumn’)
and aldoɣo ‘formerly, before, in front’. As can be seen in the spatial adverbs the
suffix is added to the oblique stem, i.e., seda is the oblique form of the numeral
Cases-Non-cases: At the Margins of the Tsezic Case System 63
hes ‘one’ and -za is an oblique plural stem suffix that has been attached to the
quantifier č’ek’k’u ‘all’.
Hinuq and Khwarshi are the only Tsezic Nakh-Daghestanian languages that
have a vocative suffix -(i)yu. In Hinuq, it is only used with two common nouns
(uži-yu ‘boy-voc’, ked-bi-ža-yu ‘girl-pl-obl.pl-voc’), and it also expresses
affectionateness and endearment (Forker 2013: 433-434). In Khwarshi, it seems
that it is more widely used because it can occur with various types of common
nouns, e.g. cana-yu ‘she.goat.obl-voc’ (Khalilova 2009: 72-73). Neither in
Hinuq nor in Khwarshi can the vocative be suffixed to proper names. In both
languages the vocative is added to the oblique stem of the noun.
i. professions and other social functions (‘as a teacher’, ‘as a friend’, etc.)
(2) Hinuq
hago zoq’ʷe-n toxtor-ɬun
he be-uwpst doctor-func
‘He was a doctor.’
In other Tsezic languages, cognates of the Hinuq suffix are attested (with the
exception of Hunzib for which the relevant information is lacking). It seems
that in all Tsezic languages the functions i and partially ii are found. The
functive suffixes can be added to nouns bearing case suffixes and if they are
added to items without overt case suffixes they are attached to the direct stem
(not to the oblique stem). In Khwarshi, the suffix is -ɬun / -ɬin (Khalilova 2009:
257-258); in Tsez it is -ɬun (5a). In Bezhta, the functive suffix -ɬun is also used
with participles of verbs in a construction with the meaning ‘as if’, e.g. when
people are pretending to act in a certain way (5b). Furthermore, all Tsezic
languages including Hunzib have a postposition sababɬun consisting of the
Avar noun sabab ‘cause, reason’ and the functive suffix. The complex word
sababɬun is also used in Avar, but it is not regarded as a postposition in the
recent Avar grammar by Alekseev et al. (2014).
(5b) Bezhta
y-egay-ʔeš-ɬun=na ∅-aq-na giɣa
ii-see-ptcp.pst.neg-func=add i-happen-cvb down
ẽxe-ɣa=na gowacʼo-na ∅-ẽƛʼe-š Tʼahir
river-next=add look.i-cvb i-go-prs Tahir(i)
‘Pretending not to see (her) and looking at the river Tahir is walking by.’
Cases-Non-cases: At the Margins of the Tsezic Case System 65
Avar has the suffix -ɬun that is used in the same or at least in a very similar
manner as the functive suffixes of the Tsezic languages (6). The suffix is not
explicitly mentioned in the grammars by Charachidzé (1981) and Alekseev
et al. (2014). Ebeling (1966: 72), following an older grammar of Avar written in
Georgian, writes that it “forms the ‘predicative’, meaning ‘as, in the quality of’
(see Creissels 2014b: 442 for further references).
In most of the Andic languages functive suffixes are attested as well, and the
meaning corresponds to the meaning of the cognate suffixes in Avar and
Tsezic. In Karata, Godoberi, Tindi and Botlikh the suffixes are formally very
similar and in one case even identical to the Avar and Tsezic suffixes:
– Karata -ɬe (Creissels 2014b: 444, citing Magomedova & Xalidova 2001)
– Godoberi -ɬu (Saidova 2006: 226)
– Tindi -ɬo (Magomedova 2003)
– Botlikh -ɬun (Creissels 2014b: 444, citing Saidova & Abusov 2012: 109)
In the languages Northern Akhvakh and Bagvalal, which also belong to the
Andic branch of Nakh-Daghestanian, the functive suffix is formally slightly
66 Forker
The Bagvalal suffix -l(h)i is analyzed by Daniel et al. (2001: 191-193) as a verb-
forming suffix that attaches to nouns, adjectives, adverbs, and postpositions.
Like in Akhvakh it is followed by a gender agreement marker, and in the
available examples also by a converbal suffix (9a, b).
(9a) Bagvalal
gurǯija-j hak’uj-li-j-o j-ah-aː
Georgian-f wife-vblz-f-cvb f-take-pot.inf
‘to take a Georgian (woman) as wife’ (Daniel et al. 2001: 193)
1 There is a verb ɬeze ‘put’, but I am not sure about its relation to the functive suffix.
Cases-Non-cases: At the Margins of the Tsezic Case System 67
3. nouns bearing the functive suffix can take modifiers (e.g. genitive modi-
fiers or adjectives) (8), (10)
4. nouns with the functive suffix serve not only as adjuncts but also as
predicative arguments of verbs such as ‘be’, ‘become’, ‘consider’, etc. (2),
(5a), (10)
However, despite the functional similarity to cases such as the Uralic essive,
I want to argue that it does not seem the best solution to treat the functive as
belonging to the case systems of Tsezic, Avar or Andic. I will concentrate in my
argumentation on the functive in Hinuq because this is the language for which
I have most data, but my first and my second argument equally apply to the
other Tsezic languages.
Why should the Hinuq functive not be considered a case marker? How does
it diverge from the other case suffixes? First of all, it can be added to other
case markers (4). Second, it is suffixed to the direct form of the nominal or
nominalized adjective or verb. For instance, if the resultative participle takes
case suffixes, it must be in its oblique form -za, but, as (3b) illustrates, this
is not the case for the functive. Third, nouns to which the functive is added
do not trigger the use of the second genitive (10). These three properties
are not attested for the case suffixes. Case suffixes are not added to other
case suffixes (though there are complex spatial cases, but they always have as
the second suffix a directional marker). Case suffixes are added to the oblique
stem (though for some nominals oblique and direct stem are identical). All
nominals inflected for any case other than the absolutive trigger the use of
the second genitive (10b). The first genitive is only available for nouns in the
absolutive case.
(10a) Hinuq
hago di ∅-egennu essu-ɬun ∅-iči-ƛo!
he 1sg.gen1 i-young sibling(i)-func i-be-opt
‘May he be my younger brother!’
The similarity of the Hinuq suffix -ɬun with its Avar source suggests that it must
be a more recent borrowing and considerably younger than the verb-deriving
suffix -ɬ. The derivational suffix is used with a wide range of base words
including many native Hinuq words and is rather deeply integrated into the
linguistic system. The derived verbs behave just like simple verbs. Therefore,
Cases-Non-cases: At the Margins of the Tsezic Case System 69
Hinuq -ɬun and -ɬ cannot directly be traced back to the same origin, even if this
is probably the case for the Avar equivalents.
The last case study examines the suffix -ɬi and its use in the Tsezic languages.
This suffix has again been borrowed from Avar. Its functions in Tsezic largely
overlap with its functions in Avar, but there are also some differences between
Avar -ɬi and Tsezic -ɬi as well as among the Tsezic languages. The functions are:
(11a) Bezhta
[do hoƛoʔ biƛo-ʔ ∅-eče-yo-ɬi] ∅-iqʼe-da . . .
1sg here house-in i-be-ptcp-abst i-know-cond
‘if they knew that I lived here in the house . . .’
(12b) Hinuq
dižo tʼek-mo-za-ɬi r-egi roži eƛi-š
1sg.gen2 book-obl-obl.pl-abst v-good word(v) say-pst
batʼi-batʼiyaw poʔet-za-y=no, ʡalim-za-y=no
various poet-obl.pl-erg=add intellectual-obl.pl-erg=add
‘Various poets and intellectuals said good things about my books.’
iv. Marking X in phrases like ‘X turns into Y’ in Hinuq (13), and apparently
also in Tsez (in the latter language this is done in combination with the
spr-Lative)
(13) Hinuq
haɬu ked-i zon-ɬi b-uː-ho arxi,
that.obl girl-erg refl.obl-abst iii-make-prs ditch(iii)
gulu-za-ɬi r-uː-ho ɬe
horse-obl.pl-abst v-make-prs water(v)
‘The girl turns herself into a ditch and the horses into water.’
(14) Hinuq
xexza-ɬi ʡolo eli aƛ-a-do nox-iš
child.obl.pl-abst because.of 1pl village-in-dir come-pst
‘Because of the children we came to the village.’
Most of the usages are functionally related because they involve the formation
and the use of abstract nominals, either as genuine parts of the nominal lexicon,
or in a more abstract sense. For example, complements can be analyzed as
nominalized propositions that occur in a position where otherwise nouns
occur. The topic of a conversation or narration is an abstract object that can be
expressed in the form of a nominalized proposition or other item.
Once again we ask the question whether the suffix -ɬi can be analyzed as a
case suffix. If we restrict ourselves to Hinuq, which seems to be the language
with the broadest range of use, the functions iii-v are functions that are typically
Cases-Non-cases: At the Margins of the Tsezic Case System 71
In order to explore the borderline cases of the Tsezic case systems I adopt the
canonical approach as developed by Corbett and his colleagues (see Corbett
(2005, 2008) and Brown, Chumakina & Corbett (2013) among others). The basic
idea of Canonical Typology is that when investigating a linguistic phenomenon
we should start by modelling a canonical item that represents an idealized
instance of the examined phenomenon. The properties of the canonical item
will help us to classify specific examples with respect to their similarity to
the canonical item. In the following, I take Corbett’s (2008) analysis of case
and apply it to the four linguistic items that have been treated in Sections 4-7
thereby concentrating on the Hinuq data. In other words, I will clarify to what
extend the following suffixes can be analyzed as belonging to the case system
of Hinuq: the spatial suffix -ɣo, the vocative -(i)yu, the functive -ɬun, and the
abstract suffix -ɬi.
2 Function vi, i.e. the formation of verb forms employed in adverbial clauses, is also attested
with some spatial cases in Hinuq, in particular the spr-essive (Forker 2013: 240-241; 255-257).
72 Forker
3 There are even more semantic restrictions with respect to the spatial cases. For example, cer-
tain place names and microtoponyms can be only inflected for directional cases but not for
the spatial cases expressing locations because of the inherent locational semantics of those
nominals. For nouns with animate referents it is difficult if not impossible to find a context
of use for spatial cases such as the in-directional. But these restrictions are purely semantic
and do not concern the morphology or the syntax.
74 Forker
Criteria 8 & 9: Canonical use of morphosyntactic features and their values does
not admit additional lexical conditions from the target (governee) or from the
controller (governor). The controller has a single requirement (e.g. it governs the
dative).
There are (almost) no special lexical conditions on the side of targets, e.g. all
nouns take the same case if the same meaning needs to be expressed. There
are occasional cases of idiosyncrasy (e.g. the noun zoro ‘barn’ is inflected for
the sub-essive that normally means ‘under’ to express the meaning ‘in the
barn’), but they perhaps have a semantic explanation. From the side of the
controllers there are also (almost) no lexical conditions, e.g. all transitive verbs
require the agent to be marked with the ergative, and all affective verbs require
the experiencer to be marked with the dative. However, the affective verb -aši-
‘find’ additionally allows for the at-essive to occur on the experiencer and this
might be considered as an additional construction, but again it is a variation
between a grammatical case and a spatial case.
Criterion 10: The use of canonical morphosyntactic features and their values
is sufficient (they are independent).
This criterion is fully met in Hinuq because the non-spatial cases do not permit
additional markers such as postpositions.4
4 The spatial cases allow them, but the use of postpositions with spatial cases is mostly
optional.
Cases-Non-cases: At the Margins of the Tsezic Case System 75
The ergative behaves relatively canonical except for its partial formal overlap
with the absolutive and the in-essive and its alternation with the absolutive
and the at-essive in non-canonical agent constructions (indicated by * in
Table 3). The dative is even more canonical since it has a unique ending. Again
the only instances that might count as non-canonical behavior are alternations
with certain spatial cases. The vocative is the least canonical nominal marker.
Its use is not obligatory because the absolutive can be used instead, and it
occurs only with a very limited number of nouns. However, it is suffixed to
the oblique stem. With respect to the criteria 1-10, the functive is as canonical
as the ergative or the dative. The only non-canonical feature is found with
respect to criterion 5 because the functive can occasionally be replaced with
the absolutive, e.g. compare (15) with example (2) above.
(15) Hinuq
Abdukarim ħaži Buynaksk šahar-mo-s imam goɬ
Abdukarim Gadzhi Buynaksk town-obl-gen1 imam be
‘Abdukarim Gadzhi is the imam of the town of Buynaksk.’
Thus, Creissels’ (2014b: 442-443) critique of excluding the functive from the
case inventory as it is done in all grammars of Tsezic languages (as well as in
the grammars of Avar and Andic languages) is surely justified. Yet it does not
confirm to all other cases when it comes to the specific property of requiring
nominals to occur with the oblique stems and modifiers to appear in their
76 Forker
8 Conclusion
In this paper, I investigated borderline cases of the Tsezic case systems that
resemble core cases, but also deviate from them in various ways. This was
done by applying the canonical approach as illustrated by Corbett’s (2008) ten
criteria for canonical case markers. The examined nominal markers can be
ordered along a scale of canonicity from being more canonical (functive, -ɬi) to
less canonical (vocative).
The Tsezic case systems are far from being homogenous because of the divi-
sion into non-spatial and spatial cases with concomitant differences in their
formal and functional properties. They lose members (e.g. the spatial suffix -ɣo
that was described in Section 4, certain spatial case combinations seem to have
fallen out of use in Bezhta, see Forker 2012a, b). But that they also gain new
members. The functive and the abstract suffix -ɬi are both borrowings from
Avar that are functional equivalents of cases in other languages, and the latter
suffix in Hinuq also shares some formal properties with the core case suffixes.
Abbreviations
i-v: genders i-v, abst: abstract suffix, add: additive, adv: adverbial, apud:
apud-essive, at: location ‘at, by, near’, conc: concessive, cond: conditional,
cont: cont-essive, cop: copula, cvb: converb, dat: dative, dir: directional,
dist: distal, erg: ergative, f: feminine, func: functive, gen: genitive, imp:
imperative, in: in-essive, inf: infinitive, ipfv: imperfective, irr: irrealis,
lat: lative, loc: locative, n: neuter, neg: negation, next: location ‘next to’,
obl: oblique stem, opt: optative, pf: perfective, pl: plural, pot: potential,
prs: present, prt: particle, pst: past, ptcp: participle, q: question marker,
quot: quotative, res: resultative, sg: singular, spr: location ‘on’, uwpst:
unwitnessed past, vblz: verbalizer
Cases-Non-cases: At the Margins of the Tsezic Case System 77
References
Victor A. Friedman
1 Introduction
1 On the situation in the Caucasus, see Friedman (2010). There is no literature to speak of
on language endangerment in the Balkans beyond the third edition of the UNESCO atlas
(Mosley 2010).
2 A clear example of this instability occurs in Webster’s Geographical Dictionary, where the
political border but the northern borders of Iran and Turkey define the southern border of
Europe in the entry for Asia, but the Caucasian ridge defines the boundary between Europe
and Asia in the entry for Europe (Bethel 1949: 74, 347).
Among the language ecological commonalities between the Balkans and the
Caucasus in the twentieth century we can identify are the effects of forced
migration (including genocide), colonization, marginalization, economic
transformation (including urbanization, economic migration, and shifts
in traditional market patterns), and the rise of new polities.3 One way of
combating endangerment is to be found in literacy practices, but this is not
in and of itself a simple or sufficient solution. It is now well understood that
language preservation cannot be divorced from social context (e.g., Mülhäusler
2002). At the same time, however, in the context of symbolic capital in societies
that are already deeply embedded in modern nation-states, literacy becomes
important (cf. Friedman 2012a).
In order to set this discussion in the present context, a look at history and
the concept of indigeneity is useful. For most of the world’s endangered lan-
guages, indigeneity is part of both the discourse involved in the exercising of
rights including, among others, language rights.
3 The literature on these phenomena is too vast to cite, but among the relevant works dealing
with events that are less well known we can note particularly Ladas (1932), Üngör (2011), and
Richmond (2013).
Language Endangerment in the Balkans 81
3 Imperial Contexts
Turning for a moment to the competing empires, and beginning with the
early modern period, it was the Ottoman, Safavid, and Russian empires that
impacted the Caucasus and the Ottoman, Russian, and Austro-Hungarian that
impacted the Balkans. On a scale of linguicidal practices, Muslim empires
were, at least until the early twentieth century, relatively benign, the Austro-
Hungarian Empire had competing destructive and protective tendencies
(cf. Gal 2015), while the Russian Empire has been linguicidal (cf. Wixman 1980).
In both the Balkans and the Caucasus, ideologies of totalizing, mono-ethnic
nation-states that had their origins in Western Europe worked to destroy
traditional multilingualisms, albeit through different mechanisms, and with
different results in different locales.
82 Friedman
Among the Balkan languages, all of those without titular nation-states suffer
from some degree of endangerment ranging from moribund to ecological
fragility.
4.1 Judezmo
Of the moribund languages, the worst affected is Judezmo, the majority of
whose speakers were murdered by the Nazis and their collaborators during
World War Two. Unlike Yiddish, which has seen a genuine revival in recent
years owing in large part to the spread of Lubavitcher Hasidism, now known as
Chabad, Judezmo is retreating. In Macedonia, by 2008 there were between five
and ten fluent speakers left, all survivors of World War Two. Their numbers have
declined since then, although there is still a handful of semi-speakers among
their children, prospects for the future are non-existent as of this writing. The
situation is similar albeit not as dire, in places such as Sofia, Salonika, Sarajevo,
and Bucharest, as well as Istanbul, although there are some younger activists
in Istanbul. The communities in the USA, Canada, and Israel are, for the most
part, all members of older generations. In the Caucasus, several hundred Juhuro
speakers were killed by Nazi occupiers, but for the most part the community
survived, although today its dispersal in Israel, the US, and Russia leave the
language endangered in Daghestan and Azerbaijan (cf. Altschuler 1990). We can
mention here in passing both Judeo-Greek (Romaniote) and Judeo-Georgian
(Q’ivruli), which are basically ethnolects of their respective languages. In both
cases, the number of speakers is dwindling owing to the Holocaust in the case
of the former and migration to Israel in the case of the latter.
4.2 Meglenoromanian
The other most highly endangered language in the Balkans is Meglenoromanian,
which was originally limited to a cluster of villages on and near Mt. Pajak, in a
region that is today divided between the Hellenic Republic and the Republic
of Macedonia. By the time this territory was ceded by Turkey to Serbia and
Greece in 1913, it was limited to eleven villages—three ended up on the Serbian
side, later the Republic of Macedonia, and eight were on the Greek side. Of
the eight that were ceded to Greece, however, the largest, Nãte/Nãnte, was
Muslim, and so all its inhabitants were sent to Turkey in the 1923 exchange of
populations mandated by the Treaty of Lausanne. There they disappeared from
view until being rediscovered in Turkish Thrace, independently by Turkish
and by Austrian and Turkish ethnologists working in the region (Kahl 2006,
Kurtişoğlu and Kurtişoğlu 2012). But the modern Nãntintsi preserve only a few
Language Endangerment in the Balkans 83
Meglenoromanian songs and dances and the memory of where their forebears
came from. Meanwhile, the language has gone extinct in two other villages
(Atanasov 1990, 2002)—one on each side of the border—leaving a total of
seven villages. Prior to the Balkan Wars, Macedonian was the primary contact
language for all Meglenoromanian villages, but on the Greek side Macedonian
was outlawed in the 1930s and practically all the Macedonian-speaking villages
in the Meglen were emptied and/or destroyed at the end of the 1940s. Thus, at
present Meglenoromanian is assimilating to Greek in Greece but Macedonian
in Macedonia. The future of Meglenoromanian is particularly grim since 1) in
Romania it is considered a dialect of Romanian (Rusu 1984), 2) in Greece all
citizens of Greece are Greeks and minority languages are actively discouraged
(Friedman 2012a), and 3) in Macedonia there is no support for Meglenoromanian
maintenance and the speakers are counted with Aromanians (Friedman 2001).
4.3 Aromanian
Aromanian is in a distinctly different situation from Meglenoromanian in that
there are till hundreds of thousands of speakers. They live in northern Greece,
southern Albania, the Republic of Macedonia, and southwestern Bulgaria.
Thousands also emigrated to Romania in the wake of the Balkan Wars and
World War One. In Romania, Aromanian immigrants were settled mostly
around Tulcea in Dobrudja, which at the time was the least Romanian and
most ethnically mixed region of Romania and had only recently been assigned
to Romania. The situation can be compared to the settlement of Pontic and
Cappadocian Greeks in northern Greece, i.e., Greek Macedonia, after the 1923
exchange of populations.4 More or less officially, however, Romania considers
Aromanian to be a dialect of Romanian and it thus receives no support aside
from the occasional folklore production. In the countries where Aromanian
originated, it is only in the Republic of Macedonia, which has one of the
smallest number of Aromanian speakers, that the language has any sort of
official recognition and support. In Greece, the language is mostly limited to
speakers over 40, although in some more isolated villages, as elsewhere in the
original Aromanian territory, the language is still being passed on to children
(cf. Bara 2005). Despite official support for Aromanian in the Republic of
Macedonia, however, numbers of speakers are declining, and the language
remains vital mostly in isolated villages. There is also a significant Aromanian
diaspora in Western Europe and North America, but Constantin Belamaci’s
(1888, cited in Balamaci 1987) rage remains a forlorn call, and Hadzhi Daniil’s
(1802, reproduced in Ninčev 1977) poem continues as Greek policy.5
4.4 Romani
Of the remaining stateless languages, Romani has some speakers in the
Caucasus, but, perhaps most significantly, its apparent closest relatives—
Lomavren and Domari—are also spoken in the Caucasus (Marushiakova and
Popov 2014). This region is thus unique in having—or at least having had—
representatives of all three of these diasporic Indic languages, of which Romani
took its definite shape in the Byzantine Empire, Domari (also called Karachi,
Garbet, Zutt, Nawar, etc.) in the Middle East, and Lomavren (also called Bosha,
5 Constantin Belamaci was the author of an Aromanian poem entitled Parinteascã dimandare
‘Paternal commandment’ that became the Aromanian anthem. The verse cited here gives a
sense of the content:
Cari-shi alasã limba a lui For whomever leaves his language:
S’lu ardã pira focului Let him be burned by flame
Si s-dirinã yiu pri loc Let him be destroyed alive where he stands
Si lli si frigã limba n-foc. Let his tongue be burned in flame.
In contrast, Hadži Daniil of Moschopole (modern Voskopoja, Albania) was a Hellenized
Aromanian who published a quadrilingual phrasebook the express purpose of which was
the extinction of Balkan languages other than Greek, as seen in the opening lines of his intro-
duction (reproduced in Ninčev 1977:83, given here in original and transliterated Greek and
in Wace and Thompson’s (1913:6) English verse translation.
Ἀλβανοὶ, Βλάχοι, Βούλγαροι, Ἀλλόγλωσσοι χαρῆτε,
Κ᾽ἑτοιμασθῆτε ὅλοι σας Ῥωμαῖοι νὰ γενῆτε.
Βαρβαρικὴν ἀφήνοντες γλῶσσαν, φωνὴν καὶ ἢθη,
Ὁποῦ στοὺς Ἀπογόνους σας νὰ φαίνωνται σὰν μῦθοι.
Albanoi, Vlakhoi, Voulgaroi, Alloglōssoi kharēte,
K’etoimasthēte oloi sas Rōmaioi na genēte.
Varvarikēn aphēnontes glōssan, fōnēn kai ēthē,
Opou stous Apogonous sas na phainōntai san mēthoi.
Albanians, Bulgars, Vlachs and all who now do speak
An alien tongue rejoice, prepare to make you Greek,
Change your barbaric tongue, your customs rude forego,
So that as byegone myths your children may them know.
Language Endangerment in the Balkans 85
6 The only datum we have is Besirov and Tlebzu (1981), who mention the replacement of the
ergative by the instrumental. We can speculate that this is a result of Serbian influence.
Kănčev (1900: 116, 178, 215) gives locations and statistics for Circassians in Macedonia, but
their dialect is completely lost to us.
86 Friedman
completed 1083), when the Georgian province of Tao and Bulgaria were both
part of the Byzantine Empire. Two inscriptions and old copies (thirteenth cen-
tury) of the monastery’s tipikon survive (Šanidze 1971).
7 In linguistic terms, Rhodopian dialects are very distinctive, albeit part of the Balkan Slavic
dialect continuum that includes Bulgarian, Macedonian, Goran, Torlak, etc. The singling out
of Torlak as a separate language in Mosley (2010), while understandable in the context of the
former Serbo-Croatian, is more problematic in the context of Balkan Slavic.
Language Endangerment in the Balkans 87
5 Conclusion
Acknowledgments
This article draws on more than forty years of my own field work in the Balkans
and the Caucasus. I wish to acknowledge support from fellowships from the
following US granting agencies: American Council of Learned Societies for
a Fellowship in East European Studies (1986) and (2000-01) financed in part
by the National Endowment for the Humanities and the Ford Foundation,
International Research and Exchanges Board for travel grants to Macedonia
in 1991and 1992, National Endowment for the Humanities (2001, Reference
FA-36517-01), and Fulbright-Hays Post-Doctoral Fellowship from the U.S.
Department of Education as well as a fellowship from the John Simon
Guggenheim Memorial Foundation in 2008-2009, American Council of
88 Friedman
Learned Societies for a Fellowship in East European Studies with support from
the National Endowment for the Humanities and the Social Science Research
Council (2012-2013), an American Councils for International Education (ACTR/
ACCELS) Title VIII Research Fellowship with support from the U.S. Department
of State, Title VIII Program for Research and Training on Eastern Europe and
Eurasia (Independent States of the former Soviet Union) (2012). Some of the
research reported here from other sources was conducted while I was an
honorary visitor at the Center for Research on Language Diversity at La Trobe
University. The opinions expressed herein are entirely my own.
References
Friedman, Victor A. 2011. Review of Matras, Yaron. 2010. Romani in Britain: The Afterlife
of a Language. Edinburgh: Edinburgh University Press. The Journal of Language
Contact, 4. 295-301.
Friedman, Victor A. 2012a. A Tantrum from the Cradle of Democracy: On the Dangers
of Studying Macedonian. In Victor C. de Munck, and Ljupcho Risteski (eds.),
Macedonia: The Political, Social, Economic and Cultural Foundations of a Balkan
State, 22-43. London: I.B. Tauris.
Friedman, Victor A. 2012b. Copying and Cognates in the Balkan Sprachbund. In Lars
Johanson and Martine Robeets (eds.), Copies vs Cognates in Bound Morphology,
323-336. Leiden: Brill.
Friedman, Victor. 2016. The Importance of Aromanian for the Study of Balkan Language
Contact in the Context of Balkan-Caucasian Parallels. In Thede Kahl and Ioana
Nechiti, (eds.), Ethnic and Linguistic Diversity in Southeast Europe and the Caucasus.
Vienna: Austrian Academy of Sciences.
Friedman, Victor A. and Brian D. Joseph. Forthcoming. The Balkan Languages.
Cambridge: Cambridge University.
Gal, Suan. 2015. Imperial Linguistics and Polyglot Nationalism in Austria-Hungary:
Hunfalvy, Gumplowicz, Schuchardt. Balkanistica 28. 151-173.
Kahl, Thede. 2006. The Islamisation of the Meglen Vlachs (Megleno-Romanians): The
Village of Nânti (Nótia) and the “Nântinets” in Present-Day Turkey. Nationalities
Papers 34. 71-90.
Kokkas, N. 2004. Úchem so pomátsko. Ksanthi: Politistiko Anaptyksiako Kentro Thrakēs
Kănčov, Vasil. 1900. Makedonija: Etnografija i statistika. Sofia: Bălgarsko knižovno
družestvo.
Kurmel, Ömer Aytek. 1994. Kosova ve Çerkesler. Yedi Yıldız 4.14-15.
Kurtişoğlu Belma and Bülent Kurtişoğlu. 2012. Hidden Latin in Thrace: Notyalılar.
Paper delivered at the 3rd Symposium of the Study Group on Music and Dance
in Southeast Europe, Berovo, Republic of Macedonia, 4/17/2012-4/23/2012,
<http://www.ictmusic.org/group/music-and-dance-southeastern-europe>
accessed 11 December 2015.
Ladas, Stephen P. 1932. The Exchange of Minorities : Bulgaria, Greece, and Turkey. New
York: Macmillan.
Matras, Yaron. 2012. A Grammar of Domari. Berlin/Boston: Walter de Gruyter.
Mosley, Christopher (chief ed.) 2010. Atlas of the World’s Languages in Danger. Paris:
UNESCO.
Marushiakova, Elena and Veselin Popov. 2014. The Gypsies (Dom—Lom—Rom) in
Georgia. Proceedings of Annual Meeting of the Gypsy Lore Society and Conference
on Romani Studies, Bratislava, September 11-13, 2014. <https://sites.google.com/site/
glsproceedings/presentations> accessed 11 december 2015.
90 Friedman
Mülhäusler, Peter. 2002. How One Cannot Preserve Languages (but can preserve lan-
guage ecologies). In David Bradley and Maya Bradley (eds.), Language Endangerment
and Language Maintenance, 34-38. London: Routledge Curzon.
Ninčev, A. 1977. Četiriezičnijat rečnik na Hadži Daniil. Sofia: BAN.
Patkanov, K. I. 1887. O narečijah zakavkazskih Cigan: Boša i Karači. St. Petersburg:
Imperatorskaja Akademija Nauk.
Richmond, Walter. 2013. Circassian Genocide. New Brunswick, NJ: Rutgers University
Press.
Rusu, Valeriu (chief ed.) 1984. Tratat de dialectologie românească. Craiova: Scrisul
românesc.
Šanidze, A. 1971. Kartvelta monast’iri bulgaretši da misi t’ip’ik’oni. Tbilisi: Mecniereba.
Steinke, Klaus and Christian Voss (eds.) 2007. The Pomaks in Greece and Bulgaria: A
Model Case for Borderland Minorities in the Balkans. Munich: Verlag Otto Sagner/
Südoosteuropa Gesellschaft.
Üngör, Uğur Ümit. 2011. The Making of Modern Tukey. Oxford: Oxford University.
Wace, A. J. B. & M. S. Thompson. 1913. The Nomads of the Balkans. New York: Dutton.
Wixman, Ronald. 1980. Language aspects of Ethnic Patterns and Processes in the North
Caucasus. Chicago: University of Chicago Department of Geography.
Chapter 7
1 The number of ethnic Mingrelians (let alone that of competent speakers) is unknown, but it
has been estimated that speakers might number up to half a million.
Instilling Pride by Raising a Language ’ s Prestige 93
close sister Mingrelian (plus that of the more distant sister Svan) in Georgia,
compared the treatment of Laz in Turkey with that of Mingrelian and Svan in
Georgia (see Feurstein 1992), and I myself have contributed to the discussion
(Hewitt 1995). More recent examples could be cited, but simply consider this
brief summary: the leading Mingrelian politician of the 1920s, Ishak Zhvania,
advocated autonomy for Mingrelia together with the teaching of, and publi-
cation in, Mingrelian; books such as Ch’ita Chxoria ‘Red Ray’ and the Zugdidi
newspaper Q’azaxishi Gazeti ‘Peasant’s Paper’ appeared from 1930, but around
this time, as Stalin secured control of the Kremlin and Lavrent’i Beria assumed
Zhvania’s mantle as the leading Mingrelian politician, all talk of autonomy
ceased, Mingrelians were officially classified as ‘Georgians’, and publishing
in Mingrelian for Mingrelians (as opposed to linguistic or folkloristic works
for specialists) ended c.1938. Whilst it is popularly believed in Georgia that
Mingrelian, Laz and Svan are mere dialects of Georgian, even local linguists
who know this not to be the case but who do not wish to upset the official des-
ignation of these peoples as ‘Georgians’ have invented the (to my mind) odd
term ‘sociolinguistic dialect’ to describe the status of these tongues in Georgia.
This general downplaying of Mingrelian must surely have contributed to the
sad remark made by that old lady back in the 1980s.
Of course, the clue as to why Tbilisi has wanted as many residents of
Georgia as possible to identify themselves as ‘Georgians’ (just as for most of its
existence the Turkish Republic has tended to regard all residents of Turkey as
‘Turks’) lies in Zhvania’s desire to establish a Mingrelian autonomy. Separatism
was feared and remains feared to the present day.2 Let us recall Stalin’s own
words from his famous 1913 article on Marxism and the National Question:
‘What is to be done with the Mingrelians, the Abkhasians, the Adjarians, the
Svanetians, the Lesghians [lek’ebi in the Georgian version—BGH], and so on,
who speak different languages but do not possess a literature of their own? To
what nations are they to be attached? . . . What is to be done with the Ossets, of
whom the Transcaucasian Ossets are becoming assimilated (but are as yet by
no means wholly assimilated) by the Georgians and the Ciscaucasian Ossets
are partly being assimilated by the Russians and partly continuing to develop
and are creating their own literature? How are they to be “organised” into a
single national union?’ (pp. 48-9 of an undated English translation).
I asked an elderly Mingrelian in Upper Gal in the autumn of 2013 about his
ethnic self-awareness and how he distinguished between Mingrelians and
2 Interestingly, it seems that the reason for the quick demise of just 2 issues of the Laz news-
paper in Georgia in 1929 may have been the result of a request from Turkey to discontinue it
(Feurstein 1992.299).
94 Hewitt
Georgians. He replied: ‘We Mingrelians have our own language, but we are only
a people [Georgian xalxi], whereas the Georgians are a nation [Georgian eri].’
Asked how he differentiated between peoples and nations, he said that the
Georgians formed a nation ‘because they have a literature’! Whilst Stalin’s view
might have prevailed amongst (?most/many/all) Mingrelians, recent attempts
to introduce this categorisation to the Laz in Turkey by visitors from Georgia
have caused what I would judge to be entirely natural resentment. Consider
these quotes from a statement released in 2013 by a group of Laz intellectuals:
1) From September 2013, in the 5th-8th classes of the secondary schools Laz can be taken
as an elective lesson. In 2013, there were only five classes in the towns of Fındıklı (Rize)
and Arhavi (Artvin). In September 2014, the number of classes went up to 15, with pupils
taking Laz as an elective lesson in the towns of Ardeşen, Pazar and Fındıklı in the province
of Rize, and in the towns of Borcka and Arhavi in the province of Artvin. We are hoping that
the number of classes will go up to at least 30 in the next academic year and that classes will
also open in Hopa.
2) The Laz Institute, which was established in 2013, is developing relations with the lin-
guists and scholars around the world. It introduced courses for the teaching of Laz for adults
in Istanbul. The Institue has been commissioned by the Turkish Ministry of Education to
prepare text books to be used during the elective Laz lessons at the secondary schools.
The institute is also having talks with a number of universities in Turkey for Laz courses to
be introduced at these universities. Laz elective courses will continue to take place at the
University of Bosphorus (Bogazici Universitesi) in the 2014 academic year.
3) Laz intellectuals continue to work towards creating a Laz literature. The Lazika Yayın
Kollektifi (The Lazika Publication Collective) has published more than 60 books in the last
four years of its establishment, of which only five are in Turkish and all the rest are in Laz.
We have published Laz dictionaries, a Laz periodical (Tanura), five Laz novels, books on the
Laz history, Laz poem books, a high number of children’s books, and a variety of translations
into Laz from famous fairytales/stories from around the world. The fairytales/stories that
have been translated into Laz so far are as below and more are to come. We will be attending
to the Tüyap Istanbul Book Fair (the largest book fair in the country) in November. The Little
Prince (also translated into Mingrelian by the Laz Cultural Association), Romeo and Juliet,
The Little Black Fish, Polly-Anna, The Snow White, Don Quixote, Pinocchio.
5 This is not to suggest that the prime responsibility for passing on a language should lie with
schools. Native speakers are the ones who can and should most easily fulfil this task by
merely opening their mouths and speaking to children. But they might not feel inclined to
do so, if they do not recognise the importance of this simple task.
96 Hewitt
Georgian nationalism. With the aim of encouraging universal study and use
of the language, a law was introduced on 27 November 2007 under the late
Pres. S. Bagapsh requiring all official business to be conducted in Abkhaz from
1 January 2015. Whilst this might have been (and indeed remains as of October
2015) a noble aspiration, passing a law without making provision in terms
of teaching, publication of relevant language-materials, etc . . . means that it
will be impossible to enforce (the Vice-President elected on 24 August 2014
does not, for example, know Abkhaz!) and renders that law pointless. With
Abkhazia’s economy still in a parlous state, any assistance in terms of help with
producing suitable manuals and the training of teachers (especially for those
classes and subjects that have traditionally been taught in Russian even in
Abkhaz-language schools once tuition switches after the first few grades from
Abkhaz to Russian) would be welcome.
And, of course, the question of Mingrelian is also relevant to Abkhazia.
Here I have long advocated that, if the local Mingrelians want their children to
be educated through the medium of Georgian (as seems to be the case), this
should be allowed on condition that Mingrelian too is taught in local schools
up to a certain level of competence in order to raise its profile and prestige
amongst its native speakers. This view is not popular, however, with many
Abkhazians, I have to admit! But if the cause is noble, the battle is worth fight-
ing, as I hope we can all agree.
Postscript
I should add in conclusion that in an e-mail received from Georgia only a week
before this paper was read at Ardahan University a Mingrelian correspondent
who has translated and published a Mingrelian translation of ‘The Little
Prince’ = ch’ich’e mapaskiri6 (organised by the Laz [sic] Cultural Organisation
in Turkey, as noted in Footnote 4 above) told me that attitudes in Georgia
to publishing Mingrelian materials seem to be changing, a greater readiness
than, say, 10-15 years ago to accept such publications being noticeable. This was
welcome news, and one can only hope firstly that it is true and secondly that
such tolerance widens and deepens there.
6 See http://www.petit-prince-collection.com/lang/show_livre.php?lang=en&id=2615.
Instilling Pride by Raising a Language ’ s Prestige 97
References
Feurstein, Wolfgang. 1992. Mingrelisch, Lazisch, Swanisch. Alte Sprachen und Kulturen
der Kolchis vor dem baldigen Untergang, in [B.] George Hewitt (ed.) Caucasian
Perspectives, 285-328. Unterschleissheim/München: Lincom Europa.
Hewitt, [B.] George. 1995. Yet a third consideration of Völker, Sprachen und Kulturen
des südlichen Kaukasus., Central Asian Survey, 14.2, 285-310.
Wheatley, Jonathan. 2009. Georgia and the European Charter for Regional or Minority
Languages, European Centre for Minority Issues (ECMI), Working Paper 42.
Chapter 8
which form the Nakh-Daghestanian language family. There are also a significant
number of languages spoken in Daghestan that belong to other families:
Azeri, Kumyk, Nogay, which belong to the Turkic family, and Tat, which
belongs to the Iranian branch of the Indo-European family.
for example, Hinukh spoken by 600 people. The largest of the unwritten ethnic
groups is Andi whose language is spoken by about 30 000-40 000.
The lack of reliable data on the number of the so-called small ethnic groups
falls in the category of curiosities of demographic statistics in Daghestan.
However, according to the most precise data it is assumed that the number of
speakers of the unwritten languages, as part of the three million people inhab-
iting the region, exceeds 150 000. More than 100 000 people represent 13 ethnic
groups of the Andi-Tsez language group. Agul, Tsakhur, Rutul, and Archi that
belong to the Lezgic language group have recently received writing systems
and become newly literate, are spoken by ca. 50 000. It is very difficult to deter-
mine the number of speakers of other unwritten languages of the Lezgic group
such as Budukh, Khinalug, Kryz, Udi, which are located outside Daghestan and
almost totally assimilated to Azeri (Gamzatov 1995). Azeri was an inter-ethnic
language of communication for Aguls, Rutuls, Tabasarans, and Tsakhurs until
their languages got their official status.
An interesting situation is within the Avar-Ando-Tsezic group which consist
of fourteen languages: Andi, Akhwakh, Bagwalal, Botlikh, Godoberi, Karata,
Tindi, Chamalal-Andic subgroup; Bezhta, Hinukh, Hunzib, Khwarshi, Tsez-
Tsezic subgroup, among which Avar is the only written and literary language,
and the other thirteen languages have no writing tradition and are limited
to domestic and family use. Of course, it is expected that multilingualism
would be widespread in this region (i.e. knowledge of more than two or three
languages). The pattern of multilingualism is the so-called ethnic-ethnic-
Avar-Russian multilingualism (which is, for example, widespread among
Hinukh speakers: Hinukh-Tsez-Avar-Russian or Hinukh-Bezhta-Tsez-Avar-
Russian multilingualism). Another pattern of multilingualism is ethnic-Avar-
Russian trilingualism (which is widespread), against which the ethnic identity
of the representative of an unwritten language is developed. Trilingualism is
a standard way of communication for the Andi-Tsezic speakers. The speakers
of the Andi language, for example, identify themselves primarily as Andi and
yet call themselves Avar, as Avar language is the lingua franca in this group.
Interestingly, outside the republic, the Andi speakers identify themselves as
Daghestanians, and outside the country, they are simply Russians. This is the
multi-ethnic hierarchy of ethnic and linguistic identity of the Daghestanian
highlanders (Gamzatov 2005).
The majority of the Andic languages’ speakers live in the most mountain-
ous part of Daghestan, in the Andi Koysu river basin between Andi and Bogos
ridges. The western border between Daghestan, Chechnya and Georgia coin-
cides with the ethnic boundaries. Some Andic languages are also represented
in the republic of Azerbaijan (for example, village Ahvahdere, where Akhvakh
speakers live). The main language of communication there is Avar.
Unwritten Minority Languages of Daghestan 101
Since ancient times, the Tsez people (also known by the Georgian name
Dido) inhabit the Western part of Daghestan and partially Georgia, keeping
close ties with each other. Interestingly, the languages in the intra-group com-
munication for the Tsezic speakers are Bezhta (which is used with Hunzib and
Hinukh speakers) and Tsez (with Hinukh speakers); today Bezhta and Tsez still
partially provide these functions. Due to the economic factors and geographi-
cal proximity, the Tsezic languages’ speakers were influenced by Georgia, and
have been linked more closely with the Georgians, rather than with the Avar.
Currently, Avar serves as the language of interethnic communication among
the Tsezic peoples.
Analysis of the Daghestanian languages clearly confirms that the declara-
tions and decisions of language development in the given circumstances are
impossible without taking into account unwritten languages, since there has
been a large and prolific branch of linguistics dedicated to these languages.
Moreover, unwritten languages of Daghestan have become not just subfield,
but an independent field of modern Daghestanian linguistic and ethnographic
research. As a result, various projects and special programs for the study of
minority peoples, their genesis, historical and cultural past and present have
been developed in the historical, philological and sociological departments
of the Daghestan Scientific Center of the Russian Academy of Sciences. Serial
publications of historical and ethnographic materials titled ‘Small peoples of
Daghestan’ are ongoing. As a result, historical and ethnographic descriptions
of all Andi-Tsez and Archi languages have been published. Collection of oral
and poetic heritage of unwritten peoples and ethnic groups by folklorists have
become a very successful endeavor. Much work has been done in the study of
folk-art traditions and handicraft culture of these ethnic groups.
The idea of preservation of small unwritten languages and protection of
ethnic identities assumes a number of plans, projects and programs. The most
important current issues for the Daghestanian, particularly small unwritten,
languages are collection and documentation, comprehension and interpreta-
tion of the lexical data of each language. Therefore, the department of lexi-
cography and lexicology of the Daghestan Scientific Center of the Russian
Academy of Sciences carries out a multi-year program of making ethnic-
Russian bilingual dictionaries of unwritten languages. It is well known, that the
only reliable way to gather the lexical material of these minority languages is to
work among ethnic groups themselves, i.e. though direct fieldwork with their
speakers. Lexical information is being documented during extensive fieldwork
in the highland regions of Daghestan, and nowadays, almost all unwritten lan-
guages have their dictionary manuscripts, which include about seven to nine
thousand of lexical units of up to eight hundred pages. Thirteen of eighteen
dictionaries of this type have been published within two decades. This became
102 Alieva and Khalilov
Therefore, the main and urgent task is collecting and documentation of the
still existing linguistic, historical and cultural, folklore and ethnographic mate-
rial of the indigenous ethnic groups in Daghestan, which previously had not
been studied. Keeping all this wealth is a necessary prerequisite for the study
of history, ethnography and language, familiarizing the younger generation to
folk wisdom and transmitting the language and the whole bulk of folklore to
future generations.
A great amount of work in documenting the culture and the unwritten lan-
guages of Daghestan was done in the Department of Linguistics in Max Planck
Institute for Evolutionary Anthropology (Leipzig, Germany) under the super-
vision and with the financial support of professor, Ph.D., former director of
Department of Linguistics, Bernard Comrie. In particular, fundamental work
on folklore of Tsezic people was made within a few years. Today the follow-
ing books have been prepared and published: Tsez folklore by Abdulaev and
Abdulaev and Khwarshi folklore by Karimova. These books include Khwarshi
and Tsez tales, legends, stories, anecdotes, folk songs and ritual laments, as well
as the sample texts of spoken language (dialogues and monologues).
‘Bezhta-Russian phraseological, folklore-ethnographic dictionary’, of 420
pages, has been prepared by Khalilov and first released in 2014. It includes
rich folklore ethnographic material of the Bezhta people. The dictionary is the
first work on lexicographical description of phraseology, including paremiol-
ogy and folklore-ethnographic expressions of Bezhta. The dictionary includes
phraseological units, proverbs and a full list of common set expressions. The
dictionary lists independent Bezhta good wishes, teachings, swear words,
curses, folk beliefs, omens, divinations, spells, riddles, rhymes, tongue twisters,
funny expressions, as well as tribal (domestic) labels, signs in order to denote
appurtenance. The dictionary can be a source for future theoretical research
on phraseology, paremiology and, in particular, Daghestan-Russian phraseog-
raphy and paremiography.
Bezhta phraseological, paremiological and folklore-ethnographic units
are diverse in their content and function. The Bezhta folklore emerged from
the everyday observations of social and natural phenomena, some units are
associated with mythology and real historical events, some of them are bor-
rowed from related (Avar) or unrelated (Georgian and Russian) languages. A
certain part of idiomatic, paremiological and folklore-ethnographic material
of Bezhta dates back to the oriental languages, namely, Arabic. In general, this
folklore work is the most valuable material for studying the life of Bezhta peo-
ple in its historical aspect.
As can be seen from a brief analysis, the study of languages of the Tsezic sub-
group has been done relatively well. A great work has been done on compiling
Unwritten Minority Languages of Daghestan 105
their position. The law could clearly specify the place and the role of the native
languages in the life of the Daghestanian peoples.
The serious social support is still needed for another category of Daghestanian
languages, which are literary languages and languages that recently became
written (they are about fourteen). These languages already have educational,
scientific and artistic material, printed books, and mass media (radio and tele-
vision). Thus, they can be considered to be “safe” or not endangered. However,
nowadays there is a dangerous recurrence of narrowing the quantitative and
qualitative areas of native language use, reduction of school programs of the
native language and native literature. Many children, who grow up in the
urban areas, do not speak their mother tongue. This happens under irrevers-
ible process of urbanization, globalization and under the influence of Russian,
which is a means of inter-ethnic communication today. Native languages are
being moved into the background of public attention.
The change in the language situation, a revival of the ethnolinguistic life
in the country is only possible with the help of reasoned, comprehensive and
radical measures aimed for the future. At present, there is an ongoing mass
ethno-linguistic assimilation: out of three million Daghestanian people almost
two-third live in ethnic amalgamation. In addition, more than six hundred
thousand live outside Daghestan. Such territorial dispersal, as well as ethnic
intermarriages (almost every tenth Daghestanian family is exogamous, i.e. the
marriage partners are representatives of different ethnic groups), lead to the
state in which many traditional forms of culture are not associated with ethnic
identity any more.
Bibliography
Abdulaev, Arsen K. & Isa K. Abdullaev. 2010. Didojskij (cezskij) fol’klor. Makhachkala:
Lotos.
Comrie, Bernard, Khalilov, Madzhid and Khalilova, Zaira. 2015. Grammatika
bežtinskogo jazyka: fonetika, morfologija, slovoobrazovanie (A grammar of Bezhta).
Leipzig-Makhachkala: ALEF.
Forker, Diana. 2013. A Grammar of Hinuq. Berlin: De Gruyter.
Gamzatov, Gadži G. 1995. Bespis’mennyi, no Zhivoi, Real’nyi (Predislovie k serii
«Bespis’mennye jazyki Dagestana») // Khalilov, Madžid Š. “Bežtinsko-russkij slovar’.
Makhachkala: Nauka, p. 12.
Gamzatov, Gadži G. 2005. Lingvističeskaja planeta Daghestan. Etnojazykovoj Aspekt
Osvoenija. Moskva: Nauka, p. 63.
Ganieva, Faida A. 2002. Khinalug-Russian Dictionary. Makhachkala: Nauka.
Unwritten Minority Languages of Daghestan 107
Isakov, Isak A. & Khalilov, Madžid Š. 2012. Gunzibskiy Jazyk (A grammar of Hunzib):
Fonetika. Morfologiya. Slovoobrazovanie. Leksika. Teksty). Makhachkala: Nauka.
Jazyki Narodov Rossii. Krasnaja Kniga. Enciklopedičeskij slovar’-spravočnik. Moskva:
Academia, 2002.
Karimova, Raisat Š. 2014. Fol’klor xvaršincev. Makhachkala: ALEF.
Khalilov, Madžid Š. 1995. Bežtinkso-russkij slovar’. Makhachkala: Institut JaLI DNC RAN.
Khalilov, Madžid Š. 1999. Cezsko-russkij slovar’. Makhachkala: Institut JaLI DNC RAN.
Khalilov, Madžid Š. 2014. Bežtinsko-russkij frazeologičeskij, fol’klorno-etnografičeskij
slovar’. Makhachkala: ALEF.
Khalilov, Madžid Š. 2015. Slovar` bežtinskogo jazyka. Makhachkala: Nauka.
Khalilov, Madžid Š. & Isakov, Isak. 2001. Gunzibsko-russkij slovar’. Moskva: Nauka.
Khalilov, Madžid Š. & Isakov, Isak A. 2005. Ginuxsko-russkij slovar’. Makhachkala:
Nauka.
Khalilova, Zaira. 2009. A grammar of Khwarshi. Utrecht: LOT.
Kibrik, Aleksandr E., Sandro V. Kodzasov, Irina P. Olovjannikova, and Džalil S. Samedov.
(1977). Opyt strukturnogo opisanija arčinskogo jazyka. Tom 1. Leksika. Fonetika (in
Russian). Moscow: Izdatel’stvo moskovskogo universiteta.
Kibrik, Aleksandr E. et al. 2001. Bagvalinskij jazyk: Grammatika. Teksty. Slovar’ (A gram-
mar of Bagwalal). Moskva: Nasledie.
Magomedova, Patimat T. 1999. Čamalinsko-russkij slovar’. Makhachkala: Nauka.
Magomedova, Patimat T. 2003. Tindinsko-russkij slovar’. Makhachkala: Institut JaLI
DNC RAN.
Magomedova, Patimat T. 2004. Bagvalinsko-russkij slovar’. Makhachkala: Institut JaLI
DNC RAN.
Magomedova, Patimat T. 2012. Tindinskij jazyk. Makhachkala: Nauka.
Magomedova, Patimat T. & Rašidat Š. Xalidova. 2001. Karatinsko-russkij slovar’. Sankt-
Peterburg: Scriptorium.
Magomedova, Patimat T. & Abdullaeva, Indira I. 2007. Axvaxsko-russkij slovar’.
Makhachkala: Nauka.
Meilanova, Unejzat A. 1984. Buduxsko-russkij slovar’. Moskva: Nauka.
Radžabov, Ramazan N. 1999. Sinstaksis cezskogo jazyka. Moskva: Academia.
Saidova, Patimat A. 2006. Godoberinsko-russkij slovar’. Makhachkala: Institut JaLI DNC
RAN.
Saidova, Patimat A. & Magomed G. Abusov. 2012. Botlixsko-russkij slovar’. Makhachkala:
Institut JaLI DNC RAN.
van den Berg, Helma. 1995. A Grammar of Hunzib (with texts and lexicon). Lincom
Europa, München.
chapter 9
Yong-Sŏng Li
1 Introduction
Having realized the importance of the research and the documentation of the
endangered Altaic languages, 8 scholars of the Altaic Society of Korea carried
out at their own expense or on a personal level field researches at least 15 times
from 1972 to 2002, on Dagur, Sibe, Uilta, spoken Manchu, Ewenki, Orochen, and
Hezhe.1 All of these 8 scholars are male and graduated from the Department of
Linguistics at Seoul National University. There is no Turkologist among them.
These field researches are tabulated as follows:2
Members of this small group began to work to organize the ASK REAL (Altaic
Society of Korea, Researches on Endangered Altaic Languages), the full-
scale research and documentation project on endangered Altaic languages.3
Supported by the Korea Research Foundation Grant, the following two succes-
sive research projects were carried out:4
These two research projects are called ASK REAL unofficially. The director of
them was Ju-won KIM.5
These two research projects had the following two main goals:6
(1) Accumulating the extensive data on Altaic languages for the future study
on the genealogy of the Korean language;
(2) Joining the world-wide efforts for the documentation of endangered
languages.
2 Field Researches
7 See http://altaireal.snu.ac.kr/askreal_v25/altlang_main.html.
8 See Kim 2008: 136, Kim 2011: 51, and Yu 2014: 24.
9 See Kim 2008: 139, Kim 2011: 52, and Yu 2014: 24. For field research equipment, see Kim 2008: 143-
151, Kim 2011: 71-77, http://www.cld-korea.org/eng/fieldwork/fieldwork4_1.php, http://www
.cld-korea.org/eng/fieldwork/fieldwork4_2.php, and http://www.cld-korea.org/eng/field
work/fieldwork4_3.php.
FIELDWORK STUDIES OF THE ENDANGERED TURKIC LANGUAGES 111
According to the circumstances, the roles of the team members could overlap
and the number of the team members increased or decreased one or two.10
2.3 Questionnaire
Referred to Èwēnkèyǔ Jiǎnzhì (鄂温克语简志 “Brief record of the Ewenki
language”) of China, the first questionnaire of the Altaic Society of Korea was
prepared and used for Ewenki and Dagur in 2000.11 This questionnaire con-
sisted of vocabulary, grammar and conversational components. It was revised
and enlarged. In the version of September 2003, i.e. the first version for the first
research project, it had 1,625 vocabulary items, 219 conversational sentences,
and 182 grammatical sentences.12 Based on this version for China, the ver-
sions for Russia and Mongolia were prepared in 2003 and in 2005 respectively.13
These questionnaires were revised and enlarged continuously. Now they typi-
cally list around 2,750 vocabulary items, 340 conversational sentences, and 380
grammatical sentences.14
The Altaic Society of Korea has selected 24 lexical item classifications, seven
grammatical categories, and 17 situations for conversational settings.15 These
questionnaires are fit for the surveys which typically last 3-4 days for 6 hours
a day.16 The informants were requested to answer twice for each entry. In addi-
tion, spontaneous speech was often gathered in an impromptu manner.17
The phonological portion is withheld from these questionnaires, because in
many cases it is not possible to know the phoneme inventory due to the lack of
description of phonemes for the language to survey. Research into the inven-
tory is later performed using lexical data.18
When considering content placement, it is important to record informa-
tion about the native speaker informant in the front of the questionnaire. The
informant’s name, age, gender, ethnicity, family situation, information about
language use, birthplace, residence history, and other relevant pieces of infor-
mation should be recorded.19
2.5 Publications
The ASK REAL published three books for the consolidated reports of the proj-
ect in Korean:22
(1) The Altaic Society of Korea (ed.) (2006). Fieldwork Studies of Endangered
Altaic Languages – For the Genealogical Study of Korean and the
Preservation of Endangered Languages –. The Language and Cultural
Studies Series 2. Paju: Taehaksa. [한국알타이학회 (엮음) 2006. 절멸 위
기의 알타이언어 현지 조사 – 한국어 계통 연구와 알타이언어
보존을 위하여 –. 알타이학회 언어 문화 연구 2. 파주: 태학사.]
This book is a general report on the ASK REAL project done from
September 2003 to February 2006 (Fieldwork Studies of Altaic Languages
for Genealogy of Korean).
(2) Kim, Ju-won, et al. 2008. Documentation of Endangered Altaic Languages.
Paju: Taehaksa. [김주원 외. 2008. 사라져가는 알타이언어를 찾아서. 파
주: 태학사.]
This book is a general report on the ASK REAL project done from
September 2003 to August 2006 (Fieldwork Studies of Altaic Languages
for Genealogy of Korean).
(3) Kim, Ju-won, et al. 2011. Documentation of Altaic Languages for the
Maintenance of Language Diversity. Paju: Taehaksa. [김주원 외. 2011. 언어
다양성 보존을 위한 알타이 언어 문서화. 파주: 태학사.]
This book is a general report on the ASK REAL project done from July
2006 to June 2009 (Building Digital Archive of Altaic Languages for the
Study of Genealogy of Korean).
19 See Kim 2008: 156, Kim 2011: 58, and http://www.cld-korea.org/eng/fieldwork/field-
work3_1.php.
20 See http://www.cld-korea.org/eng/archives/archives_1.php. For detail, see Kim 2008: 176-
193, Kim 2011: 77-87, and Yu 2014: 27-28.
21 See Kim 2008: 176.
22 See Yu 2014: 28 and http://www.cld-korea.org/eng/fieldwork/fieldwork3_1.php.
FIELDWORK STUDIES OF THE ENDANGERED TURKIC LANGUAGES 113
The Members of the ASK REAL published seven descriptive grammars on some
endangered Manchu-Tungusic, Mongolic, and Turkic languages in English as
Altaic Languages Series:23
(1) Kim, Ju-won, et al. 2008. Materials of Spoken Manchu. Altaic Languages
Series 01. Seoul: Seoul National University Press.
(2) Yu, Won-soo, et al. 2008. A Study of the Tacheng dialect of the Dagur lan-
guage. Altaic Languages Series 02. Seoul: Seoul National University Press.
(3) Li, Yong-Sŏng, et al. 2008. A Study of the Middle Chulym dialect of the
Chulym language, Altaic Languages Series 03, Seoul: Seoul National
University Press.
(4) Yu, Won-soo. A Study of the Mongol Khamnigan spoken in northeastern
Mongolia. Altaic Languages Series 04. Seoul: Seoul National University
Press.
(5) Li, Yong-Sŏng. 2011. A Study of Dolgan. Altaic Languages Series 05. Seoul:
Seoul National University Press.
(6) Kim, Ju-won. 2011. A grammar of Ewen. Altaic Languages Series 06. Seoul:
Seoul National University Press.
(7) Ko, Dong-ho & Gyu-dong Yurn. 2011. A Description of Najkhin Nanai.
Altaic Languages Series 07. Seoul: Seoul National University Press.
The first three books were selected among the excellent scholarly book
(and the only books in a foreign language) for 2009 by the National
Academy of Sciences of Korea.
The Members of the ASK REAL also published two monographs on the meth-
odology of field work researches and documentation in Korean:24
23 See Yu 2014: 29-30. These books do not contain spontaneous speech. The 8th volume
of this series was also published in 2015 as follows: Kim, Ju-won, Dong-ho Ko, Antonina
Kile & Moon-jeong Choi. 2015. The Life and Rituals of the Nanai People. Altaic Languages
Series 08. Seoul: Seoul National University Press.
24 See Yu 2014: 29.
114 Li
Papers on the findings of the fieldwork studies have been published in domes-
tic and foreign academic journals, or presented in conferences in Korean, or in
English.25
The data obtained was very small in number. Therefore, the second fieldwork
for Fuyü Kyrgyz was carried out during January 15-16, 2004 in the village of
Qijiazi. However, it was only possible to obtain 363 words and 43 conversa-
tional sentences. It was also impossible to obtain any data on grammatical
sentences.
Upon the result of the fieldworks on Fuyü Kyrgyz, the original plan of the
research project was changed to survey as many Altaic (Turkic in particular)
languages as possible. Each team tried to save expenses. Consequently it was
possible to carry out more fieldwork studies than in the original plan.
It was impossible to get a visa for Turkmenistan in Seoul at that time.
Therefore, Prof. Dr. Mehmet Ölmez, one of the foreign collaborators, carried
out with his colleagues the fieldwork studies on the Oghuz dialects of Amu
Darya region in Turkmenistan during May 7-17, 2004.
The following Turkic languages were surveyed in the 1st research project.26
(Information is given in the following order: name of the object language
(or dialect); place of the fieldwork study; time period of the fieldwork study):
(1) Fuyü Kyrgyz; Wujiazi and Qijiazi, Fuyu County, Qiqihar, Heilongjiang
Province, China; September 23-24, 2003
(2) Fuyü Kyrgyz; Qijiazi, Fuyu County, Qiqihar, Heilongjiang Province, China;
January 15-16, 2004
(3) Shor (Mrass dialect); Myski, Kemerovo Province, Russia; April 20-23, 2004
(4) Oghuz dialects of Amu Darya region; Ashgabat, Turkmenistan; May 12-17,
2004
(5) Tuvan (Kök Monchak dialect); Aqqaba, Altay Prefecture, Xinjiang Uyghur
Autonomous Region, China; October 21-22, 2004
(6) Kazakh; Almaty, Kazakhstan; January 4-7, 2005
(7) Yakut; Yakutsk, Sakha Republic, Russia; February 16-20, 2005
(8) Chuvash; Cheboksary, Chuvash Republic, Russia; April 19-22, 2005
(9) Tuvan (Kök Monchak dialect); Qanas and Aqqaba, Altay Prefecture,
Xinjiang Uyghur Autonomous Region, China; April 28-May 4, 2005
(10) Tuvan (Tsaatan dialect); Khatgal, Khövsgöl Aymag, Mongolia; June 23-26,
2005
(11) West Yugur; Hongwansi, Sunan Yugur Autonomous County, Gansu
Province, China: October 17-21, 2005
(12) Gagauz in Kiev, Ukraine: February 5 and 17, 2006
(13) Urum (Oghuz dialect); Mariupol’, Donetsk Province, Ukraine; February
7-11, 2006
(14) Urum (Kypchak dialect); Mariupol’, Donetsk Province, Ukraine; February
8-9, 2006
(15) Krymchak; Simferopol’, Autonomous Republic of Crimea, Ukraine;
February 12-14, 2006
(16) Karaim (Crimean dialect); Jevpatorija and Simferopol’, Autonomous
Republic of Crimea, Ukraine; February 15-16, 2006
(17) Chulym Tatar (Middle Chulym dialect); Tomsk, Tomsk Province, Russia:
May 15-18, 2006
(18) Siberian Tatar; Tomsk, Tomsk Province, Russia; May 18 and 20, 2006
26 Two co-researchers performed a preliminary survey of Altai and Khakas during
February 17-19, 2004 in Novosibirsk, Russia. This survey is not included here.
116 Li
(19) Chulym Tatar (Lower Chulym dialect); Tomsk, Tomsk Province, Russia:
May 19-21, 2006
(20) Tuvan (Uriankhai dialect) in Tsagaan-Üür, Khövsgöl Aymag, Mongolia:
July 1-3, 2006
The author participated in all the fieldwork studies listed here except for
4-7. The informant of Lower Chulym was the last speaker of this dialect
and died in 2011. Therefore, the linguistic data collected by the author is
probably the last one for this dialect.
The Tofa-speaking region was not easily and safely accessible. It would be very
expensive to go there. Moreover, it was informed that there was no fluent Tofa
speaker. Therefore, Khakas was selected instead of Tofa.
Altai was selected instead of Gagauz, because it was almost impossible to
find informants at that time in winter.
118 Li
(1) Turkic: Fuyü Kyrgyz, Shor, Tuvan, Kumyk, Gagauz, Karaim, Urum
(2) Mongolic: Mongolian, Buryat, Kangjia, Oirat-Kalmyk, Bonan
(3) Manchu-Tungusic: Manchu, Uilta, Nanai, Ewenki, Ewen, Sibe, Orochi,
Solon
However, this research project was rejected by the examiners of the National
Research Foundation of Korea on the pretext that the members of the
ASK REAL did nothing in spite of the enormous support of the Korea Research
Foundation. This is simply not true at all. As mentioned above, the members
of the ASK REAL wrote a lot of books, articles, etc. Moreover, three books were
among the excellent scholarly book (and the only books in a foreign language)
for 2009 selected by the National Academy of Sciences of Korea. The real rea-
son is the common practice of funding two successive research projects at
most regardless of good result in Korea.
From July 2009 to January 2010, Seoul National University supported the
members of the ASK REAL to prepare a web site in English providing infor-
mation on the ASK REAL. The result is the web site http://altaireal.snu.ac.kr/
askreal_v25.
With funding from the National Research Foundation of Korea, Ju-won KIM
and his colleagues carried out the “Languages and Culture of the Indigenous
Peoples of the Amur River” Project from May 1, 2010 to April 30, 2013. The
author participated in this project for the first six months.
Sponsored by the Ministry of Culture, Sports and Tourism of Korea,
Ju-won KIM and his colleagues including the author prepared the web site
of the Center for Language Diversity (http://www.cld-korea.org/index.php)
from October 2010 to March 2011. This site has also English version consist-
ing of 4 sections of About, Fieldwork, Archives, and Bibliography. The materi-
als of the sections Fieldwork, Archives, and Bibliography are related with the
ASK REAL.
FIELDWORK STUDIES OF THE ENDANGERED TURKIC LANGUAGES 119
4 Conclusion
(1) The period of fieldwork studies was not sufficient. So, it was not possible
to survey many informants. However, it would cost too much to survey for
a long time, for example, one or two months. Moreover, there was almost
nobody to do so, especially among the graduate research assistants.
(2) The number of informants was usually very small for each language. In
many cases, there were only one or two informants. Therefore, the reli-
ability of the collected data may be questionable. However, the members
of the ASK REAL did their best in a given situation.
(3) It was not possible for the members of the ASK REAL to continue their
fieldwork studies on (endangered) Altaic languages due to the common
practice of funding two successive research projects at most regardless
of good result in Korea. Financial support is needed for those fieldwork
studies.
(4) The members of the ASK REAL digitalized the collected data, and
marked and extracted each item with Sony Sound Forge for audio data
and Window Movie Maker for video data. These works were taken by
the graduate research assistants in almost all cases. However, the most of
these data are not transcribed by the experts. Practically no one among
the graduate students, the next generation of the discipline, wants to
devout himself/herself to Altaic studies in Korea. Because there is little
possibility for finding any chance in Altaic studies.
References
The Altaic Society of Korea (ed.) (2006). Fieldwork Studies of Endangered Altaic
Languages – For the Genealogical Study of Korean and the Preservation of Endangered
Languages –. The Language and Cultural Studies Series 2. Paju: Taehaksa. [한국알
타이학회 (엮음) 2006. 절멸 위기의 알타이언어 현지 조사 – 한국어 계통 연구
와 알타이언어 보존을 위하여 –. 알타이학회 언어 문화 연구 2. 파주: 태학사.]
Baskakov, Nikolaj A. (ed.) (1966). Jazyki narodov SSSR: 2. Tjurkskije jazyki. Moskva:
Nauka.
120 Li
Nicholas Ostler
1 Burenhult 2000.
they will have developed different vocabularies to support their very different
life-styles; but they will also have perpetuated their languages by means that
are as old as language itself: the young learn to speak and understand from
the old. This process of learning requires imitation of sound in the context of
stimulus, but also rational reconstruction of the system and the meanings it
conveys. As a process, it does not give perfect copies of the elders’ speech, and
there is always the potential, in every generation, for the language to be a little
different from how the elders spoke it. And since there is no general opportu-
nity for distant tribes to contact one another and communicate, this process
of imperfect learning will have led – over 100,000 years – to all the diversity of
language, in phonetics, vocabulary and structures that we now discover when
we review the languages humanity still speaks and (where documented in
writing) has spoken.
This account of the origin of human language diversity is not just a ratio-
nal reconstruction. The purported route of humanity round the world is
confirmed by the pattern of gene mutations (Y chromosome for men,2 mito-
chondrial DNA for women),3 and also the gross positioning of detectable fami-
lies of languages.4
The ability to use language must have been important in making this
explosion of humanity’s homelands the success that it was. Language makes
thoughts, ideas and plans discrete and potentially explicit: therefore, they
become available for inspection, and for discussion with, and transmission to
others who use the same code. Joint planning is possible, and so is co-ordination
of action beyond the instinctive or habitual. Imagined possibilities can be
explored, loyalties pledged, memories recalled. This makes a tribe with lan-
guage much more powerful and effective in the game against nature.
But this does not exhaust language as it was experienced. It was not just
language as such: rather, it was a multitude of languages.
Given the spread of people out beyond the bounds of regular contact and
the iron law of imperfect transmission from generation to generation, differ-
ent, incompatible codes of language came to be used. This had little ill-effect
in these early days of human spread and residence: by its nature, the people
who spoke languages other than yours were the people you were unlikely to
meet, or to collaborate with. But even within a single code well within your
understanding, regional groups would speak with perceivable variation, so that
2 https://en.wikipedia.org/wiki/Human_Y-chromosome_DNA_haplogroup#Major_Y-DNA_
haplogroups.
3 https://en.wikipedia.org/wiki/Human_mitochondrial_DNA_haplogroup.
4 http://emeld.org/workshop/2004/bibiko/bibiko-original.html.
124 Ostler
speech became a marker of tribe or kin-group: how much more so, when one
encountered people who were incomprehensible. The way one spoke became
a badge of belonging – both reinforcing solidarity among comrades, but also –
when among enemies – providing a give-away shibboleth of one’s true loyalties.
This badge function of language patterned with, and perhaps sometimes
also against, other markers in dress and custom. But it also fed into the reserves
of remembered lore which came with an upbringing in a particular language,
so that each language grew its own traditions and literature, recall of which
served to reinforce, in a different way, the conscious identity of speakers.
However, this scattering of humanity into small, self-conscious and self-loyal
groups ran up against another characteristic of the environment, sometime
after the human species had taken effective occupation of all the landmasses
of the earth. This was man’s ability to domesticate and intensify the food ani-
mals and food plants: pastoralism and agriculture. Rather than living off the
land, seeking whatever might be hunted or gathered, some groups began to
adapt their life-style to control of foodstuffs.
This had various effects on their lived world. Importantly, they became more
sedentary, staying with their fields and pastures, and growing their population
in place, since the new methods meant more food could be produced with the
new approach to applying human effort. Sedentary bases needed to be pro-
tected, both from weeds and wild beasts, but also from other human beings
applying the world’s oldest labour-saving device: robbery. Protection and
predation – defensive and aggressive warfare – became specialist occupations;
and the utility of language in organizing large numbers of people was increas-
ingly valued. Large groups came to be safer – and more powerful – than small
ones; and the large groups began to define particular roles – ranks, functions
and jobs – for individuals. It became possible to progress through a career, but
even more to destine people for particular roles – including priests to preside
over tradition, and rulers to preside over the whole structure of agriculural and
pastoral society.
All in all, the economic principle of falling marginal costs began to assert
itself. When controlled units of production were large, the cost of adding to
them was less than where the units were smaller: growth fed on itself, mak-
ing bigger units natural winners – and conversely the traditional, small self-
sufficient units, natural losers. At least, this was the case where resources were
relatively easy to access: so the rich lands fell to larger and fiercer tribes, with
marginal land being left to the traditional hunter-gatherers.
Two of the problems of running large units is the threat of anonymous theft,
and the difficulty of enforcing distributive contracts when they rely solely on
the memory of (corruptible) participants and witnesses. A useful response to
Empire, Lingua Franca, Vernacular 125
this was the development of tally systems, and as their flexibility was appreci-
ated, they changed into complete system writing, which could represent dis-
tinctly anything that could be said. Once this was available (defining another
rank, function and job – the scribe) it soon became clear that there were many
other uses to which this frozen speech could be put, not least correspondence
which would make rulers’ decisions knowable, and enforceable, wherever
their language was understood.
Writing made possible the control of large-scale empires, the use of written
orders and intelligence overcoming a natural limitation on an oral command
structure. It also facilitated long-distance trade, by clarifying the content of
orders and payments. But the effectiveness of these could only be realized if
the recipients of messages could understand them. There was now a premium
on understanding the language of the ruler or the merchant – in other words,
the need for a common language between instructor and agent.
People now had a motive to learn other languages, at least the language of
those in power. This was the simplest case, but in fact, there was no pressing
need for the language of communication to be the instructor’s own, as long
as there was some language – a lingua franca – which was accessible both to
instructor and agent. Hence the Achaemenid empire of the Persians, founded
by Cyrus and destroyed by Alexander the Great, had for 400 years kept Persian
as the language of the rulers, while imperial communications over their mas-
sively multilingual domain “from Hōdu to Kush” (i.e from India to Sudan) were
conducted in Aramaic, which had been the native language of the (preceding)
Babylonian empire.
A further use that was discovered for written language during this period
of the mid first millennium BC (called by German philosopher Karl Jaspers
“the Axial Age”). This was the writing down of religious scriptures, making
permanent and evident (just like the ruler’s commands) the tradition passed
down by the priests. Since writing was highly explicit in its form, this also had
the effect of sanctifying a particular style and dialect of language. Once writ-
ten down, language does not change, even if it is read by successive genrations
whose spoken languge is different, and the result was to give scriptural lan-
guage an authority that surpassed any vernacular dialect.
The institution of human empire, then, with its three clear prongs of mili-
tary command (including taxation), commercial exchange and religious wor-
ship, all created a need for a large scale super-language – a lingua franca – with
currency that extended beyond the range of any vernacular, and inevitably a
language which could be written down fully and clearly.
With this established, the tendencies which have since made for language
endangerment were all in place. As the empires expanded, and became
126 Ostler
hardened by inherited loyalties, there was less space left for communities
which relied only on their own traditional vernaculars; inevitably, these would
only survive in less favoured sites, such as marshes, mountains and deserts. As
the empires tended to use their power to monopolize wealth, there would be
little left for the free peoples oustide. The hierarchy of command and wealth in
the empires would be reflected in a class system with defined échelons of rank:
but the outsiders who spoke their own language would be seen as beyond the
pale. And with the establishment of religions, notably the missionary faiths
Buddhism, Christianity and Islam, the denizens of the large empires would
also see themselves as having virtue too that was superior to the “lesser breeds
without the law”.
Although the small tribes had shown the variability and innovations which
once provided the wherewithal to spread humanity into every corner of the
habitable earth, they were in effect disrespected and humbled when humanity
had turned the world – and above all, its sources of food – into its own domain.
Language endangerment, then, which can been defined as the pressure to
give up on the use of the languages of smaller communities, is a tendency with
a long tradition. Looking, for example, at the area of western Europe, which
was largely conquered by Rome up to the first century AD, the 16 or more lan-
guages spoken in 500 BC had declined to some half dozen by 500 AD.
Most had been replaced by Latin, the languages of a large-scale empire; but
there was still a lot of use of Gothic, which had come in latterly with maraud-
ing tribes. The old languages that survived in Europe (Aquitanian aka Basque,
and Celtic in Brythonic and Goidelic forms), together with Punic and Albanian
slightly further afield, were all spoken in areas which had been on the edge of,
or quite beyond, the land occupied by Rome. Their territories were also moun-
tainous, making them of less economic value.
In the east of the Mediterranean, the Greek language, which had spread as a
lingua franca with Alexander’s empire (following on from a widespread Greek
commercial network round the whole sea), proved quite able to resist the con-
quering might of Rome: this was a case of one lingua franca (Greek) ranged
against another (Latin), and Greek won: it was already in place, already widely
known to many Romans (in both elite and slave classes), and did not yield in
prestige to Latin. It may also have benefited from the fact that there was rela-
tively less settlement of Roman colonies in the eastern Mediterranean (though
there was some – e.g. Corinth, sacked by Rome in 146 BC but re-settled only a
century later, in 44 BC, and with many Romans. It would become the capital of
the Roman province of Achaea, but still reverted to Greek.)
Triumphs have their rewards here, but events provoke tears too, and the
mortality of things touches us. Still, dissolve your fears: the renown of
this will bring you some relief.5
These words are spoken by Virgil’s (Anatolian) hero Aeneas, once a Trojan, but
soon to be Roman, as he contemplates a monument to his own city’s destruc-
tion. The fact that we may understand the vast forces that annihilate some of
our valued possessions will not lessen the regret at their loss: and loss is just
that – it does not necessarily lead to a corresponding, or even greater, gain
further down the line.
Look at the loss of the Celtic languages, once spoken all over the west of
Europe and Iberia, but all overwhelmed in Gaul and in Spain by Latin. (In
fact the modern Romance language are spoken almost exactly in the same
places where once was Celtic.) An artefact like the Gundestrup Cauldron from
Denmark, which shows a variety of strange details associated with the antler-
bearing god Cernunnos, makes it evident that he had a highly complex myth:
but we shall never know its content, since Greeks and Romans saw no reason
to write it down – neither in their own languages, nor (much less) in its own
Gaulish.
In fact, the loss of a language, and the community that was identified by
it, is not actually necessary in these cases, since the lingua-franca which has
endangered it could just as well be acquired in bilingualism with it. What
causes the loss is the lack of respect or affection for the old order – the
class-based contempt for others’ way of life and web of old relationships, all
articulated in a language which has fewer speakers.
Indeed, the survival of diversity can be a source of strength, though it is
not typically seen as such by those who have no concern for endangerment.
Andaman Islanders showed amazing savoir faire, and lack of casualties, when
confronted with the vast tsunami that devastated coasts all round the Indian
Ocean; and in both world wars, the USA benefited from having access to
American Indian language speakers to use as “code-talkers” – since their lan-
guages (such as Navajo, Choctaw and Cherokee) were effectively undecipher-
able to intelligence officers in enemy powers such as Germany and Japan. It
takes an extreme situation to bring out the salutary value of diversity and flex-
ibility. But it is evident to those who value their own traditions that something
important stands to be lost.
There is in fact a curious irony about knowledge and the value of diversity.
Typically, those who represent, by their language and their lives, a minority
culture will not see its value as stemming from its diversity: they just want to
hold on to what has been their own. It is the representatives of empire – often
native speakers of the lingua franca – who are in a position to be conscious of
diversity as a value in itself, indeed a form of cultural wealth.
Georgia Russia
Tbilisi
Ordu Trabzon Rize
Kars
Armenia Azerbaijan
Erzurum
Iḡdir
Turkey
Malatya Van
Tabriz
Diyarbakir Siirt
Batman
Mardin
Urfa
Hamedan
Syria Iraq
Kenmanshah
Terek Scale
C 0 50 100 km.
A
U 0 50 100 ml.
C
A
E A S T E R N
( C A S P I A N
S
U
42○ S 42○
GANI
M
O
U P P E R S E A (W E S T E R N G R E AT S E A ) Kur Ala U
zon
i N
k)
(BLACK SEA) T
Irma A
(Kizil I
N
Ku S
r
P
O N
T
(Keik I C M T S . M. Aragatz
it) L.Gagham
40○
G R E A T
S E A )
40○ (Sevam)
) ARGISHTINHINELE
ys Kizil Irm
ak) axes EREBOUNI
(Hal (Ar
Kur
se M. Ararat
DOUCKAMMA Euph rates Era
(Firat)
U R A R T U
ARZASKU Erase (Araxes)
S E A
l
KIA cani M. Sipan
Ara Sea ANIASTANIA
iri
Na
MALATYA per n)
T A U Up (Va TUSHPAH (VAN)
38○ R NIHIRIANI 38○
an
U S
Lowe
M
Seyh
de
O U N T A I N S
na
r Na
Tigris Sahand
ra
MARASH
iri Se
.
Pu
S A S S Y ARDINI/
T
n
R I
a
ha
Euphrates
A N
(Urm
M KUMAHA MUSASIR
Jey
S E M
ia)
Se
SAMAL O P I
fid
N R Grea
A E t Za MEYSHTA SIBAR
GREAT SEA OF
ARPAD HARRAN
ALALAH SHNGAR NINIVE ARBAILU
36○ (ERBIL) IZIRTU 36○
RATSAPA ISANA
UGARIT QULHU IMGUR-ENLIL
36○ 38○ 40○ 42○ 44○ 46○ 48○
east and west to the south of lake Van. It finally yielded to the Mede Cyaxares
in 612 BC, but shortly afterwards it was incorporated by Cyrus into the Persian
(Achaemenian) empire.
Urartian is a language attested in cuneiform inscriptions which have survived
to the present day (also called Haldian, after the empire’s ruling deity H̬ aldi,
a warrior god who usually depicted standing on a lion), and is related both to
Hurrian to its west, and Lezgian and Avar to the north in the Caucasus. But
it did not survive the collapse of the Urartian state. Instead, the Armenians,
with their Indo-European language separate from Iranian, appear to have
replaced it. The first Armenian-speaking dynasty was called Orontid (prop-
erly Eruand, later Yervanduni). It is clear, however, that there was consider-
able bilingualism between Armenians and Iranians in this area, since the
vocabulary of the Armenian language (called Hai by its speakers) is full of
Iranian loans.
This replacement seems comparable to the emergence of English from
under French in 14th-century England, after that country recovered from the
demographic (and social) shock of the Black Death. Perhaps because there was
Empire, Lingua Franca, Vernacular 131
little social mobility before the shock, it had not been possible for the elite
lingua-franca (respectively Urartian or French) to permeate into lower orders;
but when the aristocracy were shattered, the vernacular could emerge.
In fact, there is no literacy to attest the actual language being spoken until
two more conquests had passed over and around the Armenians: first the
Macedonian Greek conquest from the late 4th century (led by Alexander,
reinforced by his general Seleucus, and under Antiochus III finally depos-
ing the Orontids in 212 BC); then the Roman conquest (the general Lucullus
defeated Pontus and Armenia at the battle of Tigranocerta in 66 BC). One long-
lasting effect of the Roman domination of Anatolia was the penetration of
Christianity. This reached Armenia at the end of the 3rd century AD (hitherto
largely a Zoroastrian country), and the country has the distinction of being
the first officially Christian state. This led after a century (in 405) to the inven-
tion of the Armenian alphabet by Mesrop Mashtots, and the translation of the
scriptures and liturgy into Armenian.
Scale
S A R M A T I A ????? 0 50 100 km.
C
A 0 50 100 ml.
PITYOUS U
SEBASTOUPOLIS S
A
C A S P
KOL KH I S
NAKALAKEVI KUTATISI S
42○
U 42○
IBERIA
S
PHASIS
?? URBNISI MTSKHETA
B L A C K S E A ?? UJARMA M
I A N
? TCHOR
? AMINOS RUSTAVI A ????O U
???? L ?
N
B
A
N T
A I
TRAPEZOUS I A
KAPALAK N S
NEO CAESAREA
AMASEIA P O S LPNATS
N T I
C M O U N T A I N
??
Ge
??
R gh
?
A A ra x A r a xe s
??
N VAGHARSHAPAT ARTASHAT
SEBASTEIA tes PAYTAKARAN
E Euphra G R E A T E R A R M E N I A
M
???
? P ZAREHAVAN
IR ZARISHAT
NAKHICHEVAN
A r a xe s
CAESAREA E ????
k
MELITENE uni HER
T A B z nVa n )
38○ U R ( VAN/SHAMIRAMARKERT 38○
U S MARTYROPOLIS
M S
????
O U N T A I N
Z
????
Kap
Tig ri s
A
GERMANIKEIA AMIDA P E R S I A N
es
ANAZARBA
G
uta
rat
AFRAZA
ph
HIERAPOLIS
n
BIRTHA
Eu
MOPSOUESTIA ZEUGMA
O
EDESSA TELA Gr
KYRRHOS SELEUKIA APAMEIA ea GANJAK
S
ANTHEMOUSIA Tig t Za
NISIBIS
A
b E M P I R E
E
RHOSOS SHIZ
HIERAPOLIS
M
INTER
36○ 36○
T
The Arsacids (a Persian dynasty) had gained control of Armenia in the first
century AD, and may have reinstated elite use of Persian in this region, but this
appears to have had little effect on continuing vernacular use of Armenian.
Politically and religiously, Armenia was even autonomous from Persia after 451,
hence effectively a monolingual country.
This would only change in the 11th century. First the Byzantine Greeks
invaded. This would have had little effect on language use, except in court cir-
cles. But then the Seljuq Turks (under Alp Arslan) invaded Armenia, as a first
fruit of the Turkish invasion of Anatolia more generally, enabled by their vic-
tory at Manzikert in 1071. This invasion actually involved long-term settlement,
hence penetration of Anatolia by the Oǧuz Turkish language, side by side with
Armenian, but not as an elite lingua-franca, since for the Turks this role would
be played by Persian (or the mixture of Persian and Turkish known as Lisân-ı
Osmânî – “Ottoman”).
So far, our story has extended over at most 2000 years. But for most of the
following millennium, the pace of change slows down. Armenian and Turkish
co-existed (with Greek) as vernacular languages within the domain of the
Ottoman empire.
The latest accident of history (in the last century) is to segregate languages
much more into discrete states, so that Ardahan is capital of a Turkish-speaking
Empire, Lingua Franca, Vernacular 133
In the Ardahan region over all these millennia, the languages which have been
endangered – indeed which have ultimately disappeared – have not been the
languages of down-trodden lower orders, but the reverse, the languages of
elites.
These are the languages which – ironically enough tend to survive in the
written record, because they are used for royal inscriptions: Urartian from the
10th to 6th centuries BC, Old Persian and Imperial Aramaic from the 6th to
3rd, Greek from the 3rd BC to 10th AD, Persian and Osmani from the 10th to the
20th AD. But actual speakers and users have ultimately deserted them.
It is the languages of the people, Kurdish, Armenian and Oǧuz Turkish
which have shown permanence: Kurdish and Armenian throughout the 3000
years we surveyed, (though the presence of Armenian in eastern and central
Anatolia have recently been restricted artificially); and Turkish ever since its
arrival with Seljuq settlers in the 11th century.
How has this divergence been possible, from the general human pattern
outlined at the beginning of this paper? Are not dominant classes more capa-
ble of hanging on to, and indeed spreading use of, their languages?
The answer seems to be that this only applies when dominant urban upper
classes are in contest with an urban proletariat. By contrast, farming, i.e. set-
tled cultivation of the land, is more effective in perpetuating a language com-
munity not only than the hunter-gather lifestyle which historically it replaced,
but also than the elite urban cultures which grow up, ultimately supported by
the surplus of food created by farming. It was the urbanized classes, upper and
lower, which were at risk of being “driven out” when their cities are conquered
and put under new rulers.
As an Irish immigrant put it in Margaret Mitchell’s popular 1936 novel
Gone with the Wind, “Land is the only thing in the world that amounts to
anything, . . . for ‘tis the only thing in this world that lasts, and don’t you be
forgetting it!”
Empires, then, do have an effect of oppressing, and ultimately suppressing,
minority languages within their domains, provided that there is a lively pros-
pect of recruitment of members of lower orders into richer and more domi-
nant classes. Demeaned classes may try to ape their presumed betters, leaving
their own languages behind. If this kind of mobility is ruled out, then there
134 Ostler
Reference
The Caucasus is perhaps the foremost region in the world in terms of endan-
gered languages. When we analyze the linguistic connection between Turkey
and the Caucasus, among the first languages that stand out are the Laz lan-
guage, and Hamshin Armenian. What I shall be focusing on in this study
however, takes us farther east to the Altai Mountains; a region that bears strik-
ing similarities in its topography to Ardahan and the Caucasus. A number of
Chinese sources name Jin Shan (金山 “Gold Mountains”) as the region Turkic
peoples settled in.1 On that note, let us mention the peoples that first appeared
in this area and belong to the Turkic ethno-linguistic group. Those that remain
to this day within the borders of China are:
Only six group of peoples among those listed continue to effectively use their
native language. Of these languages, the speakers of Uzbek have recently given
up speaking their own language and started speaking Uyghur. Consequently,
it is hard to consider Uzbek as one of the Turkic languages spoken in China.
The same applies for Tatar and Fu-yü Kirghiz as well. Middle and Old genera-
tions speak Mongolian (“Ölöt Mongolian”) while the young generation speaks
Chinese. The related details will be given below.
In China, more than one language belonging to the Turkic language group
are spoken. And notably, we see isolated Turkic languages spoken in China.
In 1970s, nine Turkic languages existed in China. Unlike the above mentioned
classification, these languages can be listed based on population/speaker as
follows: Uyghur, Kazakh, Kirghiz, Salïr, Uzbek, Yellow Uyghur, Tatar, Tuvan
and Fü-yu Kirghiz. Although not up-to-date, the demographics of China can
be viewed on Chinese articles.2 The list shows the languages and the number
Uzbek
The Uzbeks living in China do not have their own autonomous prefecture.
Having been scattered to the north and south of Tian Shan 天山 (Uyg. Tengri
Tagh; literally “God’s Mountains”) in Xinjiang Uyghur Autonomous Region,
most Uzbeks live in the cities. The Uzbek population is centered in İli, Kashgar,
Yarkant, Urumchi, Aksu, Uchturpan, Chöchek and Karghilik. The population is
about 20,000 (Özbek Edebiyati Tarihi, 2005, p. 9). In China, a written language
or an alphabet peculiar to the Uzbeks do not exist. As mentioned above, the
Uzbeks use Uyghur as a written language. Adalaiti Abdulla’s study will give us
more detailed information (Abdulla 2013).
Tatar
According to the 2006 census, the Tatar population in China is around 5400.
Most of the Tatars live in Xinjiang Uyghur Autonomous Region, densely in
Urumchi, Gulja and Chöchek. Like the Uzbeks, the Tatars do not have their
own written language or alphabet. Ersin Teres’ study will give us new informa-
tion (Teres 2011).
Within the scope of my research, let us now analyze the four languages three
of which are spoken outside of Xinjiang, and one in Xinjiang.
Fuyu Kirghiz
The Fuyu Kirghiz are a turkic community whose language and society we
have the least information and material on. The earliest information we have
about the subject relies on the collected works and studies of Hu Zhenhua,
faculty member of Kirghiz/Kazakh Language and Literature Department
in Zhongyang Minzu Daxue (Minzu University of China; formerly known as
Central University for Nationalities). The information I will be providing here,
however, is mainly from the field work that the faculty members of Philology
Department in Seoul University, my colleagues from Korea Altay Research
Institute and I did in September, 2003. The Fuyu Kirghiz live in the villages of
Fuyu County, which is under the administration of Qiqihaer City in China’s
Heilongjiang (widely known as Manchuria) province. Named after the afflu-
ents of the Amur River (Heilongjiang “Black Dragon River”), Fuyu is a residen-
tial area of 30,000 people and 1 hour away from Qiqihaer. Although the Fuyu
Kirghiz are scattered around a few villages, they mainly live in two villages. The
research group I was a part of made a compilation in the villages of Wujiazi and
Qijiazi in the September of 2003 (my studies on the Fuyu Kirghiz people and
Fuyu Kirghiz was carried out within the frame of “Korea Research Institute”
project numbered KRF-2003-072-AL2002).
Since Hu Zhenhua visited this region, the number of speakers of Fuyu
Kirghiz have decreased and as of 2000s no native speakers remain. Today, the
138 Ölmez
Fuyu Kirghiz use Ölöt Mongolian and Chinese in their daily lives. During the
compilation work we did among the Fuyu Kirghiz, whose number reaches up
to 1000, we could talk to only seven people. Two of them were over 50 and the
others were over 70. Our best source who was over 70 stated that she used their
native tongue when she was 18. The population of Wujiazi and Qijiazi villages
consists of different communities and the Fuyu Kirghiz make up nearly half of
it. Despite all our efforts, it was not possible to fill in the half of 3000 question-
naires. However, even the limited material compiled is enough to show that
this language is a branch of Khakas and Shor. After all of the compilations are
discussed and published, useful results will come up for South Siberian Turkic
languages in particular and for all Turkic languages in general. The first studies
related to the data we collected were published in 2007 and 2010/2011 (see. Li,
Ölmez and Kim, 2007 and 2010/2011).
With respect to the importance of Fuyu Kirghiz, we can safely conclude
from our work based on limited language material that Fuyu Kirghiz is a
Turkic language that falls into the same category as South Siberian Turkic lan-
guages and basically modern Khakas written language. On the basis of the data
related to that language, we can reach several results about the phonetic prop-
erties and vocabulary of South Siberian Turkic languages and modern Turkic
languages. First of all, the general belief among the local people is that they
came to that region after the second half of the 18th century. In fact, after the
Manchus started to rule over China, the Manchus had several Mongol clans
settle (mainly Dagur Mongols) in the region. Among those clans coming from
Southern Siberia, there was the Fuyu Kirghiz who today call themselves Kirghiz
and speak the same Turkic language with today’s Khakas.
When Fuyu Kirghiz is compared with Old Turkic and modern Turkic lan-
guages in the light of previous studies, a few characteristic features can be
listed as follows:
1. Old Turkic and Common Turkish sound y- becomes c-: OT yap- “to cover,
close” = ǰap-; OT yay “summer” = ǰay; OT yė- “to eat” = ǰe-; OT yürek “heart”
= ǰürüh;
2. Old Turkic and Common Turkish d sound becomes z: uzï- “to sleep”, ɢizin-
“to get dressed”, güzi “groom”;
3. Final -g of a word becomes -h: OT tag “mountain” = tah, OT sarïg “yellow”
= sarïh, OT ulug “great, big” = uluh.
In order to make a comparison with modern Turkic languages, I can give a few
short sentences from our field reseach result:
Endangered Turkic Languages From China 139
What we see here as tart- (to smoke) corresponds to that of most Turkic
languages.
Likewise, aragï “drink” has the same root as Turkish rakï (a kind of alcoholic
beverage), now common in all Altaic languages as a borrowing from Arabic
through Persian (?).
ïzïrt- is related to Old and Middle Turkish esür-, esürt- “to become drunk”
(cf. Anatolian dialects: esirik, esirikli “fractious; little bit crazy”).
In the fifth sentence, gïlçïk is not anything else but Old Turkic < kelyük “has
come” (for details see. Ölmez, 2007).
boz is another form of Old Turkic bod “himself”.
buzïh is Fuyu Kirghiz form of Old Turkic bedük “big”.
olïh is the word ulug in Old Turkic.
Regarding these examples, you can see Ölmez, 2001; Ölmez 2006a, b and c.
The first work on the Fuyu Kirghiz and their social life was prepared by Gundula
Salk and Mambet Turdi. The last and detailed work belongs to Mixail Čertïkov
(Čertïkov, 2008a and 2008b).
Below I provide the words “moon”, “day” and some number names in order
to show the phonetic similarities or differences between Fuyu Kirghiz and
Turkic languages and Turkish.
Salïr
The first data on the Salïrs are based on the works conducted in the West which
are higher in number when compared to Turkish sources. At this point we need
to mention Russian scholar Potapov’s compilations based on the last decades
of the 19th century (and based on that N. Poppe’s work), the compilations and
publications of E. R. Tenišev, text publications of Zsuzsa Kakuk, the works of
anthropologist Kevin Stuart, the articles of R. Hahn and recently the works of
Arienne M. Dwyer.
140 Ölmez
As to the origin of the Salïr, we cannot find detailed information about their
history in Chinese resources. Related to the subject, one-folio long data found
and published by Tenišev is important and interesting (Tenišev, 1977); accord-
ing to the story circulating among themselves, they came to the region from
Samarkand at the end of the 14th century; they chose that place as their home-
land since the soil and water was similar to those of their motherland and the
accompanying camel, which was carrying a Qur’an on his neck, was found pet-
rified exactly at that point (Altiūli) after getting lost: that event is the subject of
the story named döye yül ~ döye yuli “camel spring” (Chin. 駱駝泉 luo tuo quan).
The altitude of the land they live is about 1800 meters. In a region where the
Yellow River originates and reaches the inner parts of China, the Salïr usually
settle along the “river”. Since it is the sole and closest river, the Salïr call the
Yellow River morun ~ morïn (< Mo.) for short; they never use the word “yellow”.
They have very good relationships with the Tibetans, especially with those liv-
ing on the mountainous parts. The Tibetan villagers usually shop from Salïr
shops. They speak Salïr effectively among themselves and in their daily lives.
They are engaged in agriculture relying on irrigation on vast plains. Apart from
Qinghai, there are a few thousand Salïr living in the city of Gulja, in Xinjiang
Uyghur Autonomous Region (I will deal with this issue in another paper; the
details are in the book of Abdurishid Yakup, Yakup, 2002). Those living within
the borders of Gansu (according to the information Yunus provided) have
almost forgotten the Salïr language.
Today, the Salïr call Xunhua as “Salïr” among themselves: Men salïrga
(~ Salïːga) vargur (~ vaːgur) “I will go to Xunhua”. They don not use the name
Xunhua.
With intensive efforts to learn their history and language as of 1990s, the
Salïr started to publish a periodical, which includes Latin lettered texts, issued
twice a year since 2008.
Turkic origin. Today, Tibetan gača ~ geče are used even for the words “word,
utterance, language”:
Yaxçux gačanï hemme kiš yišar, yaxçux išnï hemme kiš etmes “anybody
(can) say a good word, not everybody does a good job / not everybody
can exhibit good behaviors.”
We separate the Salïr language from Kipchak and Chaghatay and approximate
it to the Oghuz family of languages since the Salïr use the verb et- ~ ėt-, and the
word el “hand” (in Kipchak and Chaghatay qol is used instead). Certainly, com-
mon use of the sound d- (dört “four” etc.) at the beginning of a word plays an
important role in this approximation. (Several important features along with
this will be discussed in detail in another paper; this paper is a report of a short
field work carried out in August, 2009).
The number systems do not go over “forty”. Especially the men, middle-aged
people or the elderly do not know much about numbers. As to the women, I
have heard the numbers forty, fifty, sixty, even from the young and middle-aged
ones. We can relate this to the fact that the men are in more contact with other
communities then women(the Chinese, the Tibetan). According to Musa Haji
at the age of 64:
Bir elli ma on dört “fifty and plus fourteen, fifty-four” (Ol Turkic *bir elig
yme on tört)
Below are the words which I heard on the first day and noted in a hurry in order
of subject and hearing. More information on the material I recorded and the
resources I collected will be included in another paper.
Atuh (~kȯp) išse kursagïm agrïr. “if I eat a lot, I will become ill; if I eat a lot,
I will have a stomachache”
Saŋa bala neče (~neǰe?) var? segis, oːl dört, ane dört; suŋzï on segis vara. “how
many kids do you have? Eight, son four, daughter four, grandchildren eighteen”.
sïh “right, right hand side; well, healthy”
dal “tree”
ağaš “wooden, timber; wood”
ağašli “forested; the forested village”
emih “bread”
hos “walnut”
ǰiǰek “flower”
ėt- “to do, to make”
Endangered Turkic Languages From China 143
Salïr and Modern Uyghur can be compared in terms of similarities and differ-
ences with the following examples:
Salïr Uyghur
sarï sėriq “yellow”
sağal saqal “beard”
satïhǰï sėtiqçi “seller, merchant”
satïhlï iş soda “trade”
sen- öçmek “go out”
sender- söndürmek “extinguish”
sinih söŋek “bone”
sïh oŋ “healthy”
soğan piyaz “onion”
sor- sori-/sora- “to ask”
sorma harak “drink, alcoholic beverage”
suva- ussi- “to become thirsty”
süt süt “milk”
Yellow Uyghur
The population of the Yellow Uyghur living in the Gansu province of China,
in Sunan Uyghur Autonomous Region close to the city of Zhangye and on the
surrounding tablelands is over 10,000. Originally called Yellow Uyghur, this
community refer to themselves shortly as Yogur ~ Yugur (Chin. Yugu). Those
who speak a Turkic language are called Sarïg Yogur, while the others speaking
Mongolian are called Shira Yogur. Their names are Western Uyghur (Xibu Yugu
西部裕固) and Eastern Uyghur (Dung Yugu 东部裕固), and their languages are
Western Uyghur and Eastern Uyghur (Xibu Yuguyu and Dongbu Yuguyu 西部裕
固语 and 东部裕固语), respectively.
The history and the language of the Yellow Uyghur belonging to old peri-
ods are known quite well compared to the other Turkic people in China. The
144 Ölmez
Yaglakar clan of the Uyghurs who established the khaganate and ruled for
almost a hundred years (744-840) in Mongolia after the Uyghurs ended the
Turkish rule (Tujue 突厥) live in the Mongolian-speaking region (see Ölmez
2012). We can look at Ariz, 2002, a work that gathers other works regarding the
Yellow Uyghur.
Living on the tablelands around Sunen, today the Yellow Uyghurs move to
the tablelands of 2000-2500 metres high. With the coming of Spring, they go up
to 3000-3500 metres high tablelands and to 4000 metres high tablelands with
the coming of Summer (personal information received from Arslan).
We need to mention G. N. Potanin as the first Western scholar doing research
about the Yellow Uyghur. Having organized excursions to the Tangut region
between the years 1884-1886, Potanin visited Yugur villages and residences; he
collected data about the Mongolian-speaking Shira Yugurs and Turkic-speaking
Kara Yugurs. According to Potanin, the Kara Yogurs are divided into two fac-
tions as Yaglak and Hurungut. These factions also are divided into smaller
families (= otok). In the following years (1906-1908), C. G. Mannerheim arrived
in the region and compiled texts both from the Yellow Uyghur and the Shira
Yugurs. That was followed by Malov’s excursions (1909-1913) and detailed stud-
ies. After Malov, E. Tenišev took part in the activities of the People’s Republic
of China about compiling Chinese minority languages; he made publications
about text, dictionary and grammar studies. Afterwards, Lei Xuanchun and
Chen Zongzhen made related publications in China. Today, Martti Roos, Erkin
Ariz and Zhong Jingwen conduct studies in the field.
With regard to the language of the Yellow Uyghur, they generally have been
assumed to be the descendants of the Old Uyghurs presumably due to their
names and some secondary language properties. As we mentioned above, it
would be more accurate to classify them as the relatives of the Uyghurs who
migrated to the region from Mongolia rather than assuming them to be the
direct relatives of the Turfan Uyghurs. They have, naturally, connection with
the Turfan Uyghurs. We can compare that with the migration of some Buddhist
Uyghurs to the east, Dunhuang due to expansion of Islam in the Turfan region
and with the Old Uyghur Altun Yaruk Sudur found by S. Ye. Malov.
We should note that Yellow Uyghur shows similarities with Khakas language
in terms of some phonetic evolution: OT -d-, -d becomes -z-, -z (OT adak “foot”
> YUyg. azak, OT adgïr “stallion” > YUyg. azgïr, OT ïd- “to send” > YUyg. ïz-).
However, it differs from Khakas in some aspects, for example OT vowel y- regu-
larly becomes č- in Khakas, while sometimes it retains itself in Yellow Uyghur:
OT yïl “year”, Khak. čïl, YUyg. yïl; OT yïltïz “root”, YUyg. yiltïs; OT yigit “young;
strong” Khak. čit, YUyg. yïgït, yigit. OT consonant b- regularly becomes p- in
Endangered Turkic Languages From China 145
Khakas while it is seen both as b- and p- sound in Yellow Uyghur (see Ölmez,
1996 and 1998).
To put it precisely, in Yellow Uyghur the OT consonant -g seen at the end of
the polysyllabic words is retained as -k/-g but the consonant d becomes z as in
the above given examples.
Another old feature of Yellow Uyghur is seen in the number system. It shows
similarities with Old Turkic: yidigirma “17” < yėti yėgirmi; sagïs yigirma “18”
< sekiz yėgirmi, per otut “21” < bir otuz (see Clark, Geng & Clark).
In some words, glottalization occurs before the unvoiced consonants k and t.
YUyg. ahldï “altï” < OT altï, YUyg. tohɢïs “dokuz” < OT tokuz.
Tuvan
Unlike the above mentioned Fuyu Kirghiz, Salïr and Yellow Uyghur, Tuvan is
not a Turkic language exclusively spoken in China. Most Tuvans live in the
Tuva Republic within Russia (in fact an autonomous region), in several cities
of Russian Federation and in some regions of Mongolia. The Tuvan language
spoken in China does not differ much from the original Tuvan; therefore, I will
not dwell on the language of the Tuvan people of China in this short paper. I
will just touch upon the differences between the Chinese Tuvan and the Tuvan
spoken in Tuva. For Tuvan, see Ölmez, 2007, p. 25 and other. For the Tuvans liv-
ing in Mongolia see Erika Taube’s works. The Tuvans of China became known
better with the works of Geng Shimin, Talant Mawkanuli and Song Zengchun.
The Tuvans living in Mongolia, their folklore, population and traditions have
long been known thanks to the studies of Erika Taube. However, the villages
and the towns and the population of the Tuvans in China are not documented
as much when compared to Mongolia. Therefore, Marina V. Monguš went to
the region, conducted field work and gave information about the Tuvan vil-
lages and towns with pictures. According to that information, the Tuvan living
in China belong to the Ak Soyan (White Soyan) and Kara Soyan (Black Soyan)
clans (2002, p. 21). Marina Monguš touches on the current situation of the
Tuvan language and adds an interview at the end of her book. The interview
made with Daš Čömblöv is given below. I would like to add that I interviewed
the same source in Altay in 2004 (2002, p. 105):
As is understood from the answer, Daš Čömblöv speaks three languages very
well but speaks the fourth language poorly. The interview is from the year 1993.
Yet, considering my personal observation I can safely state that the situation of
the language in the Altay region of China has changed quite a lot since then;
the Tuvan improved their Chinese; especially the Tuvan state officers speak
Chinese very well. Those who receive education in Urumchi also speak Uyghur
in addition to these four languages. To conclude, a well-educated “Monchak”
speaks five languages including their mother tongue.
This interview was made in 1993 with Daš Čömblöv who was born in 1962.
The interview was written with present-day Tuvan alphabet but the punctua-
tion reflects the punctuation of the Tuvans living in China. For example, in
Tuva хꝋвей is used with x-, however, only кꝋвей is used in the interview made
with Daš Čömblöv. Also, the word газыр which is derived from Kazakh qazïr is
used to mean “now, at the moment, today” (Kazakh < Arabic). Another notice-
able usage is the use of ивяааш instead of Tuvan эвээш.
In the later parts of the interview, Daš Čömblöv Oronbayv briefly tells how
their grandfather migrated from Russia to Kanas and met other Tuvans there;
according to him, that migration occurred in 1913. Two women whom I talked
to in Kanas told me a similar story. They, however, moved from Mongolia.
As mentioned before, the Tuvan population concentrated in 3 villages:
Kanas (Hanas), Akkaba and Kom. Kom is essentially a Tuvan village. Despite
being mainly populated by the Tuvans, Kanas is also home to the Kazakhs and
the Mongolians. Half of Akkaba’s population is Kazak and the other is Tuvan.
The Kazakhs call the Tuvan as “Kök Monchak”. The Tuvans, however, call
themselves diva or monchak. For population etc. see Mawkanuli, 1999, pp. 1-36;
Ölmez, 2007, pp. 25-29; Yolboldi and Kasi̇, 1987, pp. 287-289. Among the places
where the Tuvans live are the cities of Bowurǰin and Altay, and the towns of
Köktogay and Lamajao. Since it is difficult to distinguish the Tuvans from the
Mongols regarding lifestyles and beliefs, the Tuvans were accepted as Mongols
and were not included among the 56 minority groups in China’s censuses.
Chinese Tuvan was first made known by Geng Shimin. Russian Mongolist
B. H. Todayeva met people with a language resembling Mongolian but unfa-
miliar to him while he was identifying and recording Mongol languages in the
Altay region. With the opinion that the people were speaking a Turkic lan-
guage, he informs Geng Shimin, and thus the research of Geng Shimin begins
(this is what I was told by Geng Shimin).
Endangered Turkic Languages From China 147
Unlike Tuvan, words do not start with an h- sound but always a k- sound:
kep, kerek, köl, kün, küreş (Tuvan hep “shape, form”, herek “necessary”, höl “lake”,
hün “day, sun”, hüreš “wrestling”) etc.
In Tuvan standard written language č- is systematically used instead of OT
y-, but there are some words pronounced with ǰ- sound. In Chinese Tuvan,
however, only c- sound occurs: ǰan-, ǰït-, ǰi- (Tuvan čan- “to turn”, čït- “to sleep”,
či- “to eat”).
The Tuvan language spoken in Chinese Altay includes many Kazakh
(<Arabic) words such as mekeme, mekdep etc. while Tuvan from Russia do not.
We can compare some words used in both regions:
Bibliography
Abdulla, Adalaiti. 2012. Field Research on the Uzbek Language in China. Dilleri ve
Kültürleri Yok Olma Tehlikesine Maruz Türk Toplulukları, 4. Uluslararası Türkiyat
Araştırmaları Sempozyumu Bildirileri, ed. M. Erdal, Y. Koç, M. Cengiz, Ankara,
pp. 25-34.
Ariz, E. 2002. Sarı Uygur Araştırmalarının Toplu Bir Kaynakçası. KÖK Araştırmalar: Kök
Sosyal ve Stratejik Araştırmalar Dergisi: Vol. IV, S 1, pp. 117-148.
Chen, Z. 2004. Xibu Yuguyu yanjiu: Beijing.
Clark, L. 1996. Sarig Yugur Historical Linguistics and Early Turkic Counting system.
Turfan, Khotan und Dunhuang. Vorträge der Tagung „Annemarie von Gabain und
die Turfanforschung“, veranstaltet von der Berlin-Brandenburgische Akademie der
Wissenschaften in Berlin (9.-12.12.1994): (R. E. Emmerick, W. Sundermann, I. Warnke
u. P. Zieme, Ed.), pp. 17-49.
Çağatay, S. 1962. Sarı Uygurların Dili. Türk Dili Araştırmaları Yıllığı Belleten 1961:
pp. 37-42.
Čertïkov, M. A. 2008a. “Fuyuyskie Kırgızı. İstoriko-etnogragiçeskoe Issledovanie” [I].
Sibirische Studien: Vol. 3, S 1, pp. 45-126.
Čertïkov, M. A. 2008b. “Fuyuyskie Kırgızı. İstoriko-etnogragiçeskoe İssledovanie” [II].
Sibirische Studien: Vol. 3, S 2, pp. 141-256 (and following articles).
Čertïkov, M. A. Tarbagataiskie Kırgızı. http://www.kyrgyz.ru/?page=297.
Dwyer, A. M. 2007. Salar, A Study in Inner Asian Language Contact Processes, Part I:
Phonology: Wiesbaden.
148 Ölmez
Geng, S.; Clark, L. 1992-93. Saryg Yugur Materials. Acta Orientalia Academiae Scientarum
Hungaricum: Vol. XLVI, pp. 189-224.
Geng, Shimin. 2001. Materials of Tuvanian Language in China (4). Türk Dilleri
Araştırmaları: Vol. 11, pp. 5-21.
Hahn, R. F. 1998. Yellow Uyghur and Salar. The Turkic Languages: (Ed. L. Johanson and
É. Á. Csató), Routledge, London and New York, pp. 397-402.
Hermanns, P. M. 1942-44. Uiguren und ihre neuentdeckten Nachkommen. Revue
Internationale d’Ethnologie et de Linguistique / Internationale Zeitschrift für Völker-
und Sprachkunde: ANTHROPOS, Xxxv-Xxxvi, 1940-1941, pp. 78-99.
Hu, Zhenhua H. 1983. Heilongjiang Fuyuxiande Ke’erkezi-zu ji Qiyuyantedian.
Zhongyang minzuxue yuanxue bao: pp. 65-69.
Hu, Zhenhua H. 1991. Heilongjiangsheng Fuyuxiande Ke’erkezizu ji Qiyuyantedian.
Zhongguo Tujueyu Yanjiulunwenji: pp. 253-263.
Hu, Zhenhua H.; Imart, G. 1987. Fu-Yü Gırgıs: A tentative description of the easternmost
Turkic Language: Bloomington, Indiana.
Kakuk, S. 1961. Textes salar. AOH XIV: Vol. 1, S 2, pp. 95-117.
Lei, X.; Chen Z. 1992. Xibu Yugu Hanyucidian: Chengdu.
Li, Y.; Ölmez, M.; Kim, J. 2007. Some New Identified Words in “Fuyu Kirghiz (Part 1)”.
Ural-Altaische Jahrbücher: Vol. 21, pp. 141-169. Neue Folge.
Li, Y.; Ölmez, M.; Kim, J. 2010/2011. Some New Identified Words in “Fuyu Kirghiz
(Part 2)”. Ural- Altaische Jahrbücher: Vol. 24, pp. 165-188. Neue Folge.
Malov, S. Ye. 1957. Yazık jyoltıh Uygurov: Slovar’ i grammatika: Alma-Ata.
Malov, S. Ye. 1967. Yazık jyoltıh Uygurov: Teksti i perevodı: Moskva.
Malov, S. Ye. 2005. Sarı Uygurlarda Şamanlığın Kalıntıları. (Çev. M. Duranlı). Türk
Dünyası İncelemeleri Dergisi. Vol. 5, Nr. 2, pp. 391-399.
Mannerheim, C. G. E. 1911. Sarö and Shera Yögurs. Journal de la Société finnoougrienne:
Vol. 27, Nr. 2, pp. 1-72 + 5˚ + 1 harita.
Mawkanuli, T. 1999. The Phonology and Morpfology of Jungar Tuva: Unpublished
Doctoral Dissertation. Indiana University.
Mawkanuli, T. 2001. The Jungar Tuvas: language and national identity in the Prc. Central
Asian Survey: Vol. 20, S 4, pp. 497-517.
Mawkanuli, T. 2005. Jungar Tuwan Texts: Indiana University.
Monguş, Marina V. 2002. Tuvintsı mongolii i kitaya. Etnodispersnıye gruppı (İstoriya i
sovremennost’): Novosibirsk.
Ölmez, M. 1996. Sarı Uygurlar Sarı Uygurca. Çağdaş Türk Dili, Nr. 98, pp. 31-37.
Ölmez, M. 1996. Tuvalar ve Tuvaca. Çağdaş Türk Dili, Nr. 95, pp. 10-17.
Ölmez, M. 1998. Potanin’s Yellow Uigur Material and its Importance Today. Studia
Turcologia Cracoviensia 5: (= Languages and Culture of the Turkic Peoples,
M. Stachowski, Ed.), pp. 149-182.
Ölmez, M. 2001. Fuyu Kırgızcası ve Akrabaları. Türk Dilleri Araştırmaları: Vol. 11,
pp. 137-152.
Endangered Turkic Languages From China 149
Ölmez, M. 2006a. Fuyu Kırgızcasında ‘değil’. Ege Üniversitesi, Türk Dünyası Araştırmaları
Enstitüsü, I. Uluslararası, Türk Dünyası Kültür Kurultayı 9-15 Nisan 2006, Çeşme-:
İzmir.
Ölmez, M. 2006b. Fuyu Kırgızcasında Geçmiş Zaman Biçimleri. Sibirische Studien:
Vol. 1, S 1, pp. 117-124.
Ölmez, M. 2006c. Fuyu Kırgızcası Hakkında Yeni Bilgiler ve Türkolojiye Katkıları.
Sibirische Studien: Vol. 2, S 1, pp. 65-70.
Ölmez, M. 2007. Tuwinischer Wortschatz (mit alttürkischen und mongolischen
Parallelen), Tuvacanın Sözvarlığı (Eski Türkçe ve Moğolca Denkleriyle): Wiesbaden.
Ölmez, M. 2012. Oğuzların En Doğu’daki Kolları: Salırlar ve Dilleri. Türk Dili: C Ciii, Nr.
732, pp. 38-43.
Ölmez, M. 2013. Çin’deki Türk Dilleri ve Bugünü. Dilleri ve Kültürleri Yok Olma
Tehlikesine Maruz Türk Toplulukları, 4. Uluslararası Türkiyat Araştırmaları Sempo-
zyumu Bildirileri, ed. M. Erdal, Y. Koç, M. Cengiz, Ankara, pp. 397-422.
Ölmez, M. 32015. Moğolistan’daki Eski Türk Yazıtları: Ankara.
Özbek Edebiyati Tarihi: 2005. Ürümči, Šincaŋ Helk Nešriyati.
Poppe, N. 1953. Remarks on the Salar language. Harvard Journal of Asiatic Studies:
Vol. 16, N 3-4, pp. 438-477.
Potanin, G. N. 1950. Tangutsko-Tibetskaya Okraina Kitaya Tsentralnaya Mongoliya:
Sankt-Peterburg, 1893, Sobranie Slov Salarskago Nareçiya: The new edition, Moskva
(1950), pp. 426-434.
Qualin, M.; Wanxiang, M.; Zhicheng, M. 1993. Salar Language Materials: (K. Stuart
Ed.). Sino-Platonic Papers Nu: 43, Department of Asian and Middle Eastern Studies
University of Pennsylvania, Pennsylvania.
Saguchi, T. 1986. Historical Development of the Sarïgh Uyghurs. Memoirs of the
Research Department of the Toyo Bunko: Vol. 44, pp. 1-26.
Salk, Gundula; Mambet, Turdı. 1998. The “Fu-Yu Gırgıs” according to the present-day
situation and the legendary past: Kraków.
Schönig, C. 1997. A New attempt to classify the Turkic languages. Turkic Languages: Vol.
1-2, pp. 117-133; pp. 262-277.
Schönig, C. 1998. A New attempt to classify the Turkic languages. Turkic Languages: 3,
pp. 130-151.
Schönig, C. 1998. Bemerkungen zum Fuyu-Kirgisischen. Bahşı Ögdisi. Festschrift für
Klaus Röhrborn/Klaus Röhrborn Armağanı: (ed. von J. P. Laut, M. Ölmez). Freiburg/
İstanbul, pp. 317-340.
Schönig, C. 1999. The Internal Division of Modern Turkic and its Historical Implications.
Acta Orientalia Academiae Scientiarum Hungaricae: Vol. 52, p. 85.
Song, Z. 1981. Tuwayu yanjiu: Beijing.
Taube, E. 1993. Zur aktuellen Situation der Tuwiner im westmongolischen Altai. http://
userpage.fu-berlin.de/~corff/im/Landeskunde/Taube_Tuwa [December 1993].
150 Ölmez
1 Introduction
Can a language itself play a role in bringing about its own extinction? Can the
complexity of the linguistic structure of a language have a negative effect on
its transmission to the next generation? With respect to the conditions that
lead to language endangerment and extinction, the issues have generally been
approached in terms of extra-linguistic factors such as forced language shift as a
consequence of military, economic, religious, cultural, or educational interven-
tion or voluntary language shift due to the speakers’ negative attitude towards
their own language (UNESCO, Nettle and Romaine 2014, Farf’an and Ramallo
2010, Tsunoda 2006). This paper considers the case of Ubykh, a Northwestern
Caucasian language which became extinct with the quiet passing away of
its last speaker, the 88-year-old Tevfik Esenç, in his home in the Hacı Osman
village of the Manyas region in Balıkesir, Turkey, on Oct. 8, 1992. The paper
basically claims that, although some of the external factors of extinction did
prevail for Ubykh, among the factors that led to its own extinction was in fact
also the language itself. In contrast to the relatively longer survival of its related
languages which were exposed to the same external factors, the fact that it was
Ubykh which became extinct seems to suggest that this might indeed be the
case, i.e. that the complexity of the linguistic system, specifically of the phono-
logical system, of the language contributed negatively to the transmission of
the language to the younger generations and therefore constitutes one of the
factors to be considered in the endangerment of the language.
It is a fact that the Ubykh children acquiring language were exposed to
linguistic systems with much simpler phonological systems than Ubykh’s. It
is therefore not inconceivable that this had an adverse effect of the acquisi-
tion of the language to the degree that the inter-generational transfer of the
language took place at a diminishing pace, resulting in the extinction of the
language, while the other Caucasian languages with simpler systems, though
exposed to the same external factors of endangerment, did not share the same
fate. Despite the fact that they lost their linguistic heritage, the majority of
the Ubykh speakers retained their Caucasian and Ubykh identity as well as
2 Historical Background
The homeland of Ubykh was the northwest region of the Caucasus. Until
1860’s, the Ubykh lived along the eastern shores of the Black Sea, in the area of
the present day Sochi, neighboring the Adyghe speaking Shapzug and Abaza.
With a population of approximately 40,000-50,000 (Russian sources cited in
Landmann 1981). the Ubykh lived in this ‘ethnically (and linguistically) hetero-
geneous’ region until 1864 (Bodenstedt 1848 in Landmann 1981).
The Death of a Language: The Case of Ubykh 153
r.
h r. Kuma
L
ab a
e
r. Kuban
Maykop
r.U
Ad r. Be l a j a
r
up
Cherkessk
y Besleney
Tuapse gh
e
k r. Malka
a Mozdok
Sochi b
a r d r. Terek
r. Psou A j a
Nalchik n
Gagra b
r. Bzyb k h a
z
Abkhaz Sukhumi
Abaza r. Kodor
East Circassian
West Circassian
Mingrelian r. Ingur
The middle of the 19th century was a period of disruption for the Caucasus.
The military invasion of the area by the Russian forces and the drastic changes
in the sociopolitical structure of the region that followed in the mid 1860’s also
brought about a dramatic change in the demography of the Caucasus. More
than 400,000 Caucasians were forced to leave their homeland and emigrate
out of the area to the neighboring Ottoman Empire.1 Of the groups that immi-
grated to the Ottoman state, the Ubykh were unique in that they were the only
group among the Caucasians who migrated out of the region as the whole pop-
ulation. At the three ports of entry into the Ottoman Empire – İstanbul and
the two ports on the Black Sea, Samsun and Trabzon – the immigrants were
assigned locations for settlement across Anatolia. The groups speaking differ-
ent northwestern Caucasian languages were all treated as Circassians.
Landmann (1981) observes that there were approximately 600 Northwest
Caucasian villages in Turkey in mid 20th century. These villages were dispersed
over the large expanse of the Anatolian terrain. As a consequence of the settle-
ment policies of the authorities, the Caucasian groups were split, settling in
locations distant from each other. Landmann identifies 15 of the Caucasian
settlements as Ubykh villages and indicates their geographical distribution
as follows:
1 The historical details of the plight of the Caucasians during the forced migration can be
found in the extensive research listed in the reference list at the end of the paper.
154 Özsoy
Kocaeli- Sapanca 8
Balıkesir 3-4
Samsun 2
Kahraman Maraş 2
(Landmann, 1981)
B L A C K S E A
I stanb u l Düzce z
d 20
e/6 A/84
10 f g/71
66 y B/6
h C/45 K/3
a/16 c/30 Ankara 35
16
b/74 k/27 x/13
1/4 D/20
j/4 L/5 Muş ?
i/5 E u u u
m/4 2 F/63 M/2
n/3 w/2
p/3 t/12
o/3 I/18 J/2
r/2 G/5 N/3
q/1
s/2
v/2 H
6
Maraş, is an indication that the term ‘Ubykh village’ is a relative term which
expresses that the group with the largest percentage of the population in
the village is of Ubykh origin, rather than the village being inhabited only by
Ubykh speakers.
Landmann (1981) remarks that in 1965, 1971 and 1975, the speakers in the two
villages could speak only a few words of Ubykh. The Ubykh considered them-
selves as Cherkess. Landmann further remarks that the same also holds for the
Lezghi and the Chechen of the Kahraman Maraş province.2
2 As Landmann (1981) comments, the Ubykh of the two villages in the Kahraman Maras
province were not researched by Dumezil or Dirr or any other linguist. The same holds for
the Ubykh villages in the Adana area. To the best of my knowledge, that situation has not
changed since then.
156 Özsoy
For Nettle and Romaine (2012), three major causes of language endangerment
and extinction are population loss, forced language shift and voluntary shift.
They propose that vitality of a language can be negatively affected by top-down
policies whereby it is retrieved from official institutions and public domains –
e.g. courts, commerce, politics – and restricted to home use and/or by
bottom-up practices whereby the language “retreats from everyday use and sur-
vives primarily in ceremonial or more formal use, e.g. school”. Tsunoda (2006)
distinguishes between different types of language death, classifying them in
terms of cause, speed, combination of the two as well as the role of register in
language attrition.
That the conditions generally maintained to underlie language endanger-
ment and extinction prevailed for the speakers of the Ubykh language can be
ascertained by the brief account of the dramatic history of the Ubykh people
relayed in Section 2. Crucially, however, the expulsion from homeland and
forced migration to the Ottoman Empire were not confined to be the fate of
the Ubykh people only. The speakers of the other northwestern Caucasian lan-
guages also suffered through the same dramatic events as the Ubykh people
and were forced to migrate out of their homeland, settle in a land and terrain
strange to them, surrounded by an unrelated dominant language. The villages
in which the Caucasian immigrants settled were dispersed throughout various
regions across Turkey, in most cases distant from each other. Their languages
were surrounded by other languages, Turkish being the dominant language in
many regions. It is evident that these immigrants, like the Ubykh speakers, were
exposed to the same external conditions, with the exception of one, which are
generally taken to be the cause of language endangerment and extinction.
The external force, which is maintained by Nettle and Romaine (2012) to be
a crucial factor in language extinction, was loss in population. Population loss
affected the Ubykh people much more drastically than the other Caucasian
groups since the Ubykh had already started to decrease in numbers long before
they were forced to leave their homeland in the Caucasus in 1860’s. Smeets
(2013) summarizes the plight of the Oubykh as follows:
. . . . . . the Ubykh, who have been hit worst of all peoples involved in the
Caucasian War. Their numbers, which were initially already lower than
those of their immediate relatives, were first reduced dramatically by the
losses inflicted during the defense of their homeland, then during the
exode and its aftermath, and again during the turbulent period in Turkey
following the First World War. The survivors lived in a few pockets of vil-
lages dispersed over distant Turkish vilayets, which left their language
with little chances for survival.
The Death of a Language: The Case of Ubykh 157
Given that there does not seem to be any evidence a distinction was made
in the settlement policies administered by the authorities to the different
Caucasian groups, the villages of the immigrants with different linguistic
backgrounds were dispersed over different regions of Anatolia, with relatively
restricted possibility of communication between the villages. Of the other
two factors put forth by Nettle and Romaine (2012), i.e., forced language shift
and voluntary language shift, most Caucasian speakers, initially mainly men
but gradually women as well, went through stages of bilingualism, acquiring
proficiency in Turkish as well as in their native tongues, and in many cases
of the surrounding (or neighboring) Caucasian language(s). Nevertheless, the
Caucasian languages other than Ubykh managed to survive even to the pres-
ent date, albeit with observable language attrition among the members of all
the groups as the level of education of each new generation of the diaspora
improved and mass media, particularly with the introduction of television
which made Turkish accessible to even the oldest female members of the com-
munities who had not had too much exposure to the dominant language until
then, reached even the most remote parts of the country. The question then is
why it was Ubykh, but not the other Caucasian languages, that became extinct.
The claim here is that the answer lies not only in the nature of the extra-
linguistic factors of language endangerment and extinction but that the com-
plexity of the linguistic system of the language, specifically of its phonological
system, should also be taken into consideration. Given the same external fac-
tors, the complexity of a phonological system, particularly when the children
are exposed to languages with much simpler phonological systems, is very
likely to have a negative effect on the transfer of the language from one genera-
tion to the next, eventually leading to the extinction of the language, as in the
case of Ubykh.
3 With its 122 consonants, the Khoisan languages in southern Africa is the family with the
largest number of consonants (Maddieson, 2013). Khoisan languages have click sounds, not
found in Ubykh.
The Death of a Language: The Case of Ubykh 159
Labial Plain p b p’
Pharyngealized p̱ ḇ p̱’
Apico-dental Plain t d t’
Labialized tº̱ dº̱ tº̱’
Apico-alveolar ɫ’
Lamino-alveolar c ʒ c’
Dorso- Plain ċ ʒ̇ ċ’
post-alveolar Labialized ċ º̱ ʒ̇º̱ ċ º̱’
Lamino-post-alveolar č ǯ č’
Apico-palatal ç ƺ ç’
Dorso-velar Plain (k’)
Palatalized k’ g’ k’·
Labialized kº̱ gº̱ kº̱’
Dorso-uvular Plain q q’
Palatalized q’ q’’
Labialized qº̱ qº̱’
Dorso-uvular-pharyngeal Plain q̱ q̱’
Labialized q̱º̱ q̱º̱’
160 Özsoy
Labial Plain
Pharyngealized
Labio- Plain f
dental Pharyngealized v
Apico-dental Plain
Labialized
Apico-alveolar ɫ l
Lamino-alveolar s z
Dorso-post-alveolar Plain ṡ ż
Labialized ṡ º̱ żº̱
Lamino-post-alveolar Plain
Labialized šº žº
Apico-palatal ş Ȥ
Dorso-velar Plain x ɣ
Dorso-uvular Plain X Ɣ
Palatalized X’ Ɣ’
Labialized Xº̱ Ɣº̱
Dorso-uvular-pharyngeal Plain X̱ Ɣ̱
Labialized X̱ º̱ Ɣ̱º̱
Glottal h
As Table 2 reflects, Ubykh has a very complex series of fricative sounds. The
table illustrates that Ubykh has a fricative contrast for every point of articula-
tion with the exception of labials and apico-dental region. There are 10 con-
trasts in the sibilant series and 12 contrasts in the velar/uvular series. At each
point of articulation, there is voicing contrast as well, resulting in voiced and
voiceless members of each pair. As secondary articulation, labialization and
palatalization are the common processes.
The Death of a Language: The Case of Ubykh 161
Labials Plain m w
Pharyngealized m̱ w̱
Apico-dental Plain n
Apico-Alveolar r
for Kabartay; Abaza has 60 consonants (cf. Lomtatidze and Klychef 1989), A
consequence of these facts is that the phonetic features of the consonants
are much simpler than those of Ubykh sounds. Further, the consonant-vowel
ratio of these language is not as disproportionate as it is in Ubykh. All these
indicate that the consonantal system of these languages are not as complex
as Ubykh’s. Table 4 reflects the consonant system of Kabardian adapted from
Colarusso (1989).
Typical of a Caucasian language, the consonantal system of Kabardian
too includes ejectives in its inventory. Nevertheless, the total number of con-
sonants in Kabardian is significantly smaller than in Ubykh. According to
Colarusso (1989), Kabardian has 50 phonemes in total for Kabardian, 48 of
which are consonants and 2 vowels. As can be noted, Kabardian has much sim-
pler plosive and fricative series than Ubykh’s. The contrast in the sibilant and
uvular/pharyngeal series is much smaller and secondary articulations much
more restricted than those of Ubykh, resulting in a much simpler phonological
system within the definition of simplicity provided by WALS.
The Death of a Language: The Case of Ubykh 163
5 Discussion
complex system. Given the fact that the number of speakers of Ubykh was
much smaller than the speakers of the other Caucasian languages, “Many
Ubykh gradually assimilated to things Circassian and all of them eventually
developed into Turkish citizens cherishing their Ubykh (-Circassian) descent”
(Smeets, 2012).
References
Saxton, M. 2010. Child Language: Acquisition and development. Thousand Oaks, (CA):
SAGE Publications Ltd.
Smeets, R. 2013. Reflections on the Caucasus: 21 May 1864-2010. https://circassian
world.wordpress.com/2013/09/06/reflections-on-the-caucasus-21-may-1864-2010/.
Tsunoda, Tasaku. 2006. Language Endangerment and Language Revitalization. Berlin:
Mouton de Gruyter.
Vogt, H. 1963. Dictionnaires de la langue oubykh. Oslo: Universitetsforlaget.
World Atlas of Linguistic Structures (WALS). http://wals.info/chapter/1.
Whaley, Lindsay. 2004. Can a Language that never existed be saved? In Freeland, J. & D.
Patrick (eds.). Language rights and Language Survival. Manchester, UK: St. Jerome
Publications.
chapter 13
1 I wish to thank the Dukhan community for their constant cooperation in documenting their
language and culture.
2 Dukhans follow the so-called Sayan-type of reindeer breeding, characterized by small-size
herds of reindeer used as pack and riding animal and as a source of milk products. On the
Sayan-type of reindeer herding, see Vainshtein (1980). For more recent views of Sayan econo-
mies, see Donahoe & Plumley (2003). Hunting used to be an important part of the Dukhan
economy. However, hunting and fishing proscriptions were recently issued by the Mongolian
government. In order to balance the impact of these proscriptions, the Mongolian govern-
ment has granted Dukhan families dwelling in the taiga and tending to reindeer a state pen-
sion calculated on the base of family numbers.
3 The Mongolian style of pastoral nomadism is based on the so-called five snouted animals:
sheep, goats, cattle (cows and yaks), horses and camels.
4 On the taiga vs. steppe division, though with slight differences from the view presented here,
see Žukovskaja et al. (2002). Furthermore, on Soyot, see Rassadin (2010), on Tofan, Rassadin
(a.o. 1971, 1978, 2014) and Harrison (2003), on Toju, Čadamba (1974), on Tere-Khöl Tuvan,
Seren (2006), and on Dukhan, Ragagnin (2011).
5 On standard Tuvan, see, a.o. Isxakov & Palm’bax (1961) and Anderson & Harrison (1998); on
Tuvan dialects, see Sat (1987) and on Uyghur-Urianxay, see Bold (1975) and Ragagnin (2009).
6 The general view among scholars is that the Darkhat people are of Turkic origin and that
their language and customs have become Mongol in the past few centuries. For a short survey
of Darkhat grammatical features, see Sanžeev (1931) and Gáspár (2006).
168 Ragagnin
To start with, the general term referring to ‘reindeer’ is iβɨ ‘rangifer tarandus
sibiricus’ (cf. Tuvan ivi and Tofan ibi ~ ivi ‘id’). This term is possibly etymo-
logically related with iwiq ‘the she-antelope, which frequents stony tracks and
deserts’, recorded in Maḥmūd Al-Kāšɣarī’s encyclopaedic work Compendium of
the Turkish Dialects and glossed with Arabic ẓabya (Dankoff & Kelly 1982: 108);
cf. Hauenschild (2003: 100). A wild, i.e. not tamed reindeer, on the other hand,
is called tasfanaŋ. This term possibly goes back to taspan (see below) and aŋ
‘wild animal’. Among Taiga Sayan Turkic varieties merely Soyot displays the
close cognate daspanaŋ ‘wild reindeer’ (Rassadin 2006: 43).
With regard to a new-born reindeer calf, Dukhan displays the terms ehsɨrɨk
and anhay. The former is a transparent Turkic agent formation from the ver-
bal stem ehsɨr- ‘to get drunk’ and literally means ‘drunkard’. This denomination
most probably is based on the fact that the new-born baby reindeer tumbles
like a drunkard. Moreover, it surely belongs to the set of taboo names applied
to young creatures (both humans and animals) in order to protect them from
evil spirits. It is quite unlikely that evil spirits would take away a drunkard.
Moreover, from ehsɨrɨk the verbal stem ehsɨrɨkte- ‘to calf/to fowl’ is formed. The
letter term, anhay, is etymologically more obscure. It could be traced back
to *ana ‘mother’ augmented by the hypocoristic suffix -KAy, thus meaning
‘mommy’.8 Both formal and semantic correspondences of Dukhan ehsɨrɨk and
anhay are documented throughout Taiga Sayan Turkic, eg. Toju-Tuvan aˁniy
‘reindeer calf’, eˁzirik ‘affectionate name for reindeer calf’ (Čadamba 1974: 63),
Soyot aˁnay ~ aˁnhay ‘reindeer calf till one year of age’, eˁsirik ‘new-born rein-
deer calf’ (Rassadin 2006: 22, 85, 204). Steppe Sayan Turkic, on the other hand,
displays corresponding items referring to babies of other animals, cf. ezirik
7 Some Dukhan reindeer terms are listed in Badamxatan (1962: 9), Somfai-Kara (1998: 18-19),
Kuular & Suvandi (2011a) and Ragagnin (2012).
8 Similar expressions are used a.o. in Turkish, see Ragagnin (2012: 135-136) for details.
Diversity In Dukhan Reindeer Terminology 169
(eˁzirik) ‘goatling, (kid), deer’s cub (fawn)’ (Tenišev 1968: 608b, Dorlig & Dadar-
ool 1994: 242a), and aˁnay ‘young offspring of a goat or mountain goat’ (Monguš
2003: 130b).
Moreover, with regard to new-born reindeer, Dukhan inventory includes the
terms öskʉsek and hosbayak not documented so far in the rest of Taiga Sayan
Turkic. The former denotes a motherless fawn and morphologically represents
a diminutive formation from öskʉs ‘orphan’. The latter refers to a new-born
reindeer fawn rejected by its own mother and represents a nominal formation
from the verbal stem hos- ‘to refuse animal’s babies’.9
Proceeding along the age axis, a young reindeer up to one year of age is
called hokkaš, a rather obscure term, arguably going back to the diminutive
formation kuškaš ‘small bird’10 from kuš ‘bird’ through phonological distortion,
not uncommon in taboo names. A.o, cf. Tere-Khöl Tuvan xokaš ‘reindeer calf
below one year of age’ (Seren 2006: 81), Tofan hokkaš ‘one-year old reindeer
calf’ (Rassadin 1971: 190) and Soyot hoqaš ~ hokkaš ‘one year-old reindeer calf
(in its second year)’ (Rassadin 2006: 85).
When reaching one year of age, the young reindeer is called taspan.
Cf. Toju Tuvan daspan ‘1 year-old reindeer’ (Sat 1987: 77), Tofan daspan
‘2 year-old young wild reindeer’ (Rassadin 1995: 21a). The etymology of taspan
is obscure; for some proposals, see Tatarincev (2002: 105-106).
Six months later, in the autumn, at the age of 18 months, the male reindeer
is named toŋgǝr and the female toŋgʉr.11 Both toŋgǝr and toŋgʉy are etymo-
logically rather obscure.12 They may be related to a rhotacised form of Turkic
toŋuz ‘pig’.13 This assumption, however, needs further investigation. Cognates
9 On the other hand, Standard Tuvan employs the form xosturgan (xos- ‘to refuse animal
babies’-caus-past part) to characterize a young animal refused by its own mother, eg.
xosturgan xuragan ‘rejected lamb’. I wish to thank my colleague Choduraa Tumat for pro-
viding me with this example.
10 Note, in this respect, that a structurally similar lexeme is documented in Sarigh Yugur:
gohqaš ‘small bird’ (Nugteren & Roos 2006: 110).
11 For all the analyzed terms, to distinguish between male and females, the terms er/erhek
vs. epšɨ may be used.
12 Within the whole of Sayan Turkic, Turkic toŋuz shows modern correspondences merely
in the so-called Uyghur-Uriankhay variety of Eastern Khövsgöl Aimag in the form toos
(Ragagnin 2009: 229).
13 It is worth noting that traces of Turkic toŋuz occur in other documented languages of this
area, such as Mator toŋoi ‘pig’ (Helimski 1997: 365, §1060), Toju Tuvan doŋay ‘two year-old
wild reindeer’ and Tuvan doŋay (Monguš 2003: 474a) ‘bear cub’ may also belong here. In
this respect, it should be kept in mind that names of strong and dangerous animals, such
as the boar, belong to the set of taboo names in use across Siberia and neighbouring areas.
170 Ragagnin
14 According to Helimski (1997: 301-302) Mator méinde may be traced back to Protosamoyedic
*məjan-ce̮ɜ (məjan ‘ground (gen)’ + ce̮ɜ ‘(tamed) reindeer’.
Diversity In Dukhan Reindeer Terminology 171
refers to an older reindeer doe with little fur, kïsǝr mïndɨ to a dry doe and,
finally, a mature reindeer doe is called ulǝɣ mïndɨ, i.e. ‘big mïndɨ’.
With regard to gelded reindeer, Dukhan displays a rather large set of terms,
including guuday, tüktǝɣ mĩis̃ , pir tüktǝɣ guuday, ihx̃ ɨ tüktǝɣ guuday, üš tüktǝɣ
guuday, ǰarɨ and bogana. Castration usually occurs when the reindeer is in
toŋgǝr-age, in autumn;15 the expression toŋgǝr tïhrtaar, literally ‘pull the toŋgǝr-
reindeer’ in fact means ‘to castrate’. Once castrated, the reindeer is called guuday;
also cf. Soyot quuday ‘domesticated three years old reindeer buck’, Tere-Khöl
Tuvan kuuday ‘small/young male reindeer’ (general term), Tofan kuuday ‘rein-
deer buck about 2-3 years old’ (Rassadin 1995: 33a). Ščerbak (1961: 91-92) and
Tatarincev (2004: 327), derived the term kuuday from Turkic kuu ‘grey’ and day
‘foal, young horse’. One year after castration, that is when the reindeer is three
years old, it is called tüktǝɣ mĩis̃ , literally, ‘hairy antler’. A Dukhan synonym for
tüktǝɣ mĩis̃ is pir tüktǝɣ guuday, literally ‘one-haired guuday (one hair-der guu-
day). Based on the same syntactic structure (cardinal number + tük ‘hair’, aug-
mented by the adjectivalizing suffix -LXɣ + guuday) are: ihx̃ ɨ tüktǝɣ guuday and üš
tüktǝɣ guuday referring, respectively, to ‘two-haired guuday’ and ‘three-haired
kuuday’, i.e. ‘four-year vs. five-year old gelded reindeer’. Formally related items
are documented in the other Taiga Sayan Turkic varieties, a.o. Soyot düktɨγ miis
‘domesticated young reindeer buck in its third year of age’ (Rassadin 2006: 47),
Tere-Khöl Tuvan iyi tüktüg kuuday ‘three years old male reindeer’, üš tüktüg
kuuday ‘four years old male reindeer’ (Seren 2006: 82), Toju Tuvan bir düktüg
mïyïs ‘male reindeer about 3 years of age’, iyi düktüg mïyïs ‘male reindeer about
four years of age’, üš düktüg mïyïs ‘male reindeer about five years of age’ (Seren
2006: 82). Furthermore, in Dukhan, the lexeme mĩis̃ occurs in the construc-
tion sããrsǝk mĩĩs with reference to a reindeer with one-dropped antler (sããrsǝk
‘one of two’).
The term ǰarɨ refers to a “calm” riding and packing reindeer, older than
four years of age, at least according to some informants. Cognates are found
throughout Taiga Sayan Turkic, e.g. Tofan ǰarǝ, Soyot čarï ‘riding and pack-
ing reindeer’ (Rassadin 1971: 194; 2006: 153), Tere-Khöl and Toju Tuvan čarï
‘castrated reindeer’ (Seren 2006: 82). Interestingly enough, Steppe Sayan Turkic
varieties show a rather different picture. The standard Tuvan cognate čarï
refers to a breeding male reindeer (Tenišev 1968: 520a) and in the so-called
Uyghur Uriankhay Sayan-Turkic variety of Eastern Kubsugul ǰarǝ is the only
existing term meaning ‘reindeer’ (Ragagnin 2009). Interestingly enough, a
Much ink has been spilled in trying to trace the origin of reindeer taming and
reindeer herding culture across northern Eurasia. According to the renowned
16 Also cf. the information supplied by Marco Polo’s XIII century travelogue concerning
reindeer herding nomads in the Bargu area (Ragagnin 2015).
17 In this regard, also see Hauenschild (2003: 105-106).
18 Cf. Erdal (1991: 87) for the corresponding denominal suffix -gAn deriving zoological and
botanical names in Old Turkic.
Diversity In Dukhan Reindeer Terminology 173
Russian anthropologist Vainshtein (a.o. 1980 and 1986) reindeer breeding origi-
nated in the Baykal-Sayan area.19
There are many Dukhan legends on the origin of reindeer taming. To finish
the present contribution I wish to include one of these. Several further vari-
ants circulating within the Dukhan community have been collected and will
be published in Oyunbadam & Ragagnin (forthcoming).
19 In this respect, also see Laufer (1917), Vasilevič & Levin (1951) and Vitebsky (2005).
174 Ragagnin
Translation20
(1) Once upon a time (lit. in early times before) the Dukhan people, the vari-
ous Dukhan peoples were living going about hunting for large and small ani-
mals through the taiga and high plains, and moving about from one river to
the other. (2) So, after many years, after living from hunting large and small
animals, after living thus, living from hunting, fishing big and small fish, hunt-
ing game and searching for grain, hunting birds and squirrels, sables and
squirrels, and such animals, all the small animals and the resources (of the
forest), among many rivers they settled down inside one of them. (3) Separated
within the rivering land from each other by the rivers, the Dukhans of one river
20 The English translation is faithful to the original Dukhan text. Thus, hesitations and refor-
mulations of the speaker are retained in order to demonstrate the normal flow of speech
when telling a story. Speakers in fact modify and repeat sentences when building up
a story.
Diversity In Dukhan Reindeer Terminology 181
or the other, the Dukhans of one river or the other were separated, and they lived
from hunting for game and collecting grains and berries along one river or another.
(4) After many years, after maybe a hundred years, even after a hundred
years – or something (like that) – there happened to be many wild animals
(coming) from the taiga and high plains: maral deer, antelopes, bears, reindeer
and other animals. (5) Within those, there was such an animal called rein-
deer. Sooo it was. (6) Well, those animals called reindeer were staying beside
the Dukhan peoples not far away, very near. Such animals (called reindeer)
were coming to live (there). (7) Alongside those Dukhans of that taiga and
high plains, in the taigas and high places of the lands and those rivers, where
they were living, so, yeah, after many years, those Dukhans well, the animal
called the reindeer and the Dukhans came very close to each other: (8) Yeah,
the Dukhans came about to speak about the following: “How would it be if we
captured some of these animals, these (animals) called reindeer and we our-
selves were to ride them, use them, milk their milk, drink it, and to make use of
them in this life of ours in that way”. (9) So, they spoke in that way. (10) They
went on speaking for many years, like this, several years, about ten. (11) So, well,
having spoken together and among themselves about capturing that reindeer,
they remained at the point of almost catching them. (12) Yeah, so, well, one day
there was this taiga behind, a taiga called the reindeer taiga, an isolated and
free taiga among the various taigas. (13) In that taiga there were really a lot of
those animals called reindeer. (14) So, well, one day an old woman settled down
there, in that taiga, in that taiga, in the taiga called the reindeer taiga where
many reindeer lived. (15) So, as for those white and blue things called reindeer,
they came to graze very close beside that woman, those animals, those rein-
deer animals. (16) Well, one day that woman, well, that old woman, a woman
around fifty, well, that woman thought, “If I would capture one of those rein-
deer, how would it be?”. In order to teach herself the voice (of the reindeer) she
passed her days just making gugurt-sounds. (17) She just passed her days mak-
ing gugurt-sounds. (18) She tried to get them to lick her urine and in this way
(the reindeer) kept getting closer and closer. (19) Doing gugurt sounds every
day, she happened to make the reindeer familiar to her own voice. (20) And by
doing this, that reindeer, those reindeer, the woman of that taiga was able to
learn all of the gugurt-sounds, cries and wines of those reindeer called reindeer,
all of them. (21) So (they) were really getting very close to that woman. (22) For
about ten fifteen days she kept on saying such kind of things and it (the rein-
deer) became close to her. (23) Those reindeer came up to (her) hand in such
a way to be almost touched, they (gradually) started to be reachable by hand.
(24) Well, and one day that woman twisted together a lasso, a lasso for ani-
mals, she twisted together a long lasso in order to catch that reindeer. Well,
182 Ragagnin
and one day she caught with the lasso a small reindeer that came nicely close
to her. (25) As soon as she caught it, since it was not struggling and kicking a
lot, she roped it immediately. Then she put a collar on its head, a collar and,
now, she has got it tied up. (26) That reindeer she got was a small two year old
tongʉy-reindeer. (27) Yeah, well, the next day she got excited thinking “Isn’t it
possible to catch one of them again?” (28) Beside the reindeer she had caught
the day before, also (other) reindeer that followed that one happened to be
wondering around (there) and not far away from this place there were also
many reindeer. (29) The next day after that, she roped another small reindeer
as well. (30) Well, a rope lasso, with a lasso, she roped it with her collar and
lasso, yeah, well, that woman of more than fifty years old became indeed some-
body who has two reindeer. (31) Now she had one toŋgǝr-reindeer and one
tongʉy-reindeer. Well, that woman of more than fifty years of age (gradually)
became somebody who has two reindeer. (32) Yeah, that woman, now that
toŋgǝr-reindeer, tongʉy-reindeer, having done that, and doing that with her
lasso that she had made herself, and doing that, she took them out to graze
with the lasso, in that way. She was feeding it (the reindeer) little by little by
hand with very tasty things (salt),21 and having done that, now, that woman,
a woman of about fifty years old had two reindeer. (33) Those other Dukhan
people who were living along rivers and in the taigas that were nearby, well,
they heard (it) and they were talking among themselves, well, that this one
woman had reindeer and thought, “Let us now also catch them like that person
(did)”. (34) Yeah, now they spoke about how to catch that reindeer. (35) Some
of them catch it (with) the same kind of lasso of that woman – that woman
person. Some of those people now set up traps in places where there are big
and small gorges/ravine and chase (them), and gather them in great numbers.
Having done that, those Dukhan came to possess reindeer. They caught (them).
Sooo it was. (36) So, after that, our, well, of this Dukhan country, wait let me
think, after that in the end the wild reindeer that wonders around in the taiga
and high plains and that does not approach peoples exists as well. They have
raised the many many of those reindeer from that (time) on, they raised those
few, small reindeer they had caught, and there in the taiga and high plains
there always also existed wild reindeer that do not approach anyone. (37) So
now, growing that reindeer they caught, they move from river to river. Each of
the many Dukhans now have reindeer, and they raise their reindeer, some of
them go around with reindeer, they go around in their own directions with few
reindeer, and they, they ride their reindeer. Now they came to pack up these
21 Reindeer, like other animals, always lick urine for its salt content. The story implies that
the woman is trying to domesticate them in this way.
Diversity In Dukhan Reindeer Terminology 183
reindeer of theirs and go hunting. (38) Now they (i.e. the Dukhans) have come
to be people who pack the meat of wild game on them (i.e. the reindeer), have
their children ride on them, milk them and drink their milk, and in this way
now they use that reindeer, they became people who make much use of the
reindeer in their own life. Sooo it was. (39) Now the Dukhan peoples who used
to move around in that river area living from fishing, hunting and gathering,
after that, they came to have a riding animal, they milked and drank its milk,
they milked and drank the milk of the reindeer; (40) besides, they nomadize
back and forth moving fast, when they nomadize they use the reindeer as pack
animals, they nomadize from river to river, from mountain to mountain, they
came to be moving around like this, they raised them to carry their packs,
many of them. (41) Now they have a riding animal, all of them, the distances
they cover in their trips became long(er). (42) They have come to nomadize
from one river to another river. (43) They have come to be people who live
by nomadizing and hunting. (44) They have come to be people who move
and visit each other. (45) Yeah, so, the Dukhan person . . . the reindeer was
discovered/found in this way.
References
Rassadin, Valentin I. 2014. Tofalarskij jazyk i evo mesto v sisteme tjurskix jazykov. Elista.
Sanžeev, G. D. 1931. Darxatskij govor i fol’klor. Leningrad.
Sat, Šuluu Č. 1987. Tyva dialektologija. Kyzyl.
Seren, Polina. 2006. Tere-Xol’skij dialekt tuvinskogo jazyka. Abakan.
Somfai Kara, Dávid. 1998. A xöwsgöl-i tibák. In: Birtalan, Ágnes (ed.) Öseink nyomán
Belső-Ásziában II. Hitvilág és nyelvészet. (Magyar Felsőoktatás Könyvek 10.)
Budapest. 17-21.
Ščerbak, Aleksandr M. 1961. Nazvanija domašnix i dikix životnyx v tjurkskix jazykax.
In: Ubrjatova, Elizaveta I. (ed.) Istoričeskoe razvitie leksiki tjurkskix jazykov. Moskva.
82-172.
Thackston, Wheeler M. (tr.) 2012. Classical Writings of the Medieval Islamic World:
Persian Histories of the Mongol Dynasties: Rashiduddin Fazlullah (vol. III). London.
Tatár, Magdolna. 1985. Sayan Etymologies. Beşinci Milletlerarası Türkoloji Kongresi.
İstanbul, 28-23 Eylül 1985. Tebliğler. I. Türk Dili 1. İstanbul. 243-252.
Tatarincev, Boris I. 2002. Ėtimologičeskij slovar’ tuvinskogo jazyka. Tom II: D, Yo, I,
J. Novosibirsk.
Tatarincev, Boris I. 2004. Ėtimologičeskij slovar’ tuvinskogo jazyka. Tom III: K,
L. Novosibirsk.
Tenišev, Ėdgem R. (ed.) 1968. Tuvinsko-russkij slovar’. Moskva.
Vainshtein, Sevyan [Vajnštejn, Sev’jan I.] 1980. Nomads of South Siberia. The pasto-
ral economies of Tuva. Cambridge. [English translation of Vajnšeijn, Sev’jan 1972,
Istoričeskaja etnografija tuvincev, by Michael Colenso]
Vainshtein, Sevyan I. 1986. Origin of reindeer-herding in Eurasia. In: Lehtinen, Ildikó
(ed.), Traces of the Central Asian culture in the North. Finnish-Soviet Joint Scientific
Symposium held in Hanasaari, Espoo 14-21 January 1985. (Suomalais-Ugrilaisen
Seuran Toimitukssia/Mémoires de la société finno-ougrienne 194). 279-286.
Vasilevič, G. M. & Levin, M. G. 1951. Tipy olenevodstva i ixh proisxoždenie. Sovietskaya
Etnografya 1.
Vitebsky, Piers. 2005. Reindeer people. London.
Werner, Heinrich. 2002. Vergleichendes Wörterbuch der Jenissej-Sprachen. Band II: L-Š.
Wiesbaden.
Žukovskaja, N. L. & Oreškina, M. V. & Rassadin, Valentin I. 2002. Sojotskij jazyk. In:
Neroznak, V. P. (red.) Jazyki narodov Rossii. Krasnaja Kniga. Moskva. 164-170.
Chapter 14
This short essay entails some reflections about the object of ‘language docu-
mentation’ in a specific setting of functional bilingualism. In this paper, I start
from a sociological stance that can be opposed to a linguistic approach of
defining the object at issue. The perspective taken here is especially relevant
because functional bilingualism grounded in the tradition of first language
bilinguals (as it is the case with the subjects of this paper, namely the ‘Udis’) is
best interpreted in terms of a holistic view (Grosjean 1982), who “argues that
each bilingual is a unique individual who integrates knowledge of and from
both languages to create something more than two languages that function
independently of each other” (Reyes 2008: 79). As for the community of Udis,
a small ethnic group dwelling mainly in the village of Nij in Northwestern
Azerbaijan, the problem is even more complicated, because one of the ‘lan-
guages’ involved in this type of bilingualism can be considered as being mas-
sively influenced by the other language, namely Azeri, the official language of
Azerbaijan. In order to illustrate this point, I refer to one example randomly
taken from published Udi texts (translation of Jona), cf. (1):
Except the few morphological units and the lexical unit bak- ‘to become’, all
terms are taken either from Azeri or are older (Oriental) loans. Hence, one
may ask, to which degree it is justified at all if we relate this phrase to the ‘Udi’
language. In case the speech of the Udis would be under survey in terms of
‘language documentation’, one might thus ask what kind of linguistic object
we have to deal with. According to the holistic perspective as discussed by
Grosjean (1982) it would be appropriate to speak of a unitary linguistic knowl-
edge system expressed in terms of a ‘mixed language’. However, this does
not seem to be the case among many speakers in Nij. Rather, their linguis-
tic practices are characterized by a internally structured, nevertheless holis-
tic knowledge system that is profiled towards Udi or Azeri according to the
social and situational setting of communicative acts (supplemented by role
features and the language biography of the speakers). The question of how to
account for the given linguistic practices in terms of ‘language’ again becomes
complicated, because the language components of bilingualism among Udis
are marked for an unbalanced pattern with respect to language and power. In
fact we have to deal with a typical pattern that contrasts a ‘powerful’ majotar-
ian communicative system (Azeri) with a minotarian one (Udi) which again is
majotarian in the small community of Nij itself. This complex situation invites
us to reconsider the concept of ‘language’ at least when aiming at the doc-
umentation of linguistic practices in communities such as Nij. In my paper,
I will propose some arguments that aim at a more sociological understanding
of ‘language’ in terms of linguistic practices.
2 Linguistic Practices
would be a ‘language’. In this sense the definition starts from two reciprocal
hypotheses: (a) The assumption that a group of people actually represents a
‘community’, and (b) that this ‘community’ defines itself by referring to a collec-
tive knowledge system called ‘language’. However, there is sufficient evidence
to assume that social groups construe themselves as social units (‘communi-
ties’) by referring to a distinctive bundle of inherited or otherwise established
sociocultural patterns of knowledge and of practices resulting there from. This
bundle may include patterns of linguistic practices, but does not necessar-
ily do so. In other words: We cannot equal the ‘speech’ of a community (as
large as it may be) to ‘a’ language as such. Taking the above-given definition by
Himmelmann serious, we have to start from communities delimited by socio-
cultural features by monitoring their linguistic practices whether or not these
practices are grounded in a knowledge system scientifically described as ‘a’
language.
Relating ‘a’ language to dimensions of language use preconditions the
delimitation of a given language and is thus determined by sociological fea-
tures rather than by mere language-internal structural features, cf. the defini-
tion of ‘language’ as “a form of activity of human beings in societies” (Halliday,
McIntosh, Strevens 1964: 4).
Starting from a group-internal perspective we may ask whether the mem-
bers of the corresponding community relate their linguistic practices to what
Mannheim (1980 [1922]) has called “communicative knowledge”, that is to
explicit and reflective knowledge in terms of awareness. This view does not
necessarily mean that communicative knowledge has to show up in terms of
language awareness, in case language is seen as a scientifically identifiable
unit. It may well be the case that members of a speech community are aware
of their linguistic practices even though the underlying collective knowledge
system does not represent a ‘language’ of its own (in the linguistic sense). One
might argue that an indicator of language awareness is the presence of a corre-
sponding linguistic sign in terms of an endonymic language name (be it under-
ived or derived from other units such a geographical terms, ethnic names etc.).
However, note that naming linguistic practices in a community by the com-
munity itself is not always present, cf. Gregersen (1976: 95) who reports: “There
is no native name for the language [s.c. Manam, W.S.] (unnamed vernaculars
seem to be the rule in New Guinea)” (see Mühlhäusler (2006) for discussion).
Most likely, linguistic knowledge given in a community can at least partly be
understood as “conjunctive knowledge” in terms of Mannheim (1980 [1922]),
this is as implicit, experiential, non-reflective, praxeological knowledge
grounded in everyday practices, also see I. Schulze (2014a)). Taking this point
How Much Udi Is Udi ? 191
of Bourdieu), into its objectified version (by producing school books, readers
etc.). Supported by its institutionalization, this cultural capital has become a
major motivator of language awareness among Udis in Nij. This process has
resulted in changes in the attitude of Udi speakers with respect to their original
linguistic practices: Before the Armenian-Azerbaijani conflict had erupted in
1989/90, Udi-Armenian-Azerbaijani trilingualism had been a standard pattern
in Udi settlements (see below). Armenian was by that time part of the linguis-
tic landscape in Nij. Fifteen years later, all public appearances of Armenian
had been eliminated just as actions had been undertaken to ‘purify’ Udi from
Armenian elements.
The processes briefly alluded to here illustrate that language awareness has
to be regarded as a substantial part of social and even political models of lin-
guistic practices. The institutionalization of linguistic practices present in a
corresponding community can be interpreted as a shift from ‘variety’ to ‘lan-
guage’ (in the sense of sociolinguistics). The famous dictum “a shprakh iz a
dialekt mit an armey un flot” (a language is a dialect with an army and a navy)
(Weinreich 1945: 13) illustrates that a ‘language’ can be described as some kind
of ‘armed dialect’, controlled and administered by corresponding official insti-
tutions. In this sense, the total of standard linguistic practices of a single ‘speech
community’ can be regarded as representing a linguistic variety (Ammon and
Arnuzzo-Lanszweert 2001: 815) or – if confined by geographical aspects – a dia-
lect, whether or not there are other related varieties (or dialects) present in
the region. From an internal view, ‘related’ should be paralleled with mutual
comprehensibility to the degree that every-day communication is made pos-
sible somehow. It should be noted that these assumptions do not start from a
scientific definition and delimitation of languages. It makes sense to describe
the total of intercomprehensible varieties/dialects as a Gesamtsprache from
a sociolinguistic point of view (Ammon and Arnuzzo-Lanszweert 2001: 815).
However, the idea that the individuals of a ‘speech community’ are marked for
a common ‘language’ (as opposed to variety of a Gesamtsprache) seems to be
justified only, if we can show that some kind of ‘arming’ of the given variety,
that is the institutionalization of linguistic practices has taken place.
Summing up the point made so far, we can say that the term ‘language docu-
mentation’ is misleading in case we refer to Himmelmann’s definition quoted
in the beginnings of this section. The linguistic practices observable for a given
community may well constitute a linguistic variety of its own even though the
internal structure of these practices (that is grammar, lexicon, phonology etc.)
does not represent what is conventionally called a ‘language’ from a scientific
point of view. In other words: It is not necessarily the case that the linguistic
How Much Udi Is Udi ? 193
In 2009, some 65% of the inhabitants of Nij declared to be ethnic Udis, the
rest being chiefly Azerbaijanis. Nij is divided into sixteen ‘family-based’ quar-
ters (šaq’q’a or mähällä), two of which are mainly inhabited by Azerbaijanis
(Yalgaşlı, Abdallı).
Until 1989, a more or less compact group of ethnic Udis was present in the vil-
lage of Vartashen (now Oğuz), too, located some 20 km northwest of Nij. Before
1989 Vartashen was inhabited by some 5,000 people (roughly 40% Armenians,
15% Jewish Tats, and 30% Udis). Together with the local Armenians, most of
the Udis from Vartashen were forced to leave the village in 1990 due to the
Armenian-Azerbaijani conflict and thus moved to various places of the for-
mer USSR, e.g. to Armenia, more precisely to the borderlands of the Armenian
province Tavush and to the village of Zinobiani (1938-2000: Okt’omberi) in
Eastern Georgia, which had been founded by emigrants from Vartashen in
1922 in the context of the Armenian-Azerbaijani conflict 1918-1920 (see Schulze
2011a; today, some 200 ethnic Udis live in Zinobiani, together with the same
number of Georgians, cf. SSSD 2003: 110-111). Just as it is true for the Nij Udis, the
Zinobiani Udis know a ‘language sage’ (Simon (Mamuli) Nešumašvili), who has
undertaken many efforts to advance language awareness among the local Udis.
He has founded a “Society of Georgia’s Udis” and has developed a Georgian-
based script for the local Udis that he has used when preparing a new transla-
tion of the Gospels (Nešumašvili 2012). The number of Udis who have stayed
in the Oğuz region after 1990 is difficult to determine. Azerbaijani sources talk
about 79 Udis in 2009. However, it is unknown to which degree they live in
mutual contact at all. USSR-internal migration (esp. in the 1970ies) has condi-
tioned that quite a number of Udis are now to be found in scattered places of
the former USSR, e.g. in the Rostov region, near Moscow, and in Kazakhstan.
In sum, we can assume that the number of ethnic Udis does not exceed 10,000
people. This number, however, does not match the actual number of people
who use Udi in every-day communication. In many places outside Nij, Udi
has become an endangered variety, being replaced by the local language as a
general means of communication. For instance, the Udis who have migrated
to Armenia 35 years ago, have switched to Armenian nearly completely (see
Schulze and Schulze 2016).
Disregarding minor patterns present e.g. in Russia, it becomes obvious
that Nij and (to a less extent) Zinobiani are the main places where people
are exposed to Udi as a means of every-day communication. As has been said
above, both villages are marked for bi-ethnic composition: 50% of the inhab-
itants of Zinobiani are ethnic Georgians, and 35% of the inhabitants of Nij
are Azerbaijanis. The local varieties of Georgian (Kakhetian) resp. Azerbaijani
196 Schulze
(Northern variety) are supplemented by the normative and thus ‘armed’ ver-
sion of the corresponding Gesamtsprache. Hence, the total of linguistic prac-
tices observable in the two villages is marked for at least three layers, two of
which are closely related (the two varieties of Georgian resp. the two varieties
of Azerbaijani). In the rest of this paper, I will concentrate on the linguistics of
the village of Nij in order to ask to which extent the third given communicative
system (namely Udi) can be regarded as a distinct pattern of linguistic prac-
tices fully independent from the Azerbaijani varieties.
Based on field work carried out in April 1998, John Clifton (2005) reports
that “[l]anguage mixing is not common in Nic [Nij, W.S.]. According to Keçaari
[the late ‘language sage’ of Nij, W.S., see above], Udi young people in Nic do not
mix Russian and Udi when speaking. And they use Udi consistently at home”
(Clifton and others 2005). It is interesting to see that Keçaari does not refer
to Azerbaijani, but to Russian in order to show that the linguistic practices of
the Udis are not affected by ‘language mixing’. He is silent about Azerbaijani
(just as Clifton et al. (2005) are by large, even though they say that “[t]he goals
of the research were to investigate patterns of language use, bilingualism, and
language attitudes with regard to the Udi, Russian, and Azerbaijani languages
in the Udi community”). The results presented in Clifton’s survey (in fact the
only survey on sociolinguistic attitudes of inhabitants of Nij so far) have thus
to be taken with great care. It should be stressed that Clifton has worked with
34 informants only (less than 1% of the Udis in Nij): “Half of the Udi speakers
who answered questions were chosen by Jora Keçaari, the other half were cho-
sen randomly as we sat in the teahouse or visited various offices. Participants
were given a particular set of questions at random.” Considering Jora Keçaari’s
role as a propagandist of Udi language awareness as well as the fact that he
selected the main body of the small group of informants raises doubts and
sets the survey at risk of reflecting a biased view. In addition, it is remarkable
that the study concentrates on Russian, even though the relevance of
Russian has decreased considerably since the 1990ies especially in rural areas
(3% claiming to be fluent in Russian, see Ramazanova (2014)). The survey is
rather silent about the role of Azerbaijani. From this we may conclude that the
presence of Azerbaijani within every-day linguistic practices in Nij has become
part of the standard habitus of the villagers, no longer featuring a specific phe-
nomenon to be surveyed.
Hence, the question emerges what has to be considered as the linguistic
substance embodied by the linguistic practices of the speech community of
Nij. On the one hand, we have to ask whether Nij represents a single ‘speech
community’ or whether we have to divide it just for linguistic reasons into an
How Much Udi Is Udi ? 197
assuming that the cultural practices of the members of this community waver
between a more Azerbaijani-like and a more Udi-like style, the selection of
which depends on situational frames as well as upon biographical and social
parameters. Both styles can be seen as being grounded in the overall cultural
patterns of Azerbaijan that have become stabilized and habitualized over cen-
turies and that enjoy public propagation in present-day Azerbaijan.
The term ‘speech community’ (see again Himmelmann’s quote given in
section 2) suggests that ‘speech’, that is the ensemble of linguistic practices
can be seen as a delimiting factor that would subdivide (in the given case)
the community of Nij into two ‘sub-communities’. Accordingly, one would
have to start from some kind of ‘language-specific linguistic practices’, that
is one would have to isolate those layers of observed linguistic practices
that are grounded in particular linguistic knowledge systems, called ‘languages’.
However, this procedure presupposes that we have corresponding instruments
and tools available that would allow revealing these layers. One way would be
to ask people how they would label their actual linguistic practices in terms of
‘language’. Still, this way of referring to the internal view of people on their lin-
guistic practices presupposes that the individuals are sufficiently aware of the
different layers of their linguistic practices, which again presupposes that their
understanding of ‘language’ corresponds to that of the scientific perspective.
Moreover, the isolation of language-specific layers when documenting linguis-
tic practices is at risk to produce an artificial output, namely an “abstract, ideal-
ized homogeneous language”, to quote Lippi-Green (1997: 64) again.
It becomes obvious that the percentage of Azeri terms in the domains of ‘war-
fare and hunting’, ‘possession’, ‘clothing’, ‘dwelling’, ‘agriculture’, ‘religion and
belief’ is higher than the average (22.45%). Roughly the same picture emerges,
when we relate the number of Azeri loans both to the total of loans and to the
total of conceptual units, cf. diagram 2.
Given the lack of a comprehensive contemporary corpus of linguistic prac-
tices, it is difficult to judge upon the question which actual frequencies can
be ascribed to the individual loans. In addition, it has to be assumed that the
individual domains are present in accordance with the given communicative
202 Schulze
frame and in accordance with the social roles of the interlocutors. This means
that we have to include parameters related to the communicative habitus of
the corresponding social circle. For instance, it can be assumed that in the
frame of traditional role concepts, Udi men would talk more about topics
like agriculture, hunting and animals, politics and the like in public, whereas
women would address topics related to home work, social relations, cuisine
etc. more frequently when talking to each other in a corresponding setting.
This means that the linguistic practices of the Udi community may be marked
for an even higher percentage of Azeri loans in certain communicative situa-
tions, whereas they are less frequent in others. Personal observation suggests
that the more communication is done in public, the more Azeri loans occur.
The question thus emerges what is meant by ‘Udi’ when aiming at the
“language documentation (. . .) of the linguistic practices characteristic of a
speech community”, to quote Himmelmann (1998) again. One might argue
that the degree of phonological integration of Azeri loans into the collective
communicative knowledge system called ‘Udi’ can be an argument for judg-
ing upon the ‘Udishness’ of these loans. It goes without saying that especially
older loans may have undergone this type of accommodation to the phono-
logical system of Udi. However, the relevant changes are marginal and would
not hinder Azeri speakers to understand them if embedded into Udi phrasing.
On the other hand it is difficult to tell whether the spontaneous use of Azeri
terms in Udi is nothing but another type of code switching that would not
harm communication because in fact all Udis share the corresponding bilin-
gual knowledge system.
These assumptions also question the value of lexical documentation if not
derived from corpus linguistics reflecting the actual linguistic practices of a
‘speech community’. For instance when asking Udis what their term for ‘fish’
How Much Udi Is Udi ? 203
would be they would probably answer “čäli”, that is they would refer to the
non-Azeri term (without clear etymology). However, when talking about fish it
may well be the case that they say something like ayč:ä balıq aq:alzu ‘tomorrow
I will buy a fish’, using the Azeri term balıq ‘fish’ (or even sabah balıq aq:alzu,
using Azeri sabah for ‘tomorrow’). Hence, Udi čäli might be present in the
knowledge system of these speakers, but it would not be part of their every-
day linguistic practices. So, when documenting the linguistic practices of these
Udis, the Azeri term would show up, but not the Udi one.
Turning the perspective around, one may ask to which extent Azeri speak-
ers of Nij are able to follow a conversation in Udi. Informants told me that
they are usually able to understand the general topics of such conversations
by referring to the Azeri terms showing up in the speech of the interlocutors.
A more comprehensive understanding would depend on the question of how
many Azeri terms the speakers use. They likewise admit that under certain
circumstances such as communication in privacy, they would have difficulties
to get at least an idea of the contents of the conversation. From a perceptual
perspective, they would thus be faced with some kind of massive special lan-
guage as described above. The percentage of loans given in Gukasjan’s and
Mobili’s dictionaries illustrate that the number of Azeri loans increased in the
last 40 years (from 26,3% to 37,6%). Naturally, these figures do not tell much
about the actual relevance of the corresponding terms in the linguistic prac-
tices of the members of the Udi speech community. Likewise, it is difficult to
tell to which degree Robert Mobili being a full Udi-Azeri bilingual has tested
his words with the help of other Udi speakers. It should be noted that Mobili
does not include recent loans related to the domains of politics and sciences,
except if they have undergone characteristic phonetic changes (Mobili 2010: 5).
Accordingly, we may assume that the number of Azeri terms in actual Udi
speech is even higher than 37,6 %.
The observations presented in this paper question the notion of ‘Udi’ as ‘a
language’ at least from a sociolinguistic point of view. The bilingual setting of
the Nij community cannot be described as the co-existence of two distinct lin-
guistic knowledge systems. The strong tendency toward the inclusion of Azeri
patterns in the linguistic practices of Udis does not result in some kind of ‘mixed
language’, which would typically result in a monolingual knowledge system.
Rather, we may assume that Udi is on its way of becoming a communicative
style of Azeri retaining to a certain degree its grammatical, phonological, and
in parts lexical idiosyncrasies. The more Udis include Azeri lexical terms in
their linguistic practices the more this communicative system will become
intelligible to Azerbaijanis, at least on a rudimentary level. Udi will neverthe-
less keep on representing a particular way of speech in the village of Nij, just
204 Schulze
as it would be typical for other types of Azeri registers and dialects. Sure, in
case the grammatical system does not change dramatically, Udi will not simply
end up as some kind of Azeri dialect. Rather, we have to assume that it will
results in a new type of communicative style or register that may be termed
a culturally-defined variety or ‘cultural dialect’ of Azeri. However, as has been
said above, the growing language awareness among certain proponents of the
Udi community is a factor to be included in this forecast. The official acknowl-
edgement of Udi as some kind of ‘cultural capital’ for the Azerbaijani state and
society may condition that Udi survives as ‘a language’ because of its surplus
for the Azerbaijani state and – by adopting this external view – for the Udi
community of Nij.
5 Conclusions
The remarks presented in this short paper illustrate that the scientific param-
eters used to define ‘a language’ do not necessarily meet what can be derived
from the observation of the linguistic practices of a ‘speech community’. It
seems more adequate to distinguish between ‘Udi’ as a scientifically defined
unit and ‘Udi’ as representing a particular rhetorical genre that is part of the
habitus of people in Nij. Obviously, the scientifically delimited unit (Udi as ‘a
language’) is part of the knowledge system of many Udis, although we have
to assume that not all Udis share this knowledge system to the same extent.
Hence, Udi as ‘a language’ can be regarded as a catalogue of linguistic phenom-
ena abstracted from the documentation of linguistic practices that are filtered
according to scientifically established criteria. However, even this perspective
only allows delimiting Udi from other communicative systems by referring to
the feature of intelligibility. Over centuries, the communicative system called
‘Udi’ has undergone massive changes in nearly all domains of phonology,
grammar, and lexicon. Many of these changes were induced by language con-
tact. The ‘original’, that is East Caucasian profile of Udi has become distorted
to a degree that it is even doubtful whether we should label Udi as an East
Caucasian language today. Thus the question of intelligibility by people using
another communicative standard seems to be the only valid way of defining
Udi from a linguistic point of view.
However, the linguistic practices of the Udi community become intelligi-
ble to Azerbaijani members in and outside the community the more the Udis
‘tune’ their practices toward the communicative system of these people. Thus,
these linguistic practices do not represent a stable and non-dynamic knowl-
edge system, but are part of the social dynamics given for the members of
How Much Udi Is Udi ? 205
the community. In this sense, the understanding of ‘Udi’ as a stable and well-
defined linguistic system doesn’t seem to be justified. Rather, we have to assume
that this system is just a subset or one of the layers of actual linguistic practices
that may be activated to a different degree depending on the corresponding
social roles, social conditions, frames, and situations.
In this respect, it does not make sense to claim that Udi is that component
of the linguistic practices observable in the Nij community that it marked for
‘Udiness’. Such a claim would presuppose that we know what ‘Udi’ is. Rather
we might say that Udi is that component of the local linguistic practices that
cannot be processed or understood by ‘other’ people. Still, such a definition
would fragment the reality of linguistic practices. ‘Udi’ as ‘a language’ would
then no longer be understood as a typified rhetorical genre present in the com-
munity of Nij (and elsewhere) the practice of which might in parts be intelligi-
ble for others sharing a related rhetorical genre knowledge. From this it follows
that there are at least two ways of defining ‘a language’: First, by referring to
linguistic criteria (of which kind so ever); second by referring to the actual lin-
guistic practices of a community sharing a common rhetorical genre. It will be
up to people engaged in ‘language documentation’ to decide which object one
will chose.
References
Alliot, David. 2009. Larlépem-vous Louchébem? L’argot des boucher. Paris: Éditions
Horay.
Ammon, Ulrich and Arnuzzo-Lanszweert. 2001. Varietätenlinguistik. In G. Holtus,
M. Metzeltin, and C. Schmitt (eds.), Lexikon der Romanistischen Linguistik I,2, 793-
823 Tübingen: Niemeyer.
Baker, Colin. 2011. Foundations of Bilingual Education and Bilingualism. Bristol, Buffalo,
Toronto: Multilingual Matters.
Bawarshi, Anis S. and Mary Jo Reiff. 2010. Genre. An Introduction to History, Theory,
Research, and Pedagogy. West Lafayette, Indiana: Parlor Press.
Berkenkotter, Carol and Thomas N. Huckin. 1993. Rethinking Genre from a
Sociocognitive Perspective. Written Communication 10: 475-509.
Bourdieu, Pierre. 1979. La Distinction – Critique sociale du jugement. Paris: Éditions de
Minuit.
Çabanov, Gəmeršah and Rauf Hüseynov 1999. Udilər. 2nd edition. Baku: Elm.
Clifton, John M., Deborah A. Clifton, Peter Kirk, and Roar Ljøkjell. (2005). The sociolin-
guistic situation of the Udi in Azerbaijan. SIL Electronic Survey Reports 2005-014.
Clot, Yves. 2008. Travail et pouvoir d’agir. Paris: Presses Universitaires de France.
206 Schulze
Comrie, Bernard and Madzhid Khalilov. 2010. The Dictionary of Languages and Dialects
of the Peoples of the Northern Caucasus. Leipzig and Makhachkala: MPI Evolutionary
Anthropology.
Degele, Nina. 1999. Soziale Differenzierung: Eine subjektorientierte Perspektive.
Zeitschrift für Soziologie 28,5: 345-264.
Efing, Christian. 2005. Das Lützenhardter Jenisch. Studien zu einer Sondersprache.
Wiesbaden: Harrassowitz.
Gippert, Jost, Wolfgang Schulze, Zaza Aleksidze, Jean Pierre Mahé. 2009. The Caucasian
Albanian Palimpsests from Mount Sinai. 2 vols. Turnhout: Brépols.
Gregersen, Edgar A. 1976. A note on the Manam language of Papua New Guinea.
Anthropological Linguistics 18,3: 95-111.
Grosjean, François. 1982. Life with two languages: An introduction to bilingualism.
Cambridge, MA: Harvard University Press.
Gukasjan, Vorošil. 1974. Udincǝ-azǝrbajcanca rusca lüǧǝt. Bakı: Elm.
Guvasary, Venera Antonova. 2001. Udiland. Oslo: Bergersen.
Hałas, Elżbieta. 2010. Towards the World Culture Society: Florian Znaniecki’s Culturalism.
Bern: Peter Lang.
Halliday, M. A. K., A. McIntosh, P. Strevens. 1964. The Linguistic Sciences and Language
Teaching. London: Longmans.
Harris, Alice. 2002. Endoclitics and the Origins of Udi Morphosyntax. Oxford: Oxford
University Press.
Heath, Shirley Brice. 1977. Social history. In: Joshua A. Fishman et al. (eds.), Bilingual
Education: Current Perspectives. Vol. 1: Social Science, 53-72. Arlington, VA: Center
for Applied Linguistics.
Himmelmann, Nikolaus P. 1998. Documentary and descriptive linguistics. Linguistics
36: 161-195.
Hoffmann, Charlotte. 2014 [1991]. An Introduction to Bilingualism. New York and
London: Routledge.
Lippi-Green, Rosina. 1997. English with an accent: Language, ideology, and discrimina-
tion in the United States. London: Routledge.
Mobili, Robert. 2010. Udi-azerbaycanin urusin ǝyitluğ. Bakı: Qrifli nǝşr.
Mühlhäusler, Peter. 2006. Naming Languages, Drawing Language Boundaries and
Maintaining Languages with Special Reference to the Linguistic Situation in Papua
New Guinea. In: D. Cunningham, E. E. Ingram and K. Sumbuk (eds.), Language
Diversity in the Pacific. Endangerment and Survival, 24-39. Clevedon etc.: Multilingual
Matters.
Mannheim, Karl. 1980 [1922]. Eine soziologische Theorie der Kultur und ihrer
Erkennbarkeit – konjunktives und kommunikatives Denken. Frankfurt a.M.: Suhrkamp.
Maybury, Mark T. 1999. Communicative Acts for Multimedia and Multimodal Dialogue.
In: M. M. Taylor, F. Néel, and D G. Bouwhuis (eds.), The Structure of Multimodal
Dialogue II, 375-392. Philadelphia/Amsterdam: Benjamins.
How Much Udi Is Udi ? 207
SSSD. 2003. Sakartvelos mosaxleobis 2002 c’lisp’irveli erovnuli saq’oveltao aγc’eris Ʒiritadi
šedegebi. Vol. 2. Tbilisi: Sakartvelos st’at’ist’ik’is saxelc’ipo dep’artmenti.
Volkova, N. 1994. The Udis. In: Paul Friedrich and Norma Diamond (eds.). The
Encyclopedia of World Cultures, Vol. 6. Russia and Eurasia/China, 375-378. Boston:
Hall and Co.
Weinreich, Max. 1945. Der YIVO un di prablemen von undzer tsayt. YIVO bleter 25,1:
3-18.
Woodbury, Anthony. C. 2003. Defining documentary linguistics. In: P. Austin (ed.),
Language Description and Documentation, Vol. 1. (Endangered Languages Project.),
35-51. London: School of Oriental and African Studies.
Chapter 15
1 Introduction
The region of present-day Turkey, throughout history, has been the homeland
of diverse peoples with different cultural, religious and linguistic backgrounds.
Thus, this geographical area of great linguistic diversity is fertile lands for linguis-
tic research on language contact. This study aims to present a brief overview of
Sason Arabic (SA), one of the least documented and understudied dialects of
Arabic spoken in the multilingual and multi-cultural eastern Turkey, as a good
case for a language contact study.1 The town of Sason, which is situated in the
north of the mountainous province of Batman, has slightly more than thirty
thousand inhabitants according to the 2014 census. SA is spoken in a multi-
lingual environment where the main contact languages have been Armenian,
Zazaki (Kurdish) and Turkish, but due to demographic changes that have taken
place in the nineteenth and twentieth centuries SA speakers are now diminish-
ing in number.2 There is no diaglossia among SA speakers.
Jastrow (2006: 144-5) classifies the Arabic dialects spoken in Turkey into
three major groups, and several sub-groups. SA is identified as sub-group of
Mesopotamian Arabic:
1 See the appendix for the location of Batman and Sason on the map.
2 There isn’t a reliable source on the number of SA speakers today or in the past. The estimate
given by our younger informant was about 5,000 to 6,000 people, though it is not clear what
this estimate is based on.
Both Jastrow (2005, 2006) and Talay (2011) refer to Haim Blanc as the person
who first introduced the term Mesopotemian Arabic in his monograph entitled
Communal Dialects in Baghdad (Cambridge, Massachusetts 1964), in which
he gives a description of the Muslim, Christian and Jewish dialects spoken
in Bahgdad until 1950. According to Haim Blanc, based on the pronunciation
of the Classical Arabic qultu ‘I said’, the Jewish and Christian dialects com-
prised the qəltu-dialects and the Muslim gilit dialect.4 Haim Blanc also argued
that the qəltu-dialects spoken by the Christians and Jews in Baghdad were an
older variety spoken by the sedentary population in the time of the Abbasid
caliphs, while the Muslims dialect was a much later development, introduced
by the Bedouins who gradually moved into the country during the reign of
the Ottoman Empire (Jastrow 2006: 157-8). This brief historical perspective
indicates that the qəltu-dialects found in Anatolia must belong to the older
linguistic stratum, which implies that the earlier settlers in the region must
have been Christian and Jewish speakers of qəltu-dialects rather than speak-
ers of the Muslim gilit-dialect. Indeed, our informants when asked about who
their ancestors were, replied that the Arabic speaking population in the area
was believed to have migrated from the Basra region, long time ago, and that
SA was exclusively spoken by the Christian population until only recently. It
should be recalled that, historically speaking, the whole area was the land of
the Armenian people long before the Arabs and the Kurds appeared in the
scene. However, due to the migrations of the non-Muslims from the area in
the late nineteenth and early twentieth century, the remaining SA speakers
today are now predominantly Muslims living in the surrounding villages.5 The
contact languages in the area have basically been SA, Armenian and Zazaki,
the variety of Kurdish spoken in that region. Turkish appeared in the lives of
3 See Talay (2011) for a survey of the work done on the Mesopotemian dialects of Arabic spoken
in Turkey.
4 The qəltu dialects preserve the Old Arabic *q as a voiceless uvular stop /q/ while in the gilit
dialects this sound surfaces as is a voiced velar stop /g/ in most cases. Furthermore, in the
qəltu dialects the 1.SG perfect is -tu, while it is -it in the gilit dialects, Jastrow (2005, 2006).
5 Some of the Muslim SA speakers of today might be Christians who have converted to Islam.
Language Contact In Anatolia: The Case Of Sason Arabic 211
the people of Batman as the language of the state much later, probably, as late
as the early twentieth century.
It is well-known that long-term language contact leads to linguistic change;
the degree and type of change vary depending on a number of factors, such
as nature and duration of contact, social status of the languages in contact,
etc. The linguistic changes contact languages undergo may result in language
loss, borrowing at different levels or mixing of languages. SA, being one of the
languages spoken in the multilingual communities of northern Batman, would
then be expected to exhibit some effects of language contact. Thus, the nature
and extent of language contact and their effects on SA duly call for an investi-
gation. However, before raising any questions regarding how contact with the
languages in the area may have influenced SA, first a brief description of its
major linguistic features will be presented. Then, the findings will be discussed
from the perspective of the possible effects of language contact.
2 Linguistic Features of SA
The SA data presented in this study derives from the data collected in a field
methods class at Boğaziçi University in the spring of 2013 and Akkuş (2014b).6
Data were supplied through two informants, one being a middle aged gentle-
man around age 50-55, who was an illiterate polyglot in SA, Armenian, Zazaki
and Turkish, which he learned after the age of 25, and the other a bilingual
male university student of age 22-23.
When asked to what extent they could understand the other Arabic dia-
lects they were exposed to, our informants emphatically stated that they were
unable to comprehend any other Arabic dialect, be it one spoken in Turkey,
such as the Mardin or Adana/ Hatay dialect or one outside of Turkey. They said
they could only recognize certain words.7
6 The source is specified when data from Akkuş is resorted to. Examples with no source speci-
fied are part of the author’s data collected in the field methods class.
It is worth noting that the first linguistic study on SA is Akkuş (2014a). I would like to
thank him for doing the final check on the SA data in this study.
7 The complete unintelligibility of SA with the other Arabic dialects spoken in the area was
also confirmed by another SA speaker in the conference, who was an employee in Ardahan
University.
Some basic expressions of greetings in SA are provided below to give a feel for the dailect.
(i) bəlxer dʒi:t/e (M/F) (ii) ʃəme kənt/e (M/F).
Welcome How are you?
(iii) Baʃ kəttu.
I am fine.
212 Taylan
When the verb is in the progressive aspect, the auxiliary occurs before the verb,
as shown in (2):
In complement clauses, as shown in (5) and (6)a, and relative clauses as seen
in (6b), the observed order is VS(O), which is noted to be a highly marked order
in simplex sentences.
In non-verbal sentences the copula occurs sentence finally. The copula form,
which is kən for first and second persons (singular or plural), gets inflected
for number and gender, as shown in (7)a-c and (7)f-g. For third person singu-
lar or plural there is no gender distinction on the copula; it is ye for singular
third persons and -nen for plural third persons, regardless of gender, as seen in
(7)d-e and (7)h.8 These examples also demonstrate that the predicate adjective
agrees in gender and number with the subject. The masculine singular form
is the unmarked form of the adjective while the feminine singular carries the
suffix -e (e.g. mamlu:n-e ‘happy-F’). In the plural form of the adjective there is
a single plural suffix, which is -in for both genders (e.g. mamlu:n-in ‘happy-PL’).
8 In quick speech ye was observed to have the variants -yi,- i, -iye conditioned by the preceding
sound.
214 Taylan
d. iya mamlu:n-e ye
she happy-F COP-3SG
‘She is happy.’
e. iyu mamlu:n ye
he happy COP-3SG
‘He is happy.’
h. iyen mamlu:n-in-nen
they happy-PL-COP.3PL
‘They are happy.’
When the predicate is a locative NP, the unmarked word order is now sub-
ject-copula-locative, as seen in (8)a. Furthermore, in (8)b we see the third per-
son plural copula form, namely kənno, surfacing in this order, which was not
observed with predicate adjectives.
(ii) Negation
There are two different morphemes to negate verbs; ma is used when the verb
is in the indicative, as in (9)a, but la when the verb is in the imperative mood,
as in (9)b. Both forms immediately precede the verb.
b. la: tamel
NEG work.2SG.M
‘Don’t work.’
In sentences with a non-verbal predicate, when the subject is third person sin-
gular or plural, number and gender agreement surface on the negative mor-
pheme as the full form of the copula is absent, illustrated in (12)a and (13)-(14).
and this is the only possible order.9 Again, the verb is in final position in
all cases.
b. kən-t f-stanbul ↑
COP-2SG.M in-Istanbul
‘Are you in Istanbul?’
b. bənt gbir-e
girl big-F
‘the big girl.’
Adjectives agree with the gender of the noun they modify. Masculine gender
is unmarked; feminine gender is marked by the suffix -e as observed on the
adjective in (21)b.
Numerals and demonstratives precede the noun, as illustrated in (22) and
(23), respectively:
SA has no definite article. There are a few frozen expressions where the rem-
nants of a definite article are visible, such as in the expression for ‘welcome’
given below.
10 The other demostrative is laga ‘that (M)’. Demonstratives agree in number and gender
with the noun; e.g. lala (F) kitab ‘that book’, lagu kita:ba ‘those books’.
11 The adjective follows the indefinite marker:
bənt ma koisi
girl a beautiful
‘a beautiful girl’
Language Contact In Anatolia: The Case Of Sason Arabic 219
d. Ali ku idu:r-a
Ali COP look=for-3SG.F
‘Ali is looking for her.’
The object in (25)a has an indefinite reading as it has the [N ma] structure
with no object clitic on the verb. However, when the object clitic -a appears on
the verb, as seen in (25)b, the object in the form of a [N ma] structure receives
an indefinite but specific reading. When ma is not present then the object is
interpreted as definite, as seen in (25)c. When there is no overt direct object
but the verb has the object clitic, then we get a pronominal object reading, as
shown in (25)d.
If the direct object is masculine (e.g. ʃi ‘food’ in (26)), then the object clitic
agreeing with it surfaces as -u:
b. Ahmad ayal-u.
Ahmad eat.PST-3SG.M
Ahmad ate it.’
The examples above also illustrate that SA has no case marking on subjects
and objects.
b. mase ən-qaraf-e
table PASS-break.PST-3SG.F
‘The table is broken.’
Note that when the feminine direct object of the active sentence (28)a becomes
the subject of the passive structure (28)b, the verb now exhibits gender agree-
ment with the feminine subject through the suffix -e.
Morphological causative formation has been reduced to only a few verbs;
(29)b illustrates that the verb ‘cut’ is one of the verbs that retains its morpho-
logical causative form where the second consonant of the root is doubled.
For most of the verbs, however, periphrastic means employing verbs such as
‘give’ and ‘make’ are used to express their causatives as shown in (30) and (31).
Note that the agent the causer acts upon is expressed in the form of a PP (i.e.
məʃa N) both in periphrastic causatives and in morphological causatives.
This preliminary survey of the basic properties of SA has shown that this variety
deviates from classical/standard Arabic significantly in a number of ways. SA
is not unique in this respect as other dialects of Arabic have also been noted to
differ notably from classical Arabic. Work done on Arabic dialects spoken in dif-
ferent countries, has revealed that these dialects exhibit a simplification in their
grammar, such as the loss of the dual form on verbs, adjectives and pronouns,
loss of case endings in nouns and adjectives, and loss of mood distinctions in
the verb, etc. (Kaye 2009: 562). SA follows suit, and exhibits the same simplifica-
tions as well as additional ones. Based on the limited data presented in the above
section, the following points stand out as some of the characterizing features
of SA:
222 Taylan
Even in a survey as brief as this one, SA as a contact language with the above
designated features raises questions regarding the social and demographic
structure of the region through time. It is well-known that the Armenian and
Kurdish populations have existed in eastern Turkey for a very long time, though
Armenian is the earlier language spoken in region. What is not exactly known
is when the Arabic speaking population or populations moved into the area.
One possible explanation for their presence may be the Arab conquests of
Mesopotamia, Middle East and North Africa after the 7th century and the inva-
sions in the Abbasid era. The linguistic consequence of this expansion would
be the introduction of Arabic as the superimposed language. Thus, the other
existing languages spoken in the conquered lands, such as the different dialects
of Aramaic, Armenian, Persian, Coptic, etc. must all have come into contact
with Arabic, in varying degrees of intensity. Whether the Arabic speaking pop-
ulation in Batman (Sason) can be tracked down to this first wave of Arab con-
quests or whether their presence in this area is due some later migration from
Basra (the date of which we do not know), as claimed by the informants, needs
to be researched. In either case, we can assume that the first Arabic settlers
brought the social and geographical variety they spoke with them; the question
then is whether this earlier variety of Arabic can be traced down or not.
Furthermore, if SA speakers in the past were exclusively Christians, as told
by our informants, then, who were these Christians? Were they Armenians or,
going back in history, could there have been Syriac/Aramaic speakers, who
lived in the area as well? Assuming Christian population(s) existed in the
Batman region before Arabic speakers appeared, they must then have been
exposed to Arabic over a long period of time, especially when Arabic became
the dominant language socially and/or politically, and thus learned this vari-
ety of Arabic. Or can the Christian SA speakers be traced back to Christian or
Jewish Arabic speakers moving into the region from Baghdad or other parts
of Iraq in an early period? Recall that both SA and the Christian and Jewish
dialects of Baghdad belong to the older qəltu-dialect group. In either scenario,
due to the cultural and linguistic diversity of the region, we can assume that
bilingualism emerged as a consequence of this prolonged social, political and
economic contact between communities speaking different languages, lasting
until today. Looking at the linguistic features of SA and the changes and sim-
plifications observed at different levels of grammar, asking whether SA today
is just another regional Arabic vernacular or whether it has emerged into a
distinct language would be a valid question. One hypothesis would be that
224 Taylan
long term, close contact with other languages in the area has led to borrowing,
language change and possibly to a certain degree of language mixing. If we
assume Arabic was the superstrate, we then need to determine what the sub-
strate languages have been, what layer(s) their presence is witnessed in and
what changes they have triggered or brought about. Another hypothesis could
be that the variety of Arabic spoken in the region, with time, emerged into a
creole, which would mean that SA has now become a separate language. For
this hypothesis to be pursued, however, we would need a thorough analysis of
SA and also information on the properties and structural features of creoles,12
bearing in mind that drawing the line between a creole and a non-creole is no
simple matter. This alternative may appear to be a far-fetched speculation, but
probably one that is worth investigating, as there are two attested Arabic based
creoles that have been identified, namely Nubi Arabic spoken in some regions
of Uganda and Kenya and Sudanese Creole Arabic (Juba).13
The major aim of this study has been to draw attention to the existence
of the unexplored and endangered Sason Arabic spoken in multilingual east-
ern Turkey and introduce its basic linguistic properties which reflect interest-
ing effects of language contact. We hope that this research will trigger interest
in the many issues and questions that call for further investigation.
References
Akkuş, Faruk. 2014a. Functional Categories and Phrase Structure of Sason Arabic. MA
Thesis, Boğaziçi University.
Akkus, Faruk. 2014b. Clause Structure in Contact Contexts: the Case of Sason Arabic.
Talk given at Boğaziçi University.
Bakker, Peter, Aymeric Daval-Markussen, Mikael Parkvall & Ingo Plag. 2011. Creoles are
typologically distinct from non-creoles. In Bhatt, Parth and Tonjes Veenstra (eds.),
Creoles and Typology. Special Issue of the Journal of Pidgin and Creole Languages, 26:
1, 5-42. Amsterdam: John Benjamins.
Bickerton, Derek. 1981. Roots of Language. Ann Arbor: Karoma.
12 Peter Bakker, et all. (2011) after surveying a large set of data and comparing structural
properties which are analogous in terms of complexity, claim that creoles constitute a
typologically distinct category of languages. They argue that creoles are more similar to
one other than they are to other languages, and thus can be identified in terms of their
particular structural features.
Also see Bickerton (1981) for the twelve properties of creoles that he identified.
13 127 pidgin and creole languages are listed in the repertoire of Hancock (1977) given in
Romaine (1988).
Language Contact In Anatolia: The Case Of Sason Arabic 225
Jastrow, Otto. 2006. Arabic dialects in Turkey: Towards a comparative typology. Türk
Dilleri Araştırmaları, 16: 153-164. Istanbul.
Jastrow, Otto. 2005. Anatolian Arabic. In K. Versteegh (ed.), Encyclopedia of Arabic
Language and Linguistics, vol. I: 86-96, Leiden: Brill.
Kaye, Alan. 2009. Arabic. In Comrie Bernard (ed.), The World’s Major Languages (2nd
ed.), 560-578. London: Routledge.
Romaine, Suzanne. 1988. Pidgin and Creole Languages. Longman Linguistics Library.
Talay, S. 2011. Arabic Dialects of Mesopotamia. In Weninger Stefan (ed.), The Semitic
Languages, 909-920. De Gruyter Mouton.
Appendix
Map 15.2
Sason as the northest district of Batman.
Chapter 16
1 Introduction
The Svan-speaking communities of Upper and Lower Svaneti have for centu-
ries been identified as – and identify themselves as – Georgians, even though
Svan speech is not mutually intelligible with Georgian. Depending on whom
you ask, or what sources you consult, Svan has 15,000, or 50,000, speakers, or
some number in between; is endangered, or is not; is a language, or merely a
dialect of Georgian; and its speakers are, or are not, a distinct ethnic group.
Svan is the outlier in the Kartvelian family, having probably separated from
the common ancestor in the Bronze Age. Svan shares the basic morphosyn-
tactic profile of Georgian – bipersonal verb, three series of tense-aspect-mood
paradigms, shifting case assignment by transitive and active intransitive verbs
(“split-ergativity”), a rich variety of dative-subject constructions, the gram-
matical category of “version” – but has very divergent vocabulary (Tuite 1997).
To give an impression of how impenetrable Svan sounds to Georgians from
elsewhere, here is an excerpt from a Svan folk poem with parallel Georgian
translation (Shanidze & Topuria 1939: 54)
The population of Svaneti in 2006 was 22889, of which 14270 in Upper Svaneti
(Mestia Municipality) and 8619 in Lower Svaneti (Lentexi Municipality). By
way of comparison, the population was estimated at 15000 in 1886, 9533 of
whom lived in Upper Svaneti. One recent estimate of the number of Svan
speakers gives a total of 26120, 14709 of whom speak an Upper Svan dia-
lect (Lower Bal or Upper Bal), and the remainder (11411) speak Lower Svan
(Lashx, Lent’ex or Cholur dialect) (Tschantladse, Babluani & Fähnrich
2003: 12).
Accounts of the present-day situation of the Svan language diverge signifi-
cantly. An article in the October 2014 issue of National Geographic paints a
pessimistic picture: an 86-year-old from the remote village Adishi is said to be
“one of the few remaining fully fluent speakers of Svan”, and a 14-year-old is
quoted predicting that “the Svan language will disappear with my generation”
(Larmer 2014). The latest version of Ethnologue (Lewis, Simons & Fennig 2015)
credits Svan with 15000 speakers. Its vitality is evaluated at level 7 (shifting)
on the EGIDS (Expanded Graded Intergenerational Disruption Scale), which
indicates the assessment that the language “is not being transmitted to chil-
dren”. (By comparison, Mingrelian has 500K speakers & EGIDS rating 6a “vig-
orous”; Laz has 22K speakers & EGIDS rating 6b “threatened”). Gippert’s 2005
report on “Endangered Caucasian languages in Georgia” is more optimistic.
The size of the Svan speech community is estimated at 50,000, but Gippert
and his colleagues noted a high incidence of code-mixing and code-switching
with Georgian.
I have had the opportunity to observe Svan usage in Upper Svaneti (mostly
in the commune of Latali), Lower Svaneti (during fieldwork in 1997), and in
two communities of Svans who were relocated to lowland Georgia (Axali
Xaishi & Jandari/Lemshvanier). In Lower Svaneti, children were mostly spo-
ken to in Georgian, although I encountered a handful of older women in
one of the more remote villages who spoke little if any Georgian. In Latali,
children speak and are spoken to in Svan, but several people expressed con-
cern about the extent to which a full command of Svan is being passed on to
the youngest generation. One friend in his mid-40s explained that, whereas
he and other of his age learned Georgian only after acquiring Svan as their
mother tongue, the newest speakers appear to be Georgian-dominant. Svan
is still the principal language within compact ‘diaspora’ settlements of Svans,
many of which are composed of people from specific villages. Here as well I
took note of children speaking Svan; I also had the opportunity to observe an
instance of local conflict resolution, which took place mostly in Svan, although
some participants preferred to speak Georgian. On the whole, the Svan language
remains prevalent where Svans live compactly in homogenous communities,
228 Tuite
but it is rarely heard when Svans move to the cities or migrate abroad in search
of work.1
1 According to Richard Bærug (pers. comm.), the language situation in the provincial capital
Mestia, which has become the center for a burgeoning tourist industry, differs somewhat
from that of the remaining Upper Svan villages. Georgian is widely used in everyday com-
munication, and some children seem to prefer it to to Svan.
Language And Emergent Literacy In Svaneti 229
2 Resistance increased in response to the stepped-up Russification campaign that followed the
succession of Tsar Alexander III to the throne in 1881. Kiril Ianovski, director of the Caucasus
Educational District (Kavkazskij učenbyj okrug) imposed the obligatory teaching of Russian
in Georgian primary schools, and also declared that the medium of instruction in schools
in Mingrelia should be Mingrelian, rather than Georgian, which was removed from the
curriculum.
3 Among those reacting negatively to the Tsarist literacy initiative for Kartvelian minority
languages was the writer Vazha-Pshavela, in the 1902 poem Vin aris k’aci?: “He sows enmity
between Kartli, Imereti and Kakheti [names of Georgian provinces], creates special alphabets
for Mingrelia and Svaneti” (sak’utar anbans šeudgens samegrelos da svanetsa).
4 For some reason, Svan poetry was an exception to this rule, as shown by the Georgian transla-
tions provided in the 1939 Shanidze-Topuria anthology, and the 12-volume Geo. Folk Poetry
collection.
Language And Emergent Literacy In Svaneti 231
would appear to have turned Svan writing inward. The Svan chrestomathies
published from 1910 to the end of the Soviet era are accessible only to a small
circle of Kartvelologists with the necessary training to decode them without
an accompanying translation, and the Svans themselves. The medium of writ-
ten Svan was limited to the transcription of oral literature and ethnographic
descriptions originally produced in that language, and not to be used for any
of the functions already assumed by Georgian.
The Svan orthography developed by Marr and his Georgian colleagues
aimed at phonetic precision. In addition to the obsolete Old Georgian let-
ters for [q] (<ჴ>) and [j] (<ჲ>), new characters were created to represent long
vowels (marked by a macron), [æ] (written as umlauted “a”) and [ə] (<ჷ>).
The linguists hesitated between “v” (<ვ>), “ü” (<ჳ>) and a special character
called “u-brjgu” (<Ⴓ̂>) to represent Svan [w], before settling on the last option.
Even before Marr’s new journal appeared, however, Gvedo Nak’an, a soldier
from Upper Svaneti serving in what is now Turkish territory, worked out a Svan
orthography of his own. Nak’an’s 1908 diary, published in the first volume of
the Svan Prose Texts series (Shanidze & Topuria 1939: 41-48), employed only the
letters available in the standard Modern Georgian alphabet. The schwa vowel
is simply omitted from the spelling, and [æ] is not distinguished from [a]. Long
vowels are occasionally written with double letters, but often not distinguished
from short vowels. The editors of the anthology “cleaned up” Nak’an’s spell-
ing to conform to their phonetically-precise orthographic standards for Upper
Bal Svan, but his original spellings can be found in the endnotes (Shanidze &
Topuria 1939: 459-462). Rather than being an approximative representation of
Svan speech cobbled together by a semi-literate writer, Nak’an’s orthography
is surprisingly adequate, as long as the reader understands the inner workings
of the language. The presence of the schwa vowel, for example, can almost
always be predicted from phonotactic constraints on consonant sequences.5
The raising of [a] to [æ] is provoked by an /i/ or /e/ in the following syllable in a
word’s underlying morphological structure. Georgian “v” is perfectly adequate
for representing Svan [w], since [v] and [w] are nonconstrastive allophones.
Long vowels, which only occur in the Upper Bal and Lashx dialects, are not
predictable, but their semantic load is low. Very few pairs of words are distin-
guished only by vowel length.
Thus, more than a century ago, two approaches to writing Svan were inde-
pendantly devised: a phonetically exact but phonologically redundant system
developed by and for linguists; and a phonetically imprecise but semantically
5 Schwa was also not written in the spelling of Svan place names in medieval manuscripts,
e.g. <lha> for Ləha, <pxt’ler> for Pxət’rer, etc. (Ingoroq’va 1941: 20, 129).
232 Tuite
adequate system created by and for Svan speakers. These are the two options
confronting the new generation of Svan writers today, as will be discussed in
the final section.
The 1897 Russian Imperial census included the number of individuals (pop.
15756) using Svan speech (“narechie”), which was listed in the Kartvelian lan-
guage category along with Mingrelian. Svans were also counted separately, as
a subgroup of Georgians, in the 1926 USSR census (pop. 13218). They were not
counted in Soviet censuses from 1939 onward, nor are Svans recognized as a
distinct group in the post-Soviet Georgian republic. Interestingly, the Svans
were counted separately in the 2002 and 2010 Russian Federation censuses
(pop. 41 in 2002, 45 in 2010), and also included in the total count of Georgians.
The issue of whether the Svans represent a distinct national or ethnic
group is also tied up with the curious debate – difficult for many outsiders
to understand – about the status of their speech: is Svan a “language” or a
“dialect”? The proponents of the latter opinion (see, for example, Putkaradze
2002, 2003) operate with an exclusivist concept of language inherited partly
from Giorgi Merchule, who defined Georgia on the basis of a shared liturgi-
cal language; and the prominent 19th-century intellectuals Chavchavadze and
Gogebashvili, who attributed a single “mother tongue” (deda ena, also the title
of Gogebashvili’s best-known school primer) to the Georgian people. A less-
noticed predecessor is the Soviet definition of nationality: Each officially rec-
ognized national group had a single “native language” (rodnyj jazyk), which had
a written form, and which was used in at least the initial years of education.
Furthermore, each individual was ascribed a single native language, whether
or not they had equal or greater competence in other languages.
If one accepts these presuppositions, it follows that the identification
of Svan as a “language” would be tantamount to recognizing the Svans as a
nationality distinct from the Georgians. By identifying Svan and Mingrelian as
“dialects” – even though acknowledging that they are not mutually intelligible
with Georgian – Putkaradze and others who share his views assert that they
serve the same function as Georgian dialects in the accepted sense (such as
Pshavian, Tushetian or Gurian); that is, as nonliterary vernaculars vis-à-vis the
single literary language of the Georgian nation. As did Gogebashvili, they define
Georgianness on the basis of linguistic affiliation (speakers of Kartvelian lan-
guages) rather than a shared liturgical language. As one would expect, propo-
nents of this view frequently recall the Tsarist educational policy of the 1880s
Language And Emergent Literacy In Svaneti 233
and 1890s, and the simultaneous publication of Svan and Mingrelian texts in
Cyrillic script, to justify their conviction that Russia is trying to stir up separat-
ist sentiment among Mingrelians and Svans as part of a “divide and conquer”
strategy in the South Caucasus.
Also arousing the suspicions of the Svan-as-Georgian-dialect camp are
reports that translations of the Bible into Mingrelian and Svan are planned or
even underway. This has drawn the attention of the Georgian Orthodox church
hierarchy, which officially condemned the publication of religious texts in
Kartvelian languages other than Georgian. The former Georgian ombudsman
Sozar Subari drew heavy criticism – not least from the Church leadership –
for having voiced support in 2005 for a projected Svan-language version of the
New Testament (more on which later).
One prominent target is the European Charter of Regional and Minority
Languages, which has yet to be ratified by Georgia 15 years after it joined the
Council of Europe (Putkaradze, Dadiani, Sherozia 2010). The Charter obliges
participating states to promote the use of regional languages in education, the
justice system, public services, media and culture. As defined by Charter, how-
ever, the category of “regional or minority languages . . . does not include . . . dia-
lects of the official language(s) of the State”, further reinforcing the position of
those who refuse to acknowledge Svan as a language.
Even as debate continues over the status and role of the Svan language, a new
manifestation of Svan literacy is emerging, which seeks to slip between the
Scylla of Georgian exclusivity, and the Charybdis of standardization – which
would entail the selection of one of the Svan dialects as the basis of the written
language, normalization of the orthography, decisions about what is or is not
“correct” Svan vocabulary and grammar, and the fixing of ground rules for the
creation of new words. More significantly, none of the contributors to contem-
porary Svan writing gives the slightest indication that their practice is in any
way incompatible with their identity as Georgians. In this concluding section,
I will present three recent initiatives in Svan writing.6
6 This is by no means an exhaustive list. Other instances of present-day written Svan usage
include blogs by local doctors on topics such as hepatitis and cancer; comments on
Svan-themed videos on YouTube; and posters announcing a skiing contest in Upper Svaneti.
234 Tuite
(And Jesus Christ’s birth thus (was) happened: His mother Mary was
Joseph’s betrothed and before their coming to each other, Mary has
become pregnant from the Holy Spirit)
<19. i ioseb, miča č’äš (leč’šəri), mac’vdi (mare) lasŭ, i made xek’vad eča
liušxe i ušdil ka lipšŭdes laxp’ire (//ka lipšŭded gŭi laxad).>
(And Joseph her husband (fiancé), was an upright (man), and did not
want to expose her, and he intended to secretly release her (//out release-
ADV heart come-to-him))
The Svan terms for “gospel” and “chapter” are calqued from Georgian saxareba
“joyful news” and tavi “head, chapter”, respectively. In verse 19, the translator
appears to hesitate between /la=x-p’ir-e/, employing a root borrowed from
Georgian a-p’ir-eb-s “intends”, and the more idiomatic /gwi la=x-a-d/ “heart
came to him”; and also between ascribing the role of husband or fiancé to
7 Written text is enclosed in angled brackets, with the following coding of languages and
scripts: <Svan in Georgian script>; <Svan in Latin script>; <Georgian in Georgian script>;
<Georgian in Latin script>. Phonological renderings are placed between /slanted bars/.
Language And Emergent Literacy In Svaneti 235
(If you are 12 to 18 years old (born in the years 1995-2001), you can take
part in the Svan literature competition for 2013! Write in Svan on any
theme – whatever you would prefer – your desires, your successes or
whatever you like. [. . .] The story must be from 4 to 16 pages or 2000 to
236 Tuite
8000 words. There will be gifts (prizes) for the best stories. Deadline: 1
March 2013.)
Not all comments posted on the contest’s Facebook page were positive: some
criticized “mistakes” in the use of Svan, or what they took to be unwarranted
code-mixing (Georgian loans in the announcement are marked by underlin-
ing). Another commentator cited the passage in the Georgian constitution
concerning the status of Georgian as sole official language. Other writers,
however, vigorously defended the competition (most of these comments were
posted in Georgian, but quite a few in Svan).8
Six young authors, aged 12 to 17, won prizes in the 2013 competition. The
texts composed by the prize winners, published in an anthology (Bærug 2013),
show interesting variation in orthographic style, since each author had to
work out his or her own norms for writing Svan. (This could also be said of the
authors of the texts accompanying the stories in the anthology, and the posters
and Facebook announcements promoting the contest). The most “authorita-
tive” models for writing Svan are the 20th-century anthologies compiled by
linguists, who, as noted earlier, aimed for a fairly explicit representation of
the pronunciation. Some of the young writers – especially 12-year-old Erek’le
and 14-year-old Mari – appear to have been influenced by the linguists’ ortho-
graphic norms, including their use of apostrophes to mark vowel syncopation
when a clitic is attached to the following word (e.g. <ž’eser> = /ž(i) eser/ “in
QUOT”). Two writers however devised a phonologically-based orthography
reminiscent of that used in the 1908 soldier’s diary. 17-year-old Jemal tended
not to write schwas in contexts where they were automatically inserted before
resonants (/x-a-k’pən-x/ “offers” written <xak’pnx>; /daqəls/ “goat-DAT” writ-
ten <daqls>), or otherwise predictable. On the other hand, schwa was usually
written when it functioned as the root vowel of a word (e.g. <ɣən> “festival”).9
One writer from the Lower Bal dialect area, 17-year-old Giorgi, employed
phonetically-precise spellings, but not necessary those favored by the linguists.
8 Responding to previous comments criticizing the quality of the Svan used in the competi-
tion announcement, one user posted the following sarcastic remark in Svan: <si xochaamd
atdawy lushnud i echqaango axgacxad qa; konkurs> (You have such a good command of
Svan, so announce your own competition).
9 It is worth noting that Sopho and Jemal – the two authors who favored phonological
spellings – are the grandchildren of Goguca Xergiani, now 80 years old, who was one of the
pioneers of the newest phase of Svan writing. She is the author of a 2-volume collection of
Svan-language poetry and prose, “Maxvshi Baba”; vol 1, 1999; vol 2, 2004.
Language And Emergent Literacy In Svaneti 237
To represent the schwa vowel, Giorgi chose the Georgian letter “o” with an
umlaut sign, implying that he pronounced schwa with a degree of lip-rounding
(<löxins> = /ləxin-s/ “good times-DAT”; <daqöl> = /daqəl/ “goat”).10
The orthographic styles of the six prize-winners can be summarized by
comparing three parameters: (1) representation of schwa, (2) representation
of the glide [j], (3) use of apostrophes to signal vowel loss at the point of word
liaison:
10 This vowel was described by the phonetician S. Zhghent’i (1949: 65-66) as a “delabialized
/u/” ([ɯ] or [ɤ]), which is how it sounds to me. One wonders if some Svan speakers are
manifesting the same trend toward rounded schwa as has occurred in some varieties of
European French.
238 Tuite
(a). The first example is a posting headed <talibani mulaxeli q’opila, icodit?>
(The Taliban is from Mulakh, didn’t you know?), accompanied by a photograph
of a man holding a gun. This elicited a sequence of joking comments in both
Georgian and Svan, including a Russian adjective as well. Some participants
switched codes within a single intervention.
11 My sample – which includes groups based in Lower as well as Upper Svaneti – is of course
biased toward people I know personally, or friends of friends.
Language And Emergent Literacy In Svaneti 239
CD: <o---s deesa, xadu k---alšaal isgiidraal> (Not O-, rather he looks a lot
like the K- family) [reference to AB’s relatives]
EF: <k’rasni mlax xo iq’o da axla taliban . . . t’ašiii> :))) (There was [R: red]
Mulakh and a now a Taliban . . . Applause :))))). [This refers to a previous
posting where the expression “Red Mulakh” appeared, albeit entirely in Svan].
CD: <čven “švania txvim” vart, is k’i ara, ra kvia> . . . :))) (We are “the head of
Svaneti”, this is not, what’s it called . . .)
EF: <uoiiiii uoiiiiii dedee>
GH: <mulaxši tu mest’iaši imaleba> (Is he hiding in Mulakh or Mestia?)
[Mulakh and Mestia are neighboring communes in Upper Svaneti].
IJ: )))))))))))))
(b). A prayer for one’s brothers. The following text, also in Georgian script, was
posted by KL in early February 2014, at the time of the mid-winter torch festi-
val (Limp’æri). At this time, Svan men carry lit torches to their neighborhood
church, one for each male in the family, and pray for the peace and well-being
of the community’s menfolk.
The posting was followed by two responses, one in Svan and one in Georgian:
KL’s orthography ignores the distinction between long and short vowels, and
that between [æ] and [a]. He represents schwa with the Georgian letter “u”
(e.g. <ǰgurags> = /ǰgərǟgs/ “St-George-DAT”). MN, however, leaves a space in the
middle of the word <ǰamz ri> where schwa appears (/ǰamzəri/ “blesses you”).
240 Tuite
5 Conclusion
Acknowledgments
12 Standardization is also not strictly necessary in the case of Svan, since the dialects are
largely intelligible with each other. The principal differences concern vowel length (which
as mentioned has little semantic load), umlaut, sycope and some aspects of morphology.
Selection of the Upper Bal dialect spoken in Mestia – the economic and administrative
center of Upper Svaneti – as the basis for a standard language would oblige speakers of
the Lower Bal and Lentekh dialects to learn to write long vowels which do not exist in
their vernaculars.
Language And Emergent Literacy In Svaneti 241
Bibliography
Abuladze, Ilia, chief ed. 1963. Grigol xandztelis cxovreba (The Life of Grigol of Xandzta).
Dzveli kartuli agiograpiuli lit’erat’uris dzeglebi, c’igni I (V-X ss.), 248-319. Tbilisi:
Mecniereba.
Bardavelidze, Vera. 1941. kartvelta udzvelesi sarc’munoebis istoriidan: ɣvtaeba barbar-
babar (From the history of the ancient beliefs of the Georgians: the deity Barbar-
Babar). Tbilisi: Mecniereba.
Bærug, Richard, ed. 2013. lušnu bopšre lit’erat’ura / svanuri sabavšvo lit’erat’ura/ Svan
youth literature. Tbilisi: Sulakauri Publishing.
Cagareli, A. 1873. O grammatičeskoj literature gruzinskago jazyka. Kritičeskij očerk.
Sanktpeterburg: Imperatorskaja Akademija Nauk.
Davitiani, A., V. Topuria and M. Kaldani, eds. 1957. svanuri p’rozauli t’ekst’ebi II: balskve-
mouri k’ilo (Svan prose texts, II: Lower Bal dialect). Tbilisi: Mecniereba.
Gippert, Jost. 2005. Endangered Caucasian languages in Georgia. Lessons from
Documented Endangered Languages, David K. Harrison, David S. Rood & Arienne
Dwyer, eds. Amsterdam 2008, 159-194.
Gogebashvili, Jakob. 1876/1976. deda ena, anu anbani da p’irveli sak’itxavi c’igni (Mother
tongue, or Alphabet and first reader). Tbilisi: Ganatleba.
Gogebashvili, Jakob. 1912/1976. bunebis k’ari. mesame da meotxe c’lisatvis (The Gate of
Nature. For third and fourth grade). Tbilisi: Ganatleba.
Gren, A. N. 1890. Svanetskie teksty. Sbornik materialov dlja opisanija mestnostej i plemën
Kavkaza X, Otdel 2: 76-160.
Güldenstädt, Johann Anton. 1787. Reisen durch Russland und im Caucasischen Gebürge.
Vols. I-II. St. Petersburg: Kayserliche Akademie der Wissenschaften.
Ingoroq’va, P’avle. 1941. svanetis saist’orio dzeglebi (The historical documents of
Svaneti). Tbilisi: Mecniereba.
Klaproth, Julius von. 1814b. Kaukasische Sprachen. Anhang zur Reise in den Kaukasus
und nach Georgien. Halle & Berlin: Buchhandlung des Hallischen Waisenhauses.
Larmer, Brook. 2014. Medieval mountain hideaway. National Geographic, October 2014,
78-99.
Lewis, M. Paul, Gary F. Simons, and Charles D. Fennig (eds.). 2015. Ethnologue:
Languages of the World, Eighteenth edition. Dallas: SIL International. Online ver-
sion: http://www.ethnologue.com.
Lušnu anban / Svanetskaja Azbuka (Svan alphabet) 1864. Tiflis: Tipografija Glavnago
Upravlenija Namestnika Kavkazskago.
Nizharadze, Besarion. 1962 (Vol I), 1964 (Vol II). ist’oriuli-etnograpiuli c’erilebi.
(Historical and ethnographic essays). Tbilisi: Tbilisi State University Press.
Nizharadze, G. & I. 1894. Svanetskie teksty. Sbornik materialov dlja opisanija mestnostej
i plemën Kavkaza XVIII: 91-132.
242 Tuite
Nizharadze, Iv. 1890. Svanetskija skazki. Sbornik materialov dlja opisanija mestnostej i
plemën Kavkaza X, Otdel 2: 161-241.
Nizharadze, I. I. 1910. Russko-svanskij slovar’. Sbornik materialov dlja opisanija mest-
nostej i plemën Kavkaza XLI: 1-520.
Oniani, A., M. Kaldani and Al. Oniani, eds. 1979. svanuri p’rozauli t’ekst’ebi IV: lašxuri
k’ilo (Svan prose texts IV: Lashx dialect). Tbilisi: Mecniereba.
Otčet 1864. Otčet o sostojanija i deistvijax Otdela s 1859 do 1863. Zapiski Kavkazskago
Otdela Imperatorskago Russkago Geografičeskago Obščestva Knižka VI: 1-42.
Put’k’aradze, T’ariel. 2002. kartvelebi, zogadkartuli samc’ignobro ena da kartvelta
dialekt’ebi [The Georgians, the Common-Georgian literary language and the
Georgians’ dialects]. Kartveluri memk’vidreoba 6.187-203.
Put’k’aradze, T’ariel. 2003. monatesave enobriv erteulta k’valipik’aciis sak’itxisatvis
tanamedrove mecnierebaši [On the question of the qualification of related linguis-
tic units in contemporary science]. Kartveluri memk’vidreoba 7.206-227.
Put’k’aradze, T’ariel, Dadiani, Ek’a & Sherozia, Revaz. 2010. evrop’uli kartia regionuli an
umciresobis enis šesaxeb da sakartvelo (The European Charter for Regional or
Minority Languages and Georgia). Kutaisi: National Institute of Education.
Savenko, E. A. 2010. Iz istorii dejatel’nosti obščestva vosstanovlenija pravoslavnogo
xristianstva na Kavkaze (k 150-letniju osnovanija). Universitetskie čtenija 2010:
materialy naučno-metodičeskix čtenij Pjatogorskogo gosudarstvennogo ling-
vističeskogo universiteta. Pjatigorsk, 15. pp 91-97.
Shanidze, A., Kaldani, M. & Ch’umburidze, Z., eds. 1978. svanuri enis krest’omatia.
(A chrestomathy of the Svan language). Tbilisi: Tbilisi University Press.
Shanidze, A., V. Topuria & M. Gujejiani, eds. 1939. svanuri p’oezia (Svan poetry). Tbilisi:
Mecniereba.
Shanidze, A. & V. Topuria, eds. 1939. svanuri p’rozauli t’ekst’ebi I: balszemouri k’ilo (Svan
prose texts, I: Upper Bal dialect). Tbilisi: Mecniereba.
Silogava, Valeri. 1986, 1988. svanetis c’erilobiti dzeglebi (X-XVIII ss). (The written docu-
ments of Svaneti, 10th-18th centuries). Tbilisi: Mecniereba.
Tepcov, V. Ja. 1890. Svanetija (Geografičeskij očerk). Sbornik materialov dlja opisanija
mestnostej i plemën Kavkaza X, Otdel 1: 1-68.
Thurston, William R. 1987. Processes of change in the languages of north-western New
Britain. Pacific linguistics. Series B no. 99. Canberra: Research School of Pacific
Studies, Australian National University.
Topuria, V. & Kaldani, M. eds. 1967. svanuri p’rozauli t’ekst’ebi III: lent’exuri k’ilo. (Svan
prose texts, III: Lent’ex dialect). Tbilisi: Mecniereba.
Tschantladse, Isa, Rusudan Babluani & Heinz Fähnrich. 2003. Tscholurswanisch-
Deutsches Verbenverzeichnis. Jena: Friedrich-Schiller-Universität.
Tuite, Kevin. 1997. Svan. (Languages of the World / Materials, vol. 139); München:
Lincom Europa.
Language And Emergent Literacy In Svaneti 243
Tuite, Kevin. 2008. The Rise and Fall and Revival of the Ibero-Caucasian Hypothesis.
Historiographia Linguistica, 35 #1; 23-82.
Vaxusht’i Bat’onishvili. Aɣc’era sameposa Sakartvelosa (Description of the Kingdom of
Georgia). Kartlis cxovreba, IV. (S. Q’auxchishvili, ed.) Tbilisi. Sabch’ota Sakartvelo,
1973.
Xergiani, Goguca. Maxvshi Baba; vol 1, 1999; vol 2, 2004.
Zhghent’i, Sergi. 1949. svanuri enis ponet’ik’is dziritadi sak’itxebi. (Principal questions of
Svan phonetics). Tbilisi: Mecniereba.
Chapter 17
Karina Vamling
1 Introduction
of the language, as this offers a new (and potentially growing) domain for the
use of the language:
During a short period in the late 1920s and early 1930s the local Megrelian com-
munist party elite aimed at establishing an Autonomous Megrelian Region
and published newspapers in Megrelian. However, the plans were abruptly
stopped, as were the Megrelian newspapers (Gamsakhurdia 1989). From the
end of the 1930s until the latest decades very little research was published
on Megrelian for political reasons. However, several studies of Megrelian
have been published in the post-Soviet period. Among them are a grammar
sketch in English by Harris (1991), a Megrelian–Georgian dictionary in three
volumes (Kajaia 2001-2002), a recent Megrelian–Russian–Georgian dictionary
(Klimov and Kajaia 2013), a reading grammar of Megrelian–Laz in Georgian
(Danelia and Dundua 2006), grammatical descriptions of Megrelian (Kartozia
et al. 2010; Lomaia and Gersamia 2012a) and studies of Megrelian grammar by
the present author (Tchantouria and Vamling 2005, Vamling and Tchantouria
1991, 1993; Vamling 2005) and of use of the language in different domains of soci-
ety (Vamling 2000; Vamling and Tchantouria 2010). Collections of Megrelian
folklore and tales have been published by Kipshidze (1914), Khubua (1937) and
later by others such as Lomaia and Gersamia (2012b).
The increased use of the Internet has opened new communication channels.
Generally, the Internet is highly available in Georgia and particularly in Tbilisi
(IDFI 2013). The Internet penetration in Georgia in 2014-2015 is estimated to be
49% in a recent report presented by Freedom House (2015). They note in their
comments: “Internet access and usage continues to grow rapidly in Georgia,
particularly as interest in connecting with friends through social-networking
sites has increased in recent years.”
For the Megrelians, this increased access to the Internet has meant new –
virtual – possibilities for contacts across borders. What impact has this had for
the use and development of the language? Megrelian is their natural choice
for communication at home and in informal contexts. To a large extent,
computer-mediated communication belongs to this sphere as well. However,
Megrelian has no standardised literary form as the language is not used in
printed media, education and administration. Despite this, a new domain
is emerging for the language in the form of the spontaneous and non-
standardised use of Megrelian on the Internet.
A preliminary study has been conducted of interactions in Megrelian online
forums such as Facebook and YouTube. In total, approximately 800 entries of
the Internet as a Tool for Language Development and Maintenance 249
various lengths have been collected. In such interactions, it has been observed
that elements are often combined in Megrelian, Georgian, Russian and even
Laz. All three scripts, Georgian, Latin and Cyrillic, are used frequently, and
switching between languages is common.
1 The Georgian script does not distinguish small and capital letters. This principle is kept in the
glosses as well.
250 Vamling
It is also the case that different languages may be used within the same thread
of the conversation (FB margaluri nina). In thread 3, the person “Z” opens the
conversation in Megrelian and continues in Georgian. The person “H” answers
in Megrelian, and then “Z” closes the short thread in Georgian.
300 postings, 18% were made by outsiders, persons who were not “friends”
of “margaluri nina”. In such cases, comments were almost always made in
Georgian. In the remaining 246 comments, i.e. comments made by “friends”
(76 persons), the ratio is that approximately 60% of the comments were writ-
ten in Megrelian and 40% in Georgian. Furthermore, the majority of the active
“friends” are Megrelians according to features mentioned above. Three quar-
ters of the active “friends” have surnames ending in -ua, -ava or -ia.
It has also to be taken into account that the conversation on the Facebook
page “margaluri nina” is public and not person-to-person in the ordinary
sense, which means that the participants are aware that when they choose to
write in Megrelian, comments are understood only by those who are fluent in
Megrelian, whereas when they write in Georgian, they know it will be under-
stood by all, whether they have a Megrelian background or not.
5 Concluding Remarks
The general observation that can be made on the basis of this limited study
of the presence of Megrelian on the Internet is that it is quite diverse. Megrelian
is found in social media, in video clips and in various (private) initiatives to
create online publications of literature, folkloristic and language materials,
often used in different combinations (i.e. Megrelian videos and online pub-
lications are promoted and discussed in social media). Moreover, the interest
in Internet communities and web pages with a Megrelian profile seems to be
increasing quite substantially. Social media is a natural arena for the younger
generation, and it functions as a complement in maintaining increasing, often
transnational, community networks. In this sense, the Internet functions as a
tool for the development of the language.
Social media, with its conversation-like mode of communication, pro-
vide a situation that is close to casual oral communication – one of the main
domains of the use of Megrelian. The observed use of Megrelian in chats and
social media is thus a challenge to the written/oral divide in the Georgian/
Megrelian diglossia, and a sign of that it might be in the process of changing.
The Internet and CMC may create and maintain a common arena of Megrelian
the Internet as a Tool for Language Development and Maintenance 255
Abbreviations
References
Baron, Naomi S. 2008. Always On: Language in an Online and Mobile World. Oxford:
Oxford University Press.
Broers, Laurence. 2001. “Who are the Mingrelians? Language, Identity and Politics in
Western Georgia.” Paper presented at the Sixth Annual Convention of the
Association for the Study of Nationalities 2001. Accessed September 14, 2016. http://
georgica.tsu.edu.ge/?p=319.
Chervonnaya, Svetlana. 1994. Conflict in the Caucasus. Georgia, Abkhazia and the
Russian Shadow. London: Gothic Image Publications.
Chikobava, Arnold. 1936. č’anuris gramat’ik’uli analizi. Tbilisi: ssrk’ mecnierebata
ak’ademiis sakartvelos pilialis gamocema.
Crystal, David. 2001. Language and the Internet. Cambridge: Cambridge University
Press.
Crystal, David. 2002. Language Death. Cambridge: Cambridge University Press.
Danelia, Nana and Inga Dundua. 2006. k’olxuri (megrul-lazuri) ena. Tbilisi: Universali.
Deeters, Gerhard. 1930. Das kharthwelische Verbum. Vergleichende Darstellung des
Verbalbaus der südkaukasischen Sprachen, Leipzig.
“Endangered languages: The Full List.” The Guardian Datablog. Accessed January 29,
2013. www.guardian.co.uk/news/datablog/2011/apr/15/language-extinct-endangered.
Freedom House. 2015. “Georgia.” Accessed October 30, 2015. https://freedomhouse.org/
report/ freedom-net/2015/georgia
Gamsakhurdia, Zviad. 1989. “samegrelos sak’itxi”. isak’i žvania da samegrelos
“avt’onomia”. Lit’erat’uruli sakartvelo, 3 Nov: 6-8.
Green, Leila. 2010. The Internet: An introduction to New Media. New York: Berg.
Gvantseladze, Teimuraz. 2006. enisa da dialekt’is sak’itxi kartvelologiaši. Tbilisi:
universali.
Harris, Alice C. 1991. “Mingrelian.” In: The Indigenous Languages of the Caucasus. Vol. 1,
The Kartvelian Languages, 313-394. Delmar N.Y.: Caravan Books.
256 Vamling
Internet Sources
1 This paper was presented at the 1st International Caucasus University Association Conference
on Endangered Languages at Ardahan Üniversitesi on the 15th of October 2014.
Lamarckian. Not only were Wallace and Darwin both deeply influenced by the
1844 English popularisation of Lamarck’s work, entitled Vestiges of the Natural
History of Creation, Darwin explicitly counted ‘the inherited effects of use and
disuse’ as being amongst the ‘general causes’ and ‘general laws’ which govern
whether or not variations are transmitted to offspring (1871, i: 9). Darwin’s
views are clearly spelt out in the Descent of Man (e.g. 1871, i: 116-121). He con-
ceived of ‘natural selection’ as ‘the chief agent of change, though largely aided
by the inherited effects of habit, and slightly by the direct action of the sur-
rounding conditions’ (1871, i: 152-153).
With respect to the inheritance of characteristics acquired during the life-
time of an organism, Darwin was just as much a Lamarckian as Lamarck. As
the celebrated linguist Friedrich Max Müller pointed out, ‘Darwin’s real merit
consisted, not in discovering evolution, but in suggesting new explanations of
evolution, such as natural selection, survival of the fittest, influence of environ-
ment, sexual selection, etc.’ (1889: 273). Meanwhile, in light of the promiscu-
ous intricacies of molecular genetics, the old polemic about Lamarckian vs.
Darwinian evolution today appears a trifle dated, for our understanding of
evolutionary dynamics has progressed well beyond such a simplistic confron-
tation of dogmas.
Charles Darwin’s On the Origin of Species was published on 24 November
1859. The German translation by the palaeontologist Heinrich Georg Bronn
appeared in 1860 as Über die Entstehung der Arten. The maverick German
biologist Ernst Haeckel sent a copy of the German translation to his friend,
the linguist August Schleicher. Inspired by this work, Schleicher adopted the
view of individual languages as species, which compete against each other ‘im
Kampfe ums Dasein’ (1863). A modern proponent of Schleicher’s view of lan-
guages as species subject to natural selection is Salikoko Mufwene (2001, 2005a,
2005b). By contrast, Friedrich Max Müller conceived language as such to be an
organism. On the 6th of January 1870, in the very first issue of Nature, Müller
took issue with Schleicher’s idea of language survival in terms of ‘die Erhaltung
der höher entwickelten Organismen’ and instead argued that language survival
was a more complex issue.
Although this struggle for life among separate languages exhibits some
analogy with the struggle for life among the more or less favoured spe-
cies in the animal and vegetable kingdoms, there is this important dif-
ference that the defect and the gradual extinction of languages depend
frequently on external causes, i.e. not on the weaknesses of the languages
themselves, but on the weakness, physical, moral or political, of those
260 van Driem
who speak them. A much more striking analogy, therefore, than the
struggle for life among separate languages, is the struggle for life among
words and grammatical forms which is constantly going on in each lan-
guage. Here the better, the shorter, the easier forms are constantly gain-
ing the upper hand, and they really owe their success to their inherent
virtue. (1870: 257)
2 Linguistic Topography
(1) The domains of use of a language and the facility of use of the language
(2) What Wilhelm von Humboldt called the Inhalt of a language
(3) The demographics of the human population using the language as a
mother tongue.
(4) The socio-economic situation of the language community in relation to
competing or neighbouring language communities
on occasion still hear similar views innocently expressed by people with regard
to their own native language, which for obvious reasons strikes them as being
the most apt and most natural of all languages.
Yet for reasons which have nothing to do with the aptness, richness or preci-
sion of expression of a language, a language community may cede domains of
usage to the tongue of another language community. The different sociolin-
guistic situations in which speakers of a language either decide to surrender or
acquiesce to ceding a domain of language use to another tongue merit iden-
tification and study. Let us look at one such case, which is ongoing and easily
observable. In 1989, Jo Ritzen became Minister of Education and Sciences in
The Hague. Ritzen introduced the idea and later the practice of using English
medium in university education in the Netherlands. Hitherto most scientific
discourse, whether in experimental physics, astronomy, theoretical physics,
microbial genetics, cell physiology, economics, medicine or linguistics, had
essentially been conducted almost exclusively in Dutch. The language has for
centuries had a continually expanding arsenal of precise specialised lexical
terms in the sciences. Antoni van Leeuwenhoek did not bother to translate his
letters to the Royal Society in London into English.
In terms of precision or richness of expression, nothing whatsoever is
gained by replacing Dutch terms such as eiwitmantel ‘capsid’, geleedpotigen
‘arthropods’, holtedieren ‘coelenterates’, tweezaadlobbigen ‘dicotyledons’, cel-
vocht ‘cytoplasm’, bedektzadigen ‘angiosperms’, achterhoofdskwab ‘occipital
lobe’, traagheid ‘inertia’ and trage massa ‘inertial mass’ with their English
equivalents. In fact, it can be argued quite defensibly that the English forms are
inferior because of their semantic opacity. The motive behind Ritzen’s policy
was to tap into a lucrative global education market. The use of English medium
in tertiary education enables Dutch universities to sell Bachelor’s, Master’s and
Doctoral programmes more competitively to international students, just as do
the universities in the Anglo-Saxon countries. Yet Ritzen’s policies have set into
motion the ultimate surrender of a vital domain of the Dutch language and
may even have sounded the knell for Dutch as a language of science.
As a language of science, Afrikaans has been able to piggy-back on Dutch,
with its over twenty-four million native speakers in the Netherlands, Belgium,
the West Indies and Surinam. Afrikaans has nearly seven million native speak-
ers, and policy makers in the Afrikaans language community have always
been perceptive enough to recognise the importance of using their language
as a medium of science. Scientific articles written in Afrikaans bearing titles
such as Die klassifikasie van ’n sianoprokarioot deur van ligmikroskopie, trans-
missie elektronmikroskopie en molekulêre tegnieke gebruik te maak ‘The clas-
264 van Driem
2 In respective order, the authors of the articles named are L. Labuschange, M. Wescott, S. du
Plessis, A. Venter and A. Levanets; E. Moseley, T. Steynberg and M. Coetzee; P. Bipath and
M. Viljoen, and the issue cited is Suid-Afrikaanse Tydskrif vir Natuurwetenskap en Tegnologie,
Jaargang 28 No. 2: Junie 2009.
Linguistic Topography And Language Survival 265
course, Nepal is one of the few countries in Asia which managed to safeguard
its sovereignty intact throughout the age of European colonial expansion.
In India, by contrast, English is generally used instead of the Hindi neol-
ogisms that have been coined to express certain notions. Of course, Hindi
also has long coined neologisms, such as सूक्ष्म-दर्षक यन्त्र sūkṣma-darṣak yan-
tra ‘microscope’ or द ूरभार संखया dūrbhāṣ-saṅkhyā ‘telephone number’, except
that these terms generally remain unused. In terms of ease or convenience,
English telephone number may perhaps have little to recommend itself in pref-
erence to the Hindi neologism, which is just as apt. However, often enough
the Hindi neologism is so extraordinarily clumsy as to render the coinage
definitively unusable in any natural register of spoken language other than
satire, such as भुमष्मगत पैदल पार पथ bhumigat paidal pār path ‘underground foot
crossing path’ for ‘subway’, often seen written on signage in Delhi. Despite the
far greater number of native speakers of Hindi, the linguistic topography of
Nepali today is immeasurably healthier than that of Hindi, for Hindi has ceded
numerous domains to English. The contrast can be most vividly illustrated in
cases where Nepali and Hindi happen to use the same neologisms. Speakers of
Nepali will usually be heard to say विश्वविद्ालय viśvavidyālaya ‘university’ and
संग्रहालय saṅgrahālaya ‘museum’ in normal speech, whereas speakers of Hindi
are far more likely than not to say what I have sometimes even seen written in
Devanāgarī script as yunivarsiṭī ‘university’ and myuziyum ‘museum’.
Whilst protagonists in Hindi films and speakers in Hindi talk shows glibly,
perennially and almost invariably shift from English to Hindi and back, often
within the same sentence, natural Nepali speech is seldom if ever charac-
terised by the same coquettish code switching. The situation is yet different
again in Bhutan, where Dzongkha has the status of national language and
has long been used in legal, political and religious contexts as a spoken lan-
guage throughout the kingdom. Native to western Bhutan, Dzongkha is also
used throughout the country in official contexts. Dzongkha has only in recent
history become a written language, although some traditionalist advocates
might contend that the language has been used in writing for centuries under
the guise of its literary exponent Chöke, which in reality, however, is a distinct
language, the Classical Tibetan liturgical tongue.
In its traditional domains, the Dzongkha and Chöke terms are often identi-
cal, and Dzongkha suffers from no dearth of vocabulary for notions such as
བཀའ་ཤོག་ (bKaḥ-śog) kasho ‘edict, royal decree’, སྤྲུལ་སྐུ་ (sPrul-sku) trüku ‘reincar-
nation’ or དབང་ (dBan̂ ) ’wang ‘empowering benediction’. Dzongkha struggles
to colonise domains which in Bhutan are presently dominated by English. In
the political and administrative realm, Dzongkha neologisms have made easy
inroads, e.g. རྒྱལ་ཡོངས་ཚོགས་འདུ་ (rGyal-yon̂ s Tshogs-ḥdu) gäyong tshôdu ‘national
266 van Driem
j, འཇ་ (ḥJ) j, མཇ་ (mJ) j, རྒྱ་ (rGy) j, འགྱ་ (ḥGy) j, བརྒྱ་ (brGy) j, སྒྱ་ (sGy) j, བསྒྱ་ (bsGy) j
and སྦ་ (sBy) j.
Practical experience has amply demonstrated that Dzongkha spelling is
experienced as being overly complicated for Bhutanese learners. The complex-
ity of Dzongkha spelling hampers the use of Dzongkha in new media such as
internet chats, text messages and email. The use of ad hoc romanisations is
often experienced as being so unsystematic in nature that in practice English
is most usually used instead. A phonological orthography of Dzongkha in the
native Bhutanese script will be publicly presented this year for the first time
(Karma Tshering and van Driem, forthcoming). Hopefully this phonologically
consistent spelling system in the ’Ucen script, called Phonological Dzongkha,
will, alongside Roman Dzongkha, enhance the facility of use of the national
language in contemporary written media.
Another challenge is that the Bhutanese educational system has severely
restricted the domains into which Dzongkha has been permitted to venture.
When the first two secular schools were opened in Bhutan during the reign
of འབྲུག་རྒྱལཔོ་ཨོ་རྒྱན་དབང་ཕྱུག་ King ’Ugä ’Wangchu (imperabat 1907-1926), Hindi was
chosen as the medium of instruction because of the ready availability of inex-
pensive textbooks. Chöke remained the medium of instruction in the lamasery
schools. In 1961, འབྲུག་རྒྱལཔོ་འཇྲིགས་མེད་རོ་རྗེ་དབང་ཕྱུག་ King Jimi Dôji ’Wangchu decreed
that Dzongkha was the national language. At one level, this decree simply
recognised the status quo. At a deeper level, the intent was vernacularisation,
a move away from Chöke to living Dzongkha. Another aim was to eradicate
instruction in Hindi.
Until 1971, the ‘Dzongkha’ taught in the schools was in fact Chöke. As a con-
sequence of the royal decree of 1961, new English-medium textbooks were
especially developed for the Bhutanese schools. These new course books
replaced the Hindi textbooks in 1964. In 1971, the རོང་ཁ་ཡར་རྒྱས་སེ་ཚན་ Dzongkha
Division of the ཤེས་རྲིག་ལས་ཁུངས་ Department of Education was established in
order to develop materials for instruction in Dzongkha. Textbooks and learn-
ing materials in Dzongkha were developed at a rapid pace for both primary
and secondary education. Initially, English remained the medium of instruc-
tion for subjects other than Dzongkha, but nowadays virtually all subjects are
taught in English. Only Dzongkha is taught in Dzongkha as well as some mod-
ules of certain subjects such as history and geography. Bhutan in effect chose
a language policy in formal education diametrically opposed to the Malaysian
policy of replacing English with Malay as the medium of formal education,
including the coining of Malay neologisms for scientific terms. The results
is that, with the exception of remote villages, young and upwardly mobile
Bhutan, rather than Singapore, is the most English speaking country in Asia
268 van Driem
today. In language endangerment, the loss and gain of domains of use repre-
sent one dimension determining the viability of a language and its potential
for survival.
Both ceding domains of use or failing to colonise new domains of use create
a linguistic topography that is less favourable to the survival of a language.
However, when neologisms merely denote new entities which have come
into use in our material culture, then these new coinages do not enrich the
notional repertoire of the language. Whilst French has ordinateur, Czech has
počítač and Afrikaans has rekenaar, Dutch seems to make do with computer,
and Japanese fares well with コンピュータ konpyūta. The use of native roots
in coining apt and facile neologisms attests to the creativity and vitality of a
language, especially when these coinages catch on by their own virtue and are
not enforced by top-down measures, although administrative interventions
too quite often prove effective. Yet these precise translation equivalents for
referring to newly invented objects do not enhance the conceptual repertoire
of a language more than would an English loan word. They fail to augment or
diversify what Wilhelm von Humboldt called the Inhalt of a language.
The research programme spearheaded by Wierzbicka and Goddard sought to
identify shared semantic primitives presumed to be common to all languages.
Both Wierzbicka and Goddard as well as the participants in their research pro-
gramme earnestly believed in the existence of semantic primes, yet they were
unable to demonstrate the existence of shared universal categories of meaning
without resorting to the methodologically indefensible ploys of polysemy, allo-
lexy and so-called non-compositional polysemy in order to ‘find’ the purported
‘exponents’ of the hypothetical primes (van Driem 2004). The negative result of
their quest represents one of the most significant contributions to linguistics in
recent years, for their inadvertent and unwanted finding provides the strongest
corroboration to date for the theory of linguistic relativity developed in the
writings of Pierre de Maupertuis (1698-1759), Wilhelm von Humboldt (1767-
1835) and other linguists and subsequently popularised in North America by
Edward Sapir (1884-1939) and Benjamin Whorf (1897-1941). Grammatical and
lexical meanings in different languages generally tend to embody semantically
non-equivalent notional repertoires, and part of the resistance to the work on
Pirahã by Daniel Everett stems from a lingering but recalcitrant reluctance to
accept his empirical findings in many linguistic quarters still today.
Linguistic Topography And Language Survival 269
The notional repertoire of English today is not the same as it was at the
time of King Alfred. The categories of meaning available to an English speaker
today, whether grammatically or lexically expressed, are not at all congruent
with those available to a speaker of Old English in the 9th century. Whilst the
language of Kind Alfred lives on today in the form of modern English by virtue
of an unbroken continuity of speech history, it can also defensibly be stated
that Old English is a dead language. Latin too is conventionally termed a dead
language, although through a continuous unbroken line of use the language
still exists as modern French, Romanian, Portuguese and the other Romance
tongues. The inexorable and universal nature of change was long ago expressed
by Heraclitus (ca. 535-475 bc), to whom the phrase πάντα ῥεῖ ‘everything flows’
is traditionally attributed, and this fact is personally experienced by all.
When proponents of linguistic diversity defend the use of native languages
and combat language endangerment in order to preserve mankind’s linguistic
heritage, presumably they are aware that language does and will change. The
relentlessness of change will cause one language to be replaced by another,
whether this takes the form of an alien tongue, as when the Celtic inhabitants
of Britain adopted the Teutonic tongue imported by Anglo-Saxon migrants,
or of drastic cumulative change over time, as in the case of Latin turning into
French or Old English ultimately becoming modern English. What is worth-
while preserving, or at least attempting to document, in addition to phonetic
diversity and the panoply of different types of morphological systems operative
in language is the language-specific repertoire of notions, meanings and con-
cepts which are lexically, grammatically or idiomatically expressed in any given
language. The danger to diversity is not change, but centripetal change in the
same direction in order to conform to one single global semiotic repertoire.
The insidious peril of semantic assimilation through the globalisation of
categories of meaning was a central theme in the writings of David Hubert
Greene, alias Dáithní ó Huaithne (1913-2008). In the context of the Irish lan-
guage, Greene explained what is meant by such semantic assimilation and
convergence.
Unfortunately, many people are under the impression that such mod-
ern terms as development, influence, interesting represent essential con-
cepts of human thought, and that no language can afford to be without
them; yet, although they are all of Latin origin, not one of them occurs in
Latin in anything resembling its modern meaning . . . But most European
languages, from Welsh to Russian, have accepted them either as loan-
words, or calques, as these equivalents of influence indicate: German
270 van Driem
Greene further illustrated this with a random but well-chosen Irish example.
The observations made by Greene alert us to the danger of the loss of linguis-
tic diversity without actual language death. Semantic assimilation of one lan-
guage to another will reduce overall linguistic diversity. In fact, this insidious
phenomenon exerts a far greater impact on diversity, yet remains less ame-
nable to observation by the semantically unsophisticated, the monoglot and
the linguistically naïve observer. This threat raises questions which present a
fundamental challenge to the science of linguistics.
Will the languages of the future be more viable if these languages merely
represent exact or nearly precise translation equivalents of each other? Will
different languages become increasingly superfluous as they are all increas-
ingly compelled by normative influences exerted in the process of globali-
sation, including automated translation, shared international discourse
and the bullying scourge that is called political correctness, to give expression
to the same conceptual repertoire and so to have the same semiotic content?
At the same time, another pressing question which, given the current state,
direction and biases of linguistics, presently defies answering is the following:
Do certain types of conceptual repertoire render a language more resilient
than another language or in some sense intrinsically valuable? Methodologies
should be developed to address this central research query.
Linguistic Topography And Language Survival 271
In addition to the factors which bear directly upon the language, its domains
of use and its semiotic content, there are sets of factors which determine
language viability that are related to the human speakers of the language.
Statistics and sophisticated methods of quantification appear ludicrous in
some extreme cases where a language has disappeared, as very many have,
because entire populations of speakers of these languages have been extermi-
nated by rival groups. Not only has genocide been perpetrated at times during
the colonisation of the Americas, Australia and the Andamans, but the whole-
sale slaughter of rival groups also features in the recorded history of the Old
World. Sometimes populations are wiped out not just by violent aggression
perpetrated by the rival group, but also equally by diseases introduced by an
incursive population. Often the genocide is incomplete, and then the small
contingent of survivors is afterwards easily linguistically assimilated so that
often no trace of the original language remains. Yet demographic change is not
invariably this drastic.
Sometimes demographicaly marginal groups hold on to a distinct ancestral
language alongside an overwhelming linguistic majority, such as the aston-
ishing resilience of Yiddish and Sorbian over time, whereas sometimes large
populations abandon their languages, as in the case of the many now extinct
Celtic languages of Europe and many languages of antiquity, such as Elamite,
272 van Driem
Hittite, Hattic and Hurrian. The phenotypical, cultural, ritual or religious dif-
ferences between populations of speakers also all play a role, as do the specific
dynamics of any process of acculturation, conquest or domination. In future, it
would be desireable to be able to quantify or meaningfully to characterise the
affects of each of such factors. Demographic factors affecting the number and
the fecundity of the population of speakers must be distinguished from those
affecting the socio-economic circumstances of the given language community.
Economy is a determinant of language vitality, but just how economic factors
affect language viability has yet to be fully understood.
Herodatus famously recorded the linguistic experiment ostensibly carried
out by the pharaoh Psammetichus I (664-610 bc) to discover the original lan-
guage of man. Children were brought up by themselves on an island or at some
remote locality, and, when they finally learnt to speak, they turned out to be
saying becos, the Phrygian word for ‘bread’. Yet was the man who supplied the
tiny and isolated experimental population of children with their daily allow-
ance of food not himself a Phrygian? The result of the legendary experiment
may have more to say about the socio-economic factors which determine the
direction of linguistic assimilation than about the original language of man-
kind. Languages are not all economically equally weighted. The languages that
pop up at you from your computer screen each time that a new operating sys-
tem of Apple is introduced reflect the economic weight in terms of consumer
potential of a highly select group of the world’s language communities. Certain
language communities which are an order of magnitude more populous in
terms of numbers of speakers, such as Bengali or Telugu, are not represented
in the same way as the languages of certain affluent but small language com-
munities in Europe, like Norwegian or Finnish, whose numbers of speakers
pale in comparison with the burgeoning populations speaking many of the
neglected languages.
The list of factors that determine the linguistic topography of a language
adduced above requires refinement and enhancement. The aim of this paper
has merely been to formulate the challenge to develop a programme of
research to study the linguistic topography of individual languages. Analysing
and charting the linguistic topography of a language should enable us to pro-
vide an insightful assessment of the viability of a language and a prediction
of its potential for survival. Although the proposed research programme has
been conceived within the Schleicher-Mufwene framework which envis-
ages individual languages as species in competition, the inclusive fitness of a
language can only be properly assessed and quantified when the anatomy of
the relationship between language as a semiotic organism and its human host
Linguistic Topography And Language Survival 273
References
Darwin, Charles Robert. 1871. The Descent of Man and Selection in Relation to Sex (two
volumes). London: John Murray.
Dawkins, Richard. 1982. The Extended Phenotype. Oxford: Oxford University Press.
Demetrius, L., and M. Ziehe. 1984. ‘The measurement of Darwinian fitness in human
populations’, Philosophical Transactions of the Royal Society, 222: 33-50.
Driem, George van. 2001. Languages of the Himalayas: An Ethnolinguistic Handbook of
the Greater Himalayan Region, containing an Introduction to the Symbiotic Theory
of Language (2 volumes). Leiden: Brill.
Driem, George van. 2004. Book Review: ‘Meaning and Universal Grammar: Theory and
Empirical Findings. 2 vols. Ed. by Cliff Goddard and Anna Wierzbicka. (Studies in
Language Companion Series.) Amsterdam: John Benjamins, 2002. ISBN 1588112640.
$80 (Hb.)’, Language, 80 (1): 163-165.
Driem, George van. 2015a. ‘Synoptic grammar of the Bumthang language, a language of
the central Bhutan highlands’, Himalayan Linguistics Archive, 6: 1-77.
Driem, George van. 2015b. ‘Symbiosism, Symbiomism and the perils of memetic man-
agement’, pp. 327-347 in Mark Post, Stephen Morey and Scott Delancey, eds.,
Language and Culture in Northeast India and Beyond. Canberra: Asia-Pacific
Linguistics.
Goropius Becanus, Johannes. 1569. Origines Antwerpianæ sive Cimmeriorium
Becceselana novem libros complexa. Antwerpen: Christophorus Plantinus.
Grace, George William. 1981. An Essay on Language. Columbia, South Carolina:
Hornbeam Press.
274 van Driem
Grace, George William. 1987. The Linguistic Construction of Reality. London: Croom
Helm.
Grafen, Alan. 2009. ‘Formalizing Darwinism and inclusive fitness theory’, Philosophical
Transactions of the Royal Society, 364: 3135-3141.
Greene, David Hubert (Dáithní ó Huaithne). 1966. The Irish Language – An Ghaeilge.
Dublin: Cultural Relations Committee of Ireland.
Karma Tshering of Gaselô and George van Driem [forthcoming]. The Grammar of
Dzongkha Revised and expanded, with a Guide to Roman Dzongkha and to
Phonological Dzongkha. Canberra: Asia-Pacific Linguistics.
Keller, E. F. 1994. ‘Fitness: reproductive ambiguities’, pp. 120-121 in E. F. Keller and
E. Lloyd, eds., Keywords in Evolutionary Biology. Cambridge, Massachusetts: Harvard
University Press.
Leeuwenhoek, Antoni van. 1693. Derde Vervolg der Brieven, Geſchreven aan de
Koninglijke Societeyt tot Londen. Delft: Henrik van Kroonevelt.
Maynard Smith, John. 2000. Evolution and the Theory of Games. Cambridge: Cambridge
University Press.
Maynard Smith, John. 2004. Animal Signals. Cambridge: Cambridge University Press.
Mufwene, Salikoko S. 2001. The Ecology of Language Evolution. Cambridge: Cambridge
University Press.
Mufwene, Salikoko S. 2005a. Créoles, écologie sociale, évolution linquistique: Cours don-
nés au College de France durant l’automne 2003. Paris: L’Harmattan.
Mufwene, Salikoko S. 2005b. ‘Language evolution: The population genetics way’,
pp. 30-52 in Günter Hauska, ed., Gene, Sprachen und ihre Evolution. Regensburg:
Universitätsverlag Regensburg.
Müller, Friedrich Max. 1870. ‘The science of language’, Nature, I: 256-259.
Pott, August Friedrich. 1848. ‘Die wissenschaftliche Gliederung der Sprachwissenschaft:
eine Skizze’, Jahrbücher der freien deutschen Akademie, 1: 185-190.
Schleicher, August. 1863. Die Darwinsche Theorie und die Sprachwissenschaft: Offenes
Sendschreiben an Herrn Dr. Ernst Häckel, a.o. Professor der Zoologie und Direktor des
zoologischen Museums an der Universität Jena. Weimar: Böhlau.
Chapter 19
The three case studies that I am going to present here are Inner Asia, main
Japanese islands, and Korea. In the first and third cases, I review the linguistic
diversity from approximately 500 AD to the present, and in the second case
from about 700 AD to the present, roughly covering 1,500-1,300 years.
Around 500 AD we can observe the existence of the following language fami-
lies and languages co-existing in Inner Asia:
– Ruan-ruan1
– Yeniseian: Xiong-nu, ancestral languages of Ket-Yugh, Kott-Assan-Arin, and
Pumpokol subgroups
Even given the fact that Iranian, Indo-Aryan, and Tocharian languages all
belong to the Indo-European family, and Mongolic and para-Mongolic lan-
guages are certainly related as well, while Tibeto-Burman and Chinese might
constitute together Sino-Tibetan phylum, we still have 7-8 unrelated language
families coexisting in this region.
However, in one thousand years at approximately 1500 AD the situation
changes quite drastically:
We can clearly see that not only the overall reduction of the overall number
of language families and languages has occurred, some of them simply dying
out like Tocharian, Ruan-ruan, Sogdian, Khotanese, Tumshuqese, and Tangut
or being pushed outside of this linguistic area like Yeniseian, but also for the
most part (with a notable exception of Turkic where the internal diversifica-
tion has increased), the internal diversification within the families has been
also greatly reduced.
At the first glance the situation in 2016 AD, 500+ years later does not seem
to be very different:
1 On Ruan-ruan as a language not linguistically related to any other Inner Asian linguistic
family see Vovin (2004, 2010).
And So Flows History 277
Certainly, certain languages, such as Jurchen, Manchu, and Chagatay have died
out as spoken languages. But the main change that occurred is not so much
quantitative, but qualitative. All Tungusic languages, even the most numer-
ous among them, Sibe with more than twenty thousand native speakers, are
severely endangered, and some of them like Solon and Kili are moribund.
The same is true of the most Pamir languages and the Oirat language. On the
Chinese side of the Inner Asia, even the languages with several million speak-
ers like New Uyghur and Tibetan start to be slowly endangered due to the large
influx of Chinese immigration into Xinjiang and Tibet, as well as due to the
PRC language policy, which is felt especially acutely in Xinjiang, where often
even the education in elementary schools is forcibly replaced from the Uyghur
language-based by the Chinese language-based.2 It is no inconceivable, there-
fore, that in another one hundred years all Chinese Inner Asia will become
Chinese-speaking. And unless there is a major cataclysm in China that would
prevent this situation to become a reality, there seems to be no force that could
either stop or contain this process. On the other hand, there is much brighter
perspective in the Western part of Inner Asia, where the new independent
states liberated from the Soviet yoke now seem to have escaped the danger of
linguistic Russianization for good.
– Japonic: Western Old Japanese, Eastern Old Japanese, Kyūshū Old Japanese,
ancestral language of Ryūkyūan.
– Ainu: Northern Honshū Ainu, Azuma Ainu, Kyūshū Ainu.
– Affiliation unknown: Hayato, Kumaso. Possibly some other languages which
we even do not know the names of.
– The language of the Okhotsk culture on Sakhalin and Hokkaidō, probably
some ancestral form of Nivx (Gilyak).
demonstrated that two of them: Japonic and Ainu, had considerable internal
diversity. One thousand years later, ca. 1700, the linguistic situation drastically
changes:
We can observe here the same picture as in the case study #1: not only the
overall reduction in the number of language families is obvious; the internal
diversity within the remaining language families also becomes less significant.
In 2016 AD, 300+ years later, the situation of the language loss is even more
drastic:
The Hachijō language is extremely moribund: it will die out completely in the
next ten to twenty years. Thus, all linguistic diversity of the past is reduced
essentially to a single language, which is, roughly speaking, a descendant of
only one of Japonic languages.
Ca. 500 AD we find three genetically unrelated linguistic families on the Korean
peninsula and in the adjacent territory of Southern Manchuria:
One thousand years later, ca. 1500 only one language family survives on the
Korean peninsula, represented by three languages:
However, similar to the case study #1, there is a very significant qualitative
change, as the Chejudo language is severely endangered, and Yukchin is mori-
bund in Kazakhstan and Russia, where it moved or was forcibly moved within
less than last two hundred years, and severely endangered in China. There are
no data available on its sociolinguistic status in Northern Korea, but given the
overall language policy in North Korea, which has as one of its major goals
the standardization of the existing non-standard varieties, it will be no wonder
if it is also at least severely endangered.
Thus, one can observe the same common developments in the last 1,500 to
1,300 years exhibited in all three case studies, namely:
1) Loss of the external diversity: the gradual reduction of the overall num-
ber of language families and languages.
2) Loss of internal diversity: the gradual reduction of the overall number of
languages within the same family.
3) Gradual language death resulting in the severe endangerment or mori-
bund nature of many surviving languages.
In this section I will briefly survey one (and the only one!) story of the success-
ful language revitalization and three cases of failure (among multiple ones).
The only one success story of the language revitalization is well known: it
is the case of Hebrew. But why did the language revitalization (which was not
yet known under this name when Hebrew had been successfully revitalized)
succeed in this single case when in all others it so far miserably failed? The
answers seem to me to be quite self-evident, although they are not frequently
spelled out. Eliezer Ben-Yehuda is often almost solely credited with this suc-
cess (Blum and Rabin 1982: 5-14). However, the enthusiasts of the language
revitalization existed and continue to appear in other cases, albeit without any
tangible success. In short, these reasons are the following:
280 Vovin
1) For more than two thousand years of its history Hebrew always existed as
a liturgical language
2) It was used as a means of communication between European and Middle
Eastern Jews. Therefore, it always existed as a second language: a privi-
lege that other languages that were subject to revitalization never had.
3) It was the only choice as the language for Israel: not Yiddish, not Arabic,
not German, Polish, or Russian.
Many efforts were made to revitalize Ainu in the twentieth and the twenty-
first centuries, all of them in Japan. No such efforts were made in Russia,
although the last two speakers of Sakhalin Ainu passed away around 1975 AD
(V. M. Alpatov, p.c. around 1998). Moreover, Sakhalin Ainu was not even listed
as one of the minority languages of the USSR, although there is documentary
evidence for its existence (Novikova and Savel’eva 1953: 128-133). In Japan, on
the other hand, there were schools and groups for the study of Ainu, not only
on Hokkaidō, but also in Tokyo and Osaka, where not only ethnic Ainu them-
selves, but also native Japanese took classes. Ainu was taught as a subject in
Waseda University, Chiba University, Hokkaidō University and Asahikawa
Pedagogical University. Five textbooks of Ainu were published: (Tamura 1979)
based on Saru dialect of Hidaka on Hokkaidō, (Nakagawa et al. 1994) com-
bining South-Western and North-Eastern Hokkaidō dialects, (Nakagawa and
Nakamoto 1997) based on Chitose dialect of Hokkaidō, (Izutsu and Tezuka
2006) based on Asahikawa dialect of Hokkaidō, (Nakagawa and Nakamoto
2007) based on the language of folklore, and even a textbook of the Sakhalin
Ainu published after the language became extinct (Murasaki 2009). There is
also one pedagogical grammar with a short reader (Satō 2008), and numerous
teaching aids, such as for example (Izutsu 2006). There were also yearly con-
tests of the Ainu language in the city of Furano on Hokkaidō. And yet, all these
heroic efforts were in vain, and the Ainu language has effectively died out at
the present point. Why?
One of the key answers can be glimpsed from the Ainu language contests.
The author of these lines was present at one of these contests in 2001 in Furano.
What I saw, however, was encouraging only at the first glance. Young men and
women went on the stage and spoke in a very fluent Ainu either individually, or
in groups. But that was the end of it: once they descended from the stage, they
immediately switched to Japanese. That was my first clue to the understanding
And So Flows History 281
of the main reason for the failure of the revitalization efforts: the Ainu lan-
guage could be the show-off case, but it had no communicative function. In
the next few days I played an experiment, trying to buy hot-dog in various con-
venience stores on Hokkaidō using Ainu. Naturally, I was given looks as if I
were an extraterrestrial from space. This convinced me finally that the Ainu
had no communicative function and value in the daily life on Hokkaidō. But
how can a language survive, let alone be revitalized, if it loses its main social
function: the communicative one?
The second observation concerns impossibility of using the Ainu language
even for quite elementary purposes outside of everyday conversation. In 2008
the Japanese National Diet finally recognized the Ainu people as the aboriginal
population of Japan, so in one of my publications (Vovin 2009), I have decided
to write a dedication to the Ainu people on this occasion. The English and
Japanese versions were composed within a minute:
– “To the Ainu people who were finally recognized as the aboriginals of the
Japanese archipelago”
– ついに日本列島の原住民として認められたアイヌ民族の方々へ
But when I proceeded to the Ainu version, I quickly realized that in spite of
many years of study of both modern and classical Ainu, I am completely unable
to do it, so asked my colleague Izutsu Katsunobu for help. He was able to come
out with something after several days, but warned me that it was extremely
unnatural and clumsy:
– tanepo sisam utarpa utar yaykopeker ayne aynu utari anakne yaunmosir
untar hoski okay utar ne ruwe eraman siri ne. nean aynu utari nispa utar
katkematutar ku-koonkami na
One of the striking aspects of the language loss in Okinawa that while the Ainu
language death was more gradual and took over two hundred years to com-
plete, the language loss in Okinawa developed with a catastrophic speed in
282 Vovin
the post-World War II years, and really accelerated in the last ten or fifteen
years. In 2001 a number of local taxi drivers still spoke Okinawan in Naha, but
in 2012 I have not seen a single one: it was strictly Japanese. It might seem
that Okinawan would be in more advantageous position as compared to
Ainu, because in contrast to the latter that had no writing system before mid-
twentieth century the former had its own written tradition dating back to the
early sixteenth century. Okinawa has been de facto an independent kingdom
before 1609 AD, and although it was conquered by Satsuma clan of Kyūshū in
that year, it officially became a prefecture of Japan only in 1879 AD. Satsuma
overlords were more interested in Okinawa’s commerce and did not interfere
much with its political and educational system. This interference became sig-
nificant after Okinawa officially became a Japanese prefecture, and especially
in the years immediately preceding and during World War II, when any educa-
tion in Okinawan was essentially banned. There is however, two sides of the
medal in this story, as a number of elderly persons in Okinawa told me that it
was the initiative of their parents to educate them entirely in Japanese, because
this ensured that they have possibilities for better jobs both in Okinawa and
in the main Japanese islands. Consequently, the infamous hōgen fuda (方言
札) ‘dialect tags’ that were put on a child if s/he spoke in his/her school in
Okinawan as a punishment may not have been the sole responsibility of the
Imperial government.
In any case, the destruction of the education in the native tongue seems
to be the primary cause (but not the most important) of the language death
in Okinawa. As in the case with Ainu, efforts have been made to teach
Okinawans their former mother tongue, (Kawabatake 1982), (Nishioka and
Nakahara 2000), (Karimata n. d.). Recently even a pilot edition of a textbook
of Okinawan designated for Americans and Brazilians of Okinawan ancestry
has been published (Sakihara et al. n. d.). There is also a dictionary that could
be considered to a certain extent pedagogical (Uchima and Nohara 2006). But
overall, there are much less efforts in Okinawa directed at the revitalization
and Okinawan language education than at the Ainu revitalization. In spite
of the fact that it is still possible to find native Okinawan speakers (most of
them are at least in their sixties), the language is certainly moribund. The
second reason after the extermination of the education in Okinawan is that
it stopped to be the language of communication within the family, it is con-
siderably more important. This second reason is certainly directly connected
to the first: the first generation of Okinawans educated entirely in Japanese
continued to use Japanese with their children even within the family circle.
This created a snowball effect: with each following generation the competence
And So Flows History 283
in Okinawan diminished until for those who are in their forties nowadays
and for the following generations it essentially became a foreign language,
which could be either learned or could be ignored altogether, because there
is no compulsory education in Okinawan either in schools or in universities.
Even at the University of Ryūkyū, the Okinawan language is an elective in
the general education program. In short, Okinawan, like Ainu has also lost its
communicative function.
the State of Hawai’i.3 Some elderly native speakers can still be found at the
Hawai’ian homesteads throughout the islands. But there is hardly anyone
younger than sixty or seventy there, who is not a complete English-Hawai’ian
bilingual in the best case scenario. The Hawai’ian language is, of course, spo-
ken at the University of Hawai’i campuses. But without any doubt it is not
native, it is HSL – Hawai’ian as a Second Language. Contrary to the cases of
Okinawan and Ainu, there are even two show-case schools: on the island
of O’ahu, and the other on the island of Hawai’i (Big Island), where almost all
instruction is conducted in Hawai’ian. But these are only two schools in the
whole State of Hawai’i and their main purpose is to impress dignitaries from
the USA mainland and linguists from abroad. Hawai’ian is, of course, taught
at the University of Hawai’i, primarily at Mānoa and Hilo campuses. But it is
not required, and it is not a part of any University or school curriculum. The
result: the Hawai’ian language has lost completely its communicative function
in the State of Hawai’i. One has no greater chance of buying a hotdog in one
of the Honolulu convenience stores speaking in Hawai’ian than s/he has in
Hokkaidō using Ainu. So, who is reading these state documents in Hawai’ian
except fishermen and agriculturalists from the island of Ni’ihau? Probably
no one.
There are numerous textbooks: (Elbert 1970), (Kahananui and Anthony
1970), (Hopkins 1992), (Wight 1992), (Cleeland 2006) and other teaching aids
for Hawai’ian: (Judd et al. 1945), (Judd n. d.), (Hitchcock 1968), (Alexander
1968), (Pukui et al. 1975) and there was no dearth in attempts to revitalize it,
probably more than for Ainu or Okinawan combined. And yet the results are
largely the same: the ultimate failure, albeit the failure in a noble undertaking.
And the one common thing that unites all three failures is the complete loss of
the communicative function.
Before proceeding to the conclusion, I would like to attract the attention of
my readers to one case (which is by no means unique, so it serves here just as
an example), where such a catastrophic failure did not occur in spite of the rel-
atively small number of speakers, and the language death is not to be expected
any time soon.
3 There is an additional problem with this island, which is privately owned by the Robertson
family from Kawai’i. There is a wonderful law inherited, apparently, from feudal times: once a
resident decides to leave, s/he can come back only for visiting relatives. This is, undoubtedly,
the smartest possible strategy the Robertson family could have come up with regarding the
language preservation of Hawai’ian.
And So Flows History 285
Before the advent of the Meiji government in 1868, Western Toyama region of
Japan belonged to the Kaga province, with which it was bound by strong cul-
tural and linguistic ties. Western Toyama dialect is just a variety of Edo period
(1600-1867 AD) Kaga speech, although the former is separated from the latter by
a mountain range. Most of the former Kaga territory is nowadays in Ishikawa
prefecture, and linguistically Western Toyama is just a variety of Ishikawa dia-
lect, and has much less in common with the rest of the Toyama prefecture.
The total population of Western Toyama (former Western Tonami county)
hardly exceeds 50,000 people – almost a drop in the bucket in the ocean of the
total Japanese population of 126 million. In this situation one would expect
that in the modern world Western Toyama would be swallowed linguistically
if not by Standard Japanese then by the rest of the Toyama prefecture. Yet it did
not happen, and is not even on the horizon. The question is: why? The condi-
tions for the complete linguistic destruction seem to be much more favorable
here in the geopolitical sense than in the three cases of failure that I have sur-
veyed above.
The answer again lies in the communicative function. The language of the
school in Western Toyama is Standard Japanese. So is the language of any work
place. But the language of the family is not: it is still the local language. And
one uses the local language, and not Standard Japanese, when one goes to a
local fishmonger, tatami maker, meat seller, or sake dealer on the same street
or in the same quarter. Not only the inner circle of the family, but all neighbors
still use the same language, and this insures its survival, and will continue to
insure it for uncountable generations as long as they stick to the same practice.
Conclusion
References
Alexander, William D. 1968 (1891). A Short Synopsis of the Most Essential Points in
Hawaiian Grammar. Rutland and Tokyo: Charles E. Tuttle.
Blum, Shoshana and Chaim Rabin. 1982. Modern Hebrew. Jerusalem: Tarbut.
Cleeland, Hōkūlani. 2006. ‘Ōlelo ‘Ōiwi. Hawaiian Language Fundamentals. Honolulu:
Kamehameha Publishing.
Elbert, Samuel H. 1970. Spoken Hawaiian. Honolulu: University of Hawaii Press.
Hitchcock, H. R. 1968. An English-Hawaiian Dictionary. Rutland and Tokyo: Charles E.
Tuttle.
Hopkins, Alberta Pualani. 1992. Ka Lei Ha’aheo. Beginning Hawaiian. Honolulu:
University of Hawaii Press.
Izutsu, Katsunobu (ed.) 2006. I/YAY-PAKASNU. Ainu go no gakushū to kyōiku no tame
ni. Asahikawa: Hokkaidō kyōiku daigaku Asahikawa kō.
Izutsu, Katsunobu and Yoritaka Tezuka. 2006. Kiso Ainu go. Sapporo: Sapporo dō
shoten.
Judd, Henry P. n. d. The Hawaiian Language and Hawaiian-English Dictionary: a
Complete Grammar. Honolulu: Hawaiian Service, Inc.
Judd, Henry P., Mary Kawena Pukui, and John F. G. Stokes. 1945. Introduction to the
Hawaiian Language. English-Hawaiian Dictionary. Hawaiian-English Dictionary.
Honolulu: Tongg Publishing Company.
Kahananui, Dorothy M. and Alberta P. Anthony. 1970. Let us Speak Hawaiian. Honolulu:
University of Hawaii Press.
Kapstein, Matthew T. 2006. The Tibetans. Malden and Oxford: Blackwell.
Karimata, Shigehisa, n. d. Wakai hito no tame no Uchināguchi nyūmon. Naha: Ryūkyū
daigaku.
Kawabatake, Yasuo. 1982. Okinawa hōgen nyūmon. Naha: Okinawa kyōiku shuppan.
Murasaki, Kyōko. 2010. Karafuto Ainu go. Nyūmon kaiwa. Kushiro: Midori kujira sha.
Novikova, Klavdiia A. and Vera N. Savel’eva. 1953. K voprosu o iazykakh korennykh
narodnostei Sakhalina (On the problem of the aboriginal languages of Sakhalin). In:
Iazyki i istoriia narodnostei krainego severa SSSR. Uchenye zapiski Leningradskogo
Universiteta. Fakultet narodov severa (Languages and history of the peoples of the
Extreme North of the USSR. Communications of the Leningrad State University.
Faculty of the peoples of the North), #157.2: 84-133.
And So Flows History 287
Nakagawa, Hiroshi et al. 1994. Akor itak. Ainu go tekisuto 1. Sapporo: Hokkaidō utari
kyōkai.
Nakagawa, Hiroshi and Mutsuko Nakamoto. 1997. Ekusupuresu Ainu go. Tokyo:
Hakusuisha.
Nakagawa, Hiroshi and Mutsuko Nakamoto. 2007. Kamuy yukar de Ainu go wo manabu.
Tokyo: Hakusuisha.
Nishioka, Satoshi and Jō Nakahara. 2000. Okinawa go no nyūmon. Tokyo: Hakusuisha.
Pukui, Mary Kawena, Samuel H. Elbert and Esther T. Mookini. 1975. The Pocket
Hawaiian Dictionary. Honolulu: University of Hawaii Press.
Sakihara, Masashi, Shigehisa Karimata, Moriyo Shimabukuro and Lucila Etsuko Gibo.
n. d. Rikka, Uchinaa-nkai! n. p.
Satō, Tomomi. 2008. Ainu go bunpō kiso. Tokyo: Daigaku shorin.
Tamura, Suzuko. 1979. Aynu itak. Ainu go nyūmon. Tokyo: Waseda University.
Uchima, Chokujin and Mitsuyoshi Nohara. 2006. Okinawa go jiten. Tokyo: Kenkyūsha.
Vovin, Alexander. 2004. Some notes on Old Turkic 12-year animal cycle. Central Asiatic
Journal, 48.1: 118-132.
Vovin, Alexander. 2009. 『万葉集と風土記に見られる不思議な言葉と上代日本列島
に お け る ア イ ヌ 語 の 分 布 』 。 京 都 : 国 際 日 本 文 化 研 究 セ ン タ ー 。 2009 。
(Strange Words in the Man’yōshū and the Fudoki and the Distribution of the Ainu
Language in the Japanese Islands in Prehistory). Kyoto: The International Research
Center for the Japanese Studies, 57 pp.
Vovin, Alexander. 2010. Once again on the Ruan-ruan language. In: Ötükenden
Istanbul’a. Türkc̨enin 1290 Yɩlɩ, (From Ötüken to Istanbul. 1290 years of the Turkish lan-
guage). Istanbul, ed. by Mehmet Ölmez et al., pp. 27-36.
Wight, Kahikāhealani. 1992. Learn Hawaiian at Home. Honolulu: Bess Press.
Index
awana 26 türtü 30
balas 24 xans 28
bittir 28, 32, 36 xömpek, xoppug 31
biyik (Balk.) 33 xuru 27
burçak 33 züdür 29
cıgıra, zıǧıra 31 Karachi see Domari
cumarık 31 Karaim 115, 116, 118
çeget (Balk.) 26 Karaim language 51
çuçxur 27 Crimean dialect 52
çum 29 Halych-Volyn’ dialect 52
didin 31 Karaite(s) 54-55
dorbun 26 Karak (Kaya, Mimana) 278
duppur 27 Karaulov, Nikolaj 19
fadawan (Balk.) 28 Karimata, Shigehisa 282
gabu 28 Kartvelian language family 80, 226, 229-233,
gebelo, gelbo 30 246, 255
gıbı (Kar.) 28 Kashub 54
gılıw 29, 36 Kawabatake, Yasuo 282
gǝmǝx (Balk.) 28 Kazakh 115, 276
göbelek 30 Kereit 275
gubu (Bal.) 28 Kerek 2
gumulcuk 28 Ket 2, 6, 12
ırxı 27 Ket-Yugh 276
kaya 27, 36 Khakas 117
kıldı 30 Khalkha Mongolian 110
kırdık 28, 36 Khanty 2, 5
küllüm 27 Khitan 275
kündeş 31 Khotanese 275, 276
maka 33 Khövsgöl region 166
mant 29 Khwarezmian 275
miyik (Balk.) 33 Khwarshi 60-64, 67
mıga 29, 30, 36 Kıpçak 17, 18, 21, 33
mırzı 28 Kili 276, 277
mursa 29, 33 Kolkhian (language) 246
murtxu (Kar.) 30 Koguryöan 278
nanık 30 Korcha 86
nazı (Balk.) 30 Korean 278, 279
nızı (Kar.) 30 Koreanic 278, 279
pura 30 Koryak 2
qımıja (Kar.) 24 Kosova 86 see Üsküp
şinji (Balk.) 28, 31 Kott-Assan-Arin 276
şkeyli, şkildi 29 Krymchak 115
taban 34 Kubedinova, Lenara 52
taqüzük 28, 36 Kumaso 277
tıgır, (Balk.) tıkır 30 Kumyk 80, 118
tılpıw 27, 34 baka 33
tobuk (Balk.) 34 Kurds 129, 133
töppe 27 Kuril Ainu 278
294 Index
Kyiv 51-52 linguistic
Kyrgyz 116 heritage 151, 152, 155, 163
Kyüshü 277 relativity 268-271
Kyüshü Ainu 277 rights 247
Kyüshü Old Japanese 277 topography 258-273
literacy 80
Lamarck, Jean-Baptiste Pierre Antoine de emergent 228-237
Monet, Chevalier de 258 intimate 237-239
language literary standard 246, 248
awareness 190-193, 195, 196, 204 Lithuania 51
background 122 loanword 48-49
causes 122 Lom 85
change 122 Lomavren 84, 85
diversity 122 loyalty 55
endangerment (scale) 7, 106, 126, 128, Lubavitcher 82
133, 258-273
ideology 191 Macedonia 82, 83, 84, 85n6, 86
loss 127 Macedonian 83, 86
maintenance 244-246, 255 Malthus, Thomas Robert 258
sage 191, 195, 196 Manchu 118, 277
shift 151, 156, 157 Manchu-Tungusic 110, 113, 118
standardization 233, 240 Mansi 2, 5
value 128 Manyas 91
language and dialect 232-233 Mari
language and identity 226, 229, 232-233 pursa, pırsa 33
language contact(s) 24, 32-33, 36 Marr, Nikolaj. 20, 22, 230-231
eastern Anatolia 209-211, 223 Maruipol 53
effects of 211, 223-24 Maupertuis, Pierre-Louis Moreau de 258,
language families 123 268
language learning 122, 123 Mbaliote 87
language retention 4, 7, 8 Medes 129
language shift 2, 3, 7, 8 Meglen 83
language survival 258-273 Meglenoromanian 80, 82, 83
Latin 126, 127 Mesrop Mashtots 131
imperial 128 Mészáros, Julius von 91
Laz 92, 93, 94, 95n1, 95n2, 95n3, 246, 248, military command 125
249, 253, 255, 256 Mingrelia 92, 93
Laz Cultural Organisation 96 Megrelian(s) 92, 93, 94, 96, 244-257,
Laz Institute 95n2 246, 255
Lazika Publication Collective 95n3 Megrelian grammars 248
Lesghians 93 Megrelian identity 247, 256
Lausanne, Treaty of 82 Megrelian Wikipedia 254, 257
Lezgian 194 γoγo 29
čumal 29 šker- 29
dur 29 Megrelo-Chan language 246
turt 30 migration 244, 252
Lezgic 92 forced migration 246, 247
lingua franca 125, 127, 128, 131 missionary 126 see religion
linguicide 81 Moldova 86
Index 295
USA 82 word
Uslar, Peter 229-230 grammatical 42
USSR 83n4 phonological 42
Uyghur 277 word class 48-49
Uzbek 136, 276 written vs oral communication 246,
Üsküp 85 252-254