Professional Documents
Culture Documents
Indo European Grammar
Indo European Grammar
MODERN
INDO-EUROPEAN
Second Edition
Language and Culture
Writing System and Phonology
Morphology
Syntax
Texts and Dictionary
Etymology
DNGHU Carlos Quiles
with Fernando Lpez-Menchero
Version 4.15 (10/2009)
Copyright 2007-2009 Asociacin Cultural Dnghu.
2006-2009 Carlos Quiles Casas.
With contributions by Fernando Lpez-Menchero Dez, M.Phil. in IE studies 2007-2009.
Printed in the European Union.
Published by the Indo-European Language Association.
Edition Managed by
Imcrea Diseo Editorial
at <http://www.imcrea.com/>.
All content on this book is licensed under a Dual Licence Creative Commons Attribution/Share-
Alike License 3.0 (Unported), see <http://creativecommons.org/licenses/by-sa/3.0/>, and GNU
Free Documentation License (unversioned, with no invariant sections, front-cover texts, or back-
cover texts), see <http://www.gnu.org/copyleft/fdl.html>.
All images are licensed under the same Dual Licence, most of them coming from Dnghus website
<http://dnghu.org/> or from the Indo-European Wiki <http://indo-european.eu/>, a portal on
Modern Indo-European, which in turn may have copied content from the English Wikipedia and other
online and collaborative sources.
While every precaution has been taken in the preparation of this book, the publisher and authors
assume no responsibility for errors or omissions, or for damages resulting from the use of the
information contained herein.
For corrections, translations and newer versions of this free (e)book, please visit
<http://dnghu.org/Indo-European grammar/>.
Table of Contents
TABLE OF CONTENTS ............................................................................................................... 3
PREFACE ......................................................................................................................................... 9
PREFACE TO THE FIRST EDITION.............................................................................................. 11
WHATS NEW IN THIS EDITION ................................................................................................. 15
ACKNOWLEDGEMENTS .............................................................................................................. 17
CONVENTIONS USED IN THIS BOOK........................................................................................... 18
1. INTRODUCTION .................................................................................................................... 23
1.1. THE INDO-EUROPEAN LANGUAGE FAMILY .........................................................................23
1.2. TRADITIONAL VIEWS ......................................................................................................... 26
1.3. THE THEORY OF THE THREE STAGES ................................................................................. 29
1.4. THE PROTO-INDO-EUROPEAN URHEIMAT OR HOMELAND ............................................... 37
1.5. OTHER LINGUISTIC AND ARCHAEOLOGICAL THEORIES ....................................................... 45
1.6. RELATIONSHIP TO OTHER LANGUAGES ............................................................................. 48
1.7. INDO-EUROPEAN DIALECTS .............................................................................................. 50
Schleichers Fable: From PIE to Modern English .......................................................... 50
1.7.1. Northern Indo-European dialects ........................................................................... 53
1.7.2. Southern Indo-European Dialects .......................................................................... 78
1.7.3. Other Indo-European Dialects of Europe ............................................................. 88
1.7.4. Anatolian Languages ............................................................................................. 98
1.8. MODERN INDO-EUROPEAN .............................................................................................. 102
2. LETTERS AND SOUNDS ................................................................................................... 109
2.1 THE ALPHABETS OF MODERN INDO-EUROPEAN ................................................................ 109
A. Vowels and Vocalic Allophones.................................................................................. 109
B. Consonants and Consonantal Sounds ....................................................................... 110
2.2. CLASSIFICATION OF SOUNDS ............................................................................................ 112
2.3. SOUNDS OF THE LETTERS ................................................................................................. 117
2.4. SYLLABLES ....................................................................................................................... 119
2.5. QUANTITY ........................................................................................................................ 121
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
2.6. ACCENT ...........................................................................................................................122
2.7. VOWEL CHANGE ............................................................................................................... 123
2.8. CONSONANT CHANGE ......................................................................................................124
2.9. PECULIARITIES OF ORTHOGRAPHY .................................................................................. 126
2.10. KINDRED FORMS ........................................................................................................... 128
3. WORDS AND THEIR FORMS ......................................................................................... 129
3.1. THE PARTS OF SPEECH .................................................................................................... 129
3.2. INFLECTION .................................................................................................................... 130
3.3. ROOT, STEM AND BASE .................................................................................................... 131
3.4. GENDER ........................................................................................................................... 133
3.5. GENERAL RULES OF GENDER ........................................................................................... 135
3.6. VOWEL GRADE ................................................................................................................. 137
3.7. WORD FORMATION ..........................................................................................................139
3.8. COMPOUND WORDS .........................................................................................................142
3.9. NAMES OF PERSONS .........................................................................................................143
4. NOUNS ..................................................................................................................................... 145
4.1. DECLENSION OF NOUNS ................................................................................................... 145
4.2. FIRST DECLENSION ......................................................................................................... 148
4.2.1. First Declension Paradigm................................................................................... 148
4.2.2. First Declension in Examples .............................................................................. 149
4.2.3. The Plural in the First Declension ....................................................................... 150
4.3. SECOND DECLENSION ...................................................................................................... 151
4.3.1. Second Declension Paradigm................................................................................ 151
4.3.2. Second Declension in Examples ........................................................................... 152
4.5.3. The Plural in the Second Declension .................................................................... 153
4.4. THIRD DECLENSION ......................................................................................................... 154
4.4.1. Third Declension Paradigm .................................................................................. 154
4.4.2. In i, u ....................................................................................................................... 155
4.4.3. In Diphthong .......................................................................................................... 156
4.4.4. The Plural in the Third Declension ....................................................................... 157
4.5. FOURTH DECLENSION ...................................................................................................... 159
Table of Contents
5
4.5.1. Fourth Declension Paradigm ................................................................................ 159
4.5.2. In Occlusive, m, l ................................................................................................... 160
4.5.3. In r, n, s .................................................................................................................. 161
4.5.4. The Plural in the Fourth Declension .................................................................... 162
4.6. VARIABLE NOUNS ............................................................................................................ 162
4.7. VOCALISM BEFORE THE DECLENSION ............................................................................... 163
4.8. VOCALISM IN THE PLURAL ............................................................................................... 165
4.9. ACCENT IN DECLENSION .................................................................................................. 166
5. ADJECTIVES ......................................................................................................................... 169
5.1. INFLECTION OF ADJECTIVES ............................................................................................. 169
5.2. THE MOTION ................................................................................................................... 169
5.3. ADJECTIVE SPECIALIZATION ............................................................................................. 171
5.4. COMPARISON OF ADJECTIVES ........................................................................................... 171
5.5. NUMERALS ...................................................................................................................... 173
5.5.1. Classification of Numerals .................................................................................... 173
5.5.2. Cardinals and Ordinals ........................................................................................ 173
5.5.3. Declension of Cardinals and Ordinals ................................................................. 176
5.5.4. Distributives........................................................................................................... 178
5.5.5. Numeral Adverbs .................................................................................................. 179
5.5.6. Multiplicatives ....................................................................................................... 179
6. PRONOUNS ........................................................................................................................... 181
6.1. ABOUT THE PRONOUNS .................................................................................................... 181
6.2. PERSONAL PRONOUNS ..................................................................................................... 181
6.3. REFLEXIVE PRONOUNS .................................................................................................... 183
6.4. POSSESSIVE PRONOUNS ................................................................................................... 184
6.5. ANAPHORIC PRONOUNS ................................................................................................... 185
6.6. DEMONSTRATIVE PRONOUNS .......................................................................................... 185
6.7. INTERROGATIVE AND INDEFINITE PRONOUNS .................................................................. 187
6.7.1. Introduction ........................................................................................................... 187
6.7.2. Compounds ............................................................................................................ 189
6.7.3. Correlatives ............................................................................................................ 190
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
6.8. RELATIVE PRONOUNS ...................................................................................................... 191
6.9. OTHER PRONOUNS ......................................................................................................... 192
7. VERBS ...................................................................................................................................... 193
7.1. INTRODUCTION .................................................................................................................193
7.1.1. Voice, Mood, Tense, Person, Number ....................................................................193
7.1.2. Noun and Adjective Forms .................................................................................... 195
7.1.3. Voices ....................................................................................................................... 197
7.1.4. Moods ..................................................................................................................... 198
7.1.5. Tenses of the Finite Verb ....................................................................................... 199
7.2. FORMS OF THE VERB ....................................................................................................... 200
7.2.1. The Verbal Stems ................................................................................................... 200
7.2.2. Verb-Endings ........................................................................................................ 201
7.2.3. The Thematic Vowel ............................................................................................. 205
7.2.4. Verb Creation ........................................................................................................ 206
7.2.5. Separable Verbs .................................................................................................... 209
7.3. THE CONJUGATIONS ....................................................................................................... 209
7.4. THE FOUR STEMS .............................................................................................................214
7.5. MOOD STEMS .................................................................................................................. 233
7.6. THE VOICE ...................................................................................................................... 236
7.7. NOUN AND ADJECTIVE FORMS ........................................................................................ 237
7.8. CONJUGATED EXAMPLES ................................................................................................ 242
7.8.1. Thematic Verbs ...................................................................................................... 242
7.8.2. Athematic Inflection ............................................................................................. 248
7.8.3. Other Common PIE Stems ................................................................................... 254
7.9. The Verbal Accent .................................................................................................... 256
8. PARTICLES ............................................................................................................................ 257
8.1. PARTICLES ...................................................................................................................... 257
8.2. ADVERBS ........................................................................................................................ 258
8.3. DERIVATION OF ADVERBS ............................................................................................... 259
8.4. PREPOSITIONS ................................................................................................................ 260
8.5. CONJUNCTIONS ............................................................................................................... 262
Table of Contents
7
9. PROTO-INDO-EUROPEAN SYNTAX ........................................................................... 265
9.1. THE SENTENCE ............................................................................................................... 265
9.1.1. Kinds of Sentences ................................................................................................. 266
9.1.2. Nominal Sentence .................................................................................................. 267
9.1.3. Verbal Sentence .................................................................................................... 269
9.2. MORPHOSYNTAX ............................................................................................................. 272
9.2.1. Verbal Morphosyntax ........................................................................................... 272
9.2.2. Nominal Morphosyntax ....................................................................................... 281
9.3. SENTENCE MODIFIERS ................................................................................................... 294
9.3.1. Intonation Patterns .............................................................................................. 294
9.3.2. Sentence Delimiting Particles ............................................................................. 296
9.4. VERBAL MODIFIERS ........................................................................................................ 297
9.4.1. Declarative Sentences ........................................................................................... 297
9.4.2. Interrogative Sentences ....................................................................................... 298
9.4.3. Negative Sentences .............................................................................................. 300
9.5. NOMINAL MODIFIERS ...................................................................................................... 301
9.5.1. Adjective and Genitive Constructions .................................................................. 301
9.5.2. Compounds. .......................................................................................................... 302
9.5.3. Determiners in Nominal Phrases. ....................................................................... 305
9.5.4. Apposition ............................................................................................................. 308
9.6. MODIFIED FORMS OF PIE SIMPLE SENTENCES................................................................ 309
9.6.1. Coordination. ........................................................................................................ 309
9.6.2. Complementation. ................................................................................................. 312
9.6.3. Subordinate Clauses. ............................................................................................ 314
9.7. SYNTACTIC CATEGORIES .................................................................................................. 319
9.7.1. Particles as Syntactic Means of Expression ........................................................ 319
9.7.2. Marked Order in Sentences. ................................................................................ 323
9.7.3. Topicalization with Reference to Emphasis. ...................................................... 324
APPENDIX I: INDO-EUROPEAN IN USE ....................................................................... 325
I.1. TEXTS TRANSLATED INTO MODERN INDO-EUROPEAN .......................................................325
I.1.1. Pater Nos (Lords Prayer) ......................................................................................325
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
I.1.2. Slwje Marij (Hail Mary) ................................................................................... 326
I.1.3. Krddhmi (Nicene Creed) .................................................................................... 326
I.1.4. Nouds sns (Parable of the Prodigal Son)....................................................... 329
I.1.5. Newos Bheidhos (New Testament) Jhans, 1, 1-14 ........................................ 333
I.2 KOMTLOQJOM (CONVERSATION) ...................................................................................... 336
I.3 LATE PIE LEXICON ........................................................................................................... 339
I.3.1. English - PIE Dictionary ....................................................................................... 339
I.3.2. PIE - English Dictionary ....................................................................................... 375
APPENDIX II: PROTO-INDO-EUROPEAN PHONOLOGY ........................................ 451
II.1. DORSALS: THE PALATOVELAR QUESTION ........................................................................ 451
II.2. PHONETIC RECONSTRUCTION ........................................................................................ 456
II.2.1. Proto-Indo-European Sound Laws..................................................................... 456
II.2.2. Consonants ........................................................................................................... 465
II.1.3. Vowels and syllabic consonants .......................................................................... 468
II.3. THE LARYNGEAL THEORY ............................................................................................... 471
Laryngeals in morphology............................................................................................. 481
Pronunciation.................................................................................................................. 483
APPENDIX III: ETYMOLOGICAL NOTES....................................................................... 485
III.1. PROTO-INDO-EUROPEAN VOCABULARY ........................................................................ 485
III.2. INDEFINITE, DEMONSTRATIVE, AND PERSONAL PRONOUNS .......................................... 805
III.2.1. Indefinite Pronouns ............................................................................................ 805
III.2.2. Demonstrative Pronouns ................................................................................... 806
III.2.3. Personal Pronouns ............................................................................................. 806
III.3. WORD FORMATION: COMMON PIE LENGTHENINGS AND SUFFIXES ............................... 811
BIBLIOGRAPHY AND FURTHER READING ................................................................ 821
PREFACE
This second edition of A Grammar of Modern Indo-European is a renewed effort to
systematize the reconstructed phonology and morphology of Europes Indo-European.
Modern Indo-European is common to most Europeans, and not only to some of them,
as Latin, Germanic, or Slavic. Unlike Lingua Ignota, Solresol, Volapk, Esperanto,
Quenya, Klingon, Lojban and the thousand invented languages which are imagined by
individuals daily, PIE dialects are natural, i.e. they evolved from an older language
Proto-Indo-European, of which we have extensive knowledge , and were spoken by
prehistoric communities at some time roughly between 2500 and 2000 BC, having
themselves evolved into different dialects already by 2000 BC.
Proto-Indo-European and its dialects have been reconstructed in the past two centuries
(more or less successfully) by hundreds of linguists, having obtained a rough
phonological, morphological, and syntactical system, equivalent to what Jews had of Old
Hebrew before reconstructing a system for its modern use in Israel. Instead of some
inscriptions and oral transmitted tales for the language to be revived, we have a complete
reconstructed grammatical system, as well as hundreds of living languages to be used as
examples to revive a common Modern Indo-European.
Some known philologists, university professors, experts in Classical Languages, still
consider the Proto-Indo-European language reconstruction an invention; also, Spanish
Indo-Europeanist Bernab, a brilliant Spanish IE professor, has left its work on IE
studies to dedicate himself to something more serious. Francisco Villar, professor of
Greek and Latin at the University of Salamanca, deems a complete reconstruction of PIE
impossible; his opinion is not rare, since he supports the glottalic theory and the
Armenian Homeland hypothesis (against the view of the majority), and supports the use
of Latin instead of English within the EU. The work of Elst, Talageri and others
defending the Indigenous Indo-Aryan viewpoint by N. Kazanas, and their support of an
unreconstructible and hypothetical PIE nearest to Vedic Sanskrit opens still more the
gap between the mainstream reconstruction and minority views supported by political or
personal opinions. Also, among convinced Indo-Europeanists, there seems to be no
possible consensus between the different schools as to whether Common PIE
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
distinguished between and (as Gk., Lat. or Cel.) or if those vowels were all initial , as
in the other attested dialects (Villar), or if the Preterites were only one tense (as Latin
praeteritum) with different formations, or if there were actually an Aorist and a Perfect.
Furthermore, Jos Antonio Pascual, a member of the Royal Spanish Academy (RAE),
considers that it is not necessary to be a great sociologist to know that 500 million
people wont agree to adopt Modern Indo-European in the EU (Spa. journal El Mundo,
8
th
April 2007). Of course not, as they wont agree on any possible question not even on
using English, which we use in fact , and still the national and EUs Institutions keep
working, adopting decisions by majorities, not awaiting consensus for any question. And
it was probably not necessary to be a great sociologist a hundred years ago to see e.g. that
the revival of Hebrew under a modern language system was a utopia (an impossible,
unserious invention then), and that Esperanto, the easy and neutral IAL, was going
to succeed by their first so-called World Congress in 1905. Such learned opinions are
only that, opinions, just as if Hebrew and Semitic experts had been questioned a hundred
years ago about a possible revival of Biblical Hebrew in a hypothetic new Land of Israel.
Whether MIEs success is more or less probable and why is not really important for our
current work, but hypotheses dealt with by sociology, anthropology, political science,
economics, psychology, etc. or usually just by personal opinions with no strict rational
and reasonable basis. It remains unclear whether the project will be accepted by the
different existing social movements, such as Pan-Latinism, Pan-Americanism, Pan-
Sanskritism, Pan-Arabism, Pan-Iranism, Pan-Slavism, Pan-Hispanism, Francophonie,
Anglospherism, Atlanticism, and the hundred different pan-nationalist ideas, as well as
the different groups supporting anti-globalization, anti-capitalism, anti-communism, etc.
What we do know now is that the idea of reviving Europes Indo-European as a modern
language for Europe and international organizations is rational, that it is not something
new, that it doesnt mean a revolution as the use of Spanglish, Syndarin or Interlingua
nor an involution as regionalism, nationalism, or the come back to French, German
or Latin predominance , but merely one of the many different ways in which the
European Union linguistic policy could evolve, and maybe one way to unite different
peoples from different cultures, languages and religions (from the Americas to East Asia)
for the sake of stable means of communication. Just that tiny possibility is enough for us
Preface
11
to lose some years trying to give our best making the main Proto-Indo-European
dialects as usable and as known as possible.
PREFACE TO THE FIRST EDITION
According to Dutch sociologist Abram de Swaan, every language in the world fits into
one of four categories according to the ways it enters into (what he calls) the global
language system.
Central: About a hundred languages in the world belong here, widely used and
comprising about 95% of humankind.
Supercentral: Each of these serves to connect speakers of central languages. There
are only twelve supercentral languages, and they are Arabic, Chinese, English,
French, German, Hindi, Japanese, Malay, Portuguese, Russian, Spanish and
Swahili.
Hypercentral: The lone hypercentral language at present is English. It not only
connects central languages (which is why it is on the previous level) but serves to
connect supercentral languages as well. Both Spanish and Russian are supercentral
languages used by speakers of many languages, but when a Spaniard and a Russian
want to communicate, they will usually do it in English.
Peripheral: All the thousands of other languages on the globe occupy a peripheral
position because they are hardly or not at all used to connect any other languages.
In other words, they are mostly not perceived as useful in a multilingual situation
and therefore not worth anyones effort to learn.
De Swaan points out that the admission of new member states to the European Union
brings with it the addition of more languages, making the polyglot identity of the EU ever
more unwieldy and expensive. On the other hand, it is clearly politically impossible to
settle on a single language for all the EUs institutions. It has proved easier for the EU to
agree on a common currency than a common language.
Of the EUs current languages, at least 14 are what we might call a robust language,
whose speakers are hardly likely to surrender its rights. Five of them (English, French,
German, Portuguese and Spanish) are supercentral languages that are already widely
used in international communication, and the rest are all central.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
In the ongoing activity of the EUs institutions, there are inevitably shortcuts taken -
English, French and German are widely used as working languages for informal
discussions. But at the formal level all the EUs official languages (i.e. the language of
each member state) are declared equal.
Using all these languages is very expensive and highly inefficient. There are now 23
official languages: Bulgarian, Czech, Danish, Dutch, English, Estonian, Finnish, French,
German, Greek, Hungarian, Irish Gaelic, Italian, Latvian, Lithuanian, Maltese, Polish,
Portuguese, Romanian, Slovak, Slovene, Spanish and Swedish, and three semiofficial (?):
Catalan, Basque and Galician. This means that all official documents must be translated
into all the members recognized languages, and representatives of each member state
have a right to expect a speech in their language to be interpreted. And each member
state has the right to hear ongoing proceedings interpreted into its own language.
Since each of the twenty one languages needs to be interpreted/translated into all the
rest of the twenty, 23 x 22 (minus one, because a language doesnt need to be translated
into itself) comes to a total of 506 combinations (not taking on accound the semiofficial
languages). So interpreters/translators have to be found for ALL combinations.
In the old Common Market days the costs of using the official languages Dutch,
English, French, and German could be borne, and interpreters and translators could be
readily found. But as each new member is admitted, the costs and practical difficulties
are rapidly becoming intolerably burdensome.
The crucial point here is that each time a new language is added, the total number of
combinations isnt additive but multiplies: 506 + one language is not 507 but 552, i.e. 24 x 23,
since every language has to be translated/interpreted into all the others (except itself).
It is not hard to see that the celebration of linguistic diversity in the EU only lightly
disguises the logistical nightmare that is developing. The EU is now preparing for more
languages to come: Icelandic and Norwegian might be added in the future, with the
incorporation of these two countries to the EU, as well as Albanian, Macedonian,
Serbian, Bosnian and Croatian (the three formerly known as Serbo-Croatian, but further
differentiated after the Yugoslavian wars) if they are admitted to the EU as expected; and
many other regional languages, following the example of Irish Gaelic, and the three semi-
official Spanish languages: Alsatian, Breton, Corsican, Welsh, Luxemburgish and Sami
Preface
13
are likely candidates to follow, as well as Scottish Gaelic, Occitan, Low Saxon, Venetian,
Piedmontese, Ligurian, Emilian, Sardinian, Neapolitan, Sicilian, Asturian, Aragonese,
Frisian, Kashubian, Romany, Rusin, and many others, depending on the political
pressure their speakers and cultural communities can put on EU institutions. It will
probably not be long before Turkish, and with it Kurdish (and possibly Armenian,
Aramaic and Georgian), or maybe Ukrainian, Russian and Belarusian, are other official
languages, not to talk about the eternal candidates languages, Norwegian (in at least two
of its language systems, Bokml and Nynorsk), Icelandic, Romansh, Monegasque
(Monaco) and Emilian-Romagnolo (San Marino), and this could bring the number of EU
languages over 40. The number of possible combinations are at best above 1000, which
doesnt seem within the reach of any organization, no matter how well-meaning.
Many EU administrators feel that to a great extent this diversity can be canceled out by
ever-increasing reliance on the computer translation that is already in heavy use. It is
certainly true that if we couldnt count on computers to do a lot of the translation heavy
lifting, even the most idealistic administrator would never even dream of saddling an
organization with an enterprise that would quickly absorb a major part of its finances
and energy. But no machine has yet been invented or probably ever will be that is able to
produce a translation without, at the very least, a final editing by a human translator or
interpreter.
The rapidly increasing profusion of languages in the EU is quickly becoming intolerably
clumsy and prohibitively expensive. And this doesnt even count the additional expense
caused by printing in the Greek alphabet and soon in the Cyrillic (Bulgarian and
Serbian). Everyone agrees that all languages must have their place in the sun and their
diversity celebrated. But common sense suggests that the EU is going to be forced to
settle on a very small number of working languages, perhaps only one, and the linguistic
future of the EU has become the subject of intense debate.
Only in public numbers, the EU official translation/interpretation costs amount to
more than 1230 M, and it comes to more than 13% of todays administrative
expenditure of the EU institutions. There are also indirect costs of linguistic programmes
aimed at promoting the learning of three or more languages since the Year of Languages
(2001), which also means hundreds of millions of Euros, which havent been counted in
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
the EUs budget as linguistic expenditure, but are usually included in budget sections
such as Cohesion or Citizenship. It is hard to imagine the huge amount of money (real or
potential) lost by EU citizens and companies each day because of communication
problems, not only because they cant speak a third partys language, but because they
wont speak it, even if they can.
Preserving the strict equality is the EUs lifeblood, and it is a very disturbing thought
that the strongest candidate for a one-language EU is the one with an established
dominance in the world, English, which is actually only spoken by a minority within
Europe. Latin and Artificial languages (as Esperanto, Ido or Interlingua) have been
proposed as alternatives, but neither the first, because it is only related to romance
languages, nor the second, because they are (too) artificial (invented by one person or a
small group at best), solve the linguistic theoretical problems, not to talk about the
practical ones.
The Indo-European language that we present in this work, on the contrary, faces not
only the addressed theoretical problems - mainly related to cultural heritage and
sociopolitical proud - but brings also a practical solution for the European Union,
without which there can be no real integration. European nations are not prepared to
give up some of their powers to a greater political entity, unless they dont have to give up
some fundamental rights. Among them, the linguistic ones have proven harder to deal
with than it initially expected, as they are raise very strong national or regional feelings.
Indo-European is already the grandmother of the majority of Europeans. The first
language of more than 97% of EU citizens is Indo-European, and the rest can generally
speak at least one of them as second language. Adopting Indo-European as the main
official language for the EU will not mean giving up linguistic rights, but enhancing
them, as every other official language will have then the same status under their common
ancestor; it wont mean losing the own culture for the sake of unity, but recovering it
altogether for the same purpose; and, above all, it will not mean choosing a lingua franca
to communicate with foreigners within an international organization, but accepting a
National Language to communicate with other nationals within the same country.
NOTE. The above information is mainly copied (literally, adjusted or modified) from two of Mr.
William Z. Shetter Language Miniatures, which he published in his (now dead) website.
Preface
15
WHATS NEW IN THIS EDITION
This is A Grammar of Modern Indo-European, Second Edition, with Modern Indo-
European Language Grammar in Version 4, still adjusting some important linguistic
questions, and lots of minor mistakes, thanks to the contributions of experts and readers.
NOTE. A version number (N) is given to full revisions of the grammar, and each minor
correction published must be given a different number to be later identified, usually ranging from
N.01 to N.99. This book includes a full correction of version 3, following Pre-Version 4, which
means the correction was finished, and it its therefore 4.xx.
Europes Indo-European version 4 continues Modern Indo-European version 3
(first printed edition, since June 2007), and this in turn version 2, which began in March
2007, changing most features of the old Europaio/Sindhueuropaiom concept of
version 1 (Europaio: A Brief Grammar of the European Language, 2005-2006).
1. Apart from the unified Modern Indo-European, based on Europes Indo-European
(also residual or North-Western Indo-European, or Proto-European), this grammar
makes reference to other coeval PIE early dialects, especially Proto-Greek, Proto-Indo-
Iranian (or Proto-Aryan) and Common Anatolian.
2. One of the main changes of this version is the adoption of a writing system with a
clear phonological distinction between i, u and their semivocalic allophones j, w. The
artificial distinction of i/j and u/w in PIE roots and derivatives, hold in versions 1-3, was
untenable in the long term, as it was a labile decision, open to future changes. With the
traditional written differentiation we get a greater degree of stability.
3. Emphasis is on the old Latin-only alphabet, but attention is paid to Greek and
Cyrillic writing systems. Stubs of possible Armenian, Arabo-Persian and Devanagari
(Abugida) systems are also included. The objective is not to define them completely (as
with the Latin alphabet), but merely to show other possible writing systems for Modern
Indo-European, Modern Aryan, and Modern Hellenic languages.
4. The traditional distinction in writings of the controversial palatovelar phonemes has
been extensively discussed and rejected. Whether satemization appeared already as a
dialectal phonological trend in Late PIE, or were just similar individual dialectal
innovations restricted to some phonetic environments (k- before some sounds, as with
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
Latin c- before -e and -i), is not important. Reasons for not including the palatovelars in
MIE writing system are 1) that, although possible, their existence is not sufficiently
proven (see Appendix II.2); 2) that their writing because of tradition or even etymology
is not justified, as this would mean a projective writing (i.e., like writing Lat. casa, but
Lat. entum, because the k-sound before -e and -i evolves differently in Romance).
5. The historically alternating Oblique cases Dative, Locative, Instrumental and
Ablative, are shown on a declension-by-declension (and even pronoun-by-pronoun)
basis, as Late PIE shows in some declensions a simpler reconstructible paradigm (for
some more archaic, for others an innovation) while others show almost the same Late
PIE pattern of four differentiated oblique case-endings. The 8 cases traditionally
reconstructed are used and its differentiation recommended in MIE.
5. The so-called Augment in -, attested almost only in Greek, Indo-Iranian and
Armenian, is sometimes left due to tradition of Indo-European studies, although recent
research has shown that it was neither obligatory, nor general in the earliest PIE dialects.
It is believed today that it was just a prefix that had a great success in the southern
dialects, just like per- (<PIE per-) in Latin, or ga- (<PIE ko-) in Germanic.
6. The syntactical framework of Late PIE has been dealt with extensively by some
authors, but, as the material hasnt still been summed up and corrected within
mainstream Indo-European linguistics Indo-Europeanists usually prefer the
phonological or morphological reconstruction , we use literal paragraphs from possibly
the most thorough work available on PIE syntax, Winfred P. Lehmanns Proto-Indo-
European Syntax (1974), adding comments and corrections made since its publication by
other scholars
7. The whole section on Morphosyntax is taken from Michael Meier-Brggers Indo-
European Linguistics (2003).
8. Appendices I and III were written by Fernando Lpez-Menchero and published
2007-2009. The rest of this book has been written thanks to his countless corrections
and additions in those years.
Preface
17
ACKNOWLEDGEMENTS
To Mayte, my best friend, for her support and encouragement before I worked on this
project, even before she knew what was it all about. For the money and time spent in
lunchtimes, books, websites, servers and material. For her excitement when talking
about the changes that Proto-Indo-European revival could bring to the worlds future.
Thank you.
To Fernando Lpez-Menchero, Civil Engineer and Classic Languages Philologist,
expert in Indo-European linguistics, for his invaluable help, revisions, corrections and
innumerable contributions. Without his unending knowledge, this grammar wouldnt
have shown a correct Proto-Indo-European reconstruction.
To Neil (UK), Michael (PL) Olivier (FR), Louis (FR), Alberto (IT), Diego (ES), Michal
(SK); to Mithridates (CA), Simona (SL), feigur (IS), Manuel (ES); and to all other
members and friends of the Dnghu Association, for their collaboration, contribution, and
promotion of the grammar and revival project.
To Prof. Dr. Luis Fernando de la Macorra, expert in Interregional Economics, and Prof.
Dr. Antonio Muoz, Vice-Dean of Academic Affairs in the Faculty of Library Science, for
their support in the University Competition and afterwards.
To the University of Extremadura and the Cabinet of Young Initiative, for their prize in
the Entrepreneurial Competition in Imagination Society (2006).
To the Department of Classical Antiquity of the UEx, for their unconditional support to
the project.
To the Regional Government of Extremadura and its public institutions, for their open
support to the Proto-Indo-European language revival.
To Manuel Romero from Imcrea.com Diseo Editorial, for his help with the design and
editorial management of the first printed edition.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
CONVENTIONS USED IN THIS BOOK
1. Modern Indo-European (MIE), Eurpjm or European are used only to refer to
the European language, i.e. to the modern language system based on the reconstructed
North-West or Europes Indo-European (EIE), also Old European proto-language.
2. The roots of the reconstructed PIE language are basic morphemes carrying a lexical
meaning. By addition of suffixes, they form stems, and by addition of desinences, these
form grammatically inflected words (nouns or verbs).
NOTE. PIE reconstructed roots are subject to ablaut, and except for a very few cases, such
ultimate roots are fully characterized by its constituent consonants, while the vowel may alternate.
PIE roots as a rule have a single syllabic core, and by ablaut may either be monosyllabic or
unsyllabic. PIE roots may be of the following form (where K is a voiceless stop, G an unaspirated
and G
h
an aspirated stop, R a semivowel (r, l, m, n, w, j) and H a laryngeal (or s). According to
Meillet, impossible PIE combinations are voiceless/aspirated (as in *teub
h
or *b
h
eut), as well as
voiced/voiceless (as in *ged or *deg). The following table depicts the general opinion:
stops - K- G- G
h
-
- [HR]e[RH] K[R]e[RH] G[R]e[RH] G
h
[R]e[RH]
-K [HR]e[RH]K - G[R]e[RH]K G
h
[R]e[RH]K
-G [HR]e[RH]G K[R]e[RH]G - G
h
[R]e[RH]G
-G
h
[HR]e[RH]G
h
K[R]e[RH]G
h
G[R]e[RH]G
h
G
h
[R]e[RH]G
h
*
*This combination appears e.g. in b
h
eud
h
-, awake, and b
h
eid
h
-, obey, believe.
A root has at least one consonant, for some at least two (e.g. PIH *hek- vs. EIE ek-, quick,
which is the root for MIE adj. ks). Depending on the interpretation of laryngeals, some roots
seem to have an inherent vowel a or o; as, EIE ar- (vs. PIH *h2ar-), fit, EIE ong
w
- (vs. PIH
*h3eng
w
-) anoint, EIE ak- (vs. PIH *h2ek-) keen.
By root extension, a basic CeC (with C being any consonant) pattern may be extended to CeC-
C, and an s-mobile may extend it to s-CeC.
The total number of consonant, sonant and laryngeal elements that appear in an ordinary
syllable are three i.e., as the triliteral Semitic pattern. Those which have less than three are called
Concave verbs (cf. PIH *Hes-, *Hei-, *g
w
em-); those extended are called Convex verbs (cf. Lat.
plang, sparg, frang, etc., which, apart from the extension in -g, contain a laryngeal); for more
on this, vide infra on MIE Conjugations.
Preface
19
3. In this book PIE roots are usually written with laryngeals. Therefore, we only assume
certainty in the non-laryngeal nature of early PIE dialects, from ca. 2500 BC on. Whether
Late PIE lost them all sooner (ca. 3500 BC?) or later (ca. 2500 BC?), etymological roots
which include laryngeals will often be labelled as PIH, or just as (general) PIE, while
specific Late PIE vocabulary will be shown with an undetermined laryngeal output *.
NOTE. Common PIE (or PIH) roots are reconstructed by most modern Indo-Europeanists with
laryngeals; so e.g. different vowel outputs of early PIE dialects (like North-West IE or Proto-
Greek) are explained through the phonological effects of old aspirated phonemes on adjacent
vowels. See Appendix II.3 for more on this question.
Some linguists still follow the old non-laryngeal PIE concept (see Walde-Pokornys lexica), while
many only conceive a PIE with laryngeals. However, it is not logical to assume that, whereas in
Proto-Anatolian laryngeals were lost or evolved, in Late PIE they were the same (*h1, *h2, *h3, or
any other scheme) as in their common ancestor, Middle PIE. Therefore, some scholars have
adapted the Late PIE reconstruction to a partially laryngeal or non-laryngeal language (see
Adrados, Nikolayev, etc.), coeval with the partially laryngeal PAn, thus supposing a similar
laryngeal loss in both Middle PIE dialects, usually implying a quicker loss in Late PIE, due to the
conservation of laryngeals in Anatolian, and their complete disappearance in Late PIE dialects.
Some still reconstruct for Late PIE an uncertain laryngeal (or vowel) *H or *, in some phonetic
environments, otherwise difficult to explain, prior to its full loss in early PIE dialects.
4. Proto-Indo-European vowel apophony or Ablaut is indeed normal in MIE, but
dialectal Ablaut must be corrected when loan-translated. Examples of these Lat.
confessus (cf. Lat. fassus sum), from EIE b
h
-; Lat. facilis/difficilis, from EIE d
h
-; Lat.
sali/insili/insult, etc.
NOTE. Such Ablaut is linked to languages with musical accent, as Latin. In Italic, the tone was
always on the first syllable; Latin reorganized this system, and after Roman grammarians
paenultima rule, Classic Latin accent felt on the second to last syllable if long, on the third to last
syllable, or antepaenultima, if short (hence Lat. pudcus but mdicus), thus triggering off different
inner vocalic timbres or Ablauts. Other Italic dialects, as Oscan or Umbrian, didnt suffered such
apophony; cf. Osc. anterstata , Lat. interstitae; Umb. antakres, Lat. integris; Umb. procanurent,
Lat. procinuerint, etc. Germanic also knew such tone variations. More on Latin phonotactic
development at <http://www.cunyphonologyforum.net/SYLLPAPERS/Senhandoutnew.pdf>.
5. In Romance languages, Theme is used instead of Stem. Therefore, Theme Vowel and
Thematic refer to the Stem endings, usually to the e/o endings. In the Indo-European
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
languages, Thematic roots are those roots that have a theme vowel; a vowel sound that
is always present between the root of the word and the attached inflections. Athematic
roots lack a theme vowel, and attach their inflections directly to the root itself.
NOTE. The distinction between thematic and athematic roots is especially apparent in the Greek
verb; they fall into two classes that are marked by quite different personal endings. Thematic verbs
are also called - (-) verbs in Greek; athematic verbs are - (-mi) verbs, after the first person
singular present tense ending that each of them uses. The entire conjugation seems to differ quite
markedly between the two sets of verbs, but the differences are really the result of the thematic
vowel reacting with the verb endings. In Greek, athematic verbs are a closed class of inherited
forms from the parent IE language. Marked contrasts between thematic and athematic forms also
appear in Lithuanian, Sanskrit, and Old Church Slavonic. In Latin, almost all verbs are thematic; a
handful of surviving athematic forms exist, but they are considered irregular verbs.
The thematic and athematic distinction also applies to nouns; many of the old IE languages
distinguish between vowel stems and consonant stems in the declension of nouns. In Latin, the
first, second, fourth, and fifth declensions are vowel stems characterized by a, o, u and e,
respectively; the third declension contains both consonant stems and i stems, whose declensions
came to closely resemble one another in Latin. Greek, Sanskrit, and other older IE languages also
distinguish between vowel and consonant stems, as did Old English.
6. PIE *d+t, *t+t, *dh+t MIE st; PIE *d+d, *t+d, *dh+d MIE sd; PIE *d+dh,
*t+dh, *dh+dh MIE sdh; because of the common intermediate phases found in Proto-
Greek, cf. Gk. st, sth (as pistis, oisqa), and Europes Indo-European, cf. Lat. est, come,
and O.H.G. examples. For an earlier stage of this phonetic output, compare O.Ind.
seh<*sazdhi, sit!, and not *satthi (cf. O.Ind. deh, Av. dazdi).
NOTE. It has been proposed an old PIE TTTsT (where T = dental stop), i.e. that the cluster of
two dental stops had a dental fricative s inserted between them. It is based on some findings in
Hittite, where cluster tst is spelled as z (pronounced as ts), as in PIH *h1ed-ti, he eats *h1etsti
Hitt. ezzi. Confirmation from early intermediate and common (Late PIE) *-st- are found e.g. in
O.Ind. mastis, measure, from *med-tis, or Av. -hasta-, from *sed-ts. This evolution was
probably overshadowed by other Aryan developments, see Appendix II.
7. The Feminine Late PIE *-j/-, old Abstract Collective PIH *-ih
2
, gives EIE -j a/-.
While both were still interchangeable in the common North-West IE (as the different
dialectal outputs show), we prefer to use the ending -j a for feminines, and - for neuters;
as, smghsl, thousand (neuter), but trja, three (fem.).
Preface
21
The following abbreviations apply in this book:
IE : Indo-European
IE II : Middle PIE or PIH
PIH : Proto-Indo-Hittite
IE III : Late PIE
PIE : Proto-Indo-European
EIE : Europes Indo-European
MIE : Modern Indo-European
PGk : Proto-Greek
Gk. : (Ancient) Greek
Phryg. : Phrygian
Thr. : Thracian
Dac. : Dacian
Ven. : Venetic
Lus. : Lusitanian
A.Mac. : Ancient Macedonian
Illy. : Illyrian
Alb. : Albanian
Ita. : Proto-Italic
Osc. : Oscan
Umb. : Umbrian
Lat. : Latin
O.Lat. : Archaic Latin
V.Lat. : Vulgar Latin
L.Lat. : Late Latin
Med.Lat. : Mediaeval Latin
Mod.Lat. : Modern Latin
O.Fr. : Old French
Prov : Provenzal
Gl.-Pt. : Galician-Portuguese
Gal. : Galician
Pt. : Portuguese
Cat. : Catalan
Fr. : French
It. : Italian
Spa. : Spanish
Rom. : Romanian
PAn : Proto-Anatolian
CA : Common Anatolian
Hitt. : Hittite
Luw. : Luwian
Lyc. : Lycian
Pal. : Palaic
Lyd. : Lydian
PII : Proto-Indo-Iranian
Ind. : Proto-Indo-Aryan
O.Ind. : Old Indian
Skr. : Sanskrit
Hind. : Hindustani
Hi. : Hindi
Ur. : Urdu
Ira. : Proto-Iranian
Av. : Avestan
O.Pers. : Old Persian
Pers. : Persian
Kur. : Kurdish
Oss. : Ossetian
Kam. : Kamviri
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
PGmc. : Pre-Proto-Germanic
Gmc. : Proto-Germanic
Goth. : Gothic
Frank. : Frankish
Sca. : Scandinavian (N. Gmc.)
O.N. : Old Norse
O.Ice. : Old Icelandic
O.S. : Old Swedish
Nor. : Norwegian
Swe. : Swedish
Da. : Danish
Ice. : Icelandic
Fae. : Faeroese
W.Gmc. : West Germanic
O.E. : Old English (W.Saxon,
Mercian)
O.Fris. : Old Frisian
O.H.G. : Old High German
M.L.G. : Middle Low German
M.H.G. : Middle High German
M.Du. : Middle Dutch
Eng : English
Ger. : German
L.Ger. : Low German
Fris. : Frisian
Du. : Dutch
Yidd. : Yiddish
BSl. : Balto-Slavic
Bal. : Proto-Baltic
O.Lith. : Old Lithuanian
O.Pruss. : Old Prussian
Lith. : Lithuanian
Ltv. : Latvian
Sla. : Proto-Slavic
O.C.S. : Old Church Slavonic
O.Russ. : Old Russian
O.Pol. : Old Polish
Russ. : Russian
Pol. : Polish
Cz. : Czech
Slo. : Slovenian
Slk. : Slovak
Ukr. : Ukrainian
Bel. : Belarusian
Bul. : Bulgarian
Sr.-Cr. : Serbo-Croatian
Cel. : Proto-Celtic
Gaul. : Gaulish
O.Ir. : Old Irish
Sco. : Scottish Gaelic
Ir. : Irish Gaelic
Bret. : Breton
Cor. : Cornish
O.Welsh : Old Welsh
1. INTRODUCTION
1.1. THE INDO-EUROPEAN LANGUAGE FAMILY
1.1.1. The Indo-European
languages are a family of
several hundred modern
languages and dialects,
including most of the
major languages of
Europe, as well as many in
Asia. Contemporary
languages in this family
include English, German,
French, Spanish, Portuguese,
Hindustani (i.e., Hindi and Urdu among other modern dialects), Persian and Russian. It
is the largest family of languages in the world today, being spoken by approximately half
the worlds population as mother tongue. Furthermore, the majority of the other half
speaks at least one of them as second language.
1.1.2. Romans didnt perceive similarities between Latin and Celtic dialects, but they
found obvious correspondences with Greek. After Grammarian Sextus Pompeius Festus:
Such findings are not striking, though, as Rome was believed to have been originally
funded by Trojan hero Aeneas and, consequently, Latin was derived from Old Greek.
1.1.3. Florentine merchant Filippo Sassetti travelled to the Indian subcontinent, and
was among the first European observers to study the ancient Indian language, Sanskrit.
Writing in 1585, he noted some word similarities between Sanskrit and Italian, e.g.
deva/dio, God, sarpa/serpe, snake, sapta/sette, seven, ashta/otto, eight,
nava/nove, nine. This observation is today credited to have foreshadowed the later
discovery of the Indo-European language family.
Suppum antiqui dicebant, quem nunc supinum dicimus ex Graeco, videlicet pro
adspiratione ponentes <s> litteram, ut idem dicunt, et nos silvas; item sex, et
septem.
In dark, countries with a majority of Indo-European
speakers; in light color, countries with Indo-
European-speaking minorities.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
1.1.4. The first proposal of the possibility of a common origin for some of these
languages came from Dutch linguist and scholar Marcus Zuerius van Boxhorn in 1647.
He discovered the similarities among Indo-European languages, and supposed the
existence of a primitive common language which he called Scythian. He included in his
hypothesis Dutch, Greek, Latin, Persian, and German, adding later Slavic, Celtic and
Baltic languages. He excluded languages such as Hebrew from his hypothesis. However,
the suggestions of van Boxhorn did not become widely known and did not stimulate
further research.
1.1.5. On 1686, German linguist Andreas Jger published De Lingua Vetustissima
Europae, where he identified an remote language, possibly spreading from the Caucasus,
from which Latin, Greek, Slavic, Scythian (i.e. Persian) and Celtic (or Celto-Germanic)
were derived, namely Scytho-Celtic.
1.1.6. The hypothesis re-appeared in 1786 when Sir William Jones first lectured on
similarities between four of the oldest languages known in his time: Latin, Greek,
Sanskrit and Persian:
The Sanskrit language, whatever be its antiquity, is of a wonderful structure; more
perfect than the Greek, more copious than the Latin, and more exquisitely refined than
either, yet bearing to both of them a stronger affinity, both in the roots of verbs and
the forms of grammar, than could possibly have been produced by accident; so strong
indeed, that no philologer could examine them all three, without believing them to
have sprung from some common source, which, perhaps, no longer exists: there is a
similar reason, though not quite so forcible, for supposing that both the Gothic and the
Celtic, though blended with a very different idiom, had the same origin with the
Sanskrit; and the old Persian might be added to the same family
1.1.7. Danish Scholar Rasmus Rask was the first to point out the connection between
Old Norwegian and Gothic on the one hand, and Lithuanian, Slavonic, Greek and Latin
on the other. Systematic comparison of these and other old languages conducted by the
young German linguist Franz Bopp supported the theory, and his Comparative
Grammar, appearing between 1833 and 1852, counts as the starting-point of Indo-
European studies as an academic discipline.
NOTE. The term Indo-European itself now current in English literature, was coined in 1813 by
the British scholar Sir Thomas Young, although at that time there was no consensus as to the
naming of the recently discovered language family. Among the names suggested were indo-
1. Introduction
25
germanique (C. Malte-Brun, 1810), Indoeuropean (Th. Young, 1813), japetisk (Rasmus C. Rask,
1815), indisch-teutsch (F. Schmitthenner, 1826), sanskritisch (Wilhelm von Humboldt, 1827),
indokeltisch (A. F. Pott, 1840), arioeuropeo (G. I. Ascoli, 1854), Aryan (F. M. Mller, 1861),
aryaque (H. Chave, 1867), etc.
In English, Indo-German was used by J. C. Prichard in 1826 although he preferred Indo-
European. In French, use of indo-europen was established by A. Pictet (1836). In German
literature, Indo-Europisch was used by Franz Bopp since 1835, while the term Indo-Germanisch
had already been introduced by Julius von Klapproth in 1823, intending to include the
northernmost and the southernmost of the familys branches, as it were as an abbreviation of the
full listing of involved languages that had been common in earlier literature, opening the doors to
ensuing fruitless discussions whether it should not be Indo-Celtic, or even Tocharo-Celtic.
1.1.8. There are certain common linguistic ancestors of modern IE languages, and some
of them are well-attested dead languages (or language systems), such as Latin for
modern Romance languages French, Spanish, Portuguese, Italian, Romanian or
Catalan , Sanskrit for some modern Indo-Aryan languages, or Greek for Modern Greek.
Furthermore, there are some still older IE languages, from which these old formal
languages were derived and later systematized. They are, following the above examples,
Archaic or Old Latin, Archaic or Vedic Sanskrit and Archaic or Old Greek, attested in
older compositions or inscriptions, or inferred through the study of oral traditions and
even foreign texts, like the Indo-Aryan superstrate of the Mitanni.
And there are also some old related dialects, which help us reconstruct proto-
languages, such as Osco-Umbrian for an older Proto-Italic (and with Proto-Celtic, Proto-
Germanic and Proto-Balto-Slavic for Europes Indo-European), Indo-Aryan dialects for
Proto-Indo-Aryan (and with Proto-Iranian for Proto-Indo-Iranian) or Mycenaean for an
older Proto-Greek.
NOTE. Mallory and Adams (2006) argue, about (Late) Proto-Indo-European: How real are our
reconstructions? This question has divided linguists on philosophical grounds.
o There are those who argue that we are not really engaged in reconstructing a past language but
rather creating abstract formulas that describe the systematic relationship between sounds in the
daughter languages.
o Others argue that our reconstructions are vague approximations of the proto-language; they can
never be exact because the proto-language itself should have had different dialects (yet we
reconstruct only single proto-forms) and our reconstructions are not set to any specific time.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
o Finally, there are those who have expressed some statistical confidence in the method of
reconstruction. Robert Hall, for example, claimed that when examining a test control case,
reconstructing proto-Romance from the Romance languages (and obviously knowing beforehand
what its ancestor, Latin, looked like), he could reconstruct the phonology at 95% confidence, and
the grammar at 80%. Obviously, with the much greater time depth of Proto-Indo-European, we
might well wonder how much our confidence is likely to decrease.
Most historical linguists today would probably argue that [laryngeal PIE] reconstruction results
in approximations. A time traveller, armed with this book and seeking to make him- or herself
understood would probably engender frequent moments of puzzlement, not a little laughter, but
occasional instances of lucidity.
1.2. TRADITIONAL VIEWS
1.2.1. In the beginnings of the Indo-European or Indo-Germanic studies using the
comparative grammar, the Indo-European proto-language was reconstructed as a
unitary language. For Rask, Bopp and other Indo-European scholars, it was a search for
the Indo-European. Such a language was supposedly spoken in a certain region between
Europe and Asia and at one point in
time between ten thousand and
four thousand years ago, depending
on the individual theories , and it
spread thereafter and evolved into
different languages which in turn
had different dialects.
1.2.2. The Stammbaumtheorie or
Genealogical Tree Theory states that
languages split up in other
languages, each of them in turn split
up in others, and so on, like the
branches of a tree. For example, a
well known old theory about Indo-
European is that, from the PIE
language, two main groups of
Modern tree
diagram of the IE
languages by Eric
Hamp (1990),
Mallory & Adams
(2007).
1. Introduction
27
dialects known as Centum and Satem separated so called because of their
pronunciation of PIE *kmtm, hundred, in Latin and Avestan. From these groups
others split up, as Centum Proto-Germanic, Proto-Italic or Proto-Celtic, and Satem
Proto-Balto-Slavic, Proto-Indo-Iranian.
NOTE. The Centum and Satem isogloss is one of the oldest known phonological differences of IE
languages, and is still used by many to classify them in two main dialectal groups postulating the
existence of proto-Centum and a proto-Satem , disregarding their relevant morphological and
syntactical differences. The isogloss is based on a simple vocabulary comparison; as, from PIE
*kmtm (possibly earlier *dkmtm, from *dekm, ten), Satem: O.Ind. atm, Av. satm, Lith.
imtas, O.C.S. sto, or Centum: Gk. , Lat. centum, Goth. hund, O.Ir. cet, etc.
It remains the most used model for understanding the Indo-European language
reconstruction, since it was proposed by A. Schleicher (Compendium, 1866). The
problem with its simplicity is that the branching of the different groups is portrayed as a
series of clean breaks with no connection between branches after they have split, as if
each dialectal group marched away from the rest. Such sharp splits are possible, but
assuming that all splits within Proto-Indo-European were like this is not very plausible,
and any linguist surveying the current Indo-European languages would note dialectal
variations running through some but not all areas, often linking adjacent groups who
may belong to different languages (Mallory & Adams, 2006).
1.2.3. The Wellentheorie or Waves
Theory, of J. Schmidt, states that
one language is created from
another by the spread of
innovations, the way water waves
spread when a stone hits the water
surface. The lines that define the
extension of the innovations are
called isoglosses. The convergence
of different isoglosses over a
common territory signals the existence of a new language or dialect. Where isoglosses
from different languages coincide, transition zones are formed.
Wave model of some of the interrelationships
of the IE languages, Mallory & Adams (2007).
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
NOTE. These old models for our understanding of language reconstructions are based on the
hypothesis that there was one common and static Proto-Indo-European language, and that all
features of modern Indo-European languages can be explained in such unitary schemes, by
classifying them either as innovations or as archaisms of one old, rigid proto-language. After
Mallory and Adams (The Oxford Introduction to Proto-Indo-European and the Proto-Indo-
European World, 2006), their criteria of inclusion, why we are looking at any particular one, and
not another one, are no more solid than those that define family trees. The key element here is
what linguistic features actually help determine for us whether two languages are more related or
less related to one another. A decision in this area can be extraordinary difficult because we must
be able to distinguish between features that may have been present throughout the entire Indo-
European world (Indoeuropeia has been employed to describe this concept) and have dropped out
in some but not others against those features that are innovations in only some of the different
groups. The historical linguist is principally looking for shared innovations, i.e. are there traces of
corresponding developments between two or more language groups that would indicate that they
shared a common line of development different from other language groups? Only by finding
shared innovations can one feel confident that the grouping of individual Indo-European linguistic
groups into larger units or branches of the tree is real.
1.2.4. Because of the difficulties found in the modelling of Proto-Indo-European
branches and daughter languages into the traditional, unitary Diverging Tree
framework, i.e. a uniform Proto-Indo-European language with its branches, a new model
called Converging Association of Languages was proposed, in which languages that are
in contact (not necessarily related to each other) exchange linguistic elements and rules,
thus developing and acquiring from each other. Most linguists have rejected it as an
implausible explanation of the irregularities found in the old, static concept of PIE.
NOTE. Among the prominent advocates is N.S. Trubetzkoy (Urheimat, 1939): The term
language family does not presuppose the common descent of a quantity of languages from a
single original language. We consider a language family a group of languages, in which a
considerable quantity of lexical and morphological elements exhibit regular equivalences () it is
not necessary for one to suppose common descent, since such regularity may also originate
through borrowings between neighboring unrelated languages () It is just as conceivable that the
ancestors of the Indo-European language branches were originally different from each other, but
though constant contact, mutual influence, and borrowings, approached each other, without
however ever becoming identical to one another (Meier-Brgger, 2003).
1. Introduction
29
Agreeing with Neumann (1996), Meier-Brgger (2003) states that that the various Indo-
European languages have developed from a prior unified language is certain. Questionable is,
however, the concrete how of this process of differentiation, and that this thesis of a
converging association of languages may immediately be dismissed, given that all Indo-European
languages are based upon the same Proto-Indo-European flexion morphology. As H. Rix makes
clear, it is precisely this morphological congruence that speaks against the language association
model, and for the diverging tree model, even if the traditional language tree models were unable
to explain the newest findings.
1.3. THE THEORY OF THE THREE STAGES
1.3.1. Even some of the first Indo-Europeanists had noted in their works the possibility
of older origins for the reconstructed (Late) Proto-Indo-European, although they didnt
dare to describe those possible older stages of the language.
NOTE. The development
of this theory of three
linguistic stages can be
traced back to the very
origins of Indo-European
studies, firstly as a diffused
idea of a non-static PIE
language, and later widely
accepted as a dynamic
dialectal evolution, already
in the 20th century, after
the decipherment of the
Anatolian scripts. Most
linguists accept that Proto-Indo-European must be the product of a long historical development.
On this question, H. Rix (Modussystem, 1986) asserts Whereby comparative reconstruction is
based upon a group of similar forms in a number of languages, internal reconstruction takes its
point of departure from irregularities or inhomogeneities of the system of a single language ()
The fundamental supposition of language-internal reconstruction is that such an irregularity or
inhomogeneity in the grammar of a language is the result of a diachronic process, in which an
older pattern,, or homogeneity is eclipsed, but not fully suppressed. According to Meier-Brgger
(2003), Rix works back from Late Proto-Indo-European Phase B (reconstructible Proto-Indo-
European) using deducible information about an Early Proto-Indo-European Phase A, and gathers
Expansion of Proto-Indo-European and its dialects ca.
4000BC-500AD, according to the Kurgan hypothesis.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
in his work related evidence on the Proto-Indo-European verbal system. On that question, see
also G.E. Dunkel Early, Middle, Late Indo-European: Doing it My Way (1997), Adrados,
Bernab, Mendoza, Manual de Lingstica Indoeuropea (1998); etc.
1.3.2. Today, a widespread Three-Stage Theory divide PIE (internal) language evolution
into three main historic layers or stages, including a description of branches and
languages as clean breaks from a common source (e.g. PAn from IE II) or from
intermediate dialect continua (e.g. Germanic and Balto-Slavic from EIE), or classifying
similarities into continued linguistic contact (e.g. Italic and Celtic, or Balto-Slavic and
Indo-Iranian):
1) Early PIE (also IE I, for some Pre-Proto-Indo-European) is the hypothetical
ancestor of Middle PIE, and probably the oldest stage of the language that comparative
grammar could help reconstruct using internal reconstruction. There is, however, no
common position as to how it was like or when and where it was spoken.
2) The second stage corresponds to a time before the separation of Proto-Anatolian
from the common linguistic community where it should have coexisted (as a Pre-
Anatolian dialect) with Pre-IE III. That stage of the language is called Middle PIE or IE
II, also Indo-Hittite, identified with early Kurgan cultures in the Kurgan Hypothesis.
NOTE. On the place of Anatolian among IE languages, the question is whether it separated first
as a language branch from PIE, and to what extent was it thus spared developments common to
the remaining (Late) Proto-Indo-European language group. There is growing consensus in favor of
its early split from Indo-European (Heading, among others, Indo-Hittite); see N. Oettinger Indo-
Hittite Hypothesen und Wortbildung (1986), A. Lehrman Indo-Hittite Revisited (1996), H.
Craig Melchert The Dialectal Position of Anatolian within IE in IE Subgrouping (1998), etc.
For Kortlandt (1989): Since the beginnings of the Yamnaya, Globular Amphora, Corded Ware,
and Afanasievo cultures can all be dated between 3600 and 3000 BC, I am inclined to date Proto-
Indo-European to the middle of the fourth millennium, and to recognize Proto-Indo-Hittite as a
language which may have been spoken a millennium earlier.
3) The common immediate ancestor of most of the earliest known IE proto-languages
more or less the same static PIE searched for since the start of Indo-European studies,
before Hittite was deciphered is usually called Late PIE, also IE III, often simply
Proto-Indo-European, often dated some time ca. 3500-3000 BC using linguistic or
archaeological models, or both.
1. Introduction
31
NOTE. According to Mallory &
Adams (2006), Generally, we find
some form of triangulation based on
the earliest attested Indo-European
languages, i.e. Hittite, Mycenaean
Greek, and Indo-Aryan, each of these
positioned somewhere between c.
2000 and 1500 BC. Given the kind of
changes linguists know to have
occurred in the attested histories of
Greek or Indo-Aryan, etc., the linguist
compares the difference wrought by
such changes with the degree of
difference between the earliest
attested Hittite, Mycenaean Greek,
and Sanskrit and reconstructed Proto-Indo-European. The order of magnitude for these estimates
(or guesstimates) tends to be something on the order of 1,500-2,000 years. In other words,
employing some form of gut intuition (based on experience which is often grounded on the known
separation of the Romance or Germanic languages), linguists tend to put Proto-Indo-European
sometime around 3000 BC plus or minus a millennium () the earliest we are going to be able to
set Proto-Indo-European is about the fifth millennium BC if we want it to reflect the
archaeological reality of Eurasia. We have already seen that individual Indo-European groups are
attested by c. 2000 BC. One might then place a notional date of c. 4500-2500 BC on Proto-Indo-
European. The linguist will note that the presumed dates for the existence of Proto-Indo-European
arrived at by this method are congruent with those established by linguists informed estimation.
The two dating techniques, linguistic and archeological, are at least independent and congruent
with one another.
Likewise, Meier-Brgger (2003) thinks that No precise statement concerning the exact time
period of the Proto-Indo-European linguistic community is possible. One may only state that the
ancient Indo-European languages that we know, which date from the 2
nd
millennium BC, already
exhibit characteristics of their respective linguistic groups in their earliest occurrences, thus
allowing one to presume the existence of a separate and long pre-history () The period of 5000-
3000 BC is suggested as a possible timeframe of a [common] Proto-Indo-European language.
However, on the early historic and prehistoric finds, B. Hnsel (1994) states that Linguistic
development may be described in steps that, although logically comprehensible, are not precisely
Yamna culture ca. 3000 BC, roughly the time
when Late PIE and Proto-Anatolian were spoken,
according to the Kurgan framework.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
analyzable without a timescale. The archaeologist pursues certain areas of cultural development,
the logic of which (if one exists) remains a mystery to him, or is only accessible in a few aspects of
its complex causality. On the other hand, he is provided with concrete ideas with regard to time,
as vague as these may be, and works with a concept of culture that the Indo-European linguist
cannot attain. For the archaeologist, culture is understood in the sense of a sociological definition
() The archaeological concept of culture is composed of so many components, that by its very
nature its contours must remain blurred. But languages are quite different. Of course there are
connections; no one can imagine cultural connections without any possibility of verbal
communication. But it is too much to ask that archaeologists equate their concept of culture,
which is open and incorporates references on various levels, to the single dimension of linguistic
community. Archaeology and linguistics are so fundamentally different that, while points of
agreement may be expected, parallels and congruency may not. The advantage of linguistic
research is its ability to precisely distinguish between individual languages and the regularity of
developments. The strength of archaeology is its precision in developing timelines. What one can
do, the other cannot. They could complement each other beautifully, if only there were enough
commonality.
1.3.3. Another division has to be made, so that the dialectal evolution and this revival
project is properly understood. Late PIE had at least two main inner dialectal branches,
the Southern (or IE IIIa) and the Northern (or IE IIIb) ones. As far as we know, while
speakers of Southern or Graeco-Aryan dialects (like Pre-Greek and Pre-Indo-Iranian
Indo-European) spread in different directions with the first Late PIE migrations ca.
3000-2500 BC, speakers of Northern dialects remained still in close contact for centuries
in Europe, but for those (like Pre-Tocharian IE speakers) who migrated to Asia.
NOTE. Within the group of IE languages, some individual languages are more closely associated
with one another owing to morphological or lexical similarities. The cause for this, as a rule, is a
prehistoric geographic proximity (perhaps even constituting single linguistic community) or a
common preliminary linguistic phase, a middle mother-language phase, which would however
then be posterior to the period of the [common PIE] mother language (Meier-Brgger, 2003).
About Tocharian, Adrados (1998) says even if archaic in some respects (its Centum character,
subjunctive, etc.) it has shares common features with Balto-Slavic, among other languages: they
must be old isoglosses, shared before it separated and migrated to the East. It is, therefore, an IE
III B [=Northern] language. It shows great innovations, too, something normal in a language that
evolved isolated.
1. Introduction
33
On the Southern (Graeco-Aryan or Indo-Greek) PIE dialect, see Gamkrelidze, Ivanov (1995) Indo-
European and the Indo-Europeans, and Clackson (1994) The Linguistic Relationship Between
Armenian and Greek, Adrados (1998), etc. Even in Mallory & Adams (2006): Many have argued
that Greek, Armenian, and Indo-Iranian share a number of innovations that suggest that there
should have been some form of linguistic continuum between their predecessors; An Irish-Indic
cognate looks a damn sight stronger than a Greek-Iranian and linguists have long noted that there
are a whole series of words that seem to be confined largely to Greek and Indo-Iranian.
1.3.4. The so-called North-West, Residual, or Europes Indo-European language, also
Old European or Proto-European, linguistic ancestor of Celtic, Italic, Germanic, Balto-
Slavic and other (fragmentary or unattested) European dialects, is believed to have
formed the last common IE dialect continuum in Northern Europe during the centuries
on either side of 2500 BC. It was therefore a language coeval with Common Anatolian,
Proto-Indo-Iranian and Proto-Greek, able to fill in the time gap between the dates
estimated for North-West IE proto-languages and the Central and Eastern ones.
NOTE 1. On the so-called Nort-West Indo-European language or dialect continuum, see N.
Oettinger Grundstzliche berlegungen zum Nordwest-Indogermanischen (1997), and Zum
nordwestindogermanischen Lexikon (1999); M. E. Huld Indo-Europeanization of Northern
Europe (1996); Adrados, Bernab, Mendoza, Manual de Lingstica Indoeuropea (1998); etc.
Furthermore, regarding modern IE studies, even without accepting a common, independent
North-West IE language, There are so many of these words that are confined within these five
language groups (Celtic, Italic, Germanic, Baltic, and Slavic) that most linguists would regard
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
cognates found exclusively between any two or among all of these groups as specifically North-
West Indo-European and not demonstrably Proto-Indo-European. To accept a series of cognates
as reflections of a PIE word requires that the evidence come from further afield than a series of
contiguous language groups in Europe, Mallory & Adams (2006).
NOTE 2. About the Dark Ages, the time gap between the reconstructed North-West Indo-
European proto-languages (dated ca. 1500 BC) and the other IE proto-languages reconstructed
(dated ca. 2500 BC), Mallory & Adams (2006): If one reviews discussion of the dates by which the
various Indo-European groups first emerged, we find an interesting and somewhat disturbing
phenomenon. By c. 2000 BC we have traces of Anatolian, and hence linguists are willing to place
the emergence of Proto-Anatolian to c. 2500 BC or considerably earlier. We have already
differentiated Indo-Aryan in the Mitanni treaty by c.1500 BC so undifferentiated Proto-Indo-
Iranian must be earlier, and dates on the order of 2500-2000 BC are often suggested. Mycenaean
Greek, the language of the Linear B tablets, is known by c.1300 BC if not somewhat earlier and is
different enough from its Bronze Age contemporaries (Indo-Iranian or Anatolian) and from
reconstructed PIE to predispose a linguist to place a date of c. 2000 BC or earlier for Proto-Greek
itself. So where we have written documentation from the Bronze Age, we tend to assign the proto-
languages to an earlier period of the Bronze Age, i.e. earlier than at least 2000 BC if not 2500 BC.
When we turn to western and northern Europe, however, both our attestation of the different
groups and the estimates of their proto-languages tend to be shallower. The Germanic languages,
for example, are all derived from Proto-Germanic. Now the earliest runic inscriptions are so close
to reconstructed Proto-Germanic that there is a tendency to date the Germanic proto-language to
about 500 BC. Similarly, if we examine the earliest Celtic inscriptional evidence, be it Continental
or even the much more recent Irish ogam stones, these inscriptions are not that far removed from
the reconstructed Proto-Celtic and again we tend to have dates suggested on the order of 1000 BC.
The Slavic languages only began differentiating from one another during the historical period, and
Proto-Slavic is generally set to about the beginning of the Christian era while Proto-Baltic and
Proto-Balto-Slavic (assuming its existence) are probably envisaged as a second millennium BC
phenomenon. In short, where the Indo-European groups are more recently attested, we tend to
find that they are also regarded as having differentiated at a more recent time, i.e. between c.1500
and 500 BC. One explanation for the relatively short time depths of the attested northern and
western Indo-European groups is that these groups are the only survivors of a long process of
linguistic assimilation that has occurred as small demographic and linguistic groups moved,
interacted, and merged. We can see precisely such a process in action in the historic period as
Latin assimilated and replaced all the other Italic languages, Umbrian, Oscan, etc., and then went
on to assimilate and replace much of the Celtic languages. Also within the historic period Slavic
1. Introduction
35
assimilated and replaced such other Indo-European languages as Thracian, and Koine Greek
replaced nearly all other varieties of Greek. If we had only contemporary data to work with, we
would have to conclude that both Proto-Italic (now equivalent to Proto-Romance) and Proto-
Greek flourished around the beginning of the Christian era. These extinction events in the history
of Italic and Greek had the effect of resetting the time depth of the proto-language. This process
must have been repeated time and again in the prehistoric period.
Similarly, Kortlandt (1989) detected what seems to be a general tendency to date proto-
languages farther back in time than is warranted by the linguistic evidence. When we reconstruct
Proto-Romance, we arrive at a linguistic stage which is approximately two centuries later than the
language of Caesar and Cicero (cf. Agard 1984: 47-60 for the phonological differences). When we
start from the extralinguistic evidence and identify the origins of Romance with the beginnings of
Rome, we arrive at the eighth century BC, which is almost a millennium too early. The point is that
we must identify the formation of Romance with the imperfect learning of Latin by a large number
of people during the expansion of the Roman empire.
1.3.4. Apart from the shared phonology and vocabulary, Europes Indo-European shows
other common features, as a trend to reduce the noun inflection system, shared
innovations in the verbal system, the -r endings of the middle or middle-passive voice,
etc. The southern dialects, which spread in different directions and evolved without
forming a continuum, show therefore a differentiated phonology and even vocabulary,
but common developments like the augment in -, etc.
IE dialects ca. 500 BC.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
NOTE. Some European dialects like Balto-Slavic and, to some extent, Italic (or Latin, if
differentiated from Osco-Umbrian), either because of general PIE innovative or archaic trends
that only they maintained, or because of their original situation within the prehistoric dialectal
territories in relation with the origin of innovations or just because they remained in contact
with Southern Indo-European dialects after the first PIE split (e.g. through the Scythian or general
Iranian expansions) show features usually identified with Indo-Iranian, as an 8-case noun
declension and phonetic satemization, while having morphological features clearly common to
Germanic and Celtic dialects, like their verbal system. Images show IE languages ca. 500 AD
(top) and 1500 AD (under these lines).
1. Introduction
37
1.4. THE PROTO-INDO-EUROPEAN URHEIMAT OR HOMELAND
1.4.1. The search for the Urheimat or Homeland of the prehistoric Proto-Indo-
Europeans has developed as an archaeological quest along with the linguistic research
looking for the reconstruction of the proto-language.
NOTE. According to A. Scherers Die Urheimat der Indogermanen (1968), summing up the
views of various authors from the years 1892-1963, still followed by mainstream Indo-European
studies today, Based upon the localization of later languages such as Greek, Anatolian, and Indo-
Iranian, a swathe of land in southern Russia north of the Black Sea is often proposed as the native
area of the speakers of Proto-Indo-European (Meier-Brgger, 2003).
1.4.2. The Kurgan hypothesis was
introduced by Marija Gimbutas in
1956 in order to combine
archaeology with linguistics in
locating the origins of the Proto-
Indo-Europeans. She named the set
of cultures in question Kurgan
after their distinctive burial mounds
and traced their diffusion into
Eastern and Northern Europe.
1.4.3. According to her hypothesis, PIE speakers were probably a nomadic tribe of the
Pontic-Caspian steppe that expanded in successive stages of the Kurgan culture and
three successive waves of expansion during the 3
rd
millennium BC:
Kurgan I, Dnieper/Volga region, earlier half of the 4
th
millennium BC. Apparently
evolving from cultures of the Volga basin, subgroups include the Samara and
Seroglazovo cultures.
Kurgan IIIII, latter half of the 4
th
millennium BC. Includes the Sredny Stog culture
and the Maykop culture of the northern Caucasus. Stone circles, early two-wheeled
chariots, anthropomorphic stone stelae of deities.
Kurgan IV or Pit Grave culture, first half of the 3
rd
millennium BC, encompassing the
entire steppe region from the Ural to Romania.
Photo of a Kurgan ( Archaeology Magazine).
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
o Wave 1, predating Kurgan I, expansion from the lower Volga to the Dnieper,
leading to coexistence of Kurgan I and the Cucuteni culture. Repercussions of the
migrations extend as far as the Balkans and along the Danube to the Vina and
Lengyel cultures in Hungary.
o Wave 2, mid 4
th
millennium BC, originating in the Maykop culture and resulting
in advances of kurganized hybrid cultures into northern Europe around 3000
BC Globular Amphora culture, Baden culture, and ultimately Corded Ware
culture.
o Wave 3, 3000-2800 BC, expansion of the Pit Grave culture beyond the steppes;
appearance of characteristic pit graves as far as the areas of modern Romania,
Bulgaria and eastern Hungary.
Hypothetical Homeland or Urheimat of the first PIE speakers, from 4500 BC
onwards. The Yamna (Pit Grave) culture lasted from ca. 3600 till 2200 BC. In this
time the first wagons appeared. People were buried with their legs flexed, a
position which remained typical for the Indo-Europeans for a long time. The
burials were covered with a mound, a kurgan. During this period, from 3600 till
3000 IE II split up into Pre-IE III and Pre-Proto-Anatolian. From ca.3000 B.C on,
Late PIE dialects began to differentiate and spread by 2500 westward (Europes
Indo-European), southward (Proto-Greek) and eastward (Proto-Aryan, Pre-Proto-
Tocharian).
1. Introduction
39
NOTE. On the Kurgan hypothesis, Mallory & Adams (2006) say that [t]he opposite method to a
retrospective approach is a prospective approach where one starts with a given archaeological
phenomenon and tracks its expansion. This approach is largely driven by a theory connected with
the mechanism by which the Indo-European languages must have expanded. Here the trajectory
need not be the type of family tree that an archaeologist might draw up but rather some other
major social phenomenon that can move between cultures. For example, in both the nineteenth
century and then again in the later twentieth century, it was proposed that Indo-European
expansions were associated with the spread of agriculture. The underlying assumption here is that
only the expansion of a new more productive economy and attendant population expansion can
explain the widespread expansion of a language family the size of the Indo-European. This theory
is most closely associated with a model that derives the Indo-Europeans from Anatolia about the
seventh millennium BC from whence they spread into south-eastern Europe and then across
Europe in a Neolithic wave of advance.
A later alternative mechanism is the spread of more pastoral societies who exploited the horse
(and later the chariot) and carried a new language across Europe and Asia from the fourth
millennium bc onwards. The underlying assumption here is that the vector of Indo-European
language spread depended on a new, more aggressive social organization coupled with a more
mobile economy and superior transportation technology. As this theory sets the homeland in the
steppelands north of the Black and Caspian seas among different cultures that employed barrows
for their burials (Russian kurgan), it is generally termed the Kurgan theory.
Although the difference between
the Wave of Advance and Kurgan
theories is quite marked, they both
share the same explanation for the
expansion of the Indo-Iranians in
Asia (and there are no fundamental
differences in either of their
difficulties in explaining the
Tocharians), i.e. the expansion of
mobile pastoralists eastwards and
then southwards into Iran and India.
Moreover, there is recognition by
supporters of the Neolithic theory that the wave of advance did not reach the peripheries of
Europe (central and western Mediterranean, Atlantic and northern Europe) but that these regions
adopted agriculture from their neighbours rather than being replaced by them.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
I. LINGUISTIC RECONSTRUCTION AND ARCHAEOLOGICAL TIMELINE
The Manual de Lingstica Indoeuropea (Adrados, Bernab, Mendoza, 1998) makes a
summary of main linguistic facts, supported by archaeological finds: Remember the
recent date of the cristalization of European languages. Old European [=North-West
Indo-European], from which they derive, is an already evolved language, with opposition
masculine/feminine, and must be located in time ca. 2000 BC or before. Also, one must
take into account the following data: the existence of Tocharian, related to IE IIIb
[=Northern PIE], but far away to the East, in the Chinese Turkestan; the presence of IE
IIIb [=Graeco-Aryan] languages to the South of the Carpathian Mountains, no doubt
already in the 3
rd
millennium (the ancestors of Thracian, Iranian, Greek speakers);
differentiation of Hittite and Luwian, within the Anatolian group, already ca. 2000 BC, in
the documents of Kltepe, what means that Common Anatolian must be much older.
NOTE. Without taking on account archaeological theories, linguistic data reveals that:
a) IE IIIb, located in Europe and in the Chinese Turkestan, must come from an
intermediate zone, with expansion into both directions.
b) IE IIIa, which occupied the space between Greece and the Northwest of India,
communicating both Paeninsulas through the languages of the Balkans, Ukraine
and Northern Caucasus, the Turkestan and Iran, must also come from some
intermediate location. Being a different linguistic group, it cannot come from
Europe or the Russian Steppe, where Ural-Altaic languages existed.
c) Both groups have been in contact secondarily, taking on account the different
recent isoglosses in the contact zone.
d) The more archaic Anatolian must have been isolated from the more evolved IE; and
that in some region with easy communication with Anatolia.
() Only the Steppe North of the Caucasus, the Volga river and beyond can combine all
possibilities mentioned: there are pathways that go down into Anatolia and Iran
through the Caucasus, through the East of the Caspian Sea, the Gorgan plains, and they
can migrate from there to the Chinese Turkestan, or to Europe, where two ways exist:
to the North and to the South of the Carpathian mountains.
NOTE. For Kortlandt (1989), too, Starting from the linguistic evidence () The best candidate
for the original IE homeland is the territory of the Sredny Stog culture in the eastern Ukraine.
1. Introduction
41
Northern horde Tocharian
Southern Horde
Northern Horde
Southern Horde
These linguistic data, presented in a diagram, are supported by strong archaeological arguments:
they have been defended by Gimbutas 1985 against Gamkrelidze-Ivanov () This diagram
proposes three phases. In the first one, IE II [=Middle PIE] became isolated, and from it Anatolian
emerged, being first relegated to the North of the Caucasus, and then crossing into the South:
there must Common Anatolian be located. Note that there is no significant temporal difference
with the other groups; it happens also that the first IE wave into Europe was older. It is
somewhere to the North of the people that later went to Anatolia that happened the great
revolution that developed IE III [=Late PIE], the common language.
The following phases refer to that common language. The first is that one that saw both IE III B
[=Northern IE] (to the North) and IE III A [=Southern IE] (to the South), the former being
fragmented in two groups, one that headed West and one that migrated to the East. That is a proof
that somewhere in the European Russia a common language III B emerged; to the South, in
Ukraine or in the Turkestan, IE III A.
Diagram of the expansion and relationships of IE languages, Adrados (1979).
Anat.
West IE Bal.-Sla.
Gk.-Thrac. Arm. Ind.-Ira.
Germanic Bal.-Sla.
Indo-Iranian
Cel. Ita.
Thr.
Arm.
Gk.
I.-I.
Phase 1
IE II
Phase 2
IE IIIb
IE IIIa
Phase 3
IE IIIb
IE IIIa
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
The second phase continues the movements of both branches, that launched waves to the South,
but that were in contact in some moments, arising isoglosses that unite certain languages of the IE
IIIa group (first Greek, later Iranian, etc.) with those of the rearguard of IE IIIb (especially Baltic
and Slavic, also Italic and Germanic).
II. ARCHEOGENETICS AND INDO-EUROPEAN MIGRATIONS
Cavalli-Sforza and
Alberto Piazza argue
that Renfrew (v.i.) and
Gimbutas reinforce
rather than contradict
each other, stating
that genetically
speaking, peoples of
the Kurgan steppe
descended at least in
part from people of
the Middle Eastern
Neolithic who immigrated there from Turkey.
NOTE. The genetic record cannot yield any direct information as to the language spoken by these
groups. The current interpretation of genetic data suggests a strong genetic continuity in Europe;
specifically, studies of mtDNA by Bryan Sykes show that about 80% of the genetic stock of
Europeans originated in the Paleolithic.
Spencer Wells suggests that the origin, distribution and age of the R1a1 haplotype
points to an ancient migration, possibly corresponding to the spread by the Kurgan
people in their expansion across the Eurasian steppe around 3000 BC, stating that there
is nothing to contradict this model, although the genetic patterns do not provide clear
support either.
NOTE. R1a1 is most prevalent in Poland, Russia, and Ukraine, and is also observed in Pakistan,
India and central Asia. R1a1 is largely confined east of the Vistula gene barrier and drops
considerably to the west. The spread of Y-chromosome DNA haplogroup R1a1 has been associated
with the spread of the Indo-European languages too. The mutations that characterize haplogroup
R1a occurred ~10,000 years bp.
Distribution of haplotypes R1b (light color) for Eurasiatic
Paleolithic and R1a (dark color) for Yamna expansion; black
represents other haplogroups.
1. Introduction
43
Haplogroup R1a1, whose lineage is thought to have originated in the Eurasian Steppes north of
the Black and Caspian Seas, is therefore associated with the Kurgan culture, as well as with the
postglacial Ahrensburg culture which has been suggested to have spread the gene originally.
The present-day
population of R1b
haplotype, with
extremely high peaks
in Western Europe
and measured up to
the eastern confines
of Central Asia, are
believed to be the
descendants of a
refugium in the
Iberian peninsula (Portugal and Spain) at the Last Glacial Maximum, where the
haplogroup may have achieved genetic homogeneity. As conditions eased with the
Allerd Oscillation in about 12000 BC, descendants of this group migrated and
eventually recolonised all of Western Europe, leading to the dominant position of R1b in
variant degrees from Iberia to Scandinavia, so evident in haplogroup maps.
NOTE. High concentrations of Mesolithic or late Paleolithic YDNA haplogroups of types R1b
(typically well above 35%) and I (up to 25%), are thought to derive ultimately of the robust
Eurasiatic Cro
Magnoid homo sapiens
of the Aurignacian
culture, and the
subsequent gracile
leptodolichomorphous
people of the
Gravettian culture that
entered Europe from
the Middle East
20,000 to 25,000
years ago, respectively.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
III. THE KURGAN HYPOTHESIS AND THE THREE-STAGE THEORY
ARCHAEOLOGY (Kurgan Hypothesis) LI NGUI STI CS (Three-Stage Theory)
ca. 4500-4000 BC. Sredny Stog, Dnieper-Donets
and Sarama cultures, domestication of the horse.
Early PIE spoken, probably somewhere in the
Pontic-Caspian Steppe.
ca. 4000-3500 BC. The Yamna culture, the kurgan
builders, emerges in the steppe, and the Maykop
culture in northern Caucasus.
Middle PIE. Pre-IE III and Pre-Proto-Anatolian
dialects evolve in different communities but
presumably still in contact
ca. 3500-3000 BC. Yamna culture at its peak:
stone idols, two-wheeled proto-chariots, animal
husbandry, permanent settlements and hillforts,
subsisting on agriculture and fishing, along rivers.
Contact of the Yamna culture with late Neolithic
Europe cultures results in kurganized Globular
Amphora and Baden cultures. Maykop culture
shows earliest evidence of the beginning Bronze
Age; bronze weapons and artifacts introduced.
Proto-Anatolian becomes isolated south of the
Caucasus, and has no more contacts with the
linguistic innovations of the common Late PIE
language.
ca. 3000-2500 BC. The Yamna culture extends
over the entire Pontic steppe. The Corded Ware
culture extends from the Rhine to the Volga,
corresponding to the latest phase of IE unity.
Different cultures disintegrate, still in loose
contact, enabling the spread of technology.
Late PIE evolves into dialects, at least a
Southern and a Northern one. Dialectal
communities remain still in contact, enabling
the spread of phonetic and morphological
innovations, and loan words. PAn, spoken in
Asia Minor, evolves into Common Anatolian.
ca. 2500-2000 BC. The Bronze Age reaches
Central Europe with the Beaker culture of
Northern Indo-Europeans. Indo-Iranians settle
north of the Caspian in the Sintashta-Petrovka and
later the Andronovo culture.
The breakup of the southern IE dialects is
complete. Proto-Greek spoken in the Balkans;
Proto-Indo-Iranian in Central Asia; North-West
Indo-European in Northern Europe; Common
Anatolian dialects in Anatolia.
ca. 2000-1500 BC. The chariot is invented, leading
to the split and rapid spread of Iranians and other
peoples from the Andronovo culture and the
Bactria-Margiana Complex over much of Central
Asia, Northern India, Iran and Eastern Anatolia.
Greek Darg Ages and flourishing of the Hittite
Empire. Pre-Celtic Unetice culture.
Indo-Iranian splits up in two main dialects,
Indo-Aryan and Iranian. European proto-
dialects like Germanic, Celtic, Italic, and Balto-
Slavic differentiate from each other. Anatolian
languages like Hittite and Luwian are written
down; Indo-Iranian attested through Mitanni;
a Greek dialect, Mycenaean, is already spoken.
ca. 1500-1000 BC. The Nordic Bronze Age sees the
rise of the Germanic Urnfield and the Celtic
Hallstatt cultures in Central Europe, introducing
the Iron Age. Italic peoples move to the Italian
Peninsula. Rigveda is composed. The Hittite
Kingdoms and the Mycenaean civilization decline.
Germanic, Celtic, Italic, Baltic and Slavic are
already different proto-languages, developing
in turn different dialects. Iranian and other
related southern dialects expand through
military conquest, and Indo-Aryan spreads in
the form of its sacred language, Sanskrit.
ca. 1000-500 BC. Northern Europe enters the Pre-
Roman Iron Age. Early Indo-European Kingdoms
and Empires in Eurasia. In Europe, Classical
Antiquity begins with the flourishing of the Greek
peoples. Foundation of Rome.
Celtic dialects spread over Western Europe,
German dialects to the south of Jutland. Italic
languages in the Italian Peninsula. Greek and
Old Italic alphabets appear. Late Anatolian
dialects. Cimmerian, Scythian and Sarmatian in
Asia, Palaeo-Balkan languages in the Balkans.
1. Introduction
45
1.5. OTHER LINGUISTIC AND ARCHAEOLOGICAL THEORIES
1.5.1. A common development of new hypotheses has been to revise the Three-Stage
assumption. It is actually not something new, but the come back to more traditional
views, reinterpreting the new findings of the Hittite scripts, trying to insert Anatolian
into the old, static PIE concept.
1.5.2. The most known new alternative theory concerning PIE is the Glottalic theory. It
assumes that Proto-Indo-European was pronounced more or less like Armenian, i.e.
instead of PIE *p, *b, *b
h
, the pronunciation would have been *p, *p, *b, and the same
with the other two voiceless-voiced-voiced aspirated series of consonants. The IE
Urheimat would have been then located in the surroundings of Anatolia, especially near
Lake Urmia, in northern Iran, hence the archaism of Anatolian dialects and the glottalics
found in Armenian.
NOTE. Those linguistic and archaeological findings are supported by Th. Gamkredlize-V. Ivanov
(1990: The early history of Indo-European languages, Scientific American, where early Indo-
European vocabulary deemed of southern regions is examined, and similarities with Semitic and
Kartvelian languages are also brought to light. This theory has been criticized by Meid (1989)
1.5.3. Alternative theories include:
I. The European Homeland thesis maintains that the common origin of the IE
languages lies in Europe. These theses are more or less driven by Archeological. A.
Husler (1981, 1986, 1992) continues to defend the hypothesis that places Indo-
European origins in Europe, stating that all the known differentiation emerged in the
continuum from the Rhin to the Urals.
NOTE. It has been traditionally located in 1) Lithuania and the surrounding areas, by R.G.
Latham (1851) and Th. Poesche (1878: Die Arier. Ein Beitrag zur historischen Anthropologie,
Jena); 2) Scandinavia, by K.Penka (1883: Origines ariacae, Viena); 3) Central Europe, by G.
Kossinna (1902: Die Indogermanische Frage archologisch beantwortet, Zeitschrift fr
Ethnologie, 34, pp. 161-222), P.Giles (1922: The Aryans, New York), and by linguist/archaeologist
G. Childe (1926: The Aryans. A Study of Indo-European Origins, London).
a. The Old European or Alteuropisch Theory compares some old European
vocabulary (especially river names), which would be older than the spread of Late PIE
dialects through Northern Europe. It points out the possibility of an older, pre-IE III
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
spread of IE, either of IE II or I or maybe some other Pre-IE dialect. It is usually related
to the PCT and Renfrews NDT.
b. The Paleolithic Continuity Theory posits that the advent of IE languages should be
linked to the arrival of Homo sapiens in Europe and Asia from Africa in the Upper
Paleolithic. The PCT proposes a continued presence of Pre-IE and non-IE peoples and
languages in Europe from Paleolithic times and allowing for minor invasions and
infiltrations of local scope, mainly during the last three millennia.
NOTE. There are some research papers concerning the PCT available at
<http://www.continuitas.com/>. Also, the PCT could in turn be connected with Frederik
Kortlandts Indo-Uralic and Altaic studies <http://kortlandt.nl/publications/> although they
could also be inserted in Gimbutas early framework.
On the temporal relationship question, Mallory & Adams (2006): How early a solution is
admitted depends on individual decisions regarding the temporally most diagnostic vocabulary.
That the vocabulary is clearly one reflecting at least a Neolithic economy and technology, i.e.
domesticated plants and animals, ceramics, means that it cannot be set anywhere on this planet
prior to c. 8000 BC. Although there are still those who propose solutions dating back to the
Palaeolithic, these cannot be reconciled with the cultural vocabulary of the Indo-European
languages. The later vocabulary of Proto-Indo- European hinges on such items as wheeled
vehicles, the plough, wool, which are attested in Proto-Indo-European, including Anatolian. It is
unlikely then that words for these items entered the Proto-Indo-European lexicon prior to about
4000 BC. This is not necessarily a date for the expansion of Indo-European since the area of
Proto-Indo-European speech could have already been in motion by then and new items with their
words might still have passed through the continuum undetected, i.e. treated as inheritances
rather than borrowings. All that can be concluded is that if one wishes to propose a homeland
earlier than about 4000 bc, the harder it is to explain these items of vocabulary.
c. The PCT is, in turn, related to the theories of a Neolithic revolution causing the
peacefully spreading of an older pre-IE language into Europe from Asia Minor from
around 7000 BC, with the advance of farming. It proposes that the dispersal
(discontinuity) of Proto-Indo-Europeans originated in Neolithic Anatolia.
NOTE. Reacting to criticism, Renfrew by 1999 revised his proposal to the effect of taking a
pronounced Indo-Hittite position. Renfrews revised views place only Pre-Proto-Indo-European in
7th millennium BC Anatolia, proposing as the homeland of Proto-Indo-European proper the
Balkans around 5000 BC, explicitly identified as the Old European culture proposed by
Gimbutas.
1. Introduction
47
As of 2005, Colin Renfrew seems to support the PCT designs and the usefulness of the
Paleolithic assumptions. He co-authored a paper concluding: Our finding lends weight to a
proposed Paleolithic ancestry for modern Europeans The above quotation coming as results of
archaeogenetic research on mtDNA where 150 x greater N1a frequency was found. The first
European farmers are descended from a European population who were present in Europe since
the Paleolithic and not coming as a wave of Neolithic migration as proposed in Renfrews NDT.
Talking about these new (old) theories, Adrados (1998) makes an interesting remark about the
relevance that is wrongly given to each new personal archaeological revolutionary theory:
[The hypothesis of Colin Renfrew (1987)] is based on ideas about the diffusion of agriculture from
Asia to Europe in [the 5
th
millennium Neolithic Asia Minor], diffusion that would be united to that
of Indo-Europeans; it doesnt pay attention at all to linguistic data. The [hypothesis of
Gamkrelidze-Ivanov (1980, etc.)], which places the Homeland in the contact zone between
Caucasian and Semitic peoples, south of the Caucasus, is based on real or supposed lexical loans; it
disregards morphological data altogether, too. Criticism of these ideas to which people have paid
too much attention are found, among others, in Meid (1989), Villar (1991), etc..
II. Another hypothesis, contrary to the European ones, also mainly driven today by a
nationalistic view, traces back the origin of PIE to Vedic Sanskrit, postulating that it is
very pure, and that the origin of common Proto-Indo-European can thus be traced back
to the Indus Valley Civilization of ca. 3000 BC.
NOTE. Such Pan-Sanskritism was common among early Indo-Europeanists, as Schlegel, Young,
A. Pictet (1877: Les origines indoeuropens, Paris) or Schmidt (who preferred Babylonia), but are
now mainly supported by those who consider Sanskrit almost equal to Late Proto-Indo-European.
For more on this, see S. Misra (1992: The Aryan Problem: A Linguistic Approach, Delhi), Elsts
Update on the Aryan Invasion Debate (1999), followed up by S.G. Talageris The Rigveda: A
Historical Analysis (2000), both part of Indigenous Indo-Aryan viewpoint by N. Kazanas, the
Out of India theory, with a framework dating back to the times of the Indus Valley Civilization.
III. The Black Sea deluge theory dates the origin of the expansion of IE dialects in the
genesis of the Sea of Azov, ca. 5600 BC, which would in turn be related to the deluge
myth, which would have remained in oral tails until its description in the biblical story of
Noahs Ark, the Hindu Puranic story of Manu, through Deucalion in Greek mythology or
Utnapishtim in the Epic of Gilgamesh. This date is generally considered as rather early
for the PIE spread under frameworks which include the Urheimat near the Black Sea.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
NOTE. W.Ryan and W.Pitman published evidence that a massive flood through the Bosporus
occurred about 5600 BC, when the rising Mediterranean spilled over a rocky sill at the Bosporus.
The event flooded 155,000 km of land and significantly expanded the Black Sea shoreline to the
north and west. This has been connected with the fact that some Early Modern scholars based on
Genesis 10:5 had assumed that the Japhetite languages (instead of the Semitic ones) are rather
the direct descendants of the Adamic language, having separated before the confusion of tongues,
by which also Hebrew was affected. That was claimed by Blessed Anne Catherine Emmerich (18
th
c.), who stated in her private revelations that the purest descendants of the Adamic language were
the main Proto-Indo-European dialects, v.i.
1.6. RELATIONSHIP TO OTHER LANGUAGES
1.6.1. Many higher-level relationships between PIE and other language families have
been proposed. But these speculative connections are highly controversial. Perhaps the
most widely accepted proposal is of an Indo-Uralic family, encompassing PIE and Proto-
Uralic, a language from which Hunarian, Finnish, Estonian, Saami and a number of
other languages belong. The evidence usually cited in favor of this is the proximity of the
proposed Urheimaten for both of them, the typological similarity between the two
languages, and a number of apparent shared morphemes.
NOTE. Other proposals, further back in time (and correspondingly less accepted), model PIE as
a branch of Indo-Uralic with a Caucasian substratum; link PIE and Uralic with Altaic and certain
other families in Asia, such as Korean, Japanese, Chukotko-Kamchatkan and Eskimo-Aleut
(representative proposals are Nostratic and Joseph Greenbergs Eurasiatic); etc.
1.6.2. Indo-Uralic or Uralo-Indo-European is therefore a hypothetical language family
consisting of Indo-European and Uralic (i.e. Finno-Ugric and Samoyedic). Most linguists
still consider this theory speculative and its evidence insufficient to conclusively prove
genetic affiliation.
NOTE. Dutch linguist Frederik Kortlandt supports a model of Indo-Uralic in which its speakers
lived north of the Caspian Sea, and Proto-Indo-Europeans began as a group that branched off
westward from there to come into geographic proximity with the Northwest Caucasian languages,
absorbing a Northwest Caucasian lexical blending before moving farther westward to a region
north of the Black Sea where their language settled into canonical Proto-Indo-European.
1. Introduction
49
The problem with lexical evidence is to weed out words due to borrowing, because Uralic
languages have been in contact with Indo-European languages for millennia, and consequently
borrowed many words from them.
1.6.3. The most common arguments in favour of a relationship between Early PIE and
Uralic are based on seemingly common elements of morphology, such as:
Meaning Early PIE Proto-Uralic
I, me *me, me (Acc.), *mene, my (Gen.)
*mun, *mina, I
you (sg) *tu (Nom.), *twe (Acc.), *tewe your (Gen.)
*tun, *tina
1
st
P. singular *-m
*-m
1
st
P. plural *-me
*-me
2
nd
P. singular *-s (active), *-tHa (perfect)
*-t
2
nd
P. plural *-te
*-te
Demonstrative *so, this, he/she (animate nom)
*a (3
rd
person singular)
Interr. pron.
(An.)
*k
w
i-, who?, what?; *k
w
o-, who?, what?
*ken, who?, *ku-,
who?
Relative pronoun *jo-
*-ja (nomen agentis)
Accusative *-m
*-m
Ablative/partitive *-od
*-ta
Nom./Acc. plural *-es (Nom. pl.), *-m-s (Acc. pl.)
*-t
Oblique plural *-i (pronominal pl., cf. we-i- we, to-i- those)
*-i
Dual *-H
*-k
Stative *-s- (aorist); *-es-, *-t (stative substantive)
*-ta
Negative particle *nei, *ne
*ei- [negative verb] , *ne
to give *deh3-
*toHe-
to wet,water *wed-, to wet, *wodr-, water
*weti, water
water *mesg-, dip under water, dive
*muke-, wash
to assign,
name
*nem-, to assign, to allot, *h1nomn-, name
*nimi, name
metal *h2weseh2-, gold
*wake, some metal
trade *mei-, exchange
*miHe-, give, sell
fish *(s)k
w
alo-, large fish
*kala, fish
sister-in-law *galou-, husband's sister
*kl, sister-in-law
much *polu-, much
*palj, thick, much
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
1.7. INDO-EUROPEAN DIALECTS
SCHLEICHERS FABLE: FROM PIE TO MODERN ENGLISH
The so-called Schleicher's fable is a poem composed in PIE, published by August Schleicher in
1868, originally named The Sheep and the Horses. It is written here in the different
reconstructible IE dialects for comparison.
The immediate parent dialect of each proto-language is enclosed in parentheses.
Languages of Europe. The black line divides the zones traditionally (or politically)
considered inside the European subcontinent. Northern dialects are all but Greek and
Kurdish (Iranian); Armenian is usually considered a Graeco-Aryan dialect, while
Albanian is usually classified as a Northern one. Numbered inside the map, non-Indo-
European languages: 1) Uralic languages; 2) Turkic languages; 3) Basque; 4) Maltese;
5) Caucasian languages.
1. Introduction
51
A Common PIE version (ca. 3500 BC?): H3owis h1ekws-k
w
e. H3owis, k
w
esjo wlh1neh2 ne h1est,
h1ekwoms spekt, h1oinom g
w
rh3m wog
h
om wg
h
ontm, h1oinom-k
w
e megeh2m b
h
orom,
h1oinom-k
w
e d
h
h1g
h
monm h1oh1ku b
h
rontm. H3owis nu h1kwob
h
jos weuk
w
t: Krd h2g
h
nutoi
h1mo, h1ekwoms h2gontm wih1rm wdntei. H1ekws tu weuk
w
nt: Klud
h
, h3owi! krd
h2g
h
nutoi nsmi wdntb
h
jos: h2ner, potis, h3owjom-r wlh1neh2m sweb
h
ei g
wh
ermom westrom
k
w
rneuti. H3owjom-k
w
e wlhneh2 ne h1esti. Tod kkluwos h3owis h2egrom b
h
ugt.
Common Anatolian (PAn), 2500 BC Europes IE (IE IIIb), ca. 2500 BC
Howis ekws-k
w
e. Owis ekws-k
w
e.
Howis, k
w
esjo wlneh ne est, Owis, k
w
esjo wln ne est,
ekwons spekt, ekwons spekt,
oikom g
w
rrm wogom wgontm, oinom g
w
rm wog
h
om wg
h
ontm,
oikom-k
w
e megehm borom, oinom-k
w
e megm b
h
orom,
oikom-k
w
e dgomonm oku brontm. oinom-k
w
e d
h
g
h
omonm k b
h
rontm.
Howis nu kwobos wk
w
t: Owis nu kwob
h
os weuk
w
t:
Krdi xgnutor moi, Krdi g
h
nutoi moi,
ekwons xgontm wirm wdnt. ekwons gontm wrm wdntei.
Ekws tu weuk
w
nt: Klud, howi! Ekws tu weuk
w
nt: Klud
h
i, owi!
krdi hegnutor nsm wdntbos: krdi g
h
nutoi nsmi wdntb
h
jos:
hner, potis, howjom-r wlnehm ner, potis, owjom-r wlnm
sweb g
w
ermom wstrom k
w
rndi. seb
h
ei g
wh
ormom westrom k
w
rneuti.
Howjom-k
w
e wlneh ne esti. Owjom-k
w
e wln ne esti.
Tod kkluwos howis hegrom bugt. Tod kkluwos owis agrom b
h
ugt.
Proto-Aryan (IE IIIa), ca. 2500 BC Proto-Greek (IE IIIa), ca. 2500 BC
Awis aws-ka. Owis ekwoi-k
w
e.
Awis, kasja wrn na st, wis, k
w
eho wln ne st,
akwans spat, ekwons spekt,
aikam grm wag
h
am wg
h
antm, oiwom k
wh
rum wok
h
om wek
h
ontm,
aikam-ka mag
h
am b
h
aram, oiwom-k
w
e megm p
h
orom,
aikam-ka d
h
g
h
manm u b
h
arantm. oiwom-k
w
e k
h
t
h
monm ku p
h
rontm.
Awis nu wab
h
jas waukat: wis nu kwop
h
os weuk
w
et:
rdi g
h
nutai mai, Krdi k
h
nutoi moi,
awans aantam wrm wdntai. ekwons gontm wrm wdntei.
ws tu wawkant: rud
h
avi! kwoi tu wewek
w
ont: Klut
h
i, owi!
rd g
h
nutai nsmi wdntb
h
jas: krd gnutoi nsmi wdntp
h
os:
nar, patis, awjam-r wrnm anr, potis, owjom-r wlnm
swab
h
i g
h
armam wastram krnauti. sep
h
ei k
wh
ermom westrom k
w
rneuti.
Awjam-ka wrn na asti. Owjom-k
w
e wln ne esti.
Tat ruwas awis aram b
h
ugat. Tot kkluwos owis agrom p
h
uget.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
Proto-Celtic (ca. 1000 BC) Proto-Italic (ca. 1000 BC)
Owis ek
w
oi-k
w
e. Owis ekwoi-k
w
e.
Owis, k
w
esjo wln ne est, Owis, k
w
esjo wln ne est,
ek
w
s spekt, ekws spekt,
oinom barm woxom wxontam, oinom grwm woxom wxontem,
oinom-k
w
e megam borom, oinom-k
w
e megam orom,
oinom-k
w
e dxoniom ku berontam. oinom-k
w
e xomonem ku erontem.
Owis nu k
w
obos weuk
w
t: Owis nu kwoos weuk
w
t:
Kridi xnutor mai, Kordi axnutor mei,
ek
w
s gontom wrm wdanti. ekws gontom wrm wdentei.
Ek
w
oi tu wewk
w
nt: Kludi, owi! Ekwoi tu wewk
w
nt: Klui, owi!
kridi xnutor ansmi wdantbjos: kordi axnutor ensmi wdentos:
ner, otis, owjom-ar wlnm ner, potis, owjom-or wlnm
sebi g
w
ormom westrom k
w
arneuti. seei g
h
ormom westrom k
w
orneuti.
Owjom-k
w
e wln ne esti. Owjom-k
w
e wln ne esti.
Tod kklowos owis agrom bugt. Tud kkluwos owis agrom ugt.
Pre-Proto-Germanic (ca. 1000 BC) Proto-Balto-Slavic (ca. 1000 BC)
Awiz exwaz-x
w
e. Awis ews-ke.
Awiz, h
w
es wuln ne est, Awis, kesja wiln ne est,
ehwanz spext, ewas spet,
ainan karn wagan wganun, ainan grun waan wantun,
ainan-x
w
e mekon baran, ainan-ke mean baran,
ainan-x
w
e gmanan xu branun. ainan-ke umanan u brantun
Awiz nu xwamaz weux
w
: Awis nu ewamas wjaukt:
Hurti gnuai mei, irdi gnutei mei,
exwanz kanun wern wtan. ewans gantun wirn wduntei.
Exwaz tu wewx
w
ant: Hludi, awi! Ews tu wjaukunt: ludi, awi!
hurti knuai unsm wtunmaz: irdi gnutei insmi wdntmas:
ner, faiz, awjan-aur wulnn ner, patis, awjam-ir wilnn
sib warman wesran h
w
urneui. sebi gormom westran kirnjaut.
Awjan-x
w
e wuln ne isti. wjam-ke wiln ne esti.
at hxluwaz awiz akran buk. Ta luwas awis agram bugt.
Translation: The Sheep and the Horses. A sheep that had no wool saw horses, one pulling
a heavy wagon, one carrying a big load, and one carrying a man quickly. The sheep said to
the horses: My heart pains me, seeing a man driving horses. The horses said: Listen,
sheep, our hearts pain us when we see this: a man, the master, makes the wool of the sheep
into a warm garment for himself. And the sheep has no wool. Having heard this, the sheep
fled into the plain.
1. Introduction
53
1.7.1. NORTHERN INDO-EUROPEAN DIALECTS
I. NORTH-WEST OR EUROPES INDO-EUROPEAN
The North-West Indo-European dialect continuum, also Europes Indo-European, was
spoken in the European Subcontinent in the centuries on either side of 2500 BC,
evolving into the Pre-Celtic, Pre-Italic, Pre-Latin (probably within Pre-Italic), Pre-
Germanic, Pre-Baltic, Pre-Slavic (or Pre-Balto-Slavic) IE dialects, among others. Its
original common location is usually traced back to some place to the East of the Rhine, to
the North of the Alps and the Carpathian Mountains, to the South of Scandinavia and to
the East of the Eastern European Lowlands or Russian Plain, not beyond Moscow.
The Corded Ware complex
of cultures traditionally
represents for many scholars
the arrival of the first
speakers of Northern Dialects
in central Europe, coming
from the Yamna culture. The
complex dates from about
3200-2300 BC. The Globular
Amphorae culture may be
slightly earlier, but the
relation between these
cultures remains unclear.
NOTE. According to Adrados (1998), [o]ne has to distinguish, in this huge geographical space,
different locations. We have already talked about the situation of Germans to the West, and by
their side, Celtic, Latin and Italic speakers; Balts and Slavs to the East, the former to the North of
the later. See, among others, works by Bonfante (1983, 1984), about the old location of Baltic and
Slavic-speaking communities. Isoglosses of different chronology let us partially reconstruct the
language history. Note that the output obtained with Phonetics and Morphology match up
essentially those of Porzig, who worked with Lexica.
Europe ca. 3200-2300 BC. The Germanic homeland is
usually traced back to Jutland and southern
Scandinavia; present-day Germany was the
homeland for Celtic and Italic; the Eastern zone
corresponds to Balto-Slavic. Beekes (1995).
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
Kortlandt (1989), also considers that [i]t is possible that the speakers of Italo-Celtic must be
assigned to the Globular Amphora culture, and that Germanic grew out of a later component of the
Corded Ware horizon () The Indo-Europeans who remained after the migrations became
speakers of Balto-Slavic. If the speakers of the other satem languages can be assigned to the
Yamnaya horizon and the western Indo-Europeans to the Corded Ware horizon, it is attractive to
assign the ancestors of the Balts and the Slavs to the Middle Dnieper culture [an eastern extension
of the Corded Ware culture, of northern Ukraine and Belarus]. If the origin of this culture is to be
sought in the Sredny Stog, Yamnaya and Late Tripolye cultures and this phase is followed by a
middle period where the classic Corded Ware amphorae and beakers appear (Mallory 1989: 248),
the course of events corresponds nicely with the development of a satem language which was
drawn into the western Indo-European sphere of influence. Similarly, Adrados (1980) about the
dialectal situation of Slavic (under a linguistic point of view): To a layer of archaisms, shared or
not with other languages () Slavic added different innovations, some common to Baltic. Some of
them are shared with Germanic, as the oblique cases in -m and feminine participle; others with
Indo-Iranian, so satemization, Ruki sound law (more present in Slavic than in Baltic) () Most
probably, those common characteristics come from a recent time, from secondary contacts
between IE III B [=Northern IE] (whose rearguard was formed by Balto-Slavs) and A [=Southern
IE] (in a time when Greeks were not in contact anymore, they had already migrated to Greece).
On the archaeological quest for the Urheimat, Mallory & Adams (2006) make a complete
summary of the different frameworks and models used. About the Retrospective Method, still
favoured by many linguists, it is the method where one examines those archaeological cultures
that must have been associated with different Indo-European language groups and attempts to
work backwards to the proto-culture. The unit of analysis here is the so-called archaeological
culture, a classification device employed by archaeologists to deal with similar and geographically
confined material culture and behaviour () Many of the language groups of Europe, i.e. Celtic,
Germanic, Baltic, and Slavic, may possibly be traced back to the Corded Ware horizon of northern,
central, and eastern Europe that flourished c. 3200-2300 BC. Some would say that the Iron Age
cultures of Italy might also be derived from this cultural tradition. For this reason the Corded
Ware culture is frequently discussed as a prime candidate for early Indo-European.
Italic (with Latin), Celtic and Germanic are usually classified within a common West
Indo-European nucleus. Balto-Slavic, on the other hand, is usually placed somewhere
outside that West IE core, but always in close contact with it, as a North-West Indo-
European dialect. Linguists have pointed out language contacts of Italic with Celtic,
1. Introduction
55
Celtic with Germanic, and Germanic with Balto-Slavic. Southern dialectal isoglosses
affect Balto-Slavic and Tocharian, and only partially Germanic and Latin.
NOTE 1. Celtic too shares isoglosses with Southern dialects, according to Meier-Brgger (2003):
Celtic contacts with eastern Indo-Europe are ancient. Compare the case, among others, of relative
pronouns, which in Celtic, contrarily to the Italic *k
w
o-/*k
w
i-, is represented by *Hio-, a
characteristic that it shares with Greek, Phrygian, Indo-Iranian and Slavic. Even though
classifications of early proto-languages may vary depending on different criteria, they all have a
known common origin, which is generally easier to reconstruct than their dialectal groupings. For
example, if we had only some texts of Old French, Old Spanish and Old Portuguese, Mediaeval
Italian and Modern Romanian and Catalan, then Vulgar Latin (ca. 200 AD) i.e. the features of
the common language spoken by all Romance speakers, not the older, artificial, literary Classical
Latin (ca. 100 BC) still less Old Latin (ca. 700 BC) could be easily reconstructed, but the dialectal
groups not. In fact, the actual groupings of the Romance languages are controversial, even
knowing well enough Archaic, Classic and Vulgar Latin, and the history of Romance languages.
Hence the difficulties in reconstructing and grouping individual North-West IE dialects, but the
certainty in reconstructing a common North-West or Europes Indo-European language using raw
linguistics, better explained if combined with archaeological data.
NOTE 2. On the inclusion of Pre-Latin IE within West Indo-Europe, against it there are some
archaeological and linguistic theories (see Szemernyi, Colin Renfrew; v.s. for J.P. Mallory);
Polom (1983) & Schmidt (1984) say innovations common to Celtic and Germanic (later than
those common to Celtic, Latin and Germanic), come from a time when Latin peoples had already
migrated to the Italian peninsula. On the unity of Proto-Italic and Proto-Latin, Adrados (1998):
dubious is the old unity scheme, no doubt only partial, between Latin and Osco-Umbrian, which
has been rejected by famous Italian linguists, relating every coincidence to recent contacts. I am
not so sure about that, as the common innovations are big; cf. Beeler 1966, who doesnt however
dispel the doubts. Obviously, according to the decision taken, there are different historical
consequences. If one thinks that both linguistic groups come from the North, through the Alps (cf.
Tovar 1950), from the end of the 2
nd
millennium, a previous unity can be proposed. But authors
like Devoto (1962) or Szemernyi (1962) made Latin peoples come from the East, through Apulia.
There has been a continued archaeological and (especially) linguistic support by mainstream IE
studies to the derivation of Italic (and Latin) from a West Indo-European core, even after critics to
the old Italo-Celtic concept (C. Watkins Italo-Celtic Revisited, 1963, K.H. Schmidt Latein und
Keltisch, 1986); see Porzig (1954), Dressler (1971), Tovar (1970), Pisani (1974), Lehmann (1974),
Bonfante (1983, 1984), Beekes (1995), Adrados, Bernab, Mendoza (1998), etc.; on the
archaeological question, see Ghirshman (1977), Thomas (1984), Gimbutas (1985), Harall (1995),
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
Evolution of the reconstructed laryngeals of Proto-Indo-European in Europes Indo-
European include these vowel colourizations and compensatory lengthenings:
PIE *H
1
, the neutral laryngeal: *h
1
aa, *h
1
ee, *h
1
oo; *ah
1
, *eh
1
, *oh
1
.
PIE *H
2
, the a-colouring laryngeal: *h
2
aa, *h
2
ea, *h
2
oa; *ah
2
, *eh
2
.
PIE *H
3
, the o-colouring laryngeal: h
3
eo, h
3
oo; eh
3
, oh
3
.
Often, but not always, interconsonantal H a; as, *ph
2
tr patr (cf. PII pitr).
PIH *rHr, *lHl, *nHn, *mHm; also, iH, uH.
PIH *H before consonants EIE ; cf. PIE *h
1
dent-, EIE dentis (cf. PGk
odnts), tooth; PIE *h
2
str-, EIE str (cf. PGk astr), etc.
NOTE. The question is often made the other way round in IE studies, i.e. according to these
vowels reconstructed for North-West Indo-European, Proto-Greek and Proto-Indo-Iranian, which
combination of laryngeal+vowel or vowel+laryngeal could make them all fit into a common
mother-language? For clarity purposes, Common PIE is taken in this book as example for the
phonology of early dialects, but enough certainty in vocalism (for language revival purposes) is to
be found only in EIE, PGk and PII; exact regularity or congruence of a common Proto-Indo-
European phonology is neither necessary nor searched for, as there are many variations in the
laryngeal theories proposed by scholars, who reconstruct from just one (Szemernyi) to eight
(Puhvel) or nine (Adrados); a general reconstruction of three laryngeals is used here for its
simplicity and wide acceptance today. For more on this see Appendix II.3, The Laryngeal Theory.
A. GERMANIC
The Germanic languages
form one of the branches
of the Indo-European
language family. The
largest Germanic
languages are English and
German, with ca. 340 and
some 120 million native
speakers, respectively.
Other significant languages
include Low Germanic dialects (like Dutch) and the Scandinavian languages.
Spread of Germanic languages today.
1. Introduction
57
Their common ancestor is Proto-Germanic, probably still spoken in the mid-1
st
millennium B.C. in Iron Age Northern Europe, since its separation from an earlier Pre-
Proto-Germanic, a dialect of Europes Indo-European branch dated ca. 1500-500 BC.
The succession of archaeological horizons suggests that before their language
differentiated into the individual Germanic branches the Proto-Germanic speakers lived
in southern Scandinavia and along the coast from the Netherlands in the west to the
Vistula in the east around 750 BC. Early Germanic dialects enter history with the
Germanic peoples who settled in northern Europe along the borders of the Roman
Empire from the 2
nd
century.
NOTE. A few surviving
inscriptions in a runic script
from Scandinavia dated to ca.
200 are thought to represent a
later stage of Proto-Norse;
according to Bernard Comrie, it
represents a Late Common
Germanic which followed the
Proto-Germanic stage. Several
historical linguists have pointed
towards the apparent material
and social continuity connecting the cultures of the Nordic Bronze Age (1800-500 BCE) and the
Pre-Roman Iron Age (500 BCE - 1 CE) as having implications in regard to the stability and later
development of the Germanic language group. Lehmann (1977) writes Possibly the most
important conclusion based on archeological evidence with relevance for linguistic purposes is the
assumption of 'one huge cultural area' which was undisturbed for approximately a thousand years,
roughly from 1500-500 BC Such a conclusion in a stable culture permits inferences concerning
linguistic stability, which are important for an interpretation of the Germanic linguistic data.
Also, on setting the upper boundary of a comprehensive description of Proto-Germanic grammar,
Lehmann (2005) wrote: a grammar of Proto-Germanic must be a description of the language
from approximately 2500 BC to the beginning of the common era.
The earliest evidence of the Germanic branch is recorded from names in the 1
st
century
by Tacitus, and in a single instance in the 2
nd
century BC, on the Negau helmet. From
roughly the 2
nd
century AD, some speakers of early Germanic dialects developed the
Elder Futhark. Early runic inscriptions are also largely limited to personal names, and
Expansion of Germanic tribes 1200 BC - 1 AD.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
difficult to interpret. The Gothic language was written in the Gothic alphabet developed
by Bishop Ulfilas for his translation of the Bible in the 4
th
century. Later, Christian
priests and monks who spoke and read Latin in addition to their native Germanic tongue
began writing the Germanic languages with slightly modified Latin letters, but in
Scandinavia, runic alphabets remained in common use throughout the Viking Age.
The so-called Grimms law is a set of statements describing the inherited Europes
Indo-European stops as they developed in Pre-Proto-Germanic. As it is presently
formulated, Grimms Law
consists of three parts,
which must be thought of
as three consecutive
phases in the sense of a
chain shift:
PIE voiceless stops change into PGmc. voiceless fricatives: pf, t, kx, k
w
x
w
.
PIE voiced stops become PGmc. voiceless stops: bp, dt, gk, g
w
k
w
.
PIE voiced aspirated stops lose their aspiration and change into plain voiced
stops: b
h
b, d
h
d, g
h
g, g
wh
g
w
,g,w.
Verners Law addresses a category of exceptions, stating that unvoiced fricatives are
voiced when preceded by an unaccented syllable: PGmc. sz, fv, ; as, EIE b
h
ratr
PGmc. brr, brother, but EIE mtr PGmc. mr mother.
NOTE 1. W. P. Lehmann (1961) considered that Jacob Grimms First Germanic Sound Shift, or
Grimms Law and Verner's Law, which pertained mainly to consonants and were considered for a
good many decades to have generated Proto-Germanic, were Pre-Proto-Germanic, and that the
upper boundary was the fixing of the accent, or stress, on the root syllable of a word, typically the
first. Proto-Indo-European had featured a moveable pitch accent comprising an alternation of
high and low tones as well as stress of position determined by a set of rules based on the lengths
of the word's syllables.
The fixation of the stress led to sound changes in unstressed syllables. For Lehmann, the lower
boundary was the dropping of final -a or -e in unstressed syllables; for example, PIE woid- >,
Goth. wait, knows (the > and < signs in linguistics indicate a genetic descent). Antonsen (1965)
agreed with Lehmann about the upper boundary but later found runic evidence that the -a was not
Negau helmet. It reads (from right to left):
harikastiteiva\\\ip, Harigast the priest.
1. Introduction
59
dropped: Gmc. kwakraz ... wraita, I wakraz ... wrote (this). He says: We must therefore
search for a new lower boundary for Proto-Germanic.
NOTE 2. Sometimes the shift produced allophones (consonants that were pronounced
differently) depending on the context of the original. With regard to original PIE k and k
w
, Trask
(2000) says that the resulting PGmc. x and x
w
were reduced to h and h
w
in word-initial position.
Consonants were lengthened or prolonged under some circumstances, appearing in some
daughter languages as geminated graphemes. Kraehenmann (2003) states that Proto-Germanic
already had long consonants, but they contrasted with short ones only word-medially. Moreover,
they were not very frequent and occurred only intervocally almost exclusively after short vowels.
The phonemes b, d, g and g
w
, says Ringe (2006) were stops in some environments and fricatives
in others.
Effects of the aforementioned sound laws include the following examples:
pf: EIE pods foot, PGmc. fts; cf. Goth. ftus, O.N. ftr, O.E. ft, O.H.G. fuoz.
t,: EIE tritjs third, PGmc. rijaz; cf. Goth. ridja, O.N. rie, OE. ridda,
O.H.G. dritto.
kx,h: EIE kwon dog, PGmc. xunaz; cf. Goth. hunds, O.N. hundr, O.E. hund,
O.H.G. hunt.
k
w
x
w
,h
w
: EIE k
w
os what, who, Gmc. h
w
oz; cf. Goth. hwas, O.N. hverr, O.S. hwe,
O.E. hw, O.Fris. hwa, O.H.G. hwr.
bp: EIE werb throw, Gmc. werp; cf. Goth. wairpan, O.S. werpan, O.N. verpa,
O.E. weorpan, M.L.G., Du. werpen, Ger. werfen.
dt: EIE dekm ten, Gmc. tehun; cf. Goth. taihun, O.S. tehan, O.N. tiu, O.Fris.
tian, O.Du. ten, O.H.G. zehan.
gk: EIE gelu ice, Gmc. kaldaz; cf. Goth. kalds, O.N. kaldr, O.E. cald, O.H.G. kalt.
g
w
kw: EIE g
w
ws alive, Gmc. k
w
i(k)waz; cf. Goth. k
w
ius, O.N. kvikr, O.E. cwic,
O.H.G. quec.
b
h
b: EIE b
h
rtr brother, Gmc. brr; cf. Goth. brar, O.N. brir, O.E.
bror, O.H.G. bruoder.
d
h
d: EIE d
h
woris door, Gmc. duriz; cf. Goth. dar, O.N. dyrr, O.E duru, O.H.G.
turi.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
g
h
g: EIE g
h
ansis goose, Gmc. gansiz; cf. Goth gansus, O.N. gs, O.E. gs,
O.H.G. gans.
g
wh
gw/g/w: EIE g
wh
ormos warm, Gmc. warmaz; cf. O.N. varmr, O.E. wearm,
O.H.G. warm. EIE g
wh
ondos fight, Gmc. gandaz; cf. Goth. guns, O.N. gandr,
O.E. g, O.H.G. gund.
A known exception is that the
voiceless stops did not become
fricatives if they were preceded
by PIE s., i.e. sp, st, sk, sk
w
.
Similarly, PIE t did not become
a fricative if it was preceded by
p, k, or k
w
. This is sometimes
treated separately under the
Germanic spirant law.
EIE vowels: a,oa; EIE
,. PGmc. had then short i,
u, e, a, and long , , , , ?
NOTE 1. Similar mergers
happened in the Slavic languages,
but in the opposite direction. At the
time of the merge, the vowels
probably were [] and [:] before their timbres differentiated into maybe [] and [:].
NOTE 2. PGmc. and are also transcribed as
1
and
2
;
2
is uncertain as a phoneme, and only
reconstructed from a small number of words; it is posited by the comparative method because
whereas all probable instances of inherited EIE (PGmc. *
1
) are distributed in Gothic as and
the other Germanic languages as , all the Germanic languages agree on some occasions of (e.g.
PGmc. h
2
r Goth.,O.E.,O.N. hr, here). Krahe treats
2
(secondary ) as identical with . It
probably continues EIE ei or i, and it may have been in the process of transition from a
diphthong to a long simple vowel in the Proto-Germanic period. Gothic makes no orthographic
and therefore presumably no phonetic distinction between
1
and
2
. The existence of two Proto-
Germanic [e:]-like phonemes is supported by the existence of two e-like Elder Futhark runes,
Ehwaz and Eihwaz.
Germanic dialects in Europe. The line
dividesWestern from Northern dialects.
1. Introduction
61
B. LATIN
The Romance languages,
a major branch of the
Indo-European language
family, comprise all
languages that descended
from Latin, the language
of the Roman Empire.
Romance languages have
some 800 million native
speakers worldwide, mainly
in the Americas, Europe, and
Africa, as well as in many smaller regions scattered through the world. The largest
languages are Spanish and Portuguese, with about 400 and 200 million mother tongue
speakers respectively, most of them outside Europe. Within Europe, French (with 80
million) and Italian (70 million) are the largest ones. All Romance languages descend
from Vulgar Latin, the language of soldiers, settlers, and slaves of the Roman Empire,
which was substantially different from the Classical Latin of the Roman literati. Between
200 BC and 100 AD, the expansion of the Empire, coupled with administrative and
educational policies of Rome, made Vulgar Latin the dominant native language over a
wide area spanning from the Iberian Peninsula to the Western coast of the Black Sea.
During the Empires decadence and after its collapse and fragmentation in the 5
th
century, Vulgar Latin evolved independently within each
local area, and eventually diverged into dozens of
distinct languages. The oversea empires established by
Spain, Portugal and France after the 15
th
century then
spread Romance to the other continents to such
an extent that about two thirds of all Romance
speakers are now outside Europe.
Latin is usually classified, along with Faliscan, as an
Italic dialect. The Italic speakers were not native to
The Duenos (O.Lat. duenus, Lat.
buenus) Inscription in Old Latin,
ca. 6
th
century BC.
Regions where Romance languages are spoken, either
as mother tongue or as second language.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
Italy, but migrated into the Italian Peninsula in the course of the 2
nd
millennium BC, and
were apparently related to the Celtic tribes that roamed over a large part of Western
Europe at the time. Archaeologically, the Apennine culture of inhumations enters the
Italian Peninsula from ca. 1350 BC, east to west; the Iron Age reaches Italy from ca. 1100
BC, with the Villanovan culture (cremating), intruding north to south. Before the Italic
arrival, Italy was populated primarily by non-Indo-European groups (perhaps including
the Etruscans). The first settlement on the Palatine hill dates to ca. 750 BC, settlements
on the Quirinal to 720 BC, both related to the Founding of Rome. As Rome extended its
political dominion over Italy, Latin became dominant over the other Italic languages,
which ceased to be spoken perhaps sometime in the 1
st
century AD.
Italic is usually divided into:
Sabellic, including:
o Oscan, spoken in south-
central Italy.
o Umbrian group:
Umbrian.
Volscian.
Aequian.
Marsian.
South Picene.
Latino-Faliscan, including:
o Faliscan, spoken in the area
around Falerii Veteres, north
of the city of Rome.
o Latin, spoken in west-central
Italy. The Roman conquests
eventually spread it
throughout the Roman
Empire and beyond.
The ancient Venetic language, as revealed by its inscriptions (including complete
sentences), was also closely related to the Italic languages and is sometimes even
Iron Age Italy, ca 800 BC. In central Italy,
Italic languages. In southern and north-
western Italy, other Indo-European languages.
Venetic, Sicanian and Sicel were possibly IE.
1. Introduction
63
classified as Italic. However, since it also shares similarities with other Western Indo-
European branches (particularly Germanic), some linguists prefer to consider it an
independent IE language.
Phonetic changes from EIE to Latin include: b
h
f/b, d
h
f/b, g
h
h/f, g
w
w/g,
k
w
kw/k, pp/kw.
The Italic languages are first attested in writing from
Umbrian and Faliscan inscriptions dating to the 7
th
century BC. The alphabets used are based on the Old
Italic alphabet, which is itself based on the Greek
alphabet. The Italic languages themselves show minor
influence from the Etruscan and somewhat more from
the Ancient Greek languages.
Oscan had much in common with Latin, though there
are also some differences, and many common word-
groups in Latin were represented by different forms;
as, Lat. uolo, uelle, uolui, and other such forms from
PIE wel-, will, were represented by words derived
from g
h
er-, desire, cf. Osc. herest, he wants, desires
as opposed to Lat. uult (id.). Lat. locus, place was
absent and represented by Osc. slaagid.
In phonology, Oscan also shows a different evolution,
as EIE k
w
Osc. p instead of Lat. kw (cf. Osc. pis, Lat.
quis); EIE g
w
Osc. b instead of Latin w; EIE medial b
h
,
d
h
Osc. f, in contrast to Lat. b or d (cf. Osc. mefiai, Lat. mediae); etc.
NOTE. A specimen of Faliscan appears written round the edge of a picture on a patera: foied
vino pipafo, cra carefo, which in Old Latin would have been hodie vinom bibabo, cras carebo,
translated as today I will drink wine; tomorrow I won't have any (R. S. Conway, Italic
Dialects). Among other distinctive features, it shows the retention of medial f which in Latin
became b, and evolution of EIE g
h
f (fo-, contrast Lat. ho-).
Hence the reconstructed changes of North-West Indo-European into Proto-Italic:
Voiced labiovelars unround or lenite: g
w
g/w, g
wh
g
h
.
Forum inscription in Latin,
written boustrophedon.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
Voiced aspirates become first unvoiced, then fricativize: b
h
p
h
f; d
h
t
h
;
g
h
k
h
x.
NOTE. About PIE intervocalic g
h
Ita. x, linguists (see Joseph & Wallace 1991) generally
propose that it evolves as Faliscan g or k, while in Latin it becomes glottal h, without a change of
manner of articulation. Picard (1993) rejects that proposal citing abstract phonetic principles,
which Chela-Flores (1999) argues citing examples of Spanish phonology.
EIE s Ita. before r (cf. Ita. kererom, Lat. cerebrum); unchanged elsewhere.
Up to 8 cases are found; apart from the 6 cases of Classic Latin (i.e. N-V-A-G-D-Ab),
there was a Locative (cf. Lat. proxumae viciniae, dom, carthagini; Osc. aasai, Lat. in
r etc.) and an Instrumental (cf. Columna Rostrata Lat. pugnandod, marid, naualid,
etc; Osc. cadeis amnud, Lat. inimicitiae causae; Osc. preiuatud, Lat. prut, etc.).
About forms different from original Genitives and Datives, compare Genitive (Lapis
Satricanus:) Popliosio Valesiosio (the type in - is also very old, Segomaros -i), and Dative
(Praeneste Fibula:) numasioi, (Lucius Cornelius Scipio Epitaph:) quoiei.
C. CELTIC
The Celtic languages
are the languages
descended from Proto-
Celtic, or Common
Celtic, an Indo-
European proto-
language.
During the 1
st
millennium BC,
especially between the 5
th
and 2
nd
centuries BC they
were spoken across
Europe, from the southwest
of the Iberian Peninsula
and the North Sea, up the Rhine and down the Danube to the Black Sea and the Upper
Diachronic distribution of Celtic peoples: maximal
expansion (ca. 200 BC) and modern Celtic nations and
Celtic-speaking territories.
1. Introduction
65
Balkan Peninsula, and into Asia Minor (Galatia). Today, Celtic languages are now limited
to a few enclaves in the British Isles and on the peninsula of Brittany in France.
The distinction of Celtic into different sub-families probably occurred about 1000 BC.
The early Celts are commonly associated with the archaeological Urnfield culture, the La
Tne culture, and the Hallstatt culture.
Some scholars distinguish Continental and Insular Celtic, arguing that the differences
between the Goidelic and Brythonic languages arose after these split off from the
Continental Celtic languages. Other scholars distinguish P-Celtic from Q-Celtic, putting
most of the Continental Celtic languages in the former group except for Celtiberian,
which is Q-Celtic.
NOTE. There are two
competing schemata of
categorization. One
scheme, argued for by
Schmidt (1988) among
others, links Gaulish
with Brythonic in a P-
Celtic node, leaving
Goidelic as Q-Celtic. The
difference between P
and Q languages is the
treatment of EIE k
w
,
which became *p in the
P-Celtic languages but
*k in Goidelic. An
example is the Cel. verbal root k
w
rin- to buy, which became Welsh pryn-, but O.Ir. cren-.
The other scheme links Goidelic and Brythonic together as an Insular Celtic branch, while
Gaulish and Celtiberian are referred to as Continental Celtic. According to this theory, the P-
Celtic sound change of [k
w
] to [p] occurred independently or regionally. The proponents of the
Insular Celtic hypothesis point to other shared innovations among Insular Celtic languages,
including inflected prepositions, VSO word order, and the lenition of intervocalic [m] to [], a
nasalized voiced bilabial fricative (an extremely rare sound), etc. There is, however, no assumption
that the Continental Celtic languages descend from a common Proto-Continental Celtic ancestor.
Hallstatt core territory (ca. 800 BC) and its influence (ca. 500
BC); and La Tne culture (ca. 450) and its influence (ca. 50
BC). Some major Celtic tribes have been labeled.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
Rather, the Insular/Continental schemata usually consider Celtiberian the first branch to split
from Proto-Celtic, and the remaining group would later have split into Gaulish and Insular Celtic.
Known PIE evolutions into Proto-Celtic include:
Consonants: p h in initial and intervocalic positions. Cel. sxs, txt
NOTE. EIE p was lost in Proto-Celtic, apparently going through the stages (perhaps in Lus.
porcos, v.i.) and h (perhaps attested by the toponym Hercynia if this is of Celtic origin) before
being lost completely word-initially and between vowels. EIE sp- became Old Irish s and
Brythonic f; while Schrijver (1995) argues there was an intermediate stage s- (in which
remained an independent phoneme until after Proto-Insular Celtic had diverged into Goidelic and
Brythonic), McCone (1996) finds it more economical to believe that sp- remained unchanged in
PC, that is, the change p to did not happen when s preceded.
Aspirated: d
h
d, b
h
b, g
h
x, g
wh
g
w
; but g
w
b.
Vowels: , (in final syllable); ; EIE u-w Cel. o-w.
Diphthongs: iai, iei, ioi; uau, u,uou.
Sonorants: lla, li (before stops); r ar, ri (before stops); m am; n an.
Italo-Celtic refers to the hypothesis that Italic and Celtic dialects are descended from a
common ancestor, Proto-Italo-Celtic, at a stage post-dating Proto-Indo-European. Since
both Proto-Celtic and Proto-Italic date to the early Iron Age (say, the centuries on either
side of 1000 BC), a probable time frame for the assumed period of language contact
would be the late Bronze Age, the early to mid 2
nd
millennium BC. Such grouping is
supported among others by Meillet (1890), and Kortlandt (2007).
NOTE. One argument for Italo-Celtic was the thematic Genitive in i (dominus, domini ). Both in
Italic (Popliosio Valesiosio, Lapis Satricanus) and in Celtic (Lepontic, Celtiberian -o), however,
traces of PIE gentivie -osj o have been discovered, so that the spread of the i-Genitive could have
occurred in the two groups independently, or by areal diffusion. The community of - in Italic and
Celtic may be then attributable to early contact, rather than to an original unity. The i-Genitive has
been compared to the so-called Cvi formation in Sanskrit, but that too is probably a comparatively
late development.
Other arguments include that both Celtic and Italic have collapsed the PIE Aorist and Perfect
into a single past tense, and the -subjunctive, because both Italic and Celtic have a subjunctive
descended from an earlier optative in --. Such an optative is not known from other languages, but
the suffix occurs in Balto-Slavic and Tocharian past tense formations, and possibly in Hitt. -ahh-.
1. Introduction
67
D. SLAVIC
The Slavic languages (also called Slavonic languages), a group of closely related
languages of the Slavic peoples and a subgroup of the Indo-European language family,
have speakers in most of Eastern Europe, in much of the Balkans, in parts of Central
Europe, and in the northern part of Asia. The largest languages are Russian and Polish,
with 165 and some 47 million speakers, respectively. The oldest Slavic literary language
was Old Church Slavonic, which later evolved into Church Slavonic.
There is much debate whether Pre-Proto-Slavic branched off directly from Europes
Indo-European in 2000 BC, or whether it passed through a common Proto-Balto-Slavic
stage which had necessarily split apart before 1000 BC in its two main sub-branches.
The original homeland of the
speakers of Proto-Slavic remains
controversial too. The most ancient
recognizably Slavic hydronyms (river
names) are to be found in northern
and western Ukraine and southern
Belarus. It has also been noted that
Proto-Slavic seemingly lacked a
maritime vocabulary.
The Proto-Slavic language secession
from a common Proto-Balto-Slavic is
estimated on archaeological and
glottochronological criteria to have
occurred between 1500-1000 BC.
Common Slavic is usually
reconstructible to around 600 AD.
By the 7
th
century, Common Slavic had broken apart into large dialectal zones.
Linguistic differentiation received impetus from the dispersion of the Slavic peoples over
a large territory which in Central Europe exceeded the current extent of Slavic-
speaking territories. Written documents of the 9
th
, 10
th
& 11
th
centuries already show
some local linguistic features.
Historical distribution of the Slavic
languages. The larger shaded area is the
Prague-Penkov-Kolochin complex of cultures
of the 6th to 7th centuries, likely
corresponding to the spread of Slavic-
speaking tribes of the time. The smaller
shaded area indicates the core area of Slavic
river names, dated ca. 500 AD.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
NOTE. For example the Freising monuments show a language which contains some phonetic
and lexical elements peculiar to Slovenian
dialects (e.g. rhotacism, the word krilatec).
In the second half of the ninth
century, the dialect spoken north of
Thessaloniki became the basis for the
first written Slavic language, created by
the brothers Cyril and Methodius who
translated portions of the Bible and
other church books. The language they
recorded is known as Old Church
Slavonic. Old Church Slavonic is not
identical to Proto-Slavic, having been
recorded at least two centuries after the
breakup of Proto-Slavic, and it shows
features that clearly distinguish it from
Proto-Slavic. However, it is still
reasonably close, and the mutual
intelligibility between Old Church
Slavonic and other Slavic dialects of
those days was proved by Cyrils and
Methodius mission to Great Moravia and Pannonia. There, their early South Slavic
dialect used for the translations was clearly understandable to the local population which
spoke an early West Slavic dialect.
As part of the preparation for the mission, the Glagolitic alphabet was created in 862
and the most important prayers and liturgical books, including the Aprakos Evangeliar
a Gospel Book lectionary containing only feast-day and Sunday readings , the Psalter,
and Acts of the Apostles, were translated. The language and the alphabet were taught at
the Great Moravian Academy (O.C.S. Vekomoravsk uilite) and were used for
government and religious documents and books. In 885, the use of the O.C.S. in Great
Moravia was prohibited by the Pope in favour of Latin. Students of the two apostles, who
were expelled from Great Moravia in 886, brought the Glagolitic alphabet and the Old
Page from Codex Zographensis (10
th
11
th
c. AD) in Old Church Slavonic.
1. Introduction
69
Church Slavonic language to the Bulgarian Empire, where it was taught and Cyrillic
alphabet developed in the Preslav Literary School.
Vowel changes from North-West Indo-European to Proto-Slavic:
EIE , ei Sla. i
1
; EIE i *i Sla. ; EIE u *u Sla. ; EIE Sla. y.
EIE e Sla. e; EIE Sla.
1
;
EIE en, em Sla. ; EIE an, on; am, om *an; *am Sla. .
EIE a, o *a Sla. O; EIE , * Sla. a; EIE ai, oi *ai Sla.
2
.
reduced *ai (*i/*ui) Sla. i
2
; EIE au, ou *au Sla. u.
NOTE 1. Apart from this simplified equivalences, other evolutions appear (see Kortlandts From
Proto-Indo-European to Slavic at <http://www.kortlandt.nl/publications/art066e.pdf>):
o The vowels i2, 2 developed later than i1, 1. In Late Proto-Slavic there were no differences in
pronunciation between i1 and i2 as well as between 1 and 2. They had caused, however, different
changes of preceding velars, see below.
o Late Proto-Slavic yers , < earlier i, u developed also from reduced EIE e, o respectively.
The reduction was probably a morphologic process rather than phonetic.
o We can observe similar reduction of into * (and finally y) in some endings, especially in
closed syllables.
o The development of the Sla. i2 was also a morphologic phenomenon, originating only in
some endings.
o Another source of the Proto-Slavic y is * in Germanic loanwords the borrowings took
place when Proto-Slavic no longer had in native words, as EIE had already changed into *.
o EIE a (from PIE ) disappeared without traces when in a non-initial syllable.
o EIE eu probably developed into *jau in Early Proto-Slavic (or during the Balto-Slavic
epoch), and eventually into Proto-Slavic ju.
o According to some authors, EIE long diphthongs i, i, i, u, u, u had twofold
development in Early Proto-Slavic, namely they shortened in endings into simple *ei, *ai, *oi,
*eu, *au, *ou but they lost their second element elsewhere and changed into *, *, * with
further development like above.
NOTE 2. Other vocalic changes from Proto-Slavic include *jo, *j, *jy changed into *je, *j, *ji;
*o, *, *y also changed into *e, *, *i after *c, *, *s which developed as the result of the 3
rd
palatalization; *e, * changed into *o, *a after *, *, *, * in some contexts or words; a similar
change of * into *a after *j seems to have occurred in Proto-Slavic but next it can have been
modified by analogy.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
On the origin of Proto-Slavic consonants, the following relationships are found:
EIE p Sla. p; EIE b, b
h
Sla. b.
EIE t Sla. t; EIE d, d
h
Sla. d.
EIE k, k
w
Sla. K (palatalized *k
j
Sla. s); EIE g, g
h
, g
w
, g
wh
Sla. g (palatalized
(*g
j
, *g
jh
Sla. z)
EIE s Sla. s; before a voiced consonant EIE [z] Sla. z; before a vowel when
after r, u, k, i, probably also after l Sla. x.
EIE word-final m Sla. n (<BSl. *n).
EIE m Sla. im, um; EIE n Sla. in, un; EIE l Sla. il, ul; EIE r Sla. ir, ur.
EIE w Sla. v (<BSl. *w); EIE j Sla. j.
In some words the Proto-Slavic x developed from other PIE phonemes, like kH, ks, sk.
E. BALTIC
The Baltic languages are a
group of related languages
belonging to the IE
language family, spoken in
areas extending east and
southeast of the Baltic Sea
in Northern Europe.
The language group is
often divided into two sub-
groups: Western Baltic,
containing only extinct
languages as Prussian or
Galindan, and Eastern
Baltic, containing both
extinct and the two living
languages in the group,
Lithuanian and Latvian.
1. Introduction
71
While related, Lithuanian, Latvian, and particularly Old Prussian differ substantially
from each other and are not mutually intelligible.
The oldest Baltic linguistic record is the Elbinger lexicon of the beginning of the 14
th
century AD. IT contains 802 Old Prussian equivalents of Old Middle German words. The
oldest Baltic text is Old Prussian as well; it comes from the middle of the 14
th
century AD
and includes only eleven words. The first Old Lithuanian and Old Latvian texts come
from the 16
th
century and appear already in book form, and were translations of a
catechism and the Lords Prayer.
Baltic and Slavic share so many similarities that many linguists, following the lead of
such notable Indo-Europeanists as August Schleicher and Oswald Szemernyi, take these
to indicate that the two groups separated from a common ancestor, the Proto-Balto-
Slavic language, dated ca. 1500-500 BC, depending on the different guesstimates.
NOTE 1. For those guesstimates, Classical glottochronology conducted by Czech Slavist M.
ejka in 1974 dates the Balto-Slavic split to -910340 BCE, Sergei Starostin in 1994 dates it to
1210 BCE, and recalibrated glottochronology conducted by Novotn & Blaek dates it to 1400-
1340 BCE. This agrees well with Trziniec-Komarov culture, localized from Silesia to Central
Ukraine and dated to the period 15001200 BCE.
NOTE 2. Until Meillets Dialectes indo-europens of 1908, Balto-Slavic unity was undisputed
among linguists as he notes himself at the beginning of the Le Balto-Slave chapter, Lunit
linguistique balto-slave est lune de celles que personne ne conteste. Meillets critique of Balto-
Slavic confined itself to the seven characteristics listed by Karl Brugmann in 1903, attempting to
show that no single one of these is sufficient to prove genetic unity. Szemernyi in his 1957 re-
examination of Meillets results concludes that the Balts and Slavs did, in fact, share a period of
common language and life, and were probably separated due to the incursion of Germanic tribes
along the Vistula and the Dnepr roughly at the beginning of the Common Era.
A new theory was proposed in the 1960s by V. Ivanov and V. Toporov: that the Balto-
Slavic proto-language split from the start into West Baltic, East Baltic and Proto-Slavic.
In their framework, Proto-Slavic is a peripheral and innovative Balto-Slavic dialect which
suddenly expanded, due to a conjunction of historical circumstances. Onomastic
evidence shows that Baltic languages were once spoken in much wider territory than the
one they cover today, and were later replaced by Slavic.
NOTE. The most important of these common Balto-Slavic isoglosses are:
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
o Winters law: lengthening of a short vowel before a voiced plosive, usually in a closed syllable.
o Identical reflexes of PIE syllabic sonorants, usually developing i and u before them.
Kuryowicz thought that *uR reflexes arose after PIE velars, and also notable is also older
opinion of J.Endzelns and R. Trautmann according to whom *uR reflexes are the result of
zero-grade of morphemes that had EIE o PBSl. *a in normal-grade. Matasovi (2008)
proposes following internal rules after EIE syllabic R BSl. *R: 1) **i in a final syllable;
2) **u after velars and before nasals; 3) **i otherwise.
o Hirts law: retraction of PIE accent to the preceding syllable closed by a laryngeal.
o Rise of the Balto-Slavic acute before PIE laryngeals in a closed syllable.
o Replacement of PIE genitive singular of thematic nouns with ablative.
o Formation of past tense in *- (cf. Lith. pret. dv, he gave, O.C.S. imperfect b, he was)
o Generalization of the IE neuter to- stem to the nominative singular of masculine and
feminine demonstratives instead of IE so- pronoun, so, s, tod BSl. tos, t, tod.
o Formation of definite adjectives with a construction of adjective and relative pronoun; cf.
Lith. gersis, the good, vs. gras, good; O.C.S dobrj, the good, vs. dobr, good.
Common Balto-Slavic innovations include several other prominent, but non-exclusive isoglosses,
such as the satemization, Ruki, change of PIE o BSl. *a (shared with Germanic, Indo-Iranian
and Anatolian) and the loss of labialization in PIE labiovelars (shared with Indo-Iranian,
Armenian and Tocharian). Among Balto-Slavic archaisms notable is the retention of traces of an
older PIE accent. Ruki is the term for a sound law which is followed especially in BSl. and Aryan
dialects. The name of the term comes from the sounds which cause the phonetic change, i.e. PIE s
/ r, u, K, i (it associates with a Slavic word which means hands or arms). A sibilant [s] is
retracted to [] after i,u,r, and after velars (i.e. k which may have developed from earlier k, g, g
h
).
Due to the character of the retraction, it was probably an apical sibilant (as in Spanish), rather
than the dorsal of English. The first phase (s ) seems to be universal, the later retroflexion (in
Sanskrit and probably in Proto-Slavic as well) is due to levelling of the sibilant system, and so is
the third phase - the retraction to velar [x] in Slavic and also in some Middle Indian languages,
with parallels in e.g. Spanish. This rule was first formulated for IE by Holger Pedersen.
Baltic and Slavic show a remarkable amount of correspondence in vocabulary too; there are at
least 100 words exclusive to BSl., either being a common innovation or sharing the same semantic
development from a PIE root; as, BSl. *lip, tilia Lith. lepa, O.Prus. lpa, Ltv. lipa; Sla.
*lipa; BSl. *rank, hand Lith. rank, O.Prus. rnkan, Ltv. roka; Sla. *rk (cf. O.C.S. rka).
BSl. *galw, head Lith. galv, O.Prus. galwo, Ltv. galva; Sla. *golv (cf. O.C.S. glava).
1. Introduction
73
F. FRAGMENTARY DIALECTS
MESSAPIAN
Messapian (also known as Messapic) is an extinct Indo-European language of south-
eastern Italy, once spoken in the regions of Apulia and Calabria. It was spoken by the
three Iapygian tribes of the region: the Messapians, the Daunii and the Peucetii. The
language, a centum dialect, has been preserved in about 260 inscriptions dating from the
6
th
to the 1
st
century BC. It became extinct after the Roman Empire conquered the region
and assimilated the inhabitants.
There is a hypothesis that Messapian was an Illyrian language. The Illyrian languages
were spoken mainly on the other side of the Adriatic Sea. The link between Messapian
and Illyrian is based mostly on personal names found on tomb inscriptions and on
classical references, since hardly any traces of the Illyrian language are left.
NOTE. Some phonetic characteristics of the language may be regarded as quite certain:
o PIE short *oa, as in the last syllable of the genitive kalatoras.
o PIE final *mn, as in aran.
o PIE *njnn, as in the Messapian praenomen Dazohonnes vs. the Illyrian praenomen
Dazonius; the Messapian genitive Dazohonnihi vs. Illyrian genitive Dasonii, etc.
o PIE *tjtth, as in the Messapian praenomen Dazetthes vs. Illyrian Dazetius; the Messapian
genitive Dazetthihi vs. the Illyrian genitive Dazetii; from a Dazet- stem common in Illyrian
and Messapian.
o PIE *sjss, as in Messapian Vallasso for Vallasio, a derivative from the shorter name Valla.
o The loss of final *-d, as in tepise, and probably of final *-t, as in -des, perhaps meaning set,
from PIE *d
h
e-, set, put.
o The change of voiced aspirates in Proto-Indo-European to plain voiced consonants: PIE
*d
h
d, as in Messapian anda (< PIE *en-d
h
a- < PIE *en-, in, compare Gk. entha); and PIE
*b
h
b, as in Messapian beran (< PIE *b
h
er-, to bear).
o PIE *au before (at least some) consonants: Bsta, from Bausta.
o The form penkaheh which Torp very probably identifies with the Oscan stem pompaio a
derivative of the Proto-Indo-European numeral *penk
w
e, five.
o If this last identification be correct it would show, that in Messapian (just as in Venetic and
Ligurian) the original labiovelars (*k
w
,* g
w
, *g
wh
) were retained as gutturals and not converted
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
into labials. The change of o to a is exceedingly interesting, being associated with the
northern branches of Indo-European such as Gothic, Albanian and Lithuanian, and not
appearing in any other southern dialect hitherto known. The Greek Aphrodite appears in the
form Aprodita (Dat. Sg., fem.).
o The use of double consonants which has been already pointed out in the Messapian
inscriptions has been very acutely connected by Deecke with the tradition that the same
practice was introduced at Rome by the poet Ennius who came from the Messapian town
Rudiae (Festus, p. 293 M).
VENETIC
Venetic is an Indo-European language that was spoken in ancient times in the Veneto
region of Italy, between the Po River delta and the southern fringe of the Alps. It was a
Centum dialect.
The language is attested by over 300 short inscriptions dating between the 6
th
century
BC and 1
st
century. Its speakers are identified with the ancient people called Veneti by the
Romans and Enetoi by the Greek. The inscriptions use a variety of the Northern Italic
alphabet, similar to the Old Italic alphabet. It became extinct around the 1
st
century when
the local inhabitants were assimilated into the Roman sphere.
NOTE. The exact relationship of Venetic to other Indo-European languages is still being
investigated, but the majority of scholars agree that Venetic, aside from Liburnian, was closest to
the Italic languages. Venetic may also have been related to the Illyrian languages, though the
theory that Illyrian and Venetic were closely related is debated by current scholarship.
Interesting parallels with Germanic have also been noted, especially in pronominal forms:
Ven. ego, I, acc. mego, me; Goth. ik, acc. mik; but cf. Lat. ego, acc. me.
Ven. sselboisselboi, to oneself; O.H.G. selb selbo; but cf. Lat. sibi ipsi.
Venetic had about six or even seven noun cases and four conjugations (similar to Latin). About
60 words are known, but some were borrowed from Latin (liber.tos. < libertus) or Etruscan. Many
of them show a clear Indo-European origin, such as Ven. vhraterei (< PIE *b
h
reh2terei), to the
brother.
In Venetic, PIE stops *b
h
f, *d
h
f, *g
h
h, in word-initial position (as in Latin and Osco-
Umbrian), but to *b
h
b, *d
h
d, *g
h
g, in word-internal intervocalic position, as in Latin. For
Venetic, at least the developments of *b
h
and *d
h
are clearly attested. Faliscan and Osco-Umbrian
preserve internal *b
h
f,* d
h
f, *g
h
h.
1. Introduction
75
There are also indications of the developments of PIE initial *g
w
w-, PIE *k
w
kv and PIE initial
*g
wh
f in Venetic, all of which are parallel to Latin, as well as the regressive assimilation of PIE
sequence p...k
w
... k
w
...k
w
... (e.g. penk
w
e *k
w
enk
w
e, five, perk
w
u *k
w
erk
w
u, oak), a
feature also found in Italic and Celtic (Lejeune 1974).
LIGURIAN
The Ligurian language was spoken in pre-Roman times and into the Roman era by an
ancient people of north-western Italy and south-eastern France known as the Ligures.
Very little is known about this language (mainly place names and personal names
remain) which is generally believed to have been Indo-European; it appears to have
adopted significantly from other IE languages, primarily Celtic (Gaulish) and Latin.
Strabo states As for the Alps... Many tribes (thn) occupy these mountains, all Celtic
(Keltik) except the Ligurians; but while these Ligurians belong to a different people
(hetero-ethneis), still they are similar to the Celts in their modes of life (bois).
LIBURNIAN
The Liburnian language is an extinct IE language which was spoken by the ancient
Liburnians in the region of Liburnia (south of the Istrian peninsula) in classical times. It
is usually classified as a Centum language. It appears to have been on the same Indo-
European branch as the Venetic language; indeed, the Liburnian tongue may well have
been a Venetic dialect.
NOTE. No writings in Liburnian are known, though. The grouping of Liburnian with Venetic is
based on the Liburnian onomastics. In particular, Liburnian anthroponyms show strong Venetic
affinities, with many common or similar names and a number of common roots, such as Vols-,
Volt-, and Host- (<PIE *g
h
os-ti-, stranger, guest, host). Liburnian and Venetic names also share
suffixes in common, such as -icus and -ocus.
These features set Liburnian and Venetic apart from the Illyrian onomastic province, though this
does not preclude the possibility that Venetic-Liburnian and Illyrian may have been closely
related, belonging to the same Indo-European branch. In fact, a number of linguists argue that
this is the case, based on similar phonetic features and names in common between Venetic-
Liburnian on the one hand and Illyrian on the other.
Liburnia was conquered by the Romans in 35 BC, and its language was eventually
replaced by Latin, undergoing language death probably very early in the Common Era.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
LUSITANIAN
Lusitanian or
Lusatian (so
named after the
Lusitani or
Lusitanians) was
a Paleohispanic
IE language
known by only
five inscriptions
and numerous
toponyms and
theonyms. The
language was
spoken before the
Roman conquest
of Lusitania, in
the territory
inhabited by Lusitanian tribes, from Douro to the Tagus river in the western area of the
Iberian Peninsula, where they were established already before the 6
th
c. BC.
Their language is usually considered a Pre-Celtic (possibly Italo-Celtic) IE dialect, and
it is sometimes associated with the language of the Vettones and with the linguistic
substratum of the Gallaeci and Astures, based on archaeological findings and
descriptions of ancient historians.
NOTE. The affiliation of the Lusitanian language within a Pre-Celtic (or Italo-Celtic) IE group is
still debated. There are those who endorse that it is a Celtic language, a theory largely based upon
the historical fact that the only Indo-European tribes that are known to have existed in Portugal at
that time were Celtic tribes. The apparent Celtic character of most of the lexicon anthroponyms
and toponyms may also support a Celtic affiliation. There is a substantial problem in the Celtic
theory however: the preservation of PIE initial *p-, as in Lusitanian pater or porcom, meaning
father and pig, respectively. The Celtic languages had lost that initial *p- in their evolution;
compare Lat. pater, Gaul. ater, and Lat. porcum, O.Ir. orc. However, it does not necessarily
Iberian Peninsula
ca. 200 BC.
1. Introduction
77
preclude the possibility of Lusitanian being Celtic, because of the supposed evolution of PIE initial
*p * *h Cel. , so it might have been an early Proto-Celtic (or Italo-Celtic) dialect that split
off before the loss of p-, or when p- had become * - (before shifting to h- and then being lost); the
letter p of the Latin alphabet could have been used to represent either sound.
F. Villar and R. Pedrero relate Lusitanian with the Italic languages. The theory is based on
parallels in the names of deities, as Lat. Consus, Lus. Cossue, Lat. Seia, Lus. Segia, or Marrucinian
Iovia, Lus. Iovea(i), etc. and other lexical items, as Umb. gomia, Lus. comaiam, with some other
grammatical elements.
II. NORTHERN INDO-EUROPEAN IN ASIA: TOCHARIAN
Tocharian or Tokharian is one of the most obscure branches of the group of Indo-
European languages. The name of the language is taken from people known to the Greek
historians (Ptolemy VI, 11, 6) as the Tocharians (Greek , Tokharoi).
NOTE. These are sometimes identified with the Yuezhi and the Kushans, while the term
Tokharistan usually refers to 1
st
millennium Bactria. A Turkic text refers to the Turfanian language
(Tocharian A) as twqry. F. W. K. Mller has associated this with the name of the Bactrian
Tokharoi. In Tocharian, the language is referred to as arish-kna and the Tocharians as arya.
Tocharian consisted of two languages; Tocharian A (Turfanian, Arsi, or East Tocharian)
and Tocharian B (Kuchean or West Tocharian). These languages were spoken roughly
from the 6
th
to 9
th
century centuries; before they became extinct, their speakers were
absorbed into the expanding Uyghur tribes. Both languages were once spoken in the
Tarim Basin in Central Asia, now the Xinjiang Autonomous Region of China.
Note. Properly speaking, based on the tentative interpretation of twqry as related to Tokharoi,
only Tocharian A may be referred to as Tocharian, while Tocharian B could be called Kuchean (its
native name may have been kuie), but since their grammars are usually treated together in
scholarly works, the terms A and B have proven useful.
Tocharian is documented in manuscript fragments, mostly from the 8
th
century (with a
few earlier ones) that were written on palm leaves, wooden tablets and Chinese paper,
preserved by the extremely dry climate of the Tarim Basin. Samples of the language have
been discovered at sites in Kucha and Karasahr, including many mural inscriptions.
Tocharian A and B were not intercomprehensible. The common Proto-Tocharian
language must have preceded the attested languages by several centuries, probably
dating to the 1
st
millennium BC.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
1.7.2. SOUTHERN INDO-EUROPEAN DIALECTS
I. GREEK
Greek is an Indo-European branch with a documented history of 3,500 years. Today,
Modern Greek is spoken by 15 million people in Greece, Cyprus, the former Yugoslavia
(especially in the FYROM), Bulgaria, Albania and Turkey.
The major dialect groups of the Ancient Greek period can be assumed to have
developed not later than 1120 BC, at the time of the Dorian invasions, and their first
appearances as precise alphabetic writing began in the 8th century BC. The ancient
Greeks themselves considered there to be three major divisions of the Greek people, into
Dorians, Aeolians, and Ionians (including Athenians), each with their own defining and
distinctive dialects. Allowing for their oversight of Arcadian, an obscure mountain
dialect, and Cyprian, far from the center of Greek scholarship, this division of people and
Ancient Greek dialects by 400 BC
after R.D. Woodard (2008).
1. Introduction
79
language is quite similar to the results of modern archaeological and linguistic
investigation.
Greek has been spoken in the Balkan Peninsula
since 2000 BC. The earliest evidence of this is
found in the Linear B tablets dating from 1500 BC.
The later Greek alphabet is unrelated to Linear B,
and was derived from the Phoenician alphabet;
with minor modifications, it is still used today.
Mycenaean is the most ancient attested form of
the Greek branch, spoken on mainland Greece and
on Crete in the 16
th
to 11
th
centuries BC, before the
Dorian invasion. It is preserved in inscriptions in
Linear B, a script invented on Crete before the 14
th
century BC. Most instances of these inscriptions are
on clay tablets found in Knossos and in Pylos. The
language is named after Mycenae, the first of the palaces to be excavated.
The tablets remained long undeciphered, and every conceivable language was
suggested for them, until Michael Ventris deciphered the script in 1952 and proved the
language to be an early form of Greek. The texts on the tablets are mostly lists and
inventories. No prose narrative survives, much less myth or poetry. Still, much may be
glimpsed from these records about the people who produced them, and about the
Mycenaean period at the eve of the so-called Greek Dark Ages.
Unlike later varieties of Greek, Mycenaean probably had seven grammatical cases, the
nominative, the genitive, the accusative, the dative, the instrumental, the locative, and
the vocative. The instrumental and the locative however gradually fell out of use.
NOTE. For the Locative in *-ei, compare di-da-ka-re, didaskalei, e-pi-ko-e, Epikhei, etc (in
Greek there are syntactic compounds like puloi-gens, born in Pylos); also, for remains of an
Ablative case in *-d, compare (months names) ka-ra-e-ri-jo-me-no, wo-de-wi-jo-me-no, etc.
Proto-Greek, a southern PIE dialect, was spoken in the late 3
rd
millennium BC, roughly
at the same time as North-West Indo-European and Proto-Indo-Iranian, most probably
in the Balkans. It was probably the ancestor of Phrygian too, and possibly that of Ancient
Linear B has roughly 200 signs,
divided into syllabic signs with
phonetic values and logograms
with semantic values.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
Macedonian, Dacian, Thracian, and arguably Armenian. The unity of Proto-Greek
probably ended as Hellenic migrants, speaking the predecessor of the Mycenaean
language, entered the Greek paeninsula around the 21st century BC. They were then
separated from the Dorian Greeks, who entered the peninsula roughly one millennium
later, speaking a dialect that in some respects had remained more archaic.
NOTE. For Pelasgian and other Greek substrates as IE, some have cited different phonological
developments in words like (tumbos < PIE *d
h
mb
h
os) or (purgos < PIE *b
h
rg
h
os).
Proto-Greek was affected by a late Satemization trend, evidenced by the (post-
Mycenaean) change of labiovelars into dentals before e (e.g. k
w
e te and).
The primary sound changes from (laryngeal) PIE to Proto-Greek include:
Aspiration of PIE intervocalic *s PGk h.
NOTE. The loss of PIE prevocalic *s- was not completed entirely, famously evidenced by sus
sow, dasus dense; sun with, sometimes considered contaminated with PIE *kom (cf. Latin
cum) to Homeric / Old Attic ksun, is possibly a consequence of Gk. psi-substrate (See Villar).
De-voicing of voiced aspirates: *b
h
p
h
, *d
h
t
h
, *g
h
k
h
, *g
wh
k
wh
.
Dissimilation of aspirates (Grassmanns law), possibly post-Mycenaean.
PIE word-initial *j- (not *Hj-) is strengthened to PGk dj- (later Gk. -).
Vocalization of laryngeals between vowels and initially before consonants, i.e. *h
1
e,
*h
2
a, *h
3
o.
NOTE. The evolution of Proto-Greek should be considered with the background of an early
Palaeo-Balkan Sprachbund that makes it difficult to delineate exact boundaries between
individual languages. The characteristically Greek representation of word-initial laryngeals by
prosthetic vowels is shared by the Armenian language, which also shares other phonological and
morphological peculiarities of Greek, vide infra.
The sequence CRHC (where C = consonant, R = resonant, H = laryngeal) becomes
PIE CRh
1
C PGk CRC; PIE CRh
2
C PGk CRC; PIE CRh
3
C PGk CRC.
The sequence PIE CRHV (where V = vowel) becomes PGk CaRV.
NOTE. It has also been proposed by Sihler (2000) that Vk
w
uk
w
; cf. PIE *nok
w
ts, night PGk
nuk
w
ts Gk. nuks/nuxt-; cf. also *k
w
ek
w
los, circle PGk k
w
uk
w
los Gk. kuklos; etc.
Later sound changes between Proto-Greek and the attested Mycenaean include:
1. Introduction
81
o Loss of final stop consonants; final mn.
o Syllabic mam, and nan, before resonants; otherwise both were nasalized
m/na.
o loss of s in consonant clusters, with supplementary lengthening, e.g. esmimi.
o creation of secondary s from clusters, ntjansa. Assibilation tisi only in
southern dialects.
o Mycenaean i-vocalism and replacement of double-consonance -kw- for -k
w
k
w
-.
NOTE. On the problematic case of common Greek (
h
ippos), horse, derived from PIE and
PGk ekwos, Meier-Brgger (2003): the i-vocalism of which is best understood as an inheritance
from the Mycenaean period. At that time, e in a particular phonetic situation must have been
pronounced in a more closed manner, cf. di-pa i.e. dipas neuter lidded container fror drinking vs.
the later (since Homer): Risch (1981), O. Panagl (1989). That the i-form extended to the
entire Greek region may be explained in that the word, very central during Mycenaean rule of the
entire region (2
nd
millennium BC), spread and suppressed the e-form that had certainly been
present at one time. On the -pp-: The original double-consonance -ku- was likely replaced by -
k
w
k
w
- in the pre-Mycenaean period, and again, in turn by -pp- after the disappearance of the
labiovelars. Suggestions of an ancient -k
w
k
w
- are already given by the Mycenaean form as i-qo (a
possible *i-ko-wo does not appear) and the noted double-consonance in alphabetic Greek. The
aspiration of the word at the beginning remains a riddle.
Other features common to the earliest Greek dialects include:
The PIE dative, instrumental and locative cases were syncretized into a single dative.
Dialectal nominative plural in -oi, -ai fully replaces Late PIE common *-s, *-s.
The superlative on -tatos (<PIE *-t-to-s) becomes productive.
The peculiar oblique stem gunaik- women, attested from the Thebes tablets is
probably Proto-Greek; it appears, at least as gunai- also in Armenian.
The pronouns houtos, ekeinos and autos are created. Use of ho, h, ton as articles is
post-Mycenaean.
The first person middle verbal desinences -mai, -mn replace -ai, -a. The third
singular pherei is an analogical innovation, replacing the expected PIE *b
h
reti, i.e.
Dor. *phereti, Ion. *pheresi.
The future tense is created, including a future passive, as well as an aorist passive.
The suffix -ka- is attached to some perfects and aorists.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
Infinitives in -ehen, -enai and -men are also common to Greek dialects.
II. ARMENIAN
Armenian is an Indo-European
language spoken in the Armenian
Republic , as well as in the region
of Nagorno-Karabakh, and also
used by ethnic Armenians in the
Diaspora.
Armenian has been traditionally
regarded as a close relative of
Phrygian, apparently closely
related to Greek, sharing major
isoglosses with it. The Graeco-
Armenian hypothesis proposed a
close relationship to the Greek
language, putting both in the
larger context of Paleo-Balkans languages notably including Phrygian, which is widely
accepted as an Indo-European language particularly close to Greek, and sometimes
Ancient Macedonian , consistent with Herodotus recording of the Armenians as
descending from colonists of the Phrygians.
NOTE. That traditional linguistic theory, proposed by Pedersen (1924), establishes a close
relationship between both original communities, Greek and Armenian, departing from a common
subdialect of IE IIIa (Southern Dialect of Late PIE). That vision, accepted for a long time, was
rejected by Clackson (1994) in The linguistic relationship between Armenian and Greek, which,
supporting the Graeco-Aryan linguistic hypothesis, dismisses that the coincidences between
Armenian and Greek represent more than those found in the comparison between any other IE
language pair. Those findings are supported by Kortlandt in Armeniaca (2003), in which he
proposes an old Central IE continuum Daco-Albanian / Graeco-Phrygian / Thraco-Armenian.
Adrados (1998), considers an older Southern continuum Graeco-[Daco-]Thraco-Phrygian /
Armenian / Indo-Iranian. Olteanu (2009) proposes a Graeco-Daco-Thracian language.
The earliest testimony of the Armenian language dates to the 5
th
century AD, the Bible
translation of Mesrob Mashtots. The earlier history of the language is unclear and the
Distribution of ethnic Armenians in the 20
th
c.
1. Introduction
83
subject of much speculation. It is clear that Armenian is an Indo-European language, but
its development is opaque.
NOTE. Proto-Armenian sound-laws are
varied and eccentric, such as IE *dw- yielding
Arm. k-, and in many cases still uncertain. In
fact, that phonetic development is usually
seen as *dw- to erk-, based on PIE numeral
*dwo-, two, a reconstruction Kortlandt
(ibidem) dismisses, exposing alternative
etymologies for the usual examples.
PIE voiceless stops are aspirated in
Proto-Armenian, a circumstance that
gave rise to the Glottalic theory, which
postulates that this aspiration may have
been sub-phonematic already in Proto-
Indo-European. In certain contexts, these
aspirated stops are further reduced to w,
h or zero in Armenian so e.g. PIE *pots,
into Arm. otn, Gk. pous, foot; PIE *treis, Arm. erek, Gk. treis, three.
The reconstruction of Proto-Armenian being very uncertain, there is no general
consensus on the date range when it might have been alive. If Herodotus is correct in
deriving Armenians from Phrygian stock, the Armenian-Phrygian split would probably
date to between roughly the 12
th
and 7
th
centuries BC, but the individual sound-laws
leading to Proto-Armenian may have occurred at any time preceding the 5
th
century AD.
The various layers of Persian and Greek loanwords were likely acquired over the course
of centuries, during Urartian (pre-6
th
century BC) Achaemenid (6
th
to 4
th
c. BC; Old
Persian), Hellenistic (4
th
to 2
nd
c. BC Koine Greek) and Parthian (2
nd
c. BC to 3
rd
c. AD;
Middle Persian) times.
Grammatically, early forms of Armenian had much in common with classical Greek and
Latin, but the modern language (like Modern Greek) has undergone many
transformations. Interestingly enough, it shares with Italic dialects the secondary IE
suffix *-tjn, extended from *-ti-, cf. Arm (tyoun).
Armenian manuscript, ca. 5
th
-6
th
c.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
III. INDO-IRANIAN
The Indo-Iranian or Aryan language group constitutes the easternmost extant branch
of the Indo-European family of languages. It consists of two main language groups, Indo-
Aryan and Iranian, and probably Nuristani; Dardic is usually classified within the Indic
subgroup.
The contemporary Indo-Iranian languages form therefore the second largest sub-
branch of Indo-European (after North-West Indo-European), with more than one billion
speakers in total, stretching from Europe (Romani) and the Caucasus (Ossetian) to East
India (Bengali and Assamese). The largest in terms of native speakers are Hindustani
(Hindi and Urdu, ca. 540 million), Bengali (ca. 200 million), Punjabi (ca. 100 million),
Marathi and Persian (ca. 70 million each), Gujarati (ca. 45 million), Pashto (40 million),
Oriya (ca. 30 million), Kurdish and Sindhi (ca. 20 million each).
Proto-Indo-Iranians are commonly identified with the bearers of the Andronovo
culture and their homeland with an area of the Eurasian steppe that borders the Ural
River on the west, the Tian Shan on the east where the Indo-Iranians took over the
area occupied by the earlier Afanasevo culture , and Transoxiana and the Hindu Kush
Map of the
Sintashta-
Petrovka
culture
(red), its
expansion
into the
Andronovo
culture
during the
2
nd
millennium
BC, showing
the overlap
with the
BMAC in the
south. The
location of
the earliest
chariots is
shown in
purple.
1. Introduction
85
on the south. Historical linguists broadly estimate that a continuum of Indo-Iranian
languages probably began to diverge by 2000 BC, preceding both the Vedic and Iranian
cultures. A Two-wave model of Indo-Iranian expansion have been proposed (see Burrow
1973 and Parpola 1999), strongly associated with the chariot.
Aryans spread into the Caucasus, the Iranian plateau, and South Asia, as well as into
Mesopotamia and Syria, introducing the horse and chariot culture to this part of the
world. Sumerian texts from EDIIIb Ngirsu (2500-2350 BC) already mention the chariot'
(gigir) and Ur III texts (2150-2000 BC) mention the horse (anshe-zi-zi). They left
linguistic remains in a Hittite horse-training manual written by one Kikkuli the
Mitannian. Other evidence is found in references to the names of Mitanni rulers and the
gods they swore by in treaties; these remains are found in the archives of the Mitanni's
neighbors, and the time period for this is about 1500 BC.
The standard model for the entry of the Indo-European languages into South Asia is
that the First Wave went over the Hindu Kush, either into the headwaters of the Indus
and later the Ganges. The earliest stratum of Vedic Sanskrit, preserved only in the
Rigveda, is assigned to roughly 1500 BC. From the Indus, the Indo-Aryan languages
spread from ca. 1500 BC to ca. 500 BC, over the northern and central parts of the
subcontinent, sparing the extreme south. The Indo-Aryans in these areas established
several powerful kingdoms and principalities in the region, from eastern Afghanistan to
the doorstep of Bengal.
The Second Wave is interpreted as the Iranian wave. The Iranians would take over all of
Central Asia, Iran, and for a considerable period, dominate the European steppe (the
modern Ukraine) and intrude north into Russia and west into central and eastern Europe
well into historic times and as late as the Common Era. The first Iranians to reach the
Black Sea may have been the Cimmerians in the 8th century BC, although their linguistic
affiliation is uncertain. They were followed by the Scythians, who are considered a
western branch of the Central Asian Sakas, and the Sarmatian tribes.
The Medes, Parthians and Persians begin to appear on the Persian plateau from ca. 800
BC, and the Achaemenids replaced Elamite rule from 559 BC. Around the first
millennium of the Common Era, the Iranian Pashtuns and Baloch began to settle on the
eastern edge of the Iranian plateau, on the mountainous frontier of northwestern
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
Pakistan in what is now the North-West Frontier Province and Balochistan, displacing
the earlier Indo-Aryans from the area.
The main changes separating Proto-Indo-Iranian from Late PIE include:
Early Satemization trend:
o Loss of PIE labiovelars into PII plain velars: *k
w
k , *g
w
g, *g
wh
g
h
.
o Palatalization of PII velars in certain phonetic environments: *k, *g, *g
h
h
.
Loss of laryngeals: *HVa, *VH. Interconsonantal *H i, cf. *ph
2
tr PII pitr.
NOTE. A common exception is the Brugmanns law. For those linguists who consider the
laryngeal loss to have occurred already in Late PIE, Aryan vocalism is described as a collapse of
PIE ablauting vowels into a single PII vowel; i.e. *e,*oa; *,*.
Grassmanns law, Bartholomaes law, and the Ruki sound law were complete in PII.
NOTE. For a detailed description of those Indo-Iranian sound laws and the satemization
process, see Appendix II. For Ruki sound law, v.s. Baltic in 1.7.1.
Sonorants are generally stable in PII, but for the confusion *l/*r, which in the oldest
Rigveda and in Avestan gives a general PIE *l PII r, as well as lr.
Among the sound changes from Proto-Indo-Iranian to Indo-Aryan is the loss of the
voiced sibilant *z; among those to Iranian is the de-aspiration of PIE voiced aspirates.
A. IRANIAN
Current distribution of
Iranian dialects.
1. Introduction
87
The Iranian languages are a branch of the Indo-Iranian subfamily, with an estimated
150-200 million native speakers today, the largest being Persian (ca. 60 million),
Kurdish (ca. 25 million), Pashto (ca. 25 million) and Balochi (ca. 7 million).
Proto-Iranian dates to some time after the Proto-Indo-Iranian breakup, or the early
second millennium BC, as the Old Iranian languages began to break off and evolve
separately as the various Iranian tribes migrated and settled in vast areas of southeastern
Europe, the Iranian plateau, and Central Asia. The oldest Iranian language known,
Avestan, is mainly attested through the Avesta, a collection of sacred texts connected to
the Zoroastrian religion.
Linguistically, the Old Iranian languages are divided into two major families, the
Eastern and Western group, and several subclasses. The so-called Eastern group includes
Scythian, even though the Scyths lived in a region extending further west than the
Western group. The northwestern branch included Median, and Parthian, while the
southwestern branch included Old Persian.
B. INDO-ARYAN
The Indo-Aryan or
Indic languages are a
branch of the Indo-
Iranian subfamily with
a total number of
native speakers of more
than 900 million. The
largest languages in
terms of native
speakers are
Hindustani (about 540
million), Bangali
(about 200 million),
Punjabi (about 100
million), Marathi
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
(about 90 million), Gujarati (about 45 million), Nepali (about 40 million), Oriya (about
30 million), Sindhi (about 20 million) and Assamese (about 14 million).
The earliest evidence of the group is from Vedic Sanskrit, the language used in the
ancient preserved texts of the Indian subcontinent, the foundational canon of Hinduism
known as the Vedas. The Indo-Aryan superstrate in Mitanni is of similar age as the
Rigveda, but the only evidence is a number of loanwords.
In the 4
th
c. BC, the Sanskrit language was codified and standardised by the
grammarian Panini, called Classical Sanskrit by convention. Outside the learned sphere
of Sanskrit, vernacular dialects (Prakrits) continued to evolve and, in medieval times,
diversified into various Middle Indic dialects.
C. NURISTANI
The recent view is to classify Nuristani as an independent branch of the Indo-Iranian
language family, instead of the the Indic or Iranian group. In any event, it would seem
they arrived in their present homeland at a very early date, and never entered the
western Punjab of Pakistan.
1.7.3. OTHER INDO-EUROPEAN DIALECTS OF EUROPE
I. ALBANIAN
Albanian is spoken by over 8
million people primarily in
Albania, Kosovo, and the
Former Yugoslav Republic of
Macedonia, but also by
smaller numbers of ethnic
Albanians in other parts of the
Balkans, along the eastern
coast of Italy and in Sicily, as
well other emigrant groups.
The Albanian language has
no living close relatives among the
Albanian dialects Gheg, Tosk. Communities of
Arbresh- and Arvanitika-speakers
1. Introduction
89
modern IE languages. There is no consensus over its origin and dialectal classification,
although some scholars derive it from Illyrian, and others claim that it derives from
Thracian.
While it is considered established that the Albanians originated in the Balkans, the
exact location from which they spread out is hard to pinpoint. Despite varied claims, the
Albanians probably came from farther north and inland than would suggest the present
borders of Albania, with a homeland concentrated in the mountains.
NOTE. Given the overwhelming amount of shepherding and mountaineering vocabulary as well
as the extensive influence of Latin, it is more likely the Albanians come from north of the Jireek
line, on the Latin-speaking side, perhaps in part from the late Roman province of Dardania from
the western Balkans. However, archaeology has more convincingly pointed to the early Byzantine
province of Praevitana (modern northern Albania) which shows an area where a primarily
shepherding, transhumance population of Illyrians retained their culture.
The period in which Proto-Albanian and Latin interacted was protracted and drawn out
over six centuries, 1
st
c. AD to 6
th
or 7
th
c. AD. This is born out into roughly three layers of
borrowings, the largest number belonging to the second layer. The first, with the fewest
borrowings, was a time of less important interaction. The final period, probably
preceding the Slavic or Germanic invasions, also has a notably smaller amount of
borrowings. Each layer is characterized by a different treatment of most vowels, the first
layer having several that follow the evolution of Early Proto-Albanian into Albanian; later
layers reflect vowel changes endemic to Late Latin and presumably Proto-Romance.
Other formative changes include the syncretism of several noun case endings, especially
in the plural, as well as a large scale palatalization.
A brief period followed, between 7
th
c. AD and 9
th
c. AD, that was marked by heavy
borrowings from Southern Slavic, some of which predate the oa shift common to the
modern forms of this language group. Starting in the latter 9
th
c. AD, a period followed of
protracted contact with the Proto-Romanians, or Vlachs, though lexical borrowing seems
to have been mostly one sided from Albanian into Romanian. Such a borrowing
indicates that the Romanians migrated from an area where the majority was Slavic (i.e.
Middle Bulgarian) to an area with a majority of Albanian speakers, i.e. Dardania, where
Vlachs are recorded in the 10
th
c. AD. This fact places the Albanians at a rather early date
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
in the Western or Central Balkans, most likely in the region of Kosovo and Northern
Albania.
References to the existence of Albanian as a distinct language survive from 14
th
c. AD,
but without recording any specific words. The oldest surviving documents written in
Albanian are the Formula e Pagzimit (Baptismal formula), Unte paghesont premenit
tAtit e tBirit e tSpirit Senit, I baptize thee in the name of the Father, and the Son, and
the Holy Spirit, recorded by Pal Engjelli, Bishop of Durres in 1462 in the Gheg dialect,
and some New Testament verses from that period.
II. PALEO-BALKAN LANGUAGES
A. PHRYGIAN
The Phrygian language was the IE
language spoken by the Phrygians, a
people that settled in Asia Minor
during the Bronze Age. It survived
probably into the 6
th
century AD,
when it was replaced by Greek
Ancient historians and myths
sometimes did associate Phrygian
with Thracian and maybe even
Armenian, on grounds of classical
sources. Herodotus recorded the Macedonian account that Phrygians migrated into Asia
Minor from Thrace (7.73). Later in the text (7.73), Herodotus states that the Armenians
were colonists of the Phrygians, still considered the same in the time of Xerxes I. The
earliest mention of Phrygian in Greek sources, in the Homeric Hymn to Aphrodite,
depicts it as different from Trojan: in the hymn, Aphrodite, disguising herself as a mortal
to seduce the Trojan prince Anchises, tells him:
Otreus of famous name is my father, if so be you have heard of him, and he reigns
over all Phrygia rich in fortresses. But I know your speech well beside my own, for a
Trojan nurse brought me up at home. Of Trojan, unfortunately, nothing is known.
Phrygian region and expanded Kingdom.
1. Introduction
91
Phrygian is attested by two corpora, one,
Palaeo-Phrygian, from around 800 BC and later,
and another after a period of several centuries,
Neo-Phrygian, from around the beginning of the
Common Era. The Palaeo-Phrygian corpus is
further divided geographically into inscriptions
of Midas-city, Gordion, Central, Bithynia, Pteria,
Tyana, Daskyleion, Bayindir, and various
(documents divers). The Mysian inscriptions
show a language classified as a separate Phrygian
dialect, written in an alphabet with an additional
letter, the Mysian s. We can reconstruct some
words with the help of some inscriptions written
with a script similar to the Greek one.
Ancient historians and myths sometimes did
associate Phrygian with Thracian and maybe
even Armenian, on grounds of classical sources.
Herodotus recorded the Macedonian account that
Phrygians migrated into Asia Minor from Thrace (7.73). Later in the text (7.73),
Herodotus states that the Armenians were colonists of the Phrygians, still considered the
same in the time of Xerxes I. The earliest mention of Phrygian in Greek sources, in the
Homeric Hymn to Aphrodite, depicts it as different from Trojan: in the hymn,
Aphrodite, disguising herself as a mortal to seduce the Trojan prince Anchises, tells him
Otreus of famous name is my father, if so be you have heard of him, and he reigns
over all Phrygia rich in fortresses. But I know your speech well beside my own, for a
Trojan nurse brought me up at home. Of Trojan, unfortunately, nothing is known.
Its structure, what can be recovered from it, was typically Indo-European, with nouns
declined for case (at least four), gender (three) and number (singular and plural), while
the verbs are conjugated for tense, voice, mood, person and number.
Phrygian seems to exhibit an augment, like Greek and Armenian, as in Phryg. eberet,
probably corresponding to PIE *-b
h
er-e-t (cf. Gk. epheret).
Phrygian inscription in
Midas City.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
A sizable body of Phrygian words are theoretically known; however, the meaning and
etymologies and even correct forms of many Phrygian words (mostly extracted from
inscriptions) are still being debated.
Phrygian words with possible PIE origin and Graeco-Armenian cognates include:
Phryg. bekos, bread, from PIE *b
h
eh
3
g-; cf. Gk. p
h
g, to roast.
Phryg. bedu, water, from PIE *wed-; cf. Arm. get, river.
Phryg. anar, husband, man, PIE *h
2
ner-, man; cf. Gk. aner-, man, husband.
Phryg. belte, swamp, from PIE root *b
h
el-, to gleam; cf. Gk. baltos, swamp.
Phryg. brater, brother, from PIE *b
h
reh
2
ter-; cf. Gk. p
h
rter-.
Phryg. ad-daket, does, causes, from PIE stem *d
h
-k-; cf. Gk. et
h
ka.
Phryg. germe, warm, from PIE *g
wh
er-mo-; cf. Gk. t
h
ermos.
Phryg. gdan, earth, from PIE *d
h
g
h
om-; cf. Gk. k
h
t
h
n.
NOTE. For more information on similarities between Greek and Phrygian, see
Neumann Phrygisch und Griechisch (1988).
B. ILLYRIAN
The Illyrian languages are a
group of Indo-European
languages that were spoken in
the western part of the
Balkans in former times by
ethnic groups identified as
Illyrians: Delmatae, Pannoni,
Illyrioi, Autariates, Taulanti.
The main source of
authoritative information
about the Illyrian language
consists of a handful of
Illyrian words cited in classical sources, and numerous examples of Illyrian
anthroponyms, ethnonyms, toponyms and hydronyms. Some sound-changes and other
Roman provinces in the Balkans, 2
nd
century AD: A.
Spalatum (Split); 1. Raetia; 2. Noricum; 3. Pannonia;
4. Illyricum; 5. Dacia; 6. Moesia; 7. Tracia.
1. Introduction
93
language features are deduced from what remains of the Illyrian languages, but because
no writings in Illyrian are known, there is not sufficient evidence to clarify its place
within the Indo-European language family aside from its probable Centum nature.
NOTE. A grouping of Illyrian with the Messapian language has been proposed for about a
century, but remains an unproven hypothesis. The theory is based on classical sources,
archaeology, as well as onomastic considerations. Messapian material culture bears a number of
similarities to Illyrian material culture. Some Messapian anthroponyms have close Illyrian
equivalents. A relation to the Venetic language and Liburnian language, once spoken in
northeastern Italy and Liburnia respectively, is also proposed. A grouping of Illyrian with the
Thracian and Dacian language in a Thraco-Illyrian group or branch, an idea popular in the first
half of the 20
th
century, is now generally rejected due to a lack of sustaining evidence, and due to
what may be evidence to the contrary. Also, the hypothesis that the modern Albanian language is a
surviving Illyrian language remains very controversial among linguists.
B. THRACIAN
Excluding Dacian, whose status as a Thracian language is disputed, Thracian was
spoken in what is now southern Bulgaria, parts of Serbia, the Republic of Macedonia,
Northern Greece especially prior to Ancient Macedonian expansion , throughout
Thrace (including European Turkey) and in parts of Bithynia (North-Western Asiatic
Turkey). Most of the Thracians were eventually Hellenized (in the province of Thrace) or
Romanized (in Moesia, Dacia, etc.), with the last remnants surviving in remote areas
until the 5
th
century AD.
NOTE. As an extinct language with only a few short inscriptions attributed to it (v.i.), there is
little known about the Thracian language, but a number of features are agreed upon. A number of
probable Thracian words are found in inscriptions most of them written with Greek script on
buildings, coins, and other artifacts. Some Greek lexical elements may derive from Thracian, such
as balios, dappled (< PIE *b
h
el-, to shine, Pokorny also cites Illyrian as possible source),
bounos, hill, mound, etc.
C. DACIAN
The Dacian language was an Indo-European language spoken by the ancient people of
Dacia. It is often considered to have been either a northern variant of the Thracian
language, or closely related to it.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
There are almost no written documents in Dacian. Dacian used to be one of the major
languages of South-Eastern Europe, stretching from what is now Eastern Hungary to the
Black Sea shore. Based on archaeological findings, the origins of the Dacian culture are
believed to be in Moldavia, being identified as an evolution of the Iron Age Basarabi
culture.
It is unclear
exactly when the
Dacian language
became extinct,
or even whether
it has a living
descendant. The
initial Roman
conquest of part
of Dacia did not
put an end to the
language, as free
Dacian tribes
such as the Carpi may have continued to speak Dacian in Moldavia and adjacent regions
as late as the 6
th
or 7
th
century AD, still capable of leaving some influences in the forming
of Slavic languages.
According to the hypothesis of Hasdeu (1901), a branch of Dacian continued as the
Albanian language. A refined version of that hypothesis considers Albanian to be a Daco-
Moesian Dialect that split off before 300 BC, and that Dacian became extinct.
NOTE. The arguments for this early split before 300 BC include:
o Inherited Albanian words (e.g. PIE *mtr Alb. motr) shows the evolution PIE * Alb. o,
but all the Latin loans in Albanian having an (<PIE *) shows Lat. Alb. a. Therefore, the
transformation happened and ended before the Roman arrival in the Balkans.
o Romanian substratum words shared with Albanian show a Rom. a that corresponds to Alb. o
when the source for both sounds is an original common (cf. mazre/modhull<*mdzula,
Theoretical scenario: the
Albanians as a migrant
Dacian people
1. Introduction
95
pea; ra/ros<*rtja: duck); therefore, when these words had the same common form in
Pre-Romanian and Proto-Albanian the transformation PIE * Alb. o had not started yet.
The correlation between these two facts could indicate that the split between Pre-Romanian (the
Dacians later Romanized) and Proto-Albanian happened before the Roman arrival in the Balkans.
E. PAIONIAN
The Paionian language is the poorly attested language of the ancient Paionians, whose
kingdom once stretched north of Macedon into Dardania and in earlier times into
southwestern Thrace.
Classical sources usually considered the Paionians distinct from Thracians or Illyrians,
comprising their own ethnicity and language. Athenaeus seemingly connected the
Paionian tongue to the Mysian language, itself barely attested. If correct, this could mean
that Paionian was an Anatolian language. On the other hand, the Paionians were
sometimes regarded as descendants of Phrygians, which may put Paionian on the same
linguistic branch as the Phrygian language.
NOTE. Modern linguists are uncertain on the classification of Paionian, due to the extreme
scarcity of materials we have on this language. However, it seems that Paionian was an
independent IE dialect. It shows a/o distinction and does not appear to have undergone
Satemization. The Indo-European voiced aspirates became plain voiced consonants, i.e. *b
h
b,
*d
h
d, *g
h
g, *g
wh
g
w
; as in Illyrian, Thracian, Macedonian and Phrygian (but unlike Greek).
F. ANCIENT MACEDONIAN
The Ancient Macedonian language was the tongue of the Ancient Macedonians. It was
spoken in Macedon during the 1
st
millennium BC. Marginalized from the 5
th
century BC,
it was gradually replaced by the common Greek dialect of the Hellenistic Era. It was
probably spoken predominantly in the inland regions away from the coast. It is as yet
undetermined whether the language was a dialect of Greek, a sibling language to Greek,
or an Indo-European language which is a close cousin to Greek and also related to
Thracian and Phrygian languages.
Knowledge of the language is very limited because there are no surviving texts that are
indisputably written in the language, though a body of authentic Macedonian words has
been assembled from ancient sources, mainly from coin inscriptions, and from the 5
th
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
century lexicon of Hesychius of Alexandria, amounting to about 150 words and 200
proper names. Most of these are confidently identifiable as Greek, but some of them are
not easily reconciled with standard Greek phonology. The 6,000 surviving Macedonian
inscriptions are in the Greek Attic dialect.
The Pella curse tablet, a text written in a distinct Doric Greek idiom, found in Pella in
1986, dated to between mid to early 4
th
century BC, has been forwarded as an argument
that the Ancient Macedonian language was a dialect of North-Western Greek. Before the
discovery it was proposed that the Macedonian dialect was an early form of Greek,
spoken alongside Doric proper at that time.
NOTE. Olivier Masson thinks that in contrast with earlier views which made of it an Aeolic
dialect (O.Hoffmann compared Thessalian) we must by now think of a link with North-West Greek
(Locrian, Aetolian, Phocidian, Epirote). This view is supported by the recent discovery at Pella of a
curse tablet which may well be the first Macedonian text attested (...); the text includes an adverb
opoka which is not Thessalian. Also, James L. ONeil states that the curse tablet from Pella
shows word forms which are clearly Doric, but a different form of Doric from any of the west
Greek dialects of areas adjoining Macedon. Three other, very brief, fourth century inscriptions are
also indubitably Doric. These show that a Doric dialect was spoken in Macedon, as we would
expect from the West Greek forms of Greek names found in Macedon. And yet later Macedonian
inscriptions are in Koine avoiding both Doric forms and the Macedonian voicing of consonants.
The native Macedonian dialect had become unsuitable for written documents.
From the few words that survive, a notable sound-law may be ascertained, that PIE
voiced aspirates *d
h
, *b
h
, *g
h
, appear as (=d[
h
]), (=b[
h
]), (=g[
h
]), in contrast to
Greek dialects, which unvoiced them to (=t
h
), (=p
h
), (=k
h
).
NOTE. Since these languages are all known via the Greek alphabet, which has no signs for voiced
aspirates, it is unclear whether de-aspiration had really taken place, or whether the supposed
voiced stops , , were just picked as the closest matches to express voiced aspirates PIE *b
h
, *d
h
,
The Pella katadesmos, is a katadesmos (a curse, or magic spell) inscribed on a lead
scroll, probably dating to between 380 and 350 BC. It was found in Pella in 1986
1. Introduction
97
*g
h
. As to Macedonian , , = Greek , , , Claude Brixhe[ (1996) suggests that it may have been
a later development: The letters may already have designated not voiced stops, i.e. [b, d, g], but
voiced fricatives, i.e. [, , ], due to a voicing of the voiceless fricatives [, , x] (= Classical Attic
[p
h
, t
h
, k
h
]). Brian Joseph (2001) sums up that The slender evidence is open to different
interpretations, so that no definitive answer is really possible, but cautions that most likely,
Ancient Macedonian was not simply an Ancient Greek dialect on a par with Attic or Aeolic. In this
sense, some authors also call it a deviant Greek dialect.
PIE *d
h
enh
2
-, to leave, A.Mac. (ans), death; cf. Attic (t
h
natos).
PIE *h
2
aid
h
- A.Mac.* (araia), bright weather, Attic (ait
h
ra).
PIE *b
h
asko- A.Mac. (skioi), fasces. Compare also for A.Mac.
(arotes) or (arowes), Attic (op
h
rs), eyebrows; for Mac.
(erenk), Attic (P
h
erenk), bearing victory.
o According to Herodotus (ca. 440 BC), the Macedonians claimed that the Phryges
were called Brygoi (<PIE *b
h
rugo-) before they migrated from Thrace to Anatolia
ca. 1200 BC.
o In Aristophanes The Birds, the form (kelpyris), red-cap bird, shows
a voiced stop instead of a standard Greek unvoiced aspirate, i.e. Macedonian
() (keal) vs. Greek (kep
h
al), head.
If A.Mac. (otn), pig, is related to PIE *g
w
ou-, cow, this would indicate that
the labiovelars were either intact (hence *g
w
otn), or merged with the velars, unlike
the usual Gk. (bos).
NOTE. Such deviations, however, are not unknown within Greek dialects; compare Dor. -
(glep-) for common Gk. - (blep-), as well as Dor. (glchn) and Ion.
(glchn) for Gk. (blchn).
Examples suggest that voiced velar stops were devoiced, especially word-initially: PIE
*genu- A.Mac. (knadoi), jaws; PIE *gomb
h
- A.Mac.
(kmbous), molars.
o Compared to Greek words, there is A.Mac. (arkn) vs. Attic (args);
the Macedonian toponym Akesamenai, from the Pierian name Akesamenos if
Akesa- is cognate to Greek agassomai, agamai, to astonish; cf. also the Thracian
name Agassamenos.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
1.7.4. ANATOLIAN LANGUAGES
The Anatolian languages are a group
of extinct Indo-European languages,
which were spoken in Anatolia for
millennia, the best attested of them
being the Hittite language.
The Anatolian branch is generally
considered the earliest to split off the
Proto-Indo-European language, from
a stage referred to either as Middle
PIE or Proto-Indo-Hittite (PIH),
typically a date ca. 4500-3500 BC is
assumed. Within a Kurgan
framework, there are two possibilities
of how early Anatolian speakers could
have reached Anatolia: from the
north via the Caucasus, and from the
west, via the Balkans.
NOTE. The term Indo-Hittite is somewhat imprecise, as the prefix Indo- does not refer to the
Indo-Aryan branch in particular, but is iconic for Indo-European (as in Indo-Uralic), and the -
Hittite part refers to the Anatolian language family as a whole.
Attested dialects of the Anatolian branch are:
Hittite (nesili), attested from ca. 1800 BC to 1100 BC, official language of the Hittite
Empire.
Luwian (luwili), close relative of Hittite spoken in Arzawa, to the southwest of the
core Hittite area.
Palaic, spoken in north-central Anatolia, extinct around the 13
th
century BC, known
only fragmentarily from quoted prayers in Hittite texts.
Lycian, spoken in Lycia in the Iron Age, most likely a descendant of Luwian, extinct in
ca. the 1
st
century BC. A fragmentary language, it is also a likely candidate for the
language spoken by Trojans.
The approximate extent of the Hittite Old
Kingdom under Hantili I (ca. 1590 BC) in
dar kest. Maximal extent of the Hittite Empire
ca. 1300 BC is shown in dar k col or , the
Egyptian sphere of influence in l i ght col or .
1. Introduction
99
Lydian, spoken in Lydia, extinct in ca. the 1
st
century BC, fragmentary.
Carian, spoken in Caria, fragmentarily attested from graffiti by Carian mercenaries in
Egypt from ca. the 7
th
century BC, extinct ca. in the 3
rd
century BC.
Pisidian and Sidetic (Pamphylian),
fragmentary.
Milyan, known from a single inscription.
There were likely other languages of the
Anatolian branch that have left no written
records, such as the languages of Mysia,
Cappadocia and Paphlagonia.
Anatolia was heavily Hellenized following the
conquests of Alexander the Great, and it is
generally thought that by the 1
st
century BC the
native languages of the area were extinct.
Hittite proper is known from cuneiform tablets
and inscriptions erected by the Hittite kings and
written in an adapted form of Old Assyrian
cuneiform orthography. Owing to the predominantly syllabic nature of the script, it is
difficult to ascertain the precise phonetic qualities of some Hittite sounds.
NOTE. The script known as Hieroglyphic Hittite has now been shown to have been used for
writing the closely related Luwian language, rather than Hittite proper. The later languages Lycian
and Lydian are also attested in Hittite territory.
The Hittite language has traditionally been stratified partly on linguistic and partly on
paleographic grounds into Old Hittite, Middle Hittite and New or Neo-Hittite,
corresponding to the Old, Middle and New Kingdoms of the Hittite Empire, ca. 1750-
1500 BC, 1500-1430 BC and 1430-1180 BC, respectively.
Luwian was spoken by population groups in Arzawa, to the west or southwest of the
core Hittite area. In the oldest texts, e.g. the Hittite Code, the Luwian-speaking areas
including Arzawa and Kizzuwatna were called Luwia. From this homeland, Luwian
speakers gradually spread through Anatolia and became a contributing factor to the
Hittite pictographic writing
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
downfall, after circa 1180
BC, of the Hittite Empire,
where it was already widely
spoken. Luwian was also
the language spoken in the
Neo-Hittite states of Syria,
such as Milid and
Carchemish, as well as in
the central Anatolian
kingdom of Tabal that
flourished around 900 BC.
Luwian has been preserved
in two forms, named after
the writing systems used: Cuneiform Luwian and Hieroglyphic Luwian.
For the most part, the immediate ancestor of the known Anatolian languages, Common
Anatolian (the Late Proto-Anatolian dialect spoken ca. 2500) has been reconstructed on
the basis of Hittite. However, the usage of Hittite cuneiform writing system limits the
enterprise of understanding and reconstructing Anatolian phonology, partly due to the
deficiency of the adopted Akkadian cuneiform syllabary to represent Hittite sounds, and
partly due to the Hittite scribal practices.
NOTE 1. This especially pertains to what appears to be confusion of voiceless and voiced dental
stops, where signs -dV- and -tV- are employed interchangeably different attestations of the same
word. Furthermore, in the syllables of the structure VC only the signs with voiceless stops are
generally used. Distribution of spellings with single and geminated consonants in the oldest extant
monuments indicates that the reflexes of PIE voiceless stops were spelled as double consonants
and the reflexes of Proto-Indo-European voiced stops as single consonants. This regularity is the
most consistent in in the case of dental stops in older texts; later monuments often show irregular
variation of this rule.
NOTE 2. For a defence of Etruscan as an IE language, classified within the Anatolian branch, see
Adrados (2005) at <http://emerita.revistas.csic.es/index.php/emerita/article/viewArticle/52>.
Known changes from Middle PIE into Common Anatolian include:
Voiced aspirates merged with voiced stops: *d
h
d, *b
h
b, *g
h
g.
Luwian use according to inscriptions found
1. Introduction
101
Voiceless stops become voiced after accented long-vowel or diphthong: PIH *wk-
CA wg-(cf. Hitt. wk-, ask for); PIH *d
h
eh
1
ti, putting CA ddi (cf. Luw.
taac- votive offering).
Conditioned allophone PIH *tj- CA tsj-, as Hittite still shows.
PIH *h
1
is lost in CA, but for *eh
1
, appearing as Hitt., Pal. , Luw., Lyc., Lyd. ;
word-initial *h
2
x, non-initial *h
2
h; *h
3
h.
NOTE 1. Melchert proposes that CA x (voiceless fricative) is lenited to h (voiced fricative)
under the same conditions as voiceless stops. Also, word-initial *h3 is assumed by some scholars to
have been lost already in CA.
NOTE 2. There is an important assimilation of laryngeals within CA: a sequence -VRHV-
becomes -VRRV-; cf. PIH *sperh1V- Hitt. isparr-, kick flat; PIH *sun-h3-V- Hitt. sunna-,
fill, Pal. sunnuttil-, outpouring; etc.
PIH sonorants are generally stable in CA. Only word-initial *r has been
eliminated. Word-initial *je- shows a trend to become CA e-, but the trend is not
complete in CA, as Hittite shows.
Diphthong evolved as PIH *ei CA long ; PIH *eu CA . PIE *oi, *ai, *ou, *au,
appear also in CA.
NOTE. Common Anatolian preserves PIE vowel system basically intact. Some cite the merger of
PIH *o and (controversial) *a as a Common Anatolian innovation, but according to Melchert that
merger was secondary shared innovation in Hittite, Palaic and Luwian, but not in Lycian. Also, the
lengthening of accented short vowels in open syllables cannot be of Common Anatolian, and
neither can lengthening in accented closed syllables.
The CA nominal system shows an archaic productive declension in *-i, *-u. There
are only two grammatical genders, animate and inanimate.
Hittite verbs are inflected according to two general verbal classes, the mi- and the
hi-conjugation.
NOTE. Rose (2006) lists 132 hi-verbs and interprets the hi/mi oppositions as vestiges of a
system of grammatical voice, i.e. centripetal voice vs. centrifugal voice. Additionally, the Hittite
verbal system displays two voices (active and mediopassive), two moods (indicative and
imperative), and two tenses (present and preterite), two infinitive forms, one verbal substantive, a
supine, and a participle.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
1.8. MODERN INDO-EUROPEAN
1.8.1. Modern Indo-European (MIE) is therefore a set of grammatical rules including
its writing system, noun declension, verbal conjugation and syntax , designed to
systematize the reconstructed Proto-Indo-European dialect North-West Indo-European
described (v.s.) as the last IE dialect continuum (spoken in Europe for some centuries
within the time frame 3000-2000 BC) to adapt it to modern communication needs.
Because such PIE dialects were spoken by prehistoric societies, no genuine sample texts
are available, and thus comparative linguistics in spite of its 200 years history is not
(and will not be) in the position to reconstruct exactly their formal languages (the one
used by learned people at the time), but only approximately how the spoken, vulgar
languages were like, i.e. the proto-languages that later evolved into the different attested
Indo-European dialects and languages.
NOTE. Reconstructed languages like Modern Hebrew, Modern Cornish, Modern Coptic, Modern
Prussian or Modern Indo-European may be revived in their communities without being as easy, as
logical, as neutral or as philosophical as the million artificial languages that exist today, and
whose main aim is to be supposedly better, or easier, or more neutral than other artificial or
natural languages they want to substitute. Whatever the sociological, psychological, political or
practical reasons behind the success of such difficult and non-neutral natural languages instead
of universal ones, what is certain is that if somebody learns Hebrew, Cornish, Coptic, Prussian or
Indo-European (or Latin, Gothic, Greek, Sanskrit, etc.), whatever the changes in the morphology,
syntax or vocabulary that could follow (because of, say, better or purer or easier language
systems recommended by their language regulators), the language learnt will still be the same, and
the effort made wont be lost in any possible case. That cannot be said of personal inventions.
1.8.2. We deemed it worth it to use the Proto-Indo-European reconstruction for the
revival of a complete modern language system, because of the obvious need for a
common language within the EU, to substitute the current deficient linguistic policy.
This language system, called European or European language (eurpjm), is mainly
based on the features of the European or Northwestern IE dialects, whose speakers as
we have already seen remained in close contact for some centuries after the first Late
PIE migrations, and have influenced each other in the last millennia within Europe.
NOTE. As Indo-Europeanist F. Lpez-Menchero (2008) puts it, there are three (Late) Proto-
Indo-European languages which might be distinguished today:
1. Introduction
103
1) The actual Proto-Indo-European language and its early dialects, spoken by prehistoric peoples
of Eurasia in the Neolithic and Bronze Age, some millennia ago;
2) the reconstructed Late Proto-Indo-European language system, which has been studied by IE
scholars using the linguistic, archaeological and historical data available, and which is (and will
remain) imperfect by nature, based on more or less certain hypotheses and schools of thought; and
3) the modern Indo-European language systems (European, Hellenic, Aryan) which, being based
on the later, and trying to come near to the former, are neither one nor the other, but modern
languages systematized to be used in the modern world.
NOTE 2. In that sense, some critics have considered the so-called Indo-European language
revival to be different from (and thus not comparable to) other language revivals, like as they
put it Hebrew or Cornish, because of the obvious differences that will exist between that ancient
North-West Indo-European language and the Modern Indo-European or European language. It is
important to note that, even though there is a general belief that Modern Hebrew and Ancient
Hebrew are the same languages, among Israeli scholars there have been continued calls for the
Modern Hebrew language to be called Israeli Hebrew or just (preferably) Israeli, due to the
strong divergences that exist and further develop with its use between the modern language
spoken in Israel and its theoretical basis, the Ancient Hebrew of the Tanakh, its contents (and
language variations) having being compiled probably between 450-200 BC, i.e when the language
was being substituted by Aramaic. On that interesting question, Prof. Ghilad Zuckermann
considers that Israelis are brainwashed to believe they speak the same language as the prophet
Isaiah, a purely Semitic language, but this is false. It's time we acknowledge that Israeli is very
different from the Hebrew of the past. He points out to the abiding influence of modern Indo-
European dialects especially Yiddish, Russian and Polish , in vocabulary, syntax and phonetics,
as imported by Israel's founders. The same could certainly be said of Cornish and other language
revivals, and even of some death languages with a continued use, like the Modern Latin language
used by the Catholic Church, which is not comparable to the Classical Latin used by Cicero, not to
talk about the real, Vulgar Latin used by the different peoples who lived in the Roman Empire.
1.8.3. Late Proto-Indo-European features that are common to early PIE dialects
(mainly North-West IE, Proto-Greek and Proto-Indo-Iranian), like nominal and verbal
inflection, morphology and syntax, make it possible for PIE to be proposed as
Dachsprache for an Indo-European International Auxiliary Language project.
Obviously, French, German, Spanish, Hindustani, Chinese, and other natural and
artificial languages proposed to substitute English dominance, are only supported by
their cultural or social communities, whereas IE native speakers make up the majority of
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
the worlds population, being thus the most democratic choice for a language spoken
within international organizations and between the different existing nations.
NOTE 1. Because Modern Indo-European (a revived North-West IE proto-language) has other
sister dialects that were spoken by coeval prehistoric communities, languages like Modern
Hellenic (a revived Proto-Greek) and Modern Aryan (a revived Proto-Indo-Iranian) can also be
used in the regions where they are currently spoken in the form of their surviving dialects, as those
proto-languages were not much more different from North-West IE than Swedish from Danish, or
Spanish from Portuguese. They might also serve as linguae francae for closely related languages
or neighbouring regions, i.e. Aryan for Asia, Hellenic for and Armenian-speaking territories.
Anatolianism (Turkish Anadoluculuk) asserts that Turks descend from the indigenous
population of ancient Anatolia, based on historical and genetic views. Supported by Turkish
intellectuals in the 20
th
century, it became essential to the process of nation-building in Turkey,
but was substituted by the Pan-Turkic nationalism Mustafa Kemal Atatrk had discouraged before
his death. If accepted again, Turks could embrace their historical culture by adopting Modern
Anatolian (a revived Common Anatolian, cousin dialect of EIE, PGk. and PII) as a modern
second language for Turkey, which shares close historical and cultural ties with Europe and Asia.
NOTE 2. The terms Ausbausprache-Abstandsprache-Dachsprache were coined by Heinz Kloss
(1967), and they are designed to capture the idea that there are two separate and largely
independent sets of criteria and arguments for calling a variety an independent language rather
than a dialect: the one based on its social functions, and the other based on its objective
structural properties. A variety is called an ausbau language if it is used autonomously with
respect to other related languages. This typically means that it has its own standardized form
independent of neighbouring standard languages, like (in this hypothetical future) Modern Indo-
European in Europe and the Americas, Modern Aryan in Asia. This often involves being taught in
schools, and being used as a written language in a wide variety of functions, possibly including that
of an official national language. In contrast, varieties that are abstand languages are those that are
only spoken and typically only used in private contexts.
Dachsprache means a language form that serves as standard language for different dialects, even
though these dialects may be so different that mutual intelligibility is not possible on the basilectal
level between all dialects, particularly those separated by significant geographical distance. So e.g.
the Rumantsch Grischun developed by Heinrich Schmid (1982) as such a Dachsprache for a
number of quite different Romansh language forms spoken in parts of Switzerland; or the Euskara
Batua, Standard Basque, and the Southern Quechua literary standard, both developed as
standard languages for dialect continua that had historically been thought of as discrete languages
1. Introduction
105
with many dialects and no official" dialect. Standard German and standard Italian to some extent
function in the same way. Perhaps the most widely used Dachsprache is Modern Standard Arabic,
which links together the speakers of many different, often mutually unintelligible Arabic dialects.
Hence a Standard Indo-European, which might take rules from Late Proto-Indo-European
reconstruction and the Modern Indo-European rules presented here, would be the wide
Dachsprache necessary to encompass (i.e. to serve as linguistic umbrella for) the modern revival of
early PIE dialects.
NOTE 3. Our proposal is different from the Hebrew language revival, but we think that:
a) The reconstruction of a common Late PIE (laryngeal?) phonology, nominal or verbal
inflection system results at best mainly in abstract formulae or vague approximations following
the dissertation of Mallory & Adams (v.s. 1.1.8) ; they are very useful for a Standard Indo-
European Dachsprache, but the reconstruction unfortunately does not have enough certainty to be
used for a common, modern revived language. Reconstructions of early PIE dialects, on the other
hand, result in approximations with strong statistical confidence, offering a practical system for
common West European, Greek and Indo-Iranian phonetics and inflection system, but they lack
enough data on their oldest morphology, syntax and vocabulary, which were obscured by later
innovations. Therefore, reconstructions of Late PIE and early PIE dialects complement each other.
b) Where Zionism had only some formal writings, with limited vocabulary, of an ancient
language already dead centuries before their latest sacred texts were compiled (ca. 200 BC), and
their people expelled from Israel (in 70 AD), Pro-Europeanism and Indo-Europeanism have PIE
and its early dialects (EIE, PGk and PII) with a continued history of use in Eurasia and hundreds
of living dialects, and other very old dead dialects attested, so that their modern revival can be
considered less artificial. Thus, even if Europeans had tablets dating from 2000 BC in some
dialectal predominant formal EIE language (say, from Pre-Germanic or Pre-Celtic IE), the current
North-West Indo-European reconstruction should probably still be used as the main source for
Indo-European language revival in the European Union. Just taking a look at Mycenaean
inscriptions and its difficult phonetic decipherment is enough to realize how little EIE
reconstruction would change if writings were found.
c) The common culture and religion was probably the basis for the Hebrew language revival in
Israel. Proto-Indo-European, whilst the mother tongue of some prehistoric tribe with an own
culture and religion, spread into different peoples, with different cultures and religions. There was
never a concept of Indo-European community after the migrations. However, Indo-European
languages are spoken today by the majority of the population in the world and especially within
Europe , and its early dialects spread into two main communities, EIE and PGk in Europe, PII in
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
South Asia. It is therefore possible to speak them as natural, cultural and national common
languages, what may be a significant advantage of IE as IAL over any other natural language.
Also, blessed Anne Catherine Emmerich
(1774-1824), a Roman Catholic Augustinian
nun, stigmatic, mystic, visionary and ecstatic,
had revelations about the Confusion of Tongues
(Genesis 11:1-9): Upon Heber who, as we have
said, took no part in the work, God cast His
eyes; and amid the general disorder and
corruption, He set him and his posterity apart
as a holy nation. God gave him also a new and
holy language possessed by no other nation,
that thereby his race should be cut off from
communication with all others. This language
was the pure Hebrew, or Chaldaic [=NW
Semitic]. The first tongue, the mother tongue,
spoken by Adam, Sem, and Noe, was different, and it is now extant only in isolated dialects. Its
first pure offshoots are the Zend, the sacred tongue of India, and the language of the Bactrians
[=early Indo-Iranian dialects]. In those languages, words may be found exactly similar to the
Low German of my native place [=any modern EIE regional dialect]. The book that I see in
modern Ctesiphon [=Greek colony, site of church councils of the Church of the East], on the
Tigris, is written in that language [=a PGk dialect]. Her visions receive particular veneration
from Traditional Catholics, and this one refers clearly to the three isolated dialects (PII, EIE and
PGk) derived directly from the confounded divine tongue which some have since identified as
the common Proto-Indo-European language spoken until ca. 3000-2500 BC according to
Biblical chronology and archaeological findings of the great temple towers (ziggurats) of ancient
Sumer. For the Catholic Catechism, the Genesis uses figurative language, but affirms a primeval
event, see <http://www.catholic.com/library/Adam_Eve_and_Evolution.asp>.
1.8.4. Modern Indo-European words to complete the lexicon of North-West Indo-
European, in case that no common PIE form is found, are to be loan-translated from
present-day North-West IE languages. Common loan words from sister dialects can also
be loan-translated or taken as (proto-language) loan words.
NOTE 1. Even though the vocabulary reconstructible for early PIE languages is indeed wider
than the common Proto-Indo-European lexicon, a remark of Mallory & Adams (2006) regarding
Gustave Dor's Confusion of Tongues
1. Introduction
107
reconstructible Common PIE words is interesting: Yet we know that our reconstructed lexicon
falls far short of the full language, e.g. we can reconstruct eye and eyebrow but not eyelash. We
can most easily gain an impression of what may be missing when we consider modern ethno-
botanical studies. In Proto-Indo-European we can offer about thirty-two plant names and an
additional twenty-six tree names. In contrast, Brent Berlin examined the languages of ten
traditional farming societies and found that the average number of botanical taxa reported in each
language was 520. If we were to treat such comparisons at face value this would suggest that we
are recovering only about 11 per cent of the probable botanical lexicon known to the Proto-Indo-
Europeans. Or compare, for example, the fact that we can reconstruct only a few terms relating to
the horse in Proto-Indo-European; in English this semantic field includes horse, pony, nag, steed,
prancer, dobbin, charger, courser, colt, foal, fielly, gelding, hack, jade, crock, plug, and many more
terms, including the many specific terms describing the colour of the horse, e.g. bay, chestnut,
sorrel, pinto. There is no reason to suspect that PIE did not behave similarly.
NOTE 2. For examples of loan translations from modern EIE languages, cf. from Latin
aquaeduct (Lat. aquaeductus MIE aqsduktos) or universe (Lat. uniuersus<*oin(i)-uors-o-
<*oino-wt-to- MIE oinwstos turned into one); from English, like software (from Gmc.
samu-, war MIE somtwor); from French, like ambassador (from Cel. amb(i)actos MIE
ambhagtos public servant); or chamber (from O.Lat. camera, from PGk. kamr, vault
MIE kamar); from Russian, like bolshevik (MIE belijwikos); etc.
Modern loan words from sister or cousin IE dialects can be either loan-translated or directly
taken as loan words, depending on the nature of the individual words:
o Loan words should be taken directly in MIE from forms which are found only in one proto-
language or restricted to southern dialects; as e.g. Gk. photo, which should be taken directly
as loan word phwotos, from PGk phawots, gen. phawots, as Gk. (<), , in
compound phawotogphj, photography, derived from IE root bh-, shine, which could
be loan-translated as MIE *bhwotos, from *bhawotogbhj , but without having a
meaning for extended bha-wes-, still less for bha-wot-, in North-West Indo-European or
even Proto-Indo-European, as it is only found in Ancient Greek dialects. Compare also MIE
skhol, from Lat. schola, taken from Gk. (<PGk. skhol), spare time, leisure,
tranquility, borrowed from Greek with the meaning school, which was in O.Gk.
(scholeon), translated as PGk. skholehjom (<PIE *-esjo-m), from IE root segh-, which could
also be loan-translated as MIE *sghol or even more purely (and artificially) *sgholesj om,
none of them being Proto-Indo-European or common Indo-European terms. Examples from
Indo-Iranian include waskranas, bazaar, from O.Ira. vahacarana, sale-traffic,
bazaar, which could also be translated as proper MIE *wesqlenos, from PIE roots wes-
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
and qel-; or katrangam, chess, from Skr. caturagam (which entered Europe from Pers.
shatranj) a bahuvrihi compound, meaning having four limbs or parts, which in epic poetry
often means army, possibly shortened from katurangabalam, Skr. caturagabalam, lit.
four-member force, an army comprising of four parts, could be loan-translated as MIE
*qatrangom and *qatur angobel om, from roots qetw-, ang- and bel-.
o Loan words and loan translations might also coexist in specialized terms; as, from PIE
*h1rud
h
rs, red, PGk eruthrs, in common loan eruthrkutos, erythrocyte, proper MIE
rudhrs, in rudhr (senos) ktj, red (blood) cell; cf. also MIE ms, muss, mouse,
muscle, PGk ms, muhs, in loan muhokutos, myocyte, for muskosjo ktj, muscle cell.
1.8.5. The adjective eurpjs, m. European, comes from the Greek noun Eurp.
NOTE. Gk. Eurp is from unknown origin, even though it was linked with Homers epithet for
Zeus euruop, from *hur-oqeh2 far-seeing, broad, or *h1ur-woqeh2 far-sounding (Heath,
2005). Latinate adj. europaeus, which was borrowed by most European languages, comes from
Gk. adj. eurpaos, in turn from PGk eurpai-js < PIE *eurpeh2-js MIE eurp-js. For the
evolution PIH *-eh2jo- PGk *-aijo-, cf. adjective formation in Gk. agor-agoraos, Ruigh (1967).
The name of the European language system is eurpjm, inanimate, because of the
oldest IE dialects, those which had an independent name for languages used the neuter.
NOTE. Compare Gk. n.pl. (hellnik), Skr. n.sg. (sasktam), O.H.G. diutisc,
O.Prus. prsiskan, etc.; cf. also in Tacitus Lat. ucbulum latnum.
In most IE languages, the language is also referred to as language defined by an
adjective, whose gender follows the general rule of concordance; as in MIE f. eurpj
dghw, European language.
NOTE. Cf. Lat. latna lingua, Gk. , O.H.G. diutiska sprhha (Ger. Deutsche
Sprache), O.Prus. prsiskai bil, O.C.S. (slovnskyi jzyk), etc.
1.8.6. Because the term Indo-European is common today to refer to the reconstructed
language, we decided to use that traditional name to describe the Proto-European
language, as a way to familiarize the reader with the European language system as a
natural, dead language, and to distinguish it clearly from other language inventions.
NOTE. However, when speaking in European, sindhueurpjm, Indo-European,
prmosindhueurpjm, Proto-Indo-European, Eurps sindhueurpjm, Europes
Indo-European, should refer to the theoretical linguistic concepts, to the ancient reconstructed
dialects, while eurpjm, European, should be preferred for the modern language, just like
Israeli is probably the most suited name to refer to Modern Hebrew.
2. LETTERS AND SOUNDS
2.1 THE ALPHABETS OF MODERN INDO-EUROPEAN
2.1.1. Indo-European does not have an old writing system to be revived with. In the
regions where PIE speakers dwelled four thousand years ago, caves and stones probably
still keep some ancient pictographic writings, composed of logograms (graphemes) that
represent a morpheme or a whole word, as did Egyptian hieroglyphic logographs.
2.1.2. The Indo-European dialects have adopted different alphabets during the last
millennia, and all of them should be usable today although the main alphabet for
todays European Union is clearly the Latin one. This is a summary table of Proto-Indo-
European phonemes and their regular corresponding letters in MIE alphabets: Greek,
Latin, Cyrillic, Perso-Arabic and (alphasyllabary) Devangar.
A. VOWELS AND VOCALIC ALLOPHONES
Phoneme Greek Latin Persian Armenian Cyrillic Devan.
[a] A a
[e] E e
E e
[o] O o
[u]
[e]
[o]
[i] I i
[i]
[u] U u
[o]
[r] R r
()
[l] L l
()
[m] M m
[n] N n
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
NOTE. The underdot diacritic might be used to mark the sonorants, as , , , ; usually,
however, sonorants appear between consonants, so it is not necessary to mark them, v.i.
B. CONSONANTS AND CONSONANTAL SOUNDS
Phoneme Greek Latin Persian Armenian Cyrillic Devan.
[p] P p
[b] B b
[b
h
] Bh bh
[t] T t
[t
h
] Th th
[d] D d
[d
h
] Dh dh
[k] K k
[k
h
] Kh kh
[g] G g
[g
h
] Gh gh
[k
w
]
Q q
[g
w
]
C c
[g
wh
]
Ch ch
[h]
H h
[j] (J j) J j
/
, /
[w] W w
(W w)
[r] R r
[l] L l
[m] M m
[n] N n
[s] S s
2.1.2. The Latin Alphabet used for Modern Indo-European is similar to the English,
which is in turn borrowed from the Late Latin abecedarium. We also consider some
digraphs part of the alphabet, as they represent original Proto-Indo-European sounds, in
contrast to those digraphs used mainly for transcriptions of loan words.
2. Letters and Sounds
111
NOTE. The Latin alphabet was borrowed in very early times from the Greek alphabet and did not
at first contain the letter G. The letters Y and Z were introduced still later, about 50 BC.
The names of the consonants in Indo-European are as follows - B, be (pronounced
bay); Bh, bhe (b
h
ay); C, ce (g
w
ay); Ch, che (g
wh
ay); D, de (day); Dh, dhe (d
h
ay); F,
ef; G, ge (gay); Gh, ghe (g
h
ay); H, ha; K, ka; L, el; M, em; N, en; P, pe; Q, qu; R,
er; S, es; T, te; V, ve; W, wa; X, eks; Z, zet.
2.1.3. The Latin character C originally meant [g], a value always retained in the
abbreviations C. (for Gaius) and Cn. (for Gnaeus). That was probably due to Etruscan
influence, which copied it from Greek , Gamma, just as later Cyrillic , Ge.
NOTE 1. In early Latin C came also to be used for [k], and K disappeared except before in a few
words, as Kal. (Kalendae), Karthago. Thus there was no distinction in writing between the sounds
[g] and [k]. This defect was later remedied by forming (from C, the original [g]-letter) a new
character G. Y and Z were introduced from the Greek about 50 B.C., and occur mainly in loan
words in Modern Indo-European.
NOTE 2. In Modern Indo-European, C is used (taking its oldest value) to represent the Indo-
European labiovelar [g
w
] in PIE words, while keeping its different European values [k], [ts], [s],
[], [], etc. when writing proper names in the different modern IE languages.
2.1.4. The Latin [w] semivowel developed into Romance [v]; therefore V no longer
adequately represented [w] and the Latin alphabet had to develop an alternative letter.
Modern Indo-European uses V mainly for loan words, representing [v], while W is left
for the consonantal sound [w].
NOTE. V originally denoted the vowel sound [u] (Eng. oo), and F stood for the sound of
consonant [w] (from Gk. , called digamma). When F acquired the value of our [f], V came to be
used for consonant [w] as well as for the vowel [u].
2.1.5. The letter I stood for the vowel [i], and was also used in Latin (as in Modern
Greek) for its consonant sound [j]. J was originally developed as a swash character to end
some Roman numerals in place of I; both I and J represented [i], [i], and [j]. In MIE, J
represents the semivowel [j]. In the Latin script, Y is used to represent the vowel [y] in
foreign words.
NOTE. That [j] value is retained in English J only in foreign words, as Hallelujah or Jehovah.
Because Romance languages developed new sounds (from former [j] and []) that came to be
represented as I and J, English J (from French J), as well as Spanish, Portuguese or Italian J have
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
sound values quite different from [j]. Romanisation of the sound [j] from different writing systems
(like Devanagari) as Y which originally represented in Latin script the Greek vowel [y] , due to
its modern value in English, French or Spanish, has spread a common representation of [j] as Y in
Indo-European studies, while J is used to represent other sounds.
2.1.6. The consonant cluster [ks] was in Ancient Greece written as X (Chi) in Western
Greek, (Xi) in Eastern Greek dialects. In the end, X was standardized as [k
h
] ([x] in
modern Greek), while represented [ks]. In the Latin script, the X stands for [ks], as in
English or Latin, whereas in the Cyrillic alphabet it stands for [h] (and aspiration), as
well as for [x] in foreign words.
NOTE. The Etruscans took over X from Old Western Greek, therefore it stood for [ks] in
Etruscan and then in Latin, and also in most languages which today use an alphabet derived from
the Roman, including English. Cyrillic X was taken with its standard Greek value [x], but is also
used as [h] in those languages that need it; as, Macedonian, and Bulgarian and Serbian dialects.
2.1.7. As in Ancient and Classic Greek, in the Greek alphabet X stands for [k
h
], for
[p
h
], and for [t
h
].
NOTE. Because of its use in Modern Greek, they also represent (mainly foreign) [x], [f] and [].
2.1.8. represents [] in the Greek alphabet, because was originally used in most
Greek dialects to represent the sound [h], and it is therefore used with this value in IE
writings, as well as to mark aspirated phonemes.
NOTE. For more on the problem of historical Eta and its representation in the modern Greek
alphabet, see <http://www.tlg.uci.edu/~opoudjis/unicode/unicode_aitch.html>.
2.2. CLASSIFICATION OF SOUNDS
2.2.1. The Vowels are short [a], [e], [i], [o], [u], written a, e, i, o, u, and long [], [],
[], [], [], written , , , , , respectively. The other sounds are Consonants.
The Modern Indo-European Diphthongs proper are [ei], [oi], [ai], written ei, oi, ai,
and [eu], [ou], [au], written eu, ou, au. In these diphthongs both vowel sounds are
heard, one following the other in the same syllable.
NOTE. For the so-called long diphthongs [i], [i], [i], written i, i, i, and [u], [u], [u],
written u, u, u, which remained only in Indo-Iranian, Greek and partly in Baltic languages,
Schulze (1885) interpreted a regular correspondence of the type i//, which came respectively
from the full grade of the long diphthong, the full grade before consonant (where the second
2. Letters and Sounds
113
element was lost), and the zero-grade (a contraction of schwa with the semivowel). Martinet
(1953) proposed that laryngeals were behind those long diphthongs.
In any case, in the languages in which they are retained, long diphthongs have not a longer
duration than normal diphthongs; phonologically they are equivalent, as Vedic and Greek metric
shows. After Adrados (1995), The difference, therefore, is not on the duration of the group, but on
the relative duration of their components; in other words, for example ei and i have the same
phonological duration (they are long, as opposed to a brief vowel), but in ei both elements have
approximately the same duration, whereas in i the duration of i is perceptibly shorter than e.
Because of that, the name long first element diphthongs is more appropriate to refer to these
phonemes. Cf. Allen (1976) for an analysis of these diphthongs.
Strictly speaking, phoneticians do not consider the so-called rising diphthongs, [je],
[jo], [ja], [j], [j], [j], nor [we], [wo], [wa], [w], [w], [w], as diphthongs proper,
but rather sequences of glide and vowel.
NOTE. Whilst most Indo-Europeanists differentiate between sequences of approximant and
vowel (rising diphthongs) from true falling diphthongs in their transcriptions, i.e. writing [je]
(from [i]+[e]) but [ei] or [ei] (from [e]+[i]), some use a different approach, considering all of them
combinations of vowel plus glide or glide plus vowel, i.e. writing [je] and [ej], or [ie] and [ei].
Therefore, there are no real triphthongs. The formations usually called triphthongs are
[jei], [joi], [jai], [jeu], [jou], [jau], as well as [wei], [woi], [wai], [weu], [wou], [wau]; and
none can be named strictly triphthong, as there is a consonantal sound [j] or [w] followed
by a diphthong. The rest of possible formations are made up of a diphthong and a vowel.
2.2.2. Consonants are either voiced (sonant) or voiceless (surd). Voiced consonants are
pronounced with vocal cords vibration, as opposed to voiceless consonants, where the
vocal cords are relaxed.
a. The voiced consonants are [b], [d], [g], [g
w
], [l], [r] and [], [m], [n], [z], [j], [w].
b. The voiceless consonants are [p], [t], [k], [k
w
], [s].
c. The digraphs bh, dh, gh and ch represent the Indo-European voiced aspirates
proper, i.e. [b
h
], [d
h
], [g
h
], [g
wh
], whereas ph, th, and kh represent voiceless aspirates
[p
h
], [t
h
], [k
h
], mostly confined to words of Greek origin, as well as foreign [], [] and
[x], respectively.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
d. The consonants [r], [l], [m], [n], and the semivowels [j] and [w], can function both as
consonants and vowels, i.e. they can serve as syllabic border or center.
NOTE. There is a clear difference between the vocalic allophones of the semivowels and those of
the sonants, though: the first, [i] and [u], are very stable as syllabic center, while [r], [l], [m], [n],
arent, as they cannot be pronounced more opened. Hence the big differences in their evolution,
depending on the individual dialects.
2.2.3. The Mutes are classified as follows:
voiceless voiced aspirated
labials p b b
h
dentals t d d
h
velars k g g
h
labiovelars k
w
g
w
g
wh
Labialized velars or Labiovelars [k
w
] (written q), [g
w
] (written c), [g
wh
] (written ch), are
pronounced like [k], [g], [g
h
] respectively, but with rounded lips.
NOTE 1. German Neogrammarians reconstructed a fourth series of phonemes, the voiceless
aspirates *p
h
, *t
h
, *k
h
, to explain some irregularities in the outputs of the voiceless row. Most Indo-
Europeanists reject this fourth independent row of phonemes, and findings of Indo-Iranian,
Armenian and Greek have been explained as 1) expressive in origin, 2) contact of a voiceless with a
laryngeal phoneme, and 3) effect of a prior s. For support of the fourth row, cf. Szemernyi (1985).
NOTE 2. The modern mainstream Proto-Indo-European reconstruction, that accepts only these
two rows of velars as the most logical PIE phonetic system, has been confronted with the question
of the actual existence of the groups [kw], [gw], and [g
h
w], different from (and similar or identical
in their dialectal outputs to) labialized [k
w
], [g
w
], and [g
wh
]. A distinction between both is often
found, though, whether an independent row of palatalized velars is accepted or not; as, kwn,
dog, ekwos, horse, ghwer-, wild, kweidos, white, kwet-, cook (cf. O.Ind. kwathati), tekw-,
run, etc. which might be found reconstructed as *qn, *eqos, *cher-, etc. For a defence of such
unified forms, see e.g. Jussi Halla-aho at <http://ethesis.helsinki.fi/julkaisut/hum/slavi/vk/halla-
aho/problems.pdf>.
2.2.4. The so-called Liquids are l, which represents the alveolar lateral approximant [l],
an r, pronounced in PIE and in most modern IE languages (at least occasionally) as
alveolar trill [r], today often allophonic with an alveolar tap [], particularly in unstressed
positions. These sounds are voiced.
2. Letters and Sounds
115
NOTE. About Indo-European r and l, cf. Banczerowski (1968).
For foreign words, the group rh represents an [h] sound coming just after the [],
mainly in words of Greek origin. Other groups include rr, the alveolar trill [r], and its
aspirated counterpart rrh. The palatal lateral approximant [] shall be represented as lj.
2.2.5. The Nasals are labial [m], written m, and dental [n], written n. These are voiced.
The velar nasal [] as ng in English sing could have existed in IE as allophone of [n]
before velars.
NOTE. Erhart (1970) reconstructs three nasals, N, M1 and M2, this one a fricative seminasal
with which he explains the results of alternating m and w in some suffixes and roots; as, -ment-/-
went-, men-/wen-, etc. It was left unexplained, though, under which conditions did it change.
The palatal nasal [] of foreign words (similar to the [n] sound in English onion or
canyon) is represented by the pair nj.
2.2.6. The Fricatives are voiceless [s] and voiced [z], z being usually the output of s
before voiced consonants.
NOTE. [z] was already heard in Late Proto-Indo-European, as a different pronunciation
(allophone) of [s] before voiced consonants, as can be clearly seen in PIE nizdos (for nisdos),
nest, which comes from PIE roots ni-, down, and zero-grade -sd- of sed, sit. Because of that it is
preferred to write s for [z] in MIE.
It is also possible to write voiceless and voiced pairs from foreign words: labiodentals, f
and v; dorsal voiceless h and [x], written kh; and postalveolar [] as sh and [] as zh.
Possible groups include ks, ts, dz, tsh (for [t]), dzh (for [d]), etc.
2.2.7. The Semivowels are usually written j, and w. These are voiced.
NOTE. Some authors make a distinction between consonantal [j], [w], and vocalic [i], [u].
Actually, however, both appear as TIT and EYE (where T = consontant, E = vowel, I = i,u, Y = j,w),
and never as TYT or EIE. Against it, see Schmitt-Brandt (1967) and Szemernyi (1985) and
Mayrhofer (1986).
2.2.8. Gemination appears in phonemes whose duration is long enough to be perceived
their implosion and explosion, both audible as distributed in two syllables. They
existed in PIE: in stops, as apps, atts (and tt), dad, pappj, eat, or kakkj,
shit; in nasals, as anna-, amm (and mamm), mother, mum; in liquids, as bull,
buble; and in the sibilant, as kuss, kiss.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
NOTE. They appear mostly in words of expressive origin, children vocabulary, onomatopoeia,
etc., which makes it more likely that PIE inherited gemination as an expressive resource, different
from its central phonological system; a resource that was retained for a long time by most IE
languages as a recurrent possibility.
2.2.10. A synoptic table of the Proto-Indo-European phonetic system:
Labials Coronals Palatovelars Velars Labiovelars Gutturals
Voiceless
p t k
j
k k
w
Aspirated
p
h
t
h
k
h
Voiced
b d g
j
g g
w
Aspirated
b
h
d
h
g
jh
g
h
g
wh
Nasals
m n
Fricatives s , z h, *H
Liquids r , l
Approximant w j
NOTE 1. The existence of a distinctive row of PIE satemizable velars, the so-called palatovelars,
has been the subject of much debate over the last century of IE studies. Today a majority of
modern scholars support only two types of velars in Late PIE generally Velars and Labiovelars,
although other solutions have been proposed, see Appendix II.2.
Palatovelars could be found in PII, though, and are to be represented with , , h h.
The support of German Neogrammarians to the palatals in Proto-Indo-European, as well as its
acceptance in Brugmanns Grundri and Pokornys Wrterbuch, extended the distinction to many
(mainly etymological) works, which didnt deal with the phonological reconstruction problem
directly. As Adrados (2005) puts it, about the standard [=Brugmannian] theories nowadays,
Indo-Europeanists keep working on a unitary and flat PIE, that of Brugmanns reconstruction. A
reconstruction prior to the decypherment of Hittite and the study of Anatolian! This is but other
proof of the terrible conservadurism that has seized the scientific discipline that is or must be
Indo-European linguistics: it moves forward in the study of individual languages, but the general
theory is paralised. It is sad when our students go to Germany and come back brainwashed.
NOTE 2. The cover symbol *H, traditionally *, stands for the uncertain Late PIE ouput of the
(for Middle PIE) reconstructed laryngeal phonemes h1, h2, h3, which had evolved differently
already by the time when Late PIE and Proto-Anatolian were independent languages. There is no
2. Letters and Sounds
117
consensus as to what these phonemes were like, or how many of them (if any) survived into Late
PIE, but it is widely accepted that PIH *h2 was probably uvular or pharyngeal, and that *h3 was
labialized. Commonly cited possibilities are , ,
w
and x, ~, x
w
. See Appendix II.3.
2.3. SOUNDS OF THE LETTERS
2.3.1 The following pronunciation scheme is substantially that used by the common
Europes Indo-European speakers in ca. 2500 BC, when the laryngeal phonemes had
already disappeared, having coloured following vowels, and lengthened preceding ones.
NOTE. MIE cannot permit dialectal phonetic differences, whether vocalic or consonantal like
Grimms Law effects in PGmc. consonants, already seen , because a homogeneous pronunciation
system is especially needed when targeting a comprehensible common language. Some differences
exist in sister dialects Hellenic, Aryan and Anatolian, though.
2.3.2. Vowels:
as in father a as in idea
as in they e as in met
as in meet i as in chip
as in note o as in pot
as in rude u as in put
NOTE 1. Following the mainstream laryngeals theory, Proto-Indo-Hittite knew only two vowels,
*e and *o, while the other commonly reconstructed vowels were earlier combinations with
laryngeals. Thus, short vowels PIE a < *h2e; e < *(h1)e; o < *h3e, *(h1)o; long vowels < *eh2; <
*eh1; < *eh3, *oh. The output of *h2o in Late PIE was either a or o, after the different schools.
Short and long vowels and were just variants of the semivowels PIH *j and *w.
NOTE 2. The sonants may have been lengthened too (usually from older sequences of sonant +
laryngeal, or because of compensatory lengthenings), especially in the conjugation of verbs, giving
thus [r], [l], [m], [n], written as r, l, m, n. The semivowels can also have a prolonged
pronunciation, giving allophones ij and uw. For more details on this see 2.7.2.
2.3.3. Falling Diphthongs and equivalents in English:
i as in vein u e (met) + u (put)
i as in oil u as ow in know
i as in Cairo u as ou in out
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
2.3.4. Consonants:
1. b, d, h, l, m, n, are pronounced as in English. n
might also be pronounced as guttural [ ] when it is
followed by another guttural, as in Eng. sing or bank.
3. p, k, t are plain as in Romance, Slavic or Greek
languages, not aspirated as in English; t is never
pronounced as sh, as in English oration or creation.
4. g always as in get. It had two dialectal
pronunciations, the common simple velar and the
eastern (later generalized in PII) palatovelar.
Compare the initial consonants in garlic and gear,
whispering the two words, and it will be observed
that before e and i the g is sounded farther forward in
the mouth (more palatal) than before a or o. That is
what we represent as , similar to , pronounced as k
in key, compared to c in cold.
5. c is pronounced similar to [g] but with rounded
lips. Compare the initial consonant in good with get
to feel the different articulation. The voiceless q is
similar to [k] but pronounced with rounded lips; as c
in cool, compared to c in car.
6. bh, dh, gh, ch are uncertain in sound, but the
recommended pronunciation is that of the
Hindustns voiced aspirated stops bh, dh, gh, as
they are examples of living voiced aspirates in an
Indo-European language (see note to the left).
7. The voiceless aspirated ph, kh, th, frequently of
Hellenic origin, are pronounced very nearly like
English word-initial p, k, t, as in pen, ten, Ken. Their
sound is also described as equivalent to p+h, t+h,
There are several ways
to generate breathy-
voiced sounds:
1. To hold the vocal
cords apart, so that they
are lax as they are for
[h], but to increase the
volume of airflow so that
they vibrate loosely.
2. To bring the vocal
cords closer together
along their entire length
than in voiceless [h], but
not as close as in
modally voiced sounds
such as vowels. This
results in an airflow
intermediate between [h]
and vowels, and is the
case with English
intervocalic [h].
3. To constrict the
glottis, but separate the
arytenoid cartilages that
control one end. This
results in the vocal cords
being drawn together for
voicing in the back, but
separated to allow the
passage of large volumes
of air in the front. This is
the situation with
Hindustani.
2. Letters and Sounds
119
k+h, i.e. to the corresponding mutes with a following breath, as in loop-hole, hot-house,
block-house.
8. j as the sound of y in yes, never the common English [d], as j in join; w as w in will.
9. Indo-European r was probably slightly trilled with the tip of the tongue (still
common today in many IE languages), as in Scottish English curd. Another
pronunciation is common today among modern IE languages, and was possibly heard in
PIE, the alveolar tap [], pronounced like the intervocalic t or d in American or Australian
English, as in better.
10. s is voiceless as in sin, but there are situations in which it is voiced, depending on
the surrounding phonemes. Like the aforementioned [r], modern speakers will probably
pronounce [s] in slightly different ways, but this should not usually lead to
misunderstandings, as there are no proper IE roots with original [z] or [], even though
the former appeared in some phonetic environments, v.s.
11. Doubled letters, like ll, mm, tt, etc., should be so pronounced that both members of
the combination are distinctly articulated.
12. Regarding foreign sounds:
o kh might represent [x], whether strong, with ach-laut, such as kh in Russian
Khrushenko, or ch Chanukah, or soft, with ich-laut, such as ch in German
Kirche or Lichtenstein. Also, th might be pronounced as English th in thing,
and dh as th in this.
o z, v, f, sh, are pronounced as in English.
o zh is pronounced as s in English leisure.
o tsh corresponds to English ch in chain, and tzh to j in jump.
2.4. SYLLABLES
2.4.1. In many modern languages, there are as many syllables in a word as there are
separate vowels and diphthongs. This is not exactly so in Modern Indo-European. It
follows, indeed, this rule too:
swe-sr, sister, skrei-bh, write, ne-w, new, ju-gm, yoke.
NOTE. The semivowels are always written j and w. So in trejes, three, newos, new, dghws
[dngh-ws], languages, etc.
A GRAMMAR OF MODERN INDO-EUROPEAN
Indo-European Language Association <http://dnghu.org/>
2.4.2. Indo-European has also consonant-only syllables. It is possible to hear similar
sound sequences in English cattle or bottom, in German Haben, in Czech hlt, Serbian
srpski, etc. In this kind of syllables, it is the vocalic sonant [r], [l], [m], or [n]
constrained allophones of [r], [l], [m], [n] , the one which functions as syllabic centre,
instead of a vowel proper:
k-di, heart, w-qos, wolf, de-k, ten, n-m, name.
NOTE 1. Words derived from these groups, represented TRT (where T = consonant, R = sonant),
are unstable and tend to add auxiliary vowels before or after the sonants, i.e. TRT or TRT.
Because of that, their evolutions differ greatly in modern IE languages. For example, dghw,
language, evolved as [d
n-g
h
wa:] into PGmc. tung(w), and later English tongue or German
Zunge, while in archaic Latin it was pronounced dingwa, and then the initial d became l in Classic
Latin, written lingua, which is in turn the origin of Modern English words linguistic and
language. For wqos (cf. Ved. vkas < PII wkas), it evolved either as [w