Professional Documents
Culture Documents
by Igor A. Tonoyan-Belyayev
July 2018
Saint-Petersburg, Russia
§1. The present paper is an attempt to build a linguistically based model for
both South Asian origin of all Indo-Europeans and for the process of gradual
expansion of the Indo-European languages outside South Asia during Late
Neolithic, Bronze, and Iron Ages. The core set of non-linguistic evidence in
favour of the OIT (Out-of-India) or OSAT (Out-of-South-Asia) theory itself is
discussed elsewhere.
Beforehand, only one point should be noted. As of 2018, we have first
documentally corroborated evidence of Indo-Europeans in Eastern Anatolia
since c.2000-1800 BC (Hittite words in Assyrian texts); then, we have
documents in Anatolian languages from ca.1600 BC on, Asia Minor; finally, we
have both Mycenaean Greek documents in Greece and Crete from ca.1500-
1400 BC and the remnants of Aryan-like names, titles, and special terms in
Hurrite and Hittite texts from ca.1500-1400 BC. This is what is already well-
known and well-established. This shows that no simple fabrication basing
upon the idea of "Caucasian" or "Europeoid" race being present already in
Neolithic Europe "means" that Indo-European languages tacitly equated
with a definite race type (we know which one) were also present there, can
be approved as a matter of fact. Nothing, except simple wish, can speak in
favour of any early Indo-European presence in both Western and Eastern
Europe (including the ideas of the so-called Kurgan hypothesis which can in
no way be taken as based on facts due to the lack of any linguistic evidence in
the absence of documents of language). On the other hand, we have two still
controversial evidences: the idea of the earliest Indo-European presence or
contact with Southern Mesopotamia (as Whittaker, myself, and others have
tried to show), possibly via the maritime trade with the Mature Harappan
civilization (which is illustrated with some words of possible Indo-European
origin in Sumerian); and the fact that the Mature Harappan script is still
undeciphered, and as it represents the major among the greatest civilizations
of the Bronze Age, taken together with the facts of archaeology (no traces of
Aryan invasion after during or after the Mature Harappan epoch, 2600-1900
BC), the character of Old Indo-Aryan literary tradition (an abnormally huge
corpus containing texts in different stages or sub-stages of Old Indic), and so
on, it can appear to be either the earliest Indo-European script (thousand
years earlier than that of Hittites and Mycenaeans) or the script of a separate
archaic branch of Indo-European, that is, Proto-Indic. This idea is usually
tacitly or mockingly refused by most Indo-European scholars so far,
but there is nothing non-verifiable (or, non-falsifiable) here, so it is
principally a scientific problem still requiring elaboration of the
means of its solution, instead of hushing it up.
§2. The system of consonants as a starting point. Lexical isoglosses and
lexicostatistics themselves are very useful, but being considered in short lists
(like that of Swadesh) they give nearly nothing (especially, without precise and
deep analysis of diachronic shift of meanings, sets of synonyms, and sets of
derivatives from the same root). Grammatical indicators and syntactical
structures give more. But phonetics and phonology is something which shows
both internal divergence of the group of related dialects and some systematic
substratum/adstratum influences. Anyway, they all should be subject to
complex analysis, but we still need something coherent but relatively simple
to begin with.
All Indo-European dialects can be roughly divided into four groups with
respect to their systems of consonants:
1) the system of only one series of plosives (no voiced/unvoiced and no
aspirate/non-aspirate pairs), which is represented by Anatolian and Tocharian
(with some further traces of previous weak~strong opposition).
2) the system of two series of plosives, voiced and unvoiced, which
is represented by Baltic, Slavic, Nuristani, many late Eastern Iranian dialects,
and for the most part by Celtic (though with some traces of a third series, cf.
Italic in the next group below).
3) the system of three primordial series of plosives, itself forming
two subgroups – of [3A] Greek (with unvoiced aspirates in place of voiced
ones) and of [3B] Germano-Armenian (with shift and with simple voiced
plosives in place of voiced aspirates) types, respectively. Italic, with its
remnants of the third series, occupies a somewhat intermediate position, with
its "h" and "f" of Greek-like type, while "d" and "b" of Germano-Armenian type.
4) the system with secondarily developed "fourth" unvoiced
aspirate series, again forming two subgroups – of [4A] Indic (with four series
of plosives, two non-aspirate and two aspirate ones) and of [4B] Dardo-
Southern/Western-Iranian type (with new unvoiced aspirates present, but
older aspirates merged with simple voiced plosives).
It is difficult so far to classify minor Indo-European branches with
respect to this scheme, but the whole set of main branches is enough to begin
complex analysis.
Even when looked at superficially, this list shows, according to the
general rule of archaic periphery and innovative center, that the center
(and hence, the point of departure) of Indo-European expansion occupied the
territory of later dialects of the fourth group. Of course, superficial
observation cannot be accepted as a strong argument as is. It should be
further studied. What is "surprisingly" obvious here is that the old scheme of
centum~satǝm branching is not that important as the systematic analysis of
the whole system of plosives, as centum and satǝm forms may be present in th
same language, while two different systems of plosives themselves cannot be
present in one and the same language. Moreover, the satǝm~centum division
as well as the problem of labiovelars vs. palatals are much more sophisticated
and need more case study not allowing real generalizations which are
attempted to do.
Going back to the rule of center and periphery, we can also notice that
Anatolian and Tocharian, accepted as split off first basing upon other reasons,
here are also shown as the first group to be separated and thus preserving
archaic model of plosive system (two series of weaks and strongs), though
gradually dying out.
Going on, we also see that Nuristani, Balto-Slavic, mostly Celtic and
Eastern Iranian, though superficially showing a different system, in fact
represent the following stage of evolution where the archaic binary opposition
was mostly rethought than fully lost or expanded; of course, this process was
complex, and dialects of the remaining two groups significantly influenced it,
but the these groups show long stability as to preserving a binary
opposition, but not a ternary one. This makes us suppose that areally they
represent the second wave periphery. It also speaks in favour of the idea
that original Proto-Indo-European did not possessed any aspirate series,
whether voiced or unvoiced, all this being the product of subsequent regional
evolution; the series of voiced aspirates of Indic and unvoiced aspirates
in Greek (and mix in Italic) was thus originally likely a strong series,
then rethought as a voiced one, and only thereafter as an aspirated one. But
then, Anatolian, Tocharian, Baltic, Slavic, Nuristani and the like together
show, that it is highly possible that the distinction between voiced and
unvoiced non-aspirates was just a kind of positional evolution of former
allophones (two frequent positions, namely, before a pause and after the s-
mobile, originally fully neutralized this distinction). Before we continue, let's
look at a minute example of some other kind, that is, a lexical isogloss.
Special relation between Iranian (especially, Avestan and Steppe Iranian) and
Slavic is again too striking to be just a case of happening. In case of Armenian
and Indic, on the one hand, looking somewhat similar to Iranian and Slavic (as
regards the presence of palatal affricates), they, on the other hand, preserve
much more archaic common feature (being common with centum languages,
that is three series of plosives), the two facts together looking controversial
only for those who just abstractly postulate some Urheimat far from
South Asia, it gets an explanation with the OIT being quite satisfactory just
showing shift of the center of innovation, from South Asia to Iran and Central
Asia. Now let us get back to the problem of plosives in general (the idea that
"labiovelars" were originally simple uvulars, with no etymological, separate
labial element, is discussed elsewhere).
Now, before and during the first wave, the system of plosives included only
pairs of weaks vs. strongs, i.e. k~kk, q~qq, t~tt, p~pp, indifferent towards
voicedness and aspirations. In the periphery uvulars became mostly
labiovelars thus obtaining additional quasi-articulation, while this was not the
case in the Stem zone. By the end of this period it was
voicedness/unvoicedness which appeared to be the first allophonic feature
suffering some kind of phonematization.
During the second wave, the former opposition accepted the form of
k/g~kk/gg, k/kw/g/gw~kk/kkw/g/ggw, t/d~tt/dd, p/b~pp/bb (b* being the
special case suffering confusion with v and, more seldom, m); this picture can
be still partly from the graphic form of Hittite. It is during this period that
aspiration began to become a perceptible allophonic feature, while first cases
of losing s-mobile began to take place producing additional types of alteration.
Namely, those simples which had no preceding s, became voiced simples,
while those having it, became unvoiced (this is only a statistical rule, with
numerous exceptions). It is during this period, that together with voicedness
development, the future Indo-European "root structure" appeared,
anticipating such future cases as Greek pythmēn, Indic budh-, yielding further
the so-called "aspiration shifts" which inherited from the previous
simplification of the preceding strong in two sequent syllables when the latter
of them was not standing before a pause or s (cf. Greek thriks, trikhos, Indic
-dhuk[s], -duh-, etc.). During this period, all instances of plosives already had
their own voicedness or unvoicedness features, the process of selection (on
the basis of former weaks vs. strongs) already finished, while the emergence
of aspiration within the remaining Stem* continued. It is from here on that the
process of reinterpretation of velars and uvulars began within the Stem
area finally leading to the development of different palatalization and, much
later, spirantization effects (those later-spreading dialects which "covered"
zones of Proto-Tocharian and Proto-Anatolian waves, developed labiovelar-like
phonemes as well, while those never reaching their borders did not do so,
developing only "split palatals" instead).
The third wave may be largely treated as a later sub-wave of the former
so far, as minute details are yet not being discussed here. By the end of its
period, in the Stem zone aspiration became phonological thus yielding three
classes of stops, and with the spread of the next wave the previously
peripheral dialects obtained some very rara cases of specific "ternary plosive
adstratum" though without the secondary development of any third series, but
with some oscillation of already selected voicedness/unvoicedness and with
some secondary spirantization leading to the development of previously non-
existing phonemes (cf. Italic f, h, Slavic x, etc.; also, later z).
The fourth wave demonstrates some interesting features. Whereas its
far-western edge yields unvoiced aspirates in place of former strongs, both
western Armenian and northern Germanic suffer some special effect of
"consonantal shift", according to which mostly only former strongs retained
their voicedness, while simple voiced lost their voicedness, simple unvoiced
acquiring secondary unvoiced aspiration. All this leads to the supposition that
in the previous period when aspiration only appeared as an allophonic feature
it always yielded unvoiced aspirateness, but this developed in Germano-
Armenian in former positionally-free (non-clustered) original weaks, while in
Graeco-Aryan in former, original strongs. Greek and Phrygian free aspirates'
unvoicedness might be explained by special shift (archaic voiced strongs
directly to unvoiced aspirates) or via secondary loss of voicedness (the latter
is the mainstream explanation although it leaves unexplained the fact that
simple "deaspirated" phonemes are voiceless, too). In addition, Indic shows
some traces of previous unvoicedness of its aspirates, in some rare
oscillations. Anyway, by the end of this fourth wave Indic zone gradually
became periphery, the center having moved to Iran and southern Central
Asia (we've discussed it in brief above, in context of the process of
satemization or, to be more precise, "post-palatalization spirantization"). By
the end of this period, there were areas to the west (Graeco-Armeno-Thraco-
Phrygian, partly overcovering the previously Italo-Celtic zone, whence
adstratum-like third series in these groups having developed), to the east!
(Indo-Aryan), and to the north and later north-west (Germanic), where fully
developed three-series systems were present. In the area of new center, i.e.
Greater Iran and most part of southern Central Asia, there remain zone of still
two series ("second wave's legacy"), until the last exodus from the Indus valley
took place after the end of the Mature Harappan period, when newly
developed fourth series spread over the new center zone, thus yielding
secondary unvoiced aspirates and fricatives only in Iranian and Dardic
languages (the traces of this exodus are on the north-western route, and this
process was later partly redirected backwards, yielding the late loss of voiced
aspiration in New Indo-Aryan Punjabi language, being in its core purely Indo-
Aryan).
The fifth wave may be called "equestrian explanse", as it is during
this Late-to-Post-Harappan period that proto-Iranians covered an extremely
large territory to the north and to the west during the relatively short period
of historic time. It is since this time that the word with IE stem *ek'wo began
to mean only horse, but not any transportation/onerary/race animal (including
bulls [the most archaic version, where *ekwo was a masculine counterpart to
*gwōu, "a cow", only later assuming epicene-to-masculine gender locally],
asses, and various types of "not so perfect" equids), as it was before in various
Indo-European dialects (and as the Rigveda, being prior to this period,
frequently shows by its contexts where the word aśva is used). This
automatically implies that all the previous waves were very slow, using mainly
oxen and imperfect equids as packing animals. And this also implies that until
the Iranian expanse into the Steppe, all other previously separated Northern
Route dialect-speakers lived much closer to South Asia (not so in case of
Western Route, as there existed an intensive trade route between modern
Pakistan and Western Anatolia from prehistoric times; I prefer to call this
Western Route the Ancient [Indo-European spreading] Route of Silver
and Lapis Lazuli, the former component being explained with the help of
rajata-isogloss along this path, but not in the north, cf. Indic rajata, Avestan
ǝrǝzata, Armenian artsath, Latin argentum, Celtic argat, Anatolian and Greek
being not that susceptible and thus having only argyros for "silver" from the
same root, while Hittite ḫarki- for just "white").