Professional Documents
Culture Documents
Phasal Polarity Systems in East Bantu
Phasal Polarity Systems in East Bantu
Althea Löfgren
Department of Linguistics
Independent Project for the Degree of Bachelor 15 HEC
General Linguistics
Spring 2018
Supervisor: Ljuba Veselinova
Examiner: Francesca Di Garbo
Phasal Polarity Systems in East Bantu
Abstract
This study explores a category of expressions akin to not yet, already, still and no longer, called
Phasal Polarity (PhP) expressions. They encode the domains of phasal values, polarity and
speaker expectations and have previously been described in European languages (van der
Auwera 1998) and in a small, genealogically diverse sample (van Baar 1997). Using refer-
ence grammars as the primary source of information, the aim of this crosslinguistic study is to
describe PhP expressions in a sample of East Bantu languages. It is found that the distribution
and behaviour of PhP expressions in East Bantu differ from both European languages and the
genetically diverse sample of van Baar. The markers are found to be morphologically diverse
and of varied crosslinguistic frequency. Furthermore, the verbal morphotax indicates that the
markers are, or are in the process of, being incorporated into the tense-aspect systems of their
respective language.
Keywords
aspect, Bantu languages, phasal polarity, tense
Sammanfattning
Denna studie utforskar en kategori av utryck besläktade med inte än, redan, fortfarande och
inte längre, som kallas Phasal Polarity (PhP) expressions. De uttrycker fas, polaritet och ta-
larörväntningar och har tidigare beskrivits i en studie av europeiska språk (van der Auwera
1998) och i en småskalig, genetiskt mångfaldig studie (van Baar 1997). Med referensgram-
matikor som primärkälla ämnar denna studie undersöka PhP expressions i ett urval av språk
i underfamiljen Östbantu. Resultaten visar att PhP expressions i Östbantu har annorlunda
distribution och användning jämört med europeiska språk och van Baars urval. Markör-
erna varierar avseende morfologi och tvärspråklig frekvens. Verbens morfotax indikerar att
markörerna har, eller håller på att, inkorporeras i respektive språks -system.
Nyckelord
aspekt, Bantuspråk, phasal polarity, tempus
Table of Contents
1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
2 Background . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
2.1 Phasal Polarity expressions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1
2.1.1 PhP expressions and phasal values . . . . . . . . . . . . . . . . . . . . 2
2.1.2 PhP expressions and polarity . . . . . . . . . . . . . . . . . . . . . . . 2
2.1.3 PhP expressions and speaker expectations . . . . . . . . . . . . . . . . 4
2.2 Previous crosslinguistic studies . . . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.2.1 Van der Auwera’s European study . . . . . . . . . . . . . . . . . . . . 5
2.2.2 Van Baar’s crosslinguistic study . . . . . . . . . . . . . . . . . . . . . . 5
2.2.3 Kramer’s PhP parameters . . . . . . . . . . . . . . . . . . . . . . . . . 5
2.3 Bantu languages . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
2.3.1 Classification and characteristics . . . . . . . . . . . . . . . . . . . . . 6
2.3.2 Tense and aspect in Bantu languages . . . . . . . . . . . . . . . . . . . 7
2.3.3 Negation in Bantu languages . . . . . . . . . . . . . . . . . . . . . . . 8
2.3.4 Grammaticalization in Bantu languages . . . . . . . . . . . . . . . . . 9
2.4 Aims and research questions . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
2.5 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
3 Method . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
3.1 Sample . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 10
3.2 Working definitions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 12
3.3 Data collection . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
3.4 Analysis . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 14
4 Results . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 15
4.1 Frequency of occurrence and form of PhP expressions in Bantu . . . . . . . . 15
4.2 Morphological status . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
4.3 Semantics . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 23
4.4 Summary . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 25
5 Further correlations and discussion . . . . . . . . . . . . . . . . . . . . . . . . . . . 26
5.1 Crosslinguistic frequency and universals . . . . . . . . . . . . . . . . . . . . . 26
5.2 Co-occurrence of PhP expressions and more general markers . . . . . . . . . . 26
5.2.1 Co-occurrence with standard negation . . . . . . . . . . . . . . . . . . 26
5.2.2 Co-occurrence of and continuous aspect . . . . . . . . . . . . . . 30
5.2.3 Co-occurrence of and perfect aspect . . . . . . . . . . . . . . 31
5.3 A small scale historical comparative analysis . . . . . . . . . . . . . . . . . . . 32
6 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 35
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 38
A Appendix 1 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 42
B Appendix 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 44
C Appendix 3 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 46
D Appendix 4 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 48
E Appendix 5 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 50
F Appendix 6 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 52
List of abbreviations
Presentation conventions
Metalanguage with small capitals will be used to talk about the PhP expressions on an abstract
level when no specific language is discussed (i.e. , etc.).
Many Bantu languages have grammatical tone. This feature is not consistently indicated in
the reference grammars. This is therefore only illustrated when tone is relevant to distinguish
one PhP expression from another marker. Otherwise, tone is left out of the discussion.
All examples are given as found in the original source. An unglossed morpheme of un-
known meaning is glossed with a question mark. The ISO-code of each language is given in
square brackets in all examples.
1 Introduction
Languages have different ways of categorising a very abstract part of the human existence
– time. This central part of cognition can be expressed in different ways depending on how
one categorises the world. If the point of time of an action is important, the language in
question can express this in different temporal distinctions, such as past, present and future
tense. If the action is viewed as a whole, it can be expressed as aspectual distinctions. The
start, middle and end of an event can also be encoded, mainly through adverbials expressing
phasal values. Some languages have grammaticalized phasal values, in addition to tense and
aspect. Whether the event is taking place at the time of the utterance or not, is also important
and expressed through polarity, i.e. positive and negative values, e.g. the existence or non-
existence of a particular situation. All of the categories above can be expressed either lexically
or grammatically.
There is a category of expressions that combine phasal and polarity values, briefly outlined
above: a semantic domain typically referred to as Phasal Polarity, and the ways of its encoding,
namely Phasal Polarity expressions (henceforth PhP). PhP items are morphologically diverse
markers that facilitate the encoding of primarily three domains: phasal values, polarity and
speaker expectations. Phasal expressions received some attention in the late 1990s, but not
much has been done since then (van der Auwera 1998; van Baar 1997). Thus the domain
remains fairly unexplored.
Two ongoing projects are currently investigating this phenomenon. One of them is Phasal
Polarity in African languages at Hamburg University, led by Raija Kramer. The other one is
Expectations shaping grammar: searching for the link, between tense - aspect and negation at
the Department of Linguistics at Stockholm University, led by Ljuba Veselinova. My study
contributes to both of these projects by collecting and analyzing data from grammars. While
this method is insufficient for a detailed study of this phenomenon, the information found in
grammars is an important part of any crosslinguistic study. This study serves as a stepping
stone for future research in an otherwise little explored area.
2 Background
In this chapter an overview of the theoretical background will be provided. First, a basic
description of PhP expressions will be given (2.1). Second, the findings of earlier studies will
be presented (2.2) and third, some relevant features of Bantu languages will be described (2.3).
1
According to van Baar (1997), the four adverbials in the example are inherently telic/atelic
and therefore possess phasal properties. PhP expressions are also affected by the existence or
non-existence of a situation at the moment of speech. Therefore they also have polarity values.
Many other expressions may appear to be PhP items, but at a closer look they lack either the
phasal values or polarity values that are essential to the objects of this study. Consider the
examples below.
(2) Expressions similar to PhP items (van Baar 1997: 1) [emphasis added]
a. John will not visit me again.
b. John will not be discussing this matter any further.
Van Baar explains that the above examples have iterative or additive meaning but are not
phasal in the same way as PhP expressions are. To be able to classify an expression as a PhP
item, it is essential that they are related by temporal phases as well as polarity.
2
Figure 1: Duality in English (van Baar 1997: 20)
yet happened. This relationship works the same way even with the expressions that are not
negated, i.e. the equivalent of would be [not [ p)] and [ [not p]].
If a situation has not not yet happened, it has started and is already happening. If a situation
is no longer not happening, it has also started and is now already happening.
Languages make use of different combinations of internal and external negation to create
their PhP systems. The examples below show how Spanish uses internal negation to create
and .
The Spanish PhP system consists of an affirmative and a negative ya () and todavía
() thus making use of internal negation to construct both and (van
Baar 1997: 22).
The following examples show how Classical Nahuatl uses external negation to create
and .
(5) Example of in Classical Nahuatl [nci] (Andrews 2003: 76, 92, 175):
oc yōliyah
still they.were.living
‘’they are still alive’
(6) Example of in Classical Nahuatl [nci] (Andrews 2003: 76, 92, 175):
ayoc āc
.still he/she.is.present
‘’he is no longer here’
In this system external negation is used to create the negative PhP expressions, negating
to create an expression for , ayoc. This means that is equal to [not
[ p]].
3
2.1.3 PhP expressions and speaker expectations
Languages make use of different strategies to express if a situation conforms or deviates from
the shared norms in a context. Heine, Claudi and Hünnemeyer (1991) describe these expres-
sions as counterexpectation (CE) markers, like too and only illustrated in the example below.
The main function of a CE marker is, according to Heine et al. , to compare a statement
to what is being presupposed or assumed. In the case of PhP expressions, all of them are to
be viewed as CE markers, but behave differently depending on the context as illustrated in
van der Auwera (1998). Van der Auwera proposes a theory where he views PhP expressions
from the perspective of speaker expectations instead of negation, called the Double Alternative
Hypothesis.
Van der Auwera explains that both the surrounding circumstances and the expectations of
the speakers involved will affect which PhP expressions are being used. There are three sce-
narios that influence the use of PhP items. The first scenario is neutral, where the expectations
of the speaker is met (Scenario 1). The second scenario is an event realized earlier than the
speaker expected (Scenario 2). The third scenario is an event realized later than the speaker
expected (Scenario 3). The examples below are inspired by van Baar (1997: 27). To illustrate
the different scenarios the examples share the same background:
• 7 a.m. Anna and Mary are meeting at the bus stop to exchange notes.
• 8 a.m. Mary will leave for school.
• 9 a.m. Mary will arrive at school.
In the neutral scenario neither of the girls are surprised with the outcome of the meeting
because they are aware of the background information. Even if Anna oversleeps and arrives
at the bus stop at 9 a.m., she would be aware that Mary would have already left and say: ’(I
know) Mary is already at school.’ (Scenario 1).
On the other hand, if Anna arrived on time, but Mary remembered that her morning class
starts at 8 a.m. and not 9 a.m. and therefore left at the agreed meeting time, Anna’s response
would be one of surprise: ’(Oh no!) Mary is already at school.’ (Scenario 2).
Likewise, if Mary was late for the meeting and thus was late for school, Anna’s statement
would be: ’Mary is finally at school.’ (Scenario 3). The second and third scenarios are therefore
opposites, in the first one Mary arrives at school surprisingly early and in the second she is
surprisingly late.
In language specific cases all PhP expressions will not be compatible with all three scenar-
ios. According to van Baar (1997: 29) an English example would show that is only
compatible with scenario one and two while scenario three demands a different expression:
finally. Telicity also affects the compatibility with some scenarios, as is the case with and
. Van Baar argues that their lack of a clear finishing point make them incompatible
with the third scenario since their inherent lateness presupposes an event that has already
taken place.
4
2.2 Previous crosslinguistic studies
As already mentioned, the field of Phasal Polarity is understudied. A brief interest was shown
in the area during the 1990’s, that gave rise to the studies described below and in 2.1.2 and
2.1.3, but since then the interest has subsided. It is not until now that linguists have started to
show interest in the field with two large, ongoing projects that started recently (see chapter
1).
5
Telicity in PhP expressions includes, as discussed in 2.1, both telic and atelic aspects.
Kramer adds to this the concept of earliness/lateness from The Double Alternative Hypothesis,
but also states that the telic PhP items can be insensitive to this concept and, thus, be general.
The parameter wordhood is concerned with the independence, or grammaticalization, of
PhP expressions. Kramer describes PhP items as diverse, ranging from independent auxiliaries
with verbal properties to bound affixes. Since PhP expressions are so varied in their form,
they cannot be identified based on the wordhood parameter alone, but need to be viewed as
structurally diverse, specialised markers that form a paradigm in a language.
The concept of expressibility is discussed in the previous section in the form of universals
based on earlier findings. Kramer discusses this parameter more generally and states that PhP
systems may or may not contain specialised items for all four expressions, no specialised items
at all or any variation in between.
The paradigmaticity parameter focuses on the symmetrical properties of the PhP system
as a whole. Kramer describes this parameter as containing both an internal and external per-
spective. From an internal point of view, paradigmaticity is concerned with whether a PhP
expression has an alternative item with the opposite polarity. From an external perspective
this parameter focuses on whether a system has a one-to-one relationship between PhP items
and non-PhP items.
6
Table 1: Verbal template (Güldeman 1999: 546)
Verbal template
SLOT FUNCTION
preinitial /polarity
initial subject
postinitial /polarity
preradical object
radical verbal root
prefinal derivation/
final
postfinal clause type/object/polarity
(8) Example of near future tense in Logooli [rag] (Nurse 2003: 130) [own glossing]:
kʊ-ra≠gʊr-a
1.-buy-
‘’We will buy’ [Near future]
The near future in Logooli is expressed with the marker ra after the subject marker.
(9) Example of middle future tense in Logooli [rag] (Nurse 2003: 130) [own glossing]:
na-kʊ≠gʊr-i
.1-buy-.
‘’We will buy’ [Middle future]
The middle future in the example is found in the subjunctive, but whether this is obligatory
or not is uncertain. It is, however, expressed with the marker na before the subject marker.
7
(10) Example of far future tense in Logooli [rag] (Nurse 2003: 130) [own glossing]:
kʊ-rika≠gʊr-a
1.-buy-
‘’We will buy’ [Far future]
The far future tense is marked by rika, and like the near future marker, this marker occurs
after the subject.
(11) Example of uncertain future tense in Logooli [rag] (Nurse 2003: 130) [own glossing]:
kʊ-ri≠gʊr-a
1.-buy-
‘’We will buy’ [Uncertain future]
The uncertain future tense, illustrated in example (11), is constructed with the marker ri.
The same pattern is found in the past tenses, the only difference being that Bantu languages
usually have a higher number of past tenses than future tenses, according to Nurse (2003).
Nurse claims an average of one to three future tenses and one to four past tenses in the Bantu-
family.
In example (12) and (13) the imperfect and progressive aspect in Haya are illustrated.
The imperfect aspect is marked by high tone on either the final vowel a or the verbal stem
gur. The progressive aspect on the other hand has a marker ni that occurs before the subject
marker giving the verb the aspectual meaning.
(14) Example of preinitial negation in Standard Swahili [swh] (Güldeman 1999: 551):
ha-tu-tak-i
1-want-
‘’Wir wollen nicht’ [we don’t want]
8
Example (14) illustrates preinitial negation in Standard Swahili. The preinitial slot contains
the negative marker ha, followed by the subject marker, the verbal stem and lastly the final
vowel marking the present tense.
In contrast, Hungan uses periphrastic negation illustrated in example (15). The auxiliary
verb ’fail’ is marked with a subject marker and a progressive marker and has the main verb as
an infinitive complement.
With this cycle, Givón wants to illustrate that the syntax of a language is indicative of
possible morphological patterns. For example, if an adverb becomes grammaticalized the word
order in a language will affect whether it becomes a prefix or a suffix. If adverbs occur before
the verb, they will become prefixes and if they occur after the verb they will become suffixes.
Traugott (2011) proposes three basic mechanisms of grammaticalization: Reanalysis, ex-
tension and repetition. Reanalysis is change in the structure of an utterance that does not
have any apparent effect on the surface structure. For example changes in word order or
morpheme boundaries. Extension is to be understood as a broader term that can include ex-
tension of a wide range of domains, such as semantic extension which expands the meaning
of an utterance. The syntax and pragmatics of an utterance can also undergo extension. The
repetition of an utterance also contributes to the grammaticalization process. Repetition can
lead to phonetic reduction and loss of referential content.
A pattern of grammaticalization can be found in African languages using the mechanisms
proposed by Traugott. Heine (2011) describes how the African languages develop their com-
plex -systems by a constant addition of new forms of tense and aspect-markers. He pro-
poses the grammaticalization chain in example (17) to describe this process.
1
The literature on this subject is extensive. I only reference the work which is of immediate relevance to this
study.
9
1. V1 takes a non-finite verb as its complement (a).
2. V1 loses some of its semantic and morphosyntactic properties, and gains some gram-
matical properties (a).
3. V1 is now an auxiliary and V2 is the main verb (b).
The result of this grammaticalization chain is an auxiliary with inflection typically associ-
ated with a main verb and a main verb that is non-finite. In extreme cases V1 will become even
further grammaticalized into a clitic or an affix. These types of constructions are commonly
found in this study, as will become apparent in chapter 4.
2.5 Summary
PhP expressions are morphologically diverse markers with opposite phasal values and polarity
that encode speaker expectations and together form a system within a language. Earlier studies
have found that a sample of European languages differs from a crosslinguistic sample in what
the most common PhP items are. Van der Auwera’s European study finds that is
the most common PhP expression, while van Baar claims that is the most frequent. The
ambition of this study is to add to the theoretical framework of PhP expressions by describing
Phasal Polarity in a sample of East Bantu languages. The Bantu family was chosen because
there is evidence of PhP expressions in the Bantu literature and the family is absent in all
previous studies.
3 Method
The sample (3.1), working definitions (3.2), data collection (3.3) and methods of analysis (3.4)
are presented in this chapter.
3.1 Sample
The 46 languages2 selected for this study are presented in alphabetical order in Table 2. The
sample languages are mainly spoken in Zambia, Zimbabwe, Tanzania, Mozambique and South
Africa and belong to 15 different branches of the East Bantu sub-family. In Table 2 these
different genealogical groups are called clusters. The clusters are presented in appendix 6.
2
A map illustrating the distribution of the sample can be found here: http://www.arcgis.com/home/
webmap/viewer.html?webmap=8daad53ce7d349159b13f1aee5d1b116
10
East Bantu was chosen because a) it was deemed a good place to start investigating this phe-
nomenon, since no previous studies on Phasal Polarity has been conducted in this area and b)
it has the best descriptive literature of the African languages. This gives the study a bibliog-
raphy bias, but could not be avoided since this study relies solely on descriptive grammars for
data collection.
Table 2: Sample
e sample
Language ISO code Country Cluster G. zone
Bemba bem Zambia Sabi M42
Bena (Tanzania) bez Tanzania S. T. Highlands Bantu G63
Chopi cce Mozambique S. Bantu-Makua S61
Chuwabu chw Mozambique N. Mozambique Bantu P34
Gitonga toh Mozambique S. Bantu-Makua S62
Ila ilb Zimbabwe Botatwe M63
Kalanga kck Botswana Shona S16
Kaonde kqn Zambia Central Western Bantu L41
Kgalagadi xkv Botswana Sotho-Tswana S 31d
Koti eko Mozambique N. Mozambique Bantu P311
Lala-Bisa leb Zambia Sabi M51
Lamba lam Zambia Sabi M54
Lenje leh Zambia Botatwe M61
Makhuwa vmw Mozambique N. Mozambique Bantu P31
Makonde kde Tanzania Rufiji-Ruvuma P23
Makwe ymk Tanzania Rufiji-Ruvuma G402
Manyika mxc Zimbabwe Shona S13a
Matumbi mgw Tanzania Rufiji-Ruvuma P13
Mwani wmw Mozambique Coastal NEC Bantu G403
Ndali ndh Malawi Corridor Bantu M301
Ndengereko ndg Tanzania Rufiji-Ruvuma P11
Ngoni ngo Mozambique S. T. Highlands Bantu N12
Nsenga nse Mozambique Sabi N41
Nyakyusa-Ngonde nyy Malawi Corridor Bantu M31
Nyamwezi nym Tanzania Sukuma-Nyamwezi F22
Nyanja nya Mozambique Tumbuka-Sena-Nyanja N31a
Pedi nso South Africa Sotho-Tswana S32a
Sena (Malawi) swk Malawi Tumbuka-Sena-Nyanja N441
Shona sna Zimbabwe Shona S10
11
Continuation of Table 2
Language ISO code Country Language family G. zone
Soli sby Zambia Botatwe M62
Southern Sotho sot South Africa Sotho-Tswana S33
Sumayela Ndebele nbl South Africa Nguni-Tsonga S408
Swati ssw Swaziland Nguni-Tsonga S43
Tawara twl Mozambique Shona S11
Tewe twx Mozambique Shona S13b
Tonga (Zambia) toi Zambia Botatwe M64
Totela ttl Zambia Botatwe K41
Tsonga tso Mozambique Nguni-Tsonga S53
Tswa tsc Mozambique Nguni-Tsonga S51
Tswana tsn Botswana Sotho-Tswana S31a
Tumbuka tum Malawi Tumbuka-Sena-Nyanja N21
Venda ven Zimbabwe Sotho-Makua-Venda S21
Xhosa xho South Africa Nguni-Tsonga S42
Yao yao Mozambique Rufiji-Ruvuma P21
Zimbabwean Ndebele nde Zimbabwe Nguni-Tsonga S44
Zulu zul South Africa Nguni-Tsonga S42
Total: 46
As mentioned in 2.3, Bantu languages are notoriously hard to classify; they are also un-
derdescribed. This leads to the main limitation of the sample. To be able to capture as much
diversity as possible, ideally both the genealogical and geographical distribution would have
been considered when constructing the sample (Croft 2002: 21). This has, however, proven
difficult due to the number of languages that lack descriptive grammars and the sample has
thus been limited to a convenience sample. Nonetheless, great care was taken in selecting
languages with as even a distribution as possible. Several languages belonging to genealogical
clusters where also chosen to enable small scale historical comparative analysis.
12
• Scenario 2: A surprisingly early scenario where the speaker expected the event to
occur later.
• Scenario 3: A surprisingly late scenario where the speaker expected the event to
occur sooner.
(18) Example of in Hausa [hau] (van Baar 1997: 84):
datra úwa nóron man
doctor still he.is
‘’The doctor isn’t here yet’
In example (18) the negative continuativity of is apparent because of the fact that
there is a period of time, ongoing at the time of the utterance, where the doctor is not there,
but that the speaker expects the doctor to come.
(19) Example of in French [fra] (van Baar 1997: 75) [own glossing]:
L’arbre fleurit déjà
the=tree blossom..3 already
‘’The tree is blossoming already (it didn’t bloom before)’
In example (19) it is apparent that is an inchoative expression since it marks the
point where the tree starts blooming, which is now and not before. This example is neutral
and belongs to scenario 1.
Example (20) shows that is continuative since the speaker had to sign before, and still
has to sign at the moment of the utterance. Therefore the situation is ongoing.
Example (21) shows that is terminative since it marks the point when Peter has
stopped driving.
13
3.3 Data collection
The PhP expressions where collected according to the working definitions described in 3.2 and
are presented in appendices 1–4. Since interaction and correlation with tense, aspect and other
grammatical categories have been discussed in earlier studies, information about the perfect
and continuous tenses and standard negation were also collected to enrich the discussion.
A study that relies as heavily on reference grammars as this one is bound to encounter
problems associated with reliability. The ideal grammar is a both current and detailed descrip-
tion of a language from which the relevant data can be extracted. However, such a grammar is
not always possible to find, especially in the case of Bantu languages, as many grammars are
superficial or outdated. There are two main concerns when using an outdated or superficial
grammar. The problem most relevant to this study is the fact that an old grammar might be
missing features simply due to the lack of knowledge of a certain domain at that time. As
Phasal Polarity is a relatively new field, there is a considerable risk of it being overlooked in
a grammar from the early 1900’s. The second problem is that the presentation conventions
might also have changed over the last century, as well as the languages themselves, making
both the data and analysis unreliable. The above-mentioned problems are difficult to solve be-
cause of the lack of secondary sources. While it is important to mention the insufficiency of a
study exclusively based on grammars, it is equally important to note that it is also a necessary
stepping stone for further research.
3.4 Analysis
Glossed examples are a prerequisite for any type of linguistic analysis. As straightforward
as this might sound, it proved problematic early on in the study, as many grammars did not
contain glossed examples. Below follows a typical example.
It is apparent from the example that no morpheme boundaries can be identified and that the
meanings of said morphemes are unclear. This problem was solved by glossing the examples
containing the relevant markers by comparing them to the chapters in the grammar describing
the verb structure, or verbal template, and the relevant affixes. The examples were glossed
according to the Leipzig glossing rules, with one exception (Comrie et al. 2008). Since the
exact meaning of the verbal stem could not always be established, it was left in the glossing
as .
The lack of glossed examples posed yet another problem. When working with the agglu-
tinative Bantu language family, bound markers are expected to be found. However, because
of the lack of glossing, unless the bondedness was mentioned explicitly in the text, the or-
thography of the examples was used as the main source when deciding whether a morpheme
was bound or free. This makes the conclusions drawn regarding the bondedness of the PhP
expressions unreliable. Below follows a typical example.
14
Example (23) possibly shows morpheme boundaries by inserting spaces between each of
them, making all parts of this structure seem like unbound markers. It might be the case that
they are actually bound, but that the presentation in the grammar make them look like free
forms. This is not discussed or mentioned anywhere in the grammar, forcing the bondedness
criteria to be based solely on the orthography.
Kramer’s parameters are mainly applicable to the semantic content of PhP expressions.
Again, due to the limited data in the sources, information about the semantic content has
been difficult to come by. Therefore Kramer’s parameters are not used when discussing the
findings in this study.
In his study van Baar (1997) used questionnaires to investigate PhP expressions. This
method, of course, provides more in depth information than a study using reference gram-
mars as the only source. However, within the scope of this study, I have not been able to use
questionnaires.
It is important to note that under the the present circumstances, the results can demon-
strate tendencies within the sample, but should not be used to make any wider claims.
4 Results
In this chapter the results of the study are presented following the order of the research ques-
tions.
PhP expressions are very diverse in regards to their form, but some genealogical tendencies
can be discerned (see. 5.3). For a detailed description of all PhP expressions, see appendices
1-4. Table 3 shows the crosslinguistic frequency of PhP expressions among the 46 sample
languages. This includes all expressions that correspond to the definitions in 3.2.
The most common PhP item is , which occurs in 78 percent of the languages.
The second most common marker is with a 59 percent occurrence rate. and
are much less common, as they only occur in 35 and 26 percent of the sample
respectively. Ordering the markers from most common to least common the following East
Bantuversal hierarchy (inspired by van der Auwera’s Euroversals) can be proposed:
15
(24) The East Bantuversal accessibility hierarchy for PhP
> > >
For a more detailed description of the distribution of PhP expressions in each language,
see appendices 1-4. The PhP systems found in the sample are illustrated in Table 4.
PhP systems
PhPs per language Function No. of lgs ISO
Four 6 [chw][sot]
[ssw][tsn]
[ven][zul]
5 [bem][cce]
Three [ilb][xkv]
[xho]
5 [kck][nso]
[twx][ttl]
[tso]
1 [nbl]
8 [lam][leh]
Two [wmw][nyy]
[nya][twl]
[tsc][yao]
2 [eko][nde]
1 [kqn]
11 [toh][vmw]
One [kde][ymk]
[mxc][mgw]
[ngo][swk]
[sna][sby]
[toi]
1 [bez]
1 [ndg]
None - 5 [leb][ndh]
[nse][nym]
[tum]
Altogether, ten types of systems could be found, although many of these only have one
or two occurrences. As shown in Table 4, six languages have all four PhP expressions and
16
five have none at all. In total, eleven languages have three PhP items each, but in different
constellations. Five of these languages have excluded , five have excluded ,
and one language has excluded . The two most common PhP item to exclude in a
system with three expressions are and . For a more detailed overview of
the PhP systems, see Appendix 5.
Eleven languages were found to have systems with two PhP expressions. Eight of these
systems contain and . Two languages have systems with and ,
and one language has a system with and . The most common two-item system
in the sample is, by far, one with and .
Systems with only one PhP expression were found in thirteen languages. This is the most
common PhP system. Eleven of these have , one has and one has . The
languages with only one item seem to favour over all other PhP expressions, since
this is the most common PhP system found.
Makhuwa is one of the languages with only , illustrated in figure 2. The language
specific PhP system is illustrated with plus signs next to the existing markers.
(25) Example of in Makhuwa [vmw] (Van der Wal 2009: 106):
mí nki-ná-n-koh-é
. .1-1-ask-
‘’I haven’t asked him yet’
The PhP system in Makhuwa only has the PhP expression , which is expressed
with a prefix, ná, affixed to the negated verb.
The second most common arrangement, with eight occurrences in the sample, is a two-
marker system containing and . Figure 3 illustrates this kind of system found in
Nyakyusa-Ngonde.
Examples (26) and (27) show the PhP expressions and in Nyakyusa-Ngonde.
17
(26) Example of in Nyakyusa-Ngonde [nyy] (Persohn 2017: 186):
tʊ-kaalɪ tʊ-kʊ-job-a
1 1-speak-
‘’we still speak / we are still speaking’
Nyakyusa-Ngonde has a PhP system with two PhP items, and . These are
both expressed with an auxiliary, kaalɪ, that takes two different complements. This auxiliary
is used to express that a situation still holds from a later point in time. When it is marked by a
subject marker, forming a periphrastic construction with an inflected verb as its complement,
it takes on the meaning of .
When it takes an infinitive complement, like in example (27), the meaning changes to that
of . Therefore, unlike in the example for Spanish in 2.1.2, it is the complement that
decides the meaning of kaalɪ, not negation.
Six languages have all PhP expressions. An example of a complete PhP system is illustrated
by data from Southern Sotho in Figure 4.
The PhP system in Southern Sotho uses three different markers to express all four PhP
items. One marker is used to express , another is used exclusively for , and the
third marker is used both in constructing and . Examples (28)–(31) il-
lustrate the PhP items in Southern Sotho. No language with a complete system possesses a
grammar with glossed examples, therefore the glossing of each example was completed ac-
cording to 3.3.
(28) Example of in Southern Sotho [sot] (Doke & Mofokeng 1957: 213) [own gloss-
ing]:
ha-kē-es’o-rek-ē
1.-buy-.
‘’I do not yet buy; I have not yet bought’
is expressed with the marker es’o used in conjunction with standard negation in
a bound construction.
18
(29) Example of in Southern Sotho [sot] (Paroz 1946: 96) [own glossing]:
kē sa rut-a
1 teach-
‘’I still teach’
Unlike es’o, is expressed with an auxiliary, sa, which is used together with the main
verb.
(30) Example of in Southern Sotho [sot] (Paroz 1946: 95) [own glossing]:
kē-se kē-rut-a
1 1-teach-
‘’I already teach’
(31) Example of in Southern Sotho [sot] (Paroz 1946: 95) [own glossing]:
kē-se kē-sa-rut-e
1 1-teach-.
‘’I teach no more’
What is described as another auxiliary, se, is used to express both and .
In this construction the subject marker is found on both the auxiliary and the main verb. In
the case of , there is a negation marker sa, that is described as a dependent negation
marker.
There is also a group of ten languages of which half have excluded and half have
excluded . The PhP system in Chopi has all PhP expressions except , in
a system with both periphrastic and bound constructions. Examples (32)–(34) show the PhP
expressions in Chopi.
(32) Example of in Chopi [cce] (Smyth 1902: 95) [own glossing]:
ni-sanga-von-a
1.-see-
‘’I have not yet seen’
is expressed with the marker sanga, directly before the stem of the verb.
(33) Example of in Chopi [cce] (Smyth 1902: 38) [own glossing]:
se-ni-von-ile
1-see-
‘’I have already seen’
19
is expressed with a prefix, se, and the perfect tense.
(34) Example of in Chopi [cce] (Smyth 1902: 95) [own glossing]:
ni-nga-di ni-ci-von-a
1 1-see-
‘’I still see’
The construction meaning is more complex and contains markers that are not ex-
plained. In Chopi, is expressed using a periphrastic construction with an auxiliary, di,
marked by both the subject marker and a marker of unknown meaning, nga, which is described
as a ’tense marker’. The main verb is inflected with both a subject marker and a consecutive
tense marker. Except for the fact that this utterance means , exactly how the consecutive
tense marker and the ”tense marker” nga affect the meaning is unknown.
In Totela three PhP items could be found. This system uses two markers to express the
three different PhP items, illustrated in examples (35)–(37).
20
4.2 Morphological status
Research question number three is answered in this section:
Table 5 illustrates the morphological status of all PhP items. The largest category in Table
5 is labeled ’Unclear’. This means that the morphological status is not mentioned explicitly in
the reference grammar. Because of the problems regarding the orthography and bondedness
described in 3.3 the results presented here should be considered tentative at best.
occurs as a prefix six times and an auxiliary six times. The other occurrences
are few, for example there are two examples of particles and one suffix. occurs as
an adverb, auxiliary and prefix three times each, and only once as a particle. There are six
examples each of being an auxiliary or a prefix, and two of it being a particle. The
other categories are only found once. occurs four times as a prefix and once as
an auxiliary. Below, the most common types of the PhP expressions are given in examples
(38)–(44).
(38) Example of in Matumbi [mgw] (Odden 1996: 66) [own glossing]:
ni̧-na-kalaanga li̧
1.-fry
‘’I haven’t yet fried’
As shown in Table 5, most commonly occurs as either a prefix or an auxiliary. The
prefix na in example (38) is described as a negative persistive marker with no known positive
counterpart. This prefix occurs with the negative marker li̧.
21
(39) Example of in Makwe [ymk] (Devos 2008: 341):
ináamba kuúlya
:1-be:yet. 15.eat.
‘’I have not eaten yet’
The auxiliary námba in example (39) is a polarity-item described as meaning ’be (not) yet’.
It always co-occurs with negation and takes an infinitive as complement.
(40) Example of in Zimbabwean Ndebele [nde] (Bowern et al. 2002: 186):
se-si-dl-ile
1-eat-
‘’We’ve already eaten’
Just as in the case with , the most common types of markers are prefixes
and auxiliaries, two categories that tie with the adverb. Example (40) shows as a
prefix in Zimbabwean Ndebele. It occurs word-initially with the verb in the perfect tense.
(41) Example of in Koti [eko] (Schadeberg & Mucanheia 2000: 147) [own glossing]:
ka-áz-’ó-o-sííl-a
1-hear-
‘’I have already heard it (before)’
(42) Example of in Bena [bez] (Morrison 2011: 269) [own glossing]:
ndi-pí-gon-a
1-walk-
‘’I’m still walking’
(43) Example of in Mwani [wmw] (Floor 2010: 15) [own glossing]:
n-ingarí ku-fyóm-a
1 -read-
‘’I am still reading’
In Mwani, is an auxiliary in the form of ingarí, and is inflected with a subject marker
and takes an infinitive as a complement. is most often described as a prefix, as
in example (44), where the PhP expression consists of the prefix sá together with standard
negation.
(44) Example of in Sumayela Ndebele [nbl] (Ziervogel 1959: 88) [own glossing]:
a-tí-sá-dl-i
3-eat-.
‘’They no longer eat’
22
Table 6: Bondedness
The table above describes the bondedness of the PhP expressions. Half of the -
markers are bound, 39 percent are used in a periphrastic construction and eleven percent are
unbound morphemes. The marker is also mainly bound, at 63 percent, followed by a 15
percent section of periphrastic constructions and 22 percent free forms. follows
the same structure. Of the examples found, 58 percent are bound, 25 percent are used in a
periphrastic construction and 17 percent are unbound. The one marker that differs from this
pattern is , which has 56 percent free forms, 19 percent periphrastic markers and 25
percent bound items.
According to section 2.4.4, which explains the grammaticalization process, the bonded-
ness of the PhP expressions indicates the level of grammaticalization. is therefore the
most grammaticalized marker, followed by and . The high percentage of
periphrastic constructions for and indicate that they are currently undergoing
grammaticalization (see example (16)).
In short, the answer to research question number three is that the PhP expressions are
of very varied morphological status, however, some tendencies can be discerned. The most
common categories for , and is auxiliary and prefix. is
commonly found to be a prefix, an auxiliary or an adverb. , and all
have a high tendency to be grammaticalized, while does not.
4.3 Semantics
Research questions number four and five are treated in this section:
The function of the PhP expressions were investigated with great difficulty. The lack of
detail in many grammars often provides the reader with no more than an English translation
of the marker, without discussing its usage and meaning further. In three cases was
described as a CE-marker (Guerois 2015; Schadeberg & Mucanheia 2000; Van der Wal 2009),
or to have ’expectant implications’ (Stevick 1960; Canonici 1996). In one case was
described as an affirmative CE-marker, and in another it was described as inceptive (inchoa-
tive) (Schadeberg & Mucanheia 2000; Canonici 1996). is described on four occasions
as marking an action which has previously been going on and still continues at the time of
23
speech (Gadelii n.d.; Ziervogel 1952; Kiso 2012; Persohn 2017). is not described as
anything other than a negated form of .
Even though the function is not mentioned explicitly in the grammars, the markers po-
sition in the verbal template can be used to speculate about its status in the grammar of the
relevant language. Table 8 shows in what slot in the verbal template (2.4.1) the PhP expressions
occur.
The unbound or periphrastic constructions are not applicable, hence the last category. The
majority of the bound -markers are found in the postinitial slot, directly preceded by
the subject marker. The postinitial slot is described by Güldeman (1999) as being reserved for
-marker. The bound -markers show the same tendency, with 16 examples found in
the postinitial slot. has seven markers in the postinitial slot. is, again,
deviant with only one marker in the postinitial slot, and three in the prenitial slot directly
preceding the subject marker. The preinitial slot is described by Güldeman (1999) as being
reserved for and polarity. There are only two cases where the PhP expression occurs in
the postfinal slot, which is reserved for clause type/object/polarity.
Lack of detail in many grammars also led to difficulties in answering question number
five. What can be said about the restrictions put upon PhP expressions by other categories
is not much. It is, however, clear that they often occur alone, without any other tense or
aspect marker. Although not restricting them, there are a few exceptions where there is a co-
occurrence of both a PhP-marker and a -marker. Table 7 illustrates which PhP expressions
occur with which -markers in each language. Table 8 shows the co-occurrence of PhP
expressions and . Due to its size Table 8 is located on the next page.
Seven out of 36 -markers occur together with a -marker. Five out of these are
past tense markers. Six out of 16 -markers occur with the perfect tense. occurs
four times in all tenses and once only with the continuous tense. shows similar
tendencies with two occurrences in all tenses and one in the perfect tense.
24
Table 8: Phasal Polarity and
The answer to question number four is that the majority of the bound PhP expressions
seem to function like -markers since they occur in the postinitial slot otherwise reserved
for -markers. It is also very rare for all PhP expressions to occur with another -marker.
This shows that they are fully incorporated in the -system of the language.
To answer research question number five: Most often, PhP expressions can not occur to-
gether with other -markers, but the few exceptions show a tendency towards a specific
correlation between a PhP item and a -category.
4.4 Summary
In section 4.1, the East Bantuversal accessibility hierarchy for PhP was proposed to illustrate
what PhP expressions are the most common in East Bantu. The most common expression is
, followed by , with and being far less common. The PhP
systems in the samples proved to be diverse but the two most common systems contained
either only or and .
The morphological status of the markers proved to be diverse as well. The majority of
the , and -markers are either prefixes or auxiliaries. devi-
ates in the sense that it was found to be an adverb as well as a prefix and an auxiliary. The
bondedness of the markers indicate that the majority of , and are
grammaticalized in the languages. is found to be mostly unbound which indicates it
is not grammaticalized.
The majority of the bound markers occur in the postinitial slot, otherwise reserved for
/polarity, in the verbal template proposed in 2.4.1. The majority of the markers also do
not co-occur with any other -markers. This indicates that they are incorporated in the -
system, and function in a way similar to other -markers.
25
5 Further correlations and discussion
This section is organized as follows. The results presented in the previous section and the
results of previous studies are discussed (5.1). The co-occurrence of PhP expressions and stan-
dard negation (5.2.1), continuous aspect (5.2.2), perfect aspect (5.2.3) is discussed, together with
other relevant findings made when comparing the data.
(45) Example of in Yao [yao] (Sanderson 1922: 187) [own glossing]:
ni-gini-ni-ich-e
1.1-come-
‘’I had not yet come’
26
Table 9: Co-occurrence of negative PhP items and standard negation
Yao constructs the expression with the marker using gini. This prefix occurs
between two subject markers. The interesting deviation in this construction is the final vowel.
In other words, both negative and positive tenses, Sanderson describes the final vowel as a.
The other two cases where the final vowel is e is in the immediate future tense and in the
subjunctive mood. The future tense includes an additional prefix, but the subjunctive mood is
only characterised by the change of the final vowel from a to e.
This raises the question whether the subjunctive has anything to do with the PhP expres-
sion. That would mean that the counterexpectational part of links the construction
to the subjunctive, that has similar semantic content. Hypothetically, it is imaginable that this
expression is derived from the subjunctive like this: ’I may come later´ meaning ’I have not
yet come (but I expect to later)’.
Example (46) gives the structure of in Chopi.
(46) Example of in Chopi [cce] (Smyth 1902: 95) [own glossing]:
ni-sanga-von-a
1.-see-
‘’I have not yet seen’
The prefix sanga is described briefly by Smyth and Matthews (1902: 95) as meaning
, but no further explanation is given. Standard negation in Chopi consists of a prefix, a,
and a final vowel i, which does not occur in the example. In this case, seems to be
completely grammaticalized into a single prefix. However, upon examining the section in the
grammar describing the perfect, a negative prefix was found: nga. This form is also used to
negate the subjunctive, in both cases together with the negative prefix a.
If this negative prefix is part of the marker described as meaning , there is a number
of interesting implications. It is possible that in Chopi is derived from either the
perfect or the subjunctive, with the marker sa originally meaning , a -marker, being
negated by the subjunctive or perfective negative prefix nga. Again, it is possible that
has been derived from a construction originally in the subjunctive mood.
Example (47) gives the structure of in Nyakyusa-Ngonde.
(47) Example of in Nyakyusa-Ngonde [nyy] (Persohn 2017: 187):
jɪ-kaalɪ ʊ-kʊ-py-a
9 15()-be(come).burnt-
‘’It’s not yet done.’
Persohn (2017) describes the marker kaalɪ as a persistive marker that can mean either
or depending on the complement. Possible reasons why the infinitive complement
27
changes the meaning of the marker to are not discussed. This type of construction is,
however, common as noted by both Bernander (2017: 263) and Veselinova (pc).
Table 10 gives the bases of derivation of .
2
0
Not derived 34
Although usually occurs with standard negation it does not seem to be derived
from any other PhP expressions, as shown in Table 10. There are only two examples of
being derived from , and no examples of it being derived from .
Table 11 gives the bases of derivation of . does not behave like
and is for the most part derived from .
9
1
Not derived 2
Comparing and , it soon became evident that was most often
derived from . This was the case in nine out of twelve languages. Two languages showed
no similarities between the two PhP expressions, and one was derived from . Example
(48)-(49) showcase the most common way this was done.
(48) Example of in Pedi [nso] (Ziervogel 1954: 70) [own glossing]:
ki-sa-khov-a
1-sit-
‘’I still sit, I am still sitting’
(49) Example of in Pedi [nso] (Ziervogel 1954: 70) [own glossing]:
a-ki-sa-khov-i
1-sit-.
‘’I no longer sit’
Comparing and in Pedi is rather straightforward. The positive construc-
tion contains the subject marker, the prefix sa and the stem. The negated version starts with
the negative prefix a, and ends with a negative final vowel i. This is how all expressions of
are derived, except in Kalanga, that has a more complex structure. Examples (50)–(52)
illustrate the way Kalanga expresses and .
28
(50) Example of in Kalanga [kck] (Chebanne & Schmidt 2010: 101) [own glossing]:
ndi-tja-lim-a
1-plough-
‘’I’m still ploughing’
There are clear similarities between example (50) and (48). Both contain the subject marker,
followed by the PhP item and the verbal stem. However, unlike all other languages with a
marker, the construction in Kalanga is limited to the past and future progressive
tenses. It is also a periphrastic construction, a characteristic only shared with Tswana, whose
marker is not derived from any other PhP expression.
(51) Example of in Kalanga [kck] (Chebanne & Schmidt 2010: 101) [own gloss-
ing]:
a-ndi-towo-be ndi-tjowo-be ndi-lim-a
1. 1. 1-plough-
‘’I shall no longer be ploughing’ [future tense]
Example (51) illustrates the structure of in the future progressive tense. This
construction consists of two inflected auxiliaries and a main verb. The auxiliaries, be (to be),
are both marked with the subject marker, and the first is also negated with both the negative
prefix a and the future negative marker towo. The second auxiliary has the marker, tjowo,
which described as a contraction of tja () and nowo (). The main verb is, again, marked
for person, but with no other inflection.
(52) Example of in Kalanga [kck] (Chebanne & Schmidt 2010: 101) [own gloss-
ing]:
nda-ka-be ndi-si-nga-tja-lim-a
1 1-plough-
‘’I shall no longer be ploughing’ [past tense]
Example (52) illustrates the structure of in the past progressive tense. This
construction differs a lot from the future progressive, as it is not negated and only has one
auxiliary. The auxiliary, be, is inflected for person and marked with the past tense marker
ka. The main verb repeats the subject marker, in direct conjunction to what is described as
a participial marker, si. This also changes the final vowel from a, to e, the same as in the
participial mood. The next marker is nga, which is described as a conditional, or eventuality
marker. This marker is used to express the conditional, potential, permissive, and the optative
(Chebanne & Schmidt 2010: 128).
Unlike example (51), there is no negation marker in this construction. Exactly in what
way the participial and conditional markers contribute to changing to is not
explained. There is, however, another example of a negative PhP expression with no explicit
negation. Compare example (52) to example (26) of in Nyakyusa-Ngonde. This is also
a periphrastic construction where the main verb is an infinitive. Although not abundant, these
examples show that negation is not always needed to construct a negative PhP expression.
Olsson (2013) investigates perfective grammatical markers, iamitives, in Southeast asian
languges. These markers can also be used as PhP expressions. He concludes that the relation-
ship between negation and iamitives is that a negated in his data becomes .
Furthermore, when is negated Olsson claims it becomes . It would seem that the
Southeast Asian languages use the the continuative to derive the continuative ,
and the inchoative to derive the terminative . As illustrated above, Bantu
languages exhibit a different pattern: is derived from more often than from
, and is, for the most part, not derived from another PhP item. Southeast
Asia and Southeast Africa seem to have different lexicalization patterns.
29
5.2.2 Co-occurrence of and continuous aspect
Table 12 shows that coexists with the continuous aspect in 23 languages. 16 languages
only have a continuous tense and four languages have without a continuous tense. Three
languages showed no evidence of either.
Both 23
Only 4
Only 16
Neither 3
Example (53) and (54) illustrate a language with both continuous aspect and grammatical-
ized , namely Totela.
(53) Example of the continuous tense in Totela [ttl] (Crane 2011: 317):
ndi-li-kweesi ndi-la-yend-a
1.. 1-walk-
‘’I am walking’
Totela uses a periphrastic construction to express the continuous tense, with the auxiliary
kweesi inflected for person and a present tense marker, li. The complement is marked with the
subject marker and the non-completive marker la. Another, infinitive complement, can also
be used to express continuous aspect.
What is interesting in Totela is that and the continuous tense show different levels of
grammaticalization. The bound marker chi () is more grammaticalized than the construc-
tion used to express the continuous tense. This may indicate that the addition of a continuous
tense is a more recent evolution in the language.
Table 13 illustrates the bondedness of compared to the continuous tense. shows
12 bound constructions, six unbound and three periphrastic constructions. Two constructions
are mixed and can be expressed with both an auxiliary and a bound morpheme. The contin-
uous tense has a few more bound constructions, with 17, and five periphrastic constructions.
Only one is unbound. Over all, seems to be slightly less bound with its six unbound
morphemes, but overall the difference is small.
30
Table 13: Bondedness of and the continuous tense
Cont.
Bound 14 17
Periphrastic 5 5
Unbound 6 1
Mixed 2 0
Both 12
Only 4
Only 24
Neither 6
Total: 46
Upon closer examination, the languages with both an -marker and the perfect
tense show that is currently undergoing grammaticalization as described in section
2.3.4. In these languages there is a division into three categories based on the bondedness of
the marker. First, there are the free forms and auxiliaries, then there are the bound markers
that co-occur with the perfect tense, and finally there are the bound markers which occur on
their own. The examples below show in different stages of grammaticalization. From
these examples it is clear how is becoming more grammaticalized in some languages.
(55) Example of in Tswana [tsn] (Cole 1962: 130) [own glossing]:
kè-sétsè ké-kwàl-á
1 1-write-
‘’I am already writing’
31
Example (55) illustrates as an auxiliary inflected for person, together with the
main verb that also has a subject marker.
(56) Example of in Zimbabwean Ndebele [nde] (Bowern et al. 2002: 51):
se-si-dl-ile
1-eat-
‘’We’ve already eaten’
Example (56) is particularly interesting because of the following: First, occurs
together with the perfect in a bound construction and does not seem to occur in any other
tenses. This is fully possible since and the perfect have similar semantic properties,
causing an overlap (Dahl & Wälchli 2016). Second, Bowern et al. (2002) explain that
was originally preceded by another subject marker, making the total of subject markers two
as in example (55). The initial subject marker has now been deleted.
(57) Example of in Zulu [zul] (Canonici 1996: 103) [own glossing and translation]:
se-ngi-thand-a
1-love-
‘’I already love’
Example (57) also has an initial -marker, just like the two previous examples.
Canonici (1996) explains that this marker was originally an auxiliary that had its own sub-
ject marker, but is now a prefix. He also mentions that it can co-occur with the perfect, but
that it is not obligatory.
Based on these examples the following grammaticalization pattern can be hypothesised:
It becomes apparent from this hypothesis that the languages in the sample, and especially
those in the example, are in different stages of this development. The example languages even
share a similar -marker: se or sétsè. Other auxiliaries with the same meaning is the
Sumayela Ndebele sele and se in Southern Sotho (example (23)). It seems possible that an
increased usage and/or a development from an auxiliary to a bound form could reduce sétsè
and sele to se, as it is the more common form, meaning this could have already happened in
the other languages.
32
Table 15: Clusters
Cluster No. of lgs No. of PhPs
Botatwe 5 5 1 3 1 10
Sabi 4 2 1 2 0 4
Shona 5 5 0 3 1 9
Sotho-Tswana 4 4 3 4 3 14
Rufiji-Ruvuma 5 4 1 1 0 6
Nguni-Tsonga 7 6 5 6 4 21
N. M. Bantu 3 3 2 1 1 7
T-S-N 3 1 0 1 0 2
Total: 36 30 13 21 10
the clusters with three or more languages were used in the analysis. This means that 10 lan-
guages were excluded.
Table 15 gives an overview of the PhP items found in the clusters. The clusters are: Bo-
tatwe, Sabi, Shona, Sotho-Tswana, Rufiji-Ruvuma, Nguni-Tsonga, North Mozambique Bantu
and Tumbuka-Sena-Nyanja. Nguni-Tsonga and Sotho-Tswana are the two clusters with the
highest number of complete systems, and the overall highest concentration of PhP items with
21 and 14 items each. It seems like the southern part of the sample, below central Zimbabwe,
is where PhP expressions are most common. This is especially true if Chopi and Venda are
added, since they also have complete PhP systems and are geographically located close to the
two other branches.
The more northern part of the sample (Botatwe, Sabi, Shona) and the more north-eastern
part (Rufiji-Ruvuma, North Mozambique Bantu) all have a lower frequency of PhP, but still
have a high frequency of . It seems like is favoured in these parts of the
sample.
Table 16 illustrates the three clusters where has a similar form. A total of 16 out
of 27 expressions share these similarities, adding up to 59 percent of the sample. The com-
mon denominator seems to be a fricative, usually s, followed by the vowel a. Two of the
branches are more closely related, raising the possibility of this being a genealogical feature.
Both Nguni-Tsonga and Sotho-Tswana are part of the Southern Bantu-Makua sub-family, but
Shona branches of earlier. Shona, however, is geographically located just north of the other
two clusters, so there is a possibility of contact-related spread of this form of .
Table 17 illustrates the three branches in which share a similar form. The com-
mon denominator here seems to be na. In total, 11 of 36 items, or 31 percent, share a common
form; a much lower number than for . Two of these clusters are not as closely related as
Sotho-Tswana and Nguni-Tsonga either. Botatwe and Rufji-Ruvuma branch off directly under
33
Table 16: Proto-
Proto-
the East Bantu sub-family, while North Mozambique Bantu is part of the Southern Bantu-
Makua branch. Even though they are much less closely related, they still share this common
marker. This could be because they are fairly close together geograpically. The Rufiji-Ruvuma
and North Mozambique Bantu clusters are located along the coast of Tanzania and Mozam-
bique. Botatwe is mainly spoken in Zambia, very far away from the two other clusters, and is
therefore the odd one out. In both Botatwe and North Mozambique Bantu is mainly
bound, while in Rufiji-Ruvuma the expression is bound in half of the languages and used in a
periphrastic construction in half. This suggests that the form na is newer in the Rufiji-Ruvuma
cluster than in the other clusters.
A historical comparative analysis relies mainly on sound changes. Without more diachronic
data, any historical comparative analysis will be difficult. Nonetheless, looking at the fren-
quencies of the markers it is possible to hypothesise a proto-form of both and ,
although not on the Proto-Bantu level. The most frequent form of in the Nguni-Tsonga,
Sotho-Tswana and Shona branches is sa. The most frequent form of in the Botatwe,
34
Table 17: Proto-
Proto-
North Mozambique Bantu and Rufiji-Ruvuma branches is na. Therefore the proto-form of
could be sa, and for it could be na or naamba. As stated in 2.3.4. the gram-
maticalization process can result in phonetic reduction. Therefore the longest form, naamba,
might also be the oldest. But without futher documentaion, it is difficult to draw any certain
conclusions. In any case, both and appear to be inherited rather than borrowed
features in these clusters.
6 Conclusions
A summary of the answers to the research questions, the characteristics of PhP expressions
and Bantu-specific tendencies, as well as recommendations for further research is presented
in this section.
35
The forms of the PhP expressions vary. For a description of each PhP item, see appendices
1–4.
The PhP systems found in the sample are diverse. Six languages have a system con-
taining all PhP expressions and five have none. Eleven languages have systems contain-
ing different combinations of three items. The most common three-item systems are
// and // . Another eleven languages have systems
with two items. The most common two item system is /. Thirteen languages
only have one PhP item, of which twelve are .
The morphological status of the markers was not mentioned in most grammars. However,
in the cases where this was mentioned, , and were most commonly
found to be prefixes or auxiliaries. was found to be a prefix or an auxiliary as often
as an adverb.
The bound PhP items show tendencies of being incorporated in the -system of their re-
spective languages, hence functioning like other -markers. No other functions were found.
The majority of the PhP expressions did not occur with any other -markers, indicating
that they might be incompatible. No other restrictions were found.
was the most common marker found in the sample. It is most often a prefix or
an auxiliary and was found to be a bound marker in half of the occurrences. The bound
-markers occurred in the postinitial slot in the verbal template. was not found to
be derived from another PhP expression.
was the second most common marker found in the sample. It is most often a prefix
or an auxiliary and was found to be a bound marker in more than half of the occurrences. The
bound -markers occurred, like , in the postinitial slot in the verbal template.
was not commonly found in the sample. This marker was found to have varied
morphological status and was often a free form. Therefore the verbal template was inapplicable
in the majority of cases. A third of the -markers co-occurred with the perfect.
was not commonly found in the sample. The majority of the -
markers were found to be derived from . Therefore, they share the same characteristics.
Three tendencies indicate that PhP expressions are a stable category in the East-Bantu
family. First, the languages in the sample show a very strong tendency to incorporate
and to a certain degree in the -system. Second, there is also a probability that
and are diachronically stable in specific clusters. Third, the languages show a tendency
to grammaticalize PhP expressions.
To better describe the semantics and usage of PhP expressions, future studies would ben-
efit from using a questionnaire like van Baar (1997). This method would avoid two problems
36
associated with grammars. It would increase the chance of finding complete systems, as the
questionnaire would be focused solely on PhP expressions, unlike grammars. This method
would also provide data that reflects the current usage of PhP items in the target language, as
opposed to the many grammars used in this study that are out-dated.
37
References
Andrews, J.R. 2003. Introduction to Classical Nahuatl. University of Oklahoma Press.
Botne, R.D.O., & Schafer, L. 2008. e Chindali Language of Malawi: A grammatical sketch of
Cindali, Malawian variety. Lightning Rod Press. American Philosophical Society.
Bowern, C., Lotridge, V., & Alling, E. 2002. Ndebele. Languages of the world: Materials. Lincom
Europa.
Canonici, N.N. 1996. Zulu Grammatical Structure. Zulu Language and Literature, University
of Natal.
Carter, H., & Kahari, G.P. 1979. Kuverenga Chishóna: An Introductory Shona Reader with Gram-
matical Sketch. School of Oriental and African Studies, University of London.
Carter, H., & Kahari, G.P. 1986. Shona Language Course. Shona Language Course, nos. v. 1–3.
African Studies Program, University of Wisconsin.
Carter, H., & Kashoki, M.E. 2002. An outline of Chitonga grammar. Bookworld Publishers.
Chebanne, A.M., & Schmidt, D. 2010. Kalanga: Summary Grammar. CASAS book series.
Centre for Advanced Studies of African Society (CASAS).
Comrie, Bernard, Haspelmath, Martin, & Bickel, Balthasar. 2008. e Leipzig glossing rules:
Conventions for interlinear morpheme-by-morpheme glosses.
Crane, T.M. 2011. Beyond Time: Temporal and Extra-temporal Functions of Tense and Aspect
Marking in Totela, a Bantu Language of Zambia. Ph.D. thesis, University of California.
Dahl, Östen, & Wälchli, Bernhard. 2016. Perfects and iamitives : two gram types in one gram-
matical space. Letras de Hoje, 51(3), 325–348.
Dembetembe, N.C. 1987. A Linguistic Study of the Verb in Korekore. Supplement to Zambezia.
University of Zimbabwe, Publications Office.
Doke, C.M. 1938. Text Book of Lamba Grammar. Witwatersrand University Press.
Doke, C.M., & Mofokeng, S.M. 1957. Textbook of Southern Sotho grammar. SOLINET/ASERL
Cooperative Microfilming Project. Longmans, Green.
38
Gadelii, K.E. A grammatical sketch of Xitshwa. August, 1998.
Garrido, J. 1992. Expectations in spanish and german adverbs of change. 26(01), 357–402.
Givón, T. 1971. Historical Syntax and Synchronic Morphology: An Archeologist’s Field Trip. Vol.
7. The Chicago Linguistic Society. Pages 394–415.
Grimes, B.F. (ed). 2000. Languages of the world: ethnologue. SIL International.
Grollemund, Rebecca, Branford, Simon, Bostoen, Koen, Meade, Andrew, Venditti, Chris, &
Pagel, Mark. 2015. Bantu expansion shows that habitat alters the route and pace of human
dispersals. Proceedings of the National Academy of Sciences.
Guerois, Rozenn. 2015. A grammar of Cuwabo (Bantu P34, Mozambique). Ph.D. thesis.
2015LYO20032.
Güldeman, Tom. 1999. The Gensis of Verbal Negation in Bantu and its Dependency on Func-
tional Features of Clause Types. Pages 545–587 of: Hombert, & Hyman (eds), Bantu Historical
Linguistics. Calif: CSLI Publications.
Heine, Bernd, Claudi, Ulrike, & Hünnemeyer, Friederike. 1991. Grammaticalization : a concep-
tual framework. Chicago: University of Chicago Press.
Kirsch, B., Skorge, S., & Magona, S. 1999. Xhosa. Teach yourself books. NTC Publishing Group.
Kiso, A. 2012. Tense and Aspect in Chichewa, Citumbuka and Cisena: A Description and Com-
parison of the Tense-aspect Systems in ree Southeastern Bantu Languages. Ph.D. thesis,
Department of Linguistics, Stockholm University.
Lanham, L.W. 1955. A study of giTonga of Inhambane. Bantu linguistic studies. Witwatersrand
University Press.
Löbner, Sebastian. 1989. ’Schon – erst – noch’: An integrated analysis. Linguistics and Philos-
ophy, 12(2), 167–212.
Lukusa, S.T.M., & Monaka, K.C. 2008. Shekgalagari grammar: a descriptive analysis of the
language and its vocabulary. CASAS book series. Centre for Advanced Studies of African
Society (CASAS).
Madan, A.C. 1906. Wisa Handbook: A Short Introduction to the Wisa Dialect of North-East
Rhodesia. Clarendon Press.
Madan, A.C. 1908. Lenje Handbook: A Short Introduction to the Lenje Dialect Spoken in North-
west Rhodesia. Clarendon Press.
39
Maganga, C., & Schadeberg, T.C. 1992. Kinyamwezi: Grammar, Texts, Vocabulary. East African
languages and dialects. R. Köppe.
Maho, Jouni. 2003. A classification of the Bantu languages: an update of Guthrie’s referential
system. In: Nurse, & Philipson (eds), e Bantu languages. Routledge.
Miti, L.M. 2001. A Linguistic Analysis of Cinsenga: A Bantu Language Spoken in Zambia and
Malawi. CASAS book series. Centre for Advanced Studies of African Society (CASAS).
Ngonyani, D.S. 2003. A Grammar of Chingoni. Languages of the world / Materials: Materials.
Lincom.
Nurse, D. 2003. Aspect and tense in Bantu languages. In: Nurse, D., Philippson G. (ed), e
Bantu Languages. Routledge.
Odden, D.A. 1996. e Phonology and Morphology of Kimatuumbi. Phonology of the world’s
languages. Clarendon Press.
Olsson, Bruno. 2013. Iamitives : Perfects in Southeast Asia and beyond. M.Phil. thesis, Stockholm
University, General Linguistics.
Paroz, R.A. 1946. Elements of Southern Sotho. Basutoland, Morija Sesuto Book Depot.
Persohn, B. 2017. e verb in Nyakyusa: A focus on tense, aspect and modality. Contemporary
African Linguistics. Language Science Press.
Plungian, Vladimir A. 1999. A typology of phasal meanings. Pages 311–321 of: Abraham,
& Kulikov (eds), Tense-aspect, transitivity and causativity: Essays in honour of Vladimir 24
Nedjalkov. John Benjamins Publishing.
Schadeberg, T.C., & Mucanheia, F.U. 2000. Ekoti: e Maka Or Swahili Language of Angoche.
East African languages and dialects. Köppe.
Smyth, B., Matthews J. 1902. A vocabulary with a short grammar of Xilenge. Society For
Promoting Christian Knowledge.
40
van Baar, Theodorus Martinus. 1997. Eurotyp: 3: Phasal Polarity. Amsterdam: IFOTT.
van der Auwera, J. 1998. Phasal Adverbials in the Languages of Europe. Mouton de Gruyter.
Pages 25–145.
van der Merwe, D. F., Schapera I. 1943. A comparative study of Kgalagadi, Kwena, and other
Sotho dialects. University of Cape Town.
Van der Wal, J. 2009. Word order and information structure in Makhuwa-Enahara. LOT inter-
national series. LOT.
van Eeden, B.J.C. 1936. Notes on the Grammar of Soli. Nasionale Pers, Beperk.
Wright, J.L. 2007. An Outline of Kikaonde Grammar. Language in Zambia. Bookworld Pub-
lishers and UNZA Press.
Ziervogel, D. 1954. e Eastern Sotho: a tribal, historical, and linguistic survey (with ethno-
graphic notes) of the Pai, Kutswe, and Pulana Bantu tribes in the Pilgrim’s Rest District of
the Transvaal Province, Union of South Africa. SOLINET/ASERL Cooperative Microfilming
Project. J.L. Van Schaik.
41
A Appendix 1
Language ISO-code Form of Source
Bemba bem ta-?-la-?-a Hoch (n.d.: 21)
Bena (Tanzania) bez Morrison (2011)
Chopi cce SM-sanga-STEM-FV Smyth (1902: 39)
Chuwabu chw NEG.SM-ná-STEM Guerois (2015: 383)
Gitonga toh NEG-SM-si-INF-STEM-NEG Lanham (1955: 173)
Ila ilb NEG.SM na INF STEM-FV Smith (1907: 165)
Kalanga kck a-SM-thu + verb in the past Chebanne & Schmidt
consecutive (2010: 107)
Kaonde kqn Wright (2007)
Kgalagadi xkv NEG ntse SM STEM-NEG van der Merwe (1943:
71)
Koti eko NEG-SM-ná-STEM-e Schadeberg & Mucan-
heia (2000: 122)
Lala-Bisa leb Madan (1906)
Lamba lam NEG.SM-nga-STEM-FV Doke (1938: 284)
NEG-SM-nga-STEM-FV
Lenje leh NEG-SM-ná-STEM Madan (1908: 38)
Makhuwa vmw NEG.SM-ná-STEM-e Van der Wal (2009: 106)
Makonde kde SM-ka-nàa(va) INF-STEM- Kraal (2005: 242)
FV SM-ka-nàamba INF-
STEM-FV
Makwe ymk NEG.SM-náamba INF.cl15- Devos (2008: 341)
STEM-FV
Manyika mxc NEG-SM-PST.NEG-ti + verb Stevick (1960: 171)
in the recent past participle
Matumbi mgw SM-ná-STEM-FV NEG Odden (1996: 66)
Mwani wmw NEG-námba INF-STEM-FV Floor (2010: 15)
Ndali ndh Botne & Schafer (2008)
Ndengereko ndg Ström (2013)
Ngoni ngo NEG.SM-kona NEG-INF- Ngonyani (2003: 87)
STEM
Nsenga nse Miti (2001)
Nyakyusa-Ngonde nyy SM-kaali AUG-INF-STEM- Persohn (2017: 187)
FV
42
continued
Language ISO-code Form of Source
Nyamwezi nym Maganga & Schadeberg
(1992)
Nyanja nya NEG-SM-RECPST-STEM- Kiso (2012: 157)
SBJV-be
Pedi nso NEG-SM-ki SM-STEM-FV Ziervogel (1954)
Sena (Malawi) swk Kiso (2012)
Shona sna NEG-SM-satí Carter & Kahari (1986:
37)
Soli sby PST-NEG.PST-na-SM-STEM van Eeden (1936: 25)
PST-SM-NEG-na-SM-STEM
Southern Sotho sot NEG-SM-es’o-STEM-FV Doke & Mofokeng
(1957: 213)
Sumayela Ndebele nbl Ziervogel (1959)
Swati ssw SM-se SM-NEG-STEM-FV Ziervogel (1952: 133)
Tawara twl NEG-SM-cha-AUX SM-INF- Dembetembe (1987:
STEM-FV 101)
Tewe twx NEG-sa-ti + verb in the past Carter & Kahari (1979:
participle 41)
Tonga toi NEG-SM-níngá-STEM-FV Carter & Kashoki
NEG-SM-ná-STEM-FV (2002: 61)
Totela ttl NEG-SM-ini INF-STEM-FV Crane (2011: 337)
Tsonga tso NEG SM si STEM-FV Ouwehand (1978: 108)
Tswa tsc NEG SM.CONT ha STEM- Gadelii (n.d.: 22)
NEG
Tswana tsn NEG-SM-ísé + verb in the ha- Cole (1962: 130)
bitual subjunctive
Tumbuka tum Kiso (2012)
Venda ven NEG-SM-athu-STEM-FV Poulos (1990: 344)
Xhosa xho NEG-SM-ka-STEM-FV Kirsch et al. (1999: 205)
Yao yao SM-kana-SM-STEM-e Sanderson (1922: 56)
ngana-SM-STEM-FV
Zimbabwean Ndebele nde NEG-SM-zake SM-STEM- Bowern et al. (2002: 45)
SUBJ
Zulu zul NEG-SM-ka-STEM-NEG Canonici (1996: 105)
43
B Appendix 2
Language ISO-code Form of Source
Bemba bem apo pene Hoch(n.d.: 51)
Bena (Tanzania) bez Morrison (2011)
Chopi cce se-SM-STEM-PRF Smyth (1902: 38)
Chuwabu chw já SM-PRF-STEM Guérios (2015: 363)
Gitonga toh Lanham (1955 )
Ila ilb kle Smith (1907: 213 )
Kalanga kck Chebanne & Schmidt (2010)
Kaonde kqn kala Wright (2007: 31)
Kgalagadi xkv la sala v.d Merwe and Schapera (1943: 84)
Koti eko SM-áz INF-STEM-FV Schadeberg (2000: 147)
Lala-Bisa leb
Lamba lam Doke (1938)
Lenje leh Madan (1908)
Makhuwa vmw van der Wal (2009)
Makonde kde Kraal (2005)
Makwe ymk Devos (2008)
Manyika mxc Stevick and Machiwana (1960)
Matumbi mgw Odden (1996)
Mwani wmw Floor (2010)
Ndali ndh Botne (2008)
Ndengereko ndg pyaa Ström (2013: 251)
Ngoni ngo Ngonyani (2003)
Nsenga nse Miti (2001)
Nyakyusa-Ngonde nyy Persohn (2017)
Nyamwezi nym Maganga and Schadeberg (1992)
Nyanja nya Kiso (2012)
Pedi nso Ziervogel (1954)
Sena (Malawi) swk Kiso (2012)
Shona sna Carter (1986)
Soli sby van Eeden (1936)
Southern Sotho sot SM se SM STEM-a Paroz (1946: 95)
Sumayela Ndebele nbl sele, biyo Sekhosana (2009: 330)
44
continued
Swati ssw (SM)-se SM-STEM-FV Ziervogel (1952:
132 )
Tawara twl Dembetembe
(1987)
Tewe twx Carter and Ka-
hari (1979)
Tonga (Zambia) toi Carter (2002)
Totela ttl Crane (2011)
Tsonga tso Ouwehand (1965)
Tswa tsc Gadelii (1999)
Tswana tsn SM-sétsè verb in the present participle Cole and Mokaila
(1962: 130)
Tumbuka tum Kiso (2012: 235-
236)
Venda ven SM-PRF-no-STEM-FV Poulos (1990:
342)
Xhosa xho sele/sel/se Kirsch and
Scorge (1999:
123)
Yao yao Sanderson: 1922
Zimbabwean Ndebele nde se-SM-STEM-PERF Bowern et al.
(2002 : 51)
Zulu zul se-SM-STEM-a Canonici (1995:
95)
45
C Appendix 3
Language ISO code Form of Source
Bemba bem ci Hoch(n.d.: 29)
Bena (Tanzania) bez SM-pí-STEM-FV pele SM-i- Morrison (2011: 271-
STEM-FV 272)
Chopi cce SM-NEG-di SM-CONS- Smyth (1902: 38)
STEM
Chuwabu chw naváno Guérios (2015: 323)
Gitonga toh Lanham (1955 )
Ila ilb SM chi STEM Smith (1907: 140, 148)
Kalanga kck SM-tja-STEM-a Chebanne and Schmidt
(2010: 106)
Kaonde kqn SM-(ka)-ki-STEM Wright (2007: 32)
Kgalagadi xkv SM sha STEM-CONT Lukusa & Monaka
(2008: 145)
Koti eko Schadeberg (2000 )
Lala-Bisa leb Madan (1906)
Lamba lam SM-ci-STEM Doke (1938: 269)
Lenje leh SM-chi-STEM SM-achi- Madan (1908: 35)
STEM
Makhuwa vmw van der Wal (2009)
Makonde kde Kraal (2005)
Makwe ymk Devos (2008)
Manyika mxc Stevick and Machi-
wana (1960)
Matumbi mgw Odden (1996)
Mwani wmw SM-ingari INF-STEM-FV Floor (2010: 15)
Ndali ndh Botne (2008)
Ndengereko ndg Ström (2013)
Ngoni ngo Ngonyani (2003)
Nsenga nse Miti (2001)
Nyakyusa-Ngonde nyy SM-kaali SM-PRS-STEM-FV Persohn (2017: 184)
Nyamwezi nym Maganga and Schade-
berg (1992)
Nyanja nya SM-PST.PFV-STEM-FV-be Kiso (2012: 150)
Pedi nso Ziervogel (1954)
46
continued
Language ISO code Form of Source
Sena (Malawi) swk Kiso (2012)
Shona sna Carter (1986)
Soli sby van Eeden (1936)
Southern Sotho sot SM sa STEM-a Paroz (1946: 96)
Sumayela Ndebele nbl SM-sá-STEM-FV Ziervogel (1959: 88)
Swati ssw SM-sa-STEM-FV Ziervogel (1952: 97)
Tawara twl SM-sa-STEM-FV Dembetembe (1987:
133-134)
Tewe twx SM-cha-STEM-a Carter and Kahari
(1979: 28)
Tonga (Zambia) toi Carter (2002)
Totela ttl SM-chi-STEM-FV Crane (2011: 325)
Tsonga tso SM ha TM STEM Ouwehand (1965: 107-
108)
Tswa tsc SM ha STEM Gadelii (1999: 22)
Tswana tsn SM-sà-STEM. SM-sántse SM- Cole and Mokaila
STEM-PART (1962: 121)
Tumbuka tum Kiso (2012)
Venda ven SM-kha-di-STEM-FV Poulos (1990: 334)
Xhosa xho SM-sa-STEM Kirsch and Scorge
(1999: 37-38)
Yao yao SM-AUX ?-FUT-STEM- Sanderson (1922: 88-
FUT.FV 89)
Zimbabwean Ndebele nde Bowern et al. (2002)
Zulu zul SM-sa-(OM)-STEM-a Canonici (1995: 101)
47
D Appendix 4
Language ISO code Form of Source
Bemba bem Hoch(n.d)
Bena (Tanzania) bez Morrison (2011)
Chopi cce Smyth (1902)
Chuwabu chw já NEG-SM-STEM-PRF Guérios (2015: 363)
Gitonga toh Lanham (1955)
Ila ilb Smith (1907)
Kalanga kck 1. NEG-SM-NEG.FUT-AUX Chebanne and Schmidt
SM-STILL.FUT-AUX SM- (2010: 111, 120)
STEM-FV 2. PST-SM-PRF-
PST-AUX SM-PART-COND-
STILL-STEM-PART.FV
Kaonde kqn Wright (2007)
Kgalagadi xkv v.d Merwe and
Schapera (1943)
Koti eko Schadeberg (2000)
Lala-Bisa leb Madan (1906)
Lamba lam Doke (1938)
Lenje leh Madan (1908)
Makhuwa vmw van der Wal (2009)
Makonde kde Kraal (2005)
Makwe ymk Devos (2008)
Manyika mxc Stevick and Machi-
wana (1960)
Matumbi mgw Odden (1996)
Mwani wmw Floor (2010)
Ndali ndh Botne (2008)
Ndengereko ndg Ström (2013)
Ngoni ngo Ngonyani (2003)
Nsenga nse Miti (2001)
Nyakyusa-Ngonde nyy Persohn (2017)
Nyamwezi nym Maganga and Schade-
berg (1992)
Nyanja nya Kiso (2012)
Pedi nso NEG-SM-sa-STEM-NEG.FV Ziervogel (1954: 70)
48
continued
Language ISO code Form of Source
Sena (Malawi) swk Kiso (2012)
Shona sna Carter (1986)
Soli sby van Eeden (1936)
Southern Sotho sot NEG SM sa STEM-a Paroz (1946: 9)5
Sumayela Ndebele nbl NEG-SM-sá-STEM-NEG.FV Ziervogel (1959: 88)
Swati ssw NEG-SM-sa-STEM-NEG.FV Ziervogel (1952: 102 )
Tawara twl Dembetembe (1987 )
Tewe twx NEG-SM-cha-STEM-FV Carter and Kahari
(1979: 28)
Tonga (Zambia) toi Carter (2002)
Totela ttl NEG-SM-chi-(TM)-STEM- Crane (2011: 327 )
FV
Tsonga tso NEG SM ha TM STEM- Ouwehand (1965: 107-
NEG.FV 108)
Tswa tsc Gadelii (1999)
Tswana tsn NEG-SM-tlhòlé SM-STEM- Cole and Mokaila
PART (1962: 130)
Tumbuka tum Kiso (2012)
Venda ven NEG-SM-tsha-STEM-FV Poulos (1990: 338)
Xhosa xho sele/sel/se Kirsch and Scorge
(1999: 123)
Yao yao Sanderson (1922)
Zimbabwean Ndebele nde Bowern et al. (2002)
Zulu zul NEG-SM-sa-(OM)-STEM- Canonici (1995: 101)
NEG.FV
49
E Appendix 5
PhP Systems
Language ISO-code
Bemba bem Yes Yes Yes -
Bena (Tanzania) bez - - Yes -
Chopi cce Yes Yes Yes -
Chuwabu chw Yes Yes Yes Yes
Gitonga toh - - -
Ila ilb Yes Yes Yes -
Kalanga kck Yes - Yes Yes
Kaonde kqn - Yes Yes -
Kgalagadi xkv Yes Yes Yes -
Koti eko Yes Yes - -
Lala-Bisa leb - - - -
Lamba lam Yes - Yes -
Lenje leh Yes - Yes -
Makhuwa vmw Yes - - -
Makonde kde Yes - - -
Makwe ymk Yes - - -
Manyika mxc Yes - - -
Matumbi mgw Yes - - -
Mwani wmw Yes - Yes -
Ndali ndh - - - -
Ndengereko ndg - Yes - -
Ngoni ngo Yes - - -
Nsenga nse - - - -
Nyakyusa-Ngonde nyy Yes - Yes -
Nyamwezi nym - - - -
Nyanja nya Yes - Yes -
Pedi nso Yes - Yes Yes
50
PhP Systems continued
Language ISO-code
Sena (Malawi) swk - - - -
Shona sna Yes - - -
Soli sby Yes - - -
Southern Sotho sot Yes Yes Yes Yes
Sumayela Ndebele nbl - Yes Yes Yes
Swati ssw Yes Yes Yes Yes
Tawara twl Yes - Yes -
Tewe twx Yes - Yes Yes
Tonga (Zambia) toi Yes - - -
Totela ttl Yes - Yes Yes
Tsonga tso Yes - Yes Yes
Tswa tsc Yes - Yes -
Tswana tsn Yes Yes Yes Yes
Tumbuka tum - - - -
Venda ven Yes Yes Yes Yes
Xhosa xho Yes Yes Yes -
Yao yao Yes - Yes -
Zimbabwean Ndebele nde Yes Yes - -
Zulu zul Yes Yes Yes Yes
51
F Appendix 6
Clusters
Language ISO-code Country Cluster
Bemba bem Zambia Sabi
Lala-Bisa bez Zambia Sabi
Lamba cce Zambia Sabi
Nsenga chw Mozambique Sabi
Ila toh Zimbabwe Botatwe
Lenje ilb Zambia Botatwe
Soli kck Zambia Botatwe
Tonga (Zambia) kqn Zambia Botatwe
Totela xkv Zambia Botatwe
Kalanga eko Botswana Shona
Manyika leb Zimbabwe Shona
Shona lam Zimbabwe Shona
Tawara leh Mozambique Shona
Tewe vmw Mozambique Shona
Kgalagadi kde Botswana Sotho-Tswana
Pedi ymk South Africa Sotho-Tswana
Southern Sotho mxc South Africa Sotho-Tswana
Tswana mgw Botswana Sotho-Tswana
Makonde wmw Tanzania/Mozambique Rufiji-Ruvuma
Makwe ndh Tanzania Rufiji-Ruvuma
Matumbi ndg Tanzania Rufiji-Ruvuma
Ndengereko ngo Tanzania Rufiji-Ruvuma
Yao nse Mozambique Rufiji-Ruvuma
Sumayela Ndebele nyy South Africa Nguni-Tsonga
Swati nym Swaziland Nguni-Tsonga
Tsonga nya Mozambique Nguni-Tsonga
Tswa nso Mozambique Nguni-Tsonga
Xhosa swk South Africa Nguni-Tsonga
Zimbabwean Ndebele sna Zimbabwe Nguni-Tsonga
Zulu sby South Africa Nguni-Tsonga
52
Clusters continued
Language ISO-code Country Cluster
Chuwabu sot Mozambique North Mozambique Bantu
Koti nbl Mozambique North Mozambique Bantu
Makhuwa ssw Mozambique North Mozambique Bantu
Ndali twl Malawi Corridor Bantu
Nyakyusa-Ngonde twx Malawi Corridor Bantu
Tumbuka toi Malawi Tumbuka-Sena-Nyanja
Nyanja ttl Mozambique Tumbuka-Sena-Nyanja
Sena (Malawi) tso Malawi Tumbuka-Sena-Nyanja
Chopi tsc Mozambique Southern Bantu-Makua
Gitonga tsn Mozambique Southern Bantu-Makua
Bena (Tanzania) tum Tanzania Southern Tanzania Highlands Bantu
Ngoni ven Mozambique Southern Tanzania Highlands Bantu
Kaonde xho Zambia Central Western Bantu, Luban
Mwani yao Mozambique Coastal NEC Bntu
Nyamwezi nde Tanzania Sukuma-Nyamwezi
Venda zul Zimbabwe Sotho-Makua-Venda
53
Stockholms universitet/Stockholm University
SE-106 91 Stockholm
Telefon 08 - 16 20 00
www.su.se