Professional Documents
Culture Documents
Research Online
University of Wollongong Thesis Collection University of Wollongong Thesis Collections
2012
Recommended Citation
Buchan, Heather, Phonetic variation in Gurindji Kriol and northern Australian English: a longitudinal study of fricatives in maternal
speech, thesis, Faculty of Education, University of Wollongong, 2012. http://ro.uow.edu.au/theses/3789
A thesis submitted in fulfilment of the requirements for the award of the degree
Doctor of Philosophy
from
University of Wollongong
by
Faculty of Education
2012
iii
Abstract
sound structures (phonology) of the speech input. Speech to children often contains
phonological modifications, and across languages the speech input to which children
are exposed contains phonetic variation associated with linguistic and socioindexical
speech (e.g. Davis & Lindblom, 2001; Fernald et al., 1989; Kuhl et al., 1997; Snow,
directed speech (e.g. Bernstein-Ratner, 1984a, 1984b; Lee, Davis & MacNeilage,
2008; Lee & Davis, 2010). There are, however, few studies examining how and when
phonetic variation in speech to children changes after infancy, as they get older. This
lexical forms and phonology from both the traditional language Gurindji and Kriol, an
consonant reduction in casual speech. There is, however, little prior research on
phonetic variation that children are exposed to in Australian English. The purpose of
speech in Gurindji Kriol and northern Australian English as children aged from
issues in phonetic transcription when the transcribers are non-native speakers of the
language.
interactions in Gurindji Kriol created by Felicity Meakins between 2003 and 2007 as
part of the Aboriginal Child Language project (ACLA-1, Simpson & Wigglesworth,
2008). Speakers were three Gurindji Kriol speaking women recorded at three
timepoints, when the focus children were approximately 1;6, 2;0 and 2;6. We
potentially be pronounced with a fricative, based on words that had fricatives in their
English cognates. Results showed that words containing stop-fricative variation were
more likely to be open-class than closed-class, with the variable segment most
all tokens with potential fricatives, the likelihood of fricatives in word-initial position
significantly increased when children were 2;6. In tokens of words found to contain
mothers’ speech at child age 2;6. Analyses took into account phonological
naturalistic family interactions was recorded and phonetically transcribed for this
study. Speakers were five mothers who were native speakers of Australian English,
and were recorded when their children were approximately 1;6, 2;0 and 2;6. We
analysed deletion in word-initial /h/ and word-final /v/, common processes in casual
overall deletion over time within a stable set of lexical items. Between child ages 1;6
and 2;0 deletion proportionately increased in mothers’ speech, while between 2;0 and
with native speakers of Gurindji Kriol. Native speakers have implicit knowledge of
ambiguities arising from perceptual bias, and for furthering our interpretation of
phonetic variation in Gurindji Kriol. We used visual analogue scales to elicit native
Gurindji Kriol. Native speaker judgements on the scales were then compared to the
both agreements and discrepancies between native and non-native speakers for
take into account variable phonetic detail in the input. Pedagogical implications are
also discussed (in Chapter 7) in terms of how teachers can use information about
Gurindji children’s home language to augment their language and literacy teaching.
vi
Acknowledgements
individuals who have made this thesis possible and my PhD experience one that I will
cherish.
opportunities she has afforded me and for her endless support and patience. Caroline
encouraged me to pursue my own research ideas while also contributing her extensive
knowledge and expertise, and I thank her for her contribution to this thesis and to my
whose involvement and insight helped to develop my ideas and my writing. I am very
owe thanks to Felicity for sharing with us her corpus of Gurindji Kriol and allowing
us to build on her extensive previous work in this project. I would also like to
1), which was the basis of Felicity’s work on the Gurindji Kriol corpus, and I am
thankful to the ACLA PIs Jane Simpson and Gillian Wigglesworth for supporting the
research in this thesis. I also acknowledge the Australian Research Council for
financial assistance with my stipend, field work and conference trips (ARC Discovery
language”, 2009-2012, C.I. Caroline Jones) and I thank Caroline for supporting my
This research could not exist without the many people who assisted in my
fieldwork, especially the speakers and their families, community leaders and
thank Gurindji Kriol speakers Cecelia Edwards, Anne-Maree Reynolds and Samantha
Smiler for giving us permission to use the recordings of their families that Felicity
Meakins had made. I am also incredibly grateful to the community research assistants
in Kalkaringi: Trisha Morris, Kirsty Smiler, Jessica Vincent, Samantha Smiler, Anne-
Maree Reynolds, Lisa Smiler, Leanne Smiler, and Rosemary Johnson. They taught
me a lot, both for the research and on a personal level, and I treasure the friendships
we have made.
project; their advice and direction was invaluable. I am very grateful to Leah Leaman
community visits. I am very appreciative of our evening chats in the garden and
Leah’s extensive knowledge of the community and her readiness to share this with
me. I wish to thank all the people in Kalkaringi and Daguragu who made my trips
I am very grateful to the Katherine mothers Cathy, Eva, Kellie, Kim and Erin
(pseudonyms used at participants’ discretion) and their children and families, who let
me into their homes and their lives. They always greeted me with a smile and were
patient when I had to fiddle around with equipment. It was truly a pleasure to work
with them and I feel privileged to have watched their kids grow. I am also grateful to
all the organisations and businesses in Katherine who helped me to set up the project
and find participants. I wish to particularly thank the childcare services Good
viii
Beginnings and Little Mangoes. Also I thank the linguists and others in Katherine
I would like to thank the project steering committee at Diwurruwurru Jaru (the
Katherine Language Centre) for their advice and recommendations and for letting me
sit and work in the Language Centre library in my first trip to Katherine. I particularly
thank Ruth Joshua for teaching about Kriol and kinship systems. I am also grateful to
Cerise King for her voluntary cultural mentoring. Cerises’s insight and advice was
always helpful.
Kim Cayzer, Rachel Groves, Sarah Cutfield, and Alison Hannah. I also thank Yvan
Rose and the team at Memorial University for their support with the phonetic
transcription and analysis program Phon and for always providing helpful (and
My Building 23 (and 22) buddies have made this whole process fun and
supportive, and I feel so lucky to have formed many lasting friendships. Thank you to
Rosie Welch and a certain energy drink, Jonnell, Narumi, Sophie, Charles, Kyle,
Ashley Sisco for the chocolate activities, prime coffee buddy Alex Miller, Sam
McMahon for feeding me and her special contributions to the office stationary, and
Kay Prcevich who has been an amazing officemate for the entire time.
Finally, I wish to thank all my friends and family who have supported me
along the way. I especially thank Flick for being such a great friend, Alissa for the
cloudy apple nights, MV for the mojo, and Yan for her optimism and friendship. My
parents Rosemary and Grant Buchan, my sisters, grandparents and other family
members have always been supportive and I give them my heartfelt thanks.
ix
Table of Contents
List of Tables
Table 3.1 Gurindji Kriol speaking mothers and age of the focus children........... 52
Table 3.2 Total amounts of recording and number of mothers’ utterances in the
Table 3.3 Characteristics of Katherine English speaking mothers and the focus
children ................................................................................................. 57
Table 3.4 Total amounts of recording and number of mothers’ utterances in the
Table 5.2 Total and mean lengths of recording sessions, hh:mm:ss................... 142
Table 5.3 Raw frequencies of word-initial /h/ deletion out of possible contexts by
Table 5.4 Logistic regression for word-initial /h/ deletion: Child age and
Table 5.5 Raw frequencies of word-final /v/ deletion out of possible contexts by
Table 5.6 Logistic regression for word-final /v/ deletion: child age and
Table 5.7 Lexical items containing word-initial /h/ and word-final /v/ deletion at
Table 5.8 /h/ deletion in male and female pronouns by mothers of boys (n=2) and
Table 6.2 Number of each type of judgements in the Visual Analogue Scales
(N = 83)............................................................................................... 183
List of Figures
Figure 1.1 Map of the Victoria River District and its communities
(Meakins, 2011)...................................................................................... 8
Figure 3.4 Using the segmentation tool to create a record that is linked to the
Media .................................................................................................... 71
Figure 4.1 Proportions of closed-class and open-class words within lexical form
categories .............................................................................................. 96
Figure 4.2 Proportions of word positions of potential fricatives within lexical form
categories .............................................................................................. 97
Figure 4.6 Proportions of word categories at each child age ............................... 116
ages in variable words, other words, and the total sample ................. 117
Figure 5.1 Spectrogram of utterance "There we go, we'll just leave him there,
Figure 5.2 Percentage word-initial /h/ deletion out of word-initial /h/ contexts by
Figure 5.3 Percentage word-final /v/ deletion out of word-final /v/ contexts by child
Figure 5.4 Most frequent lexical items containing word-initial /h/ deletion at each
Figure 6.1 Example Visual Analogue Scale for [b] and [v]................................. 187
Publications
This thesis, Phonetic variation in Gurindji Kriol and northern Australian English: A
longitudinal study of fricatives in maternal speech, includes three chapters, and one part
of one chapter, that have been written as the following four journal articles. Two of
these articles have been submitted to peer-reviewed journals. Of these, one of the
articles has been published, and one is under review:
Chapter 4
Buchan, H., Jones, C., & Meakins, F. (2012). Fricative variation in maternal Gurindji
Kriol. Manuscript in preparation. To be submitted to Australian Journal of Linguistics.
Chapter 5
Buchan, H., & Jones, C. (2013). Phonological reduction in maternal speech in northern
Australian English: Change over time. Journal of Child Language. doi:
10.1017/S0305000913000123
Chapter 6
Buchan, H., Jones, C., & Meakins, F. (2012). Investigating native speaker judgements
of fricative variation in Gurindji Kriol using visual analogue scales. Manuscript in
preparation. To be submitted to Language Resources and Evaluation.
As the primary supervisor, I, Dr Caroline Jones, declare that the greater part of the work
in each article listed above is that of the candidate, Heather Buchan. In each of the co-
authored manuscripts above, Heather contributed to the study design, was primarily
responsible for data analysis and data interpretation, and led the writing.
In terms of data collection and transcription, Heather was primarily responsible for data
collection for Chapters 5 (English speech corpus) and 6 (Perceptual judgements). The
thesis also draws on a subset of a pre-existing corpus of Gurindji Kriol, recorded and
orthographically transcribed by Dr Felicity Meakins (2003-07, as part of the Aboriginal
Child Language project through University of Melbourne, funded by Australian
Research Council DP0343189). Transcripts from the pre-existing Meakins corpus were
converted to Phon format by Caroline and Heather. Phonetic transcription was added to
specific words in the transcripts by Heather and a casual research assistant, with
occasional assistance from Caroline. This formed the data for Chapter 4 and the
materials for the perceptual study in Chapter 6. Two casual research assistants
phonetically transcribed 10% of the data in Chapters 4 and 5 as a reliability check.
Gurindji Research Assistants contributed to data collection for Chapter 6 during
Heather’s visits to Kalkaringi, Northern Territory.
Regarding data analysis and writing, Heather performed all the data analyses including
statistical analyses. Heather wrote the first draft of each manuscript and was then
responsible for responding to the suggestions of her co-authors in subsequent versions
of each manuscript. The co-authors, Dr Caroline Jones (Chapters 4-6) and Dr Felicity
Meakins (Chapters 4 and 6), assisted in literature review, study design, data analysis,
CHAPTER ONE
Introduction
CHAPTER 1 2
Learning the sound structure of the home language from speech input is a
critical process in children’s language acquisition. The input provides children with
words and across word boundaries, and phonetic variability in sounds and words. The
long been recognised in the literature, with systematic studies on the phonology of
(see Snow & Ferguson, 1977). Since this early research on speech input, our
understanding of the amount and types of information that children encode from the
input has developed. Current acquisition theories recognise the critical role of the input
and posit mechanisms for how detailed phonetic features of input can shape children’s
phonological representations. However, while it has been well established that there are
little research has been done on how and when phonological variability in the input
changes as children get older – that is, as child-directed speech develops into adult-
directed speech. The overall aim of this thesis is to investigate how phonological
variation in the input to children changes from later infancy into early childhood. The
data used in these studies were primarily speech recordings transcribed phonetically
using the IPA (International Phonetic Alphabet), and a subsidiary aim of this thesis was
Gurindji Kriol.
the input children are exposed to in two language varieties of northern Australia,
Gurindji Kriol and northern Australian English. This Introduction chapter provides brief
CHAPTER 1 3
essential background to the three empirical studies that comprise the thesis. The first
and second sections set out the context of previous research into Gurindji Kriol and
thesis’, defines key terms and describes the analytical approach taken here. In the
closing section of Chapter 1, I lay out the structure of the thesis and present the
Gurindji Kriol is the main language of the Gurindji people in northern Australia.
grammatical structures of two languages, Gurindji and Kriol (Meakins, 2011, pp. 2, 12-
Victoria River District in the Northern Territory, Australia (see map in Figure 1.1).
word forms are derived from English although the grammar and semantics reflect
regional Aboriginal languages (Hudson, 1985). The grammar of Gurindji Kriol has been
studied in detail by Meakins (2009, 2010, 2011). Prior to this project there are no
detailed accounts of the phonetics and phonology of Gurindji Kriol, though the
phonological inventory has been described briefly (Meakins, in press; Meakins, 2007).
This thesis is part of a larger project on the phonology of Gurindji Kriol1 (Jones, the CI,
is the primary supervisor of this thesis). Studies in the larger project have examined
Gurindji Kriol’s vowel system (Jones, Meakins & Buchan, 2011; Jones, Meakins &
1
Australian Research Council Discovery Grant DP0985395, Phonological development
in child speakers of mixed language (CI Caroline Jones, 2009-2012)
CHAPTER 1 4
Muawiyath, 2012), voicing variation in stop consonants (Jones & Meakins, 2012a), and
The Gurindji Kriol studies in the current thesis use data from a longitudinal
and orthographically transcribed by Felicity Meakins between 2003 and 2007 as part of
the Aboriginal Child Language (ACLA-1) project (Simpson & Wigglesworth, 2008;
Meakins, 2011). For the current research, we built on this corpus to add phonetic
fricatives.
language is Gurindji Kriol, but they are also exposed to traditional Gurindji from older
speakers and to Kriol, which is spoken with Kriol-speaking visitors. The neighbouring
Aboriginal language Warlpiri is also spoken by some community members and visitors.
Children are taught in English at the school, but they do not generally hear English in
the preschool years, except in fun or role-playing (e.g., doctors and nurses). Children
are thus exposed to some code-switching into English, as well as code-switching into
other language varieties, chiefly Kriol, Gurindji, and Warlpiri, depending on their
family and connections. See Meakins (2008; 2011, pp. 58-70) for further detail on the
Gurindji Kriol is a mixed language that contains a structural split between the noun
phrase system, supplied mostly by Gurindji, and the verb phrase system, supplied
mostly by Kriol (Meakins, 2011, pp. 12-13). Importantly, both Gurindji and Kriol make
CHAPTER 1 5
contains phonology from both Gurindji and Kriol. Due to the originally English
derivation of most Kriol words, Gurindji Kriol also has some phonological similarities
the high phonological variability in Kriol (Hudson, 1985; Sandefur, 1979; Sandefur,
1991). In Sandefur’s (1979) description of Kriol spoken in the Roper River region (to
the east of the current study sites) he suggested that the high variation in Kriol
phonology is due to the way in which it changed over time. When speakers of
Aboriginal languages first came into contact with English, English words were
view, for example, fricatives and affricates were pronounced as stops and consonant
clusters were reduced. (Traditional Aboriginal languages of the region do not have
phonemic fricatives, and have restricted consonant cluster types, especially in syllable-
onset position.) Once Kriol developed, speakers continued to have contact with English
and traditional Aboriginal languages, and words were pronounced with some English
Sandefur (1979, 1991) suggests that this resulted in the high phonological variability in
contact language varieties opens up research questions about the amount and type of
variability that children are exposed to in Gurindji Kriol and the phonological structures
that they are learning from their home language in the preschool years. From an
Kriol and the sound structures children learn in their early years can provide
information to assist formal teaching of both Aboriginal languages and English. For
CHAPTER 1 6
example, sounds that are rare in Gurindji Kriol, such as some English fricatives, may
require explicit teaching, while sounds that are the same or similar in both language
varieties are generally easier to learn and can be used as a basis for teaching the harder
sounds.
environments, as well as speaker variables and the situational and stylistic context. It is
variety such as Australian English, as a reference point and to further our interpretation
of the variation that occurs in Gurindji Kriol. This thesis includes research on a type of
phonetic variation in a variety of Australian English from the town Katherine, Northern
Territory, which is in the same geographic region as Kalkaringi and is the major
documented in detail (e.g. Cox, 2006; Cox, 2012; Cox & Palethorpe, 2007; Harrington,
Cox & Evans, 1997; Mitchell & Delbridge, 1965; Tollfree, 2001; Yallop, 2003). As
well as differing from other varieties of English in some lexical items, the phonology
differs in the pronunciation of vowels and in some prosodic patterns (Harrington et al.,
1997). Australia is a multicultural country and there are dialect differences among the
many different cultural backgrounds, and there have also been some suggestions of
regional differences (Cox & Palethorpe, 2003; Loakes, Hajek & Fletcher, 2010). There
CHAPTER 1 7
is, however, a historical perception of relatively low regional variation (i.e. dialect
spoken in capital cities in the 1980s. The Sydney Urban Dialect Survey (Horvath, 1985)
South Wales. Findings revealed differences in both vowel and consonant variation
according to the speaker’s gender, age, socioeconomic status and ethnicity (Horvath,
speakers from Brisbane, Queensland and found differences in some types of phonetic
Thus I recorded a longitudinal corpus of Australian English maternal speech for this
thesis, in order to examine phonetic variation in local English as a reference point for
variety from northern Australia from the same geographical region as Gurindji people,
features as special to Gurindji Kriol when they are in common with regional Australian
English.
2
Though see AusTalk (https://austalk.edu.au/) - a project currently in the process of
collecting a large corpus of Australian English adult speech from all Australian states
and territories. Also see the Australian Voices website (Cox & Palethorpe, 2010:
http://clas.mq.edu.au/australian-voices/australian-voices) for more information and
audio samples of Australian English.
CHAPTER 1 8
Figure 1.1. Map of the Victoria River District and its communities (Meakins, 2011, p.
xxi).
segmental level. Here the terms segments and phones are used to refer to units of speech
sounds that are perceptually discrete, at least to literate adults, and possess distinct
articulatory and acoustic properties. This includes both phonemic and allophonic
CHAPTER 1 9
phones. Phonetic variation in speech input may index linguistic variables as well as
social variables. The latter is termed sociophonetic variation and includes variability in
phonetic forms that is related to speaker identities and social groups in the speech
are consonants that are produced by constricting the space between two articulators so
that the airstream is partially restricted, resulting in a turbulent airflow (Ladefoged &
Maddieson, 1996). English contains voiceless and voiced fricatives at four places of
articulation: labio-dental e.g. fat, vat [f, v], dental e.g. bath, bathe [θ, ð], alveolar e.g.
sip, zip [s, z], and palato-alveolar e.g. mission, vision [ʃ, ʒ]. In addition, [h] (who) is
traditionally considered a voiceless glottal fricative, although its status is not the same
as a typical consonant and has also been described as a voiceless vowel in some
coordination – if the airflow is fully obstructed then the sound becomes a stop, affricate,
or a fricated stop, and if there is too much space between the two articulators then it
relatively difficult for children to produce and tend to be acquired later than stops and
not occur phonemically in traditional Gurindji. (In fast Gurindji, fricatives occur
pronounced with an intervocalic voiced velar fricative.) The fricatives known from
labiodental [f], alveolar [s] and palatal [ʃ]. There is voicing variation, and fricatives are
also used in variation with stops (Meakins, in press). There is no previous research that
distributions and nature of fricatives that children are exposed to in their home language
fricatives and the acquisition of highly variable phonology in general. This would
provide information for language professionals serving the community such as teachers
exposed to at the segmental level, and how it changes over time as children age from
approximately 1;6 to 2;0 to 2;6. The thesis includes three research studies. The first
variation within and between lexical forms, and the relationships between sounds in
examined change in fricative variation in maternal speech over time as children aged.
fricatives in this variety of Australian English are related to casual speech and different
variants may carry social meanings. We examined how mothers’ use of casual phonetic
Studies one and two are based on analyses of phonetic transcription data. The
transcribers were native speakers of Australian English speakers, but there may be
differences in how Gurindji Kriol phones are perceived between native and non-native
CHAPTER 1 11
involving categorical judgements, and the transcriber’s perceptual judgement about the
category a sound belongs to depends to some extent on their own language background
(e.g. Coussé, Gillis, Kloots & Swerts, 2004; Strange & Jenkins, 1978). Depending on
the questions being investigated, native speaker phonetic transcriptions are generally
considered valid because perceptual judgements are made based on the sound categories
of the speech community (Edwards & Beckman, 2008). In the current context, however,
it was not practical to train speakers to the technical level required to do phonetic
Therefore in the third study of this thesis we used Visual Analogue Scales (VAS) to
elicit native speaker judgements of transcribed segments and compared how native
Results were used to further our interpretation of Gurindji Kriol fricative variation.
The overall aim of this thesis was to investigate variation in fricative production
in mothers’ speech in Gurindji Kriol and northern Australian English as children aged
from approximately 1;6 to 2;0 to 2;6. The overarching research questions of the
The three studies each make up a chapter of this thesis (Chapters 4, 5 and 6)
written in journal article style3 and are followed by a general conclusions chapter
(Chapter 7). In the next two chapters I discuss the literature and conceptual framework
3
Because this thesis is written in the journal article format of the University of
Wollongong, whereby some chapters are styled as journal articles, there is necessarily a
small amount of repetition in some sections, particularly the methodology.
CHAPTER TWO
Empirical and Theoretical Context of Children’s
input changes as children get older, in the age range 1;6 to 2;6, in two relatively
empirical studies (Chapters 4, 5, 6), this chapter provides a review of the literature in
child-directed speech and how it is conceptualised in the current studies and the
rationale and research questions of each empirical study in this thesis. Third, a
framework of this research. The chapter ends with summaries of the three research
occurs in male and female adult speakers and is widespread across many (if not all)
cultures and language varieties (Byrant & Barrett, 2007; Fernald et al., 1989; Grieser &
Kuhl, 1988). Child-directed speech contains simpler lexical and syntactic structures
than adult-directed speech (Ferguson, 1977), as well as a slower tempo, longer pauses,
more variability in pitch contours, and higher fundamental frequency (Fernald et al.,
1989; Katz, Cohn & Moore, 1996; Snow, 1977). Infants prefer to listen to infant-
CHAPTER 2 15
directed over adult-directed speech (Cooper & Aslin, 1990; Fernald, 1985), regardless
of the sex or language of the speaker (Fernald & Morikawa, 1993; Werker & McLeod,
1989), likely due to the positive affect conveyed in infant-directed speech (Singh,
Morgan & Best, 2002). Speaking in a register that young children prefer to listen to has
the effect of guiding their attention to the speech signal (Dominey & Dodane, 2004) and
linguistic functions of child-directed speech remain unclear and are likely to vary
depending on the type of modification made, the language variety and cultural context,
the kind of interaction taking place, and the child’s age and receptive and productive
Luksaneeyanawin, 2002; Stern, Spieker, Barnett & MacKain, 1983; Soderstrom, 2007).
study of baby-talk, Ferguson (1977) talked about baby-talk as a “simplified register” (p.
210) that contains ‘simplifying’ and ‘clarifying’ processes. These processes include use
of limited sounds and syllable forms with a tendency toward reduplicated consonant-
vowel (CV) forms (e.g., mama, dada), and substituting articulatory easy sounds for
articulatory complex ones (e.g. replacing rhotics with glides). Ferguson (1977)
language abilities to make the input easier to understand. One function of baby-talk as a
simplified speech register may be “a response to the need for improved communication
when one of the participants has only a limited ability to use language normally”
Later studies on child-directed speech in the 1980s also used the terms
greater dispersion and less overlap between vowel categories as measured by formants
glottalisation, and palatalisation (Bernstein Ratner, 1984b; Shockey & Bond, 1980).
interadult conversational speech (see Cole & Jakimik, 1980, for a review). In this
The variability related to the above processes has also been conceptualised as
casual speech and refers to when articulators do not reach the target, resulting in lenition
away from the extremes and toward the centre of vowel space (Browman & Goldstein,
1990). At the other end of the continuum, hyperarticulation characterises careful speech
(Picheny, Durlach & Braida, 1986) where segments are distinctly articulated (de Jong,
Beckman & Edwards, 1993) and vowels are more separated in acoustic space (Davis &
Lindblom, 2001). The continuum represents the dynamic between the perceived needs
of the listener and ease of articulation for the speaker (Lindblom, 1990). In the
with style-shifting, referring to the process whereby speakers accommodate their speech
to the situation and perceived needs of the audience (Labov, 1984). While social
CHAPTER 2 17
variation refers to variation due to interspeaker factors such as age, gender and
1984). These stylistic factors include the speaker’s attention to their own speech, as well
as the audience and the social meaning the speaker wishes to construct with the
audience. Situational factors such as the topic and setting also affect variation as the
speaker dynamically responds to the situation (Bell, 1984; Eckert, 2001). In the
accommodation theory, which explains the social and psychological factors that
influence how speakers vary their speech to either converge with or diverge from their
audience’s speech style (Ball, Giles & Hewstone, 1985; Giles, Coupland & Coupland,
1991). ‘Style’ and ‘register’ are often used interchangeably, though ‘register’ is also
used as a broader term encompassing both situational and stylistic variation (Finegan &
Biber, 2001). Child-directed speech may therefore be a speech register, but a better
a style continuum4.
linguistic needs of the child. Considerable research has examined the phonological
modifications caregivers make in their speech during the child’s first year of life (e.g.
Cristia, 2010; Ferguson, 1977; Fernald, 1989; Fernald et al., 1989; Fernald &
Morikawa, 1993; Grieser & Kuhl, 1988; Kirchhoff & Schimmel, 2005; Kitamura et al.,
2002; Kuhl et al., 1997; Lee & Davis, 2010; Lee, Davis & MacNeilage, 2008).
4
Though see Wassink, Wright and Franklin (2007) for evidence that variation in infant-
directed speech resembles that of style-shifting and may relate to audience design.
CHAPTER 2 18
adult speech to children after infancy and how and when these change as children get
older.
The studies that have examined phonological features of speech to older children
(e.g. Foulkes, Docherty & Watt, 2005; Ko, 2012; Liu, Tsao & Kuhl, 2009; Smith,
Durham & Fortune, 2007) are discussed in more detail in the literature reviews of
Chapters 4 and 5. Overall, these studies suggest two things. First, a combination of
social and stylistic factors affects phonological variation in speech to children as they
get older. Second, the change from child-directed to adult-directed speech may not
necessarily be a gradual linear shift in style. The input that children are exposed to in
their early years is dynamic, likely changing over time as the perceived needs of the
speech’, as these are common terms in the literature and the ones generally used by the
studies discussed above. However throughout this thesis I use the term ‘maternal
speech’, particularly when describing the speech data recorded and analysed for this
research. This is because ‘maternal speech’ is a more general term which encompasses
all of the mother’s speech to which children are exposed, rather than just speech
directed specifically to them. ‘Maternal speech’ also covers both infant- and child-
directed speech, which refer to the same concept with the use of ‘infant’ or ‘child’
depending on the age of the children in the study. In the current research children were
aged between approximately 1;6 and 2;6, which is just after infancy and into early
childhood. Much of the literature on the relationship between input and phonological
acquisition has focused on infant-directed speech, but research also shows that
important role after infancy. My rationale for using the concept of maternal speech and
the supporting evidence is discussed next. This is followed by a brief discussion of the
Child-directed speech is not the only type of input that children are exposed to,
as evidenced by van de Weijer (2002) who recorded all the speech that one infant heard
during her waking hours from age 6 to 9 months. He transcribed and analysed in depth
three weeks of these recordings (the first, middle, and last weeks) and found that only
14 per cent of utterances were spoken by an adult directly to the infant. The infant had a
sister 2 years older than her, and 60 per cent of the speech transcribed was between an
adult and the older sister. Inter-adult speech made up 19 per cent of the total, which is
still a higher proportion than the infant-directed speech. Most of the infant’s activities
during the recording period were with one or both parents, at home with the older sister
and an adult babysitter, and at a daycare centre with the older sister, other children, and
the daycare centre staff. Although this study only involves one infant, it makes sense
that the types and amount of input to which children are exposed would be highly
Studies have also shown that other types of input may be highly relevant for
children’s language learning, as children can encode and store in long-term memory at
least some aspects of speech that are merely overheard. This was demonstrated by Au,
Knightly, Jun and Oh (2002), who found that participants who overheard informal
adulthood than participants who had not been exposed to Spanish in childhood. The
(with the most exposure occurring between birth and age 6;0) but were spoken to
directly in Spanish only to a minimal extent, i.e. no more than occasional words or short
phrases (Au et al., 2002). The finding that overhearing a language in childhood had a
positive impact on adult learners’ Spanish accents has two implications. First, phonetic
information can be learned implicitly and retrieved later in life for further language
development (i.e. learning a second language). Second, children may attend to and
encode information in speech they are exposed to above and beyond that directed
specifically to them – that is, child-directed speech is not the only type of speech input
Akhtar (2005), who examined the ability of children (mean age 2;1) to learn novel
words in third-party conversations between two adults while being distracted with an
activity. Akhtar (2005) found that the presence of a distraction did not significantly
affect children’s learning of a new word, demonstrating that at age 2;1 children can not
only learn words through overhearing adult conversation, but can continue to learn
language in their early years, but children are also exposed to other sources of speech
input (e.g. from overhearing inter-adult speech) that have been known to contribute to
should include speech spoken within earshot even if it is not directed specifically to the
child. In a review of the nature of speech input to infants, Soderstrom (2007) noted the
problems with defining what exactly constitutes the input, and contended that
‘maternal’ speech from the female primary caregiver plays a key role in children’s early
In the current research, maternal speech is the focus of analysis. Maternal speech
is defined as all speech spoken by the mother around as well as to the target child. This
is not to say that the speech of fathers (and aunts, uncles, grandparents and others) is not
also important, but with the relatively small numbers of speakers used in corpus based
studies we cannot reliably analyse interspeaker differences, and so must control for
characteristics of speakers such as sex that may influence the variables being
Australian Aboriginal societies. The Meakins (2011) corpus that was built on for these
studies contains mostly women speakers, in part because it would have been culturally
inappropriate for female researchers to work with young men who speak Gurindji Kriol.
Both the Meakins (2011) Gurindji Kriol corpus and the Australian English
corpus collected for this research are longitudinal and comprise maternal speech to
children as they age between approximately 1;6 and 2;6. There is currently relatively
little research on how maternal speech changes over time between these ages, despite
research showing that input continues to play a role in phonological and lexical
development in children of these ages and beyond. The following section discusses this
research and provides the rationale for investigating phonetic characteristics of input to
There are many studies on speech input and language development that focus on
infant-directed speech in children’s first year of life. During this first year infants
acquire information about sound patterns in the ambient language. Specifically, between
ages 0;6 and 0;9 they develop sensitivity to prosodic patterns and permissible segment
sequences in the language being acquired (Jusczyk, Cutler & Redanz, 1993; Jusczyk,
Luce & Charles-Luce, 1994). Infants display sensitivity to vowel contrasts specific to
CHAPTER 2 22
their native language at age 0;6 (Kuhl, Williams, Lacerda, Stevens & Lindblom, 1992),
and by age 0;9 they prefer to listen to words that are consistent with phonetic,
phonotactic and prosodic patterns in their native language (Jusczyk, Friederici, Wessels,
Svenkerud & Jusczyk, 1993). The speech input during infancy plays a critical role in
developing sensitivities to phonetic contrasts and a knowledge base of how sounds are
phonological development. These studies suggest a feedback loop between the lexicon
and phonological processing, meaning that as the lexicon develops over time, children’s
phonological knowledge will develop as well. Edwards, Beckman and Munson (2004)
children aged 3;2 to 8;10. Nonwords contained segment sequences that have both high
and low phonotactic probabilities in the native language (English). It was found that
size, and that vocabulary size mediated the effect of frequency on nonword repetition
scores (Edwards et al., 2004). The first result suggests that a larger vocabulary supports
and reproduce new word forms, possibly because they are able to make more accurate
segments are not as well represented in the lexicon, which means children have less top-
down information to draw on when trying to produce them. That vocabulary size
CHAPTER 2 23
mediated the effect to some extent supports the argument that a larger vocabulary
2004), as there is more data in the lexicon to draw on for the creation of an articulatory
representation for the new nonword. Vocabulary growth therefore helps to refine
phonological knowledge.
the /ɹ-w/ contrast using a forced-choice identification task for children aged 3;0, 4;0 and
5;0 and adults. They found that the phonemic boundary between /ɹ/ and /w/ was closer
to the /ɹ/ endpoint for children relative to adults, and that it increasingly shifted toward
the adult-like boundary with child age. That is, older children judged more stimuli on
the [ɹ-w] continuum as belonging to the /ɹ/ category. Older children were also more
judgements (Slawinksi & Fitzgerald, 1998). Further, the children aged 5;0 made adult-
like judgements of the /w/ category, but not of the /ɹ/ category. These results show that
Munson, Swenson and Manthei (2005) also showed that lexical and
children of mean ages 4;3 and 7;2 and tested their ability to repeat real words and
words in the lexicon). Munson et al. (2005) found that neighbourhood density affected
real word repetition in the older children but not the younger children, and suggest that
because the younger children have smaller lexicons there is less competition from
phonetically similar words in lexical processing. In the nonword repetition tasks they
CHAPTER 2 24
found evidence of phonological facilitation that increased with age. The older children
probability phonotactic segments (Munson et al., 2005). Together these findings show
that both lexical competition, perhaps due to neighbourhood density, and phonological
throughout childhood and into adolescence (Hazan & Barrett, 2000). Gradient stimuli
were presented across four different phonemic contrasts to children aged between 6;0
and 12;6 and to an adult control group (mean age 29;8) in a forced-choice identification
task. To test children’s ability to integrate acoustic cues the researchers also
manipulated cues to the contrast in the stimuli so that in some conditions only one
acoustic cue was varied, while in “combined-cue” conditions multiple cues were
systematically varied. Hazan and Barrett (2000) found a significant effect of age even
after controlling for possible attention effects (by excluding participants who did not
difference between the youngest (6;0-7;6) and oldest (11;6-12;6) groups of children,
and that the largest difference was between all child groups and adults. Thus, although
the older children were significantly more consistent than the younger children in
categorising phonemic contrasts, they were still significantly below the adult level.
phonological development that set the basis for future word learning, it is also clear that
it is a gradual process continuing well into at least early adolescence. Further research is
still needed on the nature of the speech input beyond infancy. Even throughout
adulthood new input is presented that may refine phonological representations (for
CHAPTER 2 25
example, perception of a new accent may introduce phonetic variation that an adult has
not heard before but must still somehow process). Periods of large vocabulary growth
al., 2004), and children’s representations receive further refinement through the
The sound structures in speech input to children play a central role in children’s
language development. Learning to perceive and produce words in the native language
requires children to extract sound segments from the input and organise them in
memory (e.g. Jusczyk et al., 1993; Jusczyk et al., 1994; Kuhl et al., 1992; Stoel-
Gammon, 2011; Swingley, 2007, 2009; Werker, Fennell, Corcoran, Stager, 2002).
There is a large literature on the phonology of speech to children during infancy (e.g.
Cristià, 2010, 2011; Davis & Lindblom, 2001; Fernald et al., 1989; Grieser & Kuhl,
1988; Katz et al., 1996; Kirchhoff & Schimmel, 2005; Kuhl et al., 1997; Snow, 1977;
Stern et al., 1983), as this is an important period for acquiring sound contrasts and
phonotactics of the native language (Aslin, Saffran & Newport, 1998; Coady & Aslin,
2004; Jusczyk et al., 1993; Jusczyk et al., 1994; Kuhl et al., 1992). Speech input to
children consists of more than specific infant-directed and child-directed speech, and
their knowledge of phonological processes in the native language develops and they
start to consistently produce whole words (e.g. Nelson, 1973; Swingley, 2007), and
children after infancy compared with adult-directed speech. There are, however, few
longitudinal studies on phonetic variation in the input to children after infancy that
show how and when variation in child-directed speech develops as children get older.
to children in two language varieties in northern Australia, Gurindji Kriol and northern
Australian English. To realise this purpose there are two specific aims:
Gurindji Kriol and northern Australian English as their children age from
In this section I discuss the rationale for the three empirical research studies in
this thesis, that is, our motivation for investigating fricatives in Gurindji Kriol and
northern Australian English maternal speech and for eliciting native speaker judgements
in Gurindji Kriol maternal speech. The aim of this first study is to investigate the
variation in fricatives that Gurindji children are exposed to in their home language
before they go to school, where English is the main language. Gurindji Kriol is a
relatively new mixed language and the Kriol-derived words have inherently high
Kriol for several reasons. Traditional Gurindji does not contain phonemic fricatives,
though there are allophonic fricatives in some phonological contexts. Kriol, on the other
articulation. Kriol contains some fricatives due to its English influence, and these are
highly variable with stops as Kriol words often have alternate phonetic forms (Sandefur,
languages. Children tend to acquire fricatives later than stops in production, as fricatives
require more precise motor control to constrict the degree of airflow between
articulators and are therefore more difficult to produce (Dinnsen, 1992, pp.196-199;
speech to adults (Lee, Davis & MacNeilage, 2008; Lee & Davis, 2010), likely due to
differences in lexical choice (Daland, 2013). Thus across languages there appears to be
variation in children’s production of fricatives as they acquire them over time, as well as
factors, fricative variation in speech input to children has implications for our
CHAPTER 2 28
production and perception. Language varieties that do not contain the same extent of
phonetic variability as Gurindji Kriol still contain variation as a result of, for example,
phonological context effects, lexical factors, speech rate and speech style. Across
phonetic variation in the input can be perceived, encoded and incorporated into
further our understanding of the patterns of variation to which children may be exposed,
how these patterns change as children develop, and the range of factors that potentially
mothers’ speech in northern Australian English, and how it changes over time as
children age. The English variety we analysed is from the town of Katherine in northern
Australia, which is in the same geographic region as the communities where Gurindji
English does not have the same inherent high phonetic variation as Gurindji Kriol.
Instead, we examined possible stylistic variation, i.e. variation due to factors such as the
audience and formality of the speech style (for further discussion see §2.1 Literature
Review pp.16-17), taking into account phonological and lexical effects. In this study we
analysed the reduction processes word-initial /h/ deletion and word-final /v/ deletion.
Both these processes involve fricatives and in some contexts the reduction indicates
speech /h/ and /v/ deletion may be affected by child age, if mothers (consciously or not)
modify phonetic realisations of word forms as their children’s receptive and productive
language develops.
Phonetic variation is high and it is difficult to untangle the many contributing factors to
in addition to the variability inherent in the language variety, some of the variation may
have been due to mothers modifying their speech as children got older. In Study 2 we
further explored the possibility that mothers may differentially modify their speech as
children aged in another language variety, Katherine English. The reduction in fricatives
fricative variation in Gurindji Kriol. Study 1 and 2 together provide us with a bigger
picture of patterns of phonetic variation in the input children are exposed to between
ages 1;6 and 2;6 and potential influences on variation. These findings deepen our
The data for studies 1 and 2 consist of IPA phonetic transcription of naturalistic
speech recordings. Phonetic transcription is a useful clinical and analytical tool and is
subject to some perceptual biases. For the Australian English data we addressed this
5
The study of Gurindji Kriol fricatives is based on phonetic transcription and not
acoustic analysis, in part because the recordings are naturalistic and therefore contain
background noise and amplitude variation that would make spectral analysis of frication
and stop bursts relatively difficult to use.
CHAPTER 2 30
independent transcriber who is a native speaker of Australian English (e.g. Shriberg &
Lof, 1991; Edwards & Beckman, 2008). For Gurindji Kriol we also checked segmental
phonetic transcription with native speakers of Gurindji Kriol. Although native speakers
have implicit linguistic knowledge that can improve our (as non-native speakers)
understanding of their language, it was not practical to train native speakers of Gurindji
Kriol to the high technical level required for IPA phonetic transcription. This is our
broad rationale for conducting the third study included the thesis.
The aim of Study 3 was to develop and test a method for eliciting native speaker
perceptual judgements of fricative variation in Gurindji Kriol, and to use the results in
visual analogue scales, which have previously been used to investigate variation in
Schellinger, Beckman & Meyer, 2010; Schellinger, 2008; Urberg-Carlson, Munson &
Kaiser, 2009). To our knowledge, Study 3 is the first study to use visual analogue scales
described language. We found that the method was largely successful and offer
In conjunction with the results from Study 1, findings from the visual analogue
scale study provide information about the places of articulation and word positions
where fricative variation occurs in Gurindji Kriol, and where it is less likely to occur.
Comparing native speaker judgements on the visual analogue scales with non-native
background on perception of fricative variability and whether these may have affected
the findings of Study 1. Further, Study 3 gives insight into transcriber discrepancies and
ambiguities, that is, when the two Australian English phonetic transcribers disagreed or
were unsure of a transcription. Overall, the third study in this thesis allowed us to
phonetic analyses and provided further information for our interpretation of data based
This thesis contains three research articles. The overarching research questions
of each study are presented here along with the specific sub-questions.
• What is the nature and extent of fricative variation within common Gurindji
• What are the relationships between fricatives in English cognates and their
• How does fricative variation within and between Gurindji Kriol lexical
• How do rates of word-initial /h/ deletion and word-final /v/ deletion change
• Are there lexical effects on word-initial /h/ and word-final /v/ deletion? If so,
• What is the effect of speech rate on word-initial /h/ and word-final /v/
deletion?
without phonetic training, and how do native speaker perceptions add to phonetic
transcription?
• How can Visual Analogue Scales be used in the field to record native
2.4 Significance
There has been little previous research on phonetic variation in speech input to
children of the age group in this project, 1;6 to 2;6, and to the author’s knowledge this is
the first study of maternal speech in regional Australian English and is among the first
studies of phonology in Gurindji Kriol (see also Jones, Meakins & Buchan, 2011; Jones,
Meakins & Muawiyath, 2012; Jones & Meakins, 2012a, 2012b). The project contributes
to our understanding of phonetic variation that children are exposed to between ages 1;6
and 2;6, the phonetic modifications that mothers make in their speech to children at
these ages, how variation in the input changes as children get older, and linguistic and
sociolinguistic factors that influence phonetic variation in maternal speech. Findings are
discussed in terms of theoretical models that explain how variation in speech input is
processed by the listener/speaker and acquired by children. This thesis also provides
CHAPTER 2 33
avenues for further research on fricatives in Kriol varieties spoken in other parts of
Australia.
in maternal Gurindji Kriol, which may have future use in the development of measures
to assess and compare the phonological skills of child speakers of mixed languages and
Kriol spoken in northern Australia. Findings can be used to inform teachers of the likely
maternal speech. Study 3 (Chapter 6) documents a novel methodology for using Visual
Analogue Scales in the field to elicit native speaker perceptual judgements of phones.
mothers modify their speech differentially as children age, and offers some speculation
as to why this may occur. This is discussed further in the discussion sections of Studies
1 and 2. Further, by examining a situation where children are acquiring language under
section I describe and discuss the theoretical framework underpinning the empirical
that is the theory that individual perceived auditory events leave episodic traces in long-
CHAPTER 2 34
term memory that are used in categorisation tasks. At a phonological level this means
that each perceived phonetic episode, or exemplar, is encoded into long-term memory
with fine-grained detail intact. A key feature of this theory is that variation in the speech
input is retained in memory rather than filtered out as noise, thus variation contributes to
acquiring by analysing and describing the speech to which they are exposed. This
section contains a description of general exemplar theory and a brief review of the
these models are applied to children’s acquisition of phonology and a discussion of the
this thesis as they provide an account of how phonetic variation and associated
information, both linguistic and socioindexical, may be processed and used in the
categorisation tasks required for speech perception and production. The purpose of this
thesis is to examine variation in the input and potential contributing factors rather than
discussion here is of the theoretical rationale that underlies the current research and will
be limited to the general features of exemplar models relevant to this research without
making claims as to the precise mechanics of the various proposed models in the
literature.
information about concepts and memory of specific perceived events, a distinction that
is long-standing in the psychology literature. One of the earliest arguments for the
Tulving (1972), who defined episodic memory as a processing system for temporally
dated perceptual events and semantic memory as organised knowledge about general
concepts and their rules, meanings, and interrelations. Perceptual episodes can be stored
in episodic memory purely on the basis of the perceived features of episodes, while
semantic memory involves the cognitive referents and more general abstracted
information of these episodes (Tulving, 1972). In some literature the terms ‘episodes’
and ‘exemplars’ are synonymous; however, in other memory literature ‘episodes’ refers
Wheeler, Stuss & Tulving, 1997). The more general term ‘exemplar’ will therefore be
concepts are processed and stored in long-term memory. The more traditional
abstractionist theories, such as Tulving’s (1972) model, posit separate memory systems
therefore postulate abstract forms of lexical representations that are separate from the
perceptual events (e.g. Becker, 1980; Forster, 1976; Morton, 1979), with phonological
Behrmann & Bub, 1992; Coltheart, Curtis, Atkins & Haller, 1993) or in parallel with
lexical access (connectionist models, e.g. Seidenberg & McClelland, 1990). On the
other hand, exemplar theories propose that representations in long-term memory are of
specific individual episodes and abstractions emerge from this store of exemplars (for
Tenpenny, 1995).
CHAPTER 2 36
Exemplar theories are supported by the robust finding that listeners retain
information about phonetic detail after long delays, which suggest that fine acoustic
details of speech are in fact represented in long-term memory (Church & Schacter,
1994; Goldinger 1996, 1998; Goldinger & Azuma, 2004; Nygaard & Pisoni, 1998;
Palmeri, Goldinger & Pisoni, 1993; Pisoni, 1997). Exemplars are organised in clusters
exemplars associated with a category label. Exemplars can be distributed along any
dimension and so the same exemplar or attribute can contribute to many categories, for
categories such as vowel phone and socioindexical categories such as the speaker’s
gender. This is a strength of exemplar models, as it means that information from speech
input can be stored in multiple ways and different kinds of information can be extracted
and used for different tasks (Hawkins, 2003). Exemplars can therefore be thought of as
individual data points in memory, with categories emerging from clusters of these data
points. The ‘dataset’ can be drawn on to make generalisations about the language as
required.
Different models propose different mechanisms for exactly how exemplars are
encoded and stored. Depending on the model, a new perceptual event may be organised
emerge from categories, and each encoded perceptual event may either form a new
events are all stored as exemplars then each new event forms an extra case in the
exemplar dataset and will contribute to its associated categories, while in activation
models each new event contributes to the saliency of an existing exemplar. The
important point here is that either way, every encoded perceptual event slightly modifies
CHAPTER 2 37
the exemplar store and categories in long-term memory. Another important feature that
2003). I will return to this in more detail later on in this section when Pierrehumbert’s
context of general exemplar theory such a loop allows for produced speech to replicate
fine-grained phonetic detail from perceived speech and to feed back into the exemplar
processing has accumulated since at least the early nineties. One phenomenon that
priming. In lexical decision tasks repetition priming refers to the facilitation effect of a
brief earlier presentation of a word on the speed and accuracy of identification of words
in a later task. Thus, the first presentation of the stimulus ‘primes’ the second
presentation for identification. Priming effects have been found in a variety of tasks that
involve lexical retrieval (for a review see Tenpenny, 1995). Abstractionist models
representation, which then goes back to base level after a short period of time. On the
other hand, Tenpenny’s (1995) review of repetition priming in word identification tasks
showed consistent findings of a priming effect after relatively long delays, and
furthermore that the effect is enhanced when phonetic detail is matched in both
presentations. These effects suggest that not only are individual perceptual episodes (in
this case the prime) encoded into long-term memory, but also that phonetic detail
effects in recognition tasks, that is that recognition memory of speech is improved when
the prime and the stimulus to be identified (the target) are spoken by the same voice
(Church & Schacter, 1994; Nygaard & Pisoni, 1998; Palmeri, Goldinger & Pisoni,
1993). Again these effects have been found with long delays between the prime and the
The studies mentioned above involve typical adult participants, who have had
sufficient time to develop an exemplar store and form generalisations for use in
where perceptual memory traces are distributed along various dimensions, as discussed
earlier in this section, and another level involving broader, less fine-grained
generalisations that can be processed ‘on demand’ as required (Beckman et al., 2007).
Evidence for these two encoding systems comes from research on children’s
phonological processing (Munson, Edwards & Beckman, 2005; Munson, Kurtz &
Pierrehumbert’s (2003) model of how children acquire the phonology of their native
language.
correspondences, and as these levels are hierarchical they develop in that order
(Pierrehumbert, 2003). In this model categories are initiated through statistical learning
of speech segments, i.e. conditional probabilities are computed from encoded perceptual
CHAPTER 2 39
events (for more detailed accounts of infants’ statistical learning of speech segments see
Anderson, Morgan & White, 2003; Maye Weiss & Aslin, 2008; Werker & Tees, 1999).
Once categories have been initially formed, information from generalisations can be
as categories are continually refined from both bottom-up processes (i.e. perceptual
different levels of abstraction, at least some of which can feed back down into lower
representation; rather these theories have framed the current studies as they offer
concise explanations for processes explored in the thesis. The main features of exemplar
Rather than being filtered out at the encoding stage, fine-grained phonetic detail is
production-perception feedback loop from which a child can acquire the phonetic
memory, which means changes in the input over time influence the distributions
categories based on similarity across that category dimension rather than assigned
and the associated linguistic and socioindexical generalisations that may form from
the theoretical background and rationale for investigating phonetic variation in the input
children are exposed to in their home language. The overall purpose of this thesis is to
varieties that contain differing phonological variability. The next section contains
summaries of the three research studies contained in the thesis that investigate phonetic
CHAPTER 2 41
variation in maternal Gurindji Kriol and northern Australian English. This is followed
This thesis has been written in the journal article style format approved by the
context, methodology and conclusion chapters (Chapters 1, 2, 3 and 7) that have been
written in traditional thesis chapter format. Each research article has its own abstract,
background, methodology, results and discussion sections. The research chapters of the
Gurindji Kriol maternal speech and analyse the factors that potentially contribute to
factors and child age on phonetic variation in maternal speech and how they may
Sociolinguistic factors discussed include the language situation and social status of
languages spoken in the community, the potential effect of child age, and the situational
context.
In the methodology are details of the Gurindji Kriol corpus (Meakins, 2011), the
procedure of converting existing files to use in Phon (Rose, Hedlund, Byrne, Wareham
& MacWhinney, 2007) to add phonetic transcription and analysis, and the procedure for
defined based on fricatives in English cognate words. Words in the dataset containing a
‘potential fricative’ were categorised into three groups: Always pronounced with a
fricative, never pronounced with a fricative, and pronounced variably with a stop or
fricatives.
Words that contained stop-fricative variation were more likely to be open-class than
closed-class, and the variable segments were most frequently word-initial and at labio-
Kriol maternal speech and fricatives in the English cognate words, in initial, medial and
final word positions. Analysis 3 reports the results of logistic regression models that
were ran to investigate the effects of child age, phonological environment, and speaker
in all tokens and in tokens of variable words, in each word position. In initial position in
all tokens and medial position in tokens of variable words, fricatives the likelihood of
fricatives increased in mothers’ speech as children aged between approximate ages 1;6,
2;0 and 2;6. Analysis 4 describes in more detail the effects of child age on stop-fricative
lexical choice, as mothers used more words that can be pronounced variably when
children were older than when they were younger. These results provide support for the
concept of fine-tuning in speech to children, that is, that phonetic variation may change
over time in mothers’ speech as mothers make modifications to the child’s age and
language abilities.
CHAPTER 2 43
maternal speech and how it changes over time as children get older. The literature
review for this study includes previous research on the differences in phonological
suggests that in English variation in maternal speech must change over time as mothers
shift from using child- to adult-directed speech. The literature review also explores the
theory that mothers may not only fine-tune their speech to children’s linguistic
Thus the literature on sociophonetic variation in mothers’ speech is also examined here.
Finally in the literature review there is a brief discussion of the previous research on
phonetic variation in Australian English and casual speech processes that may occur in
regional varieties.
The methods section details how the Katherine English maternal speech
recordings were made and transcribed, and how searches and analyses were performed
The results section is divided into the three analyses conducted for this study,
with a brief discussion following each analysis. Analysis 1 examined the incidence of
phonological reduction in maternal speech over time, focusing on two specific reduction
processes, word-initial /h/ deletion and word-final /v/ deletion. Findings showed an
inverted-v effect: deletion in mothers’ speech was more likely when children were 2;0
than 1;6, but was unexpectedly less likely at 2;6 than 2;0. This change over time effect
was significant for /h/ but not /v/ deletion. Analysis 2 examined the local effects on
these reduction processes and it was found that changes in deletion rates occurred in
effect of speech rate on deletion and found deletion did not appear to be driven by
speech rate, as there was no significant difference in speech rate between child ages 1;6
and 2;0, and between 2;0 and 2;6 speech rate increased while deletion decreased.
The discussion considers possible explanations for the inverted-v effect found
for word-initial /h/ and word-final /v/ deletion in Katherine English maternal speech as
children aged from 1;6 to 2;0 to 2;6. Analyses 2 and 3 show that this effect is not likely
may serve different functions at different stages of linguistic and social development,
This study arose primarily from methodological questions about how we can
elicit native speaker perceptual judgements of Gurindji Kriol fricatives and use the
review discusses the use of visual analogue scales in speech perception research and
how they can be used to get continuous judgements of phonetic variation. Also
discussed here are potential biases and other limitations of phonetic transcription, in
as well as the applied context of the language situation where Gurindji Kriol is spoken
Kriol; 2) The development of visual analogue scales, including how tokens were
selected and dichotomies determined for the scales, and how the scales were constructed
CHAPTER 2 45
Gurindji Kriol who participated in this study; and 4) Administration of the visual
analogue scales to Gurindji Kriol native speakers. Native speaker responses on the
Results showed both agreements and disagreements between native speakers and
non-native transcribers on different types of judgements in initial, medial and final word
towards the stop where transcribers had judged the segment as a fricative. On medial
stop-fricative judgements native speakers sometimes judged the target segment as a stop
and sometimes responded along the continuous scale. For voicing judgements native
speakers tended to judged Gurindji Kriol segments along the continuous scale in each
word position.
and lexical expectation. Findings open up further questions and hypotheses about
This chapter provided the empirical and theoretical background of the research
presented in this thesis. First I reviewed the literature on child-directed speech and
contextualised the concept of maternal speech used here. I also discussed the literature
the input to which children are exposed. The following sections contain the aims,
research questions and subquestions of the three empirical studies that make up
Chapters 4, 5 and 6, and the significance and outcomes of this thesis. I then discussed
theoretical context and rationale for the current research. The final section of this
chapter provided summaries of the three research studies described in the thesis. The
next Chapter (Chapter 3) describes the methodologies used in these studies. The
whole, and to detail the collection and processing of the Gurindji Kriol and Katherine
English corpora. The methodology chapter also provides a general overview of the third
CHAPTER THREE
Methodology
CHAPTER 3 48
The methodology for the thesis reflects its nature as a corpus study of maternal
speech involving fieldwork in regional and remote locations, working with speakers of
the two language varieties, Gurindji Kriol and Katherine English. Chapters 4, 5 and 6
describe separate research studies that share the overarching aim to examine phonetic
variation in maternal speech in Gurindji Kriol and northern Australian English. The
specific methods used for each study are described in each of these chapters.
This chapter begins with the timeframes of data collection and a brief
description of the activities carried out on the fieldwork for this research. This is
followed by four sections that give further detail of the general methods used in this
thesis: 1) The Gurindji Kriol corpus collected by Meakins (2011) and our addition of
Katherine English corpus that was created for this project, 3) Working with speakers of
Kriol, and elicit their implicit linguistic knowledge of Gurindji Kriol phonology using
Visual Analogue Scales, 4) Using the program Phon to phonetically transcribe and
analyse naturalistic corpora of maternal speech. All data were analysed quantitatively
and the details of analyses are specified in the research papers in Chapters 4 to 6.
Data were collected in a series of four field trips to Katherine and Kalkaringi
between September 2009 and June 2011. Figure 3.2 shows the timeline and the main
activities conducted in each trip. In the first three trips, approximately six months apart,
I recorded families for the Katherine English corpus when children were aged
approximately 1;6, 2;0 and 2;6. In trip 3 I also did some phonological awareness
the fourth trip I revisited the Katherine English families to update them on the research
and check with them my description of Katherine English for Chapter 4. I also worked
with the community Research Assistants in Kalkaringi on the perceptual tasks described
The field trips also involved building relationships with the local childcare service
providers, consulting with a steering committee and Kalkaringi community leaders and
working with cultural mentors and the project liaison. Discussions of informed consent
for all participants are ongoing and continued at each trip throughout the project.
A final field trip was undertaken at the end of the project (October 2012). In this
final trip we provided more feedback to the participants and communities about the
results of the research in the form of presentations, booklets, and classroom materials
3.2.1 Design
The Gurindji Kriol recordings used in this project were a subsample selected
from an existing audiovisual database created by Meakins (2011) for the Aboriginal
Australian Aboriginal languages and a contact variety of English, with mixing between
language varieties and speech styles (Simpson & Wigglesworth, 2008). Meakins (2011)
created the database of Gurindji Kriol for the Kalkaringi section of the ACLA-1 project
6
The ACLA-1 project was funded by Australian Research Council grant DP0343189
through the University of Melbourne (C.I. Gillian Wigglesworth, Jane Simpson and
Patrick McConvell). http://linguistics.unimelb.edu.au/research/projects/ACLA/
CHAPTER 3 50
and for her PhD thesis on the development and function of case morphology in Gurindji
Kriol (Meakins, 2007). The map in Figure 1.1 (from Meakins, 2011, p. xxi) is of the
Victoria River District and shows the Kalkaringi and Daguragu communities in relation
to the nearest town, Katherine, and the Northern Territory capital city, Darwin.
family interactions. These recordings were of Gurindji Kriol speaking women and their
older Gurindji family members, and were made between 2003 and 2007 at
approximately 6-month intervals. For the current research we selected a subsample from
this database of three speakers at three timepoints approximately six months apart, when
the target children were aged approximately 1;6, 2;0 and 2;6. We converted the
program Phon (Rose et al., 2007), phonetically transcribed the tokens of interest and
their phonological environments using the International Phonetic Alphabet (IPA), and
exported the relevant data into SPSS for further analysis. In this section I describe the
subsample from the Meakins (2011) corpus of Gurindji Kriol used in the current
research as well as our procedure for processing and transcribing the data and
consulting community members about the project and our use of the data.
CHAPTER 3 51
3.2.2 Participants
Participants in the subsample used in these studies were three Gurindji mothers
and their children who are speakers of Gurindji Kriol. The three women were mothers
of focus children in ACLA-1 and agreed to participate in the current project. These
speakers were selected because there was longitudinal conversational data when their
children were aged on average 1;5, 1;11 and 2;5, which are the approximate child ages
under investigation in this thesis, and because they were available in the community for
consultation and discussions about our use of their recordings and transcripts for this
research. The three mothers were recontacted and, in the presence of an interpreter,
were invited to give informed consent for us to use their speech data collected by
The children’s ages in the recordings selected for this research are displayed in
Table 3.1. From this point forward I refer to the three timepoints of data collection in
the current subsample as Stages 1, 2 and 3, though note these do not correspond to the
field trips in the Meakins dataset (in which there are trips both preceding and following
this subsample). The recording length and number of mothers’ utterances at each stage
Table 3.1
Gurindji Kriol Speaking Mothers and Age of the Focus Children
Child’s age
Mother Child’s gender
Stage 1 Stage 2 Stage 3
Total amounts of recording and number of mothers’ utterances in the Gurindji Kriol subsample at each stage
3.2.3 Procedure
The sample selected for the current research comprised audiovisual recordings
(2011) was sometimes present in recording sessions and sometimes left the area to
allow speakers to interact as naturally as possible (see Meakins, 2011, pp. 45-54 for
toys provided by Meakins, which included dolls, picture books, drawing and puppets.
Recordings also took place outside at the river, which included activities such as
that recorded at 44Ghz 16 bit, to a minidisc recorder, which was carried by the speaker
in a bumbag. A video camera with a shotgun microphone was also used to record
sessions.
transcript files from CHAT format to Phon (Rose et al., 2007) with the use of
conversion tools created for this purpose7. We then phonetically transcribed in IPA the
of common IPA transcriptions of Gurindji Kriol words that was imported into Phon.
7
Chatter converts files from CHAT to xml formats and vice-versa
(http://www.talkbank.org/software/chatter.html), and Phontalk converts files from xml
to Phon formats and vice-versa (http://phon.ling.mun.ca/phontrac/wiki/PhonTalk). Both
tools were created for the Phon project (Rose et al., 2007), which is a part of CHILDES
(MacWhinney, 2000), and are freely available.
CHAPTER 3 55
Phon’s search function was then used to export the transcription data into SPSS for
further analysis.
There were several ethical considerations involved with building on the data in
the Gurindji Kriol corpus for the current project. These included consultation with
community leaders and the speakers about the direction of the project, ongoing
permission from participants to use recordings and transcripts, who to work with as
discussed with speakers during each visit to the community in the presence of an
interpreter, and throughout the project speakers were given opportunities to discuss and
update their decisions about how their data would be used and stored. The consent form
for Daguragu mothers in Appendix A shows the options participants had for our use and
storage of their data. Participants generally consented to data storage at other facilities
(e.g. the language centre in Katherine, at national archives) if future researchers have to
get their permission to access the data. Participants also allowed us to print parts of
transcripts in academic publications and to use video clips and photos for conferences
and the project website, with varying degrees of permissions. The speakers did not wish
to use pseudonyms in academic reports – these women are proud of their language
variety and agreed to their names being associated with work on it. Speakers,
interpreters, community leaders and other community members involved in the project
were all paid for their time and gave informed consent to be a part of the research,
where they were able to discuss and choose the levels of anonymity they wished for the
3.3.1 Design
naturalistic northern Australian English maternal speech, recorded in homes and local
outdoor settings in family interaction. I recorded this corpus between July 2009 and
December 2010 (see Figure 3.1 for data collection process) in the town Katherine,
Northern Territory, which is about 300km south of Darwin and about 500km northeast
of Kalkaringi and Daguragu (Figure 1.1). This corpus was designed to be comparable to
the Gurindji Kriol maternal speech sample described above (section 1.5.1), and
same child ages between 1;6 and 2;6. The Katherine English corpus comprises over 30
the target children were aged approximately 1;6, 2;0 and 2;6. The mothers’ speech was
phonetically transcribed and analysed in Phon (Rose et al., 2007). Table 3.3 displays the
length of recording and number of mothers’ utterances of each speaker at each stage of
recording.
3.3.2 Participants
Participants were recruited through the Katherine childcare centres and other
local businesses and were paid for their time. The five mothers who participated were
monolingual Australian English speakers who lived and had grown up in Katherine,
with the exception of one mother (Erin) who grew up in a small regional town
four of the mothers had a child aged 1;5-1;7 at Stage 1 of data collection (the target
child); the fifth mother’s child (Cullen) was 2;0 at Stage 1. The characteristics of the
mothers and the target children are displayed in Table 3.4, followed by a brief
CHAPTER 3 57
description of each dyad and the other people present in recording sessions. Mothers
were given the choice of having pseudonyms for themselves and their families in
academic manuscripts; thus pseudonyms are used throughout this thesis for the mothers
Table 3.3
Bachelor
Cathy Lucy female only child 1;5 1;11 2;5
degree
Secondary youngest
Kim George male 1;5 1;11 2;5
school of three
Secondary
Kellie Tyson male only child 1;7 2;1 2;7
school
Bachelor oldest of
Erin Grace female 1;6 2;0 2;8*
degree three
Secondary oldest of
Eva Cullen male 2;0 2;6 3;0
school two
* The age difference between Stages 2 and 3 is greater for Grace than the other
children because Erin and Grace’s Stage 3 sessions were recorded later due to
participant circumstances
Cathy and Lucy. Lucy was Cathy’s only child. Lucy’s father occasionally
participated in recordings, but he was not usually home during recording and sessions
Kim and George. George had two older siblings, a boy and girl aged 6;3 and
3;0 respectively at Stage 1. One or both of the siblings participated in most of the
sessions. Kim was asked to focus on George in the sessions, so generally the siblings
CHAPTER 3 58
played together while Kim played with George. The children’s father participated
occasionally, but usually he was not home when recordings took place.
Kellie and Tyson. Tyson was an only child. Tyson’s grandmother (Kellie’s
mother) was present in most of the sessions. Other adult family members were
Erin and Grace. Grace had two younger brothers: one aged 0;5 at Stage 1 and
another born between Stages 2 and 3. Sessions were generally held when the younger
children were sleeping and the majority of interactions were between Erin and Grace.
Two adult males who work for Erin and her husband were occasionally present in some
of the sessions.
Eva and Cullen. Cullen was an only child at the first two stages, and then had a
younger brother born between Stages 2 and 3. Cullen’s father participated sometimes
when he was home during sessions. Cullen refused to wear the microphone in all
sessions, accordingly there is only audio of Eva’s speech at each stage. As Cullen was
six months older than the other children, Eva’s data were not used for the longitudinal
analyses of changes between the three child ages to ensure a simple and complete
dataset.
Table 3.4
Total amounts of recording and number of mothers’ utterances in the Katherine English corpus at each stage
3.3.3 Procedure
Mothers were asked to interact with their children as they normally would and to
focus on the target child if siblings were present. Settings for the recordings were the
family home, both inside and in the backyard, and at a local park, with care taken to
choose locations with minimal background noise. Mothers were told that the project
was about their everyday speech to children and were debriefed on the details after the
final recording session. I made all the Katherine English recordings, and spent time
before and after sessions building rapport with the families and familiarising the
children with the recording equipment. I was usually present in the sessions operating
the video camera, although sometimes when the participants appeared nervous or very
aware of my presence I secured the camera to the tripod and left the area.
Activities. The content of the recordings is the mothers and children engaging in
free play. They played with their own toys and games as well as those brought by the
researcher, and generally the same types of activities were carried out across families
and stages of recording: playing with coloured blocks, toy cars, dolls, puzzles and
playdough, free drawing and painting, and playing on playground equipment such as
swingsets and slides. Originally an effort was made to keep the materials as similar as
possible to those in the Gurindji Kriol recordings; however, participants were more at
ease choosing their own activities and materials. This was therefore allowed as a more
naturalistic speech sample was elicited when participants were comfortable, and as we
were interested in the speech the children were typically exposed to it was valid for
them to engage in their typical activities. Between one and three sessions were
conducted with each family at each stage, generally between half an hour and an hour in
length, depending on the participants’ schedules and the children’s compliance (refer to
receiver with XLR connection to a hard disk recorder (Edirol R-44). The mother and
child wore a microphone each and recorded on separate audio tracks. The mother’s
track was used for transcription. The microphone was positioned upwards at the top
centre of the mother’s shirt, and the children wore adjustable aprons with the transmitter
clipped on the back and the microphone at the top centre. The video camera (Sony
AVCHD HDR XR-500V) filmed in high definition and had an external microphone
Figure 3.2. Photo of a typical recording session. The mother and child (2;0) are both
wearing lapel microphones and play together in the garden at the child’s grandmother’s
house. The grandmother participates in the background and the researcher is present
behind the camera.
CHAPTER 3 62
that informed how the research was conducted, so I will briefly discuss these and how
Participants were recruited through Katherine childcare services and other local
businesses. This recruitment procedure allowed me as the researcher to consult with the
community and to build relationships with the local organisations that have an interest
in the research. Before starting the Katherine English corpus I attended several childcare
centres in operation and observed adult childcare providers interacting with children
aged about 1;0 to 5;0, and spoke casually with parents and staff about the experiences of
caring for children in Katherine. I also attended staff meetings of an organisation that
Katherine and gained awareness of common issues faced by families in the region.
These organisations also provided a channel for disseminating our findings to the
community.
recordings of the families’ everyday lives, it was critical to ensure the mothers had an
in-depth understanding of what we were doing with the recordings and transcripts, how
they would be stored and who could access them, and that we complied with the
families’ wishes. I discussed the information sheet and consent forms (Appendix A) in
detail with mothers before the first session and during each of the subsequent fieldtrips
to check if they had changed their minds on the consent options. These included various
options for the storage of data, its current and future use, and levels of de-identification
for each of the data formats (audio, video, photos and transcripts). Some mothers chose
to give full permission for all options of our use and storage of all forms of their data
CHAPTER 3 63
without any anonymity, for themselves and their children. The other mothers chose de-
identified options for online data storage and access (i.e. on the TalkBank website), and
Families were sent DVDs compiled of the video recordings overlaid with the
audio from the lapel microphones (with assistance from a technical research assistant on
the project), as these were the raw data we processed and analysed for the research.
Mothers could then check if there was anything they would like deleted from the stored
conversation (usually the recording was stopped or the mother’s microphone muted for
phone calls but it had kept recording in this case). Participation was voluntary and
mothers were reminded at the start of each session that they could stop the recording at
any time. They were paid $25 an hour for their time and were aware they would be paid
The mothers’ audio tracks were transcribed in the phonetic transcription and
analysis program Phon (Rose et al., 2007). Phon was used to segment the tracks into
utterances based on conversational turn-taking and pause boundaries (see Table 3.4 for
the number of utterances per mother at each child age). Phonetic transcription of the
audio recordings was performed by me, a native speaker of Australian English from
regional Tasmania. I transcribed mothers’ actual pronunciation in the ‘IPA Actual’ tier
in Phon as well as the standard Australian English forms in the ‘IPA Target’ tier. This
allowed us to compare the mothers’ pronunciation with the standard forms and to search
for phonological processes that involve systematic differences between the two, such as
elision (the process investigated in Chapter 4). Further detail of the procedure of using
CHAPTER 3 64
Phon (Rose et al., 2007) for this purpose is in Buchan (2011), reprinted in the last
Harrington, Cox and Evans (1997) for phonetic transcription of Australian English,
subset of the data for reliability analyses, which showed over 95% agreement.
Disagreements were resolved through consensus discussions between myself and the
second transcriber, and where consensus could not be reached a third transcriber, the
primary supervisor of this thesis (trained in phonetics and phonology and a researcher in
Transcription was an ongoing process and was done in several settings (field
locations and at the university), and the same set of high quality headphones
(Sennheiser HD 280 Pro) was used for all transcription with the second and third
transcribers using identical sets. Further detail of the quality of the audio recordings and
project. Community RAs were seven young women in Kalkaringi and Daguragu who
were native speakers of Gurindji Kriol and recommended by the traditional owners and
the community project liaison. Two of the three Gurindji Kriol speakers mentioned in
the above description of the Gurindji Kriol subsample were also community RAs. The
third speaker was away from the community during the times the research for this thesis
CHAPTER 3 65
was carried out, but we were able to contact her in Katherine and when she visited
Kalkaringi to discuss consent and to ask her permission to use recordings and
transcripts of her. The community RAs were involved in a variety of tasks. They
familiarised me with the community, the social context and their perspective on the
language situation, and taught me some Gurindji Kriol as well as some traditional
training and Visual Analogue Scale tasks described in Chapter 4, and through these
The study in Chapter 5 involved us developing a new method for using the
Visual Analogue Scales with native speakers of Gurindji Kriol to add to our phonetic
transcription of Gurindji Kriol, and so the method is described in some detail in that
research article.
3.5.1 Overview
powerful search function, can be used for a wide range of investigations in phonetics
and phonology. Phon is compatible with other language processing programs and is not
8
This section is a refereed published paper: Buchan, H. (2011). Phon: Free software for
phonological transcription and analysis. Language Documentation and Conservation, 5
81-87. Acknowledgements: I would like to thank my supervisors Caroline Jones
(University of Wollongong) and Felicity Meakins (University of Queensland) for their
valuable suggestions. I am also very grateful to Yvan Rose (Memorial University) for
his helpful feedback.
CHAPTER 3 66
just limited to English, making it a useful tool for documenting and analysing the
The program has a user-friendly interface that makes transcription and analysis a
feature opens in a new view panel and so can be displayed as needed. Phon supports
displayed together for consensus discussions and transcription validation. The media
player supports both audio and video files, and media files can be segmented to link to
sections of the transcription. Each transcript section is called a record, and once linked
to the media the waveform of each record can be displayed. Phon uses Unicode fonts,
and there are built-in dictionaries for some languages which facilitates transcription of
the target phonological form. Although the built-in dictionaries are only of major
languages, there is also a dictionary utility that allows user-defined dictionary input that
will show the IPA transcription of target phonological forms. There is also an automatic
syllabification feature that can be adapted for other languages. Perhaps most impressive
are the program’s search capabilities. The search function permits systematic searches
of phones and meta-phones, and the program developers have created PhonEx (Phon
Expressions), a language that allows for complex searches using common linguistic
terms. PhonEx is logical and easy to learn from the manual, especially for researchers
very useful program for describing and analysing phonetic features and phonological
Another very useful feature of Phon is that transcripts can be imported and
exported in CSV and XML formats. This makes it easy to archive transcripts, and to use
Phon in conjunction with other language documentation software. For example users
can download the program Chatter, which is freely available online, to convert CHAT
(the CHILDES transcription format) transcripts into XML, and then use another freely
available script, xml2phon, to import these into Phon for phonetic and phonological
analysis9.
as the IPA dictionary function can substantially speed up transcription time. However,
transcripts can also be converted to be compatible with programs like Transcriber and
ELAN. This means if a corpus has previously been transcribed in one of these programs
it can then be imported into Phon for phonetic transcription and analysis10.
Microsoft Excel and Word, OpenOffice Calc and Writer, as well as more general PDF,
HTML and CSV formats, the latter especially suited for transporting Phon data into
other applications and statistical analysis programs. In this way Phon’s unique search
Phon was developed by a team led by Yvan Rose and Greg Hedlund as a part of
the PhonBank project, a recent expansion of the CHILDES project (Child Language
system facilitates data sharing across the field of child language research. CHILDES is
several subfields of human and animal communication research. Within this larger
of languages. PhonBank data are compatible with the Phon and CLAN software
programs.
Good support is available for Phon users. A PDF version of the manual is
available online and in the program’s help menu. Users can ask for help and post feature
requests in an online forum11 that is checked regularly by the developers, and there is
Phon has very recently been released in a new version (v1.4). This version is a
substantial improvement on Phon 1.3, particularly in the layout and navigation. User
feature requests have been incorporated into v1.4, and existing functions have been
updated. There is a simple tool within the program for converting project files from v1.3
Projects are organised hierarchically, with each project made up of one or more
corpora, and each corpus containing one or more sessions. At this stage the user needs
to decide whether to work in the Editor’s mode or the Blind mode, the former being the
default. Transcription done in Editor’s mode will not appear in the Blind mode, so it is
important to make the decision about which mode to work in at the beginning of the
project. The Blind mode enables blind transcription from multiple transcribers and
should be checked at the start of the project if you would like to compare the phonetic
11
http://phon.ling.mun.ca/phontrac/discussion
12
phon-subscribe@googlegroups.com
CHAPTER 3 69
it is a way to keep track of changes made through consulting with native speakers.
Transcription is done in the Session Editor. The layout here can be organised
according to the user’s needs at the time. The displays (view panels) for different
functions (e.g., media segmentation, automatic IPA lookup, search, search results) can
be opened and moved around and docked or undocked as needed. Customisation of the
layout is one of the strong points of Phon, as you only need to display the functions
relevant to the specific task at hand (e.g. you can have a different configuration of view
example setup for transcribing in Phon. The Record List on the left makes it easy to
navigate between records, which are portions of the transcript that are useful for
research, typically consisting of single utterances. The Session Information and Record
Data in the middle display the participants’ details and the record transcript,
respectively. The IPA Lookup on the right displays the dictionary IPA transcription of
3.5.3 Media
The media player in Phon supports a range of audio and video formats and can
handle high quality recordings (I use 24 bit 48 kHz audio files, which work well in this
version). Media files can be added by locating the file in the Session Information view
panel, and played by opening the player from the Media menu. Media can be segmented
to create records (though data records can also be created in the absence of recorded
media.) The segmentation feature is simple to use. After entering in the participant
information, the user can decide to segment media from the beginning of the recorded
media file or from the end of the last record segmented. To segment, simply use either
shown in Figure 3.4. This will automatically create a record for that participant that
timeframe. I found the default 3000ms window to work generally well for naturalistic
interactions between mothers and children. Segment times can also be easily adjusted in
the segment tier of each record or by moving the boundaries in the waveform. The
segmentation tool is also useful in that it allows the user to export individual media
clips as separate files. This is good for storing soundbites, and is also very helpful when
audio in the segment is unclear, as the user can then quickly open the individual
Figure 3.4. Using the segmentation tool to create a record that is linked to the media.
3.5.4 Transcription
Once a record has been created, the associated utterance(s) can be transcribed.
The Record Data window in the example in Figure 3.5 shows the default transcript
tiers. The Tier Management function allows the user to re-order and show or hide tiers,
and to create new tiers. There is an IPA chart, available from the Tools menu, for
At this stage the user should decide on how to group words. Word groups in
Phon refer to user-defined boundaries and are useful for defining boundaries of a lexical
word, e.g., should, or a prosodic word, e.g. oughta. Word groups are represented by the
square brackets and can be created in the Orthography tier, which then automatically
creates groups in the IPA Target tier (transcription of the model or standard utterance
form) and the IPA Actual tier (transcription of the produced utterance form), and any
user-defined tiers that have been created to align with word groups. This feature is a
way in which the transcript structure can be adapted to specific requirements of the
research as it allows utterances to be segmented at any level, so groups can be made for
The IPA target refers to the model phonological form, and the search function in
Phon can make systematic comparisons between target and actual phonological forms.
This is a very useful feature that can be applied, for example, to compare phonemic and
phonetic transcriptions, standard and colloquial pronunciations, adult forms with actual
inserted into the transcript. Dictionaries are already built-in for some major languages
(American English, Catalan, German, French, Icelandic, Dutch, Italian and Spanish),
and can be imported for other languages. A fieldworker could therefore create a word
list with IPA transcriptions of the most frequent words in a corpus and build it into Phon
as a basic dictionary. The phonological word forms can then be displayed in the
dictionary look-up view panel and inserted into the transcript simply by clicking on
each word form. If a word in the dictionary look-up has more than one IPA entry then
the arrow keys can be used to scroll through the possible alternatives (for example you
can choose between [ət] and [t] for the English word it). Combined with the waveform
feature, which allows the user to isolate segments in an utterance, this feature is an
efficient way for a fieldworker to check transcriptions with native speakers – the user
can play a segment and go through potential phonological forms for that segment
simultaneously.
Phon also has tiers for target and actual Syllabification and Alignment, which
offer syllable-level data annotation as well as systematic phone alignment between the
IPA Target and IPA Actual tiers (see Figure 3.5 for an example). Syllable information is
automatically labelled and displayed graphically in the transcript with boxes around
each syllable and different colours to represent syllable constituents. The automatic
selecting the correct label from a drop-down list. The syllabification feature does not
require the use of stress symbols in transcription, and provides a quick and easy way of
seeing how utterances are segmented into syllables and the alignment between the IPA
3.5.5 Search
The search function in Phon is excellent and allows for advanced queries. While
this is just an overview of the main search features, searching in Phon can be adapted to
be as complex and detailed as necessary. Searches can be performed both within and
across transcripts, and results can be filtered by speaker. Tiers can be queried
individually using the Data Tiers search (e.g., to search for all instances of a particular
word in the Orthography tier), or multiple tiers can be queried at once (e.g., to compare
CHAPTER 3 74
segments in the IPA Target and IPA Actual tiers). Phon also allows for searches of
stress patterns and syllable types, i.e. consonant, vowel and glide combinations. The
results can be exported in the variety of formats listed above, making it easier for
further analysis. For example, CSV report exports are useful for data post-processing in
The search function is so good because it has been specifically designed for
phonological analysis. There are tools for searching vowel and consonant harmony and
consonant metathesis, in which the user can select the type of harmony and list the
features and directionality of the process. As well as using plain text and regular
expressions, complex queries can also be made using PhonEx, a language built for Phon
using common linguistic terms. The structure of and terms in PhonEx are described in
detail in the manual and it does not take long to learn how to create queries. PhonEx
allows search expressions to be as broad or specific as needed. For example you can
search for all instances of word-initial [k], all vowels in unstressed syllables, dorsal
sonorants in primary stressed syllables in onset position, one or more phones that are
not in coda position and occur at the end of a word group, etc. The output of searches
report with the record numbers for each instance (if you search within a session you can
even click in the results box to bring up the records in which each instance occurs). The
advanced search system is what really sets Phon apart from other phonetic transcription
software. It is not as complicated as it may appear at first, results are easy to navigate,
and it allows for a very wide range of complex searches depending on the researcher’s
needs.
CHAPTER 3 75
3.5.6 Conclusion
and phonetic data. Transcribing in Phon is straightforward, and the media segmentation
and dictionary IPA look-up features help to speed up the transcription process. The
language variety in great detail. Phon would be a good asset to a great variety of
customisable layout make Phon a very handy tool for fieldworkers interested in
program developers
Open Source: Yes, Phon is licensed under the GNU General Public License
http://phon.ling.mun.ca/phontrac/wiki/Downloads
CHAPTER 3 76
Documentation: http://phon.ling.mun.ca/phontrac/wiki/
This chapter provided an overview of the methodology used for the empirical
research in this thesis. It included descriptions of how the project was conducted, the
Gurindji Kriol and Katherine English corpora, and how data were processed, transcribed
and analysed in the program Phon (Rose et al., 2007). There were three main aims of
the field trips to Kalkaringi and Katherine in the Northern Territory, Australia, that I
undertook between 2009 and 2011: 1) Recontact three Gurindji Kriol mothers whose
families were part of the Meakins (2011) corpus and discuss our use of their audiovisual
recordings and transcripts for this project, and consult with Kalkaringi community
leaders and traditional owners about the project; 2) Create a comparable longitudinal
Assistants, and use Visual Analogue Scales to elicit their native speaker judgements of
Phonological analyses of corpus data conducted at the level in this research are
possible due to advanced computer programs for phonetic transcription and analysis.
For this project we used the software Phon (Rose et al., 2007), which is a component of
CHILDES and compatible with the CHAT transcription format (MacWhinney, 2000).
CHAPTER 3 77
The last section in this chapter (previously published, see Buchan [2011]) provided a
review of Phon and a guide for using it in the context of this research.
The next three chapters describe three empirical research studies and are written
children at approximately the same ages, 1;6, 2;0 and 2;6. Study 3 (Chapter 6) offers a
4.1 Abstract
manners of articulation and lack phonemic fricatives (Butcher, 2006). In Kriol varieties
fricative production is highly variable and many Kriol words that have fricatives in their
English cognates can be pronounced variably with a fricative or stop (Sandefur, 1991).
Speakers therefore often have multiple phonetic forms of words to choose from, and in
child-directed speech adults may either consciously or unconsciously choose forms that
a mixed language variety in northern Australia that contains lexical forms from both the
traditional language Gurindji and Kriol. We phonetically transcribed and analysed data
from the Meakins (2011) longitudinal Gurindji Kriol ACLA corpus to examine stop-
fricative variation in Kriol-derived words. Speakers were three Gurindji women at three
timepoints, when their children were aged approximately 1;6, 2;0 and 2;6. To quantify
transcripts for words that had fricatives in the English cognates. Words were categorised
and analysed according to whether they always contained a fricative, were never
regression models were performed for word-initial, -medial and -final positions to
investigate the relative effects of child age and phonological environment on stop-
fricative variation.
Results showed that words containing stop-fricative variation were more likely
to be open-class (88%) than closed-class, and the variable segments were most
CHAPTER 4 80
frequently word-initial (49%), then word-medial (32%) and word-final (19%). The most
frequent places of articulation were labio-dental (44%) and alveolar (32%). Cross-
derived Gurindji Kriol words showed variability across place, manner and voicing, and
differences by word position. Logistic regression results indicated that in initial and
medial positions fricative variation in mothers’ speech changed with child age, taking
into account phonological environments and inter-speaker variation. Across all words,
word-initial fricatives in mothers’ speech were significantly more likely at child age 2;6
significantly more likely at 2;6 than 2;0 and 1;6. Results suggest stop-fricative variation
changed in Gurindji Kriol maternal speech as children aged, partly due to changes in
lexical choice. Findings are discussed in relation to lenition processes and fine-tuning in
child-directed speech.
CHAPTER 4 81
4.2 Background
Gurindji Kriol is the main language spoken by children and adults under the age
structures and lexical items from the traditional Aboriginal language Gurindji, and from
p.12). The grammar of Gurindji Kriol has been examined in detail (Meakins, 2009,
2010, 2011) and the phonological inventory has been briefly described (Meakins, 2007;
speech13, with the overarching aim of investigating the phonology of the speech input
fricative variation in family interactions and examined possible factors that may affect
fricative variation and whether it changes over time in mothers’ speech as their children
Most Australian languages do not have a manner contrast between stops and
fricatives and lack phonemic fricatives (Butcher, 2006). Lenis stops will sometimes
have fricative allophones, particularly voiced stops following long vowels, and in
languages that do contain phonemic fricatives they are generally voiced and contrast
with voiceless stops (Dixon, 2002). Phonetic fricatives are often dorsal and/or labial,
and common word positions are initial and medial (Dixon, 2002). Traditional Gurindji
does not contain a stop-fricative contrast and has allophonic fricatives (e.g. the
13
The Gurindji Kriol corpus analysed in this study was made by Meakins (2004-07) for
the Aboriginal Child Language (ACLA-1) project, funded by Australian Research
Council grant DP034189 through the University of Melbourne (C.I. Gillian
Wigglesworth, Jane Simpson and Patrick McConvell).
http://linguistics.unimelb.edu.au/research/projects/ACLA/
CHAPTER 4 82
fricative). The nature and extent of fricative production in Gurindji and Gurindji-
derived words in Gurindji Kriol has not been investigated. The current study focuses on
the fricatives in Gurindji Kriol that are in Kriol-derived words. The production of these
fricatives is highly variable. They are known to occur at three places of articulation –
bilabial, alveolar and palatal – and are used in variation with stops (Meakins, in press).
or other Australian contact language varieties. In the present study we focus our
influenced by lexical and phonetic environments, and also how variation in mothers’
speech changes as their children get older. Evidence from the child-directed speech
literature (mostly on standard language varieties) suggests that mothers make segmental
modifications in their speech to children, and that modifications may change as children
get older and their receptive and productive language develops. Cross-linguistically
fricatives tend to be acquired by children later than stops or glides, and it has been
suggested that this is due to the greater articulatory difficulty involved in producing
fricatives (Dinnsen, 1992, pp.196-199; Kent, 1992, pp.75-76). Thus if mothers make
there may be an effect of child age on fricative variation in mothers’ speech. Qualitative
reports from Gurindji women suggest that fine-tuning may contribute to phonetic
variation in their speech to children. They have reported using more stops than
fricatives when speaking to children as fricatives are considered more difficult sounds
(Jones & Meakins, 2012b). There are, however, no previous quantitative descriptions of
CHAPTER 4 83
fricatives in Gurindji Kriol and it is unknown if these impressions are actually realised
in mothers’ speech and at what child ages the effect may appear.
Kriol, which is followed by more detail on how child age may affect phonetic variation
in maternal speech. We then discuss linguistic factors that may contribute to fricative
It developed from code-switching between Gurindji and Kriol in the 1970s (McConvell
& Meakins, 2005; McConvell, 2008; Meakins, pp.109-53) and contains a structural split
between the noun phrase system, which is mostly from Gurindji, and the verb phrase
system, mostly from Kriol (Meakins, 2011, pp.12-13). The lexicon contains nouns and
verbs derived from both languages and Gurindji Kriol contains phonology of both
Gurindji and Kriol (Meakins, in press). Gurindji Kriol is the main language of Gurindji
some older people, and Kriol is spoken with Kriol-speaking visitors to the community.
The neighbouring language Warlpiri is also spoken by some people. English is the
language of the school, government services and media and is spoken with English-
phrases that often occur in the school context (Meakins, 2008; Meakins, 2011, p.65),
but Gurindji Kriol is the children’s home language and they generally do not hear much
English in the years before school. See Meakins (2008; 2011, pp.58-70) for more
The current study is part of a larger project investigating the phonetics and
phonology of maternal Gurindji Kriol. The aim of the project is to investigate the sound
CHAPTER 4 84
patterns that children are exposed to in their home language Gurindji Kriol, and
between 2004 and 2007 as part of ACLA-1 (see footnote 1). Acoustic analyses of
mothers’ speech from this corpus and analyses of minimal pair contrasts in Kriol-
derived words suggests Gurindji Kriol may have a 5 vowel system, /ɪ, ɛ, ɐ, ɔ, ʊ/ (Jones,
Meakins & Buchan, 2011; Jones, Meakins & Muawiyath, 2012), and voicing variation
in stop consonants depending on word position and place of articulation (Jones &
Meakins, 2012a). Fricatives in Gurindji Kriol have not previously been studied in detail.
Research on the phonology of Kriol has found that fricative production is highly
variable. Sandefur (1991) suggests that when Kriol was developing, speakers produced
English-derived words with traditional language phonology, which did not contain
fricatives, and as speakers remained in contact with English some Kriol words started to
be pronounced with the original English sounds. He also suggests that the newer
than replacing them, which is why Kriol words often have several pronunciations
(Sandefur, 1991) and accounts for the high variability in fricatives, which are present in
Impressionistically, Kriol-derived words in Gurindji Kriol are also highly variable, and
Meakins (2007, p.358) notes that there is variation in manner of articulation between
including situational contexts and linguistic factors such as word position and
phonological environment. The settings of the current dataset are family interactions, as
the broad aim is to investigate the phonology of the input that children acquiring
CHAPTER 4 85
Gurindji Kriol are exposed to. Thus the data in the current study is maternal speech in
Gurindji Kriol. Maternal speech is defined here as mothers’ speech spoken in the
presence of their children and includes child-directed speech, that is speech directed
specifically to the child (see Soderstrom, 2007, for a discussion of maternal speech as
in pitch contours, longer pauses, and higher fundamental frequency (Snow, 1977, p.36;
Fernald et al., 1989; Katz, Cohn & Moore, 1996). Cross-linguistic evidence suggests
their speech change as children get older. For example, in Mandarin acoustic
exaggeration in vowels, i.e. larger vowel space, longer durations, and increased tone
range, was greater in adult speech to younger children (aged 0;7-1;0) than to older
children (aged 5;0), and vowel exaggeration in speech to the older children was still
greater than in adult-directed speech (Liu, Tsau & Kuhl, 2009). Similarly in a regional
variety of English from northern England, segmental phonetic variants associated with
speech between child ages 2;0 to 4;0 (Foulkes, Docherty & Watt, 2005). Phonological
reduction in mothers’ speech related to casual speech processes has also been found to
change with a child's age in northern Australian English, with mothers proportionately
CHAPTER 4 86
increasing segmental deletion as children aged 1;6 to 2;0 and then decreasing deletion
between 2;0 and 2;6 (Buchan & Jones, in press), suggesting mothers may fine-tune their
Mothers’ fricative production may also vary between child- and adult-directed
manner of articulation have been found in English (Lee, Davis & MacNeilage, 2010)
and Korean (Lee & Davis, 2008). English child-directed speech to children aged 1;0
contained proportionately more stops and glides, and fewer fricatives, affricates, liquids
and nasals than adult-directed speech, partly due to different lexical items used in each
speech compared with interadult speech. These results were interpreted to suggest that
mothers may ‘match’ their speech to their child’s language capabilities, scaffolding
children according to the sounds they are acquiring (Lee et al., 2010). The difference in
lexical items used in child-directed and adult-directed speech further suggests that the
Fricatives are more difficult to produce and tend to be acquired later than stops
and glides (Dinnsen, 1992, pp.196-199; Kent, 1992, pp.75-76), therefore if mothers
modify their speech according to the sounds children are acquiring (Lee & Davis, 2010)
we would expect a relationship between child age and fricative variation in mothers’
it occurs both within and between lexical forms. Speakers may have a choice of
CHAPTER 4 87
alternate phonetic forms and opportunity to avoid fricatives in speech to children and/or
The current study is an investigation of input children are exposed to, thus the
data consists of maternal speech, which is a broader term than child-directed speech
(Soderstrom, 2007) that includes mothers’ speech in family interactions when the child
is present rather than only speech directed to the child. Studies on English suggest that a
directed specifically to them (van de Weijer, 2002) and that exposure to third party
children are part of large extended family groups and the speech input they are exposed
to consists of general family interactions. Child age is one factor that may contribute to
fricative variation in maternal Gurindji Kriol, but variability is also high in interadult
speech and in this study we also examined linguistic factors that potentially affect
fricative variation.
changes into a fricative (e.g. [t] [θ]), may play a role in this type of variation.
Lenition is a sound change process involving consonant ‘weakening’, but exactly what
constitutes weakening is debated in the literature (e.g. Bauer, 2008; Honeybone, 2008;
trajectories of sounds from strong to weak (e.g. Escure, 1977; Hock, 1986), and focus
on the acoustic and articulatory features of the sound change such as sonority (i.e.
between articulators.
CHAPTER 4 88
effort whereby the phonetic target is not reached (e.g. Kirchner, 2004). In spirantization
the reduced effort leads to frication of stops, that is, allowing turbulent airflow through
the articulators rather than stopping it (Bauer, 2008). Kirchner (2004, p.316) suggests
certain contexts are more susceptible to lenition, in particular fast/casual speech and
articulatory effort, which is in turn influenced by the speech context and phonological
environment.
considered ‘weaker’ sounds than stops in that there is more airflow between the
articulators, they are also more difficult to articulate in that producing fricatives requires
more precision and muscular control than stops (Ladefoged & Maddieson, 1996, p.137).
children’s acquisition as they are generally acquired later than ‘easier’ manners of
articulation may vary with the position of the segment within a word. Sole’s (2010)
aerodynamic study of English speakers found that coda fricatives were articulatorily
more difficult to produce than onset fricatives, and suggests this may account for word-
final fricative weakening or elision of syllable-final fricatives that is found across many
languages.
(2008) found that the immediate phonological environment, that is segments either side
of the lenited phone, influenced consonant lenition. He posited lenition may be driven
by the speaker modifying the perceptual saliency of the segment and those surrounding
CHAPTER 4 89
it. A perceptual explanation of lenition was also examined by Kaplan (2011), who
suggests one of the speaker’s goals in lenition is minimal perceptual salience of the
sound change. Kaplan (2011) found that intervocalically the distinction between voiced
and voiceless stops was more perceptually salient than the distinction between voiced
stops and voiced fricatives, suggesting listeners may be less sensitive to intervocalic
stops being lenited to fricatives. The magnitude of the effect, however, was smaller for
labials than dorsals and coronals, despite labial being a common place of articulation for
frication across languages (Kaplan, 2011). Perceptual factors may therefore account for
some but not all lenition processes, and both perceptual and articulatory factors may
(Wilkinson, 1991) and Djapu (Morphy, 1983). In a recent study of Gaalpu, Chong
(2011) found phonological context was a factor in the conditioning of lenition. Suffix-
initial stops were lenited to semi-vowels preceding a vowel and following a liquid,
semi-vowel or vowel.
potentially interacting factors. In addition to lenition processes, the input that children
are exposed to is dynamic and may also contain variability as a result of mothers
productive and receptive acquisition. The aim of this study is to quantitatively describe
stop-fricative variation in Gurindji Kriol maternal speech and to examine the nature of
variation, particularly how it changes over time in mothers’ speech as their children age.
The current study investigated three research questions: 1) What is the nature and extent
CHAPTER 4 90
of fricative variation within Gurindji Kriol lexical forms in maternal speech? 2) Within
lexical forms containing stop-fricative variation, what are the relationships between
fricatives in English cognates and their pronunciation in Gurindji Kriol words? 3) How
does fricative variation within and between Gurindji Kriol lexical forms change in
mothers’ speech as children get older, taking into account phonological environment
4.3 Method
4.3.1 Design
interactions in Gurindji Kriol collected by Meakins (2011, pp. 45-47) for the Aboriginal
Child Language (ACLA-1) project. From the subsample we selected all tokens of words
that could contain a fricative and analysed the IPA transcription of these tokens and
or known phonological targets in Kriol words and therefore no model forms to which
basis for quantifying where fricatives could potentially occur in Kriol based words in
Gurindji Kriol. From hereon we use the term ‘potential fricative’ to refer to segments in
Gurindji Kriol words that are pronounced as fricatives in the English cognates and can
approach allowed us to examine not only the environments where fricatives and stop-
fricative variation occurred, but also the environments where fricatives did not occur,
i.e. where there were only stops despite the English cognate containing a fricative,
providing further insight into the factors that may be influencing variation. The current
CHAPTER 4 91
dataset contains tokens of ‘potential fricatives’. The dependent variable was manner of
4.3.2 Speakers
Speakers in the subsample used in this study were three young Gurindji mothers
who are native speakers of Gurindji Kriol. We selected recordings from the larger
audiovisual database (Meakins, 2011, pp.45-47) that were made at three timepoints:
when the mothers’ children were aged on average 1;6, 2;0 and 2;6. The children were
4.3.3 Recordings
engaging in free play. Speakers wore a lapel microphone that recorded at 44 kHz 16 bit
to a minidisc recorder that was worn by the speaker. Sessions were also recorded with a
video camera that had a shotgun microphone. Further detail on the corpus and recording
4.3.4 Analysis
Byrne, Wareham & MacWhinney, 2007) using IPA. Phonetic transcription was done by
the first and second authors and two research assistants, all native speakers of
Australian English trained in phonetic transcription. We then queried Phon for the
lexical items whose English cognate words contain fricatives and exported all tokens of
these words spoken by the three mothers along with the immediate phonological
A total of 2,070 tokens were exported from Phon. Words containing [h] in their
English cognates were then excluded (N=474), as in Australian languages [h] behaves
CHAPTER 4 92
differently to other fricatives and would have skewed the results: in the current dataset
only 9 of the potential [h] tokens (0.02%) were transcribed as being pronounced with a
[h]. For the purpose of analysing variation between stops and fricatives we also
excluded cases where the potential fricative was pronounced as a glide (N=92), most of
unstressed words, for example dat ‘the, <that’ was sometimes pronounced [jɐt] when
the previous word ended in a vowel. A further 2 tokens were excluded because the
potential fricative segment was unclear. After the exclusions, 1,502 total tokens were
analysed.
4.3.5 Coding
analysis. Data included the IPA transcription, corresponding sound in the English
segments from adjacent words (i.e. the phones preceding word-initial potential
fricatives and the phones following word-final ones). Binary logistic regression
analyses were performed separately for each word position, initial, medial and final. For
all three models the actual pronunciation of the potential fricatives was dichotomized
coded separately for each word position according to their sound class.
cognate. These are all cases of consonant clusters in the cognate where the ‘potential
absent. Where this occurred the whole cluster was represented as the cognate segment.
This was done so that differences in consonant clusters between the cognate and
CHAPTER 4 93
Gurindji Kriol pronunciation could be examined while the actual pronunciation of the
Some potential fricative phones were also coded as ‘absent’ for the purpose of
comparison with fricatives in English cognates. Because target forms in Gurindji Kriol
are unknown, ‘absent’ in this case does not refer to elided segments but rather phones
that are not present in relation to the corresponding segment in the English cognate,
consonant in the Gurindji Kriol form. For example the word-initial cluster [st] is
generally pronounced as a cluster in some Gurindji Kriol words, e.g. stori ‘story’ [stɔɹɪ]
and as the alveolar stop in other words, e.g. dop ‘stop’ [dɔp]. In the latter case the
potential fricative initial [s] was coded as absent for comparison purposes.
these tokens. Tokens in the reliability analysis were a stratified sample of the dataset,
randomly selected from each word position and place of articulation of potential
fricatives. The second transcriber listened to the whole utterance and transcribed in IPA
the token indicated on a spreadsheet, as well as the preceding phone for word-initial
potential fricatives and the following phone for word-final ones. Percentage agreement
between the two transcriptions was calculated for features of the potential fricatives and
fricatives is in Table 4.1. There was 92% agreement (N=185/202) on whether a phone
transcriber had noted uncertainty about whether the phone was a stop or a fricative. The
CHAPTER 4 94
other manner of articulation discrepancies were between palatal stops and affricates,
intervocalic stops and glides, and word-initial deletions, which generally reflected
agreement’ (63%) refers to when both transcribers used the same IPA symbol for the
potential fricative, that is both transcribers agreed completely on all features of the
93% agreement on the preceding phone and 98% on the following. Phonological
environment disagreements generally involved pause boundaries, that is, whether the
Table 4.1
Number and percentage of feature agreement on ‘potential fricative’ segments between
two transcribers
%
N agree N disagree
agreement
Full feature agreement 128 74 63%
palatal stop-affricate - 20 -
stop-glide-deletion - 12 -
Unclear token - 3 -
CHAPTER 4 95
4.4 Results
To examine the nature and extent of fricative variation within Gurindji Kriol
lexical forms, we analysed the Kriol-derived words in the sample that contained
fricative variation in manner of articulation (i.e. variability between fricatives and stops
or affricates) and compared them to the words where the potential fricative was not
pronounced with stop-fricative variation. Words were categorised into four groups
variably as a stop/affricate or a fricative, 4) only one token in the dataset and variability
could not be determined. The latter group (N=57) were excluded from the current
analysis. The list of words in each of the three groups analysed is in the appendix.
Tokens that were English code-switches were included in these analyses as they form
part of the linguistic input children are exposed to. Words involving code-switches are
in bold in the appendix, which shows that code-switched words containing potential
Words containing multiple potential fricatives were coded for each word-
position. For example dij ‘this’ was coded twice, for word-initial and -final positions.
There were 28 types of ‘all stop’ words where the potential fricative was always
pronounced as a stop or affricate in the sample, 41 types of ‘all fricative’ words where it
was always pronounced as a fricative, and 58 types of ‘variable’ words containing stop-
fricative variation. These three lexical form categories were then analysed by word
class, word position and place of articulation of the potential fricative, and by overall
word frequency. In these categories some cells had few (< 5) observations, therefore
inferential statistics were not run and these results are presented descriptively.
CHAPTER 4 96
Word class. Words were broadly classified into open-class and closed-classed
words. Figure 4.1 shows the proportions of open and closed class words within the ‘all
stops’, ‘all fricatives’ and ‘stop-fricative variation’ words. In words where the potential
fricative was always a stop or affricate, just over half of them were closed-class words
(54%), such as dat ‘the <that’ and ‘jarran ‘that one’, while words where it was always a
fricative were nearly all open-class (95%), such as faitim ‘fight’ and stirimraun ‘stir’.
The majority of words containing stop-fricative variation were open-class words (88%)
100%
90%
Percentage pronounced in each word class
80%
70%
60%
50%
40% closed-class
open-class
30%
20%
10%
0%
all stops all fricatives stop-fric
variation
Pronunciation of 'potential fricative' within lexical forms
Figure 4.1. Proportions of closed-class and open-class words within lexical form
categories
CHAPTER 4 97
Word position. Figure 4.2 shows the proportions of the word position of the
fricative, and words where there was stop-fricative variation. In words that were never
pronounced with fricatives, i.e. the potential fricative was always a stop or affricate,
32% of potential fricatives were word-initial, 39% word-medial, and 29% word-final. In
words that were always pronounced with a fricative, the fricative was most frequently
words containing stop-fricative variation, 49% of the potential fricatives were in word-
initial position, 32% were word-medial, and 19% were word-final. Thus stop-fricative
variation within lexical forms occurs in all word positions, though it appears to be most
100%
90%
Percentage pronounced in each word position
80%
70%
60%
50%
final
40% medial
30% initial
20%
10%
0%
all stops all fricatives stop-fric
variation
Pronunciation of 'potential fricative' within lexical forms
Figure 4.2. Proportions of word positions of potential fricatives within lexical form
categories
CHAPTER 4 98
in the English cognates across the three categories of lexical items are shown in Figure
4.3. This description is of the place of the fricative in the cognates, as several of the
variable words involved place of articulation variation. More detail on the place of
where the potential fricative was always pronounced as a stop or affricate, the cognate
place of articulation was most frequently dental (46%) e.g. the medial stop in jamting
‘something’, and labio-dental (36%) e.g. abim ‘have’, with some alveolars (18%) e.g.
mawuji ‘mouse’ and no palato-alveolars. In words where the potential fricative was
always pronounced as a fricative, the fricatives in the cognates were most frequently
alveolars (67%) e.g. stori ‘story’, followed by labio-dentals (21%) e.g. fait ‘fight’, with
few palato-alveolars (10%) e.g. shade ‘shade’ and just one dental (2%), three, which is
variation was also frequent in sounds that in cognates were alveolars (32%) e.g. medial
consonant in dijei ‘this way’, and also occurred in palato-alveolars (17%) e.g. medial
consonant in machine and dentals (7%), e.g. medial consonant in najan ‘another one’.
Note that fricatives in Gurindji Kriol are not always at the same place of articulation as
Overall, stop-fricative variation within words occurred most frequently when the
fricative in the cognate was labio-dental and least frequently when it was dental.
Cognate dentals, which were present due to their occurrence in words like that and this,
were most likely to be pronounced invariably as a stop or affricate within Gurindji Kriol
words, while alveolars were more likely to always be pronounced as a fricative within
words and palato-alveolars were either always a fricative or varied with a stop.
CHAPTER 4 99
100%
80%
70%
60%
50% palato-alveolar
40% alveolar
dental
30%
labio-dental
20%
10%
0%
all stops all fricatives stop-fric
variation
Pronunciation of 'potential fricative' within lexical forms
variation was occurring and the nature of these forms. The lexical forms analysed were
selected based on the occurrence of fricatives in their English cognates. In the following
analyses we use the term ‘variable words’ to refer to the Gurindji Kriol words that
contained stop-fricative variation in the current dataset. ‘All stops’ refers to the words
that have fricatives in their English cognates but were always pronounced with a stop of
affricate in the data, and ‘all fricatives’ are the words that were always pronounced with
a fricative. In the next analysis we examined all tokens of words containing stop-
fricative variation, i.e. the ‘variable words’, and analysed the relationships between the
The following analyses are of tokens the ‘variable words’, where the potential
fricative was pronounced variably as a fricative or stop in the sample, and their
relationships with the corresponding fricatives in the English cognate words. Cross-
tabulations are presented of the fricatives in English cognates and the corresponding
segments in Gurindji Kriol in word-initial, -medial and -final positions. To reiterate the
terminology used for the purpose of this study: the Gurindji Kriol phones that are the
focus of investigation are referred to as ‘potential fricatives’, and these are in Kriol-
derived words whose English cognates contain fricatives. The corresponding segments
of Gurindji Kriol potential fricatives in the English cognate words are referred to here as
‘cognate fricatives’, and are part of the analyses as a basis for calculating proportional
variation.
N=156/366), for example the initial consonant in bijin ‘fish, <fishing’. They were
pronounced variably as labio-dental fricatives (N=63) and bilabial stops [p, b] (N=91).
fricative [s] in English cognate words (28%, N=101/366), for example ji ‘see’. These
were pronounced as palatal stops [c, ɟ] (41%, N=41/101), affricates [ʧ, ʤ] (12%,
N=12/101) and were also often pronounced as the fricative [s] in Gurindji Kriol words
(36%, N=36). ‘Absent’ [s] is represented by the symbol [Ø] (N=9), and word-initially
refers to segments that are the cluster [st] in English words pronounced as a stop
CHAPTER 4 101
without the fricative in Gurindji Kriol words (e.g. Gurindji Kriol top 'to be', English
cognate stop).
closed-class words, particularly demonstratives e.g. dat ‘the, <that’, jeya ‘there’, which
have high frequencies in conversational speech. As Table 4.1 shows, dental fricatives in
these words are pronounced with a palatal stop [c, ɟ], the voiced alveolar stop [d], or an
affricate [ʧ, ʤ]. Word-initial dental fricatives in English cognates were rarely
palato-alveolar fricative [ʃ] in cognate words is also pronounced with a range of phones
in corresponding Gurindji Kriol words: as palatal stops and affricates (N=9/19) and as
Kriol. The most frequent pronounced fricatives were [f] (N=44/113), [s] (N=36/113)
and [v] (N=19/113). Palatal fricatives also occurred (N=11), and there were some rare
cognate words are pronounced with a voiced palatal stop [ɟ] in corresponding Gurindji
Kriol variable words (N=62/143). These included words where the cognate fricative was
alveolar, e.g. dijei (this way), dental e.g. nojing (nothing) and palatal e.g. wajing
Gurindji Kriol potential fricatives were [s] (27%, N=39/143), [ð] (24%, N=34/143), and
[ʃ] (19%, N=27/143). The alveolar cognate fricative [s] was pronounced variably as a
stop (N=22/39) and a fricative (N=14/39). Dental cognate fricatives [θ, ð] were mostly
CHAPTER 4 102
pronounced as stops (N=36/49) in the corresponding Gurindji Kriol words, and when it
was a fricative it was mostly voiced palatal [ʒ], e.g. nojing (nothing) [nɐʒ ŋ]. The
voiceless palatal fricative [ʃ] in cognates was pronounced most frequently as the voiced
palatal fricative [ʒ] (56%, N=15/27), for example majin (machine) [mɐʒ n]. Labio-
dental fricatives in cognate words, for example libim (leave) were more often
variable words.
Kriol words containing stop-fricative variation were pronounced as fricatives, with the
shown in Table 4.4. The most frequent cognate fricative, word-final alveolar [s] e.g.
pleis (place) was most frequently pronounced in Gurindji Kriol as a voiceless palatal
stop [c] (48%, N=21/44), followed by other stops or affricates (N=18), and then
alveolar fricatives [s, z] (N=5). A similar pattern was found for cognate palatal
fricatives [ʃ], e.g. word-final consonant in binij ‘finish’, which was also most often
pronounced with a palatal stop [c, ɟ] in Gurindji Kriol (75%, N=18/24), with a few cases
fricatives [f, v] in English cognates were usually pronounced with bilabial stops in
Gurindji Kriol variable words (78%, N=18/23). There were only 4 cases of final [st]
clusters, but these were pronounced variably as stops [c] in bej ‘first’ and bust-im-ap
than in initial and medial positions, and they were also less frequently pronounced as
Gurindji Kriol words that contain stop-fricative and the corresponding fricatives in their
English cognate words. Results gave further insight into the characteristics of segments
that vary in the words described in Analysis 1. Findings of the first two analyses open
fricative or a stop. To address this question, in the Analysis 3 we examined the relative
The next analyses were performed separately on tokens of words that contain
stop-fricative variation and on all tokens in the dataset. In both cases the potential
fricatives were coded according to whether they were pronounced as a fricative or other
sound. To provide information on the characteristics of all potential fricatives (not just
those in variable words as in Analysis 2), Figure 4.4 shows the proportions of potential
fricatives pronounced as fricatives and other phones. The figure displays the differences
in word positions, particularly that fricatives were more likely word-initially. It also
shows that alveolar and palato-alveolar fricatives, and to a lesser extent voiceless labio-
Gurindji Kriol than other places of articulation. Conversely, dental fricatives in English
cognates were the least likely to be fricatives in Gurindji Kriol words, though as shown
Frequencies of word-initial fricatives in English cognates and corresponding phones in Gurindji Kriol variable words
s - 29 12 - 36 - - - 10 - 9 - 2 1 2 - - - - 101
ð - 22 31 - - - 5 - 2 1 - - - - - 2 - - 1 64
θ - - - - - - 14 - 1 9 - - - - - - - 2 - 26
ʃ - 2 3 - - - - - - - - 8 4 2 - - - - - 19
v 3 - - - - 2 - - - - - - - - - - - - - 5
Total 62 53 46 44 36 29 19 19 13 10 9 8 6 3 2 2 2 2 1 366
Table 4.3
Frequencies of word-medial fricatives in English cognates and corresponding phones in Gurindji Kriol variable words
ð 23 9 - - - - - - - 2 - - - - 34
ʃ 10 15 - 2 - - - - - - - - - - 27
f - - 7 - - 4 1 - 3 - - - - - 15
θ 13 1 - - - - - - - - - - 1 - 15
v - - 8 - - - 3 - - - 2 - - - 13
Total 62 33 15 6 6 4 4 3 3 2 2 1 1 1 143
Table 4.4
Frequencies of word-final fricatives in English cognates and corresponding phones in Gurindji Kriol variable words
ʃ 14 - - 4 - - - 2 3 - - - - - 1 24
v - 3 - - 7 - - - - 1 - - - 1 - 12
f - 8 - - - - - - - 2 - 1 - - - 11
ðz 3 - - - - - - - - - 1 - - - - 4
st 2 - - - - 1 - - - - - - 1 - - 4
ð 1 - - - - - - - - - 1 - - - - 2
Total 41 11 9 8 7 5 4 3 3 3 3 1 1 1 1 103
100%
90%
80%
70%
60%
50%
40%
stop
30% fricative
20%
10%
0%
initial
initial
initial
initial
initial
initial
initial
medial
medial
medial
medial
medial
medial
medial
s z f v ð θ ʃ
Phone in English cognates
Figure 4.4. Relationships between fricatives in English cognates and corresponding pronunciation in Gurindji Kriol words.
CHAPTER 4 108
in maternal Gurindji Kriol. The relative contributions of child age and phonological
regression models for each word position. The outcome variable was pronunciation of
‘potential fricatives’ in Gurindji Kriol (fricative, stops) and the predictor variables were
approximate child age (1;6, 2;0, 2;6) and phonological environment. For each word
position analyses were first performed on all tokens in the dataset to analyse the global
fricatives or stops. The same model was then run just on the tokens in the words that
were found to contain stop-fricative variation in the current dataset (see Analysis 1), to
investigate whether within these words there was an effect of child age on the form used
(i.e. with the fricative or stop). Cell counts were too low to include speaker as a
predictor, thus speaker was included as a random factor in all analyses and inter-speaker
phone preceding word-initial potential fricatives. These were classified into fricatives,
stops, nasals, liquids, vowels, and utterance boundary or pause. There were no
preceding glides in this data. Two cases were excluded from the following analysis
All tokens. Table 4.5 shows the results of the logistic regression for word-initial
position. Results indicate a significant effect of child age 2;6, Wald χ2 (1) = 6.897,
higher at child age 2;6 than 1;6 (odds ratio 0.630). There was also a significant effect of
phonological environment when the preceding phone was a fricative, Wald χ2 (1) =
CHAPTER 4 109
5.210, p=0.022, vowel, Wald χ2 (1) = 17.075, p<0.001, and pause or utterance boundary,
Wald χ2 (1) = 6.109, p=0.013. The likelihood of a potential fricative being pronounced
as a fricative rather than a stop increased when the preceding phone was a fricative
(odds ratio 0.185), vowel (odds ratio 0.385), and pause or utterance boundary (odds
Table 4.5
Logistic regression results for word-initial fricative variation in all words
95% C. I. for
Exp (B)
B s.e. Wald df p Exp(B)
Lower Upper
Child Age
Preceding Phone
Pause or
-0.597 0.242 6.109 1 0.013 0.551 0.343 0.884
Boundary
The reference group for Child Age is 1;6 and for Preceding Phone is Stop
Tokens in variable words. A logistic regression model with the same variables
as used in the above model was also performed on word-initial potential fricatives that
only occurred in variable words. Results are shown in Table 4.6. The were no
CHAPTER 4 110
significant effects of child age, although the pattern was consistent with the analysis
above as the odds ratios were in the same direction (compared to age 1;6 odds ratio
1.585 for age 2;0, 0.713 for age 2;6). Results for phonological environment were also
consistent; however the only significant effect was when the preceding phone was a
vowel, Wald χ2 (1) = 6.185, p=0.013. Compared to when the preceding phone was a
stop, a fricative was more likely when the preceding phone was a vowel (odds ratio
0.363).
Table 4.6
Logistic regression results for word-initial fricative variation in variable words
95% C. I. for
Exp (B)
B s.e. Wald df p Exp(B)
Lower Upper
Child Age
Preceding Phone
Pause or
-0.560 0.434 1.665 1 0.197 0.571 0.244 1.337
Boundary
The reference group for Child Age is 1;6 and for Preceding Phone is Stop
CHAPTER 4 111
categorised into four groups: intervocalic (V_V) e.g. pijinglain 'fishingline', consonant
cluster preceded by a vowel (V_C) e.g. laswan 'last', consonant cluster followed by a
vowel (C_V) e.g. insaid 'inside', and triple consonant cluster (C_C) e.g. dragonflai
'dragonfly'. To test the hypothesis that word-medial fricatives would be more frequent
in intervocalic environments, V_C was used as the reference group, as fricatives and
other phones occurred almost equally in this environment (N= 26 fricatives, 27 other).
All tokens. The results for the word-medial logistic regression are shown in
Table 4.7. There were no significant effects of child age or phonological environment
Table 4.7
Logistic regression results for word-medial fricative variation in all words
95% C. I. for
Exp (B)
B s.e. Wald df p Exp(B)
Lower Upper
Child Age
Environment
Tokens in variable words. The results for the word-medial logistic regression of
tokens in variable words are shown in Table 4.8. Results indicate a significant effect of
child ages 2;0 and 2;6 compared to age 1;6, Wald χ2 (1) = 5.646, p=0.018 and Wald χ2
(1) = 4.530, p=0.033, respectively. The likelihood of a fricative occurring over a stop
medially in variable words was higher at child age 2;0 (odds ratio 0.250) and 2;6 (odds
ratio 0.368) than 1;6. There was no significant effect of phonological environment
medially in tokens of variable words (note there were not enough C_C cases in variable
Table 4.8
Logistic regression results for word-medial fricative variation in variable words
95% C. I. for
Exp (B)
B s.e. Wald df p Exp(B)
Lower Upper
Child Age
Environment
phone following word-final potential fricatives. These were classified into fricatives,
stops, nasals, glides and liquids, vowels, and utterance boundary or pause.
CHAPTER 4 113
All tokens. Table 4.9 shows the results of the logistic regression for word-final
for nasals, Wald χ2 (1) = 3.593, p=0.058, indicating fricatives decreased in likelihood
word-finally when the following phone was a nasal (odds ratio 3.094).
Table 4.9
Logistic regression results for word-final fricative variation in all words
95% C. I. for
Exp (B)
B s.e. Wald df p Exp(B)
Lower Upper
Child Age
Following Phone
Liquid
-1.635 0.968 2.855 1 0.091 0.195 0.029 1.299
or Glide
Pause or
0.132 0.476 0.077 1 0.782 1.141 0.448 2.902
Boundary
The reference group for Child Age is 1;6 and for Following Phone is Stop
Tokens in variable words. Table 4.10 shows the results of the logistic regression
for word-final position. There were no significant effects of child age or phonological
Table 4.10
Logistic regression results for word-final fricative variation in variable words
95% C. I. for
Exp (B)
B s.e. Wald df p Exp(B)
Lower Upper
Child Age
Following Phone
Liquid
-1.904 1.589 1.435 1 0.231 0.149 0.007 3.356
or Glide
Pause or
-0.529 0.757 0.488 1 0.485 0.589 0.134 2.599
Boundary
The reference group for Child Age is 1;6 and for Following Phone is Stop
as fricatives for each speaker across child ages are displayed in Figure 4.5. This
included all tokens and indicates individual speaker variability in fricative production.
The same analysis performed just on tokens of words containing stop-fricative variation
showed the same pattern for each speaker, indicating the interspeaker differences in
Figure 4.7 are the same for both within-word and between-word stop-fricative variation.
The increase in fricatives when children are 2;6 appears to be driven by SS and CE, as
AR’s fricatives decrease from 2;0 to 2;6 (although note there were fewer tokens across
all ages for AR than the other two speakers). From child ages 1;6 to 2;0 CE’s use of
CHAPTER 4 115
fricatives decreases, while SS’s remains about the same and AR’s increases. At age 2;6
less.
0.40
0.35
Proportion of fricatives to stops
0.30
0.25
0.20 SS
0.15 CE
AR
0.10
0.05
0.00
1;6 2;0 2;6
Average child age
accounting for phonological environment and word position. Results suggested that
word-initially there is an effect of child age on all potential fricatives as well as just in
those in words that contain stop-fricative variation in the current dataset. Word-medially
the child age effect was only found in words containing variation. In the fourth and final
analysis we further investigated the change in variation across child age by examining
1 across child ages, displayed in Figure 4.6. Between child ages 1;6 and 2;0 the words
with stop-fricative variation (‘variable words’) and the words invariably pronounced
with fricatives (‘all fricatives’) proportionately decreased, while words where the
potential fricative was always pronounced with a stop or affricate (‘all stops’)
proportionately increased. Between child ages 2;0 and 2;6 the ‘all fricatives’ words
remained proportionately low, while the proportions of the ‘all stops’ words decreased
and the ‘variable words’ increased. Thus mothers were using relatively more words
containing stop-fricative variation when children were 2;6 than when they were 2;0 or
1;6.
0.70
0.60
Proportions of each word category
0.50
0.40
all stops
0.30
all fricatives
variable words
0.20
0.10
0.00
1;6 2;0 2;6
Average child age
Figure 4.7 further illuminates the change in variation as children aged, showing
tokens of words containing stop-fricative variation, tokens of all other words, and all
tokens together (note the y-axis is of a different scale to that in Figure 4.6). Between
child ages 1;6 and 2;0 the overall proportions of fricatives decreased, however the
Taken with the proportional decrease in the frequency of variable words between 1;6
and 2;0 shown in Figure 4.5, this indicates that although mothers use relatively fewer
variable words when children are 2;0 compared with 1;6, they are more likely to
0.40
0.35
Proportion of fricatives to stops
0.30
0.25
tokens of variable
0.20 words
tokens of other words
0.15
all tokens
0.10
0.05
0.00
1;6 2;0 2;6
Average child age
Between child ages 2;0 and 2;6, Figure 4.7 shows a proportional increase in
fricatives in both variable words and in other words. Thus when children are 2;6,
mothers are both more likely to use words containing stop-fricative variation and are
more likely to pronounce those words with fricatives compared to when children are 2;0
or 1;6. Fricatives in other words are also relatively more frequent at age 2;6 than 2;0 and
1;6. This is not due to an increase in words that are always pronounced with fricatives,
as Figure 4.6 indicates these words remain proportionately low across all three time
points. Rather, it is due to an increase in words pronounced with fricatives that were not
coded as either variable words or ‘all fricatives’ words as there was only one token in
4.5 Discussion
from the variable lexical forms of Kriol-based words, children are exposed to stop-
fricative variability that changes differentially in mothers’ speech as children age from
approximately 1;6 to 2;0 to 2;6, due to changes in mothers’ choice of both lexical forms
within lexical forms and the nature of this variation. Words with multiple tokens in the
dataset were grouped according to whether the potential fricative was always a fricative,
showed that each group of words had different characteristics. Overall the ‘all stops’
words tended to be relatively high frequency with a near even split between open and
closed class words. The ‘potential fricative’ (the corresponding segment in the English
cognate word is a fricative) in these words was fairly evenly distributed across word
CHAPTER 4 119
positions and the highest proportion of them were dentals, followed by labio-dentals
and then palato-alveolars. The ‘all-stops’ group of words contained most of the Kriol-
derived closed-class words that have fricatives in their English cognates, including the
high frequency articles and demonstratives. It remains unclear, however, exactly which
factors or interactions contribute to lexical items always being pronounced with a stop
rather than a fricative, as it is not possible to isolate each factor. For example, dentals
are rare in open-class words but have very high frequency due to the high frequency of
frequency. It is interesting to note that some of these words contained two potential
fricatives where one was always pronounced as a stop and the other was variable, for
example the initial and medial consonants in dijan 'this': the word-initial phone was
always pronounced with a stop, while the word-medial phone was variable between a
stop and a fricative. However this did not occur the other way round (i.e. the initial
phone variable and the medial phone always a stop) so it could also be a word-position
effect. Whether potential fricatives are always pronounced with stops or contain stop-
fricative variation is also likely related to the length of time that they have been in the
Gurindji Kriol lexicon and the time it takes for conventional pronunciations of words to
develop.
In contrast, words where fricatives were not variable with stops or affricates
were nearly all open-class words. Fricatives occurred in all word positions, most
frequently initial and least frequently final. They occurred in all places of articulation,
although the majority were alveolar. Word-initial [s] thus occurred in a relatively large
proportion of the ‘all fricatives’ words. Most of the words in this group (see appendix)
were nouns and included some consonant clusters, e.g. flai ‘fly’, verbs e.g. stirimraun
CHAPTER 4 120
‘stir’, recent borrowings e.g. fens ‘fence’, and some English code-switches e.g. swim
‘swim’.
Words containing stop-fricative variation tended to fall between the ‘all stops’
and ‘all fricatives’ words in most categories. Variable words were mostly open-class
words, though there were more closed-class words in the variable category than the all
fricatives category. The proportions of word-position of the potential fricative for the
words containing variation were also in-between those for the ‘all stops’ and ‘all
For place of articulation there were more labio-dentals and palato-alveolars in the words
containing variation than in the non-variable (all stops or all fricatives) words. Words in
the variable group contained relatively common nouns such as bijinlain ‘fishingline’
and jinek ‘snake’, verbs, e.g. libim ‘leave’, lijan ‘listen’, and closed-class words, e.g.
Impressionistically it is possible that these findings are associated with the time
at which words were adopted into Gurindji Kriol, with target phone in early adopted
words more likely to always pronounced with a stop, in the more recent words to
always be a fricative, and in the variable words to be in-between, but the data does not
cognates, accounting for word position. Analysis 2 findings suggested that while some
words may be pronounced with the English manner of articulation it is not necessarily
same phone in the cognate word that is produced. For example the medial sound in
pronounced variably with a palatal stop [nɐɟæn], but when it is a fricative it is palato-
alveolar [nɐʒæn] rather than dental. These findings open up avenues of further research
In analyses three and four we analysed the effect of child age on fricative
interspeaker variation, the effect of child age was greatest in word-initial position, and
the effect was also significant word-medially in tokens of words that contain stop-
real effect, for example mothers modifying word-initial segments due to the greater
perceptual saliency of segments in this position for acquisition (Bent, Bradlow & Smith,
2007; Vihman, Nakai, DePaolis & Hallé, 2004), or simply an analysis artefact, as there
were many more tokens with potential fricatives in word-initial position than in medial
or final positions. With a much larger sample it may be possible to randomly sample a
comparable number of target segments from each word position, however, in the current
corpus this was not possible and we would still not be able to make direct comparisons
according to the children’s age14. This was partly due to lexical choice. Tokens of words
proportionately low across the three child ages, which was not surprising given these
14
The apparent effect of change over time may be within lexical items or due to the
mothers' use of different items as the children get older. This is an analysis which we
will include in the published version of the article. Interested readers are encouraged to
look up and should cite the published version of this paper as a journal article, rather
than this chapter.
CHAPTER 4 122
words had a lower than average frequency. In contrast, tokens of words containing no
fricatives and stop-fricative variation did not change much in proportion between child
ages 1;6 and 2;0, with a greater proportion of ‘all stops’ words in mothers’ speech at
both ages, while between ages 2;0 and 2;6 the proportion of ‘all stops’ words decreased
as the words containing variation increased. Thus compared to age 1;6 and 2;0, when
children were 2;6 mothers were using proportionately more words that could be
pronounced variably with a stop or a fricative. The change in lexical choice may be
The total proportions of fricatives to stops in mothers’ speech also changed over
time but followed a different pattern to the change in use of words containing stop-
fricative variation, suggesting the effect of child age is only partly due to changes in the
types of words mothers used. At 1;6 the proportions of fricatives to stops was about the
same in tokens of variable words and other words. Between child ages 2;0 and 2;6 the
words. In the other words the increase was likely due to the inclusion of words that only
had one token in the dataset as many of these were low frequency words that were
pronounced with a fricative. The increase of fricatives in the variable words indicates
that not only were mothers using more words containing stop-fricative variation at 2;6
than at the earlier child ages, but also that they were more likely to use a fricative in
differential segmental modifications according to child age and the change between
approximate ages 1;6 to 2;6 may be in response to the child’s own language
CHAPTER 4 123
variation in child-directed speech gradually changes over time to become like adult-
directed speech, possibly to introduce children to the variation typical in adult speech in
the community (Foulkes et al., 2005; Smith, Durham & Fortune, 2009). Recent studies
have found variation in mothers’ speech may change differentially over time, with
mothers possibly fine-tuning their speech to the child’s age and language abilities
(Buchan & Jones, in press; Ko, 2012). This is a plausible explanation for the child age
effect in the current study, which found that word-initially and -medially the likelihood
of mothers using fricatives over stops increased as children got older. These findings
support previous studies on English and Korean child-directed speech (Lee & Davis,
2008; Lee et al., 2010), which found proportionately more stops and fewer fricatives in
fricatives tend to be acquired later by children than stops, as fricatives are relatively
difficulty to articulate and require precise motor coordination (Dinnsen, 1992; Kent,
1992). Thus when children were younger mothers may have chosen (consciously or not)
to increase their use of phonetic forms containing fricatives as children got older and
The interpretations here are tentative, as there are limitations to this study and
further research is needed to find out what is causing these effects. The current findings
are only based on data from three speakers and there was interspeaker variation,
although this is consistent with the child-directed speech literature showing relatively
large individual differences in language development (e.g. Ko, 2012). It would be useful
for future studies to compare segmental distributions in adult-directed speech with those
involving family groups; however, it would provide further insight into whether the
CHAPTER 4 124
effect of change over time in mothers’ speech found here represents a change toward
adult-directed speech, thus serving the function of exposing children to the type of
Australian English15
15
This chapter has been published as a journal article. To cite, please look up and use the following
citation:
Buchan, H. & Jones, C. (2013). Phonological reduction in maternal speech in northern Australian
English: Change over time. Journal of Child Language, doi: 10.1017/S0305000913000123.
All material in this chapter (Chapter 5) is copyright of Cambridge University Press.
CHAPTER 5 126
5.1 Abstract
child interactions when children (N=4) were 1;6, 2;0 and 2;6. The mothers’ speech was
older, it was predicted that deletion of word-initial /h/ and word-final /v/, processes
common in non-citation speech, would increase over time. Instead results showed a
proportionately increased between 1;6 and 2;0 and decreased between 2;0 and 2;6.
Further analysis indicated that increased deletion was not accounted for by changes in
speech rate, which only marginally increased over time. Findings suggest mothers fine-
tune differentially over time as children’s receptive and productive language knowledge
develops.
CHAPTER 5 127
5.2 Background
language, and their phonological development includes learning the systematic phonetic
variation that is a part of everyday speech. Variation in the speech input that children
are often exposed to, known as infant- or child-directed speech, differs from that in
interadult speech (adult speech directed to other adults) in lexical and syntactic
structures as well as in phonetic features such as tempo, pitch and frequency (Fernald,
children becomes more like interadult speech as children get older. For example it has
been found cross-linguistically that mothers tend to use hyperarticulated vowels when
talking to their infants aged 0;2-0;5 compared with their speech to other adults (Kuhl,
children were prelinguistic, aged 0;7-1;0, and at age 5;0-5;9 (Liu, Tsao & Kuhl, 2009).
Further, vowels were more hyperarticulated to children at 0;7-1;0 than at 5;0-5;9, in that
they were longer, had a higher pitch, and the vowel space was larger so vowels were
more distinct from one another, suggesting mothers modify their speech differently as
Few studies have examined segmental variation and changes in maternal speech
after infancy and even fewer have used longitudinal data. Much of the literature on
speech input to children has focussed on the first year of life, as this is a crucial period
for children’s language development (Jusczyk, Cutler & Redanz, 1993; Jusczyk,
Friederici, Wessels, Svenkerud & Jusczyk, 1993; Jusczyk, Luce & Charles-Luce, 1994;
CHAPTER 5 128
Kuhl, Williams, Lacerda, Stevens & Lindblom, 1992). Research has found differences
in segmental distributions between mothers’ speech to children aged 1;0 and to adults
(Lee & Davis, 2010; Lee, Davis & MacNeilage, 2008), indicating segment patterns in
maternal speech change sometime after the first year of life to eventually develop into
adult-directed speech.
develops into adult-directed and at what ages the changes take place. Early research on
segmental differences between child- and adult-directed speech suggests that the period
between child ages 1;0 and 2;6 is a time of extensive variability in the input, and
changes around these ages. Findings of these early studies were mixed in regards to the
type of variation and the nature of the phonological differences in speech directed to
children and to adults. However, the age of the children varied between studies and
relatively large age ranges were used within studies. Bernstein-Ratner (1984a, 1984b)
for example found more vowel lengthening, which is a feature of hyperarticulation (Liu
reduced or deleted consonants, so these studies suggested mothers modify their speech
Other early research, however, found that reduction was more frequent in speech
to children than speech to adults (Shockey & Bond, 1980), and that isolated segments
were less intelligible in child-directed than interadult speech (Bard & Anderson, 1983).
A discussion of these early studies by Cruttenden (1994) argued that children’s own
found in speech to children who were prelinguistic or starting to produce single words
while more reduction and less intelligibility than interadult speech was found in speech
to older children, generally over 2;0. A recent study on speaking rate in child-directed
speech also found that age 2;0 is associated with a shift in mothers’ speech. Ko (2012)
analysed mothers’ speech rate in longitudinal data from 25 speakers in the CHILDES
that in general a non-linear function fit the data significantly better than a linear model,
and that the break-point tended to occur around age 2;0 to 2;5. Results showed large
individual differences in the direction of the pattern; however the non-linear shift in
mothers’ speech around child age 2;0 was consistently significant. It is possible that the
mixed findings in the early child-directed speech studies are due to linear or non-linear
changes in mothers’ speech over time, as mothers change their speech depending on
Fortune (2007) found that caregivers’ use of variable phonetic forms changed
constraints appears to vary depending on the type of variable being acquired (Labov,
1989; Roberts, 1997; Youssef, 1991). Kerswill and Williams (2000) suggest that
children gradually acquire stylistic variation over time as they gain sociolinguistic
competence, and Labov (2001) suggests that stylistic variation in caregiver speech plays
may serve different functions at different child ages and stages of language
CHAPTER 5 130
development. For example exaggerated prosodic features in mothers’ speech peak in the
first six months (Stern, Spieker & Barnett, 1983), and this may function to help infants
attend to the speech signal and develop sensitivity to affect expressed in speech
(Cruttenden, 1994). Similarly the apparent peak in hyperarticulation when children start
(Cruttenden, 1994). This explanation relates to the concept of fine-tuning: that mothers
modify their speech according to their perceptions of their child’s receptive and
productive language.
that mothers may also fine-tune to children’s understanding of social categories indexed
in speech, such as age, sex, region and socioeconomic status. Variation in phonetic
Docherty, 2006), and in speech to children this type of variation introduces children to
phonetic forms associated with speakers’ identities and social groups in the speech
community. The few studies that have examined sociophonetic variation in caregiver
speech suggest that it does change as children age. Smith, Durham and Fortune (2009)
analysed mothers’ speech to children aged 2;11 to 3;11 in a Scottish dialect of English.
common in non-citation inter-adult speech across English dialects that has been studied
widely as a sociolinguistic marker (for a review see Tagliamonte & Temple, 2005).
Findings showed that rates of t/d deletion tended to be higher in speech to the older than
Sociophonetic variation associated with regional vernacular has also been shown
to change over time in mothers’ speech to children after infancy. Foulkes, Docherty and
CHAPTER 5 131
between 2;0 and 4;0 in a non-standard variety of English from Tyneside in northeastern
Pronunciation of the segment as [t] was considered standard, while [] and glottals in
these contexts were considered non-standard variants that are characteristic of the
region and may not be held in high regard outside the speech community. Foulkes et al.
(2005) found that overall, child-directed speech contained a higher proportion of the
standard [t] variant compared with interadult speech. Standard [t] was relatively less
frequent in speech to older than younger children while the non-standard variants were
relatively more frequent in speech to the older children. It is suggested that phonetic
variation in mothers’ speech changes over time to gradually familiarise children with
the alternate phonetic forms common in interadult non-citation speech, and that
& Docherty, 2006; Foulkes et al., 2005). These studies on sociophonetic variation in
speech to children indicate that mothers may fine-tune their speech to their perceptions
such as Pierrehumbert’s (2003) model. According to this approach all tokens contain
concurrently for each exemplar and associated with frequency distributions (Foulkes &
Docherty, 2006; Pierrehumbert, 2003). Munson, Edwards and Beckman (in press)
CHAPTER 5 132
suggest that learning to map sociophonetic variation involves developing another level
of representation for socioindexical information. This level may provide another source
in the speech signal (Munson et al., in press). The strength of representations may
depend on the frequency and recency of exemplars (Pierrehumbert, 2003), that is the
stored memories of tokens, and so variable input over time during early childhood could
connected speech, depending on linguistic factors and social economic status (SES)
(Chevrot, Dugua & Fayol, 2009; Chevrot, Nardy & Barbu, 2011). SES differences in
children’s productions of liaison consonants reflect those in adult speech and increase as
children age from 2;0 to 6;0 (Chevrot et al., 2011). Chevrot et al. (2009) suggest that
children acquire variation by first encoding multiple exemplars of the variable lexical
forms in the input and then abstracting information from the exemplar store to form
schemas.
variation associated with speech acts and social interactions. Phonetic variation can be
associated with style shifting, which is learned throughout childhood and related to
caregiver usage (Kerswill & Williams, 2000). Mothers’ use of either standard or local
English, with more vernacular variants used in the informal contexts of play and routine
speech than the formal contexts of discipline and teaching (Smith et al., 2007).
in life. Phonetic modifications made in maternal speech change over time as child-
directed speech develops into interadult speech, but there is relatively little literature on
how and when this happens as children get older. There is some evidence that infant-
and child-directed speech has different functions depending on the age and linguistic
development of the child, and this may drive changes in variation in the input over time
productive language. The few studies that have examined variation in maternal speech
to children after infancy suggest mothers may also shape modifications in their speech
to their perceptions of children’s knowledge of socioindexical factors, and there are also
mothers’ speech to children from ages 1;6 to 2;6 in a regional variety of Australian
English. To get a clearer understanding of when mothers might start using more
processes in non-citation speech, this study started with children of a younger age than
those in the Foulkes et al. (2005) study, in which children were aged 2;0 to 4;0, and the
Smith et al. (2009) study, in which children were aged between 2;11 and 3;11. In the
although regional dialects have generally not been studied in detail. Standard Australian
English differs from other standard English varieties in realisation of many of the
Cox & Evans, 1997). In a recent description of the main phonological features of
Australian English, Standard Australian English is described as, ‘the dominant dialect
CHAPTER 5 134
used by the vast majority of speakers’ (Cox & Palethorpe, 2007, p. 341) in Australia
today. Being a multicultural country there are many other dialects spoken, which have
Australian English’ categories (Cox & Palethorpe, 2007). There may also be varieties
differing on other dimensions such as geographic region, but this has not been studied
in depth owing to a historical perception that there is relatively low regional variation in
Australian English.
region and other socioeconomic factors (Cox & Palethorpe, 2010). Relatively less
research into Australian English has focused on consonants (Ingram, 1989; Tollfree,
processes. Horvath’s (1985) study included ‘h dropping’ (/h/ deletion) and Ingram’s
status and gender; connected speech processes (assimilation, elision) are also socially
correlated and were one of five variables differentiating the ‘classical’ accent classes in
Australian English: broad, general, and cultivated (Mitchell & Delbridge, 1965).
117 speakers in Sydney, New South Wales. Data were from the Sydney Urban Dialect
four main ‘sociolects’ based on sex, age, socioeconomic status, and ethnicity, and
CHAPTER 5 135
analysed the frequency of six consonant processes to compare the incidence of these
processes in the ‘sociolect’ subgroups. One consonant process she examined was word-
Along with other consonant processes considered typical of ‘broad’ Australian English
speakers, word-initial /h/ deletion was generally more frequent in males, teenagers, and
connected speech process is the articulatory simplification that often occurs in fast non-
citation speech, leading to phonetic assimilation and reduction (2). The frequency of
connected speech processes account for much of the phonological differences between
citation and non-citation speech, and vary with speech rate, register, and ‘broadness’
(Ingram, 1989).
One consonant process that Ingram (1989) analysed was cluster reduction, and it
was noted that cluster reduction largely affected word-final /nt/, /nd/ and /st/ clusters.
Ingram (1989) found that the frequency of consonant cluster reduction was relatively
high in both working class and middle class adolescents, and there was no significant
difference between the two groups. There was however a significant difference between
working class and middle class adolescents in the frequency of other speech processes.
Deletion of /h/ occurred significantly more often in speech of the working class than the
middle class participants, and was most common in third person pronoun forms. The
CHAPTER 5 136
working class group were also significantly more likely to delete schwas (e.g. ‘bout for
about) and word-initial interdental fricatives (e.g. ‘em for them). These findings show
speakers. In the present study we consider word-initial /h/ deletion and the speech
English. In the absence of sociolinguistic evidence for this dialect, we refer to these as
about 317 kilometres south of Darwin, the Northern Territory capital. The English
spoken in Katherine, which we term Katherine English, was sampled for the current
study as a variety of regional Australian English. This language variety has been the
subject of just one previous study (Jones, Meakins, & Buchan, 2011), and no
sociolinguistic setting. Terms that speakers used to describe Katherine English included
‘ocker’ (slang for a stereotypically uncultivated Australian), Speakers also say – we lack
objective data – that their words tend to be simpler and not as long, and that the speech
of Southerners (people from outside the Northern Territory or city people) is ‘fuller’ in
that they use longer words, more words, and fewer shortened versions. Overall speakers
speech.
CHAPTER 5 137
The aim of this study was to examine how phonetic variation related to non-
children age. We collected and analysed a corpus of naturalistic speech from mothers
when their children were aged 1;6, 2;0 and 2;6. This study is part of a larger project on
Katherine English and an Aboriginal contact language variety in the region, Gurindji
Kriol. Gurindji Kriol contains extensive fricative variation in Kriol-derived words, i.e.
of English historical origin (Meakins, 2011). Because fricative variation is the focus of
the larger project, in this study we examined variants in fricatives that are related to
processes in non-citation speech. Word position effects are also being investigated in
the Aboriginal language variety, and so in this study we analysed a word-initial and a
word-final fricative.
The phonetic variants we chose to measure were word-initial /h/ deletion (3a, b)
based on the previous findings from Horvath (1985) and Ingram (1989), and word-final
/v/ deletion (4a, b), which are regarded as processes that occur in non-citation speech
standard form
containing deletion
We analysed deletion across function and content words. Previous research has
found that phonological information is important for classifying word class, and that
phonological reduction is more likely to occur in function than content words (Cutler,
1993; Monaghan, Chater & Christiansen, 2005). It was therefore expected that overall
In this study maternal speech is the main source of speech input for the target
children and is defined as all speech spoken by the mother around the target child. This
is based on Soderstrom’s (2007) definition of speech input to children and studies that
have found children learn from overhearing interadult speech as well as speech directed
specifically to them (Akhtar, 2005; Au, Knightly, Jun & Oh, 2002; Martinez-Sussman,
gradually changes to be more like interadult speech over time (Foulkes et al., 2005;
Smith et al., 2009), it was predicted that mothers would be more likely to use the
5.3 Method
5.3.1 Participants
Four mothers and their children participated in the recording sessions, which
were conducted at three stages six months apart when the children were aged
approximately 1;6, 2;0 and 2;6. Exact ages are shown in Table 5.1. A fifth mother
participated but her child was six months older than the others so her data are not
included in this study, as she was only recorded at two of the timepoints under
Northern Territory, Australia. Three mothers had grown up in Katherine, while the
CHAPTER 5 139
The highest educational attainment was a Bachelor degree (two mothers) and secondary
school (two mothers). Mothers were recruited through local childcare centres and other
Two of the target children had siblings, and there was also some participation in
the sessions from other family members and visitors. Mothers were instructed to focus
on the target child, so mothers’ interactions with other people were generally minimal.
Table 5.1 shows the target children’s sex, their ages at each stage and the mothers’
education levels, and is followed by a brief description of the dyads and other people
present in recording sessions. Only the mothers’ speech was analysed for this study.
Cathy and Lucy. Lucy was Cathy’s only child. Lucy’s father occasionally
participated in recordings, but in the majority of the sessions Cathy and Lucy were the
only participants.
Kim and George. George had two older siblings, a boy and girl aged 6;3 and
3;0 respectively at Stage 1. One or both of the siblings participated in most of the
sessions. Kim was asked to focus on George in the sessions, so generally the siblings
played together while Kim played with George. The children’s father participated
occasionally.
Kellie and Tyson. Tyson was an only child. Tyson’s grandmother was present
in most of the sessions. Other adult family members were sometimes in the vicinity but
Erin and Grace. Grace had two younger brothers: one aged 0;5 at Stage 1, and
another born between Stages 2 and 3. Sessions were held when the younger children
were mostly sleeping, so the majority of interactions were between Erin and Grace. Two
CHAPTER 5 140
adult males who work for Erin and her husband were occasionally present in some of
the sessions.
Table 5.1
Participant Characteristics
Bachelor only
Cathy* Lucy* female 1;5 1;11 2;5
degree child
Secondary youngest
Kim George male 1;5 1;11 2;5
school of three
Secondary only
Kellie Tyson male 1;7 2;1 2;7
school child
Bachelor oldest of
Erin* Grace* female 1;6 2;0 2;8†
degree three
* pseudonyms used at mother’s discretion
† the age difference between Stage 2 and 3 is greater for Grace than the other target
children because these sessions were recorded later due to participant circumstances
5.3.2 Procedure
for the analysis of sound patterns that the children are typically exposed to. Audiovisual
recordings were made in a natural setting, with each mother and child recorded at their
home or at a local park, with care taken to choose quiet environments with minimal
background noise. All the recordings were made by the first author, who spent time
before and after each recording session talking with the participants about both the
research project and unrelated subjects to build rapport. Mothers were aware that the
project was about their speech, and were debriefed on the details after the final
recording session.
CHAPTER 5 141
Before each recording mothers were asked to do free play with their children
and reminded that they could stop the session at any time. Participants played with
materials brought by the researcher and their own toys and games. The most common
activities across children and stages were drawing and painting, and playing with
coloured blocks, toy cars, dolls, and playdough. Other activities included swingsets and
other playground equipment, and general outside activities such as feeding pets.
The mother and target child each wore a wireless, cardioid, condenser lapel
microphone (Sennheiser ME104) that recorded high quality audio (at 48kHz, 24 bit) via
bodypack transmitter to a receiver with XLR connection to a hard disk recorder (Edirol
R-44), in separate audio tracks for mother and child. The mother’s audio track was used
for transcription. The acoustic quality of the recordings was high, and with the cardioid
lapel microphone positioned upwards at the top centre of the mother’s shirt her speech
was recorded clearly on her audio track. As recordings were made in busy naturalistic
environments, occasionally a background noise was too loud to clearly hear the speech.
In these cases speech was marked as unclear in the transcripts and was not analysed. A
example was recorded outdoors in a park by a river and has a dynamic range of 52 dB.
The researcher also filmed sessions with a video camera (Sony AVCHD HDR XR-
conducted with each mother at each stage over one to three weeks, to produce in total
between 8 and 11 hours of recording time at each stage as detailed in Table 5.2. The
Most sessions went for half an hour to an hour in length, and usually ended when the
Figure 5.1. Spectrogram of utterance "There we go, we'll just leave him there, shall
we?"
Table 5.2
Mother child age 1;6 child age 2;0 child age 2;6 Total
Mothers’ speech was transcribed from the audio using the phonological analysis
and transcription software Phon (Rose, Hedlund, Byrne, Wareham & MacWhinney,
2007). Phonetic transcriptions were made of the mothers’ actual pronunciation as well
as the standard Australian English form. Transcription was done by the first author, a
recording at Stage 1 and one hour at Stage 2, made up of a random selection from each
session. The second transcriber was a native Australian English speaker who has a
and 96.88% at Stage 2. The majority of disagreements were about vowel quality, in
particular between the schwa and full vowels (making up 45.93% of total disagreements
at Stage 1 and 49.24% at Stage 2). The two transcribers held consensus discussions to
make decisions about how to transcribe disagreements. Where consensus could not be
reached, a third transcriber (the second author) was consulted to make a decision.
5.3.3 Analysis
The search tool in Phon was used to perform frequency counts of fricatives by
word position. Consonant deletion was quantified by searching for differences in the
standard and actual IPA transcriptions. The transcripts were then searched manually to
determine whether the differences between fricatives in the IPA Actual and IPA Target
tiers were due to speech errors or the mother using a different lexical form (e.g. the
child-directed speech form horsie for horse would show up as a difference in the Target
and Actual word-medial and word-final [s]), or whether it was an apparent systematic
process in the sample. The exclamation hey was excluded from the /h/ deletion analyses.
CHAPTER 5 144
To test whether the frequency of specific speech processes increased over time, the
processes word-initial /h/ deletion and word-final /v/ deletion were analysed further in
SPSS using binary logistic regression. To explore local effects on the processes we
analysed the number and types of lexical items in which deletion occurred, in both
function and content words, using frequency counts at each child age.
over time were related to changes in speech rate. Speech rate in words per second was
analysed for the entire sample at each timepoint, following the procedure outlined and
validated by Ko (2012). A more detailed analysis of speech rate was also performed on
a subset of the data by calculating syllables per second in a random selection of sixty
utterances from each mother at each child age: thirty in which deletion occurred and
5.4 Results
initial /h/ deletion in maternal speech would increase at each time point. Table 5.3
shows the frequencies of word-initial /h/ deletion out of possible contexts at each
average child age, and percentages of deletion for each speaker are in Figure 5.2.
Descriptively there was a proportional increase in deletion between child ages 1;6
(24.65%) and 2;0 (35.11%), and a proportional decrease between 2;0 and 2;6 (24.62%).
CHAPTER 5 145
Table 5.3
Raw Frequencies of Word-Initial /h/ Deletion Out of Possible Contexts by Each Speaker
Figure 5.2. Percentage word-initial /h/ deletion out of word-initial /h/ contexts by child
To test the hypothesis that word-initial /h/ deletion is influenced by child age we
performed a binary logistic regression. The outcome variable was word-initial /h/
deletion and the predictor variables were child age and speaker. Child age (1;6, 2;0 and
2;6) was entered as a repeated contrast making age 2;0 the reference group. Table 5.4
shows the results of logistic regression analysis. The model has good fit to the data χ2
(11)=295.36, p<.001, and accounts for some of the variance (Nagelkerke R2 = 0.10).
Results indicate a significant effect of child age, Wald χ2 (2) = 16.08, p <.001. The odds
of /h/ deletion are lower at child age 1;6 than 2;0 (odds ratio 0.67), and higher at 2;0
than 2;6 (odds ratio 1.42). The hypothesis that deletion would increase as child got older
CHAPTER 5 147
was supported between child ages 1;6 and 2;0, while the decrease in deletion between
Table 5.4
Logistic regression for word-initial /h/ deletion: Child age and speaker
Lower Upper
Child Age
Speaker
The reference group for Child Age is 2;0 (repeated contrast) and for Speaker is Kellie.
There is also a significant effect of speaker on /h/ deletion, Wald χ2 (3) = 191.21,
p <.001. The speaker Kellie was the reference group (total deletion 35.57%), and results
CHAPTER 5 148
show that the odds of deletion were lower for Erin (odds ratio 0.26) and Cathy (odds
ratio 0.34). There was no significant difference in the odds of deletion for Kim (odds
Interactions between child age and speaker were tested in the logistic regression
model to test whether the main effect of age was found across all speakers. Results
show a significant interaction effect, Wald χ2 (6) = 46.80, p <.001. As shown in Table
5.4 there were three significant interactions: Cathy by age 1;6, Kim by age 1;6, and Kim
by age 2;6. Figure 5.2 shows that Kim displays a v-pattern of deletion over time, a
decrease in deletion from 1;6 to 2;0 followed by an increase from 2;0 to 2;6, while
Kellie, Cathy and Erin display an inverted v-pattern, that is an increase from 1;6 to 2;0
followed by a decrease from 2;0 to 2;6. Between child ages 1;6 and 2;0 /h/ deletion was
significantly more likely to increase for the reference group Kellie than for Kim (odds
ratio 3.39). Between child ages 2;0 and 2;6 the odds of /h/ deletion were significantly
more likely to decrease for Kellie than Kim (odds ratio 0.32). The interaction of Cathy
by child age 1;6 shows that the increase in the odds of /h/ deletion between ages 1;6 and
2;0 was significantly higher for Kellie than Cathy (odds ratio 1.88).
Word-final /v/ deletion. It was hypothesised that the speech process word-final
/v/ deletion would proportionately increase in mothers’ speech at each time point. Table
5.5 shows the frequencies of word-final /v/ deletion out of possible contexts at each
average child age. The percentage of word-final /v/ deletion for each speaker is shown
in Figure 5.3. This process was not as frequent as word-initial /h/ deletion, so there were
fewer possible contexts and fewer tokens in which the process occurred16. Overall there
was a slight increase in word-final /v/ deletion between child ages 1;6 and 2;0, from
16
Searches were performed in Phon for word-final [v] contexts that precede a word-initial labio-dental
fricative [v, f] in the following word. This environment is extremely rare in the corpus, with one case at
child age 1;6 (Cathy), none at 2;0, and four at 2;6 (1 Cathy, 3 Kim).
CHAPTER 5 149
16.15% to 21.95%, and a decrease between ages 2;0 and 2;6 to 16.74%. This inverted v-
pattern resembles that for word-initial /h/ deletion and was partially consistent with the
hypothesis: word-final /v/ deletion changed in mothers’ speech over time but not in the
Table 5.5
Raw Frequencies of Word-Final /v/ Deletion Out of Possible Contexts by Each Speaker
Figure 5.3. Percentage word-final /v/ deletion out of word-final /v/ contexts by child
To test the hypothesis that word-final /v/ deletion is influenced by child age we
performed another binary logistic regression. Table 5.6 shows the results of the analysis
using child age and speaker as predictors of word-final /v/ deletion. The model fit to the
Results indicate no significant effect of child age, Wald χ2 (2) = 2.18, p = .336. The
hypothesis that word-final /v/ deletion would increase as children got older was not
supported. In this model the effect of speaker approached significance, Wald χ2 (3) =
7.75, p = .052. The speaker with the highest total /v/ deletion was the reference group
(Kim, total /v/ deletion 23.70%) and results show that the odds of deletion were
Table 5.6
Logistic Regression for Word-Final /v/ Deletion: Child Age and Speaker
Lower Upper
Child Age
Speaker
The reference group for Child Age is 2;0 (repeated contrast) and for Speaker is Kim.
There was no significant interaction effect, Wald χ2 (6) = 7.48, p = .279. Figure
5.3 suggests, however, that the pattern of /v/ deletion over time depends on speaker.
Kellie, Cathy and Erin display an inverted v-pattern as they are less likely to delete at
child age 2;0 than at 1;6 or 2;6, while Kim displays the opposite pattern as she is more
likely to delete at 2;0 than 1;6 or 2;6. The interspeaker patterns for word-final /v/
CHAPTER 5 152
deletion over time resemble those found for word-initial /h/ deletion and show that
mothers’ speech would increase over time to reflect a gradual change from child-
directed speech to interadult speech. The hypothesis was partially supported: word-
initial /h/ and word-final /v/ deletion were more likely to occur in mothers’ speech when
children were 2;0 than 1;6, but were unexpectedly less likely at 2;6 than 2;0. The
change over time was significant for /h/ deletion but not for /v/ deletion, likely because
word-final /v/ contexts were rarer in the corpus. Results also show interspeaker
developmental paths of speech rate in child-directed speech. In this study the presence
of siblings may have contributed to interspeaker variation, as the mother who did not
have the inverted-v pattern of deletion also had two older children who were present in
many of the sessions, while with the other three mothers the target child was generally
the only child in the recordings. The inverted-v effect was also stronger in one of the
speakers, Kellie, than in the other two speakers who displayed the pattern. Future
interspeaker variation: a possible contributing factor is the gender of the child, as Erin
and Cathy, who were mothers of girls, displayed similar patterns of a relatively shallow
inverted-v while Kim and Kellie were mothers of boys and displayed greater variation.
It was found that the two mothers with less formal education, Kim and Kellie,
displayed the highest rates of /h/ deletion at all timepoints. We cannot draw any
conclusions here due to the small number of speakers, but it would be interesting to
light of previous studies that have found consonant deletion in clusters is more likely in
mothers’ speech to children aged 3;11 than 2;11 (Smith et al., 2009), and that non-
standard variants become more frequent as children age from 2;0 to 4;0 (Foulkes et al.,
2005). This could be due to the different indexical natures of the speech variables
investigated in these studies, i.e. cluster reduction in the Smith et al. (2009) study, [t]
variants in Foulkes et al. (2005), and /h/ and /v/ deletion in the current study. A possible
interpretation of our findings in Analysis 1 is that /h/ and /v/ deletion are non-citation
speech processes in Katherine English, and mothers increase their use of these processes
between child ages 1;6 and 2;0 and decrease them as children age from 2;0 to 2;6,
change over time in the distribution of lexical items in which deletion occurred, and/or
in the distribution of function words, which are more susceptible to reduction processes
(Cutler, 1993) than content words. It raises the question, are mothers changing their
changing over time, resulting in an apparent effect of change in deletion as children get
question of how widespread /h/ and /v/ deletion are across lexical items and word class.
final /v/ deletion in all the lexical items in which it occurred in the corpus.
CHAPTER 5 154
Deletion in lexical items. Table 5.7 shows the proportion of word-initial /h/ and
word-final /v/ deletion across ages for each word that displayed the process. Words are
column in Table 5.7). Word-initial /h/ deletion occurred across a range of words at all
timepoints, while word-final /v/ deletion only occurred in of, have and give. The /h/
items are separated by word class to examine the distributions of deletion in function
and content words. At all child ages deletion was more frequent in function than content
words (30.18% and 5.91% total deletion respectively). The inverted-v pattern of
deletion over time is apparent in the function words for /h/ deletion. In content words /h/
deletion decreased slightly between timepoints, and /v/ deletion only occurred in three
lexical items. Note that in both /h/ deletion in content words and /v/ deletion the N for
deleted consonants in most lexical items is very low. In particular /h/ deletion was
frequent in pronouns, the adverb here, and the verb have (as a main verb and an
auxiliary verb). The words displaying the highest frequencies of deletion are relatively
frequent overall at each time point, indicating that changes in rates of deletion are not
due to the mothers introducing or removing words from their speech as children get
older.
Within Table 5.7 it is clear that there are approximately seven words that are
high frequency and involve considerable word-initial /h/ deletion and that the inverted-
pattern of deletion over time is mostly driven by function words rather than content
words. The seven words are the same words at each child age and account for 94.53%
of /h/ deletion at age 1;6, 94.09% at age 2;0, and 94.57% at age 2;6. It is therefore of
interest to examine how /h/ deletion changes over time for these words. Figure 5.4
shows the percentage of /h/ deletion over time for each word. Five of the seven words
CHAPTER 5 155
display an increase in deletion between 1;6 and 2;0 and a decrease between 2;0 and 2;6
and are therefore contributing to the inverted v-pattern of deletion found in Analysis 1.
The two words that do not follow this pattern are him and have. Deletion in him is the
highest of all words, between 80.00% and 83.17%, and the lack of change may be due
The three words containing word-final /v/ deletion are of, have and give.
Deletion in of is relatively stable over time, between 30.00% and 31.45% while /v/
deletion in have displays the inverted v-pattern. Deletion in give displays the opposite
pattern, although note that deletion in give has a very low number of occurrences at
each time point and must be interpreted with caution. The inverted-v pattern of word-
final /v/ deletion therefore appears to be driven primarily by the lexical item have.
Figure 5.4. Most frequent lexical items containing word-initial /h/ deletion at each
timepoint
CHAPTER 5 156
Table 5.7
Lexical Items Containing Word-Initial /h/ and Word-Final /v/ Deletion at Each Stage
1;6 2;0 2;6 TOTAL
n % n % n % n deleted/ %
deleted/ deletion deleted/ deletion deleted/ deletion contexts deletion
contexts contexts contexts
/h/ deletion: function words
he 52/178 29.21 73/144 50.69 55/113 48.67 180/435 41.38
him 84/101 83.17 55/65 84.62 41/51 80.39 180/217 82.95
have 25/145 17.24 37/158 23.42 53/208 25.48 115/511 22.50
he’s 23/121 19.01 67/154 43.51 22/78 28.21 112/353 31.73
here 21/260 8.08 52/259 20.08 30/250 12.00 103/769 13.39
his 28/44 63.64 39/52 75.00 35/71 49.30 102/167 61.08
her 15/35 42.86 49/92 53.26 8/36 22.22 72/163 44.17
has 0/9 0.00 5/21 23.81 4/8 50.00 9/38 23.68
haven’t 2/11 18.18 4/10 40.00 2/10 20.00 8/31 25.81
himself 3/4 75.00 3/3 100.00 0/4 0.00 6/11 54.55
he’ll 2/6 33.33 1/7 14.29 0/1 0.00 3/14 21.43
how 0/46 0.00 2/67 2.99 1/81 1.23 3/194 1.55
had 1/12 8.33 0/13 0.00 1/24 4.17 2/49 4.08
hasn’t 0/2 0.00 2/4 50.00 0/1 0.00 2/7 28.57
herself - - 2/3 66.67 - - 2/3 66.67
who - - - - 1/20 5.00 1/20 5.00
Total 256/974 26.28 391/1052 37.17 253/956 26.46 900/2982 30.18
/h/ deletion: content words
hang 1/30 3.33 1/32 3.13 1/33 3.03 3/95 3.16
head 1/14 7.14 0/21 0.00 2/17 11.76 3/52 5.77
hop 2/37 5.41 1/4 25.00 0/20 0.00 3/61 4.92
Heather 2/3 66.67 0/11 0.00 0/7 0.00 2/21 9.52
hit 0/5 0.00 2/5 40.00 - - 2/10 20.00
hold - - - - 2/15 13.33 2/15 13.33
Total 6/89 6.74 4/73 5.48 5/92 5.43 15/254 5.91
/v/ deletion
of 35/112 31.25 39/124 31.45 66/220 30.00 140/456 30.70
have 8/145 5.52 31/158 19.62 7/208 3.37 46/511 9.00
give 4/34 11.76 2/46 4.35 2/20 10.00 8/100 8.00
Total 47/291 16.15 72/328 21.95 75/448 16.74 194/1067 18.18
CHAPTER 5 157
Gender differences in lexical items. Gender differences in the use of male and
female pronouns were also analysed. Table 5.8 shows the number of tokens and
percentage of deletion in male and female pronouns used by caregivers of boys (N=2)
and of girls (N=2). Relevant male pronouns are much more frequent overall than female
pronouns. Male pronouns in mothers’ speech to boys had a higher rate of deletion than
in speech to girls (52.30% and 40.06% deletion respectively). However, for male
pronouns, the inverted-v pattern of /h/ deletion occurred in speech to both boys and
girls. Female pronouns were more frequent in speech to girls but had a higher rate of
deletion in speech to boys (58.49%) than to girls (38.05%). The inverted-v pattern of /h/
deletion occurred in female pronouns in speech to girls, while in speech to boys the
opposite pattern occurred in female pronouns (though note the small number of tokens
Table 5.8
/h/ Deletion in Male and Female Pronouns by Mothers of Boys (N=2) and Girls (N=2)
/v/ deletion as children aged from 1;6 to 2;0 to 2;6, and Analysis 2 was performed to
examine /h/ and /v/ deletion in lexically specific contexts. Results showed changes in
deletion occurred across a specific set of function words. The words containing the
highest frequencies of deletion were function words and were the same across each
child age. These words accounted for most of the deletion (94%) at each timepoint. This
finding suggests overall changes in deletion were not due to mothers introducing new
conversational speech function words tend to be unstressed and contain reduced vowel
forms compared with content words (Cutler, 1993), so it is unsurprising that consonants
are also reduced, or in this case deleted, in function words. Function words are also high
frequency in child-directed speech relative to content words (Shi, Werker & Cutler,
2006), and the finding that phonetic variation is high in these words has implications for
proportionately more /h/ deletion in mothers’ speech to boys than to girls in both male
and female pronouns. The inverted-v pattern of deletion over time was found in male
pronouns in all speakers, but in female pronouns it occurred in speech to girls but not
boys. With only four speakers these findings cannot be generalised, but they do appear
to support the Tyneside English study (Foulkes et al., 2005) which found that caregiver
speech to boys contained more vernacular variants of [t] than speech to girls, which
contained more standard variants. Foulkes et al. (2005) suggested that these gender
gender identities, as in adult speech males tend to use more non-standard phonological
CHAPTER 5 159
variants than females. Horvath’s (1985) study on adult Australian English speakers
from Sydney did find that /h/ deletion generally occurred more in male than female
speech so the gender identity explanation could apply to the current findings, although
A second possible explanation for the finding of change in deletion over time is
that it is a function of speech rate, and the rate of deletion changes because mothers
modify their rate of speech as children age. If this were the case we could expect to find
an association between more deletion and faster speech, and that mothers’ speech rate
across child age follows the same pattern as deletion, i.e. an increase between ages 1;6
utterance, following the method used and validated by Ko (2012). Transcripts were
segmented by utterance in Phon, where utterances were units of speech with pause
boundaries at each end, and speech rate was calculated from the segmentation time
laughter) were omitted and utterances were excluded when the duration was longer than
10 seconds (N=12), as these were likely to contain pauses that had not been segmented,
and where they contained unintelligible speech and phonological fragments (N=396).
This left 5,856 utterances at age 1;6, 5,613 at 2;0 and 6,685 at 2;6.
speech rate at child age 2;6 was significantly higher than 1;6 (M=0.20, 95% CI [0.15,
0.25]) p<.001), and 2;0 (M=0.18, 95% CI [0.13, 0.23]), p<.001. There was no
significant difference between 1;6 and 2;0 (M=0.02, 95% CI [-0.03, 0.07]), p=.557.
CHAPTER 5 160
Thus, mothers’ speech rate did not significantly change as children aged from 1;6 to
A more detailed analysis of speech rate in syllables per second was performed
on a subset of 682 utterances in which word-initial /h/ and word-final /v/ deletion
occurred. These are referred to here as deletion utterances (deletion occurred) and non-
deletion utterances (containing word-initial /h/ or word-final /v/ contexts but with no
deletion occurring). Overall there was little difference in mean speech rates between
utterances containing deletion and no deletion, M = 3.26 and 3.06 syllables per second
respectively. An independent samples t-test confirmed that the difference was not
timepoint (child age 1;6 M = 2.88, child age 2;0 M = 3.11 child age 2;6 M = 3.43). A
one-way ANOVA showed a significant difference between child age for adults’ speech
rate, F(2, 680) = 9.58, p<.001, adjusted R2 = 0.025, indicating child age contributed to
only 2.5% of the variance in speech rate. A REGWQ post-hoc test indicated that speech
rate was significantly faster at child age 2;6 than at 1;6 (p<.001) and 2;0 (p = .031). The
difference between 1;6 and 2;0 was not significant (p = .156). Thus the results from the
more fine-grained analysis of the subset were the same as those found in the analysis of
speech rate in the whole sample: mothers’ speech rate did not significantly change as
children aged from 1;6 to 2;0, and got slightly faster between 2;0 and 2;6.
The present study is the first to examine phonetic variation in maternal speech in
analysed in mothers’ speech when their children were aged 1;6, 2;0 and 2;6. Based on
the few studies that have investigated phonological variation in input to children after
CHAPTER 5 161
the first year of life, it was expected that variants containing consonant deletion would
become more frequent in mothers’ speech as their children got older. Overall findings
suggest that word-initial /h/ and word-final /v/ deletion are phonological processes in
Katherine English that occur across a set of lexical items. Mothers increase deletion as
children age from 1;6 to 2;0 and decrease deletion as children age from 2;0 to 2;6, and
these changes do not appear to be driven by speech rate. Results also showed
interspeaker variation: the inverted-v effect over time appeared in three of the four
speakers and was stronger in one of the speakers who displayed the pattern relative to
the other two. The effect is therefore a preliminary finding and will need to be
The increase in the proportion of /h/ and /v/ deletion between child ages 1;6 and
2;0 is consistent with literature suggesting mothers use more non-citation speech
processes and correspondingly fewer standard phonetic variants as children get older.
There was, however, an unexpected decrease in both /h/ and /v/ deletion in mothers’
speech between 2;0 and 2;6, although the results for /v/ deletion were not statistically
speech input to children exposes them to socioindexical variation, and change over time
is due to mothers’ gradually shifting their speech style to become more like interadult
speech (Foulkes et al., 2005). While this may explain the current finding of an increase
in deletion between 1;6 and 2;0, the question remains of why did deletion
proportionately decrease in maternal speech between 2;0 and 2;6? The current findings
suggest that mothers change their speech style as children get older, using more non-
citation speech processes between 1;6 and 2;0 and then shifting back to using fewer
however, it was important to consider whether changes in deletion over time were due
When we examined the distribution of /h/ and /v/ deletion across lexical items
we found that deletion occurred in the same set of words at each timepoint, and that
most of these words displayed the inverted-v pattern of deletion over time. The results
provide evidence that /h/ and /v/ deletion are phonological processes occurring in a set
of words, or one particular item (have) in the case of /v/ deletion, and not simply due to
changing distributions of words with word-initial /h/ and word-final /v/ contexts.
Interestingly this analysis also showed that deletion was most frequent in
function words rather than content words, and that the significant changes in /h/ deletion
over child age were due to changes in function words. This is consistent with the
prosodic level function words are frequently unstressed and at the segmental level they
are likely to be realised with weak vowels, usually schwa (Cutler, 1993). The current
study suggests that in addition to vowel reduction, function words contain high
variation in consonants, relative to content words. Function words also have a high
token frequency relative to content words in English, in both interadult and child-
directed speech (see Shi et al. [2006] for calculations of type-token ratios in function
number of perceived tokens may result in a child having more distributional information
about the word. It is, however, not known whether the frequency of tokens in the input
frequency words in the input may in fact be less likely to be perceived and thus
CHAPTER 5 163
represented in memory because they are more phonetically reduced more often and
contain fewer phonological cues than low frequency words (Cutler, 1993; Monaghan et
al., 2005). Low frequency words might also have an advantage in novelty and saliency
longitudinally children’s perceptions and use of phonological cues in high and low
frequency items to investigate any relationship with the current finding of variation in
speech over time by analysing the mean speech rate of each speaker at each time point.
It is well-known that speech rate is related to segmental deletion, with deletion more
likely in faster speech (e.g. Koreman, 2006). It is therefore possible that the inverted-v
pattern of deletion found in the current study was a result of mothers using faster speech
at child age 2;0 than at 1;6 or 2;6. Speech rate may also play a subtler role as a
component of style. Both high speech rate and segmental deletion are features of
between production ease for the speaker and perception ease for the listener (Lindblom,
1990). If change in speech rate was accounting for the change in deletion found in the
current study then we expected speech rate would display a similar inverted-v pattern
over the three timepoints. Instead we found that overall speech rate did not significantly
differ between1;6 and 2;0 and increased between 2;0 and 2;6, although the latter effect
was small at less than half a syllable per second difference. Further, there was no
significant difference in speech rate between utterances containing /h/ and /v/ deletion
and utterances where /h/ and /v/ were fully realised. These findings indicate changes in
deletion over time were not due to changes in speech rate, as deletion and speech rate
17
We thank an anonymous reviewer for raising these issues for us.
CHAPTER 5 164
both displayed different patterns of change over the three timepoints. The lack of
difference in speech rate between utterances with and without /h/ and /v/ deletion
provided further evidence that in this sample deletion is not, or at least not primarily,
It appears that the change in deletion over time found in this study was not an
mothers’ speech rate. We thus return to the possibility that rates of /h/ and /v/ deletion in
maternal speech change with the age of the child. There are at least two plausible
explanations for this: 1) that mothers are modelling forms differentially as children age,
and 2) that deletion rates vary with types of speech acts (e.g. questions, recasts, and
repetitions, which could be frequent as children become more verbal) and social
interactions (e.g. play, teaching, behaviour management) and the distributions of these
shift with child age. Note that these are not mutually exclusive explanations and both
variation over time as children develop knowledge of the phonology of their home
language as well as across the different speech acts and social contexts in which
in regional Australian English, it is not clear whether /h/ and /v/ deletion contain
research on this is needed to draw any strong conclusions on the indexical information
indexical variables to those in the current study. They found a slow linear increase in
mothers’ use of non-standard variants over time as their speech gradually became more
like interadult speech (Foulkes et al., 2005; Smith et al., 2009). However, the mixed
CHAPTER 5 165
speech did suggest that the different findings may be due to the different ages of the
children in the studies (Cruttenden, 1994). Vowel clarification was found mostly in
1984a), while speech to older children was found to contain more consonant reduction
and unintelligible segments than speech to adults (Bard & Anderson, 1983; Shockey &
Bond, 1980). The non-linear change over time found in the present study is consistent
with Cruttenden’s (1994) argument that the apparent mixed findings of these early
ages. The current finding of an inverted-v pattern in deletion may indicate that mothers
start producing multi-word utterances (by around 2 years of age), and then revert to
clarification (by 2;6). Further research is needed to investigate whether the current
finding of the inverted-v pattern over time is specific to the phonological processes of
/h/ and /v/ deletion or whether it is also found in other phonological variables in speech
times then we might expect mothers to fine-tune differentially as their child’s language
develops. Mothers’ use of more standard clarified forms may be particularly important
memories, and use of deleted or reduced forms may be more important when children
are learning the alternate phonetic forms common in non-citation interadult speech and
the socioindexical associations. Full standard forms may then become important again
as children start producing multi-word utterances and mothers become aware of their
child’s pronunciation. This may explain the decrease in deletion after 2;0, as this is
CHAPTER 5 166
when most of the children start producing multi-word utterances and could indicate that
of speech acts and social interactions, and that there was less deletion in more formal
sociolinguistic markers and speech style, with standard phonetic variants more frequent
in formal contexts and the local vernacular variants more frequent in informal contexts
(Smith et al., 2007). It would be of interest to do further research with the current corpus
to investigate whether /h/ and /v/ deletion is related to speech style, interaction contexts
and sentence types, and whether the distributions of these change in mothers’ speech as
speech over time; further research is needed to understand why mothers make
differential segmental modifications as their children age. The current study is limited
by the relatively small number of participants, and studies with more speakers are
phonological variation. These findings open several avenues for further investigation.
For example future studies could examine the phonological modifications mothers make
when they are specifically modelling speech for children, such as through repetition and
explicit teaching which may involve hyperarticulation, and how this changes as children
also investigate the relationship between deletion and speech rate in maternal speech, as
the current finding that speech rate increases as deletion decreases between 2;0 and 2;6
timepoints to examine how speech rate and deletion change over time and whether
CHAPTER 5 167
Further timepoints and more speakers are also needed in order to understand
modifications in maternal speech over time, and again how these may relate to the
is needed to test different theoretical explanations for the pattern of adult speech found
variation at different ages would also be relevant to help elucidate how their
CHAPTER SIX
6.1 Abstract
perceptual biases. Native speakers bring implicit linguistic knowledge that can be useful
for understanding the sound system of the language, and for checking the reliability of
native speakers to the technical level required for IPA transcription. In this study we
investigated the use of Visual Analogue Scales (VAS) for eliciting perceptual
Tokens for this study were selected from words that potentially contained a fricative or
affricate across a range of places of articulation in initial, medial and final word
positions. Tokens were selected where the non-native transcribers had both agreed on
the target segment and disagreed and/or marked the target segment as ambiguous. 73
visual analogue scales were constructed in total for the target segments in each token.
Community research assistants (RAs) who were native speakers of Gurindji Kriol
transcriptions for each token. Results showed both agreements and discrepancies
between RAs and transcribers on different judgements in each word position. These
and suggest several hypotheses for further research on phonetic variation in Gurindji
Kriol.
CHAPTER 6 171
6.2 Introduction
Phonetic transcription is a useful tool clinically and in the field, however it does
have limitations, particularly when the transcribers are non-native speakers of the
language being analysed. Native speakers have implicit linguistic knowledge that is
beneficial to analyses and interpretation of phonetic variation in the language, but often
in field linguistics it is not practical to train speakers in IPA to the level required to
judgements of speech sounds, which may limit our understanding of phonetic variation
that is potentially continuous. The purpose of this study was to gain a deeper
Visual analogue scales have been used in previous studies to elicit continuous
perceptual judgements on speech segments for the purpose of examining fine phonetic
detail. For example this method has been used to investigate adults’ sensitivity to
within-category variation in stops and fricatives (Skorniakova & Ito, 2011) and stop
voicing contrasts (Kong & Edwards, 2011), as well as adult perceptions of covert
contrasts and phonetic accuracy in children’s speech (Julien, Munson & Edwards, 2012;
Munson, Edwards, Schellinger, Beckman & Meyer, 2010; Schellinger, 2008; Urberg-
Carlson, Munson & Kaiser, 2009). In the current study we explore the use of visual
spoken by Gurindji people in northern Australia. The purpose of eliciting native speaker
overarching aim of the project is to examine phonetic variation in the input that Gurindji
children are exposed to. One particular kind of variation we are investigating is in
fricatives, which contain variability in voicing, manner and place of articulation. The
data for the larger project came from an existing longitudinal corpus of conversational
Gurindji Kriol maternal speech (Meakins, 2011, pp. 45-47), to which we added phonetic
English speaking linguists because it was not practical to train native speakers of
Gurindji Kriol to the technical level required for IPA transcription. Consideration of
native speaker phonological judgements was an important part of our methodology and
theoretical understanding of variation in Gurindji Kriol, and this formed the basis of the
present study.
studies that have used visual analogue scales to measure perception of speech sounds.
including transcriber language background. We finish up this section with more detail
about the current applied context and the rationale for examining fricative variation in
Gurindji Kriol.
6.3 Background
listeners to indicate their perception of a speech stimulus on a horizontal line with two
different sounds at the endpoints. Differentiation of the endpoint sounds depends on the
CHAPTER 6 173
process being investigated; for example the endpoints can differ acoustically within a
phonological category to test listener sensitivity to within-category detail (e.g. Kong &
Edwards, 2011; Skorniakova & Ito, 2011), or they can represent different phonological
along a continuous scale. Urberg-Carlson et al. (2008) found that adult listener ratings
of children’s productions using visual analogue scales were generally well correlated
with acoustic measures, and Munson et al. (2010) found naïve listeners reliably
productions and substitutions by responding on the scales for /s/-/θ/ and /d/-/ɡ/ contrasts.
It is important to note that phonetic training and experience is still advantageous to the
than naïve listeners in their visual analogue scale judgements of children’s productions
voicing changes. Kong and Edwards (2011) used visual analogue scales and eye-
tracking to examine changes to stop consonant voicing, and found that adult listeners
voicing has also been analysed using the scales in a goodness-rating task, which found
listeners made intermediate judgements of stop and fricative VOT contrasts when
CHAPTER 6 174
stimuli were acoustically intermediate, though overall fricatives were perceived more
Visual analogue scales were used in the current study for two main reasons: first,
they allow for continuous judgements of speech sounds, which may provide further
insight into variation of fricatives in Gurindji Kriol; second, they are effective in
eliciting judgements of speech sounds from naïve listeners who do not necessarily have
high literacy or technical knowledge of IPA transcription. Using visual analogue scales
with native speakers may add another dimension of information to that gained through
events and involves the transcriber identifying speech segments and assigning each
perceived phone a category using IPA notation. It has long been recognised that there
are many different factors that can compromise validity and reliability in phonetic
transcription. Early studies in the field have shown that validity can be affected by the
influence perceptual expectation effects (Oller & Eilers, 1975). The effect is not
necessarily negative – Oller and Eilers (1975) found that transcription accuracy was
generally improved when the transcriber knew the ‘correct’ meaning of the utterances
and suggested that knowing the target form (i.e. what to expect perceptually) may help
although fine phonetic detail can be represented in IPA with the use of diacritics. If
Kriol, then transcribing using IPA categories could be somewhat artificial. The potential
turn leading to the question of whether native speakers categorise these segments more
categories are applied, both within and between transcribers. It is commonly measured
transcriptions of five teams of trained transcribers and described a heuristic of four main
or characteristics of the speaker. These include the amount and type of speech errors
made (Kearns & Simmons, 1988), intelligibility (Shriberg & Lof, 1991), and canonicity
of speech sounds (Ramsdell et al., 2007). 2) Analysis factors, such as the level of detail
in the transcript, specific agreement criteria and type of agreement (Cucchiarini, 1996;
Shriberg & Lof, 1991). 3) Linguistic contexts, including structural, grammatical and
stress forms (Shriberg & Lof, 1991), stylistic factors (Goddijn & Binnenpoorte, 2003),
and phonological units such as class, features, sounds and diacritics (Shriberg & Lof,
1991).
CHAPTER 6 176
Although there are a range of potentially interacting factors that can affect
reliability and validity, phonetic transcription remains a useful tool clinically and in the
person transcribes a portion of the recording and the percentage of segments agreed on
is calculated. Cucchiarini (1996) points out that this approach assumes that all
disagreements are equal, i.e. a disagreement based on one feature is treated the same as
are treated – whether they are excluded or discussed among transcribers to reach a
Beckman, 2008). In addition, high reliability does not indicate high validity. Even if
that has high intertranscriber reliability but that is not necessarily an accurate
al., 2004; Hardison, 2003; Strange & Jenkins, 1978). Native speaker transcriptions are
therefore considered valid, as phones are judged using the phonological categories of
the speech community (Edwards & Beckman, 2008). Where phonological contrasts of a
of speech segments and thus also phonetic transcription, for example linguistic context
(Hardison, 2003), sentence context (Warren & Warren, 1970), and listener expectations
(Oller & Eilers, 1975; Ralston & Johnson, 1990). The listener’s phonological categories,
determined by his or her language background, also influence these factors, so language
background potentially has both direct and indirect biases on phonetic transcription.
experience can affect phonetic transcription (Coussé et al., 2004), and that it takes
training and practice to become sensitive to non-native contrasts (Pollock & Hinton,
2001). For example, Pruitt et al. (2006) compared native American English speakers’
and native Japanese speakers’ perceptions of dental and retroflex stop consonants in
Hindi. These sounds do not phonemically contrast in either language; however Japanese
has a similar contrast between /d/ and flapped /r/, which is sometimes produced as a
retroflex. it was found that Japanese speakers identified the Hindi contrast more
accurately than American English speakers (Pruitt et al., 2006). These findings suggest
that distinguishing non-native contrasts may be easier when the categories are
vocalisations across eight transcribers who were native American English speakers.
Agreement was significantly higher (r=0.64) for sounds found in American English
(including allophones) than for sounds outside the phonetic range of American English
(r=0.44). Thus reliability was lower for transcriptions of non-native than native
Language background can also affect transcription through lexical expectation effects,
as knowing the target word causes the listener to perceive the phonetic features they
expect to hear, even in some contexts where those features are not acoustically present
it does have limitations and should be supplemented with other types of analysis where
possible. Visual analogue scales have been used in the child language literature to gain
covert contrasts. In the current study we tested whether visual analogue scales could be
used to record native speaker judgements of fricatives in Gurindji Kriol, and examined
how these data could further our analyses and interpretation of our phonetic
transcription data.
language, emerging in the 1970s from code-switching (McConvell & Meakins, 2005).
Gurindji Kriol speakers are fluent and literate in varying degrees in English, which is
the language of the local school and administration in the community. Many speakers
are also fluent in Kriol and other Aboriginal languages spoken in the region.
Gurindji Kriol is derived from Kriol, 35% from Gurindji, and the remaining 28.4% are
CHAPTER 6 179
synonymous forms from both languages (Meakins, 2011, p.19)18. There are both
traditional Australian languages, in that the consonant system contains many contrasting
places of articulation and relatively few contrasting manners (Butcher, 2006). For
example there are palatal and retroflex places of articulation for stops, but no contrast
between stops and fricatives. There also does not appear to be evidence of a voicing
contrast in Gurindji Kriol stops (Jones & Meakins, 2012a). Traditional Gurindji does
not contain phonemic fricatives, although stops may be realised as fricatives in some
contexts due to lenition processes. In Gurindji Kriol fricatives are highly variable in
Kriol-derived words, both within fricatives (voicing and place of articulation variation)
and with stops. In Kriol-derived words most fricatives are labio-dental and alveolar, and
final position (Buchan et al. in preparation). These observations are consistent with the
consonant inventory of Gurindji Kriol proposed in Jones and Meakins (2012a), adapted
18
This count is based on a 200 word Swadesh list.
CHAPTER 6 180
Table 6.1
Proposed Gurindji Kriol consonant inventory (Jones & Meakins, 2012b; Meakins, in
press)
Fricative f s ʃ
Nasal m n ɳ ɲ ŋ
Lateral l ɭ ʎ
Rhotic r ɹ ɻ
Glide j w
The phonetic transcribers of the data in this study are native Australian
English speakers with varying degrees of first-hand experience with Gurindji Kriol. In
the current study one transcriber (the first author) has visited the community several
times, and the second transcriber was not familiar with Gurindji Kriol before starting
community Research Assistants employed to work on the project are native Gurindji
Kriol speakers and are also fluent English speakers. The community RAs do not have
variable but relatively low at the phoneme level, that is, the ability to segment phones in
words. The specific focus for this study is on the transcription of fricatives in maternal
Gurindji Kriol.
CHAPTER 6 181
The aim of the current study was to investigate native speaker perceptions of
6.4 Method
maternal Gurindji Kriol made by Meakins (2003-2007) for the Aboriginal Child
Language Acquisition (ACLA-1) project (Meakins, 2011, pp. 45-47). For the current
project a subset of the Meakins corpus was selected. These were recordings of three
mothers, SS, AR and CE, at three time-points when their children were aged around 1;6,
and a Research Assistant who has formal training in phonetic transcription. Both
transcribers are native Australian English speakers. The first author has had direct
the phonetic transcription and analysis program Phon (Rose, Hedlund, Byrne, Wareham
& MacWhinney, 2007). Phon was used to search the orthography of all transcripts for
words that may be pronounced with a fricative in Gurindji Kriol, that is Kriol words that
have a fricative in the English cognate word. The orthographic transcription of Gurindji
CHAPTER 6 182
Kriol is not phonetic. The orthography was developed over a long period of time in
close consultation with community members. It uses English letters and does not have
one-to-one correspondence with Gurindji Kriol phonology. Thus the orthography was
searched by the authors, who had prior knowledge of Kriol words and their English
meanings, rather than searching for letters or letter combinations in the orthographic
transcription. All tokens of these words were transcribed phonetically by the first
author and the Research Assistant. These transcriptions were used to select the tokens
Token selection. A sample of transcribed tokens was selected for checking with
Gurindji Kriol native speakers. The sample comprised tokens that contained the target
alveolar, palato-alveolar) in each word position (initial, medial, final). Also included
were word-initial and word-medial affricates, and word-initial /h/. Two tokens in each
of these conditions were randomly selected for each of the three speakers: one where
the two initial non-native transcribers agreed on the phonetic transcription, and one
where there was a discrepancy. The tokens selected for checking were rated for
difficulty on a scale of 1 (easy) to 3 (hard). Some positional segments are rarer than
others and were not found for some speakers, for example word-final th. In total 73
transcription involved ambiguity between two sound categories, for example [v] and [b].
Visual Analogue Scales were set up so that native speakers could judge where the target
segment lies on a continuum between two sound categories. A scale was created for
each target segment of the 73 tokens with each one having two sound categories as
CHAPTER 6 183
anchors, one at each extreme of the scale. For the tokens where the two non-native
transcribers disagreed on the phonetic transcription of the target segment these anchors
were the IPA transcriptions of each on-native transcriber. For example, for a token of
the word binij 'that's it! <finish' one non-native transcriber phonetically transcribed the
word-final consonant as the palatal stop [c] while the other transcribed the same
segment as the palato-alveolar fricative [ʃ]. This case was a stop-fricative judgement
type where the anchors on the Visual Analogue Scale were [c] and [ʃ]. Where the non-
native transcribers agreed on the transcription, the alternative anchor was based on the
sound in the English cognate or the nearest sound to that transcribed, selected by one of
the non-native transcribers. Listeners also had the option of saying that the token
represented another sound (i.e. not one of the anchors). The judgement types created for
Table 6.2
Table 6.2 shows that the largest number of judgements were stop – fricative,
followed by fricative voicing judgements. Included in the ‘other’ category were place of
articulation judgements, sound pairings that differed by more than one feature (e.g. [ʃ] –
CHAPTER 6 184
[ɟ]), and the word-initial [h] judgements. With the latter the judgement was whether [h]
was present or not; in these scales the alternative anchor to [h] was the first vowel sound
in the corresponding token. The positioning of each type of sound on either the left or
the right of the scale was counter-balanced to avoid any right-side bias. For example in
the twelve affricate-stop judgements the affricate appeared six times on the left of the
scale and six times on the right. The specific judgements and frequencies for each
judgement type are shown in Table 6.3 for word-initial sounds, Table 6.4 for word-
medial and Table 6.5 for word-final. See Figures 6.2, 6.3 and 6.4 for the lexical items
Table 6.3
affricate-
stop-fricative voicing affricate-stop other
fricative
d–ð 2 z–s 2
ʃ–c 2
ʃ–ɟ 1
t–s 1
θ–c 1
θ–t 1
Total 11 7 6 2 7
CHAPTER 6 185
Table 6.4
ɟ–ð 2 ʃ–s 2
d–ð 1
ʃ–ɟ 2
v–b 5
Total 12 3 6 3
Table 6.5
stop-fricative voicing
judgement n judgement n
b–v 1 f–v 2
c–ʃ 5 z–s 5
ɟ–ʃ 1
d–ð 1
f–p 2
Total 10 7
CHAPTER 6 186
Scale order. First, tokens were grouped into three blocks by word position.
Tokens with the target segment in word-initial position appeared first, as word-initial
sounds were easier for RAs to isolate. The second block contained word-final target
segments, and the third word-medial. Within each block the scales were ordered so that
the same sound pairings were together. Within pairings they were then ordered by the
of each IPA target phone were downloaded from the UCLA phonetics website
(Ladefoged, last accessed 12/12/2011) and embedded into the PowerPoint slides as
anchors. Each of the 73 scales were on separate slides. Each sound was also associated
trained to isolate sounds in words by identifying each sound with a colour. Using
colours in combination with the audio files also helped the researcher avoid both
spelling the sounds and producing the sounds in her Australian English accent. Sounds
could instead be referred to by the associated colour, e.g. [b] was ‘the yellow sound’,
represent the potential continuum between sound categories. Each scale had a light 7-
scale is shown in Figure 6.1. The anchors on this scale were [b] (yellow) and [v] (red).
The audio files for these sounds were played by clicking on the speaker symbols.
CHAPTER 6 187
Figure 6.1. Example Visual Analogue Scale for [b] and [v]
and the community project liaison, and were paid as research assistants. Two of the
Research Assistants, Kirsty and Trisha, were young women (age 20 and 22 years
respectively) and the third, Rosemary, was an older woman who works as an interpreter
in the community. Rosemary had taken several interpreter training courses and has
Materials. A set of 108 picture cards measuring 3”x4” were used as stimuli. The
set contained commonly used words and most had Gurindji counterparts. English words
for the pictures contained CVC minimal pairs, words with consonant digraphs, words
with consonant clusters, words with vowel digraphs, and multisyllabic words. Other
CHAPTER 6 188
materials used were a set of small coloured blocks. There were about one hundred
blocks in total of five different colours that were used to represent segments in words.
Procedure. Training was conducted over two sessions, each three to four hours
syllable segmentation, sound isolation, and sound matching tasks. In Session 2 the
sound isolation tasks were extended using coloured blocks to represent different sounds
in words. For example bag was represented as red-green-yellow and bat as red-green-
blue. Using minimal CVC pairs this task helped RAs to separate sounds from spelling
and to attend to and discriminate different sounds. Difficulty progressed from English
CVC words to words with consonant clusters and vowel diagraphs, and Gurindji words.
These tasks were precursors to the Visual Analogue Scale task, where sounds from
separate trip following the one in which the phonological awareness training was
conducted. Three young Gurindji women were paid as RAs to do the task although they
did not undertake the training. Rosemary, the interpreter who participated in the training,
did not participate in the task. Thus in total five young women who were native
speakers of Gurindji Kriol participated in the visual analogue scale task. The women
who participated in the phonological awareness training assisted the women who did
not; although it is a limitation of the procedure that not everybody did the training.
Procedure. The PowerPoint slides were printed in colour and bound into
booklets, with one scale per page. The Research Assistants were given a booklet each
while the researcher had the PowerPoint version open on a laptop with external speakers.
Before each word position block, Research Assistants were reminded how to isolate
CHAPTER 6 189
sounds in that word position, as they were taught in the earlier phonological awareness
training. RAs who participated in the training helped the three who did not. For each
scale the audio files of the anchors were played. To parallel the IPA transcription
context, the utterance containing the corresponding token was then played after RAs
were told which word to listen for in the utterance. Anchors and utterances were played
as many times as needed. RAs were instructed to mark on the scale in their booklets
where they thought the target segment lay between the two sounds on the scale. They
were asked to tell the researcher if they thought the target segment was something
different to the two sounds in the scale, in which case the researcher noted the
alternative sound. RAs were also asked to say if they thought a scale was too hard or if
RAs did the task over several sessions in groups of two to five. Often there was
discussion among the group and some group decisions were made, however RAs also
Analysis. RA responses were measured from the 1-7 scale to two decimal places.
Box and whisker plots were made to display collated responses to each judgement type
in each word position. These were then overlaid with the scale anchors and the non-
native transcriber transcriptions to compare the RAs’ continuous responses on the scales
6.5 Results
Data were split by word position and analysed separately for initial, medial and
final positions. Based on discussions during the visual analogue scale judgment sessions
with the RAs, mid-point responses are interpreted to mean that RAs perceived the sound
as midway between the two anchors; ‘other’ and ‘don’t know’ responses were recorded
separately. Figures 6.2, 6.3 and 6.4 were constructed to show the RA responses and the
CHAPTER 6 190
box and whisker plots in the figures represent the pooled RA responses to each
judgement pair, and the dots represent the categorical judgement of each sound as
transcribed in IPA by the non-native listeners. Green dots show that both non-native
listeners agreed on the transcription, and purple dots represent non-native transcriber
6.5.1 Word-Initial
Word-initial judgements are shown in Figure 6.2. The first eight judgements
along the x-axis are the stop-fricative judgements, and the next three are voicing,
followed by four stop-affricate judgements, and then [h] present or absent and one
fricative place judgement. For the first two pairings, [b, v] and [c, s], there was close
agreement among RAs who all judged the sounds as stops, and disagreement between
RA judgements and the non-native transcriptions. The purple dots indicate there was
also disagreement between the two non-native transcribers on both of the fricative
transcriptions, [v] and [s] (baldan ‘fall’ and jeya ‘there’, respectively). For [d, ð] RA
judgements agreed with those of the non-native transcribers. For [c, ʃ] RAs tended to
agree with non-native transcribers on the palatal stop although their responses were
continuous, indicating that RAs did not judge the sound as being entirely like the stop
endpoint. RAs also gave continuous responses for the [ʃ, ɟ] sound. RAs responded
closer to the stop than the fricative as they also did for the initial sound in jouim; non-
The next judgement involved the [st, t] judgement. There was non-native
transcriber disagreement about whether the initial sound was [st] in the word top ‘be,
<stop’. RAs judged it as a stop with no fricative. For [θ, c] there was agreement
between RAs and non-native transcribers in favour of the stop. For [θ, t] in ting
CHAPTER 6 191
‘whatchamacallit, <thing’, again there was high agreement among RAs that it was a
The next three word-initial judgements involve voicing distinctions. For the [f,
v] pairing RAs gave both categorical (indicated by the whiskers extending to the
endpoints) and continuous responses. In judging dental fricative voicing, [θ, ð], non-
native transcribers disagreed and RAs all gave mid-point responses. For [s, z] non-
native transcribers both judged the fricatives as voiceless, while RAs judged them as
voiced.
RAs perceived stops for the voiced stop-affricate judgements [ɟ, ʤ] with one
midpoint response, disagreeing with non-native transcribers who perceived the English-
based affricate. For the voiceless stop-affricate judgement [c, ʧ] in jinek ‘snake’, RAs
judged the sound as the palatal stop and there was disagreement between non-native
transcribers. There was general agreement between RAs and transcribers on the
fricative-affricate judgements [ʧ, s] and [ʃ, ʤ] that these sounds were affricates.
perceived [h] as present in three tokens and absent in the other three tokens. With one
endpoint as [hV] and one as the following vowel in the word with no [h], RAs all
responded strongly towards the [h] endpoint. The final word-initial judgement was [θ, s]
insight on the transcriptions that were ambiguous and/or where non-native transcribers
disagreed. Where these sounds involved stop-fricative distinctions, the native speaking
RAs generally judged them as stops. For two of the three voicing pairings RAs tended
to give continuous responses. Judgements involving palatal stops and affricates were
CHAPTER 6 192
Initial [s] was always judged as being present by the RAs while non-native transcribers
English code-switches, [z, s] circle and [θ, s] three, in which RAs disagreed with the
non-native transcribers. Both non-native transcribers agreed with each other on these
sounds.
6.5.2 Word-Medial
sounds arranged by judgement type. The first five pairings along the horizontal axis are
the last two involve place differences in fricatives. Again the green dots indicate the
sound perceived by non-native transcribers when they were in agreement with each
other, and the purple dots where there was ambiguity or transcribers disagreed with
each other.
whether the sound was a palatal stop [ɟ] or palato-alveolar fricative [ʒ]. RAs judged the
sound as a palatal stop. The next pairing shows agreement that the sound was a stop
between RAs and both non-native transcribers on the [ɟ, ð] judgement. For the [d, ð]
pairing, non-native transcribers perceived one sound as fricative and the other as stop,
with transcribers disagreeing on the stop in the first consonant in ajasaid ‘other side’.
RAs judged these sounds as the fricative [ð] and between the fricative and the mid-point
of the scale. For the [ʃ, ɟ] pairing the average RA response was around the mid-point,
with some responses at either end, agreeing with non-native transcribers who also
judged these sounds in both categories. The [v, b] responses show that all four of the
sounds were perceived by non-native transcribers as the stop [b], but non-native
CHAPTER 6 193
transcribers disagreed on two of them, gibit ‘give’ and libim ‘leave’. RAs judged them
between the stop and the fricative with the mean response at the mid-point.
The voicing pairing [z, s] shows that non-native transcribers agreed that one
sound was voiced and the other was voiceless. RAs judged both sounds on the
continuous scale between the endpoints. For the affricate-palatal stop pairing, non-
native transcribers perceived three as [ɟ] and three as [ʤ], with some disagreement.
RAs judged most of these as closer to the palatal stop and one as the affricate, indicated
by the long whisker extending out to [ʤ]. The voiceless place judgement [ʃ, s] shows
that non-native transcribers perceived the sound as [ʃ] and had some disagreement. RAs
judged it between both sounds but closer to [s] than [ʃ]. The results for [z, ɟ] indicate
that non-native transcribers perceived an ambiguous palatal stop for the second
Overall word-medially RAs generally agreed with the judgements of the non-
native transcribers. The main differences between RAs and non-native transcribers were
in the stop-fricative judgements [d, ð] and [b, v]. These sounds were transcribed by non-
native transcribers as stops, though for half of (1 out of 2 [d, ð] and 2 out 4 [b, v]) there
were disagreements among non-native transcribers between the stop and fricative. On
the continuous scale RAs judged some of these sounds toward the fricative. There was
also a difference between the non-native transcribers’ judgements and RAs’ judgements
for the place pairing [ʃ, s]. Non-native transcribers perceived the sound as [ʃ], but RAs
responded at the [s] endpoint and on the continuum between the two points.
6.5.3 Word-Final
The results for word-final sounds are shown in Figure 6.4. The first four pairings
involve stop-fricative judgements and the last two voicing. Sounds in the first
listeners and RAs generally perceived a stop. The sounds in the [c, ʃ] judgements for the
final consonants in binij 'that's it!, <finish' were perceived as fricative by non-native
transcribers with some inter-transcriber disagreement. RAs judged these between the
fricative [ʃ] and the mid-point. For the [ɟ, ʃ] judgement RAs perceived a fricative, as the
non-native transcribers did. For the [f, p] judgements RAs perceived stops, as the non-
The next pairings involved voicing judgements. For sounds in [f, v] judgements
non-native transcribers perceived [v], and on the scale RAs made judgements between
the voiced and voiceless endpoints with a mean around the midpoint. For the [z, s]
pairing RAs judged the sounds between the two endpoints but generally closer to the
voiceless fricative [s]. The whisker extending out to [z] corresponds with the sound that
Overall for the word-final sounds there was generally agreement between non-
native transcribers and RAs. The main discrepancy was for [f, v], which involved an
English code-switch five and a borrowing giraffe. While the final sound in these words
was perceived by non-native transcribers as the voiced fricative [v], there was inter-
transcriber disagreement for one token, five, which was an English code-switch, and
6.6 Discussion
In this study native speakers of Gurindji Kriol used visual analogue scales to
make judgements of Gurindji Kriol sounds that had been transcribed phonetically as
Kriol were made by native Australian English speakers, one of them with direct
native speakers and nonnative transcribers on Gurindji Kriol segments, native speaker
judgements were then compared with the transcriber judgements in initial, medial and
final word positions. The transcribers are referred to here as ‘non-native transcribers’, as
The aim was twofold: 1) to investigate how visual analogue scales can be used
types of judgements and word positions. We found that the methodology used here was
successful at eliciting judgements of sounds from native speakers who had no technical
IPA training, with some limitations that could be addressed in future studies. From the
native speaker responses we gained further insight into transcription ambiguities. Word-
by native speakers, and ambiguities between palatal stops and palato-alveolar affricates
ambiguities were often judged toward fricatives by native speakers when transcribers
had judged them as stops. In all word positions voicing judgements were often judged
along the continuous scale by native speakers. Results also indicated agreement
CHAPTER 6 199
between native speakers and non-native transcribers for many judgements, particularly
Visual analogue scales have been used previously to measure adult perceptions
of covert contrast, accuracy, and ambiguous sounds in child speech (Julien & Munson,
2012; Munson et al., 2010; Schellinger, 2008; Uberg-Carlson et al., 2009), judgements
Skorniakova & Ito, 2011). The previous research suggested visual analogue scales
2012; Munson, Schellinger, Urberg-Carlson, 2012). The current study indicates visual
analogue scales can additionally be used in an adult L2 context for the purpose of
checking phonetic transcription of non-native transcribers (i.e. not native speakers of the
language being transcribed) with judgements of native speakers who have not been
trained in the technical aspects of transcription. The visual analogue scales enabled us to
elicit and analyse the implicit linguistic knowledge of native speakers, which provided
non-native transcribers.
technical level required for IPA transcription. Further, in a language like Gurindji Kriol
the sound system is highly variable and the sound categories of native speakers are
unknown. Using continuous scales for sound judgements therefore provided additional
The two major advantages of the current design were its ease of use with the
native speaker RAs and that the visual analogue scales allowed for both continuous and
categorical judgements. That is, RAs could respond along the scale for a continuous
judgement or at the endpoints for categorical judgements. There were also some
limitations, however. In particular the endpoints on the scales and the time demands
required by this design should be carefully considered for future research. Here we will
discuss the study results and specific information that native speaker scale responses
provided about phonetic transcription and fricative variation in Gurindji Kriol. We then
consider the limitations of the current design, particularly the time demands in relation
to the extra information that was gained from carrying out this study.
RAs were native speakers of Gurindji Kriol and were also bilingual in English.
Phonetic transcribers were native Australian English speakers who were trained in
phonetic transcription and had varying levels of experience with Gurindji Kriol – one
transcriber was not familiar with Gurindji Kriol prior to this study, the other (the first
author) had some experience listening to the language gained throughout the research.
Language background may have affected the results differentially for RAs and
transcribers, and different levels of familiarity with Gurindji Kriol could explain some
categories (Coussé et al., 2004; Hardison, 2003; Strange & Jenkins, 1978). Effects
would be seen in native speaker and transcriber judgements of sounds that occur in one
language and not in the other. The sounds in the current study that occur in Gurindji
Kriol and are not present in Australian English are the voiced and voiceless palatal stops,
CHAPTER 6 201
[ɟ ,c]. These sounds occur in traditional Gurindji words, and in Gurindji Kriol are also
pronounced in some Kriol-derived words in variation with fricatives, e.g. the initial
sound in jinek ‘snake’ is sometimes pronounced with a palatal stop [ɟ] and sometimes a
voiceless alveolar fricative [s]. When palatal stops are pronounced with frication they
sound very similar to the palato-alveolar affricates [ʧ, ʤ], and there were several inter-
The results show that in initial and medial positions there were differences
between RAs and transcribers in judgements between palatal stops and affricates (there
there were six sounds transcribed as the affricates [ʧ] or [ʤ], and the mean RA
response was a categorical palatal stop [c] or [ɟ]. There was one mid-point RA response
for a [ɟ, ʤ] judgement suggesting the Gurindji Kriol sound was perceived as in-
between the two endpoints, and this was in a token of jeya ‘there’ that transcribers had
disagreed on. Word-medially there were three [ʤ, ɟ] sounds transcribed as the palatal
stop and three as the affricate. Again the mean RA response was closer to the palatal
stop endpoint, although more of the scale was used than for the word-initial judgements
of this type. Overall these findings suggest it may be difficult to distinguish between
palatal stops and palato-alveolar affricates and that listeners tend to judge them in terms
of their native language phonology. Future research could involve acoustic analyses of
Some of the native speaker judgements were unexpected because they differed
from non-native transcriber judgements, with the RA responses tending toward the
phonology of English, in which they are bilingual. For example initial and medial
interdental fricatives [θ, ð] are present in English but do not occur in Gurindji and are
extremely rare in Gurindji Kriol. RAs judged an initial sound as a voiceless dental
CHAPTER 6 202
fricative [θ] rather than the alveolar stop [t], and two medial sounds toward the voiced
dental fricative [ð] rather than the alveolar stop [d]. In these cases language background
may have had indirect effects, as RAs and transcribers may have been differentially
phonological awareness training and visual analogue scale procedure were designed so
that RAs would not rely on either English or Gurindji Kriol spelling to complete the
task. However, the words involved in these judgements (e.g. smuth-wan ‘smooth’) are
phonologically similar to the corresponding words in English, in which the RAs are
bilingual. Their knowledge of English spelling may have interfered with perceptual
The word-initial [h] judgements also appear to have been affected by language
background. Non-native transcribers had judged half the [h] words as being pronounced
with an initial [h] and the other half as having no pronounced [h], while native speakers
judged all the words as being pronounced with an initial [h]. Word-initial [h] appears to
occur variably in Gurindji Kriol and may be a form of hypercorrection when it occurs in
vowel-initial Kriol-derived words, e.g. [hæpʊl] apple. The current results may be a
reflection of the hypercorrection of initial [h] in production. It is also possible that [h] is
a covert contrast in Gurindji Kriol, that is, a phonetic contrast that is only apparent at a
subphonemic level. If there were a covert contrast with [h] in Gurindji Kriol this would
mean it may be more readily perceived by native speakers than non-native speakers.
perceptual studies would be useful to examine the variable occurrence of [h] in Gurindji
Kriol.
Non-native speaker transcribers and native speaker RAs may have both been
influenced by lexical expectation effects. For example for the initial sound in the
CHAPTER 6 203
English code-switch word circle transcribers agreed on the voiceless alveolar fricative
[s], as that is the expected pronunciation in English. RAs however judged this sound
toward the voiced alveolar fricative endpoint [z]. On the other hand there were words
like giraffe, another English code-switch, in which transcribers judged the final
consonant as the voiced fricative [v] and RAs judged the voiceless fricative [f], which is
the expected sound from the spelling and usual English pronunciation. To elucidate
these effects further research and acoustic analyses of controlled recordings would be
beneficial; however these results indicate a starting point for the types of perceptual
disagreements. While both transcribers were trained in phonetic transcription, one had
no prior experience with Gurindji Kriol and the other had some familiarity with the
language and had learned some of it directly from native speakers. As Oller and Eilers
(1975) found, language background effects are not necessarily negative and familiarity
with the language being transcribed can improve the accuracy of transcription. Knowing
the ‘target’ meaning of words and utterances can help direct transcriber attention toward
the relevant phonetic detail (Oller & Eilers, 1975). Half the tokens in the current study
were chosen for the visual analogue scale judgements because there was transcriber
the current data are higher than in the full dataset. At least some of these disagreements
are likely due to familiarity with Gurindji Kriol, which changes even throughout the
transcribers may explain some of the judgement differences in the current data.
Disagreements between the two transcribers may be due to their different levels of
CHAPTER 6 204
are shown in the different judgements of native and non-native speakers between palatal
stops and palato-alveolar affricates. Interference from English spelling and lexical
expectation effects may account for some differences between RA and transcriber
judgements where RAs unexpectedly judged sounds as ‘toward’ endpoints that were
differ overall from responses to medial and final sounds. In general, RA judgements of
initial sounds showed more inter-RA agreement but also more discrepancies between
RAs and transcribers than medial and final word positions. In the phonological
awareness training, RAs found the initial sounds easier to isolate and identify than the
medial and final sounds, which is consistent with previous research as in early literacy
development, initial sounds are easier to segment and reflect on (Gillon, 2004).
Additionally, the consonants in the endpoints were in onset positions (CV), which may
have made for an easier comparison with the word-initial judgements as these were also
For word-medial sounds, the mean RA responses were often along the scale (i.e.
not categorical judgements at the endpoints) for stop-fricative judgements such as [d, ð],
[ʃ, ɟ] and [v, b]. This may be due to lenition occurring in medial position, particularly as
relative to other contexts (Kirchner, 2004). Future research using acoustic analyses and
more controlled perceptual testing would be useful to determine whether these sounds,
CHAPTER 6 205
or some of them, are produced along a continuum, that is stops may be produced with
varying amounts of frication until they are indistinguishable from fricatives. Further
studies using acoustic analyses would have to involve controlled recordings of speech
formal speech. The current data was from naturalistic conversational recordings, thus
transcription and other perceptual judgements are more reliable than acoustic analyses
due to background noise, overlapping speech, low recording levels and clipping.
Overall, the current study using the continuous visual analogue scales with
native speakers of Gurindji Kriol helps to explain some of the ambiguities and inter-
transcriber discrepancies in the phonetic transcription that are likely due to differences
in language background. Some sounds, such as fricative voicing and word-medial stop-
fricative continuancy, may vary along a continuum while others such as the presence of
Kriol. Further research involving acoustic analyses of controlled recordings that also
control for phonetic environments and perceptual experiments are needed to investigate
It is also important to note that the RA judgements, and thus our interpretations
of them, are restricted by the endpoints provided in the visual analogue scales. A major
advantage of the scales is that they allowed for continuous judgements; however, this
was a trade-off in that there were only two categorical endpoints and these were decided
by a transcriber. RAs did have the option of responding with ‘other’ if they thought the
Gurindji Kriol sound was something different to the endpoints provided, and sometimes
they did use that response. The endpoints may still have biased RAs toward making an
assumption that the Gurindji Kriol sound ‘belonged’ to one of the two endpoints. The
endpoints may have been problematic in other respects as they required certain
CHAPTER 6 206
decisions to be made in advance by the researchers. These are discussed in further detail
6.6.3 Limitations
There are two main limitations to the current design and, by way of
recommendation for future research, it is important to make decisions about these early
on. One relates to the endpoints used in the scales – what to use as endpoints, how
listeners may represent them in relation to the sounds being rated, and how they may
affect the results. The second important consideration is the limited number of RAs in
the current study and that not all of them undertook the phonological awareness training.
We then consider whether the quality of information gained from the visual analogue
transcription.
The endpoints used in the current study were IPA segments in CV contexts from
the UCLA phonetics website and were spoken by an American male speaker trained in
phonetics. In the scale tasks the native speakers heard the consonants of interest in CV
endpoints, and the Gurindji Kriol sound in the context of the whole utterance. This
allowed for comparison with the transcriber IPA judgements, as transcribers also judged
the sounds in the context of the whole word and utterance. Native speakers and
transcribers were therefore all exposed to linguistic context and sentence cues. The
rationale for using the UCLA IPA segments as endpoints was that in Gurindji Kriol
fricatives are highly variable (Buchan et al., 2012) and the native speaker target forms
are unknown. To use whole words or any kind of Gurindji Kriol categories as endpoints
would raise a new set of issues around how speakers represent those categories and may
The UCLA segments were used because they are clear typical examples of IPA
contexts in the UCLA segments, which may have limited the study as the endpoints did
not indicate the actual range of the Gurindji Kriol sounds that were judged. Rather, the
awareness training was designed to address some of the issues with the endpoints by
focusing on sound segmentation and isolating sounds in different word positions. One
aim of the training was to develop RAs’ phonological awareness skills so that they
could compare the consonants in the endpoints to Gurindji Kriol sounds in initial,
medial and final word positions in the tokens. Knowing more about the variation in
sounds and how they are typically pronounced in the language being studied would
allow for a closer approximation of the sounds used in the endpoints of a visual
A further limitation of the current study is that a relatively small number of RAs
participated, and not all of them completed the phonological awareness training that was
conducted in a previous visit. The native speakers who underwent the training assisted
those who did not do it, but these differences in the phonological awareness training
research using the current method would benefit from recruiting more RAs and
retaining the same people throughout the study, that is for both the training and the
6.6.4 Conclusion
Visual analogue scales are useful tools for eliciting linguistic knowledge about
phonology from native speakers who do not have technical training in phonetic
provided insight into inter-transcriber disagreements and alerted us to sounds that may
vary over a continuum rather than being categorical, such as voicing, and possible
covert contrasts.
Kriol in the field was generally successful. Native speakers have implicit linguistic
knowledge of their language, and the visual analogue scales allowed us to elicit their
knowledge of the sound system to compare with non-native transcriber IPA judgements.
Results suggested that ambiguities and discrepancies in phonetic transcriptions may not
always be due to random error but may be more systematic, with factors such as
Conclusion
CHAPTER 7 210
structures in the input forms the basis of much of their linguistic knowledge. According
language phonology is an ongoing process and the input continually influences the
phonological variation in speech input to infants (e.g. Cristià, 2010, 2011; Davis &
Lindblom, 2001; Fernald et al., 1989; Grieser & Kuhl, 1988; Katz, Cohn & Moore,
1996; Kirchhoff & Schimmel, 2005; Kuhl et al., 1997; Snow, 1977; Stern, Spieker,
Barnett & MacKain, 1983). The phonology of the input continues to change after the
first year of life as child-directed speech develops into adult-directed speech. There are,
however, relatively few longitudinal studies of how and when this happens.
The purpose of this thesis was twofold. The main aim was to investigate how
two language varieties in northern Australia – Gurindji Kriol and Australian English. A
fricatives in Kriol-derived words. The Australian English corpus collected and analysed
for this thesis was from Katherine, a regional town in the same geographic region as
children aged from approximately 1;6 to 2;0 to 2;6. This is considered an important
the native language continues to develop and their receptive and productive
This thesis consisted of three empirical research studies to address three main
research questions: 1) Study 1, Chapter 4 – What is the nature and extent of variation in
English maternal speech? 3) Study 3, Chapter 6 – How can we elicit native speaker
(VAS), and what information does this add to non-native transcribers’ phonetic
transcriptions? In this chapter I discuss the main findings and conclusions across all
three studies, the implications and limitations of this research, and directions for future
research.
The overall results of the studies in this thesis show that children are exposed to
varieties, and that variation changes in maternal speech over child ages 1;6, 2;0 and 2;6.
Fricative variation is complex and different types were found in each language variety,
derived words were highly variable, at least some of the variability was systematic.
Manner of articulation variation between stops and fricatives was more likely to occur
in word-initial position (rather than medial or final), at labio-dental and alveolar places
of articulation, and in lexical items that were open- rather than closed-class. In all
CHAPTER 7 212
tokens that could potentially be pronounced with a fricative (defined as words whose
were significantly more likely in mothers’ speech when children were approximately
2;6 than when they were 1;6. In tokens of words that were found to contain stop-
fricative variation, word-initial fricatives (with the exception of /h/, which did not occur
frequently enough to be analysed) were again more likely at 2;6 than 1;6. In these
tokens fricatives were also significantly more likely in medial position at child age 2;0
and 2;6 than at 1;6. These results took into account both phonological environment and
older partly due to lexical choice, as mothers also proportionately increased their use of
words that could be pronounced with fricatives as children aged. Findings can be
interpreted in terms of fine-tuning, the concept that mothers make adjustments to their
may serve to gradually introduce children to the variable phonetic forms that are
maternal speech was investigated. We analysed word-initial /h/ and word-final /v/
are frequent in casual speech. It was found that deletion occurred in a stable set of
lexical items that were mostly function words. In mothers’ speech deletion
proportionately increased between child ages 1;6 and 2;0, and decreased between 2;0
and 2;6. The effect was significant for /h/ deletion and approached significance for /v/,
which had fewer occurrences in the corpus. Further analysis showed that the effect of
child age on deletion was not likely to be caused by changes in speech rate. The non-
CHAPTER 7 213
linear change over time was unexpected, as the few previous studies that have examined
linear change from child-directed to adult-directed speech style (Foulkes, Docherty &
Watt, 2005; Smith, Durham & Fortune, 2009). Possible explanations for the Australian
English maternal speech findings also relate to fine-tuning, as mothers may have
There are several mechanisms that could lead to a non-linear change in deletion,
in particular changes in the distributions of speech acts and types of social interactions,
which are associated with different speech styles along a continuum of casual to careful
speech. For example, Smith, Durham and Fortune (2007) found that in the informal
speech contexts play and routine, child-directed speech was more likely to contain a
non-standard phonetic variant than in the more formal contexts of teaching and
continuum, such as questions, recasts and repetitions, may become more frequent in
frequencies of different types of interactions between mothers and children, e.g. play,
explicit teaching, behaviour management, may also shift over time with children’s
social and cognitive development. Further research is needed to examine whether there
are stylistic changes over time in maternal speech and how it relates to changes in
phonetic variability. As with Gurindji Kriol, in northern Australian English one function
of changing phonetic variability in input to children over time may be to introduce them
VAS and compared these to IPA transcriptions made by non-native transcribers. This
CHAPTER 7 214
methodology provided further insight into variation in Gurindji Kriol, and elucidated
speaker and non-native transcriber may have been due to several effects on perception
perceptual hypercorrection. Findings also showed agreement between native and non-
native speakers on many segments, particularly those in medial and final word
positions.
7.2 Implications
The findings of this thesis show that fricatives in speech input to children are
variable and change over time in Gurindji Kriol and northern Australian English. More
generally, the findings indicate that theoretical models of phonological acquisition need
to account for a variable and dynamic input. As discussed in the theoretical framework
in section 2.4, exemplar-based theories are well positioned to do this as they propose
mechanisms for how children process and store variable phonetic detail and associated
2003; Pierrehumbert, 2003). The current studies were not designed to test the various
theoretical models, rather exemplar-based models form the theoretical rationale for
describing variation in the input. Exemplar models account for phonetic variation in the
input including phonetic variation related to fine-tuning, which was explored in Studies
categories in memory, and phonetic variants in the input influence the distributions of
communities where similar language varieties are spoken. Although Gurindji Kriol is
the home language of children in Kalkaringi, English is the language of the school, and
children are taught in English when they start going to school. Prior to the larger project
that this research is a part of there were no detailed systematic descriptions of the
phonology of Gurindji Kriol and thus there was little information for teachers to use as
a basis for teaching English language and literacy. Information about sounds in Gurindji
Kriol and the similarities and differences to Australian English could help guide
teachers’ explicit teaching of sounds and words in English. In particular the current
findings suggest that Gurindji children may benefit from explicit teaching of
English between fricatives and stops in different word positions. For example the results
of Study 1 suggest that children are exposed to fricatives in word-initial position, with
the majority being labio-dental and alveolar in open-class words. English also contains
word-initial labio-dental dental and alveolar fricatives, but in contrast to Gurindji Kriol
English also has a relatively high frequency of interdental word-initial fricatives, largely
due to their presence in English closed-class words (the, this, that, then, there, etc.). The
corresponding Kriol-derived words in Gurindji Kriol were nearly all pronounced with
word-initial stops, and children may need special instruction to help them perceive and
medial fricatives in Kriol-derived words alternate frequently with stops, which means
CHAPTER 7 216
words may sometimes sound quite similar to the corresponding English word and
the medial consonant in neba ‘never’). If teachers hear children producing fricatives
they may assume that the children have a command of English phonology, without
necessarily realising that they are producing Gurindji Kriol phonology where stops and
The rarest position for fricatives in Kriol-derived words was word-final. Again,
may also need further teaching of English morphology as part of learning word-final
fricatives. The results from the Visual Analogue Scales with adults in Study 3 suggested
Further research is needed to examine voicing in Gurindji Kriol fricatives; however, the
present research does suggest variability in voicing. This is important when it comes to
Overall, the findings of these studies have practical implications for teachers at
the school in Kalkaringi and neighbouring areas. They can also be used in the same way
for teaching traditional Gurindji, for example starting with the sounds that children have
already learned from Gurindji Kriol input and then later teaching the words that sound
relatively different. The different distributions of fricatives in each word position have
pedagogical implications, as it is important for teachers to know that learners will likely
be more familiar with some fricatives in particular positions. For example, relative to
speech. Being familiar with the initial [s] sound from their home language may make it
CHAPTER 7 217
easier to learn [s] in medial and final positions in English words if teachers can make
In the feedback trip to Katherine and Kalkaringi conducted at the end of the
providing information and sound charts to teachers about Gurindji Kriol phonology
described in the current studies and studies in the larger project. Information included
brief descriptions of Gurindji Kriol fricatives and stop-fricative variation from the
studies in this thesis, as well as stop consonant voicing variation (Jones & Meakins,
2012a), the vowel system (Jones, Meakins & Buchan, 2011; Jones, Meakins &
Muawiyath, 2012), and baby-talk phonology (Jones & Meakins, 2012b). The focus of
the information provided to teachers was on similarities and differences between sounds
in Gurindji Kriol and Australian English, with the aim of developing teachers’
knowledge about Gurindji mothers’ speech to children, sounds that children may have
difficulty with, and sounds that they are likely to be familiar with.
7.3 Limitations
As the limitations of each study have been identified in the discussion sections
in Chapters 4, 5 and 6, the wider limitations of the thesis as a whole will now be
discussed. First, both the Katherine English corpus and the subsample of the Gurindji
Kriol corpus analysed for these studies consisted of a small number of speakers (four
Katherine English speaking mothers, three Gurindji Kriol speaking mothers). Small
speaker numbers is a common issue in many child speech studies involving corpus
analyses, due to the large demands on time and resources that it takes to create a corpus.
A relatively small number of speakers was built into the current design as it allowed for
in-depth analysis of naturalistic data, that is, analysis of occurrences of segments and
results and further research is needed to determine how widespread the current findings
are.
that the data were not specifically child-directed but included all speech spoken by
mothers in the presence of the target children. This was again by design, based on
previous conceptualisations of input to children in the literature (see Chapter 2 pp. 19-
children and other speech in their presence that would not be revealed by the analyses in
this thesis.
Another broader limitation inherent in the design of this study is that the data
were from phonetic transcription, which is primarily a perceptual activity and may be
subject to perceptual biases. Potential problems with phonetic transcription data are
phonetic transcription as a data analysis tool formed part of the rationale for Study 3
transcribed segments with native speaker judgments. As well as checking the phonetic
reliability checks were also undertaken for the transcription data analysed in Studies 1
and 2. Future research could involve acoustic analyses of Gurindji Kriol speech
variation than transcription based analyses will allow. Acoustic analyses of fricatives
was not feasible for the data in these studies, as they are from naturalistic recordings
that contain varying amounts of background noise and were not controlled for context.
Also, the Gurindji Kriol conversational data were originally recorded for the purpose of
morphological analyses rather than detailed acoustic processing and some audio
CHAPTER 7 219
recordings of Gurindji Kriol were made on mini-discs, which compress audio data.
appropriate methods for analysing fricatives in the current data than acoustic analyses.
language, and the Katherine English corpus is the first phonetically transcribed
findings of this research therefore open up several broad avenues for future research.
Investigating fricatives more broadly and using acoustic analyses and perceptual testing
could be done to investigate whether or not there are any fricative contrasts in Gurindji
lenition, socioindexical and stylistic factors) to fricative variation. These findings can
also be used as a basis for researching fricatives in Kriol and other Australian contact
language varieties.
analyse children’s receptive and productive language longitudinally over the preschool
years. Such research would provide information about the developmental trajectories of
children’s acquisition of phonetic variation in the input, and the specific effects that
change over time in maternal speech has on children’s language acquisition. We have
longitudinal data of the Australian English speaking children’s productions as the target
children in the Katherine English corpus did wear microphones for most of the sessions.
Future research on this data may provide further insight on the relationships between
Another question for future research on both language varieties is how much
variation is related to speech styles and the types of interactions? We suggested changes
Also raised in the Katherine English study was a question about sociolinguistic
about social groups and/or speech styles. Research to identify these variables would
Australian English spoken in different regions and social groups. Studies of change over
speech would allow for comparisons with previous studies in other English varieties (in
particular Foulkes, Docherty and Watt’s (2005) study of Tyneside English child-
directed speech), and further insight into children’s acquisition of phonetic variation
The methodological study in Chapter 6 also highlights areas for further research.
Findings indicated that Visual Analogue Scales could be used successfully in the field
technical training in transcription. More generally it suggests that visual analogue scales
studies could investigate the application of this to other contexts. For example in adult
L2 language learning visual analogue scales could be used to compare learner and
native speaker judgements for the purpose of providing specific feedback to L2 learners
CHAPTER 7 221
contexts.
The pedagogical implications of the research also open avenues for further
research. The larger study included analyses of phonetic variation at acoustic and
segmental levels of vowels and consonants, which applied to a teaching context is a lot
which sounds to prioritise and the contrasts that are important for intelligibility and
minimal pairs and their lexical frequency, and the frequency of particular phones within
minimal pair words (Brown, 1988; King, 1967; Munro & Derwing, 2006; Surendran &
Niyogi, 2006). Phones that have high frequencies and distinguish a number of minimal
pairs have high functional load (such as many vowels in English), while low frequency
phones that distinguish relatively few pairs (such as [ʃ, ʒ] in English) have low
functional load. Speech errors involving high functional load sounds affect
involving low functional load sounds (Munro & Derwing, 2006). Empirical information
about functional load would therefore be useful to teachers, who could use it to
thesis is an analysis of the sounds children are likely to be familiar with through
exposure to speech in their home language. Future studies of children’s speech would
CHAPTER 7 222
provide information about the sounds they have difficulty producing, which could then
importance of the difficult sounds (Brown, 1988). Further questions raised in the
Katherine English study (Chapter 5) and the Visual Analogues Scales study (Chapter 6)
could also be studied from a functional load approach. For example, are phonological
reduction and other casual speech processes more likely to occur in low than high
functional load segments? What are the relationships between functional load, phonetic
7.5 Conclusion
that children are exposed to in two language varieties in northern Australia, Gurindji
Kriol and northern Australian English. Further, variation changes in mothers’ speech as
children age between approximately 1;6, 2;0 and 2;6 in some processes and
into the language input to children after infancy, and of systematic descriptions of
variation in contact language varieties. Results indicate the need for models of language
References
Anderson, J. L., Morgan, J. L., & White, K. S. (2003). A statistical basis for speech sound
Au, T. K., Knightly, L. M., Jun, S. A., & Oh, S. (2002). Overhearing a language during
Ball, P., Giles, H., & Hewstone, M. (1985). Interpersonal accommodation and situational
London: Erlbaum.
Bard, E. G., & Anderson, A. (1983). The unintelligibility of speech to children. Journal of
Beckman, M. E., Munson, B., & Edwards, J. (2007). Vocabulary growth and the
Hualde (Eds.), Papers in laboratory phonology (Vol. 9, pp. 241-264). New York:
Mouton de Gruyter.
Behrmann, M., & Bub, D. (1992). Surface dyslexia and dysgraphia: Dual routes, single
REFERENCES 224
Bell, A. (1984). Language style as audience design. Language in Society, 13(2), 145-204.
Bent, T., Bradlow, A., & Smith, B. L. (2007). Segmental errors in different word positions
and their effects on intelligibility of non-native speech: All's well that begins well.
Browman, C. P., & Goldstein, L. M. (1987). Tiers in articulatory phonology, with some
laboratory phonology I: Between the grammar and physics of speech (pp. 341-
Brown, A. (1988). Functional load and the teaching of pronunciation. TESOL Quarterly,
22(4), 593-606.
Buchan, H. (2011). Phon: Free software for phonological transcription and analysis.
Buchan, H., & Jones, C. (2013). Phonological reduction in maternal speech in northern
10.1017/S0305000913000123
REFERENCES 225
Buchan, H., Jones, C., & Meakins, F. (2012). Fricative variation in maternal Gurindji
Speech production: models, phonetic processes and techniques (pp. 187-210). New
Chevrot, J. P., Dugua, C., & Fayol, M. (2009). Liaison acquisition, word segmentation and
557-596.
Chevrot, J. P., Nardy, A., & Barbu, S. (2011). Developmental dynamics of SES-related
89(3), 183-213.
Cole, R., & Jakimik, J. (1980). A model of speech perception. In R. Cole (Ed.), Perception
Associates.
REFERENCES 226
Coltheart, M., Curtis, B., Atkins, P., & Haller, M. (1993). Models of reading aloud: Dual-
100(4), 589-608.
Cooper, R. P., & Aslin, R. N. (1990). Preference for infant-directed speech in the first
Coussé, E., Gillis, S., Kloots, H., & Swerts, M. (2004). The influence of the labeller’s
Cox, F. (2006). Australian English pronunciation into the 21st century. Prospect: The
Cox, F., & Palethorpe, S. (2010). Australian Voices, Macquarie University. Retrieved from
http://clas.mq.edu.au/australian-voices/australian-voices.
Cox, F. M., & Palethorpe, S. (2004). The border effect: Vowel differences across the
Linguistics Society.
Cox, F. M., & Palethorpe, S. (2007). Illustrations of the IPA: Australian English. Journal
Melbourne, Australia.
Cristià, A. (2011). Fine-grained variation in caregivers' /s/ predicts their infants' /s/
Cruttenden, A. (1994). Phonetic and prosodic aspects of baby talk. In C. Gallaway & B. J.
Cutler, A. (1993). Phonological cues to open- and closed-class words in the processing of
Daland, R. (2013). Variation in the input: A case study of manner class frequencies.
Davis, B. L., & Lindblom, B. (2001). Phonetic variability in Baby Talk and development
cognitive abilities in early infancy (pp. 135-171). Mahwah, New Jersey: Lawrence
Erlbaum.
de Jong, K. J., Beckman, M. E., & Edwards, J. (1993). The interplay between prosodic
Dominey, P. F., & Dodane, C. (2004). Indeterminacy in language acquisition: The role of
child directed speech and joint attention. Journal of Neurolinguistics, 17(2), 121-
145.
Eckert, P. (2001). Style and social meaning. In P. Eckert & J. Rickford (Eds.), Style and
Edwards, J., Beckman, M. E., & Munson, B. (2004). The interaction between vocabulary
Edwards, J., & Munson, B. (2012). Transcribing the speech of multilingual children with
Fernald, A., & Morikawa, H. (1993). Common themes and cultural variations in Japanese
Fernald, A., Taeschner, T., Dunn, J., Papousek, M., de Boysson-Bardies, B., & Fukui, I.
Finegan, F. E., & Biber, D. (2001). Register variation and social dialect variation: The
Forster, K. I. (1976). Accessing the mental lexicon. In F. J. Wales & E. Walker (Eds.), New
Foulkes, P., & Docherty, G. (2006). The social life of phonetics and phonology. Journal of
Foulkes, P., Docherty, G., & Watt, D. (2005). Phonological variation in child-directed
Giles, H., Coupland, N., & Coupland, J. (1991). Accommodation theory: Communication,
Press.
Guilford Press.
REFERENCES 230
Goddijn, S., & Binnenpoorte, D. (2003). Assessing manually corrected broad phonetic
Goldinger, S. D. (1996). Words and voices: Episodic traces in spoken word identification
Goldinger, S. D., & Azuma, T. (2004). Episodic memory reflected in printed word naming.
Grieser, D. L., & Kuhl, P. K. (1988). Maternal speech to infants in a tonal language:
24(1), 14-20.
Harrington, J., Cox, F. M., & Evans, Z. (1997). An acoustic phonetic study of broad,
Hawkins, S. (2003). Roles and representations of systematic fine phonetic detail in speech
Hazan, V., & Barrett, S. (2000). The development of phonemic categorization in children
Ségéral (Eds.), Lenition and fortition (pp. 9-93). Berlin: Mouton de Gruyter.
Hudson, J. (1985). Grammatical and semantic aspects of Fitzroy Valley Kriol. Darwin:
Jones, C., & Meakins, F. (2012a). Variability in stop consonant voicing in Gurindji Kriol,
Jones, C., & Meakins, F. (2012b). The form and function of janyarrp: Gurindji Kriol child-
Jones, C., Meakins, F., & Buchan, H. (2011). Comparing vowels in Gurindji Kriol and
305-326.
Jones, C., Meakins, F., & Muawiyath, S. (to appear). Learning vowel categories from
Julien, H., & Munson, B. (to appear). Modifying speech to children based on their
Jusczyk, P. W., Cutler, A., & Redanz, N. J. (1993). Infants' preference for the predominant
Jusczyk, P. W., Friederici, A. D., Wessels, J. M. I., Svenkerud, V. Y., & Jusczyk, A. M.
REFERENCES 232
(1993). Infants' sensitivity to the sound patterns of native language words. Journal
Jusczyk, P. W., Luce, P. A., & Charles-Luce, J. (1994). Infants' sensitivity to phonotactic
patterns in the native language. Journal of Memory and Language, 33(5), 630-645.
Kaplan, A. (2011). Perceptual pressures on lenition. Language and Speech, 54(3), 285-305.
Katz, G. S., Cohn, J. F., & Moore, C. A. (1996). A combination of vocal f0 dynamic and
Kearns, K. P., & Simmons, N. N. (1988). Interobserver reliability and perceptual ratings:
More than meets the ear. Journal of Speech and Hearing Research, 31(4), 131-136.
Kempster, G. B., Gerratt, B. R., Abbott, K. V., Barkmeier-Kraemer, J., & Hillman, R. E.
18(2), 124-132.
Kerswill, P., & Williams, A. (2000). Creating a New Town koine: Children and language
Kirchhoff, K., & Schimmel, S. (2005). Statistical properties of infant-directed versus adult-
directed speech: Insights from speech recognition. Journal of the Acoustical Society
REFERENCES 233
University Press.
function of infant age and sex in a tonal and non-tonal language. Infant Behaviour
122(8), 841-857.
Kong, E. J., & Edwards, J. (2011). Individual differences in speech perception: Evidence
Koreman, J. (2006). Perceived speech rate: The effects of articulation rate and speaking
582-596.
Kuhl, P. K., Andruski, J. E., Chistovich, I. A., Chistovich, L. A., Kozhevnikova, E. V.,
Kuhl, P. K., Williams, K. A., Lacerda, F., Stevens, K. N., & Lindblom, B. (1992).
Labov, W. (1984). Field methods in the project on linguistic change and variation. In J.
REFERENCES 234
Baugh & J. Sherzer (Eds.), Language in use: Readings in sociolinguistics (pp. 28-
Labov, W. (1989). The child as a linguistic historian. Language Variation and Change 1,
85-97.
Blackwell.
Ladefoged, P., & Maddieson, I. (1996). The sounds of the world's languages. Oxford &
Lee, S., Davis, B., & MacNeilage, P. F. (2008). Segmental properties of input to infants: A
Lee, S., & Davis, B. L. (2010). Segmental distribution patterns of English infant- and
Hardcastle & A. Marchal (Eds.), Speech production and speech modelling (pp.
Liu, H. M., Tsao, F. M., & Kuhl, P. K. (2009). Age-related changes in acoustic
Loakes, D., Hajek, J., & Fletcher, J. (2010). The /el/-/æ/ sound change in Australian
Selected papers from the 2009 conference of the Australian Linguistic Society.
Louko, L. J., & Edwards, M. L. (2001). Issues in collecting and transcribing speech
MacWhinney, B. (2000). The CHILDES project: Tools for analyzing talk (3rd ed.).
Martinez-Sussmann, C., Akhtar, N., Diesendruck, G., & Markson, L. (2011). Orienting to
Maye, J., Weiss, D. J., & Aslin, R. N. (2008). Statistical phonetic learning in infants:
from Australia and their implications. Journal of Language Contact, THEMA II,
187-212.
McConvell, P., & Meakins, F. (2005). Gurindji Kriol: A mixed language emerges from
Meakins, F. (2007). Case marking in contact: The development and function of case
Meakins, F. (2009). The case of the shifty ergative marker: A pragmatic shift in the
(Eds.), The role of semantics and pragmatics in the development of case (pp. 59-
Huber (Eds.), Atlas of pidgin and creole structures (Vol III, pp. 131-139). Oxford:
Monaghan, P., Chater, N., & Christiansen, M. H. (2005). The differential role of
96(2), 143-182.
Munro, M. J., & Derwing, T. M. (2006). The functional load principle in ESL
Munson, B., Edwards, J., & Beckman, M. E. (2005). Relationships between nonword
61-78.
Munson, B., Edwards, J., Schellinger, S. K., Beckman, M. E., & Meyer, M. K. (2010).
245-260.
Munson, B., Johnson, J., & Edwards, J. (2012). The role of experience in the perception of
Munson, B., Kurtz, B. A., & Windsor, J. (2005). The influence of vocabulary size,
Munson, B., Swenson, C. L., & Manthei, S. C. (2005). Lexical and phonological
Nelson, K. (1973). Structure and strategy in learning to talk. Monographs of the Society for
Oller, D. K., & Eilers, R. E. (1975). Phonetic expectation and transcription validity.
Palmeri, T. J., Goldinger, S. D., & Pisoni, D. B. (1993). Episodic encoding of voice
Picheny, M. A., Durlach, N. I., & Braider, L. D. (1986). Speaking clearly for the hard of
Pruitt, J. S., Jenkins, J. J., & Strange, W. (2006). Training the perception of Hindi dental
and retroflex stops by native speakers of American English and Japanese. Journal
Ralston, J. V., & Johnson, K. (1990). Context effects as a function of perceptual set.
Ramsdell, H. L., Oller, K. D., & Ethington, C. A. (2007). Predicting phonetic transcription
REFERENCES 239
Rose, Y., Hedlund, G. J., Byrne, R., Wareham, T., & MacWhinney, B. (2007). Phon 1.2: A
the Association for Computational Linguistics (pp. 17-24). Stroudsburg, PA: ACL.
Rost, G. C., & McMurray, B. (2010). Finding the signal by adding noise: The role of
Linguistics.
Wisconsin-Madison.
Shi, R., Werker, J. F., & Cutler, A. (2006). Recognition and representation of function
Shriberg, L. D., & Lof, G. L. (1991). Reliability studies in broad and narrow phonetic
Press.
Singh, L., Morgan, J. L., & Best, C. T. (2002). Infants' listening preferences: Baby talk or
Skorniakova, O., & Ito, K. (2011). Sub-phonemic variation during lexical identification:
Evidence from visual analogue scale (VAS) goodness ratings. Proceedings of the
Smith, J., Durham, M., & Fortune, L. (2007). "Mam, my trousers is fa'in doon!":
Smith, J., Durham, M., & Fortune, L. (2009). Universal and dialect-specific pathways of
acquisition: Caregivers, children and t/d deletion. Language Variation and Change,
21(1), 69-95.
Snow, C. E. (1977). Mothers' speech research: From input to interaction. In C. E. Snow &
C. Ferguson (Eds.), Talking to children: Language input and acquisition (pp. 31-
Snow, C. E., & Ferguson, C. A. (Eds.). (1977). Talking to children: Language input and
Soderstrom, M. (2007). Beyond babytalk: Re-evaluating the nature and content of speech
Stern, D. N., Spieker, S., Barnett, R. K., & MacKain, K. (1983). The prosody of maternal
speech: Infant age and context related changes. Journal of Child Language, 10(1),
1-15.
speech. In R. D. Walk & H. L. Pick (Eds.), Perception and experience. New York:
Plenum Press.
Surendran, D., & Niyogi, P. (2006). Quantifying the functional load of phonemic
Benjamins.
Tagliamonte, S. A., & Temple, R. (2005). New perspectives on an ol' variable: (t,d) in
REFERENCES 242
Tollfree, L. (2001). Variation and change in Australian English consonants. In D. Blair &
P. Collins (Eds.), Varieties of English around the world: English in Australia (pp.
Tulving, E. (1972). Episodic and semantic memory. In E. Tulving & W. Donaldson (Eds.),
Urberg-Carlson, K., Munson, B., & Kaiser, E. (2009). Gradient measures of children's
speech production: Visual analog scale and equal appearing interval scale measures
2529.
van de Weijer, J. (2002). How much does an infant hear in a day? In J. Costa & M. J.
Vihman, M. M., Nakai, S., DePaolis, R. A., & Hallé, P. (2004). The role of accentual
336-353.
Warren, R. M., & Warren, R. P. (1970). Auditory illusions and confusions. Scientific
Wassink, A. B., Wright, R. A., & Franklin, A. D. (2007). Intraspeaker variability in vowel
Wedel, A. B. (2006). Exemplar models, evolution and language change. The Linguistic
Werker, J. F., Fennell, C. T., Corcoran, K. M., & Stager, C. L. (2002). Infants' ability to
learn phonetically similar words: Effects of age and vocabulary size. Infancy, 3(1),
1-30.
Werker, J. F., & McLeod, P. J. (1989). Infant preference for both male and female infant-
Werker, J. F., & Tees, R. C. (1999). Influences on infant speech processing: Toward a new
Wheeler, M. A., Stuss, D. T., & Tulving, E. (1997). Toward a theory of episodic memory:
331-354.
Williams, A., & Kerswill, P. (1999). Dialect levelling: Continuity vs. change in Milton
Keynes, Reading and Hull. In P. Foulkes & G. J. Docherty (Eds.), Urban voices
15 July 2009
Dear
You are invited to participate in a study being conducted by Dr Caroline Jones and
Heather Buchan (University of Wollongong). Between 2004 and 2007 you participated
in the ACLA research project with Felicity Meakins who made video and audio
recordings of you talking to your children and other people around you. We are inviting
you to help us write down the sounds in these recordings for a new project. This project
is to find out about the sounds you use talking to children in Daguragu and Katherine.
INVESTIGATORS
Dr Caroline Jones Heather Buchan
Faculty of Education Faculty of Education
carjones@uow.edu.au hb307@uow.edu.au
If you agree, we would like to record these sessions on video and audio. We would also
ask you about how we should use and store recordings of you talking about the videos.
If you choose to take part you will be paid at $30.70 an hour before tax. We do not think
there are risks for you. You can choose if you want to do it or not. You can stop
whenever you like. You don’t have to tell us why and nothing bad will happen. Just let
us know, and we can delete the recordings of you talking about the videos.
This project is funded by a grant from the Australian Research Council. There is no
direct benefit to you. The project can be used to help teachers and community members
with teaching traditional languages and English.
No-one will be able to identify you or your child from the results of the study. Only the
researchers will have access to this information. If you agree, we might use some photos
or recordings when we present the information (for example at a conference).
All information will be stored securely at the University of Wollongong for at least 5
years and only the researchers will have access to it. You can choose to store your
information at NCI and/or AIATSIS, or not.
-NCI (National Computational Infrastructure) used to be called APAC
(Australian Partnership for Advanced Computing). It stores the information you gave
to Felicity Meakins for the ACLA (Aboriginal Child Language Acquisition) project.
-AIATSIS is a national collection of audio and visual records of
Australian Aboriginal and Torres Strait Islander cultures and histories.
If you would like to check that you are OK with the information or recordings from the
study, you need to contact Dr Caroline Jones on (02) 4221 4905.
When you have read this information sheet, Dr Caroline Jones and Heather Buchan are
free to talk about any questions. We will be happy to talk to you at any time in the
project if you think of something later, or if any problems come up.
1. I have read the participation information sheet and have been able to ask the
researchers any questions.
2. I have discussed with the researchers how my data is going to be used and stored
and I am happy with how the study has been explained to me.
4. I understand that this study does not have any known risk for me, and I have
read the information sheet and asked any questions about risks.
5. I understand that I will watch and listen to audio and visual recordings of me
speaking to my children. These recordings are some of the ones that were made
by Felicity Meakins between 2004 and 2007 for the ACLA project. I understand
that I will help to write down my speech in the recordings.
6. I agree to be audio and video recorded while I help the researchers write it down.
The researchers will delete anything I want them to.
8. If I have any concerns or complaints about way the research is or has been
conducted I can contact the Ethics Officer, Human Research Ethics Committee,
Office of Research, University of Wollongong on 4221 4457.
Signed Date
....................................................................... ......./....../......
Name (please print)
.......................................................................
On this page you can tell us how the recordings can be used, and
who can access the recordings. Please circle what you agree to:
Storing the videos and making sure you can get copies:
I would / would not like a video file of the recordings to be kept at Diwurruwurru-
jaru (Katherine regional Aboriginal language centre).
Tick one:
anyone can access only me and my family
anyone with my permission only me
I do / do not allow for the data I give to this study to be stored at NCI with the
data I gave to the ACLA project.
In five years time, I would / would not like to have recordings stored at AIATSIS.
I do / do not give my permission for the researchers of this project to print parts of
transcripts in books / journals and other printed reports.
names must be changed
I do / do not give my permission for the researchers to show short clips of video at
conferences and meetings.
no names in the audio
faces in video blurred
I do / do not give my permission for the researchers to print photos for academic
publications and reports / show photos at conferences and meetings / put photos
on website.
any
check with me first
Signed
....................................................................... ......./....../......
Name (please print) date
.......................................................................
15 July 2009
You are invited to participate in a study being conducted by Dr Caroline Jones and
Heather Buchan (Education, University of Wollongong). The purpose of this project is
to look at sound patterns in adults’ everyday speech to children in regional northern
Australia.
INVESTIGATORS
Dr Caroline Jones Heather Buchan
Faculty of Education Faculty of Education
carjones@uow.edu.au hb307@uow.edu.au
We will ask to come back to make recordings every 6 months for a year and a half. We
would like to make 3 to 4 hours of recordings each time. We will transcribe and do
acoustic analyses of speech in the recordings and analyse the sound patterns.
We would also like to talk with you about how you want recordings of you and your
child to be used and stored, and record these discussions.
If you choose to take part you will be paid $25 an hour to reimburse you for your time.
We do not think there are risks for you or your child. You are free to decide if you want
to be involved in this project or not and you can stop participating at any time. You do
not need to give a reason for changing your mind, and there won’t be any negative
effects for you if you choose to stop participating even if the study has started. All you
have to do is let the researcher know, and any information we already have about you
will be destroyed.
This project is funded by a grant from the Australian Research Council. Your
participation will not directly benefit you. Participating will help us describe the sound
patterns of speech to children in regional Australian English, and this information can
be used to help teachers in northern Australia with teaching language to children.
All information will be stored securely at the University of Wollongong for at least 5
years and only the researchers will have access to it. In the attached consent form we
ask you to tell us how you would like your data to be used and stored.
If you would like to check that you are OK with the information or recordings from the
study, you need to contact Dr Caroline Jones on (02) 4221 4905.
When you have read this information sheet, Dr Caroline Jones and Heather Buchan are
free to talk about any questions you might have. They will be happy to talk to you at
any time in the project if you think of something later, or if any problems come up.
9. I have read the participation information sheet and have been able to ask the
researchers any questions I may have had.
10. I have discussed with the researchers how my data is going to be used and stored
and I am happy with how the study has been explained to me.
12. I understand that this study does not have any known risk for me, and I have
read the information sheet and asked any questions I may have about the risks.
14. I understand that the researchers will contact me and ask to make recordings
every 6 months for a year and a half, and each time they will make 3 to 4 hours
of recordings.
15. I understand that I will be asked regularly to look at the recordings to make sure
they are okay. I may also be asked to clarify to the researchers what I am saying
in some of these recordings.
16. I understand that all of my work with the researchers will be video and audio
recorded.
17. If I have any concerns or complaints regarding the way the research is or has
been conducted I can contact the Ethics Officer, Human Research Ethics
Committee, Office of Research, University of Wollongong on (02) 4221 4457.
To choose how and where we will keep the recordings, please circle whether you do
or do not give permission for these things:
Storing the videos and making sure you can get copies:
What material would you like stored on TalkBank? (Please tick all that apply)
audio
transcript
video
Do you want to be anonymous? (Please tick all that apply)
no, you can use my and my child’s real name
yes, by changing last names only
yes, by changing first names and last names
yes, by removing names from audio recordings
yes, by blurring faces in video recordings
Printing parts of the transcripts and showing videos and photos at conferences and
language meetings:
I do / do not give my permission for the researchers of this project to print parts of transcripts in
books / journals and other printed reports.
only with pseudonyms
I do / do not give my permission for the researchers to show short extracts of video at
conferences and meetings.
only if the audio contains no name
only if the video blurs faces
I do / do not give my permission for the researchers to print photos for academic publications
and reports / show photos at conferences and meetings / display photos on a project website
any
ask me about specific photos before using them
----------------------------------------------------------------------------------------------------------
Signed
....................................................................... ......./....../......
Name (please print) date
.......................................................................
15 July 2009
Your child is invited to participate in a study being conducted by Dr Caroline Jones and
Heather Buchan (Education, University of Wollongong). The purpose of this project is
to look at sound patterns in adults’ everyday speech to children in regional northern
Australia.
INVESTIGATORS
Dr Caroline Jones Heather Buchan
Faculty of Education Faculty of Education
carjones@uow.edu.au hb307@uow.edu.au
At the start of each recording session we will explain to your child in a simple way what
we are doing and why, and we will ask your child if he or she would like to take part.
You are welcome to help us with this.
We will ask to come back to make recordings every 6 months for a year and a half. We
would like to make 3 to 4 hours of recordings each time. We will transcribe and do
acoustic analyses of speech in the recordings and analyse the sound patterns.
We do not think there are risks for your child being involved in this study. You are free
to decide if you want your child to be involved in this project or not, and you can take
them out of the study at any time. You do not need to give a reason for changing your
mind, and there won’t be any negative effects for you if you choose to withdraw your
child even if the study has started. All you have to do is let the researcher know, and any
information we already have about your child will be destroyed.
This project is funded by a grant from the Australian Research Council. Participation
will not directly benefit your child. Participating will help us describe the sound patterns
of speech to children in regional Australian English, and this information can be used to
help teachers in northern Australia with teaching language to children.
All information will be stored securely at the University of Wollongong for at least 5
years and only the researchers will have access to it. In the attached consent form we
ask you to tell us how you would like your child’s data to be used and stored.
If you would like to check that you are OK with the information or recordings from the
study, you need to contact Dr Caroline Jones on (02) 4221 4905.
When you have read this information sheet, Dr Caroline Jones and Heather Buchan are
free to talk about any questions you might have. We will be happy to talk to you at any
time in the project if you think of something later, or if any problems come up.
1. I have read the participation information sheet and have been able to ask the
researchers any questions I may have had.
2. I have discussed with the researchers how my child’s data is going to be used
and stored and I am happy with how the study has been explained to me.
4. I understand that this study does not have any known risk for my child, and I
have read the information sheet and asked any questions I may have about the
risks.
6. I understand that the researchers will contact me and ask to make recordings
every 6 months for a year and a half, and each time they will make 3 to 4 hours
of recordings.
7. I understand that I will be asked regularly to look at the recordings to make sure
they are okay. I may also be asked to clarify to the researchers what my child or
I are saying in some of these recordings.
8. I understand that the video and audio recorder will be recording all the time the
researchers are with my child and myself.
9. If I have any concerns or complaints regarding the way the research is or has
been conducted I can contact the Ethics Officer, Human Research Ethics
Committee, Office of Research, University of Wollongong on (02) 4221 4457.
To choose how and where we will keep the recordings, please circle whether
you do or do not give permission for these things as your child’s legal guardian:
Storing the videos and making sure you can get copies:
I do / do not give my permission for audio recordings to be stored on the TalkBank database,
where it will be available to other researchers once this project has finished.
What material would you like stored on TalkBank? (Please tick all that apply)
audio
transcript
video
Do you want your child to be anonymous? (Please tick all that apply)
no, you can use my and my child’s real name
yes, by changing last names only
yes, by changing first names and last names
yes, by removing names from audio recordings
yes, by blurring faces in video recordings
Printing parts of the transcripts and showing videos and photos at conferences and
language meetings:
I do / do not give my permission for the researchers of this project to print parts of transcripts in
books / journals and other printed reports.
only with pseudonyms
I do / do not give my permission for the researchers to show short extracts of video at
conferences and meetings.
only if the audio contains no name
only if the video blurs faces
I do / do not give my permission for the researchers to print photos for academic publications
and reports / show photos at conferences and meetings / display photos on a project website
any
ask me about specific photos before using them
----------------------------------------------------------------------------------------------------------
APPENDIX B
List of Gurindji Kriol Words Containing
Potential Fricatives
APPENDIX B 262
List of words containing ‘potential fricatives’ that were variable, always a fricative, and
always a stop. Words in bold are English code-switches. Underlined words are recent
borrowings.
gibit V so CONJ
hawuj N sori V
ji V stat-im V
jidan V stik N
jinek N sting V
jingat V stirimraun V
jouim V stori N
juka N swim V
jutim V telafon N
libim V
lijin V
machine N
mub V
najan N
nes N
nojing EXCLAM
pleis N
pujim V
raifol N
said DIR
stop V
ting N
waip N
wajim V
APPENDIX B 264
yestadei N