Professional Documents
Culture Documents
doi:10.1093/applin/amy053
ß The Author(s) (2018). Published by Oxford University Press. All rights reserved.
For permissions, please email: journals.permissions@oup.com
2 EXPLORING THE INTELLIGIBILITY THRESHOLD
4. RESEARCH QUESTIONS
The current study is guided by the following research questions:
1 To what extent do unique phonological features affect listeners’ compre-
hension scores and intelligibility scores?
2 How might these phonological features be used to establish a threshold of
intelligibility for a variety of English accents?
Phonological features were operationalized using both segmental and supra-
segmental measures. We aimed to identify a threshold of intelligibility for
TOEFL-type listening comprehension passages produced with varied accents
by examining which features of speech typically associated with intelligibility
contribute to the comprehension of test passages by typical TOEFL test-takers.
5. METHOD
5.1 Creation of test materials
5.1.1 Speakers
Eighteen speakers, three from each of six distinct groups, were recruited. Two
groups represent traditionally standard English accents, General American
(GA) and British Received Pronunciation (RP); two groups represent
O. KANG, R. I. THOMSON, AND M. MORAN 7
Fluency features refer to articulation rate, mean lenth of run, pauses, and
hesitation markers.
Initially, a principal component analysis (PCA) was conducted as part of the
dimension reduction process so that the number of normed phonological vari-
ables could be reduced for the final analysis as seen in Kang et al. (2018a).
Pearson correlation coefficients were computed to ensure that the final set of
selected phonological features were not highly correlated among each other.
Linear LMEMs were then performed to examine how the overall phonological
features could affect the listening test scores. LMEM was chosen as a primary
analysis due to its robust power and flexibility by including both random and
fixed effects (Faraway 2005). For this mixed model, we treated the reduced
phonological features as covariates and took speaker and listeners for random
effects. Listening comprehension scores and intelligibility scores served as de-
pendent variables. These variables were also checked for normal distribution.
The skewness and kurtosis test results ranged from 1.4 to 2.4 with SDs of
0.14–0.74. Given that acceptable values for psychometric purposes are be-
tween 2 and +2 (George and Mallery 2010), these variables were considered
to be normally distributed.
14 EXPLORING THE INTELLIGIBILITY THRESHOLD
6. RESULTS
First, we conducted PCA analyses to reduce the number of phonological vari-
ables. We provide a summary of the results below, but more detailed informa-
tion regarding this preliminary step of our analysis can be found in Kang et al.
(2018a).
The PCA was computed three times independently for each of the three
categories (segmental features, prosodic features, and fluency features) to
maintain the transparent nature of each category, which enhanced the inter-
pretation of composite variables. In the first computation, the principal com-
ponent consisted of the following six features: consonant deletions, syllable
reductions, consonant cluster divergence, high functional vowel substitutions,
low functional vowel substitutions, and high functional consonant substitu-
tions. All six features extracted had positive coefficients; accordingly, we com-
bined all six features and created one super-feature called ‘consonant and
vowel divergence’. Pace, word stress divergence, and falling tone choices
were discriminant from other variables displaying the same direction with
negative component loadings. On the other hand, number of tone units,
rising tone choices, and rhythm revealed the opposite characteristics with posi-
tive coefficients but strongly correlated with the Primary Component 1.
Accordingly, we labeled the first three features (pace, word stress divergence,
and falling tone choices) as ‘impeding prosodic markers’ and the other three
features (number of tone units, rising tone choices, and rhythm) as ‘enhancing
prosodic markers’.
Articulation rate, number of silent pauses, and unexpected pause ratio re-
vealed a positive relationship with the Primary Component 1, whereas mean
length of run (MLR) and expected pause ratio showed a negative association.
As a result, the first four features were grouped and labeled as ‘impeding flu-
ency markers’ as the increment of such variables hindered the listeners from
understanding the lectures. The other two features were composited together
and labeled as ‘enhancing fluency markers’ because the increase of these fea-
tures (i.e. MLR and expected pause ratio) would enhance the listeners’ listen-
ing comprehension.
In sum, for the LMEM analysis, the following five independent variables
were created as predictors for the response variables: (i) vowel and consonant
O. KANG, R. I. THOMSON, AND M. MORAN 15
Intercept 714 5.505 0.607 9.065 .000 11.073 1.438 7.698 .000
Vowel/consonant 714 10.2 1.630 6.255 .000 49.801 3.862 12.895 .000
divergence
Impeding prosody 714 0.218 0.007 1.643 .101 1.761 0.170 5.590 .000
Enhancing prosody 714 0.034 0.030 2.447 .015 0.002 0.071 0.084 .933
Impeding fluency 714 0.018 0.013 2.638 .008 0.028 0.315 1.701 .089
Enhancing fluency 714 0.0321 0.014 1.088 .277 0.444 0.033 6.214 .000
Indian 1
Indian 2
Indian 3
Chinese 1
Chinese 2
Chinese 3
South African 1
South African 2
South African 3
Spanish 1
Spanish 2
Spanish 3
American 1
American 2
American 3
Brish 1
Brish 2
Brish 3
American 1 Pace (the average number of prominent syllables per run) = 2.72
Falling tone = 61 per cent; level tone = 0 per cent; rising tone =
39 per cent
American 2 Pace (the average number of prominent syllables per run) = 2.37
Falling tone = 39 per cent; level tone =2 per cent; rising tone =
59 per cent
American 3 Pace (the average number of prominent syllables per run) = 2.48
Falling tone = 59 per cent; level tone = 6 per cent; rising tone =
35 per cent
British 1 Pace (the average number of prominent syllables per run) = 2.53
Falling tone = 51 per cent; level tone = 0 per cent; rising tone =
49 per cent
/O/!/ow/ (1)
Absence of flap (various times)
Absence of postvocalic r (various times)
/A/!/ O/ (3)
/æ/!/A/ (various times)
Addition of /h/ (1)
Minor word stress (1)
British 2 Pace (the average number of prominent syllables per run) = 2.15
Falling tone = 54 per cent; level tone = 2 per cent; rising tone =
44 per cent
Word stress (1)
/æ/!/A/ (various times)
British 3 Pace (the average number of prominent syllables per run) = 2.04
Falling tone = 48 per cent; level tone = 4 per cent; rising tone =
28 per cent
Absence of postvocalic r (various times)
/O/!/ow/ (1)
Absence of flap (various times)
/ A/!/ O/ (1)
South African 3 Pace (the average number of prominent syllables per run) = 2.41
Falling tone = 52 per cent; level tone = 12 per cent; rising tone =
36 per cent
Word stress (2)
Absence of flap (various times)
Absence of postvocalic r (various times)
Trilled r (various times)
O. KANG, R. I. THOMSON, AND M. MORAN 21
Indian 3 Pace (the average number of prominent syllables per run) = 2.81
Falling tone = 63 per cent; level tone = 0 per cent; rising tone =
37 per cent
/s/!/z/ (1)
/w/!/v/ (1)
Word stress (1)
Absence of the flap (various times)
Spanish 3 Pace (the average number of prominent syllables per run) = 2.30
Falling ton e = 57 per cent; level tone = 3 per cent; rising tone =
40 per cent
/I/!/iy/ (1)
/z/!/s/ (2)
/ ô/!/d/ (1)
/ A/!/ow/ (1)
Chinese 3 Pace (the average number of prominent syllables per run) = 2.23
Falling tone = 52 per cent; level tone = 0 per cent; rising tone =
44 per cent
/I/ /! /iy/ (1)
/"/! /iy/ (1)
Note. The number or percentage after each feature signifies number of occurrences.
the maximum possible divergence cases for each phonological category. For
example, to set up a divergence rate for vowels, we examined vowel diver-
gence occurrences across all 10 speakers. A British Speaker #1 had five diver-
gences, a Mexican Speaker #3 two divergences, a Chinese Speaker #3 two
divergences, and so forth. Five divergences for vowels found from a British
speaker appeared to be the highest (most frequent) value found from all
speakers.
Speakers achieving divergence rates for vowels in content words below 4.1
per cent, divergence rates for consonants in content words below 2.5 per cent,
divergence rates for lexical stress below 1.6 per cent (i.e. no more than 2 stress
divergences out of 120 content words), and divergence rates for number of
prominent words in content words (pace = 2.72 per cent) can be considered
intelligible particularly in the assessment context of the current study. For
vowels and consonants, the divergences from high functional loads may not
exceed 1.6 per cent (i.e. 2 errors out of 120 words). For tone choices, falling
tones should be used no more than 63 per cent, but rising tones more than 37
per cent.
22 EXPLORING THE INTELLIGIBILITY THRESHOLD
8. CONCLUSION
We believe that the most significant impact of the present study is that it
provides a rational, systematic approach to selecting speakers with a variety
of English accents for inclusion as model speakers in English listening tests. The
resulting approach to delineating a threshold of intelligibility, while prelimin-
ary, has the potential to take much of the guesswork out of selecting speakers
O. KANG, R. I. THOMSON, AND M. MORAN 25
NOTE
1 It is important to note that work within present discussion, we are only con-
the World Englishes paradigm includes cerned with differences in pronunci-
attention to variation in lexis, gram- ation, not with these other dimensions
mar, and pragmatics, in addition to pro- of difference.
nunciation. For the purposes of the
FUNDING
This research was funded by the Educational Testing Service (ETS) under a
Committee of Examiners and the TOEFL research grant. ETS does not discount
or endorse the methodology, results, implications, or opinions presented by
the researchers.