Professional Documents
Culture Documents
net/publication/272187739
CITATIONS READS
43 722
5 authors, including:
11 PUBLICATIONS 211 CITATIONS
Michigan State University
173 PUBLICATIONS 2,264 CITATIONS
SEE PROFILE
SEE PROFILE
Ingo R Titze
University of Utah
421 PUBLICATIONS 20,004 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Ingo R Titze on 18 April 2016.
Research Article
Purpose: Although there is a long history of use of semi- Results: Voice Handicap Index (Jacobson et al., 1997)
occluded vocal tract gestures in voice therapy, including scores showed significant improvement for both treatment
phonation through thin tubes or straws, the efficacy of groups relative to the no-treatment group. Comparison
phonation through tubes has not been established. This of the effect sizes suggests FRT therapy is noninferior to
study compares results from a therapy program on the basis VFE in terms of reduction in Voice Handicap Index scores.
of phonation through a flow-resistant tube (FRT) with Vocal Significant reductions in Roughness on the Consensus
Function Exercises (VFE), an established set of exercises Auditory-Perceptual Evaluation of Voice (Kempster, Gerratt,
that utilize oral semi-occlusions. Verdolini Abbott, Barkmeier-Kraemer, & Hillman, 2009) were
Method: Twenty subjects (16 women, 4 men) with found for the FRT subjects, with no other significant voice
dysphonia and/or vocal fatigue were randomly assigned quality findings.
to 1 of 4 treatment conditions: (a) immediate FRT therapy, Conclusions: VFE and FRT therapy may improve voice
(b) immediate VFE therapy, (c) delayed FRT therapy, or quality of life in some individuals with dysphonia. FRT
(d) delayed VFE therapy. Subjects receiving delayed therapy therapy was noninferior to VFE in improving voice quality of
served as a no-treatment control group. life in this study.
V
oice disorders affect the ability to communicate at clinical efficacy of these exercises has not been studied to
work and in recreational activities. The lifetime date.
prevalence of self-reported voice problems in adults Semi-occluded voice exercises are rooted in a long
has been found to be almost 30%, with a point prevalence tradition of use in training vocal performers. For example,
from 6.6% to 7.5% (Cohen, 2010; Roy, Merrill, Gray, & some exercises involve using the hand during phonation to
Smith, 2005). Treatments for voice disorders include medi- partially cover the mouth (Aderhold, 1963) or completely
cation, surgery, and behavioral therapy. Voice therapy cover the mouth (Coffin, 1987), thus creating either a semi-
is frequently used either as a primary treatment or as an occlusion or a complete occlusion for a brief moment. Os-
adjunct to surgical intervention. Many successful voice cillatory SOVT exercises, such as lip trills, tongue trills, and
therapy techniques and programs are based on exercises raspberries (labio-lingual trills), have been used in training
that semi-occlude the vocal tract (SOVT). SOVT exercises of the acting voice as well as the singing voice (Linklater,
such as phonating through straws and tubes are becoming 1976; Nix, 1999). Engel (1927) described the use of SOVT
increasingly common in clinical practice. However, the for acting voice, suggesting that narrowing the mouth with
the tongue tip against the alveolar ridge produces efficient
voicing. This technique was further developed by Lessac
a
National Center for Voice and Speech, University of Utah, (1997) in the resonance exercises producing a vocal quality
Salt Lake City called y-buzz, which utilizes oral narrowing of the vocal
b
University of Washington, Seattle tract to create buzzy sensations in the face due to heightened
c
Michigan State University, East Lansing
d acoustic pressures in the narrowed region.
University of Iowa, Iowa City
Several well-known voice therapy programs utilize
Correspondence to Mara R. Kapsner-Smith: mkapsner@uw.edu
SOVT exercises as key components of the therapy. For ex-
Editor: Jody Kreiman
ample, Lessac-Madsen Resonant Voice Therapy (LMRVT)
Associate Editor: Bruce Gerratt
draws from performing voice techniques as described above,
Received September 9, 2013
Revision received April 7, 2014
Accepted December 28, 2014 Disclosure: The authors have declared that no competing interests existed at the time
DOI: 10.1044/2015_JSLHR-S-13-0231 of publication.
Journal of Speech, Language, and Hearing Research • Vol. 58 • 535–549 • June 2015 • Copyright © 2015 American Speech-Language-Hearing Association 535
and relies upon semi-occluded consonants such as frica- others remain in a very exploratory stage. We will review
tives and the nasals /m/, /n/, and /ŋ/ as key training gestures the evidence supporting several commonly used treatment
and embedded cues in connected speech (Orbelo, Li, & approaches here.
Verdolini Abbott, 2014; Verdolini, 2000; Verdolini-Marston,
Burke, Lassac, Glaze, & Caldwell, 1995). Resonant voice
has been defined clinically as “easy to produce and buzzy in Resonant Voice Therapy
the facial tissues” and scientifically as “a reinforcement of Several studies have sought to define resonant voice,
the source by the vocal tract” (Titze & Verdolini Abbott, which is often judged perceptually by clinicians and coaches.
2012). In a similar way, the Accent Method (AM) utilizes Perceptually, resonant voice has been defined as voice in-
consonants such as voiced fricatives that provide oral semi- volving perceptible anterior oral vibrations that feels easy to
occlusions in rhythmic vocalizations, moving progressively produce (Verdolini-Marston et al., 1995). Biomechanically,
from nonspeech to connected speech exercises (Kotby & this voicing pattern has been shown to be associated with
Fex, 1998). Vocal Function Exercises (VFE), a therapy a barely ad/abducted vocal fold configuration (Verdolini,
technique based upon the work of Briess (1957, 1959), are a Druker, Palmer, & Samawi, 1998), resulting in a closed
series of nonspeech daily exercises that incorporate semi- quotient between .5 and .6. The vocal folds are neither
occlusion at the lips as the primary training gesture (Stemple, hyperadducted nor hypoadducted (Peterson, Verdolini-
1993, 2005). In addition, there is a long history of using Marston, Barkmeier, & Hoffman, 1994). Acoustically, there
straws or tubes to extend the vocal tract and provide resis- is evidence that adjustment of the relationship between
tance during phonation (Gundermann, 1977; Habermann, harmonics and formants can lead to maximized acoustic
1980; Laukkanen, 1992; Sovijärvi, 1966; Spiess, 1904; Tapani, output (Barrichelo-Lindstrom & Behlau, 2009; Smith,
1992). A variation on this gesture is phonation through a Finnegan, & Karnell, 2005). Resonant voice may in fact
tube submerged in water (Simberg & Laine, 2007). rely on maximizing source-filter interaction (Titze, 2004),
SOVT exercises have been validated by several thereby lowering PTP through increased inertance of
theoretical investigations. In a modeling study (Titze & the vocal tract air column (Titze, 2001), which is initially
Laukkanen, 2007), semi-occlusion of the vocal tract at trained through the use of semi-occluded consonant sounds.
the lips paired with narrowing of the epilaryngeal tube in- At least three published studies have examined the
creased inertive reactance in the range of 200–1000 Hz, re- efficacy of resonant voice therapy. First, Verdolini-Marston
inforcing vocal fold vibration and increasing vocal economy et al. (1995) conducted a small-scale, preliminary study
(defined as maximum flow declination rate divided by using a prospective, randomized controlled design with
maximum area declination rate). Lengthening of the vocal blinding. The authors examined college-aged women with
tract, as achieved with phonation through a thin tube, fur- nodules who were treated with either resonant voice ther-
ther increases vocal tract inertance (see Figure 1) (Titze & apy or confidential voice therapy in a two-week intensive
Verdolini Abbott, 2012). Vocal tract inertance has previously therapy program. Treatment benefits were demonstrated
been shown to have desirable effect. Titze (1988) showed across auditory-perceptual measures, laryngoscopy find-
that acoustically, an inertive vocal tract reduces phonation ings, and self-ratings of vocal effort. An interesting finding
threshold pressure (PTP). Vocal tract inertance also skews was that treatment benefits were shown to be dependent
the flow pulse to increase maximum flow declination rate upon compliance, but not therapy type. Limitations of this
(Rothenberg, 1981; Titze, 2006a), thus increasing the inten- study included, as the authors noted, the inability to use
sity of higher harmonics in the acoustic spectrum. SOVT parametric statistics to determine the comparative magni-
may also provide a correction for so-called pressed voice. tude of treatment benefits across treatment types, although
The oral pressure produced by a semi-occlusion behind the nonparametric, binomial statistics did produce meaningful
lips acts on the superior surface of the vocal folds to keep results. Second, in an RCT (Roy et al., 2003), 64 teachers
them separated, thereby helping to maintain a rectangular with dysphonia were randomly assigned to one of three
glottal shape (Titze, 2014). Squared-up vocal folds, where treatment groups: voice amplification, resonant voice ther-
the medial surfaces of the vocal folds are parallel or nearly apy, or respiratory muscle therapy. Treatment benefits
parallel, have the lowest PTP, requiring less vocal fold were observed in the voice amplification and resonant voice
adduction. This has been shown repeatedly with physical therapy groups, based upon reduction in Voice Handicap
models and with computational models (Chan, Titze, & Index (VHI) scores (Jacobson et al., 1997) and subjects’
Titze, 1997; Titze, 1988). The rationale and underpinnings self-ratings of voice symptom severity. In a third study,
of voice therapy with both a frontal semi-occlusion (the 24 teachers were treated in small groups using a protocol
lips) and a rear semi-occlusion (in the epilarynx tube) have based on LMRVT, without a control group (Chen, Hsiao,
also been outlined in terms of maximum power output (Titze, Hsiao, Chung, & Chiang, 2007). Diagnoses included four sub-
2006b). jects with muscle tension dysphonia, six subjects with
SOVT therapy programs and exercises vary in their vocal fold nodules, and 14 subjects with “chronic corditis.”
complexity and claims about efficacy, physiologic under- Significant improvements were noted after therapy in
pinnings, and ease of application in a clinical setting. auditory-perceptual measures, laryngostroboscopy ratings,
Although some treatment programs have had beneficial speech fundamental frequency (F0), F0 range, PTP, and
effects documented in randomized controlled trials (RCTs), VHI-physical domain scores. Data are forthcoming from
536 Journal of Speech, Language, and Hearing Research • Vol. 58 • 535–549 • June 2015
Figure 1. (a) Idealized vocal tract shapes with (b) corresponding inertograms. From top to bottom: decoupled epilarynx tube;
epilarynx tube with uniform vocal tract; oral semi-occlusion; oral semi-occlusion paired with epilarynx narrowing; oral semi-
occlusion and lengthened vocal tract paired with epilarynx narrowing (reprinted with permission; Titze & Verdolini Abbott, 2012).
several prospective, randomized clinical trials of LMRVT, nodules, and significant improvements in some aerodynamic
one of which shows longitudinal improvements in VHI measures in group pre/post comparisons. Likewise, improve-
scores continuing over a 1-year follow-up period after treat- ments in auditory perceptual and acoustic measures were
ment (K. Verdolini Abbott, personal communication, found in an uncontrolled, unmasked study of 10 subjects with
September 12, 2014). functional dysphonia (Fex, Fex, Shiromoto, & Hirano, 1994).
In the largest study to date of AM therapy, Bassiouny (1998)
conducted a randomized, controlled, double-masked trial
AM of AM therapy with 42 subjects with dysphonia, including
functional dysphonia and dysphonia associated with vocal
Several studies of varying quality have investigated
fold lesions and vocal fold immobility. Subjects were ran-
the efficacy of AM of voice therapy. AM therapy utilizes
domly assigned to 20 sessions of AM therapy including
consonants such as voiced fricatives that provide oral semi-
vocal hygiene advice plus the accent exercises, or 10 sessions
occlusions in rhythmic vocalizations, moving progressively
of vocal hygiene only. The AM group improved more in
from nonspeech to connected speech exercises (Kotby &
auditory perceptual, acoustic, and aerodynamic measures
Fex, 1998). Smith and Thyme (1976) measured acoustic
than the vocal hygiene group. The AM group also showed
changes in a pre/post uncontrolled, unmasked study of
improvements in stroboscopy parameters, which were not
30 students without dysphonia who participated in 10 sessions
found in the vocal hygiene group.
of AM therapy. The authors reported improvements in vari-
ous spectrographic parameters. Kotby, El-Sady, Basiouny,
Abou-Rass, and Hegazi (1991) studied the effects of AM ther-
apy on 28 subjects with dysphonias of a variety of origins VFE
(functional dysphonia, vocal fold lesions, and vocal fold immo- A number of well-designed studies have documented
bility) in an uncontrolled, unmasked study. After 20 therapy the efficacy of VFE, a set of pitch range and duration exer-
sessions, positive changes in voice performance were reported cises for voice that use SOVT postures. Two controlled
by 89.3% of subjects. In addition, the authors reported im- studies have shown changes in flow rate, phonatory volume,
proved auditory perceptual ratings of voice quality in over maximum phonation time, and pitch range in vocally
half of subjects, reduced nodule size in all six subjects with healthy subjects, pre- to postvoice training with VFE (Stemple,
538 Journal of Speech, Language, and Hearing Research • Vol. 58 • 535–549 • June 2015
phonation, attributed to possible decreased muscular tension (recognition schema). Schema theory predicts several vari-
(Sampaio, Oliveira, & Behlau, 2008), although in another ables should influence learning, including the distribution
study, F0 did not change after the same duration of exercise of practice and provision of knowledge of results. Most
(Costa, Costa, Oliveira, & Behlau, 2011). Guzman et al. germane to the present discussion, schema theory also
(2013) found increased spectral prominence of the singer’s/ predicts that generalization from trained to untrained
speaker’s formant cluster after 5 min of tube phonation in a exemplars of a movement should be enhanced by variable
single subject. Perceptual measures of voice quality have practice, due to an enrichment of data in the recall schema
suggested positive effects of tube phonation. Several studies “space” that it induces (Schmidt, 1975). Considerable data
reported improved perceptual judgments of voice quality are consistent with this prediction (for review, see Titze &
immediately after 1–5 min of phonation through tubes (Enflo, Verdolini Abbott, 2012). However, there is not uniform
et al., 2013; Guzman et al., 2013; Sampaio et al., 2008) and agreement about the reasons for it. According to one alter-
after exercising with three sets of 15 voiced tongue trills native view, the variable practice effect can, paradoxically,
(Schwarz & Cielo, 2009). However, Costa et al. (2011) again be explained by the specificity of practice principle. In this
reported no change in perceptual judgments of voice quality view, the reason that variable practice enhances generali-
after straw phonation, but concluded that their exercise dos- zation is not because some rule-based cognitive schema
age may have been insufficient to see effects. In their study, is enhanced by it. Rather, generalization is enhanced by
subjects with vocal fold lesions did improve in vocal self- variable practice because if a person practices a task in a
assessment ratings after straw phonation. large number of ways, when he or she encounters a novel
Some of the variability in the above findings may be version of it in the future, the chances increase that that
attributed to differences in the semi-occlusions used. Semi- novel version will in some way approximate at least one ex-
occlusions vary in the intraoral pressures they create, and emplar encountered in the past. Thus, the learner can use a
these pressures vary across subjects (Maxfield, Titze, Hunter, prior cognitive or neural trace to guide movement in the
& Kapsner-Smith, 2014). Even in the case of phonation present context (for related discussion see, for example,
through tubes, which is more controlled than some other Kumaran & McClelland, 2012). The specificity of practice
SOVT exercises, the diameter of the tube will have an impact principle would suggest that the benefits of nonspeech
on the pressure-flow relationship and the resistance created SOVT training for speech will be enhanced by training its
at the lips, therefore changing the back pressure created corollaries in actual speech.
during phonation (Titze et al., 2002). A certain amount of In contrast to approaches to motor learning that em-
narrowing/resistance may be necessary to create desirable phasize cognitive processes, the dynamical systems theory
changes; indeed, in the CT imaging and acoustic analysis of motor control de-emphasizes the role of cognition. In this
conducted by Guzman et al. (2013) described above, greater approach, the focus is on the emergence of behavior from
beneficial changes were induced by phonation through a self-organization of independent biological subsystems in
narrow stirring straw than through a wider glass tube. response to interactions across subject, task, and environment
(Glazier, Davids, & Bartlett, 2003). Central representations
of movement are considered largely superfluous. From a
Speech Versus Nonspeech Exercises dynamical systems perspective, changes in motor behavior
The voice therapy approaches described utilize non- are not the result of central cognitive processes, but rather
speech exercises, speech exercises, or a combination of both. are induced by external conditions and peripheral responses
VFE and tube phonation, for example, rely on nonspeech (e.g., altered voicing physiology in response to physical
semi-occlusions and do not incorporate direct training of conditions imposed by SOVT). Some carryover to speech
speech production. LMRVT begins with nonspeech explor- could still occur, analogous to general physical preparation
atory exercises such as humming and pitch glides, but pro- in sports literature—and indeed the idea of “taking your
gresses quickly to semi-occlusions embedded in speech. larynx to the gym” has been proposed as a purpose of these
There is some controversy over how motor learning may exercises. As an alternative to preserve the principle of task
occur in nonspeech versus speech exercises for voice. Views specificity as conceptualized in cognitively based motor
on the potential for learning and carryover into communi- learning approaches, semi-occluded nonspeech exercises
cation depend in part on one’s theoretical perspective. may be designed to bear a closer resemblance to speech,
For example, in schema theory, a key concept has to such as incorporating variations in pitch and loudness and
do with generalized motor programs (GMPs), which are speech-like prosodic contours.
proposed to be central representations of movement. The
suggestion is that GMPs are developed with practice and
once acquired, are parameterized for each trial within a Purpose of Study
class of movements to govern movement (e.g., Schmidt, To date, no studies have been performed investigating
1975). A schema is a hypothetical three-dimensional cogni- the effects of a therapeutic tube phonation protocol, though
tive space that relates initial conditions for movement, these exercises have a long history of use in voice studios
parameters applied to the GMP, and movement outcomes and clinics. There is some evidence that tube phonation has
(recall schema) or that relates initial conditions, sensory immediate positive effects such as improved voice quality,
consequences of movement, and movement outcomes decreased muscular activation, and decreased effort or
540 Journal of Speech, Language, and Hearing Research • Vol. 58 • 535–549 • June 2015
Figure 2. Passage of subjects through the trial, according to the Consolidated Standards of Reporting Trials (CONSORT;
Schulz, Altman, & Moher, 2010). FRT = flow-resistant tube; VFE = Vocal Function Exercises.
women, below middle C for men) as long as possible on one long and 0.4 cm in diameter. Subjects were instructed to
breath, 10 repetitions; (b) stretching exercise, slow upward allow airflow only through the straw (not through the nose
pitch glide performed on /no/ with semi-occlusion at the lips or around the straw), to use an abdomino-thoracic (non-
(creating a buzzing sensation), 10 repetitions; (c) contract- clavicular) breathing pattern, and to maintain a relaxed
ing exercise, slow downward pitch glide performed on /no/ upper body posture. Exercises were performed in full voice,
with semi-occlusion at the lips, 10 repetitions; and (d) low- though the semi-occluded sound is perceived as reduced in
impact adductory power exercise, /o/ with semi-occlusion loudness. The exercises performed during therapy sessions
at the lips performed five times each on middle C, D, E, F, were (a) 10 repetitions of a pitch glide up and back down;
G (one octave lower for men), for as long as possible on (b) 10 repetitions of an accent exercise, creating about five
each breath. All exercises were performed as softly as possi- to seven “hills” of sound by varying the pitch and loudness
ble with the voice still engaged. For the home program, sub- of the voice using increased breath support (vs. adduction);
jects completed two repetitions each of exercises a–c, and (c) singing through the straw with melody but without
two repetitions on each note for exercise d. Subjects were articulation, a total of 10 short songs (e.g., one verse of
instructed to complete the home exercises four times daily. “Mary Had a Little Lamb”) or equivalent longer ones; and
The frequency of the VFE home program was increased (d) “reading” through the straw, emphasizing prosody
from the published standard (two practice sessions daily) in (intonation and stress) while producing voice without artic-
order to match the FRT program in terms of frequency of ulation, approximately five medium-length paragraphs
practice and total time spent on voice exercises. Subjects (around five to 10 sentences) per session. For the home pro-
were given an MP3 player with audio instructions for the gram, subjects were instructed to complete 1 min of each
home exercise program. They were also given a log sheet for exercise, four times daily (four 4-min practice sessions).
home practice. Subjects were given an MP3 player with audio instructions
The FRT exercise program consisted of four exercises for the home exercise program. They were also given a log
performed while phonating through a stirring straw 14.1 cm sheet for home practice.
542 Journal of Speech, Language, and Hearing Research • Vol. 58 • 535–549 • June 2015
Table 1. Control total Voice Handicap Index (VHI) change scores.
voice. Thus, total VHI change scores were tabulated in the
following score ranges: increased, decreased 0–12, 13–17,
Subject Pre-VHI Post-VHI Change
and ≥18.
F19 45 46 1.
F20 16 18 2.
Results F21 66 36 −30.
F23 25 33 8.
All subjects tracked their home practice on a daily F25 26 27 1.
log sheet. Subjects in the FRT group averaged 14.4 min per F26 36 46 10.
day of home exercise, and subjects in the VFE group aver- F27 52 46 −6.
aged 14.5 min per day. Therapy session length was tracked F28 25 23 −2.
F29 38 37 −1.
by the clinician. FRT sessions averaged 42 min in length, M16 71 68 −3.
whereas VFE sessions averaged 51 min in length. Sixteen Average 40 38 −2.
of the 20 subjects completed all six sessions of therapy. SD 18.4 14.4 11.0
Two subjects in each group missed one to three sessions
due to illness or time conflicts.
control condition. Analyses were completed for the parame-
ters Overall Severity, Roughness, Breathiness, and Strain.
VHI After adjusting the p values and CIs for four multiple
A mixed-effects linear regression was used to test for comparisons, given four CAPE-V subscales, separately
significant differences in VHI scores between each treat- within each treatment group, FRT therapy had significant
ment group (VFE and FRT) and the no-treatment control improvement in Roughness relative to control (change
phase. Both treatment groups showed significantly more coefficient = −10.2, 95% CI [−20.1, −0.26], p = .040), but
improvement in the total VHI score than the control condi- VFE did not achieve significant improvement (change
tion (FRT: p < .001; VFE: p = .048). The change coefficient coefficient = −7.8, 95% CI [−17.5, 1.8], p = .17). In both
for FRT therapy was −12.6 (95% CI [17.8, −7.4]), and for groups, results were not significant for Overall Severity
VFE was −5.4 (95% CI [−10.8, 0.04]), where a negative (FRT change coefficient = −6.4, 95% CI [−15.9, 3.1],
number represents a better pre- to posttreatment outcome p = .37; VFE change coefficient = −7.4, 95% CI [−17.0,
than the control condition. 2.3], p = .23), Strain (FRT change coefficient = −8.8, 95%
In a postregression comparison of slopes, the change CI [−20.5, 2.9], p = .24; VFE change coefficient = −8.6,
for FRT therapy was in the direction of a greater improved 95% CI [−20.2, 3.0], p = .26) or Breathiness (FRT change
outcome than VFE (difference = −7.2, 95% CI [−14.5, 0.1], coefficient = −3.3, 95% CI [−10.7, 4.1], p = .68; VFE change
p = .054). The lower bound of this confidence interval, coefficient = −4.2, 95% CI [−11.8, 3.4], p = .68).
−14.5, represents 14.5 points more reduction in total VHI Interrater agreement was assessed by calculating the
scores with FRT therapy than with VFE. The upper bound probability of agreement (±21.5 mm) between each possible
of this confidence interval, 0.1, represents 0.1 points less pair of raters for each voice sample (probability of chance
reduction in total VHI scores with FRT therapy than with agreement p = .39). The probability of interrater agreement
VFE. was p = .83 for Overall Severity, p = .75 for Roughness,
Individual total VHI change scores were calculated p = .81 for Breathiness, and p = .59 for Strain.
for all conditions (control, FRT, VFE), and were tallied in Intrarater agreement was assessed by calculating the
score range categories (increased, decreased 0–12, 13–17, probability that a rater agreed with him/herself (±21.5 mm)
and ≥18). Individual scores, group averages and standard in repeated ratings of the same voice sample (probability
deviations, and change scores are presented in Tables 1, 2,
and 3. In the control phase, five of 10 subjects’ scores in-
creased, four subjects’ scores decreased by 0–12 points, and Table 2. Flow-resistant tube total Voice Handicap Index (VHI)
one subject’s score decreased by ≥18 points. In the FRT change scores.
group, one of 10 subjects’ scores increased, four subjects’
scores decreased by 0–12 points, two subjects’ scores de- Subject Pre-VHI Post-VHI Change
creased by 13–17 points, and three subjects’ scores decreased F03 60 4 −56.
by ≥18 points. In the VFE group, three of 10 subjects’ scores F16 23 25 2.
increased, two subjects’ scores decreased by 0–12 points, F19 46 37 −9.
one subject’s score decreased by 13–17 points, and four sub- F20 18 8 −10.
F23 33 27 −6.
jects’ scores decreased by ≥18 points. F24 39 17 −22.
F26 46 30 −16.
F28 23 15 −8.
Consensus Auditory Perceptual Evaluation of Voice M13 60 45 −15.
A mixed-effects linear regression was used to test for M15 46 8 −38.
Average 39.4 21.6 −17.8
significant differences in masked CAPE-V scores between SD 15.0 13.5 17.2
each treatment group (VFE and FRT) and the no-treatment
544 Journal of Speech, Language, and Hearing Research • Vol. 58 • 535–549 • June 2015
CAPE-V instrument, and further study is needed to deter- tube is very narrow, as utilized in the present study (Guzman
mine whether small but statistically significant differences et al., 2013). These combined effects—optimization of glottal
such as these are meaningful. Results for Strain were not shape, epilaryngeal narrowing, and increased inertance of
significant, despite the fact that the majority of subjects re- the vocal tract—are likely to be facilitated by tube phonation.
ported decreased vocal effort and fatigue after treatment. Subjects may rely on sensory cues such as facial vibrations
Results for Strain should be interpreted with caution, to identify and habituate ideal vocal tract configurations.
given that the probability of interrater agreement was only Further study of the physiologic effects of FRT therapy is
p = .59. In studies of auditory perceptual assessment of needed to confirm the mechanisms that underlie the treat-
voice, Strain is consistently the least reliable dimension, and ment effects observed in this study.
in fact may require cues available to the speaker but not The role of motor learning in nonspeech voice exer-
the listener in order to judge (e.g., kinesthetic; Eadie & cises is an interesting question that remains to be answered.
Kapsner-Smith, 2011). Although the current study does not address this question,
The efficacy of VFE for treating dysphonia and pres- we propose that the design of the FRT protocol used in this
byphonia has already been established in several studies study is not in conflict with principles of motor learning,
(Gillivan-Murphy et al., 2006; Gorman et al., 2008; Roy including task specificity, distributed practice, and provi-
et al., 2001; Sauder et al., 2010; Tay et al., 2012; Ziegler sion of knowledge of results. Phonation into a narrow tube
et al., 2013), and this study adds support to this evidence. while producing speech-like prosody (the “reading” task
Given the repeated findings that VFE are an effective voice in this study) is a task very similar to speech. It is simplified
therapy technique, our finding that a 0.4-cm–diameter by the removal of articulation, which reduces the number
FRT induced a comparable amount of change in VHI of potential vocal instabilities created by a constantly
scores as VFE in subjects with dysphonia provides evidence changing vocal tract shape. In addition, semi-occlusion and
for the efficacy of semi-occluding the vocal tract in vocal elongation of the vocal tract impose conditions that create
exercise. inertive reactance, a desirable condition for vocal fold
Some hypotheses regarding the physical mechanisms vibration. With approaches that do not impose as much
of voice improvement using SOVT exercises may be devel- control on the vocal tract configuration, such as VFE or
oped based upon previous modeling studies. As described humming in LMRVT, more trial and error is generally nec-
above, semi-occlusion of the vocal tract creates a posi- essary to achieve an accurate production. The tube in FRT
tive intraoral pressure that may facilitate an optimal, near- therapy might be thought of as akin to training wheels
rectangular shape of the glottis (Titze, 1988). Laukkanen on a bicycle that keep the rider upright—it allows the user
et al. (2008) found that vocal economy and glottal efficiency to experience the target behavior reliably before mastery
increase with an increased thyroarytenoid-to-cricothyroid is achieved. In terms of attention to movement effects
activation ratio in computer modeling, and observed in- (knowledge of results), subjects have access to sensory cues
creased TA activation using electromyography in one sub- including vibration and ease during FRT exercises, possibly
ject during SOVT exercises. Also, increased vocal tract related to the acoustic pressures of an inertive vocal tract
inertance facilitates self-sustained oscillation of the vocal and reduced PTP. As alluded to above, subjects may use
folds. For example, Titze and Laukkanen (2007) conducted those same sensory cues to facilitate establishing similarly
a modeling study in which they simulated phonation through efficient phonation during speech when the tube is removed.
a tube and the vowel /u/, while varying the degree of glottal To conclude, frequent, distributed practice is a key element
adduction and epilaryngeal tube area. They found that of the FRT program, in order to facilitate learning.
oral semi-occlusion increased inertive reactance in the 200– In a postregression comparison of change slopes, re-
1000 Hz range, but noted that the effect was strong only sults were in the direction of a greater improvement for
when the epilaryngeal tube was also narrowed. Furthermore, FRT therapy than VFE (7.2 more points of improvement),
they examined the effects on the economy of voice produc- though the 95% CI ranged from 14.5 more points of im-
tion, defined as maximum flow declination rate divided provement to 0.1 fewer points of improvement for FRT
by maximum area declination rate, and found that the therapy over VFE. Superiority testing with a larger group
greatest economy occurred when the epilaryngeal tube was of subjects may clarify whether significant differences in
narrowed, provided that adduction was sufficient. This ef- outcomes exist between the two treatments. Differences
fect for tube phonation was comparable to that for /u/, but such as ease of learning of the two treatment programs may
intraoral acoustic pressures during tube phonation were play a role in treatment outcomes. Although no objective
three times greater than those during production of /u/. They measures were taken regarding learning of the two exercise
concluded that this may provide an abundance of vibratory programs, clinician observations suggest the FRT therapy
sensations in the face that could provide feedback for learn- may be easier for subjects to master. Most subjects were
ing optimal vocal tract configuration for vocal economy. able to learn and independently perform the FRT exercises
Also, there is preliminary evidence from two single-subject adequately within one session. The time required for learn-
CT imaging studies to suggest that the area ratio between ing VFE varied between subjects, from one to several ses-
the pharyngeal inlet and epilaryngeal tube may in fact in- sions; in some cases nearly the full treatment period was
crease in response to phonation through tubes (Guzman needed for mastery of VFE. This difference may be explained
et al., 2013; Vampola et al., 2011), particularly when the by the fact that during FRT exercises, the semi-occlusion
546 Journal of Speech, Language, and Hearing Research • Vol. 58 • 535–549 • June 2015
Briess, F. B. (1959). Voice therapy: Part II. Essential treatment Habermann, G. (1980). Funktionelle Stimmstörungen und ihre
phases of specific laryngeal muscle dysfunction. Archives of Behandlung [Functional voice disorders and their treatment].
Otolaryngology–Head & Neck Surgery, 69, 61–69. Archives of Oto-Rhino-Laryngology, 227, 171–345.
Chan, R. W., Titze, I. R., & Titze, M. R. (1997). Further studies Harrell, F. E., Jr. (2001). Regression modeling strategies with appli-
of phonation threshold pressure in a physical model of the cations to linear models, logistic regression, and survival analysis.
vocal fold mucosa. The Journal of the Acoustical Society of New York, NY: Springer.
America, 101, 3722–3727. Jacobson, B. H., Johnson, A., Grywalski, C., Silbergleit, A.,
Chen, S. H., Hsiao, T.-Y., Hsiao, L.-C., Chung, Y.-M., & Chiang, Jacobson, G., Benninger, M. S., & Newman, C. W. (1997).
S.-C. (2007). Outcome of resonant voice therapy for female The Voice Handicap Index (VHI): Development and valida-
teachers with voice disorders: Perceptual, physiological, acous- tion. American Journal of Speech-Language Pathology, 6(3),
tic, aerodynamic, and functional measurements. Journal of 66–70.
Voice, 21, 415–425. Kempster, G. B., Gerratt, B. R., Verdolini Abbott, K., Barkmeier-
Coffin, B. (1987). Coffin’s sounds of singing: Principles and applica- Kraemer, J., & Hillman, R. E. (2009). Consensus auditory-
tions of vocal techniques with chromatic vowel chart. Lanham, perceptual evaluation of voice: Development of a standardized
MD: Scarecrow Press. clinical protocol. American Journal of Speech-Language
Cohen, S. M. (2010). Self-reported impact of dysphonia in a Pathology, 18, 124–132.
primary care population: An epidemiological study. The Kotby, M. N., El-Sady, S. R., Basiouny, S. E., Abou-Rass, Y. A.,
Laryngoscope, 120, 2022–2032. & Hegazi, M. A. (1991). Efficacy of the accent method of
Costa, C. B., Costa, L. H. C., Oliveira, G., & Behlau, M. (2011). voice therapy. Journal of Voice, 5, 316–320.
Immediate effects of the phonation into a straw exercise. Kotby, M. N., & Fex, B. (1998). The accent method: Behavior
Brazilian Journal of Otorhinolaryngology, 77, 461–465. readjustment voice therapy. Logopedics Phonatrics Vocology,
Eadie, T. L., & Kapsner-Smith, M. (2011). The effect of listener 23, 39–43.
experience and anchors on judgments of dysphonia. Journal of Kreiman, J., & Gerratt, B. R. (1998). Validity of rating scale mea-
Speech, Language, and Hearing Research, 54, 430–447. sures of voice quality. The Journal of the Acoustical Society
Enflo, L., Sundberg, J., Romedahl, C., & McAllister, A. (2013). of America, 104, 1598–1608.
Effects on vocal fold collision and phonation threshold pres- Kumaran, D., & McClelland, J. L. (2012). Generalization through
sure of resonance tube phonation with tube end in water. the recurrent interaction of episodic memories: A model of the
Journal of Speech, Language, and Hearing Research, 56, hippocampal system. Psychological Review, 119, 573–616.
1530–1538. Laukkanen, A. (1992). About the so called “resonance tubes” used
Engel, E. F. (1927). Stimmbildungslehre [Voice pedagogy]. Dresden, in Finnish voice training practice. Logopedics Phoniatrics
Germany: Weise. Vocology, 17, 151–161.
Fairbanks, G. (1960). Voice and articulation drillbook (2nd ed.). Laukkanen, A. M., Lindholm, P., & Vilkman, E. (1995a). On
New York, NY: HarperCollins. the effects of various vocal training methods on glottal resis-
Fex, B., Fex, S., Shiromoto, O., & Hirano, M. (1994). Acoustic tance and efficiency. Folia Phoniatrica et Logopaedica, 47,
analysis of functional dysphonia: Before and after voice ther- 324–330.
apy (accent method). Journal of Voice, 8, 163–167. Laukkanen, A. M., Lindholm, P., & Vilkman, E. (1995b). Phona-
Franic, D. M., Bramlett, R. E., & Bothe, A. C. (2005). Psychomet- tion into a tube as a voice training method: Acoustic and
ric evaluation of disease specific quality of life instruments in physiologic observations. Folia Phoniatrica et Logopaedica, 47,
voice disorders. Journal of Voice, 19, 300–315. 331–338.
Frison, L., & Pocock, S. J. (1992). Repeated measures in clinical Laukkanen, A. M., Lindholm, P., Vilkman, E., Haataja, K., &
trials: Analysis using mean summary statistics and its implica- Alku, P. (1996). A physiological and acoustic study on voiced
tions for design. Statistics in Medicine, 11, 1685–1704. bilabial fricative /b:/ as a vocal exercise. Journal of Voice, 10,
Gaskill, C. S., & Quinney, D. M. (2012). The effect of resonance 67–77.
tubes on glottal contact quotient with and without task instruc- Laukkanen, A. M., Takalo, R., Vilkman, E., Nummenranta, J., &
tion: A comparison of trained and untrained voices. Journal Lipponen, T. (1999). Simultaneous videofluorographic and
of Voice, 26, e79–e93. dual-channel electroglottographic registration of the vertical
Gillivan-Murphy, P., Drinnan, M. J., O’Dwyer, T. P., Ridha, H., laryngeal position in various phonatory tasks. Journal of
& Carding, P. (2006). The effectiveness of a voice treatment Voice, 13, 60–71.
approach for teachers with self-reported voice problems. Journal Laukkanen, A. M., Titze, I. R., Hoffman, H., & Finnegan, E. (2008).
of Voice, 20, 423–431. Effects of a semioccluded vocal tract on laryngeal muscle
Glazier, P. S., Davids, K., & Bartlett, R. M. (2003). Dynamical activity and glottal adduction in a single female subject. Folia
systems theory: A relevant framework for performance- Phoniatrica et Logopaedica, 60, 298–311.
oriented sports biomechanics research. Sportscience, 7. Retrieved Lessac, A. (1997). The use and training of the human voice: A bio-
from http://www.sportsci.org/jour/03/psg.htm dynamic approach to vocal life. Mountain View, CA: Mayfield.
Gorman, S., Weinrich, B., Lee, L., & Stemple, J. C. (2008). Aero- Linklater, K. (1976). Freeing the natural voice. New York, NY:
dynamic changes as a result of vocal function exercises in Drama Book Publishers.
elderly men. The Laryngoscope, 118, 1900–1903. Maxfield, L., Titze, I., Hunter, E., & Kapsner-Smith, M. (2014).
Gundermann, H. (1977). Die Behandlung der gestorten Sprech- Intraoral pressures produced by thirteen semi-occluded vocal
stimme [The treatment of the pathological speaking voice]. tract gestures. Logopedics Phonatrics Vocology. Advance on-
Stuttgart, NY: Fischer. line publication. doi:10.3109/14015439.2014.913074
Guzman, M., Laukkanen, A. M., Krupa, P., Horáček, J., Švec, Nguyen, D. D., & Kenny, D. T. (2009). Randomized controlled
J. G., & Geneid, A. (2013). Vocal tract and glottal function trial of vocal function exercises on muscle tension dysphonia in
during and after vocal exercising with resonance tube and Vietnamese female teachers. Journal of Otolaryngology–Head
straw. Journal of Voice, 27, 523.e19–523.e34. & Neck Surgery, 38, 261–278.
548 Journal of Speech, Language, and Hearing Research • Vol. 58 • 535–549 • June 2015
Verdolini, K. (2000). Resonant voice therapy. In J. C. Stemple of treatment for laryngeal nodules. Journal of Voice, 9,
(Ed.), Voice therapy: Clinical case studies (2nd ed., pp. 46–61). 74–85.
San Diego, CA: Singular. Vickers, A. J., & Altman, D. G. (2001). Statistics notes: Analysing
Verdolini, K., Druker, D. G., Palmer, P. M., & Samawi, H. controlled trials with baseline and follow up measurements.
(1998). Laryngeal adduction in resonant voice. Journal of British Medical Journal, 323, 1123–1124.
Voice, 12, 315–327. Ziegler, A., Verdolini Abbott, K., Johns, M., Klein, A., & Hapner, E. R.
Verdolini-Marston, K., Burke, M. K., Lassac, A., Glaze, L., & (2013). Preliminary data on two voice therapy interventions in the
Caldwell, E. (1995). A preliminary study of two methods treatment of presbyphonia. The Laryngoscope, 124, 1869–1876.
Appendix
Demographic Data for Subjects Enrolled in Study
Note. LPR = laryngopharyngeal reflux; L = left; TVF = true vocal fold; MTD = muscle tension dysphonia; GSW = gunshot wound; R = right.