Professional Documents
Culture Documents
net/publication/327194795
CITATIONS READS
4 610
6 authors, including:
Junichiro Wada
Tokyo Medical and Dental University
23 PUBLICATIONS 55 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
Splinting and protecting remaining teeth with removable partial denture placement View project
All content following this page was uploaded by Junichiro Wada on 26 August 2018.
Hiroshi Matsuura c Noriyuki Wakabayashi a
a Sectionof Removable Partial Prosthodontics, Department of Masticatory Function Rehabilitation, Graduate School
of Medical and Dental Sciences, Tokyo Medical and Dental University (TMDU), Tokyo, Japan; b Dental Clinic for Sleep
Disorders (Apnea and Snoring), Oral and Maxillofacial Rehabilitation, University Hospital of Dentistry, Tokyo Medical
and Dental University (TMDU), Tokyo, Japan; c Graduate School of Management and Information of Innovation,
10 mm 10 mm
0.8 mm
0.8 mm 0.8 mm
a b
Fig. 1. Experimental major connectors. Photograph depicting experimental major connectors covering the mid-
dle area of the palate (M-bar) (a) and both anterior and posterior areas of the palate (AP-bar) (b). Schematic il-
lustrations represent cross-sectional views of the connectors.
lars. Subjects received written information regarding the study rae (/shi/: [∫i], /chi/: [t∫i], /hi/: [çi], and /ki/: [ki]) of 4 test words
purpose and protocol before they consented to participation. A (/i/shi/kawa/, /i/chi/ro/, /e/hi/meken/, and /o/ki/nawa/) were cho-
total of 17 subjects (7 female, 10 male; mean age, 35.0 years; range, sen as the target sounds (Table 1). These target sounds can be clas-
21–58 years) were eligible to participate. All experimental proce- sified into 3 groups on the basis of articulation position: (1) alveo-
dures upheld the principles of the 1975/1983 Helsinki declaration lar: [∫i] and [t∫i]; (2) palatal: [çi]; and (3) velar: [ki]. The sounds
and were approved by the Ethics Committee of the University (Au- can also be classified into 3 groups on the basis of the manner of
thorization No. 286). articulation: (1) fricative: [∫i] and [çi]; (2) affricate: [t∫i]; and (3)
plosive: [ki]. The target sounds included the vowel [i], which re-
Design of the Experimental Procedure quires the highest tongue position among Japanese vowels. These
As shown in Figure 1, the experimental major connectors were test words are meaningful Japanese words, which have both target
fabricated from cobalt-chromium (Cobaltan; Shofu Co., Kyoto, Ja- sounds and accents on the second morae.
pan) by using casts made from silicone impressions of the subjects’ Speech data of the test words were recorded under the follow-
maxillae. Two different designs were fabricated: a connector cov- ing conditions: without a connector (control); with M-bar; and
ering the middle area of the palate (M-bar) and a connector cover- with AP-bar. Each subject was required to pronounce a test word
ing both the anterior and posterior areas (AP-bar). The connectors 5 times under the above-mentioned conditions. The order of the
were fabricated with a maximum thickness of 0.8 mm and a width recording conditions was randomly arranged. Each subject was
of 10 mm at the narrowest part. The cross-sectional shape of the instructed not to vocalize the test words before recording, and the
connecter in the sagittal plane was moderately curvilinear. recordings were initiated immediately after the connectors were
stably inserted. During the measurements, the subjects were asked
Recording and Evaluation of Speech Data to maintain their pronunciation at a constant speed and volume,
In speech, consonants whose production requires contact be- as in their daily life. Speech data were recorded with a headset-type
tween tongue, palate, and alveolus were reported to be affected by microphone and a laptop computer (PAC9214LDEW; Toshiba
the placement of dentures [13–17, 20, 22, 26–28, 39, 40]. In addi- Co., Tokyo, Japan) with a speech evaluation system. Subjects were
tion, disorders of pronunciation have been reported to appear given rests after each recording and were not allowed to speak dur-
readily on the second mora [39, 40]. Accordingly, the second mo- ing the breaks.
202.242.187.110 - 8/27/2018 1:37:54 AM
Downloaded by: W. Junichiro - 239864
Power spectrum
FFT 64 ms
8 ms
GG $$ $I ##
Segment label
8 ms $I Segment label
a b
Fig. 2. Flow process of the speech evaluation system. a Acoustic (48 ms) were quantized with statistical pattern-matching process-
analysis was performed using a fast Fourier transform (FFT) that ing and the integrated phonetic segment was labeled every 8 ms.
transformed the speech data of 8 analytical frames (79 ms) into 64 b The pattern is extracted from the power spectrum every 8 ms.
ms of information (FFT parameters). Then, the multiple acoustic The extracted pattern is matched to a phonetic segment in the ref-
feature plane (MAFP) pattern, emphasizing the local changes in erence patterns and labeled as a combination of 2 letters.
frequency and time frame, was extracted. Finally, 6 MAFP patterns
[∫i] /shi/ alveolar fricative $$, SS, $I i/shi/kawa name of a Japanese golf player
[t∫i] /chi/ alveolar affricate CC, CI i/chi/ro name of a Japanese baseball player
[çi] /hi/ palatal fricative ##, HH, HI e/hi/meken one of the prefectures in Japan
[ki] /ki/ velar plosive KI o/ki/nawa one of the prefectures in Japan
The speech evaluation system (Voice Analyzer; Toshiba Digital acoustic feature plane patterns (48 ms) were quantized with statis-
Media Engineering Co., Tokyo, Japan) used speech recognition tical pattern-matching processing and the integrated phonetic seg-
based on 213 types of integrated phonetic segments that represent ment was labeled every 8 ms. Thus, this system performed micro-
phonetic features of transitions from one phoneme to another [27]. period (8 ms) evaluations on a frame-by-frame basis and was able
The flow process of this system is shown in Figure 2. The acoustic to record time-scale information (Fig. 2a). Integrated phonetic seg-
analysis, which had an analytical frame of 23 ms and a frame shift ment labels which indicated the correct pronunciation of the target
range of 8 ms, was performed by using a fast Fourier transform) part of the target sound were defined as “correct labels” (Table 1).
that transformed the speech data of 8 analytical frames (79 ms) into The number of correct labels varied with the type of test sound. This
64 ms of information (fast Fourier transform parameters). Then, system also performed macro-period quantitative sound evalua-
multiple acoustic feature plane patterns, emphasizing local changes tions by using the ratio between the numbers of correct and incor-
in frequency and time frame, were extracted. Finally, the 6 multiple rect integrated phonetic segment labels during the evaluation pe-
202.242.187.110 - 8/27/2018 1:37:54 AM
Downloaded by: W. Junichiro - 239864
I I I $ C $ $ $ $ C C $ C I I I
S S S $ C $ I $ $ C C $ Y 4 4 Q
Fig. 3. Method of calculating the mean appearance ratio of correct labels (MARC) (sample [∫i]). In this record-
ing, there are 10 segment labels in the target part. There are 6 correct labels of [∫i] ($$ and $I) and 4 incorrect
labels (CC and CY). Thus, the MARC of [∫i] is calculated to be 60%.
riod. Matsuura et al. [25] conducted a recall test to verify the reli- target sounds were calculated. The mean appearance ratio of correct
ability of this system with sound subjects. The recall and precision labels (MARC) was regarded as a representative value reflecting the
averages were 95.4 and 95.9%, respectively. The F-measure was accuracy of phonetic output of the target sounds. The mean appear-
95.7, which indicated that the speech evaluation system was a reli- ance ratio of incorrect labels (MARIC) was also calculated for statis-
able method [41]. The results of our studies with this system were tical analysis. For other target sounds, calculations of MARCs and
consistent with the results of previous studies with conventional MARICs were performed in the same manner.
methods, such as sound spectrography, palatography, and auditory
evaluations by speech pathologists [15–28]. Morphometry of Oral Cavity and Subgrouping
As shown in Figure 2b, the pattern was extracted from the pow- Maxillary stone models of the subjects were digitized with a
er spectrum and matched to a phonetic segment of the reference three-dimensional (3D) scanner (ARCTICA Scan; KaVo, Biber-
pattern. These results were labeled and displayed. As an example, ach, Germany). The digital data were saved in a surface tessellation
the method to extract the consonant part of the target sound [∫i] language (STL) format. Three parameters – representing the shape
from the speech data of the test word /i/shi/kawa/ is shown in Figure of the palate and dental arch; palate heights, narrowing of dental
3. The head border of the range of the [∫i] sound was the transition arches, and front spaces of the oral cavities – were then computed
from [i] to [∫i], and the foot was the part containing the transition for each STL datum by using 3D evaluation software (Geomagic
from [∫i] to [ka]. The vowel part of [∫i] was then removed, and the Studio 2014; 3D Systems, Rock Hill, SC, USA). Calculation meth-
remaining section was defined as the target part. Extractions of oth- ods for these parameters are shown in Figure 4. The palate height
er target sounds were performed in the same manner. The ratios of was defined as the distance from the midpoint between the mesio-
correct labels to all labels appearing in the consonant parts of the palatal cusps of both first molars to the palate (Fig. 4a). The ratio
202.242.187.110 - 8/27/2018 1:37:54 AM
Downloaded by: W. Junichiro - 239864
CL
WMD
Palate height
a b
PI
BP
PR
PL
DP
c d
Fig. 4. Calculation of the 3 morphological parameters. a Palate of the left side (PL) and right side (PR). d The front space was cal-
height was calculated as the length of the line (perpendicular to the culated as the ratio of front volume to total volume. The total vol-
reference plane [RP] from the midpoint between the mesiopalatal ume was defined as the space enclosed by the palate, RP, and distal
cusps of the left (CL) and right first molars (CR) to the palate. plane (DP; the perpendicular plane to RP including the distal ends
b Narrowing of the dental arch was calculated as the ratio of the of the left and right second molars). The bounding plane (BP) be-
interpremolar distance (DIP) to the mesiodistal width of the dental tween the front and rear spaces included the midpoint of the per-
arch (WMD). c The RP included the incisive papilla (PI) as well as pendicular line from PI to DP.
the palatal interdental papillae between the first and second molar
of the interpremolar distance to the mesiodistal width of the dental performed for each subgroup by using the Friedman test and pair-
arch was defined as narrowing of the dental arch (Fig. 4b). The to- wise comparisons. A multiple linear regression test was applied to
tal volume of the oral cavity was divided into front volume and rear evaluate the association of the 3 morphometric parameters to
volume. The ratio of front volume to total volume was defined as MARCs with each recording condition. For the comparison of
the front space (Fig. 4c, d). MARICs, the Mann-Whitney U test was performed among sub-
The study group was divided into 2 subgroups for each param- groups for the 3 parameters (SPSS version 22.0; SPSS Japan Inc.,
eter (Table 2). The boundary between the 2 subgroups was the Tokyo, Japan); p < 0.05 after Bonferroni correction was considered
median of all subjects. to be statistically significant.
Statistical Analysis
The Kolmogorov-Smirnov test was used for normality analysis.
Due to the absence of normality of some data, such as MARCs of Results
[∫i] and [ki], nonparametric statistical analyses were conducted.
Spearman’s rank correlation with Bonferroni correction was per-
formed to evaluate associations between morphometric parame- Correlations between Morphometric Parameters
ters. For each parameter, comparisons of MARCs among the 3 Front space was significantly correlated with palate
recording conditions (no connector, M-bar, and AP-bar) were height (ρ = –0.591; p = 0.039). Narrowing of the dental arch
202.242.187.110 - 8/27/2018 1:37:54 AM
Downloaded by: W. Junichiro - 239864
HP, height of palate; NDA, narrowing of dental arch; FS, front space. 1 Median of all subjects.
Table 3. Partial regression coefficients (β) in multiple regression analysis among morphometric parameter and
MARC with each recording condition
Height of palate 0.512* 0.422 0.244 –0.299 –0.076 0.101 0.167 –0.062
Narrowing of dental arch 0.143 –0.061 –0.124 –0.089 0.427 0.42 0.056 0.134
Front space –0.188 0.118 –0.09 –0.506 –0.343 0.108 0.001 –0.37
was not significantly correlated with palate height (ρ = the MARCs of all target sounds in all subjects; however,
–0.184; p = 1.00) or front space (ρ = –0.328; p = 0.594). placement of an AP-bar had a significant effect on the
MARCs of [t∫i] and [ki] (Fig. 5c). No significant effect
Correlations between MARCs and Parameters was found under any recording condition with respect to
The results of multiple linear regression tests are the MARCs of [çi] (Fig. 5b).
shown in Table 3. A significant correlation was found In contrast, the placement of an AP-bar had a signifi-
only between the change in MARCs of [∫i] with M-bar cant effect on the MARCs of [∫i] among high-palate (p =
and “front space” (β = 0.512, p = 0.036). 0.026), narrow dental arch (p = 0.004), and small front
space subgroups (p = 0.014) (Fig. 5a).
Subgroup Comparison of MARCs among 3 Recording
Conditions Comparison of MARICs
The MARCs under the 3 recording conditions, ac- Regardless of the recording condition, type of target
cording to morphological subgroup, are shown in Figure sound, or subgroup, various incorrect labels appeared
5. The placement of an M-bar had no significant effect on among all subjects. Regarding the MARICs of [ki], incor-
202.242.187.110 - 8/27/2018 1:37:54 AM
Downloaded by: W. Junichiro - 239864
* ** *
**
100
80
MARC of [∫i], %
60
40
20
0
Low High Narrow Wide Small Large
a Palate Dental arch Front space
100
80
MARC of [çi], %
60
40
20
0
Low High Narrow Wide Small Large
b Palate Dental arch Front space
* ** ** ** ** ** ** ** ** * ** **
100
*
80
60
MARC, %
40
20
0
Low High Low High Narrow Wide Narrow Wide Small Large Small Large
[t∫i] [ki] [t∫i] [ki] [t∫i] [ki]
Fig. 5. MARCs under 3 recording conditions, according to morphological subgroup. The MARCs under 3 record-
ing conditions (the white bar, grey bar, and dark bar represent control [Ctrl], with M-bar, and with AP-bar, re-
spectively) are shown by the subgroups. The MARCs of [∫i], [çi], and both of [t∫i] and [ki] are shown in a, b, and
c, respectively. The asterisks indicate a significant difference between recording conditions (* p < 0.05, ** p < 0.01).
202.242.187.110 - 8/27/2018 1:37:54 AM
Downloaded by: W. Junichiro - 239864
space patients.
8 Previous palatographical studies reported that vocal
tract leaks-in and closures in the oral cavity, both caused
4 by changes in the tongue contact, as well as disorders of
speech production, were typically encountered following
0 the placement of dentures [18–21]. The speech evaluation
M-bar AP-bar M-bar AP-bar system used in this study could assess leaks-in or closure
[gi] [çi] of the vocal tracts by comparing tongue contact areas be-
tween the target and emerging sounds, thus identifying
the incorrect labels [15, 27, 28]. In this study, the appear-
Fig. 6. Appearance ratios of incorrect labels of [ki]. The mean ap-
pearance ratios of incorrect labels (MARICs) of 2 labels of [ki] ([gi]
ance of the [gi] sound as an incorrect label for [ki] sig-
and [çi]) are shown by the recording conditions. The white bars nificantly increased with the placement of an M-bar in
represent the low-palate group, while the dark bars represent the low-palate subjects, compared with high-palate subjects.
high-palate group. The asterisk indicates a significant difference Since the voice onset time of the voiced plosive [gi] is
(* p < 0.05) between groups. shorter than that of the voiceless plosive [ki], the appear-
ance of [gi] indicated that the voice onset time of [ki] was
incidentally shortened [42, 43]. This suggests that the
rect labels representing the [gi] sound were significantly placement of a palatal bar covering the middle area in
higher with the placement of M-bars in low-palate subjects low-palate patients could readily cause closures of the vo-
(p = 0.027). The number of incorrect labels representing cal tracts. In contrast, the appearance of the [çi] sound as
the [çi] sound significantly increased with the placement the incorrect label for [ki] significantly increased with the
of an AP-bar in a high-palate subject (p = 0.027) (Fig. 6). placement of an AP-bar in high-palate subjects, in com-
parison to low-palate subjects. The tongue does not block
airflow through the vocal tract when the fricative [çi] is
Discussion produced. However, the vocal tract is blocked when the
plosive [ki] is produced. Thus, the appearance of [çi] in-
In this study, 2 alveolar ([∫i] and [t∫i]), 1 palatal ([çi]), dicated that the vocal tract for [ki] was incidentally leaked
and 1 velar ([ki]) sounds were considered the target by the placement of an AP-bar. This suggests that the
sounds. During articulation, the [∫i] and [çi] sounds do placement of a palatal bar covering both anterior and pos-
not require contact between the tongue and palate, where- terior areas for high-palate patients could readily cause a
as [t∫i] and [ki] sounds do require contact. To minimize leak in the vocal tract.
the influence of individual variations on the speech data, Despite various reported methods for evaluating the
the selected test words were proper familiar nouns for shape of the palate and dental arch, there is no consensus
these study subjects. In contrast to the auditory evalua- standard [44, 45]. In this study, palate height, narrowing
tion by speech pathologists, this selection did not affect of the dental arch, and the ratio of front area space were
evaluation by the speech recognition system. regarded as “palate and dental arch parameters,” which
The results of this study showed that [t∫i] and [ki] could be easily measured on maxillary stone models.
were significantly disturbed by the placement of an AP- Tanaka [35] reported that disorders of speech production
bar, with no significant effect on [çi] among all subjects. with CDs were more severe in small front space patients
This was consistent with the findings of previous studies than in large front space patients. Although we identified
[13–17, 20–22, 26–28, 39, 40]. In contrast, there were dis- a significant association between MARC of [∫i] with M-
tinctive trends in subgroup analysis only in the MARCs bar and front space on the basis of multiple linear regres-
of [∫i], which was readily disturbed by dentures (in a sion, our findings that placement of the experimental bar
202.242.187.110 - 8/27/2018 1:37:54 AM
Downloaded by: W. Junichiro - 239864