Professional Documents
Culture Documents
https://doi.org/10.1007/s00455-021-10316-1
ORIGINAL ARTICLE
Abstract
This study aimed to assess the validity and reliability of the Turkish translation of the Yale pharyngeal residue severity rat-
ing scale. The scale measures the severity of residue in the vallecula and pyriform sinus. The original scale was translated
into Turkish by two bilingual English-Turkish translators, and the Turkish version was translated back into English by two
qualified professional translators to assess accuracy. The evaluators were divided into two groups (training and no-training)
and two subgroups according to their experience. Intra-rater, inter-rater, and intra-class correlation coefficient measurements
were analyzed by calculating agreement rates, kappa, and p values. In the analysis of the reliability, intra-class correlation
coefficient values in the overall ratings for both the vallecula and the pyriform sinus were 0.9996 (95% CI 0.9992–0.9998)
and 0.9997 (95% CI 0.9995–0.9999), respectively (p < 0.01). High agreement (> 95%) and perfect Fleiss kappa values were
obtained for the vallecula and pyriform sinus ratings in the inter-rater initial assessments (κ = 0.959 and κ = 0.967, respec-
tively). Perfect kappa values were found in the intra-rater results for both the vallecula and pyriform sinus (α = 0.9959 and
κ = 0.9959, respectively). In the inter-rater secondary analysis, the vallecula and pyriform sinus kappa values were perfect
(κ = 0.959 and κ = 0.967, respectively). In the intra-rater analysis, perfect kappa values were obtained for the vallecula and
pyriform sinus in the no-training group and less-experience subgroup (κ = 0.9918 and κ = 1.0 for the vallecula, and κ = 1.0
and κ = 0.9902 for the pyriform sinus, respectively) In the inter-rater analysis, perfect kappa values were obtained for the
vallecula and pyriform sinus in the no-training group and less-experience subgroup (κ = 0.9507 and κ = 0.9606 for the val-
lecula, and κ = 0.9836 and κ = 1.0 for the pyriform sinus, respectively). The Turkish translation of the Yale pharyngeal residue
severity rating scale demonstrated high validity and reliability scores in determining pharyngeal residue location and value
in the fiberoptic endoscopic evaluation of swallowing.
Keywords Pharyngeal residue · Validity · Swallowing · Vallecula · Pyriform sinus · Therapy · Scale
Introduction
13
Vol.:(0123456789)
Yavuz Atar et al.: Turkish version of the Yale Pharyngeal Residue Scale
13
Yavuz Atar et al.: Turkish version of the Yale Pharyngeal Residue Scale
Vallecula
I None 0% No residue
II Trace 1–5% Trace coating of the mucosa
III Mild 5–25% Epiglottic ligament visible
IV Moderate 25–50% Epiglottic ligament covered
V Severe > 50% Filled to epiglottic rim
Pyriform sinus
I None 0% No residue
II Trace 1–5% Trace coating of the mucosa
III Mild 5–25% Up wall to quarter full
IV Moderate 25–50% Up wall to half full
V Severe > 50% Filled to aryepiglottic fold
Turkish version
Vallekula
I Yok 0% Rezidü yok
II Eser miktar 1–5% Mukozayı eser miktarda kaplamış
III Hafif 5–25% Epiglotik ligaman görülebilir
IV Orta 25–50% Epiglotik ligaman kaplanmış
V Şiddetli > 50% Epiglot kenarına kadar dolmuş
Sinüs piriformis
I Yok 0% Rezidü yok
II Eser miktar 1–5% Mukozayı eser miktarda kaplamış
III Hafif 5–25% Duvarın dörtte birine kadar dolu
IV Orta 25–50% Duvarın yarısına kadar dolu
V Şiddetli > 50% Ariepiglotik foldlara kadar dolmuş
13
Yavuz Atar et al.: Turkish version of the Yale Pharyngeal Residue Scale
Fig. 1 Vallecula images for each residue level: a none; b trace; c mild; d moderate; and e severe, the residue images (b–e) are property of Dr.
Paul Neubauer and limited permission has been obtained from Dr. Paul Neubauer for publication
Fig. 2 Pyriform sinus images for each residue level: a none; b trace; c mild; d moderate; and e severe, the residue images (b–e) are property of
Dr. Paul Neubauer and limited permission has been obtained from Dr. Paul Neubauer for publication
and Krippendorff’s alpha values were calculated, and safe anatomic part (vallecula and pyriform sinus), measurement
intervals and p values with 95% reliability were determined. (first and second), training status (present vs. absent), and
The inter-rater agreement analyses were performed for each level of experience (≥ 5 vs. ≤ 4 years) status. Participants
13
Yavuz Atar et al.: Turkish version of the Yale Pharyngeal Residue Scale
Turkish Version of the Scale At the end of the study, there were 156 ratings for each
group per session, comprising a total of 624 ratings by 24
We worked with a team that had previously been involved raters. For the 24 raters and the images, the average ICC
in various validation studies of translations from English to coefficient values for the vallecula (n = 312) and pyriform
Turkish. After the translation was completed by the transla- sinuses (n = 312) for the initial assessment were 0.9996 (95%
tors, our experts corrected the scale explanation sections CI 0.9992–0.9998) and 0.9997 (95% CI 0.9995–0.9999),
in terms of compliance with professional terminology. The respectively. A very strong positive correlation was obtained
scale was sent to the original author for final approval, and in both the vallecula (n = 312) and pyriform sinus (n = 312)
additional opinions were obtained for the explanations on the ICC coefficients in the initial validity assessment of the
scale. Due to the presence of two separate pyriform sinuses training and no-training groups. Very strong positive corre-
in normal anatomy and cultural differences, the definition lation results were obtained for both the vallecula and pyri-
of “aryepiglottic fold” was translated into the plural (-s). form sinus ICC coefficient values, according to the FEES
This change was included in the final version after obtaining experience of the study group (≥ 5 vs. ≤ 4 years; 0.9993,
approval from the original author. 0.9991, 0.9993, and 1.), respectively; Table 3). Overall,
the agreement values of intra-rater κ = 0.9959 and 99.68%
agreement were obtained for both the vallecula and pyri-
Table 2 Rater characteristics
form sinus. When the intra-rater reliability of the ratings
of training (n = 312) and no-training (n = 312) groups was
Raters n (%) Years of experience evaluated, the kappa values were 0.9996 and 0.9987 for the
Median (± IQR) Min.–Max vallecula, and 0.9994 and 0.9997 for the pyriform sinus,
respectively. According to the intra-rater training status,
Total 24 (100%) 5.5 (± 3.88) 01-Oct
over 95% agreement was obtained in all subgroups. In the
Experience ≤ 4 10 (42%) 3 (± 1.25) 01-Apr
intra-rater experience analysis, ratings obtained according
Experience ≥ 5 14 (58%) 6.5 (± 2.13) 05-Oct
to ≥ 5 years (n = 364) and ≤ 4 (n = 260) years of experience
No-training 12 (50%) 5.75 (± 3.88) 01-Sep
with FEES were evaluated, and the kappa values of 0.9993
Training 12 (50%) 5.25 (± 4.63) 01-Oct
and 0.9991 were found for the vallecula, and 0.9993 and
Professional status
1.0 were found for the pyriform sinus, respectively. For the
ENT specialist 14 (58%) 6.5 (± 2.13) 05-Oct
intra-rater reliability of FEES experience, an agreement of
ENT resident 10 (42%) 3 (± 1.25) 01-Apr
over 95% was obtained in all subgroups (Table 4).
13
Yavuz Atar et al.: Turkish version of the Yale Pharyngeal Residue Scale
In the first (initial) inter-rater evaluation, perfect kappa In the inter-rater secondary analysis, agreement of
values were obtained in the overall results, with a value of greater than 95% was found in all other groups and sub-
0.9672 for the vallecula and 0.9761 for the pyriform sinus. groups except for the vallecula of the no-training group
In the inter-rater evaluation, less than 95% agreement was (94.87%). When the kappa values were analyzed, excel-
obtained in all subgroups according to training status. lent kappa values were found in all groups (n = 312) and
While kappa values of 0.9836 for the vallecula and 0.9672 subgroups (n = 156; Table 6).
for the pyriform sinus were reported for the training group,
the perfect kappa values of 0.9507 and 0.9836 were found,
respectively, in the no-training group. According to the Discussion
level of FEES experience, greater than 95% agreement
was found in all subgroups. For those with ≥ 5 years of FEES is a reliable swallowing assessment tool [2, 20],
FEES experience, a kappa value of 0.9719 was found for and it can be used to show the location and size of PR
the vallecula, and 0.96 was found for the pyriform sinus. in conjunction with validated scales [2, 11, 20, 22]. The
For those with ≤ 4 years of experience, excellent kappa scales support the use of a common language with the
results were obtained for the vallecula (κ = 0.9606) and patients that are followed up after the diagnosis of swal-
for the pyriform sinus (κ = 1.00), respectively (Table 5). lowing disorder and allows for the identification of patient
follow-up parameters and responses to treatment [2, 7, 19,
13
Yavuz Atar et al.: Turkish version of the Yale Pharyngeal Residue Scale
20]. There may be validated and non-validated scales pre- the kappa comparison, and we specified the results with the
viously reported for PR, but the YPRSRS has been shown ICC method.
to have high validity and reliability in studies conducted It is important for a scale to be clear enough that it does
in different languages [2, 10–12, 20–24]. In the original not require special training for its use in providing accu-
study of YPRSRS, we found it easier to explain the scale rate results. Neubauer et al. [11] has shown that there is
through descriptive images, and we decided to validate it no significant difference as statistically between the groups
in Turkish. trained and not trained in the use of the YPRSRS. However,
The T-YPRSRS demonstrated high intra-class correlation Gerschke et al. [12], in the German version of the YPRSRS,
and perfect intra-rater and inter-rater reliability and validity obtained high kappa values in both vallecula groups and
after evaluation in all rater groups for both anatomical loca- found that the kappa value of the untrained group was sig-
tions (Tables 3, 4, 5, 6). We investigated the software they nificantly lower than that of the trained group based on the
used to ensure that our kappa results were as compatible as results of the Student’s t test. In our study, only in the inter-
possible with the previous two versions. The program used secondary analysis was the vallecula rating of the untrained
to calculate the Fleiss kappa values in the original study group slightly below the agreement value (94.87%), on the
was not reported. Therefore, we made our own calcula- other hand the other agreement values greater than 95%
tions with the German version of Stata 15.1. However, in (Table 6). Nevertheless, the ICC values we obtained in
the German version, Student’s t test was used to express the untrained group and the kappa results in the first and
13
Yavuz Atar et al.: Turkish version of the Yale Pharyngeal Residue Scale
secondary inter-rater analyses showed perfect values, and we Curtis et al. [8] reported that there might be differences in
concluded that the Turkish version of the scale was suitable the FEES results for foods of different consistency, opacity,
for clinical use without special training (Tables 3, 4, 5, 6). and color. Prospective studies involving FEES studies with
Neubauer et al. [11] obtained perfect kappa values in both foods of different consistencies to demonstrate the ability to
groups in the intra-rater results, but they found a significant use the scale in the future may be useful.
difference in the evaluations of the experienced group in the Using allusions, phrases, or binary words in scales that
pyriform sinus localization. However, the values were close are translated from one language to another may reduce
for the vallecula. In the study of the German version and our the correlation between the original and translated ver-
study, raters with less experience obtained very high kappa sions of the scale [15, 16]. Two of our experts conducted
values in both localizations. In our study, high kappa values a preliminary study to mitigate these issues due to cultural
were indicated for the less experienced group. This shows and linguistic differences. After obtaining the results, the
that the scale is a practical use tool that does not require high main study was initiated. Although our study was conducted
levels of experience in evaluating FEES images. with raters of 24 different experiences and characters, we
In our study, 26 randomized images were evaluated by cli- achieved perfect kappa results in the validation study we
nician raters (n = 24) with a FEES median experience level conducted with the final version, which is consistent with the
of 5.5 years. In the original study published in 2015, 25 ran- studies conducted for versions in other languages.
domized images were studied, and the average FEES experi-
ence of the raters (n = 20) was reported to be 8.3 years. In Limitations
the German version published in 2018, 30 non-randomized
images were studied with an average FEES experience level There are limitations to this study related to the cultural
of the raters (n = 28) was reported to be 4.5 years. Although adaptation and translation process specific to validation
the image randomization, rater number, and experience studies. Another limitation is that we worked with images
values were different, when the kappa values of the ver- selected by experts from the original image archive that the
sions of the same scale in the three different languages were YPRSRS was published with about six years ago. Thus, cli-
examined, excellent kappa values were obtained in both the nicians, including the raters, may have accessed the English
vallecula and pyriform sinus results of all studies [11, 12]. version of the original study and used it in FEES studies.
Except for the very good kappa value in the inter-rater pyri- In our study, we developed the rating scale with the fixed
form sinus test of the original study. (0.7511 ± 0.011). How- images selected by experts, as was the case with the previous
ever, excellent kappa values were found in all groups in both two versions, but it may be useful to carry out prospective
the German version translated three years after the original studies for simultaneous FEES scaling with dynamic.
study and in our study conducted 6 years after the original.
These high kappa values may be associated with the broader
recognition of the scale with the passing of time. Another Conclusions
reason may be that the widespread use of flexible endoscopy
has increased the use of images among ENT physicians. The T-YPRSRS demonstrated high validity and reliability
The YPRSRS is an ordinal scale containing five levels in its use to determine PR location and value in FEES. We
ranging from no residue to a severe level. Pisegna et al. believe that the Turkish version will contribute to FEES
[25] examined the relationship between PR measurements studies as a practical scale that has been validated and
and parameters in FEES application. They suggested that published.
the scale should include multiple categories, such as mild-
moderate-severe, when working with an ordinal scale. The Acknowledgements We would like to thank Mr. Paul D. Neubauer who
T-YPRSRS is an ordinal scale, it demonstrated high valid- shared the original images in his archive with us and also Mr. Marco
ity and reliability scores with its five levels ranging from Gerschke for his valuable contribution and assistance. We respectfully
commemorate the late Steeven B. Leder, one of the developers of the
no residue to a severe and also it had been anatomically scale.
defined. As a result of 624 ratings of the T-YPRSRS, it has
been found to be a useful version with an agreement value Funding This research did not receive any specific grant from funding
of over 95% among clinicians with 1–10 years of experi- agencies in the public, commercial, or not-for-profit sectors.
ence. In the recorded fixed FEES images used in our study,
opaque white-colored food with the consistency of pudding Declarations
was used. High agreement and perfect kappa values were
achieved in the evaluations of both untrained raters and inex- Conflict of interest The authors declare that they have no conflict of
interest.
perienced raters.
13
Yavuz Atar et al.: Turkish version of the Yale Pharyngeal Residue Scale
13