Professional Documents
Culture Documents
Manual Therapy
journal homepage: www.elsevier.com/math
Original article
a r t i c l e i n f o a b s t r a c t
Article history: Head posture (HP) is assessed as part of the clinical examination of patients with neck pain using
Received 31 October 2009 observation and qualitative descriptors. In research, HP is characterised through the measurement of
Received in revised form angles and distances between anatomical landmarks. This study investigated whether the assessment of
27 April 2010
HP as performed in clinical practice is reliable and valid. Ten physiotherapists assessed forward HP, head
Accepted 5 May 2010
extension and side-flexion from images of 40 individuals with and without previous experience of neck
pain using a four-category scale. The assessment was repeated twice with a 1-week gap. Physiothera-
Keywords:
pists’ ratings were then compared with angular measurements of the same components of HP. K values
Head posture
Observation
for intra-rater reliability varied between 0.22 and 0.81 for forward HP, between 0.19 and 0.69 for head
Reliability extension and between 0.38 and 0.67 for side-flexion. K values for inter-rater reliability were 0.02 for
Validity forward HP, 0.07 for head extension and 0.19 for side-flexion. Correlation coefficients between the ratings
and the angular measurements varied between 0.16 and 0.49 for forward HP, between 0.17 and 0.68
for head extension and between 0.04 and 0.37 for side-flexion. The assessment of HP by observation
and a four-category scale showed poor reliability and validity.
Ó 2010 Elsevier Ltd. All rights reserved.
1. Introduction 2007; Johnston et al., 2008) and greater neck muscle fatigability
(Falla et al., 2003).
Head posture (HP) assessment is recommended as part of the A recent systematic review of studies comparing HP in indi-
examination of patients with neck pain (NP) to aid diagnosis, viduals with NP and asymptomatic individuals was inconclusive
determine treatment strategies and monitor the progress of the due to conflicting outcomes and shortcomings in methodological
patient (American Physical Therapy Association, 2001; Kendall quality of the included studies (Silva et al., 2010).
et al., 2005; Petty, 2006; Lau et al., 2008; Magee, 2008; Yip et al., However, the more recent studies included in this systematic
2008). This is based on evidence that deviations in HP are associ- review are of better methodological quality and suggest that there
ated with decreasing length moment arms and increasing activity is a difference in HP between patients with NP and asymptomatic
of the neck extensors (Kumar et al., 2002; Przybyla et al., 2006), participants. For example, there were 10 studies that measured
increasing forces acting on anatomical structures (Bonney and forward HP in adults, of which 4 reported a difference and 6 did not.
Corlett, 2002) and decreasing range of movement of the neck Five out of these 6 studies were judged as using a small sample size
(Edmondston et al., 2005). Studies have shown that patients with and, therefore, are likely to be reporting a false negative. In contrast,
neck pain have decreased range of motion of the neck (Woodhouse 3 of the 4 studies that reported a difference in HP between patients
and Vasseljen, 2008), reduced neck muscle strength, neuromus- and asymptomatic participants used an a priori sample size
cular efficiency and endurance (Harris et al., 2005; Cagnie et al., calculation. Moreover, two of these studies (Lau et al., 2008; Yip
et al., 2008) measured the angle C7-tragus-horizontal as an indi-
cator of HP while participants were in static standing and reported
a mean difference of 5.1 and 6.7, respectively. These values are
* Corresponding author. Escola Superior de Saúde, Universidade de Aveiro, higher than the standard error of measurement calculated by Lau
Campo Universitário de Santiago, 3810-193 Aveiro, Portugal. Tel.: þ351
et al. (2008) in the same study (1.2 ) suggesting that the differ-
234401558x23899; fax: þ351 234401597.
E-mail address: asilva@ua.pt (A.G. Silva). ence in HP between patients with NP and asymptomatic partici-
1
www.leeds.ac.uk/pallium pants is clinically significant.
1356-689X/$ e see front matter Ó 2010 Elsevier Ltd. All rights reserved.
doi:10.1016/j.math.2010.05.002
A.G. Silva et al. / Manual Therapy 15 (2010) 490e495 491
2.1.2. Raters
Raters were 10 physiotherapists recruited using advertisements
and word of mouth from physiotherapists that had an association
with Aveiro’s University (Portugal) or Leeds Metropolitan Univer-
sity (UK). Eligibility criteria were having at least 3 years of clinical Fig. 1. Images from one participant in the study: (A) frontal image and (B) lateral
experience and to self-report that they routinely assess HP by image (participant consent is available on request). Note: in the study the images were
observation in their clinical practice. shown without masking the eyes.
492 A.G. Silva et al. / Manual Therapy 15 (2010) 490e495
Forty individuals had their image taken (21 women and 19 men; 3.4. Inter-rater reliability
mean SD age 39.50 14.18 years, age range 20e65 years;
mean SD weight 69.73 14.81 kg and height 164.35 9.14 cm). When presented with the same images on the first assessment and
Sixteen of these individuals reported that they had previous using a four-category scale, the 10 raters scored at least 3 different
experience of NP but no episode lasted longer than 3 days. Only 1 categories of assessment for 90% of the 40 images for forward HP, 75%
was in pain when the images were taken. for head extension and 40% for side-flexion (Table 2). No image
received the same rating from all raters for forward HP and extension
3.2. Raters and only 2.5% (n ¼ 1) received the same rating for side-flexion. The
multirater K calculated for the 10 raters was 0.02 for forward HP, 0.07
The 40 sets of images were assessed by 10 physiotherapists (5 for head extension and 0.19 for side-flexion, indicating that agree-
Portuguese and 5 British) ranging in age from 28 to 41 years and ment among the 10 raters was not very different from that expected
a mean of 12.7 years of clinical experience (range: 7e24). Two were by chance alone (Table 3). Results also show that Portuguese and
full-time clinicians, 2 were full-time University lecturers who British raters had similar levels of inter-rater reliability suggesting
taught HP assessment and 6 worked as both clinicians and that potential differences in the theoretical and clinical background of
University lecturers. Three raters reported to use the scale used in the physiotherapists did not affect the reliability levels. Furthermore,
this study in clinical practice (referred as raters 1, 3 and 6). previous experience using the scale to assess HP in clinical practice
did not contribute to higher levels of inter-rater reliability.
3.3. Intra-rater reliability
3.5. Angular measurements
When assessing the same image on 2 occasions separated by 7
days, 7/10 raters for forward HP and 9/10 raters for head extension The angle C7-tragus-horizontal is used as indicative of forward
and side-flexion scored the same category on the scale for the HP and decreasing values are indicative of a more forward HP. Mean
same image in both assessments for less than 80% of the 40 images (SD) angular values were 46.62 6.10 (range: 33.76e57.77 ). The
(Table 1). Raters used different number of scale categories and, as angle tragus-eye-horizontal is used as indicative of head extension
would be expected, raters using fewer categories tended to show and increasing values are indicative of a more extended head. Mean
higher percentages of agreement. This may help explain the (SD) angular values were 18.11 7.75 (range: 1.36 to 36.08 ).
Table 1
Percentage of agreement, K values, 95% confidence interval (95% CI) and standard error (SE) for intra-rater reliability.
Table 3
K values, 95% confidence interval (95% CI) and standard error (SE) for inter-rater reliability (calculated using data from the first assessment).
K CI SE K CI SE K CI SE
All raters 10 0.02 0e0.06 0.02 0.07 0.03e0.11 0.02 0.19 0.15e0.23 0.02
Portuguese 5 0.00 0e0.06 0.03 0.05 0e0.11 0.03 0.09 0.01e0.17 0.04
raters
British raters 5 0.00 0e0.08 0.04 0.08 0e0.18 0.05 0.38 0.28e0.48 0.05
Raters using the 3 0.02 0e0.16 0.07 0.11 0e0.25 0.07 0.03 0e0.17 0.07
scale in clinical
practice
The angle right eareleft ear-horizontal is used as indicative of side- observation to assess HP, in particular, considering that mean
flexion with 0 indicating perfect symmetry. Mean (SD) angular differences between individuals with and without NP for HP are
values for this angle were 1.56 3.03 (range: 0.02e7.6 ). likely to be small. A systematic review of studies comparing HP
between individuals with and without NP found that mean differ-
3.6. Relationship between the angular measurements and the ences between groups varied between 2.2 and 6.7 (Silva et al.,
severity scale categories 2010). This suggests that differences within and between patients
with NP are likely to be of similar magnitude, but the results of this
Matching the angular values against the respective category scored study challenge the ability of physiotherapists to accurately and
by the raters in the first assessment, revealed no clear correspondence consistently assess such differences.
between each one of the categories of the scale and an interval of Factors such as the subjectivity of the scale categories requiring
angular values for any of the 3 angles measured (Table 4). Mean a judgement from the rater and the possibility that the patterns of
angular values within each category tend to decrease from category 1 reference used when rating HP varied within and between raters
to category 4 for forward HP suggesting that individuals with more may have contributed to the low reliability. This is supported by the
forward HP were correctly scored a category of higher severity. findings of a previous study where the relevance attached to HP
However, minimum values were the same for the 4 categories and deviations was found to vary among physiotherapists (Silva et al.,
maximum values differ less than 2 among the 4 categories and 2008). It is also possible that the pattern being used changed for
standard deviation also showed that categories overlap considerably. the same rater between assessments or throughout the assessment
Mean angular values within each category tend to increase from as a result of comparing HP between the individuals in the images.
category 1 to category 4 for extension and side-flexion suggesting that Furthermore, the level of reliability among the raters that used the
individuals with a more extended and side-flexed head were correctly scale in clinical practice was not higher than the level of reliability
scored a category of higher severity. However, and as for forward HP, among those that did not, suggesting that lack of experience with
minimums, maximums, means and SDs show that categories overlap the scale was not responsible for the low levels of reliability.
considerably and, therefore, suggest that raters were unable to accu- In clinical practice, physiotherapists usually assess HP after
rately attribute a higher rating to more deviated HPs. taking the history of the patient, while in the present study they
Correlation coefficients between angles and ratings attributed were blind to participants’ status (patient with NP or asymptomatic).
by each one of the 10 raters varied between 0.16 and 0.49 for However, knowledge of the patient history does not seem to have an
forward HP, between 0.17 and 0.68 for head extension and impact on the level of reliability of other clinical tests used to assess
between 0.04 and 0.37 for side-flexion (Table 5). This indicates neck range of motion, tenderness, atrophy, sensitivity to pain and
weak to low correlation between angular measurements and the strength (Bertilson et al., 2001). A further consideration is whether
severity ratings for forward HP and side-flexion and weak to the fact that raters were not in the presence of participants but
moderate correlation for head extension. assessing photographs that only showed the upper half of the indi-
viduals being assessed could have affected the reliability and validity
4. Discussion of the assessment. Photographs were chosen because it was a means
to standardise HP so that the same posture was shown to all raters in
The assessment of forward HP, head extension and side-flexion the first and second assessments 1 week apart, and to allow the
by observation and a four-category scale seems to be neither reli- comparison between the observational assessment and the angular
able nor valid. This challenges the clinical usefulness of using measurements. In addition, photographs were also reported to be
Table 4
Descriptive statistics for angular values grouped according to the category of the scale attributed to each component of HP by all raters.
Tragus-eye-horizontal/extension
Number of ratings 160 142 82 16
Range of values 1.36 to 30.79 1.36 to 36.08 1.36 to 36.08 15.08e36.08
Mean SD 16.06 6.32 18.04 7.81 20.43 7.92 27.11 7.07
Table 5 increased the levels of reliability (van Genderen et al., 2003). Other
Correlation between the ratings attributed by the raters and the angular values used factors that could have negatively affected the reliability were the
as surrogate measures for each component of HP.
use of half body photographs and the use of HP assessment out of
Rater Forward HP Extension Side-flexion the clinical practice context. Physiotherapists use imaginary lines of
T p T p T p reference (e.g. line of gravity) to inform their assessment of HP
1 0.39 0.002* 0.68 0.000* 0.28 0.024* (Silva et al., 2009a) which cross anatomical points in the head,
2 0.37 0.003* 0.27 0.035* 0.04 0.730 thorax and lower limbs. However, in this study some reference
3 0.44 0.000* 0.42 0.000* 0.15 0.228 points were missing as only half body (head to waist) photographs
4 0.16 0.233 0.39 0.004* 0.04 0.779
were used which may have affected their ability to make judgments
5 0.48 0.000* 0.19 0.131 0.37 0.004*
6 0.41 0.001* 0.33 0.01* 0.02 0.857
on HP. In addition, HP is usually considered in the context of the
7 0.49 0.000* 0.19 0.152 0.09 0.467 whole examination and knowledge of whether HP influences pain
8 0.49 0.000* 0.53 0.000* 0.16 0.201 or pain influences HP is perhaps considered when assessing the
9 0.21 0.110 0.24 0.060 0.07 0.574 patient. It is possible though that this information can affect the
10 0.18 0.171 0.17 0.206 0.05 0.682
reliability of HP assessment.
T, Tau b correlation coefficient; *, statistically significant. The lack of reliability and validity of HP assessment by obser-
vation and the small size of the differences in HP between partic-
used in clinical practice and assessed by observation by 19.6% of the ipants with and without NP suggest that future studies should
respondents in a previous study (Silva et al., 2009a). explore the reliability and validity of quantitative procedures of HP
The low reliability found in the current study is in agreement assessment that could easily be used in clinical practice.
with the findings of Cleland et al. (2006), but contrast with the
findings of Griegel-Morris et al. (1992), Paternostro-Sluga et al. 5. Conclusion
(1995) and Eriksson et al. (2000). It could be expected that differ-
ences in the current study procedures such as the use of a scale The assessment of HP through observation and the use of a four-
with more categories, use of photographs, absence of a pre-training category severity scale seem to be neither reliable nor valid chal-
session might have contributed to the contrasting results. However, lenging its clinical usefulness.
in the study of Cleland et al. (2006) the assessment of forward HP
was dichotomized as present or absent, carried out in the presence Acknowledgements
of the patients and experienced physiotherapists received pre-
study training. Nevertheless, Cleland et al. (2006) also reported HP This work was funded via a PhD scholarship from the Founda-
assessment to be unreliable. Therefore, it is likely that factors such tion for Science and Technology (SFRH/BD/30735/2006), Portugal.
as the use of a low number of individuals (n ¼ 5 for inter-rater
reliability and n ¼ 10 for intra-rater reliability) (Griegel-Morris References
et al., 1992), low number of raters (n ¼ 2) (Eriksson et al., 2000)
and the use of statistics that do not correct for agreement expected Altaye M, Donner A, Eliasziw M. A general goodness-of-fit approach for inference
procedures concerning the kappa statistics. Stat Med 2001;20:2479e88.
by chance alone (Paternostro-Sluga et al., 1995) explain the higher American Physical Therapy Association. Guide to physical therapist practice. Phys
reliability found by these authors. Ther 2001;81:9e746.
When comparing the ratings against the angular measurements, Bertilson B, Grunnesjö D, Strender L. Reliability of clinical tests in the assessment of
patients with neck/shoulder problems e impact of history. Spine
some degree of overlapping between adjacent categories in terms 2001;28:2222e31.
of the corresponding angular values would have been expected due Bonney R, Corlett E. Head posture and loading of the cervical spine. Appl Ergon
to an increase in the difficulty of choosing between categories in 2002;33:415e7.
Cagnie B, Cools A, De Loose V, Cambier D, Danneels L. Reliability and normative
the transition area. However, the extension of overlapping is too big
database of the Zebris cervical range-of-motion system in healthy controls with
and is occurring not only between adjacent categories, suggesting preliminary validation in a group of patients with neck pain. J Manipulative
that physiotherapists rated similar HP deviations as different and Physiol Ther 2007;30:450e5.
Cleland J, Childs J, Fritz J, Whitman J. Interrater reliability of the history and physical
different HP deviations as similar. While the accuracy of the
examination in patients with mechanical neck pain. Arch Phys Med Rehabil
correlation coefficients between the angular values and the ratings 2006;87:1388e95.
might have been compromised as the limited number of scale Edmondston S, Henne S-E, Loh W, Østvold E. Influence of cranio-cervical posture on
categories lead to a high number of tied ranks, which are known to three-dimensional motion of the cervical spine. Man Ther 2005;10:44e51.
Eriksson E, Mokhtari M, Pourmotamed L, Holmdahl L, Eriksson H. Inter-rater reli-
affect the magnitude of the correlation coefficient (Pett, 1997), the ability in a resource-oriented physiotherapeutic examination. Physiother
considerable overlap between scale categories shown by the Theory Pract 2000;16:95e103.
descriptive statistics is in accordance with the low correlation Falla D, Rainoldi A, Merletti R, Jull G. Myoelectric manifestations of sternocleido-
mastoid and anterior scalene muscle fatigue in chronic neck pain patients. Clin
coefficients found. No previous studies were found that compared Neurophysiol 2003;2003(114):488e95.
the assessment of HP through observation and severity scales Griegel-Morris P, Larson K, Mueller-Klaus K, Oatis C. Incidence of common postural
against surrogate measures for HP. Furthermore, the mean angular abnormalities in the cervical, shoulder and thoracic regions and their association
with pain in two age groups of healthy subjects. Phys Ther 1992;72:425e31.
values and standard deviation in the present study were similar to Harris K, Heer D, Roy T, Santos D, Whitman J, Wainner R. Reliability of a measure-
those reported in previous studies for patients with NP. For ment of neck flexor muscle endurance. Phys Ther 2005;85:1349e55.
example, the mean values and standard deviation for the angle C7- Johnston V, Jull G, Souvlis T, Jimmieson N. Neck movement and muscle activity
characteristics in female office workers with neck pain. Spine 2008;33:555e63.
tragus-horizontal in the present study were 46.6 6.1, and were
Kendall F, McCreary E, Provance P, Rodgers M, Romani W. Muscles testing and func-
49.9 6.1 in a study of Yip et al. (2008) and 46.1 6.7 in tion, with posture and pain. Philadelphia: Lippincott Williams and Wilkins; 2005.
a previous study of our team (Silva et al., 2009b) for participants Kumar S, Narayan Y, Amell T, Ferrari R. Electromyography of superficial cervical
muscles with exertion in the sagittal, coronal and oblique planes. Eur Spine J
with NP. This suggests that the HPs assessed in the present study
2002;2002(11):27e37.
were similar to the HP of patients with NP. Lau H, Chiu T, Lam T. Clinical measurement of craniovertebral angle by electronic
A preparatory session was not conducted because authors did head posture instrument: a test of reliability and validity. Man Ther 2008;2009
not want to influence physiotherapists’ judgement prior to con- (14):363e8.
Magee D. Orthopaedic physical assessment. Philadelphia: Saunders; 2008.
ducting the studies. However, it may be argued that a preparatory Norkin C, White D. Measurement of joint motion: a guide to goniometry. Phila-
session aiming to standardise procedures of assessment could have delphia: FA Davis Company; 1995.
A.G. Silva et al. / Manual Therapy 15 (2010) 490e495 495
Paternostro-Sluga T, Preisinger E, Uher E, Resch K, Ernst E. How reproducible is the Silva AG, Punt D, Sharples P, Vilas Boas JP, Johnson M. Head posture and neck pain of
functional assessment of the spine? Eur J Phys Med Rehabilitat 1995;5:122e5. chronic nontraumatic origin: a comparison between patients and pain-free
Pett M. Nonparametric statistics for health care research. Statistics for small persons. Arch Phys Med Rehabilitat 2009b;90:669e74.
samples and unusual distributions. California: Sage Publications; 1997. Silva AG, Sharples P, Johnson M. A systematic review of studies comparing surrogate
Petty N. Neuromusculoskeletal examination and assessment. A handbook for measures for head posture in individuals with and without neck pain. Phys Ther
therapists. Bath: Churchill Livingstone; 2006. Rev 2010;15:12e22.
Przybyla A, Mohite A, Blease S, Adams M. How does posture affect lever arms of van Genderen F, De Bie R, Helders P, Van Meeteren N. Realiability research: towards
neck flexor and extensor muscles? J Biomech 2006;39:S49. a more clinically relevant approach. Phys Ther Rev 2003;8:169e76.
Raine S, Twomey L. Head and shoulder posture variations in 160 asymptomatic van Niekerk S, Louw Q, Vaughan C, Grimmer-Somers K, Schreve K. Photographic
women and men. Arch Phys Med Rehabilitat 1997;78:1215e23. measurement of upper-body sitting posture of high school students: a reli-
Silva AG, Punt D, Sharples P, Vilas Boas J, Johnson M. The Experiences of Portuguese ability and validity study. BMC Musculoskelet Disord 2008;9:113.
Physiotherapists When They Assess Head Posture for Patients with Neck Pain: Walter S, Eliasziw M, Donner A. Sample size and optimal designs for reliability
a Focus Group Study. Liverpool: British Pain Society; 2008. British Pain Society studies. Stat Med 1998;17:101e10.
meeting. Woodhouse A, Vasseljen O. Altered motor control patterns in whiplash and chronic
Silva AG, Punt D, Johnson M. A postal survey gathering information about physio- neck pain. BMC Musculoskelet Disord 2008;20:90e100.
therapists’ assessment of head posture for patients with chronic idiopathic neck Yip C, Chiu T, Poon A. The relationship between head posture and severity and
pain. Eur J Pain 2009a;13:S223. disability of patients with neck pain. Man Ther 2008;13:148e54.