Professional Documents
Culture Documents
Therapy Practice
Jules M Rothstein, Suzann K Campbell, John L
Echternach, Alan M Jette, Harry G Knecht, Steven J
Rose and on behalf of the Task Force on Standards for
Measurement in Physical Therapy
PHYS THER. 1991; 71:589-622.
The online version of this article, along with updated information and services, can be
found online at: http://ptjournal.apta.org/content/71/8/589
Jules M Rothstein, PhD, PT, is Professor and Head, Depart- Dedication. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .,591
.
ment of Physical Therapy (M/C 898), University of Illinois at
Chicago, and Chief of Physical Therapy Services, University Introduction. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ,592
..
of Illinois Hospital, 1919 W Taylor St, Chicago, IL 60612
Glossary of Terms Used in the Standards . . . . . . . . . . .,595
(USA).
Suzann K Campbell, PhD, PT,FAF'TA, is Professor, Depan- Standards for Primary Test Purveyors. . . . . . . . . . . . . . . ..598
ment of Physical Therapy, University of Illinois at Chicago.
Standards for Secondary Test Purveyors . . . . . . . . . . . . .,604
John L Ekhternach, EdD, PT, is Chairman, School of Commu-
nity Health Professions and Physical Therapy, Old Dominion Standards for Tertiary Test Purveyors. ............... ,609
University, Norfolk, VA 23539-0288.
Standards for Test Users . . . . . . . . . . . . . . . . . . . . . . . . . . . . .613
.
Alan M Jlette,PhD, PT, is Senior Research Scientist, New En-
gland Research Institute Inc, 9 Galen St, Watertown, MA Standards for Ensuring Integrity in
02172. Measurement Research. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ,620
.
One of the first acts of the Task Force members was to The Standards include a glossary. The glossary defines
examine the APA manual. The APA Standards are a pri- terms as they are used in the Standards. The glossary al-
mary source of information on measurement. The mem- lows readers to see how these terms were used by the
bers of the Task Force agreed, however, that while the authors of the Standards. The Task Force made every ef-
APA document contained a great deal of useful informa- fort to avoid creating new terms and to avoid using jar-
tion, it was not directly applicable to physical therapy. gon. The glossary is meant solely as a source for materials
Many of the measurement problems in physical therapy in the Standards; it is not a general measurement glossary.
are unique. Physical therapists use measurements that are Wherever possible, the terms used and defined are those
based on the behavioral, biological, and physical sciences. commonly found in the measurement literature. Clini-
The scope of measurements in physical therapy is extraor- cians may find many of the terms unfamiliar at first, but
dinary. Questionnaires are used by therapists, as are man- the Task Force believes that, through use of the glossary
ual muscle testing, developmental testing, postural evalua- and the primer, the Standards can be understood and
tions, instrumented muscle testing, movement analysis, used by all therapists. The Task Force also recognizes that
and a whole variety of other tests. Instruments vary from many physical therapists will have to make a commitment
paper and pencil, to the therapist's hands, to complex of time and effort to learn these new terms and to learn
computer-based machines with elaborate peripheral de- about measurement. In the future, these terms will be
vices. We concluded that clinicians needed standards writ- more commonplace in the clinical literature of physical
ten by physical therapists for physical therapy practition- therapy.
ers. The Task Force, therefore, set out to develop
Standards specifically for physical therapy. In developing The Standards are meant to foster the continued growth
these Standards, the Task Force was aware that most phys- of high-quality care in physical therapy. They are highly
ical therapists receive little or no training in the science specific in describing what should be done to ensure
of measurement. The Task Force agreed that the final meaningful and useful measurements. Part of the Stan-
document must be sufficiently comprehensive to cover dards provides long-overdue guidance to persons devel-
the vast expanse of physical therapy measurements and oping tests and to persons teaching about testing. No
that it nnust also be practical. longer will clinicians independently have to ask purveyors
of tests to supply vital information. The Standards specify
The process of developing standards began with the en- what the providers should provide.
tire Task Force considering philosophical and practical
issues cluring 2 days of often heated, and always thor- The Standards consist of five sections. The first three
ough, discussion. After the Task Force worked out basic sections specify what is expected of test purveyors.
concepts, the writing of the Standards was delegated to a Three categories of purveyors are described: primary
three-rnember Working Group (Jules M Rothstein, Task purveyors, who originate tests; secondary purveyors,
Force coordinator; John L Echternach; and Harry G who conduct research and advocate the use of tests;
Knecht;,. The Working Group developed a draft document and tertiary purveyors, who are teachers. The fourth
that was initially reviewed by the rest of the Task Force. section contains the Standards for Test Users-physical
The draft was revised. The present version has been re- therapists. The fifth section describes standards for en-
vised on feedback from the physical therapy community, suring integrity in measurement studies. This last sec-
as well as on feedback from other interested parties and tion is adapted from the APTA's Standards for Integrity
from experts on measurement. in Physical Therapy Research.
In the training of physical therapists, measurement has all The Task Force originally had hoped to generate a series
too often been equated with research. Concerns about of guidelines that would be few in number and "user
the quality of measurements are, at times mistakenly, friendly." Early versions of the Standards proved that this
thought to relate to research and not to practice. Because was impossible. Attempts to generate fewer sections and a
therapists need help with the basic science of measure- more multipurpose document resulted in a cumbersome
ment, a primer on measurement has been prepared to set of standards that was difficult to apply. Because this is
comp1r:ment the Standards. The purposes of the primer the first document of its type in physical therapy and be-
are to provide physical therapists with explanations of cause of the nature of the subject, we found that the Stan-
basic concepts and to explore issues related to mea- dards needed to be comprehensive and to contain de-
surement. Eugene Michels, who began work with the tailed specifications. We found that when we attempted to
Task Force as an APTA staff member, wrote the initial produce briefer versions of the Standards and when we
draft of the primer. The primer is a tool that can be attempted to use fewer measurement terms, our drafts
used to help understand not only the Standards, but were unclear and could not be used as references. The
also is:jues related to tests and measurements them- Standards, although they are advisory, may read like a
rule book. Such books are not easily read, nor are they
The glossary describes terms as they are used in the Stan- Criterion-based (criterion-related) validity: see
dards. The glossary is not meant to be all-inclusive, but validity
rather to provide definitions for the terms as they are
used in the Standards. For further information about the Data: synonymous with measurements (see
terms, related concepts, o r other terms used in measure- measurement)
ment, consult the Primer.
Derived measurement: a measurement of an attribute
Alternate-forms (parallel-forms) reliability: see that is obtained as the result of a mathematical operation
reliability applied to an existing measurement of some other at-
tribute; an example is the measurement of leg-length dif-
Assessment: measurement, quantification, o r placing a
value or label on something; assessment is often confused
ference, which is derived by subtracting one leg-lengthi,
measurement from another
-
with evaluation; an assessment results from the act of as-
sessing (see evaluation and examination) Evaluation: a judgment based on a measurement; often
confused with assessment and examination (see assess-
Attribute: a variable; a characteristic o r quality that is m e n t and examination); evaluations are judgments of
measured the value or worth of something
Classification (categorization): assignment of an indi- Fxamination: a test or a group of tests used for the pur-
vidual or an entity to a group; assignment is based on pose of obtaining measurements o r data (see assess-
rules; groups are defined so that they allow all pertinent m e n t and evaluation)
entities or individuals to belong to the defined groups
(classes or categories are exhaustive) and so that they False negatives: persons who test negatively for some
allow entities or individuals to belong to only one possi- attribute but who, in fact, have that attribute (see t r u e
ble group (classes or categories are mutually exclusive) negatives)
Clinical decision: a determination that relates to direct False positives: persons who test positively for some
patient care, indirect patient care, acceptance of patients attribute but who, in fact, do not have that attribute (see
for trea.tment, and whether patients should be referred to t r u e positives)
other practitioners (this definition is modified from that
presented by Charles Magistro at a conference on Clinical Instrument: a machine, a questionnaire, or any device
Decisic~nMaking held under APTA auspices in October that is used as part of, or as a test to obtain, measure-
1988 in Lake of the Ozarks, Missouri); a diagnosis that ments
leads a therapist to take an action is a form of a clinical
decision; clinical decisions result in actions; when direct Internal consistency: see reliability
supporting evidence for clinical decisions is lacking, such
decisions are based on clinical opinions Intertester reliability: see reliability
Clinical opinion: a belief or idea that a physical thera- Intratester reliability: see reliability
pist holds regarding a patient; this opinion may be based
on the use of tests and measurements, but is not directly Measure: the act of obtaining a measurement (datum)
supported by evidence relating to those tests and mea-
suremt:nts; clinical opinions are based on the therapist's Measurement: the numeral assigned to an object, event,
evaluation of available information; clinical decisions (ie, or person or the class (category) to which an object,
determinations that cause the therapist to take an action) event, or person is assigned according to rules
that are based on the therapist's synthesis of information
are based on the clinical opinions of that therapist Normalization: a process that yields a new or trans-
formed measurement that is mathematically derived to
Concurrent validity: see validity change the distribution of measurements; normalization
procedures are often used to change the distribution of
Construct: a concept developed for the purpose of mea- data to make the distribution more congruent with a bell-
surement; support for the construct is through logical shaped (or normal) curve
argumentation based on the theoretical and research evi-
dence (see construct validity listed under validity) Objective measurement: a measurement that is not
affected by some aspect of the person obtaining the rnea-
Construct validity: see validity surement; the opposite of a subjective measurement (see
subjective measurement); measurements cannot be
Specificgty of a test: an indication of how well a diag- T r u e positives: persons who test positively for some
nostic test identifies people who should have a negative attribute and who, in fact, have that attribute (see false
finding; the numerical representation of specificity is a positives)
ratio formed by dividing the number of persons with a
true-negative response on a test by the number of per- Validity: the degree to which a useful (meaningful) inter-
sons who should have had a negative response (ie, the pretation can be inferred from a measurement
number of persons who are known to have properties
that would indicate that they should test negative) Concurrent validity: a form of criterion-based valid-
ity in which an inferred interpretation is justified by
Standardization: a process by which a score is con- comparing a measurement with supporting evidence
verted (transformed) into a relative score by using indices that was obtained at approximately the same time as
of central tendency and variability; a commonly used stan- the measurement being validated
dardized score is the z score; the term "standardization"
is also used to describe the process of systematization of Construct validity: the conceptual (theoretical) basis
the methods used to obtain a measurement; the process for using a measurement to make an inferred interpre-
of standardization, however, does not ensure reliability, tation; evidence for construct validity is through logical
because reliability can only be determined through the argumentation based on theoretical and research evi-
collectic~nof data (see reliability) dence (see construct)
Subjective measurement: a measurement that is af- Content validity: a form of validity that deals with
fected by some aspect of the person obtaining the mea- the extent to which a measurement is judged to reflect
surement (contrasts with objective measurement); sub- the meaningful elements of a construct and not any
jectivity relates to the reliability of measurements, extraneous elements
especially the intertester reliability; the more subjective
the measurement, the less reliable the measurement; sub- Criterion-based (criterion-related) validity:
jectivity, like reliability, is measured along a continuum three forms of criterion-based validity exist: concur-
rent validity, predictive validity, and prescriptive va-
Tertiary purveyor: see purveyor lidity; the common element is that, with each of
these forms of validity, the correctness of an inferred
Test: a procedure or set of procedures that is used to interpretation can b e tested by comparing a mea-
obtain measurements (data); the procedures may require surement with either a different measurement o r
the use of instruments data obtained by other forms of testing
Test manual: a booklet or book prepared by a primary Predictive validity: a form of criterion-based validity
test purveyor to guide the process of obtaining a mea- in which an inferred interpretation is justified by com-
surement and to provide documentation and justification paring a measurement with supporting evidence that is
for the test obtained at a later point in time; examines the justifica-
tion of using a measurement to say something about
Test setting: the environment in which a test is given, future events or conditions
including the physical setting and the characteristics of
that setting Prescriptive validity: a form of criterion-based
validity in which the inferred interpretation of a
Test uset: one who chooses tests, interprets test scores, measurement is the determination of the form of
or makes decisions based on test scores (this definition is treatment a person is to receive; prescriptive validity
from Standardsfor Educational and Psychological Tests; is justified based on the successful outcome of the
Ammencan Psychological Association, Washington, DC, chosen treatment
1974, page 1)
Standards for Primary Test P4.1. Operational definitions of attributes that the test
Purveyors (indicated with a P) measures must be provided in the test manual.
The Standards in this section describe requirements for P4.2. Operational definitions of terms used to describe
primary purveyors of tests. The following is the definition the population for whom the test is intended
of a primary purveyor. must be provided in the test manual.
P r t m a r y p u w e y o r : a person who develops, pro- P4.3. Operational definitions of terms used to describe
motes, or requires the use of tests; this definition in- potential test users must be provided in the test 1
cludes persons within clinical institutions who require manual.
the use of specific tests; persons who conduct continu-
ing education courses in which a major component P4.4. Operational definitions of terms used to describe
involves the advocacy of the use of specific testing pro- components of the test o r test instruments must
cedures are primary purveyors; any person (or organi- be provided in the test manual.
zation) who promotes (advocates) the use of tests by
selling testing equipment, manuals, books, or similar P4.5. Operational definitions of any unique terms cre-
f
materials is a primary purveyor; in the case of books o r ated by the primary purveyor must be provided
articles that serve as test manuals, the primary purveyor in the test manual.
is the author; persons who sell instruments that may be
used for testing, but who d o not describe or advocate P4.6. Operational definitions of any terms used in a
specific testing procedures, are not purveyors (see noncustomary (unusual) manner by the primary %
purveyor, s e c o n d a r y purveyor, and t e r t h r y p u r - purveyor must be provided in the test manual.
veyor) I,
P5. Test manuals provided by primary purveyors must
Organhtion of the Standards for Primary Purveyors: include descriptions of the populations for whom the
Primary purveyors are obliged to provide documentation tests are designed. Descriptions of subjects for whom
of essential elements for the tests and measurements they the tests should not be used and descriptions of sub-
are promoting. Documentation should be in the form of a jects for whom the tests should be used with caution
test manual. Most of the Standards for primary purveyors must be included.
describe the elements that should be included in test man-
uals. Qualitative requirements for the information to be P6. Test manuals provided by primary purveyors must
included in the manuals are presented within sections that include descriptions of procedures that will ensure
describe what should be included in the test manuals. safe test administration. Safety procedures must be
enumerated and should include specific instructions
PI. Persons o r organizations should not become primary as to when the test should be terminated if a subject
test purveyors unless they are prepared to adhere to has an adverse response.
the Standards.
P7. Test manuals provided by primary purveyors must
P2. Primary purveyors of tests must provide test manuals. include descriptions of the qualifications and compe-
Books that contain major sections dealing with tests tencies needed by test users. These descriptions
and include materials that promote and advocate the should include statements regarding potential conse-
use of tests are considered test manuals, and all stan- quences of unqualified users administering the test.
dards for test manuals apply to these books. Primary
purveyors are responsible for the quality (accuracy) P8. Test manuals provided by primary purveyors should
of all information in their manuals and must make describe how potential test users can obtain the com-
every effort to ensure that information in the manuals petencies necessary to administer the tests.
is in compliance with the Standards (eg, research
studies cited are in accordance with the Standards). P9. Test manuals provided by primary purveyors should
include narrative chronological accounts of the devel-
P3. Test manuals provided by primary purveyors must opment of the tests, including descriptions of the
include descriptions of the theoretical bases of the development of any instruments associated with the
tests and measurements, including discussions of the tests.
evidence supporting the construct validity and the
content validity of the measurements. The purpose of P9.l. A description of the test developer(s) must be
the test must be clearly described. provided in the narrative account in the test
manual.
P13. Test manuals provided by primary purveyors must P14.2. Descriptions of where the reliability research
include evidence for all relevant forms of reliability was conducted must b e provided in the test
and related information for the measurements de- manual.
scribed in the test manual.
P14.3. Descriptions of the sample(s) studied in the
P13.1. Descriptions of how information related to reli- reliability research must be provided in the test
ability was collected must be provided in the manual.
test manual, and all relevant references to peer-
reviewed publications must be supplied. P14.3.1. Descriptions must b e provided in the test
manual of how the sample studied in the
P13.2. Evidence relating to reliability must b e reported reliability research was selected.
in the test manual in a way that describes the
errors associated with common uses of the P14.3.2. The number of subjects studied in the reli-
measurements. ability research must be specified in the q
test manual.
P13.2.1. Intratester reliability estimates (indices)
4
must be reported in the test manual. P14.3.3. Descriptions of relevant clinical characteris-
Within-day and between-day studies should tics of the sample studied in the reliability
have been conducted in a clinical context research must be provided in the test man-
consistent with the intended use of the ual. A discussion of how the sample is rep-
measurements. Intratester reliability should resentative of the population for whom the
be reported in the test manual for all forms test is intended should b e included in the
of measurements, including self- test manual.
administered tests. L
P14.4. Descriptions of persons who obtained the
P13.2.2. Intertester reliability estimates (indices) measurements in the reliability research
must b e reported in the test manual. In- (ie, those who were in the role of test users)
tertester reliability studies should have must be provided in the test manual. De-
been conducted in a clinical context scriptions of their qualifications, competen-
consistent with the intended use of the cies, and experiences with the test should be
measurements. included. Any special information o r training
given to test users prior to their obtaining
P13.2.3. Internal consistency coefficients (or factor the measurements in the study should b e
structures) must be reported in the test described in the test manual.
manual when there is a need to demon-
strate that items o r elements contributing P14.5. Descriptions of the methods and research
to a measurement reflect one basic phe- design used in the reliability studies must be
nomenon o r dimension. Studies of internal provided in the test manual. The specific
consistency should have been conducted in types of reliability that were investigated
a clinical context consistent with the in- must b e specified.
tended use of the measurements.
P14.6. Descriptions of the statistics used to derive reli-
P13.2.4. Parallel-forms (alternative-forms) reliability ability estimates and the rationale for their use
must be reported in the test manual if must b e provided in the test manual. When
more than one version of the test is being methodologically appropriate, reports of confi-
described. Studies of parallel-forms reliabil- dence intervals and standard errors of measure-
ity should have been conducted in a clini- ments should b e included in the test manual.
cal context consistent with the intended use Examples of how the reliability estimates are to
of the measurements. b e used as part of data interpretation should be
included. Reliability estimates should b e accom-
P14. Test manuals provided by primary purveyors should panied by reports of regression data (ie, slopes
include descriptions of all research studies into the and intercepts) when appropriate for the statis-
reliability of the measurements described in the tical analysis.
manual, and all relevant references to peer-
reviewed publications must be supplied. P15. Test manuals provided by primary purveyors must
include evidence for all relevant forms of validity
P15.2.2. The content validity of the measurements P16.1. Descriptions of who conducted the validity re-
must be discussed in the test manual. Ex- search must be provided in the test manual.
perimental evidence as well as logical argu-
ments for the content validity of the mea- P16.2. Descriptions of where the validity research was
surements should be provided in the test conducted must be provided in the test manual.
manual.
P16.3. Descriptions of the sample(s) studied in the
P15.2.3. Evidence for concurrent validity must be validity research must be provided in the test
provided in the test manual when the pri- manual.
mary purveyor contends that the measure-
ments can be used to make inferences P16.3.1. Descriptions of how the sample in the va-
about the current status of an attribute at lidity research was selected must be pro-
the time the measurements are obtained or vided in the test manual.
shortly thereafter. This evidence must in-
clude logical and experimental data to sup- P16.3.2. The number of subjects studied in the va-
port the use of other measurements as cri- lidity research must be specified in the test
teria to justify a concurrent inference. The manual.
primary purveyor should not make claims
in the test manual for concurrent validity P16.3.3. Descriptions of relevant clinical characteris-
by comparing the measurement of interest tics of the sample studied in the validity
with another measurement (the criterion) research must be provided in the test man-
unless the criterion measurement has been ual. A discussion should be provided in the
shown to be valid (ie, it has been justified test manual of how the sample is repre-
for use as a criterion). sentative of the population for whom the
test is intended.
P15.2.4. Evidence for predictive validity must be
provided in the test manual when the pri- P16.4. Descriptions of persons who obtained the
mary purveyor contends that the measure- measurements in the validity research (ie,
ments can be used at the time they are ob- those who were in the role of test users)
tained to make inferences about the future must be provided in the test manual. De-
status of an attribute. This evidence must scriptions of their qualifications, competen-
include logical and experimental data to cies, and experiences with the test should be
support the use of other measurements as included. Any special information or training
criteria to justify a predictive inference. The given to test users prior to their obtaining
primary purveyor should not make claims the measurements in the study should be
in the test manual for predictive validity by described in the test manual.
comparing the measurement of interest
mate when methodologically appropriate. tics of the sample used to obtain the nor-
mative data must be provided in the test
P17. Primary purveyors who claim that measurements manual. These descriptions should include
can be used to classrfy persons into diagnostic reports of the central tendencies, variabili- @
groups based on the presence o r absence of a find- ties, and distributions of the data on rele-
ing (eg, cut scores o r tests that result in determina- vant clinical, demographic, and anthropo-
tions of negative or positive findings) must include metric (physical) characteristics.
in their test manuals the essential elements that al-
low for interpretation of these findings. In reporting P18.4. Descriptions of persons who took the measure-
these elements, the same standards as described for ments used to obtain the normative data (ie,
reports of validity must b e followed. those who were in the role of test users) must \
be provided in the test manual. The test manual
P17.1. Percentages of false positives and false negatives should include descriptions of test users' quali-
must be reported in the test manual. fications, competencies, and experiences with
the test. Any special information o r training
P17.2. Sensitivity of the test must be reported in the given to test users prior to their taking the mea-
test manual. surements in the study should be described in
the test manual.
P17.3. Specificity of the test must be reported in the
test manual. P18.5. Descriptions of the methods and research
design used to obtain the normative data
P17.4. Predictive values of positive and negative find- must be provided in the test manual. Norma-
ings (measurements) obtained with the test tive data should b e obtained using the same
must be reported in the test manual. measurement procedures that are described
in the manual. If other versions of the test
P18. Test manuals provided by primary purveyors must were used to obtain the normative data, o r if
include normative data when measurements are to other scales were used, there must be a dis-
be interpreted in terms of how an individual mea- cussion of how the normative data relate to
surement compares with measurements obtained the data that can b e obtained using the test
on other persons (ie, when the data are norm- described in the manual.
referenced).
P18.6. A complete discussion of limitations in the use
P18.1. Descriptions of who obtained the normative of the supplied normative data must be p r o
data must be provided in the test manual. vided in the test manual. The discussion may
include, but should not be limited to, consider-
P18.2. Descriptions of where the normative data were ations of whether the normative data relate to a
obtained must be provided in the test manual. particular local area, facility, ethnic group, age
group, o r gender.
P18.3. Descriptions of the sample studied to obtain
the normative data must be provided in the test P18.7. Details on any data transformations (eg, any
manual. standardization o r normalization procedures)
used in obtaining o r preparing the normative
data must be provided in the test manual.
be described by the secondary purveyor. errors of the estimate should be included. Ex- Ir
amples of how the validity estimates are to be
S13.4. Research reports on reliability written by sec- used as part of data interpretation should be
ondary purveyors must include descriptions of included Validity estimates should be accompa-
the methods and research designs used in their nied by reports of regression data (ie, slopes
S14.5. Secondary purveyors who state in research re- S15.3. Specificity of the test must be described in re-
ports or scholarly papers that measurements ports by secondary purveyors.
can be used to make inferences about the cur-
rent status of an attribute at the time the mea- S15.4. Predictive values of positive and negative
surements are obtained or shortly thereafter findings (measurements) obtained with the
must include logical and experimental data to test must b e described in reports by second-
support the use of other measurements as crite- ary purveyors.
ria to justify these concurrent inferences. Sec-
ondary purveyors should not make claims for S16. Secondary purveyors who include normative data in
concurrent validity by comparing the measure- their reports must include, to the extent allowed in
ment of interest with another measurement (the the publication in which the report will or may ap-
criterion) unless the criterion measurement has pear, essential elements required for the interpreta-
been shown to be valid (ie, it has been justified tion of these normative data.
for use as a criterion).
S16.1. Secondary purveyors must describe who (ie, the
S14.6. Secondary purveyors who state in research re- researcher) obtained the normative data they
ports or scholarly papers that measurements report.
can be used at the time they are obtained to
make inferences about the future status of an S16.2. Secondary purveyors must describe in their re-
attribute must include logical and experimental ports where the normative data were obtained.
data to support the use of other measurements
as criteria to justlfy these predictive inferences. S16.3. Secondary purveyors must describe in their re-
Secondary purveyors should not make claims ports the sample studied to obtain the norma-
for predictive validity by comparing the mea- tive data.
surement of interest with another measurement
(the criterion) unless the criterion measure- S16.3.1. Secondary purveyors must describe in their
ment has been shown to be valid (ie, it has reports how the sample used to obtain the
been justified for use as a criterion). normative data was selected.
S14.7. Secondary purveyors who state in research re- S16.3.2. Secondary purveyors must specify in their
ports or scholarly papers that measurements reports the number of subjects studied to
can be used to determine the choice of treat- obtain the normative data.
ment (ie, prescriptive validity) must base these
statements on research indicating that treatment S16.3.3. Secondary purveyors must explain in their
chosen on the basis of the measurement is reports how the sample used to obtain the
effective. Documentation of the effectiveness of normative data is characteristic of the popu-
treatment must be based on the use of valid lation for whom the measurement is in-
measurements and should be included in re- tended to be used.
ports by the secondary purveyors.
S16.3.4. Secondary purveyors must describe in their
S15. Secondary purveyors who claim in research reports reports relevant clinical characteristics of
or scholarly papers that measurements can be used the sample used to obtain the normative
to classify persons into diagnostic groups based on data. These descriptions should include
the presence or absence of a finding (eg, cut scores reports of the central tendencies, variabili-
or tests that result in determinations of negative or ties, and distributions of the data on rele-
positive findings) must report the essential elements vant clinical, demographic, and anthropo-
that allow for interpretation of these findings. In metric (physical) characteristics.
reporting these elements, the same standards as de-
scribed for reports of validity must be followed. This 516.4. Secondary purveyors must describe in their re-
information should be supplied to the extent al- ports the persons who took the measurements
lowed in the publication in which the report will o r used to obtain the normative data (ie, those
may appear. who were in the role of test users). Descrip-
tions of test users' qualifications, competencies,
S15.1. Percentages of false positives and false nega- and experiences with the test should be in-
tives must be described in reports by second- cluded. Any special information o r training
ary purveyors. given to test users prior to their obtaining the
S16.5. Secondary purveyors must describe in their re- S16.8. Secondary purveyors who report normative data
ports the methods and research designs used to for measurements that are based on interval or
obtain the normative data. Normative data ratio scales should present as part of the norma-
should be obtained using the same measure- tive data standard scores o r percentiles with
ment procedures that are described in the re- accompanying measures of central tendency and
port. If other versions of the test were used to variability. Data for clinically meaningful sub-
obtain the normative data, o r if other scales groups should be similarly reported.
were used, there must be a discussion of how
the normative data relate to the data that can be S16.9. Secondary purveyors who report normative
obtained using the test described in the report. data for measurements that are based on or-
dinal o r nominal scales should present the
S16.6. Secondary purveyors must supply in their re- normative data in the form of the proportion
ports a complete discussion of limitations in the of persons in the population who can be ex-
use of the normative data they report. The dis- pected to belong to each group and sub- \
cussion may include, but should not be limited group. Data for clinically meaningful sub-
to, considerations of whether the normative groups should be similarly reported.
data relate to a particular local area, facility, eth-
nic group, age group, o r gender. S17. Advocacy by secondary purveyors for the use of a
measurement must not exceed a level that can be
S16.7. Secondary purveyors must supply in their re- supported by the research of the secondary pur-
ports details on any data transformations (eg, veyor o r by other published data.
any standardization o r normalization proce-
T2. Ten:iary purveyors must have a basic knowledge of T2.13. Tertiary purveyors must understand the meth-
the theory and principles of tests and measurements. ods and effects of normalizing o r standardizing
measurements.
T2.1. Tertiary purveyors must understand what con-
stitutes a measurement, what constitutes a T2.14. Tertiary purveyors must understand the mean-
test, and the role of instruments in obtaining ing and implications of reactivity to tests.
measurements.
T3. Tertiary purveyors should promulgate the Standards
T2.2. Tertiary purveyors must understand the differ- for Test Users and, in their teaching, should provide
ences between clinical opinions (impressions) potential test users with the necessary tools and in-
that are not based on valid measurements and formation so that these potential test users can ad-
inferences that are based on the use of valid here to user standards.
measurements.
T4. Tertiary purveyors, when discussing a test, must pro-
T2.3. Tertiary purveyors must understand what consti- vide descriptions of the theoretical bases for the test
tutes an operational definition and the impor- and must discuss evidence relating to construct and
tance of using operational definitions. content validity.
T5.1. Tertiary purveyors must provide operational defi- T8.3. Tertiary purveyors must discuss limitations in the
nitions for attributes that the test measures. normative data. This discussion may include, but
should not be limited to, considerations of
T5.2. Tertiary purveyors must provide operational defi- whether the data relate to one local area, facility,
nitions for terms used to describe the population ethnic group, age group, o r gender.
for whom the test is intended.
T8.4. Tertiary purveyors must discuss details on any
T5.3. Tertiary purveyors must provide operational defi- data transformations used and whether any stan-
nitions for terms used to describe potential test dardization o r normalization procedures were
users. used in generating the normative data.
T5.4. Tertiary purveyors must provide operational defi- T8.5. Tertiary purveyors who discuss measurements
nitions for terms used to describe components that are interval o r ratio scaled must provide
of the test o r test instrument. standard scores o r percentiles with measure- t
ments of central tendency and variability, if these
T5.5. Tertiary purveyors must provide operational defi- data are available. Data for meaningful subgroups
nitions for any terms created by purveyors. should be similarly reported. If these data are
lacking, the tertiary purveyor should discuss the
T5.6. Tertiary purveyors must provide operational defi- limitations in the use of the normative data.
nitions for any terms they use in a noncustomary I
T8. Tertiary purveyors must provide all information that T11. Tertiary purveyors, in discussing a specific test, must ",
js available when they convey or discuss normative provide descriptions of the test and instruments
data. associated with the test.
T8.1.Tertiary purveyors must describe the methods T1l.l Tertiary purveyors must discuss available docu-
used to obtain the sample that was used to ob- mentation of relevant technical information re-
tain the normative data. The generalizability of garding performance characteristics of any ma-
the normative data must be characterized relative chines, recording devices, transducers, computer
to the sampling method. interfaces, and similar instruments. The tertiary
%
purveyor should identify the source of this doc-
T8.2. Tertiary purveyors must describe the sample umentation. If documentation is not available,
studied (eg, the number of subjects and the the tertiary purveyor must discuss the implica- I
distributions of relevant clinical, demographic, tions and limitations of using such instruments.
and anthropometric [physical] characteristics).
How this group is characteristic of the popula- T11.2.Tertiary purveyors must describe how instru-
ments used in the test manipulate or process
T12.4. Tertiary purveyors must describe how the test T15. Tertiary purveyors must include in their discussions
user must manipulate o r process information in of a test descriptions of all special groups for whom
order to obtain the desired measurements. the test is contraindicated o r known to lead to mea-
surements of questionable validity.
T12.5. Tertiary purveyors must provide instructions to
potential test users for use of any instruments T16. Tertiary purveyors, when discussing measurements
required to obtain the desired measurements. used to classify persons into groups based on the
These instructions, where appropriate, must presence o r absence of a diagnostic finding (eg, use
include machine settings and any other user- of cut scores o r tests to determine a positive o r neg-
selected options. Descriptions of the effects of ative finding), must discuss the limitations of these
all options on the measurements and the conse- measurements.
quences of selecting the incorrect options
should be included. T16.1. Tertiary purveyors must report the percentages
of false positives and false negatives for the
T12.6. Tertiary purveyors who discuss tests that in- measurements they discuss. If this information
volve the use of instruments must describe how is not available, tertiary purveyors must dlscuss
the instruments are calibrated. A means of test- the limitations of using these measurements.
ing calibration must be described. If calibration
is needed, the tertiary purveyor must provide T16.2. Tertiary purveyors must report the sensitivity of
instructions regarding a course of action to be the tests they discuss. If this information is not
taken. available, tertiary purveyors must discuss the
limitations of using these tests.
T12.7. Tertiary purveyors must describe variations in
the test procedures that are available to the test T16.3. Tertiary purveyors must report the specificity of
user. Descriptions of variations that are known the tests they discuss. If this information is not
not to impair the quality of the measurements available, tertiary purveyors must discuss the
and descriptions of variations that are known to limitations of using these tests.
Organization of the Standards for Test Users: Four U3.9. Test users must understand what constitutes
basic types of Standards are found in the Standards for meaningful normative data and how such data
Test Users. The Standards listed first detail the general can be used.
knowledge that a test user must have. The majority of the
Standards in this section deal with specific requirements U3.10. Test users must understand the differences be-
that a user should consider when performing specific tween objective measurements and subjective
tests. These Standards include issues relating to the measurements and the implications of using
choice of tests, the performance of testing, observing the each type of measurement.
rights of test takers, and the use of obtained measure-
ments. The last two Standards, U44 and U45, describe the U3.11. Test users must understand the meaning and
requirements test users should observe in interpreting use of the terms "false negatives," "false posi-
and reporting test results. tives," "true negatives," "true positives," "pre-
dictive value of a measurement," "specificity of
U1. Persons should not become test users unless they a test," and "sensitivity of a test."
are prepared to adhere to the Standards and under-
stand the requirements for test purveyors. U3.12. Test users must understand the importance
of knowing the technical specifications of
U2. Test users must have a basic understanding of local, instruments.
state, and federal laws governing the use of tests in
their practice settings. U3.13. Test users must understand the importance of
calibrating instruments.
U3. Test users must have a basic knowledge of the the-
ory and principles of tests and measurements. U3.14. Test users must have a basic understanding of
the methods and effects of normalizing or stan-
U3.1. Test users must understand what constitutes a dardizing measurements.
measurement, what constitutes a test, and the
role of instruments in obtaining measurements. U3.15. Test users must understand the meaning and
implications of reactivity to tests.
U3.2. Test users must understand the differences be-
tween clinical opinions (impressions) that are U4. Test users must have background knowledge in ba-
not based on valid measurements and inferences sic, applied, and clinical sciences related to the selec-
that are based on the use of valid measurements. tion, administration, and interpretation of each test
they use.
U3.3. Test users must understand what constitutes an
operational definition and the importance of U5. Test users must understand the theoretical bases
using operational definitions. (construct and content validity) for the tests they use,
and they must have knowledge about the attribute
U3.4. Test users must understand the different levels (characteristic) being measured.
of measurement (ie, nominal, ordinal, interval,
and ratio) and the mathematical operations that U6. Test users must be familiar with the development of
are appropriate for each level. tests that they use and the test settings in which
those tests have been developed and used.
U3.5. Test users must understand types of validity and
how these types of validity relate to the use of U7. Test users must understand how a test they are using
measurements. relates to similar tests o r previous versions of the
same test.
U10. Test users must understand all operational defini- U13.2. Test users who deviate from accepted direc-
tions related to tests they use. tions for obtaining a measurement should not
use published data o r documentation relative
U1O.l. Test users must understand the operational to reliability and validity to justify their use of
definitions for attributes that the test measures. the measurement.
U10.2. Test users must understand the operational U14. Test users must know what information and instruc-
definitions for terms used to describe the pop- tions are to be given to the person being tested.
ulation for whom the test is intended. Test users should be able to answer questions
about the test and related subjects.
U10.3. Test users must understand the operational
definitions for terms used to describe potential U14.1. Test users who do not give the purveyor's spec-
test users. ified instructions to persons being tested, o r
test users who are unable to give these instruc-
U10.4. Test users must understand the operational tions, should not use published data o r docu- c
definitions for terms used to describe compo- mentation relative to reliability and validity to
nents of the test o r test instruments. justrfy their use of the measurements.
U10.5. Test users must understand the operational U15. Test users must know the physical settings in which
definitions for any terms created by purveyors the test should be given and the possible effects of
of the test. conducting the test in other settings.
U10.6. Test users must be able to identify and under- U16. Test users must be able to identify any conditions
stand the operational definitions for any terms o r behaviors in the person being tested that may
used in a noncustomary manner. compromise the reliability o r validity of their mea-
surements (eg, if a modified position must be used
U11. Test users must be able to describe the population in manual muscle testing because of a deformity).
for whom the test was designed. Test users must be Test users who observe such conditions or behav-
able to relate this description to the persons they iors should note these observations in their reports
are testing. of any resultant measurements. Test users who be-
U17.1. Test users must know relevant technical infor- U23. Test users who are required to derive o r trans-
mation regarding performance characteristics of form measurements must have sufficient training
any machines, recording devices, transducers, and knowledge to derive o r transform those
computer interfaces, and similar instruments measurements. Test users must have the back-
they use. Test users should be able to identify ground information and skills needed to derive
the source of this information. If this informa- measurements o r make categorizations necessary
tion is not available, the test user must be able for interpretation of their measurements (eg,
to discuss the implications and limitations of how to normalize o r standardize a score o r how
using such instruments. to classify a measurement).
U17.2. Test users must b e able to describe how in- U24. Test users must be aware of any normative data for
struments they use manipulate or process the measurements they are obtaining (see Standard
information in order to obtain measure- U44.3 for guidelines on using normative data to
ments. Test users should identify the source interpret measurements; see Standard U45.10 for
of this information. If this information is not guidelines on reporting measurements related to
available, the test user must be able to dis- normative data). Test users should be able to evalu-
cuss the implications and limitations of using ate critically normative data and use the data for
such instruments. clinical decision making.
U18. Test users must know how to use any instruments U25. Test users must make every effort to control the
required to obtain the desired measurements. This environment (test setting) in which they test in or-
Standard includes, where appropriate, the test user der to maintain consistent conditions between tests.
knowing how to choose machine settings and other These efforts are needed to ensure that the validity
user-selected options. Test users must be able to and reliability of a measurement are not compro-
discuss the effects of all options on their measure- mised.
ments and the consequences of selecting the incor-
rect options. U26. Test users must make every effort when personal
information is being obtained to control the envi-
U19. Test users must be able to describe how instru- ronment (test setting) in which they administer tests
ments they use for a test are calibrated, including in order to preserve the privacy of the person tak-
the means of testing calibration. Test users must ing the test.
know the course of action to be taken when cali-
bration is needed. U27. Test users must be able to discuss common errors
in the interpretation of the measurements they use.
U20. Test users, for all the tests they use, should be able
to describe variations in the test procedures that are U28. Test users must make every effort to minimize the
available. Test users must be able to describe varia- effects of reactivity associated with the tests they
tions that are known not to impair the quality of the use.
measurements and those variations that are known
t o lead to measurements of questionable validity. U29. Test users should report to the purveyor of the
test any problems regarding a test o r any associ-
U21. Test users who deviate from accepted directions for ated instruments.
obtaining a measurement should not use published
data o r documentation relative to reliability and U30. Test users should communicate with other test us-
validity to justify their use of the measurement. ers and purveyors regarding their experiences with
tests.
U21.1. Test users who administer tests in settings
other than those recommended by the pur- U31. Test users must avoid giving persons prior knowl-
veyor should not use published data o r docu- edge about the natqre of a test when such knowl-
mentation relative to reliability and validity to edge is known to compromise the validity of the
justify their use of the measurement. measurements.
U33. Test users should not share results of tests with per- U35. Test users must maintain records in such a man-
sons (or organizations) who are likely to misuse ner that information about tests and measure-
that information. ments is accurate and is not likely to be distorted
o r lost. Abbreviations used in communications
U34. Test users must respect the rights of persons whom should be limited to those that appear in estab-
they test. lished references.
U34.1. Test users must respect the right of persons to U36. Test users have a responsibility to report inappro-
refuse to be tested. Test users must allow per- priate test use to proper authorities.
sons to discontinue participation in any test at
any time without recrimination or prejudice U36.1. Test users who know that a person's rights are
against that person. not being observed during testing must make
every effort to change that situation.
U34.2. Test users must inform persons whom they test
of potential risks and benefits that persons may U37. Test users should select tests based on what is
experience as a result of taking the test. best for the person being tested. Test selection
based on considerations of personal benefit to
U34.3. Test users must respect the right of persons the test user, test purveyor, or the referring prac-
being tested to know the results of tests, the titioner is inappropriate.
interpretations of those test results, and with
whom the test results will be shared. The right U38. Test users, in clinical practice, should avoid the use
of the person to know the results of tests does of tests that were designed solely for research pur-
not imply that all test users must personally poses. Such tests, when theyare used in the clinical
supply this information. In some cases, test re- setting, should be identified in all reports as re-
sults may be supplied by the professional who search tests that have not necessarily been shown to
originated a referral or who is coordinating be reliable or valid in clinical use.
treatment.
U39. Test users should not assign persons to conduct
U34.4. Test users who fail to adhere to the Standards tests unless they know that such persons are quali-
and who use tests inappropriately, especially in fied to conduct the tests.
terms of drawing unwarranted conclusions
from results, violate the rights of persons being U40. Test users should not make promotional claims for
tested. their testing procedures that are not supported by
research literature.
U34.4.1. Test users who misrepresent their clinical
opinions as being based on test results U40.1. Test users are responsible for the critical evalu-
when evidence for such opinions is not ation of all claims of test purveyors and should
found in the research literature violate the not merely repeat the claims of purveyors with-
rights of persons taking tests. (For exam- out critical evaluation of these claims.
ple: A test user may use a battery of tests to
determine the ability of a patient with low U41. Test users should assist in the development and
back pain to function in an industrial envi- refinement of testing procedures by sharing their
ronment. In this hypothetical example, the knowledge of tests and assisting in the collection of
test battery yields a measurement that is data where appropriate.
supposed to predict the type of work that
the patient may d o safely. There is, in this U42. Test users have a responsibility to periodically re-
example, evidence for the validity of this view the test procedures they and their colleagues
inference. However, based on the test us- use in their institutions (practice settings) to ensure
er's observations, the test user concludes that appropriate use of measurements is being
that the patient is malingering. This is the
U44.2. Test users must consider the error associated U44.4.2. Test users must consider the sensitivity
with their measurements when they interpret of the diagnostic test they are using
their test results. Reliability and validity esti- when they interpret their measurements.
mates should be considered when the test user If this information is not available, test
makes interpretations of measurements. (For users should understand the limitations
example: Reliability studies have indicated that of making interpretations based on their
a measurement varies as much as 10% between measurements.
repeated tests. Therefore, a change of less than
10% on that measurement may be due, at least U44.4.3. Test users must consider the specificity
in pan, to measurement error. Test users who of the diagnostic test they are using
note changes the second time they take mea- when they interpret their measurements.
surements should consider, before they make If this information is not available, test
interpretations, that the change may not reflect users should understand the limitations
real change, but may be due solely to measure- of making interpretations based on their
ment error.) measurements.
U44.3. Test users must consider whether normative U44.4.4. Test users must consider the predictive
data are available for the measurements they values of positive and negative findings
interpret. Test users must consider the when they interpret their measurements
sources of the normative data and how appli- obtained with a diagnostic test. If this infor-
cable these data are to the measurements mation is not available, test users should
they are interpreting. understand the limitations of making inter-
pretations based on their measurements.
U44.6. Test users must consider whether changes (eg, U45.5. Test users must consider estimates of reliability
attributable to development o r learning) in the and validity when reporting test results. Test
person being tested may alter performance on users should report estimates of the errors as-
subsequent tests. Test users, when appropriate, sociated with a measurement when they report
should discuss in their reports of test results test results. (For example: Reliability studies
the possibility of change in the future. Test us- have indicated that a measurement varies as
ers should not imply that a test result repre- much as 10% between repeated measurements.
sents an immutable state when there is reason Therefore, a change of less than 10% may be
to believe that the test result may differ if the due, at least in part, to measurement error. Test
test is repeated at some future time. users who note changes the second time they
take measurements, in reporting such measure-
U44.7. Test users must consider the conditions under ments, should also report that the change in
which they conduct tests and the extent to the measurement may not reflect real change.
which results are generalizable to other test The change may be solely due to measurement
situations (eg, testing in other places o r at error. A report of the reliability estimate o r
other times). standard error, in this case, would be useful in
the test user's report.)
U44.8. Test users must identify whether their interpre-
tations are based on the results of multiple U45.6. Test users should include warnings about com-
measurements obtained with the same test o r mon misinterpretations of their measurements
on the results of a single measurement. in reports of their measurements.
U44.9. Test users must identify whether any of their U45.7. Test users should report any significant effects
interpretations are not supported by research of reactivity when they report the results of
evidence of validity. Such interpretations must their tests.
be clearly identified as being based on the test
user's personal opinion. U45.8. Test users who use a variation of a test must
indicate, when they report test results, that a
U45. Test users reporting the results of tests must supply variation was used. The test users must note
adequate information so that these results can be whether they believe that the variation may
understood. (The following Standards provide have affected the quality of their measurements.
guidelines for reporting about measurements. Test users who believe the variation had a sig-
These Standards are not meant to supersede or in nificant effect on the measurements should dis-
any way modify the requirements specified else- cuss this belief in all reports of test results.
where in the Standards for Test Users.)
U45.9. Test users should report any aspect of the test
U45.1.Test users should specify, when more than one that may cast doubt on test results (eg, ways in
form of a test exists, the specific form of the which the person tested differed from the pop-
test used when they report their results. ulation for which the test was designed o r any
observation the test user made during testing).
U45.2. Test users should report measurements in the
form specified by the purveyor's instructions. U45.10. Test users, in reports of test results, should
Test users should justify any deviations from relate their measurements to normative data,
standard methods of reporting. if available. Test users should report the
source of the normative data they use and, if
U45.3. Test users should use only the terms that are necessary, discuss how applicable the data are
defined in test manuals or in other support- to the measurements they are reporting (see
ing literature when they discuss tests o r Standard U44.3 for guidelines on using nor-
measurements. Descriptive terms that are not mative data to interpret measurements).
defined should be avoided, because such
terms may encourage inappropriate interpre- U45.10.1. Test users using normative data, when
tation of results. they report test results, should report
all information necessary to understand
R13.2. Physical therapists should honor the requests of R19. Physical therapists who are participating in measure-
professional colleagues for access to their mea- ment studies must dissociate themselves from any
surement research data. activities that are unethical or unlawful. Physical
therapists must make every effort to take corrective
R14. Physical therapists must have a thorough knowledge action when they encounter unethical, unlawful, or
of measurement theory and pertinent professional incompetent acts in the conduct of measurement
and scientific literature before they conduct mea- research. In addition, they are obligated to report
surement studies. Measurement studies must be any unethical, unlawful, or incompetent acts of any
carried out with due consideration given to this person to the appropriate authorities.
bcdy of knowledge.
R20. Physical therapists must assist the profession in doc-
R15. Physical therapists who conduct measurement re- umenting the integrity of measurement research.
search are obligated to know their personal limita- Physical therapists called upon to assist in inquiries
tions and to know when to seek consultation and about any measurement research should cooperate.
peer review of their research plans before they be- There should be no recriminations against any per-
gin data collection. son called upon to serve as an investigator. A per-
son should not investigate any other person's work
R16. Physical therapists, in written reports of measure- unless that person possesses the necessary knowl-
ment studies, must ensure that all references are edge and skills to evaluate objectively all available
correct and that citations are used appropriately. data.
References must directly support information in the
sentence in which the references are cited. If the
references supply indirect support for the state-
ment, this support should be indicated.