Professional Documents
Culture Documents
CONTINUED
• Content-Related Validity
Norm-Referenced Reliability & Validity
• Criterion-Related Validity
• Statistical or correlational
• concurrent
• predictive
• Construct-Related Validity
• Also known as face validity and logical validity • Also known as statistical validity and correlational validity
• Concurrent validity (criterion is measured at about the
• Content experts, expert judges, colleagues,
same time as the alternative measure)
and textbook writers can serve as sources to
• Predictive validity (criterion is measured in the future)
validate instrument content
Chapter 6 1
EXAMPLE OF CONCURRENT VALIDITY EXAMPLES OF PREDICTIVE VALIDITY
VO2max (criterion: oxygen consumption) • Heart disease (criterion: heart disease developed in later
life)
• Distance runs (e.g., 1.0 mi [1.6 km], 1.5 mi [2.4
km]; 2km [1.2 km] ; 9 min, 12 min; 20-m [21.9 yd] • Present diet, exercise behaviors, blood pressure,
shuttle) family history of heart disease or related health issues
• Submaximal (e.g., cycle, treadmill, swimming) • Success in graduate school (criterion: grade point
• Nonexercise models (e.g., self-reported physical average or graduation status)
activity) • Graduate Record Examination scores
• Undergraduate grade point average
Chapter 6 2
CONSTRUCT-RELATED VALIDITY FACTORS INFLUENCING VALIDITY
(CONTINUED) • A test is valid only for a particular group at a particular time for a
particular purpose. There is no such thing as a generally valid test.
For example, a test that is valid for college students may not be for
An example of known group difference method of elementary students. A test for middle-class white girls may be
construct validation: invalid for other groups. Thus, the culture or unique characteristics
of a group often affect a test's validity for that group.
• A student who is poor at golf should score poorly on a golf • The I.Q. test turned out to be somewhat invalid for persons of
test; experienced golfers should obtain higher scores, and color growing up in low-income households. When tests created
specifically for them were administered to middle-class to upper-
members of the golf team should score the best
class whites, the results were reversed.
• Testing difference of means with ANOVA could provide • Other factors are unclear directions, strange patterns of
construct-related evidence for the golf test correct answers on test. For motor tests, novelty may influence the
validity, so a practice round should be available.
• Also, reliability and objectivity. Remember that for a test to be
valid, it must be reliable, but a test can be reliable without having
validity.
Standard error
SE of Measurement
Standard error of prediction
SEM Sy * 1 rxx '
SEE Sy * 1 r 2 xy SE of Estimate
SEE Sy * 1 r 2 xy
Chapter 6 3
FIGURE 6.1
DIAGRAM OF VALIDITY AND Interpreting the “r” you obtain
RELIABILITY TERMS
Error
Chapter 6 4
CONCURRENT VALIDITY CONCURRENT VALIDITY
D – it has the MOST error
and requires 4 tests to
be administered
A B C D A B C D
Consider the Concurrent validity of Which test battery would you be LEAST
the above 4 possible skills test batteries likely to use? Why?
A B C D A B C D
Which test battery would you Which test battery would you
be MOST likely to use? Why? use if you are limited in time?
Chapter 6 5
INTERPRET THESE INTERPRET THESE
CORRELATIONS CORRELATIONS
Actual Putting Putting Driving Driving Observer Observer Actual Putting Putting Driving Driving Observer Observer
golf score
Criterion Trial 1 Trial 2 Trial 1 Trial 2 1 2 golf score Trial 1 Trial 2 Trial 1 Trial 2 1 2
Observer 2 .39 .30 .41 .47 .35 .50 1.00 Observer 2 .39 .30 .41 .47 .35 .50 1.00
INTERPRET THESE
CORRELATIONS SCATTERPLOT
Actual Putting Putting Driving Driving Observer Observer
golf score Trial 1 Trial 2 Trial 1 Trial 2 1 2
Two trials of Leg Press
Actual golf 1.00 Line of identity
score
Putting T1 .78 1.00 Prediction line
What is this?
Putting T2 .74 .83 1.00
Chapter 6 6
ESTIMATING AGREEMENT BETWEEN
CORRELATION MEASURES: BLAND–ALTMAN METHOD
SUMMARY
Chapter 6 7