Validity Lecture

Lecture on Validity
Validity
“Validity refers to the degree to which evidence and theory support the
interpretations of test scores entailed by proposed uses of tests. Validity is,
therefore, the most fundamental consideration in developing and evaluating tests”
(AERA et al. 1999, p. 9).
Validity?
• Does the test measure what it is intended or designed to measure?
• Refers to the degree to which evidence and theory support the interpretations
of test scores entailed by proposed uses of tests (Standards: AERA, 1999).
Threats to Validity
• Construct Underrepresentation: present when a test does not measure

important aspects of the specified construct.
• Construct-Irrelevant Variance: present when the test measures

characteristics, content, or skills that are unrelated to the test construct.
Nature of Validity
• Validity refers to the appropriateness of the interpretation of the results of an
assessment procedure for a given group of individuals, not to the procedure
itself.
• Validity is a matter of degree; it does not exist on all-or-none basis.
• Validity is always specific to some particular use or interpretation.
• Validity is a unitary concept.
• Validity involves an overall evaluative judgement.
Relationship Between Reliability & Validity
• Reliability is necessary for validity, but is not sufficient.
• A test can be reliable without being valid, but a test cannot be valid without
being reliable.
• Validity analyses is often based on correlational analyses between tests and

validation measures.
• The reliability of the test (and validation measures) set an upper limit on the
validity coefficient.
Traditional “Types” of Validity
• Content validity: does the test provide an accurate estimate of the test taker’s
mastery of the specified content domain.
• Criterion-Related Validity: can the test predict performance on a specified

criterion.
Lecture on Validity
• Construct Validity: does the test accurately reflect the dimensions underlying
the construct measured by the test. Based on an integration of evidence
collected using a variety of research strategies.
Sources of Validity Evidence versus Types of Validity
• The common practice has been to refer to types of validity, e.g., content
validity.
• However, validity is a unitary concept reflecting the degree to which all the
accumulated evidence supports the intended interpretation of test scores for
proposed purposes.
• As a result, the current emphasis in the “Standards” and advanced texts is to

refer to types of validity evidence rather than distinct types of validity.
• Instead of “Content Validity” this approach refers to “Evidence Based on Test

Content.”
• Instead of “Criterion-Related Validity” this approach refers to “Evidence Based

on Relations to Other Variables.”
Validity Evidence Based on Test Content

(previously Content Validity)
• Focuses on how well the test items sample the behaviors or subject matter
the test is designed to measure.
• Does the test provide an accurate estimate of the test taker’s mastery of the
specified content domain?
• In other words, does the test adequately measure the content, behaviors, or
skills it is thought to measure.
• Most relevant, appropriate, and important for tests used to make inferences
about the knowledge and/or skills assessed by a sample of items such as
achievement tests and employment tests.
• Traditionally, validity evidence based on test content has involved a qualitative
process in which the test is compared to a detailed description of the test
domain.
• Uses respected experts who specify the domain that should be covered and
help determine how representative the test’s items are.
Face Validity
• Not technically a facet of validity, but refers to a test “appearing” to measure

what it is designed to measure.
• Content-based evidence of validity is acquired through a systematic and
technical analysis of the test content, face validity only involves the superficial
appearance of a test.
• Face Validity is often desirable as it makes the test more acceptable to those
taking the test and the general public.
Lecture on Validity
• Face Validity can be undesirable if it makes the test “transparent” to the point
that it makes it easy for test takers to feign symptoms (e.g., forensic &
employment settings).
Validity Evidence Based on Relations to Other Variables
• Test-Criterion Evidence: Can the test predict performance on a specified

criterion? How accurately do test scores predict criterion performance?
(traditionally considered Criterion-Related Validity)
• Convergent & Discriminant Evidence: Examines relationship with existing
tests that measure similar or dissimilar constructs. (traditionally incorporated
under Construct Validity)
• Contrasted Group Evidence: Do different groups perform differently on the
test? (traditionally incorporated under Construct Validity)
Test-Criterion Evidence
• The criterion variable is a measure of some attribute or outcome that is of

primary interest, as determined by the test users.
• For example the SAT is used to predict academic performance in college, so

the criterion variable is college GPA.
Approach #1: Predictive Studies
• Predictive Studies: indicates how accurately test data can predict criterion
scores that are obtained at a later time. When prediction is the intended
application, predictive studies retain the temporal differences and other
characteristics of the practical situation.
• Potential problems - predictive studies are time-consuming and expensive.
• Since predictive validity studies involve the use of an experimental group,
people may be admitted to programs or given jobs without regard to their
performance on the predictor, even if their scores suggest a low probability of
success.
Approach #2: Concurrent Studies
• Concurrent Studies: obtains predictor and criterion data at the same time.
• Concurrent evidence is most useful when prediction over time is not
important. For example, examining the relationship between the results of a
brief paper-and-pencil questionnaire with that of a clinical interview.
Convergent & Discriminant Evidence

• Convergent evidence is obtained when you correlate a test with existing tests
that measure similar constructs.
• Discriminant evidence is obtained when you correlate a test with existing tests
that measure dissimilar constructs.
Lecture on Validity
Known – Groups Technique

According to Shaw and Wright (1967) this technique is based on the
premise that two disparate groups should hold different attitudes toward a given
object or situation. Furthermore, a valid instrument to measure a particular
construct / trait in question should yield different scores for two or three groups.
Factors Influencing Validity
1. Unclear directions.
2. Reading vocabulary and sentence structure too difficult.
3. Ambiguity
4. Inadequate time limits
5. Inappropriate level of difficulty of the test items.
6. Poorly constructed test items
7. Test items inappropriate for the outcomes being measured
8. Test too short
9. Improper arrangement of items
10. Identifiable pattern of answers.

Validity Lecture

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Validity Lecture

Uploaded by

Copyright:

Available Formats

Lecture on Validity

• Does the test measure what it is intended or designed to measure?

• Construct Underrepresentation: present when a test does not measure

• Construct-Irrelevant Variance: present when the test measures

Relationship Between Reliability & Validity

• Reliability is necessary for validity, but is not sufficient.

• Validity analyses is often based on correlational analyses between tests and

Traditional “Types” of Validity

• Criterion-Related Validity: can the test predict performance on a specified

Sources of Validity Evidence versus Types of Validity

• As a result, the current emphasis in the “Standards” and advanced texts is to

• Instead of “Content Validity” this approach refers to “Evidence Based on Test

• Instead of “Criterion-Related Validity” this approach refers to “Evidence Based

Validity Evidence Based on Test Content

• Not technically a facet of validity, but refers to a test “appearing” to measure

Validity Evidence Based on Relations to Other Variables

• Test-Criterion Evidence: Can the test predict performance on a specified

• The criterion variable is a measure of some attribute or outcome that is of

• For example the SAT is used to predict academic performance in college, so

Approach #1: Predictive Studies

Approach #2: Concurrent Studies

Convergent & Discriminant Evidence

Known – Groups Technique

Factors Influencing Validity

You might also like