Professional Documents
Culture Documents
Validity
“Validity refers to the degree to which evidence and theory support the
interpretations of test scores entailed by proposed uses of tests. Validity is,
therefore, the most fundamental consideration in developing and evaluating tests”
(AERA et al. 1999, p. 9).
Validity?
• Refers to the degree to which evidence and theory support the interpretations
of test scores entailed by proposed uses of tests (Standards: AERA, 1999).
Threats to Validity
Nature of Validity
• Validity refers to the appropriateness of the interpretation of the results of an
assessment procedure for a given group of individuals, not to the procedure
itself.
• Validity is a matter of degree; it does not exist on all-or-none basis.
• Validity is always specific to some particular use or interpretation.
• Validity is a unitary concept.
• Validity involves an overall evaluative judgement.
• A test can be reliable without being valid, but a test cannot be valid without
being reliable.
• The reliability of the test (and validation measures) set an upper limit on the
validity coefficient.
• Content validity: does the test provide an accurate estimate of the test taker’s
mastery of the specified content domain.
• Construct Validity: does the test accurately reflect the dimensions underlying
the construct measured by the test. Based on an integration of evidence
collected using a variety of research strategies.
• The common practice has been to refer to types of validity, e.g., content
validity.
• However, validity is a unitary concept reflecting the degree to which all the
accumulated evidence supports the intended interpretation of test scores for
proposed purposes.
• Focuses on how well the test items sample the behaviors or subject matter
the test is designed to measure.
• Does the test provide an accurate estimate of the test taker’s mastery of the
specified content domain?
• In other words, does the test adequately measure the content, behaviors, or
skills it is thought to measure.
• Most relevant, appropriate, and important for tests used to make inferences
about the knowledge and/or skills assessed by a sample of items such as
achievement tests and employment tests.
• Traditionally, validity evidence based on test content has involved a qualitative
process in which the test is compared to a detailed description of the test
domain.
• Uses respected experts who specify the domain that should be covered and
help determine how representative the test’s items are.
Face Validity
• Face Validity can be undesirable if it makes the test “transparent” to the point
that it makes it easy for test takers to feign symptoms (e.g., forensic &
employment settings).
Test-Criterion Evidence
• Predictive Studies: indicates how accurately test data can predict criterion
scores that are obtained at a later time. When prediction is the intended
application, predictive studies retain the temporal differences and other
characteristics of the practical situation.
• Potential problems - predictive studies are time-consuming and expensive.
• Since predictive validity studies involve the use of an experimental group,
people may be admitted to programs or given jobs without regard to their
performance on the predictor, even if their scores suggest a low probability of
success.
• Concurrent Studies: obtains predictor and criterion data at the same time.
• Concurrent evidence is most useful when prediction over time is not
important. For example, examining the relationship between the results of a
brief paper-and-pencil questionnaire with that of a clinical interview.
• Discriminant evidence is obtained when you correlate a test with existing tests
that measure dissimilar constructs.
Lecture on Validity
1. Unclear directions.
2. Reading vocabulary and sentence structure too difficult.
3. Ambiguity
4. Inadequate time limits
5. Inappropriate level of difficulty of the test items.
6. Poorly constructed test items
7. Test items inappropriate for the outcomes being measured
8. Test too short
9. Improper arrangement of items
10. Identifiable pattern of answers.