Professional Documents
Culture Documents
3 VALIDITY - RELAIBILITY 18032024 101010am
3 VALIDITY - RELAIBILITY 18032024 101010am
CHARACTERISTICS OF TESTS
– RELIABILITY AND VALIDITY
VALIDITY
,the form of the test, the purpose of the test and the
population for whom it is intended. Therefore, we cannot ask
the general question “Is this a valid test?”.
The question to ask is “how valid is this test for the decision that I
1. Face validity
2. Construct validity
3. Content validity
4. Criterion validity
a) Predictive validity
b) Concurrent validity
Face Validity
has validity.
We often say a test has face validity if the items seem to
items such as
And
“My heart starts pounding fast whenever I think about all of
stop to help stranded motorists are also the people who donate to
charities, while those that do not stop also do not donate.
Comparing Face Validity with
Construct Validity:
Face validity: The consensus that a measure represents a
particular concept – the face value of the measure
validity.
Convergent vs Divergent validity
Convergent Validity
When a measure correlates well with other tests
does not represent a construct other than the one for which it
was devised.
Content Validity
A test has content validity if it sufficiently covers the
(a) A measure may look valid when it does not, in fact, measure the
underlying construct, and
For example, a test might be used to predict which engaged couples will
have successful marriages and which ones will get divorced. Marital
success is the criterion, but it cannot be known at the time the couples take
the premarital test.
Criterion Validity
A powerful indicator of the validity of a measure is its ability to
accurately predict performance on other, independent
outcome measures (referred to as criterion measures).
The extent to which your SAT score predicts your college GPA is
an indication of the SAT’s criterion validity.
admissions tests if it accurately forecasts how well high school students will do
in their college studies.
The SAT is the predictor variable (test), and the college GPA is the criterion.
criterion – that is achieving a high GPA in college. A valid test for this purpose
would greatly help college admissions committees because they would have
some idea about which students would most likely succeed.
Concurrent Validity
time because the test is designed to explain why the person is now
having difficulty in school.
Concurrent validity applies when the test and the criterion can be
Predictive validity However, its predictive validity would be high if your SAT
score accurately predicted your college GPA, which is obtained long after taking
the SAT.
RELIABILITY
TYPES
Reliability: The consistency of a
measurement procedure
1. Test-retest reliability
4. Inter-rater reliability
Test-retest reliability
Measure the scores twice with the same instrument. Reliable
measures should produce very similar scores.
The most obvious method for finding the reliability of test scores
is by repeating the identical test on a second occasion.
The reliability coefficient (r) in this case is simply the correlation
between the scores obtained by the same persons on the two
administrations of the test.
the same test on two well-specified occasions and then find the
correlation between the scores from the two administrations.
Alternate Forms Reliability
Alternate Forms Reliability/Parallel Forms Reliability/ Parallel
Forms Reliability
Carryover effect
Alternate Forms Reliability
attribute. The two forms use different items; however, the rules
used to select items of a particular difficulty level are the same.
When two forms of the test are available, one can compare
practice effects.
Alternate Forms Reliability
is long, the best method is to divide the items randomly into two halves.
the second half of the test are more difficult than items on the first half.
If the items get progressively more difficult, then you might be better
advised to use odd-and-even system, whereby one sub-score is obtained
for the odd-numbered items in the test and another for the even-
numbered items.
Split Half Reliability
To estimate the reliability of the test, you could find the