Professional Documents
Culture Documents
1. PREDICTIVE VALIDITY
--- involves the use of criterion and THEORETICAL CONSTRUCT
refers to the ADEQUACY and a predictor. Example correlating
REPRESENTATIVENESS of the results of college entrance test -- describe by determining the
learning outcomes to be measured and student GWA at some future components of such psychological
time task
(predictor= CET; criterion= GWA)
2. CONCURRENT VALIDITY
CRITICAL CONSTRUCT
-- criterion are already available in
which CET is correlated with some - predictors, conclusions,
can be assure with the use of T.O.S assumptions, inference,
available criterion
interpretations and relevance of
(predictor= GWA; criterion= 4th year evidence
high school grade)
CHARACTERISTICS OF A GOOD TEST
RELIABILITY
--refers to the ―CONSISTENCY‖ of the test score.
--ERRORS of measurements are factors or conditions that can contribute to the
lowering of the test reliability. If the test has low reliability we can be assure
that errors of measurement have affected the test scores to the point that
the test is UNRELIABLE
•What happening within the •Test contain (poorly •Lightning of room, room
temp (too hot or too cold) •MISKEY/ providing wrong
individual?(fatigue, hunger, constructed items, items with answer, mistake in
headache, motional upset, clues, very easy, very noise, seating arrangement,
instruction, time allotment, correcting a wrong answer,
anxiety, growth and learning difficult, very high vocabulary mistake in the use of
acquired before the test)— reading level )—tends to attitude of test
examinee(MAKE THE TEST required pencil and
tends to reduce the guessing particularly when it subjective scoring
consistency of the SCORE is long UNRELIABLE LOWER THE
OVER TIME TEST SCORE)
Test Itself
Test
Test Takers (INTRA TEST Test Scoring
Administrations
ERROR)
TO DETERMINE THE CONSTRUCT VALIDITY OF CRITICAL
THINKING
1. Each subtest is correlated with the whole test.
2. The correlation of each subtest which measures a particular components
contribute to the measurement of a psychological trait which is critical thinking.
Define by:
X Y (proportion
(subtest) (correlation with the total score) of common
variance)
TEST-RETEST METHOD
--determines how scores are consistent over a given period of time. The same test is administered twice to the
same group with an interval between 2 to 15 days(sufficient time interval)(2-3 days student can recall
answer)(longer time interval lower the reliability)/true score= true score+error of measures/
PARALLEL/ALTERNATE FORMS METHOD
--used two different versions of the same test, administered to the same group close together
in time. It used form A or B and can be given on the same day or the next day. The difference of the
two is how they worded or written, it should measure the same skills and errors are significantly
controlled
TEST-RETEST WITH ALTERNATE FORMS METHOD
--administering the two version of the same test on two different occasions. Time interval may
be short(2 weeks)(longer for 6 months). Takes into account all possible sources of errors. It is the
most useful indicates variation of a test score over a period of time.
INTERNAL CONSISTENCY METHOD
-- employ only one test administration of the same test given to the same group on individual.
DIFERENT METHODS
1. SPILT-HALF /ODD-EVEN METHOD—scoring odd items, scoring even items
2. KUDER RICHARDSON FORMULA 20—two sets of score (odd and even) are correlated using
PRODUCT MOMENT CORRELATION COEFFICIENT FORMULA
3. TO TEST THE RELIABILITY OF THE WHOLE TEST (USE SPEARMAN-BROWN PROPHECY
FORMULA )
4. PEARSON r USED TO COMPUTE INTERMNAL CONSISTENCY OF A CERTAIN TEST USED IN
SPLIT-HALF METHOD
Reliability coefficient is high then it is said to be homogenous.
Consistency of the test scores determined over different parts of the
entire test..
Item Analysis
Index of Index of
difficulty discrimination
METHOD TO EMPLOY IN ITEM ANALYSIS
-USING THE UPPER AND LOWER INDEX METHOD
27/100
1. After scoring the test, arrange from lowest to highest.
2. Segregate the top and bottom 27/100 of the paper.
3. Tally the correct answers to each item by each student in the upper
27/100 group.
4. Repeat step three, considering the lower 27/100.
5. Get the percentage of the upper group that obtained the correct
answer use U.
6. repeat step 5, considering lower group. Used L.
7. Get the average percent of U and L.
8. Get the difference between U and L.
Range Description
FAIR OR REVISED
-UNACCEPTED DIFFICULTY OR DISCRIMINATION INDEX
POOR OR DISCARDED
-BOTH DIFFICULTY AND DISCRIMINATION INDEX ARE
UNACCEPTABLE.
THEN THE ITEM NEED TO BE DISCARDED RIGHT AWAY
TABLE OF ACTION TO BE TAKEN
DIFFICULTY LEVEL DISCRIMINATING ACTION
LEVEL
QUESTIONABLE ITEM
VERY DIFFICULT DISCARD
VERY DISCRIMINATING
NOT DISCRIMINATING DISCARD
MODERATELY
DIFFICULT DISCRIMINATING
REVISE
DISCRIMINATING RETAIN
NOT DISCRIMINATING REVISE
MODERATELY
MODERATE DIFFICULT DISCRIMINATING
MAY NEED REVISION
DISCRIMINATING ACCEPT
NOT DISCRIMINATING DISCARD
MODERATELY
EASY DISCRIMINATING
N.R.
DISCRIMINATING N.R
QUESTIONABLE SEE EXAMPLE
VERY EASY DISCARD
TRADITIONAL ASSESSMENT
6
Series
4 1
1. Histogram 2
2. Polygon 0 Series
Cate…
Cate…
Cate…
Cate…
3. bar 2
MEASURES OF CENTRAL TENDENCY
MEAN, MEDIAN, MODE
The MEAN– denoted by
-Simply the average of the group and most widely accepted measures of
central tendency
For Grouped data
For ungrouped data
--
Where - using
-- mean deviation
1. NORMAL DISTRIBUTION
2. POSITIVELY SKEWED DISTRIBUTION
3. NEGATIVELY SKEWED DISTRIBUTION
. Normal distribution
Positively skewed distribution