Professional Documents
Culture Documents
Page30
by criterion- referenced interpretation
with that of norm-referenced
interpretation of test results
LEARNING ACTIVITIES
(UNCOVER:
After a test has been administered, each student's paper is scored for either correct
or incorrect responses. The total number of items responded to correctly or points
awarded is usually recorded at the top of each paper and is referred to as the
student's raw score. When papers are returned to students, they know how many
times they responded correctly, the total number of points the teacher judged their constructed
responses to be worth, or a total made up of a combination of these values. However, a student
is unable to determine the meaning of a score, unless additional information is provided for
interpretation. The teacher may tell the student how a score compares to a criterion point set as
pass / fail or to a set of norms established for interpretation.
Direction: Examine the pictures above. How do you relate them to your experience. What do
grades mean to you? Please send your answer in the comment box.
Page30
terms of the percentage of responses to items in a clearly defined learning task.
Examples:
1. Solves problems involving multiplication of similar fraction with 80% success.
2. Identifies 70 percent of terms for chemicals.
Another way of describing the individual's performance is in terms of the speed by which
the task is performed. (Oriondo, 1964)
Examples:
1. Spells 40 words correctly out of 50.
2. Answers 50 addition facts in 10 minutes without error.
In the examples given above, the interpretative frames of reference are specified content
domain rather than a specified population of persons. The focus is on what the student knows
and what he can do, not on how he compares with others.
Norm-Referenced Interpretation of Test Results
Norms represent the typical performance of a specified group of individuals. For a
classroom test that is norm - referenced, each score will compare to how that score stands in
relation to scores of others who have taken the test. Thus norm-referenced test interpretation
reveals how an individual compares with other persons who have taken the same test. The
rationale for the use of norm-referenced interpretation for some tests lies in the need to get an
overall estimate of how we study a general subject field or in some educational undertaking.
This kind of test is constructed so that students have an opportunity to reveal their highest levels
of attainment.
Mathematical techniques are employed to explain what a set of scores means and to
communicate meaning of individual scores to teachers, students and parents. The mathematics
used with test scores deals with the collection, presentation, analysis and interpretation of test
results. The accurate description of test results depends on a logical sequence of treatment of
the test scores to aid in reporting and interpreting the results, and ultimately in utilizing the
results in effective decision making.
Comparing Norm-Referenced and Criterion-Referenced Tests (Adapted from Kubiszyn
and Borich, 2007)
Dimension NRT CRT
Average number of 50% 80%
students who get an item
right
Compares a student's The performance of other Standards indicative of mastery
performance to students
Breadth of content Broad, covers many Narrow, covers a few objectives
sampled objectives
Comprehensiveness of Shallow, usually one or two Comprehensive, usually three or
content sampled items per objective more items per objective
Variability Since the meaningfulness of a The meaning of the scores does
Page30
norm-referenced score not depend on the comparison
basically depends on the with other scores. It flows
relative position of the score in directly from the connection
comparison with other scores, between the items and the
the more variability or spread criterion. Variability may be
of the scores, the better. minimal.
Item construction Items are chosen to promote Items are chosen to reflect the
variance or spread items that criterion behavior. Emphasis is
are “too easy” or “too hard” are placed on identifying the domain
avoided. One aim is to of relevant responses.
produced good “distractor
option.”
Reporting and Percentile rank and standard Number succeeding or failing or
interpreting scores used (relative ranking). range of acceptable
considerations performance uses (e.g. 90%
proficiency achieved, or 80% of
class reached 90% proficiency)
Page30
NUTSHELL
Recall the grades that you received in your high school subjects? How consistent
are these grades? How do you feel about receiving your grades last semester?
How important are grades for you?
References:
(Enter references here in APA format)
LESSON 12 - ORGANIZATION AND PRESENTATION OF ASSESSMENT DATA
Topics:
1. Frequency Distribution
2. Histogram, Frequency Polygon
3. Ranking of Scores
TIME ALLOTMENT WEEK 14 -
Student Learning Outcomes: Expected Output
1. organize a set of scores into a
frequency distribution; Do the exercise NUTSHELL.
2. construct a graphic representation by
histogram or frequency polygon for a
frequency distribution;
3. interpret the scores by way of ranking
LEARNING ACTIVITIES
X CN R X CN R X CN R
56 1 1 40 13 13.5 30 25 24.5
52 2 2 40 14 13.5 30 26 24.5
50 3 3 38 15 15 27 27 27.5
49 4 4.5 36 16 17 27 28 27.5
49 5 4.5 36 17 17 25 29 29
48 6 6.5 36 18 17 24 30 30.5
48 7 6.5 34 19 19 24 31 30.5
47 8 8 33 20 20 21 32 32.5
46 9 9 32 21 21.5 21 33 32.5
44 10 10 32 22 21.5 20 34 34
41 11 11.5 30 23 24.5 19 35 35
41 12 11.5 30 24 24.5 18 36 36
Ranking of the data above:
Ranking is used to indicate the relative position of a pupil or student in a group to which
he/she belongs. By ranking test scores, it is possible to compare the achievement of a pupil with
those of the others in the same group. A report of a student's rank is a very good indication of
individual performance compared to general group performance. Ranking however does not
consider the extent of the difference between successive test scores. From the ranks, the
percentage of pupils that surpasses a pupil or that are surpassed by him can be determined.
Ranks are generally well understood by students and parents.
In the above data:
1. The ranks of scores 56, 52, 50, 47, and 46 are their numbers namely: 1, 2, 3, 8 and
9. These scores appear only once, their consecutive numbers are their ranks.
2. The rank of score 49 is 4.5 , that is the average of 4 and 5; 4 added to 5 divided by
Page30
2.
3. The rank of 30 is 24.5 , that is the average of numbers 23, 24, 25 and 26.
4. Score 33 has a rank of 20. Nineteen students or 52.86 percent of the class
surpassed the student who got this score. This student surpassed 16 or 44.44
percent of his classmates.
Frequency Distribution
A frequency distribution provides the classroom teacher with a systematic arrangement
of raw scores by tallying the frequency of occurrence of each score in the interval or in some
instances score values that have been grouped.
Steps in Setting Up a Frequency Distribution for Ungrouped Data
1. Arrange the scores from highest to lowest in a column headed X. The X represents the
raw score.
2. Head the second column Tallies and record a slash or tally mark for each score. If a
score value appears twice, this column will have two slashes, three values gives three
slashes, and so on.
1. Count the slash marks and place the number corresponding to the total number
of tallies for each raw sore value in the third column. The f column represents the
occurrence of each score.
2. Sum the f column and record the number of scores (N) as a total.
Given the following set of scores:
32 39 40 25 29 35 39 28 41 29
37 30 27 32 29 29
X Tallies Frequency
(f)
41 / 1
40 / 1
39 // 2
37 / 1
35 / 1
32 // 2
30 / 1
29 //// 4
28 / 1
27 / 1
25 / 1
N = 16
1. The highest score is 58 and the lowest score is 24. The range is 34.
2. To find the class interval, divide the range, 34 by 10 (the desired number of step
or class-intervals). The answer is 3.4, so 3 the step-interval.
3. The lowest score is 24. Since 24 is exactly divisible by 3, then it is the lowest
limit.
4. The resulting frequency distribution would be:
10
6
Series1
4
0
8
6
-2
-3
-3
-4
-5
-5
24
30
36
42
48
54
Page30
representation of the frequency of scores associated with designated points on the
baseline. A frequency polygon is constructed by locating the midpoint of each interval
and recording a dot to represent the number of scores falling in that interval. The points
are then plotted at the midpoints of the interval and then the points are connected by
lines.
Frequency Polygon of A Grouped Scores
10
9
8
7
6
5 Series1
4
3
2
1
0
24 27 30 33 36 39 42 45 48 51 54 57
- - - - - - - - - - - -
28 29 32 35 38 41 44 47 50 53 56 59
Page30
References:
(Enter references here in APA format)
Page30
uses of measures of central tendency
LEARNING ACTIVITIES
UNCOVER
Questions:
1. What do you see in the picture? Have you received like this before?
2. What do you do with your report cards?
3. Why do you need to get the average of your grades?
securebrowser&hspart=avast&p=measures+of+central+tendency#id=1&vid=a2735015ef0acc6259e5491
e4a7f0376&action=click
https://www.statology.org/measures-central-tendency/
The middlemost scores are 32 and 28. The average of these two numbers is 30.
So the median is 30
Mdn = LL + xi
Where: Page30
LL = the real lower limit of the median class
N/2 = half sum
Fl = partial sum
F = frequency of the class interval where the median lies
N = the number of cases
i = the interval
Mdn = 69.5 + x5
= 69.5 + x5
= 69.5 + (.4545) x 5
= 69.5 + 2.2725
Mdn = 71.77
4. Check the answer by using the formula:
Mdn = Ul + xi
in which :
UI = real upper limit of the interval where the median lies
N/2 = half sum
Fu = partial sum
f = frequency of the class interval where the median lies
N = the number of cases
i = the interval
= 74.5 + x5
= 74.5 + (.5454) x 5
= 74.5 + 2.727
Mdn = 71.77
The Mean
Computation of the Mean from Ungrouped Data (When the number of cases is less than
30)
1. Use the formula: M= EX/N (The sum of X divided by N)
2. Write the sores in a column. They can be in any order.
3. Count the number of scores to get N.
4. Add the scores to get the sum.
5. Divide the sum by the number of cases.
1. Assume a mean. Get the midpoint of the interval where the assumed mean lies.
AM = 72
2. Fill in Column d (deviation). The deviation is the spread of the score from a point
of origin.
3. Fill in Column fd. The sum of the positive values is +33 and that of the negative
values is -42. The sum of fd is -9.
4. Substituting the formula:
M = 72 + (-9/50) 5
M = 72 + (-0.18) 5
M = 72 + (-0.9)
M = 72 - 0.9
M = 71.10
5. Check your answer by assuming another mean.
X f d fd
90 – 94 1 5 5
85 – 89 2 4 8
80 – 84 7 3 21
75 – 79 9 2 18
70 – 74 11 1 11 +63
65 – 69 8 0 0
60 – 64 5 -1 -5
55 – 59 5 -2 -10
Page30
50 – 54 1 -3 -3
45 – 49 1 -4 -4 -22
N = 50 Σfd = +41
Given:
AM = 67
Σfd = +41
1=5
N = 50
M= 67 + (+41/50) 5
M = 67 + (0.82) 5
M = 67 + 4.1
M = 71.10
Another method of computing the mean is through the midpoint method. The
formula is:
M=
X f M fM
90 – 94 1 92 92
85 – 89 2 87 174
80 – 84 7 82 574
75 – 79 9 77 693
70 – 74 11 72 792
65 – 69 8 67 536
60 – 64 5 62 310
55 – 59 5 57 285
50 – 54 1 52 52
45 – 49 1 47 47
N = 50 Σfd = 3555
Procedure:
1. Prepare a frequency distribution.
2. Place Column M which represents the midpoints of each class interval.
3. Fill in Column fM by multiplying each frequency by each corresponding midpoint.
4. Find the sum of the data in Column fM.
5. Divide this by N.
M = 3555/50 = 71.10
The Mode
The mode is the most frequently occurring score in the distribution. It is the score
with the highest frequency.
Determining the Mode from Ungrouped Scores (Crude or Rough Mode) Page30
Procedure:
1. Arrange the scores from highest to lowest.
2. The score that occurs most often is the crude mode.
Data:
25 30 37 41 52 52 30 37 42 37
Х
52
52
42
41
37 Mode = 37
37
37
37
30
30
30
25
When a group of scores has two different scores with the same highest
frequency, the group is said to be bi-modal. If there are three different scores with the
same highest frequency, the group is tri-modal, four, quadri-modal, etc.
In the Frequency Distribution given above where the median is 71.77 and the mean is
71.10, the mode is:
3 (71.77) - 2 (71.10)
= 215.31 - 142.2
= 73.11
The mode is merely the most typical value or the most frequent measure. It is computed
when a quick method of computing the most typical and approximate measure of central
tendency is all that is needed.
Quartiles
The first quartile (Q1) is located at one-fourth of the number of cases, such that
25% of all the cases lie at or below it and 75% at or above it.
The value of the third quartile corresponds to the value of the seventy-fifth
percentile. Seventy-five percent of all the cases lie at or above it and 25% lie at or below
it.
The value of the second quartile is equal to the value of the median, such that
50% of all the cases lie at or below it and 50% lie at or above it.
Formula:
Q1 = LL + xi
Finding Q1
X F CM
Page30
90 – 94 1 50
85 – 89 2 49
80 – 84 7 47
75 – 79 9 40
70 – 74 11 31
65 – 69 8 20
60 – 64 5 12
55 – 59 5 7
50 – 54 1 2
45 – 49 1 1
N = 50
1. Add Column CM in the Frequency Distribution. It stands for the cumulative
frequencies, this is done by adding the scores from the lower score end upward.
2. Find N/4. 50/4 = 12.5. The twenty-fifth score lies in the interval 65 - 69.
3. Determine the partial sum (F). That is the sum of the frequencies upward which
totals 25 (Q/4) but not exceeding it. In the given distribution, the partial sum (F) is
12
4. The value of f is 8 since it is the frequency of the interval where Q1 lies.
5. The value of LL or lower limit is 64.5
Substituting the formula:
Q1 = 64.5 + x5
Q1 = 64.5 + x5
Q1 = 64.5+ (0.06) x 5
Q1 = 64.5 + .30
Q1 = 64.80
Third Quartile
Formula:
Q3 = LL + xl
3N/4 = = = 37.5
LL = 74.5
F = 31
f=9
Page30
l=5
Q3 = 74.5 + x5
Q3 = 74.5 + (6.5/9) x 5
Q3 = 74.5 + (.72) x 5
Q3 = 74.5 + 3.6
Q3 = 78.1
Percentiles
Percentiles are points dividing the distribution into 100 equal parts.
Formula:
Px = LL + xl
where:
Px = the number of percentile desired
NPx = Percentile Num (N x the percentage desired)
F = partial sum (the number of scores falling below the desired percentile)
f = the frequency of the interval where the desired percentile lies
LL the exact lower limit of the interval where the desired percentile lies
I = interval.
Finding the Percentiles
X F CM
90 – 94 1 50
85 – 89 2 49
80 – 84 7 47
75 – 79 9 40
70 – 74 11 31
65 – 69 8 20
60 – 64 5 12
55 – 59 5 7
50 – 54 1 2
45 – 49 1 1
N = 50
Procedure:
1. Determine the desired percentile. E.g. P20
2. Find the percentile sum by multiplying the number of cases (N) by the
percentage desired. 20% of 50 = 50 * .20 = 10
3. Find the partial sum by adding the frequencies of the scores from the lower score
end upward until reaching the percentile sum but not exceeding it. (1 + 1 + 5= 7).
Percentile 20 or the 10th score lies at interval 60-64.
Page30
4. Determine f = the frequency of 60 - 64 is 5.
5. Determine LL. The exact or real lower limit of 60 - 64 is 59.5.
6. The interval is 5.
7. Substitute the formula.
INSTILL :
Questions for discussion:
1. Differentiate the mean from the mode and median.
2. When do you use the mean? The median? The mode?
3. Solve the following problem:
3.1 Find the mean, median and mode of the following set of scores:
89 77 63 99 92 93 94 65
62 82 86 76
3.2 Find the mean, median and mode of the following set of scores in Philippine
History:
82 43 72 74 69 68 67 87 86 73
85 75 65 60 35 57 52 59 40 42
61 57 70 50 45 68 62 49 69 58
61 65 60 81 63 48 54 46 54 44
67 66 49 58 67 60 60 68 58 62 68
Page30
administered. Compute the measures of central tendency and the
measures of location.
References:
(Enter references here in APA format)
LEARNING ACTIVITIES
UNCOVER: :
The measures of central tendency represented by the mean, median and
mode are valuable statistical measures, but they describe only the typical
score representing the whole distribution. They describe only the tendency of
the scores to pile up at or near the middle of the distribution. The measures
of variability or dispersion are important. They show the tendency of the scores to
spread or scatter above or below the center point of the distribution. They show how
close or how far the scores are from each other. These measures also show the
homogeneity or heterogeneity of different sets of scores. The higher the measure of
variability the more homogenous is the group: the lower the measure of variability, the
more heterogeneous is the group.
The most common measures or variability are the range, the standard deviation,
the mean deviation and quartile deviation. The most important and most often used in
measurement and research and in advanced statistics is the standard deviation.
Look at the figure below. Compare. In what aspect are they similar? Different?
Page30
Range
The range is the difference between the highest and lowest scores. It provides a
quick approximation of the spread of the scores, but it is not a dependable measure of
variability because it is calculated from only two values.
Example: Highest Score = 78; lowest score is 25. The range is 53.
Standard Deviation
The standard is the square root of the mean of the squared deviations of all
scores from the mean. It is basically a measure of how far each score is from the mean.
Since the standard deviation is based on deviations from the mean, these two statistics
are used together to give meaning to test scores.
SD =
Procedure:
1. List the scores under X column.
2. Find the mean of the scores.
3. Place Column X - (deviations); get the values by subtracting the mean from
each of the scores. When the scores are less than the mean, the negative sign
precedes the difference between the raw score and the mean.
4. Place column (X- ); square each of the values.
5. Find the sum of the squared deviation and divide it by the number of cases.
Example: Given this set of scores: 43, 41, 40, 38, 37, 33, 31, 29, 26, 24, 22
X (X- ) (X-X)
43 7 49
41 5 25
40 4 16
38 2 4
Page30
37 1 1
33 -3 9
30 -6 36
29 -7 49
24 -12 144
24 -12 144
21 -15 225
ΣX = 360 702
N = 10
X = 36
SD = = = 4.415
Example:
X (X- )
43 7
41 5
40 4
38 2
37 1
33 -3
30 -6
29 -7
24 -12
24 -12
21 -15
ΣX = 360 = 74
N = 10
X = 36
AM = = 7.4
Page30
Standard Deviation from Grouped Scores
The formula for standard deviation using the short method is:
SD =
Where:
SD = standard deviation
I = class interval
Σfd = the sum of the products of the frequencies by the deviations of the score
from the mean, squared.
Σfd = the sum of the products of the frequencies by the deviations of the score
from the mean.
N = the number of cases.
X f d fd fd
90 – 94 1 4 4 16
85 – 89 2 3 6 18
80 – 84 7 2 14 28
75 – 79 9 1 9 +33 9
70 – 74 11 0 0 0
65 – 69 8 -1 -8 8
60 – 64 5 -2 -10 20
55 – 59 5 -3 -15 45
50 – 54 1 -4 -4 16
45 – 49 1 -5 -5 -42 25
N = 50 Σfd = -9 Σfd = 185
SD =
SD =
SD =
SD =
SD = 5 x 1.9150
SD = 9.575
Page30
Quartile Deviation (Q)
When using the statistics of percentiles, deciles, quartiles, or the median which
are based on the order of the scores, the standard deviation cannot be used as a
measure of variability, since the deviations used in calculation of the standard deviation
are based on the mean. The variability of a distribution of scores can be used by using
the two points, Q3 and Q1. A measure of the variability of the middle 50 percent of the
scores is considered to be a good estimate, because extreme scores or erratic spacing
between scores in the upper 25 percent and lower 25 percent are excluded in the
computation. This is the quartile deviation. This is the value that is equal to half the
distance from Q1 to Q3.
Q=
Where:
Q = quartile deviation
Q3 = 75th percentile
Q1 = 25th percentile
Q1 = 64.5 + x5
Q1 = 64.5 + x5
Q1 = 64.5 + (0.06) x 5
Q1 = 64.5 +.30
Q1 = 64.80
Third Quartile
Page30
Formula:
Q3 = LL + xl
3N/4 = = = 37.5
LL = 74.5
F = 31
f=9
l=5
Q3 = 74.5 + x5
Q3 = 74.5 + x5
Q3 = 74.5 + (.72) x 5
Q3 = 74.5 + 3.6
Q3 = 78.1
Q= = 13.3
Characteristics/Properties of Distributions
Sk =
The characteristic of kurtosis is very closely related to the characteristic of
variability. It can give an indication of the degree of homogeneity of the group being
tested in regard to the characteristic being measured. If students tend to be much alike,
the scores will generate a leptokurtic frequency polygon; if students are very different,
a platykurtic distribution is generated. A mesokurtic distribution is neither platykurtic
nor leptokurtic.
The kurtosis for the normal distribution is approximately 0.263. Hence if the Ku is
greater than 0.263 , the distribution is most likely platykurtic; while if the Ku is less than
0.263, the distribution is most likely leptokurtic (Garett, 1973).
K= Q
(P90 – P10)
Page30
STANDARD SCORES
A standard score is one of many derived scores used in testing today. Derived
scores are valuable to the classroom teacher. Since scores differ from different tests,
the teacher can make them comparable by expressing them in the same scale. For
norm-referenced tests, it is meaningful to interpret classroom test scores by locating a
student's score with reference to the average for the class and to describe the distance
between the score and the average in terms of the spread of the scores in the
distribution.
Tristan's raw score on an English achievement test was 50. In the same class of
students Tristan scored 70 on the Mathematics achievement test. To compare the raw
score on one test with a raw score on another test to obtain a total or average score is
meaningless. The units are not comparable because the tests may have different
possible total scores, means, and standard deviations. By converting raw scores on
both tests to standard scores, the units become comparable, and can be interpreted
properly.
Using the deviation of a score from the mean (X- ) and the standard deviation
(SD), a teacher can build what is called a z- score.
z=
Z = a standard score
X = any raw score
M = the mean
SD = the standard deviation
For example, the means and standard deviation for Tristan's two test scores are as
follows:
Tristan's Raw Score Mean Standard Deviation
English test 50 45 5.6
Mathematics test 70 75 7
Comparison can be made between the two scores because the scores were earned in
the same group of students. Substituting the formula:
For English:
z= = = .89
Page30
For Mathematics:
z= = = -0.71
The two scores of Tristan can now be compared. Even if he got a higher score in
Mathematics than in English, he still did well in English as shown by the higher value of
the standard score in that subject.
INSTILL :
Questions for discussion:
1. Why is there a need to determine the variation of scores in a test?
2. How will this help the teacher in decision-making?
3. Discuss the steps in computing the different measures of variability?
4. Demonstrate how each is computed.
APPLY:
1. Find the standard deviation and average deviation of the
following set of scores:
32 39 40 25 29 35 39 28
41 29 37 30 27 32 29 26
2. Find the standard deviation, quartile deviation, skewness and kurtosis. Illustrate
your answer.
ci f
54 - 56 3
51 - 53 3
48 - 50 1
45 – 47 5
42 - 44 6
39 - 41 9
36 - 38 5
33 - 35 7
30 - 32 4
27 - 29 4
24 - 26 2
3. Vin's score in the midterm test in Statistics was 48 and 56 in the final test. The
mean of the first test is 42 and the standard deviation is 5. In the second test the
mean is 60 and the standard deviation is 6. In which test did Vin do better?
Page30
References:
(Enter references here in APA format)
Page30