Professional Documents
Culture Documents
Module 5
Development of Varied Assessment Tools
LEARNING OUTCOME
• Construct paper-and pencil test in accordance with the guidelines in
test construction
Introduction
We are concerned with developing objective tests for assessing the attainment of
educational objectives based on Bloom’s taxonomy in this Chapter. For this purpose, we restrict our
attention to the following types of objective tests: (a) true-false items, (b) multiple-choice type items,
(c) matching items, (d) enumeration and filling of blanks and (e) essays. The first four types of objective
tests are used to test the first four to five levels of the hierarchy of educational objectives while the
last (essay) is used for testing higher-order thinking skills.
The development of objective tests requires careful planning and expertise in terms of
actual test construction. The more seasoned teachers can produce true-false items that can test even
higher-order thinking skills and not just rote memory learning. Essays are easier to construct than the
other types of objective tests but the difficulty with which objective grades are derived from essay
examinations often discourage teachers from using this particular form of examination in actual
practice.
1
Assessment of Student Learning 1
Example. We want to construct a test on the topic: “Subject-Verb Agreement in English” for a Grade V
class. The following are typical objectives:
Knowledge. The students must be able to identify the subject and the verb in a given sentence.
Comprehension. The students must be able to determine the appropriate form of a verb to be used
given the subject of a sentence.
Application. The students must be able to write sentences observing rules on subject-verb agreement.
Analysis. The students must be able to break down a given sentence into its subject and predicate.
Synthesis/Evaluation. The students must be able to formulate rules to be followed regarding the
subject-verb agreement.
In the table of specifications we see that there are five items that deal with knowledge and
these items are items 1,3,5,7,9. Similarly, from the same table we see that five items represent analysis,
namely: 12, 18, 20, 21, 23. The first four levels of Bloom’s taxonomy are equally represented in the test
while application (tested through essay) is weighted equivalent to ten (10) points or double the weight
given to any of the first four levels. The table of specifications guides the teacher in formulating the
test. As we can see, the TOS also ensures that each of the objectives in the hierarchy of educational
objectives is well represented in the test. As such, the resulting test that will be constructed by the
teacher will be more or less comprehensive. Without the table of specifications, the tendency for the
test maker is to focus too much on facts and concepts at the knowledge level.
Most often, however, the try-out is not done due to lack of time.
3
Assessment of Student Learning 1
Example: The Philippines gained its independence in 1898 and therefore celebrated its centennial year
in 2000. ______
Obviously, the answer is FALSE because 100 years from 1898 is not 2000 but 1998.
Rule 2: Avoid using the words “always”, “never” “often” and other adverbs that tend to be either
always true or always false.
Statements that use the word “always” are almost always false. A test-wise student can easily guess
his way through a test like these and get high scores even if he does not know anything about the test.
Rule 3: Avoid long sentences as these tend to be “true”. Keep sentences short.
Example: Tests need to be valid, reliable and useful, although, it would require a great amount of time
and effort to ensure that tests possess these test characteristics. _______
Notice that the statement is true. However, we are also not sure which part of the sentence is deemed
true by the student. It is just fortunate that in this case, all parts of the sentence are true and hence,
the entire sentence is true. The following example illustrates what can go wrong in long sentences:
Example: Tests need to be valid, reliable and useful since it takes very little amount of time, money and
effort to construct tests with these characteristics.
The first part of the sentence is true but the second part is debatable and may, in fact, be false. Thus,
a “true” response is correct and also, a “false” response is correct.
Rule 4. Avoid trick statements with some minor misleading word or spelling anomaly, misplaced
phrases, etc. A wise student who does not know the subject matter may detect this strategy and thus
get the answer correctly.
Example: True or False. The Principle of our school is Mr. Albert P. Panadero.
The Principal’s name may actually be correct but since the word is misspelled and the entire sentence
takes a different meaning, the answer would be false! This is an example of a tricky but utterly useless
item.
4
Assessment of Student Learning 1
Rule 5: Avoid quoting verbatim from reference materials or textbooks. This practice sends the wrong
signal to the students that it is necessary to memorize the textbook word for word and thus,
acquisition of higher level thinking skills is not given due importance.
Rule 6. Avoid specific determiners or give-away qualifiers. Students quickly learn that strongly
worded statements are more likely to be false than true, for example, statements with “never” “no”
“all” or “always.” Moderately worded statements are more likely to be true than false. Statements
with “many” “often” “sometimes” “generally” ‘frequently” or “some” should be avoided.
Rule 7. With true or false questions, avoid a grossly disproportionate number of either true or false
statements or even patterns in the occurrence of true and false statements.
1. T 6. F 1. T 6. F
2. F 7. F 2. F 7. T
3. F 8. F or 3. T 8. F
4. F 9. F 4. F 9. T
5. F 10. F 5. T 10. F
For ease of correction, teachers sometime create a pattern of True or False answers. Students will
sense it and may arrive at a correct answer not because he/she really knows the answer but because
he/she senses the pattern.
Rule 2: Do not use modifiers that are vague and whose meanings can differ from one person to the
next such as: much, often, usually, etc.
5
Assessment of Student Learning 1
The qualifier “much” is vague and could have been replaced by more specific qualifiers like:” 90% of
the photosynthetic process” or some similar phrase that would be more precise.
Rule 3: Avoid complex or awkward word arrangements. Also, avoid use of negatives in the stem
as this may add unnecessary comprehension difficulties.
Example:
(Poor) As President of the Republic of the Philippines, Corazon Cojuangco Aquino would stand next to
which President of the Philippine Republic subsequent to the 1986 EDSA Revolution?
(Better) Who was the President of the Philippines after Corazon C. Aquino?
Rule 5: Do not use negatives or double negatives as such statements tend to be confusing. It is best
to use simpler sentences rather than sentences that would require expertise in grammatical
construction.
Example:
(Poor) Which of the following will not cause inflation in the Philippine economy?
(Better) Which of the following will cause inflation in the Philippine economy?
(Poor) What does the statement “Development patterns acquired during the formative years are NOT
Unchangeable” imply?
A.
B.
C.
D.
(Better) What does the statement “Development patterns acquired during the formative years are
changeable” imply?
A.
B.
C.
D.
6
Assessment of Student Learning 1
Rule 5: Each item stem should be as short as possible; otherwise you risk testing more for reading
and comprehension skills.
Example: The short story: May Day’s Eve, was written by which Filipino author?
a. Jose Garcia Villa
b. Nick Joaquin
c. Genoveva Edrosa Matute
d. Robert Frost
e. Edgar Allan Poe
If distracters had all been Filipino authors, the value of the item would be greatly increased. In this
particular instance, only the first three carry the burden of the entire item since the last two can be
essentially disregarded by the students.
Rule 7: All multiple choice options should be grammatically consistent with the stem.
Rule 8: The length, explicitness, or degree of technicality of alternatives should not be the
determinants of the correctness of the answer. The following is an example of this rule:
Example: If the three angles of two triangles are congruent, then the triangles are:
a. congruent whenever one of the sides of the triangles are congruent
b. similar
c. equiangular and therefore. must also be congruent
d. equilateral if they are equiangular
The correct choice, “b,” may be obvious from its length and explicitness alone. The other choices are
long and tend to explain why they must be the correct choices forcing the students to think that they
are, in fact, not the correct answers!
Rule 10: Avoid alternatives that are synonymous with others or those that, include or overlap others.
Example: What causes ice to transform from solid state to liquid state’?
a. Change in temperature
b. Changes in pressure
c. Change in the chemical composition
d. Change in heat levels
7
Assessment of Student Learning 1
The options a and d are essentially the same. Thus, a student who spots these identical choices would
right away narrow down the field of choices to a, b, and c. The last distracter would play no significant
role in increasing the value of the item.
Rule 11: Avoid presenting sequenced items in the same order as in the text.
Rule 12: Avoid use of assumed qualifiers that many examinees may not be aware of.
Rule 13: Avoid use of unnecessary words or phrases, which are not relevant to the problem at hand
(unless such discriminating ability is the primary intent of the evaluation). The items value is
particularly damaged if the unnecessary material is designed to distract or mislead. Such items test
the student’s reading comprehension rather than knowledge of the subject matter.
Example: The side opposite the thirty degree angle in a right triangle is equal to half the length of the
hypotenuse. If the sine of a 30-degree is 0.5 and its hypotenuse is 5, what is the length of the side
opposite the 30-degree angle?
a. 2.5
b. 3.5
c. 5.5
d. 1.5
The sine of a 30-degree angle is really quite unnecessary since the first sentence already gives the
method for finding the length of the side opposite the thirty-degree angle. This is a case of a teacher
who wants to make sure that no student in his class gets the wrong answer!
Rule 14: Avoid use of non-relevant sources of difficulty such as requiring a complex calculation when
only knowledge of a principle is being tested.
Note in the previous example, knowledge of the sine of the 30-degree angle would have led some
students to use the sine formula for calculation even if a simpler approach would have sufficed.
Rule 16: Include as much of the item as possible in the stem. This allows for less repetition and
shorter choice options.
Rule 17: Use the “None of the above” option only when the keyed answer is totally correct. When
choice of the “best” response is intended, “none of the above” is not appropriate, since the
implication has already been made that the correct response may be partially inaccurate.
8
Assessment of Student Learning 1
Rule 18: Note that the use of “all of the above” may allow credit for partial knowledge. In a multiple
option item, (allowing only one option choice) if a student only knew that two (2) options were
correct, he could then deduce the correctness of “all of the above”. This assumes you are allowed
only one correct choice.
Rule 19: Having compound response choices may purposefully increase difficulty of an item.
Rule 20: The difficulty of a multiple choice item may be controlled by varying the homogeneity or
degree of similarity of responses. The more homogeneous, the more difficult the item.
Example:
A B
9
Assessment of Student Learning 1
Normally, column B will contain more items than column A to prevent guessing on the part of
the students. Matching type items, unfortunately, often test lower order thinking skills (knowledge
level) and are unable to test higher order thinking skills such as application and judgement skills
As with multiple-choice items, there has only been a small amount of empirical research on the
characteristics of matching items and how they affect validity or reliability. In addition to research
findings, there is also a common set of recommendations found in classroom assessment textbooks. A
few of the critical guidelines from both these types of data (Frey, Petersen, Edwards, Pedrotti, &
Peyton, 2003; Haladyna & Downing, 1989a, 1989b; Haladyna, Downing & Rodriguez, 2002) are
presented below.
Guideline 1.
Guideline 2.
Guideline 3.
Guideline 4.
Guideline 5.
Guideline 6.
Matching stems should be on the left and answer options on the right.
Students are used to reading from left to right, and the process of matching two concepts together is
similar to the construction and comprehension processes which occur when reading sentences.
How can the use of quality matching items benefit your students, including those with special
needs? Like multiple-choice items, a section of matching items on a test can cover a large amount of
material in a relatively brief period of time. In fact, matching items are even more efficient than
multiple-choice items because each stem acts as a separate multiple-choice item with all the answer
options as possible answers. Functionally, a matching section containing ten stems operates as ten
multiple-choice items. When well-written, all the wrong answer options act as distractors. Guessing is
difficult, perhaps the most difficult of any objective test format. Because matching items allow for many
items in a short space and make guessing difficult, the validity and reliability of classroom tests are
improved, and that helps all students to be assessed fairly and accurately.
A variant of the matching type items is the data sufficiency and comparison type of test illustrated
below:
Example: Write G if the item on the left is greater than the item on the right; L if the item on the left is
less than the item on the right; E if the item on the left equals the item on the right and D if the
relationship cannot be determined.
A B
Another useful device for testing lower-order thinking skills is the supply type of tests. Like the
multiple-choice test, the items in this kind of test consist of a stem and a blank where the students
would write the correct answer.
11
Assessment of Student Learning 1
Supply type tests depend heavily on the way that the stems are constructed. These tests allow for one
and only one answer and, hence, often test only the students’ knowledge.
It is , however, possible to construct supply type of tests that will test higher order thinking as
the following example shows:
Example: Write an appropriate synonym for each of the following. Each blank corresponds to a letter:
Metamorphose: _ _ _ _ _ _
Flourish: _ _ _ _
The appropriate synonym for the first is CHANGE with six(6) letters while the appropriate
synonym for the second is GROW with four (4) letters. Notice that these questions require not only
mere recall of words but also understanding of these words.
Essay questions can be used to measure the attainment of a variety of objectives. Stecklein (1955) has
listed 14 types of abilities that can be measured by essay items:
13
Assessment of Student Learning 1
7. Knowledge of relationships
8. Illustrations of rules, principles, procedures, and applications
9. Applications of rules, laws, and principles to new situations
10. Criticisms of the adequacy, relevance, or correctness of a concept, idea, or information
11. Formulation of new questions and problems
12. Reorganization of facts
13. Discriminations between objects, concepts, or events
14. Inferential thinking
Types of Essay
Restricted Essay
It is also referred to as short focused response. Examples are asking students to
“write an example,” “list three reasons,” or compare and contrast two techniques.”
How are the scrub and the mockingbird different? Support your answer
with details and information from the article.
Non-restricted/Extended Essay
Extended responses can be much longer and complex than short responses, but
students are encouraged to remain focused and organized.
Note that all these involve the higher-level skills mentioned in Bloom’s Taxonomy.
The following are rules of thumb which facilitate the scoring of essays:
14
Assessment of Student Learning 1
Rule 1: Phrase the direction in such a way that students are guided on the key concepts to be
included.
Rule 2: Inform the students on the criteria to be used for grading their essays. This rule allows
the students to focus on relevant and substantive materials rather than on peripheral and
unnecessary facts and bits of information.
Example: Write an essay on the topic: “Plant Photosynthesis” using the keywords indicated. You
will be graded according to the following criteria: (a) coherence, (b) accuracy of statements, (c)
use of keywords, (d) clarity and (e) extra points for innovative presentation of ideas.
Rule 5: Evaluate all of the students’ answers to one question before proceeding to the next
question.
Scoring or grading essay tests question by question, rather than student by student, makes it
possible to maintain a more uniform standard for judging the answers to each question. This
procedure also helps offset the halo effect in grading. When all of the answers on one paper
are read together, the grader’s impression of the paper as a whole is apt to influence the grades
he assigns to the individual answers. Grading question by question, of course. prevents the
formation of this overall impression of a student’s paper. Each answer is more apt to be judged
on its own merits when it is read and compared with other answers to the same question. than
when it is read and compared with other answers by the same student.
Rule 6: Evaluate answers to essay questions without knowing the identity of the writer. This
is another attempt is control personal bias during scoring. Answers to essay questions should
be evaluated in terms of what is written, not it terms of what is known about the writers from
other contacts with them. The best way to prevent our prior knowledge from influencing our
judgment is to evaluate each answer without knowing the identity of the writer. This can be
done by having the students write their names on the back of the paper or by using code
numbers in place of names.
Rule 7: Whenever possible, have two or more persons grade each answer. The best way to
check on the reliability of the scoring of essay answers is to obtain two or more independent
judgments. Although this may not be a feasible practice for routine classroom testing, it might
be done periodically with a fellow teacher (one who is equally competent in the area). Obtaining
two or more independent ratings becomes especially vital where the results are to be used for
important and irreversible decisions, such as in the selection of students for further training or
for special awards. Here the pooled ratings of several competent persons may be needed to
attain level of reliability that is commensurate with the significance of the decision being made.
Some teachers use the cumulative criteria i.e. adding the weights given to each criterion,
as basis for grading while others use the reverse. In the latter method, each student begins with
a score of 100. Points are then deducted every time a teacher encounters a mistake or when a
criterion is missed by the student in his essay.
15
Assessment of Student Learning 1
Rule 9. Provide information about the value/weight of the question and how it will be scored.
Learning Activities
Learning Activity 1.
A. Give non-examples of each of the following rules in construction of true-false test. Improve on
the non-examples for them to become good examples of tests.
1. Avoid using the words “always”, “never” “often” and other adverbs that tend to be either
always true or always false.
2. Do not give a hint (inadvertently) in the body of the question.
3. Avoid long sentences as these tend to be “true”. Keep sentences short.
4. Avoid trick statements with some minor misleading word or spelling anomaly, misplaced
phrases, etc.
5. Avoid specific determiners or give-away qualifiers.
B. Give non-examples of each of the following rules in construction of multiple-choice test.
Improve on the non-examples for them to become good examples of tests.
1. Avoid stems that reveal the answer to another item.
2. Distracters should be equally plausible and attractive.
3. Do not use unfamiliar words, terms and phrases.
4. All multiple choice options should be grammatically consistent with the stem.
5. Having compound response choices may purposefully increase difficulty of an item.
Learning Activity 2
A. Choose learning competencies from the K to 12 Curriculum Guide. Construct a 10-item test in
line with your field/major, using the following type of test.
1. Matching Type
2. Completion
3. Supply Type
B. In a 100-item test, what types of objective tests will you include? Justify your answer.
C. In the sample essay “Plant Photosynthesis” given in this section, why would you give zero (0)
score to the student writing this essay? Justify your answer.
Example: Write an essay on the topic: “Plant Photosynthesis” using the ff. key words
and phrases: chlorophyll, sunlight, water, carbon dioxide, oxygen, by- product, stomata.
16
Assessment of Student Learning 1
Plant Photosynthesis
Nature has its own way of ensuring the balance between food producers and
consumers. Plants are considered producers of food for animals. Plants produce food
for animals through a process called “photosynthesis”. It is a complex process that
combines various natural elements on earth into the final product which animals can
consume in order to survive. Naturally, we all need to protect plants so that we will
continue to have food on our table. We should discourage burning of grasses, cutting
of trees and illegal logging. I f the leaves of plants are destroyed, they cannot perform
photosynthesis and animals will also perish.
D. Make a sample Table of Specification for 1st Quarterly Assessment. (You can choose any subjects
in Junior High School). Refer to the Curriculum Guide (CG) or the Most Essential Learning
Competencies (MELC) for the different topics.
NOTE: Write all your answers in a short bondpaper for compilation. (ALL ANSWERS MUST BE
HANDWRITTEN)
Web Links
https://educationstandards.nsw.edu.au/wps/portal/nesa/k-10/understanding-the-
curriculum/assessment/approaches
https://journals.sagepub.com/doi/pdf/10.2304/plat.2003.3.2.97
References
Navarro, R.L., Santos, R.G., & Corpuz, B.B. (2017). Assessment of Learning 1
Calmorin, L.P. (2011). Assessment of Student Learning 1
17