Professional Documents
Culture Documents
Constructing
Objective Test
4 Items
LEARNING OUTCOMES
By the end of this topic, you should be able to:
1. Define objective tests;
2. Differentiate between different types of objective tests;
3. Discuss the advantages of using different types of objective tests;
4. Identify the weaknesses of using the different types of objective tests;
and
5. Explain the techniques for constructing the different types of
objective test items.
INTRODUCTION
In this topic we will focus on using objective tests to assess various kinds of
behaviour in the classroom. Firstly, the discussion will be limited to the simple
forms of objective test items, namely short-answer item, true-false item and
matching item. Three types of objective tests are examined and guidelines for the
construction of each type of the tests are discussed. The advantages and limitations
of each of these types of objective tests are explained. Secondly, we will discuss the
multiple-choice item, a more complex form of objective test items. The discussion
will focus on the characteristics and uses of multiple-choice items, their advantages
and limitations and some suggestions for the construction of such items.
You can refer to Linn and Gronlund (1995) for more examples.
Now, let us look at the advantages and the limitations of this type of question.
Many short-answer questions can be set for a specific period of time. A test paper
consisting of short-answer questions is thus able to cover a fairly wide content of
the course to be assessed. A wide content coverage enhances the content validity
of the test.
Scoring of answers to the short-answer question can also pose a problem. Unless
the question is carefully phrased, learners can provide answers of varying degree
of correctness. For example, answer to a question such as „When was Malaysia
formed?‰ could either be „In 1963‰ or „On 16 September 1963‰. The teacher has to
decide whether learners who gave the partial answer have the same level of
knowledge as those who provided the complete answer. Besides, learnersÊ
answers can also be contaminated by spelling errors. If spelling is taken into
consideration, the test scores of learners will reflect their level of knowledge of the
content assessed as well as their spelling ability. If spelling is not considered in the
scoring, the teacher has to decide whether the misspelled word actually represents
the correct answer.
(a) Word the question so that the intended answer is brief and specific
As far as possible, the question should be phrased in such a way that only
one answer is correct.
For example:
Better item: An animal that eats the flesh of other animals is classified as
___________.
For example:
Possible answers for 1st item: a boat, the 15th century, a search for India
(c) If the problem requires a numerical answer, indicate the units in which the
answer is to be expressed
For example:
Possible answers for 1st item: a boat, the 15th century, a search for India
(d) Do not include too many blanks for the completion item
Blanks for answers should be equal in length. For the completion item, place
the blank near the end of the sentence.
For example:
Possible answers for 1st item: a boat, the 15th century, a search for India
SELF-CHECK 4.1
ACTIVITY 4.1
For example:
True False
True-false questions can be quickly written and can cover a lot of content. True-
false questions are well suited for testing learner recall or comprehension. Learners
can generally respond to many questions, covering a lot of content in a fairly short
amount of time. From the teacherÊs perspective, these questions can be written
quickly and are easy to score. Because they can be objectively scored, the scores
are more reliable than for items that are at least partially dependent on the
teacherÊs judgment. Generally, they are easier to construct compared to multiple-
choice questions because there is no need to develop distractors. Hence, they are
less time consuming compared to constructing multiple-choice questions.
(a) Guessing ă A learner has a one in two chance of guessing the correct answer
of a question. Scores on true-false items tend to be high because of the ease
of guessing the correct answers when the answer is not known. With only
two choices (true or false) the learner could expect to guess correctly on
half of the items for which correct answers are not known. Thus, if a learner
knows the correct answers to 10 questions out of 20 and guesses on the other
10, the learner could expect a score of 15. The teacher can anticipate scores
ranging from approximately 50 per cent for a learner who did nothing but
guess on all items to 100 per cent for a learner who knows the material.
(b) Because these items are in the form of statements, there is sometimes a
tendency to take quotations from the text, expecting the learner to recognise
a correct quotation or note a change (sometimes minor) in wording. There
may also be a tendency to include trivial or inconsequential material from
the text. Both of these practices are discouraged.
(d) True-false items provide little diagnostic information. Teachers can often get
useful information about learner errors and misconceptions by examining
learnersÊ incorrect answers but true-false items do not provide such
diagnostic information.
(e) True-false items may produce a negative suggestion effect. Some testing
experts feel that exposing false statements might promote learning false
information.
(f) False statements do not provide evidence that learners know the correct
answer.
For example:
(b) Use negative statements sparingly but avoid double negatives. Double
negatives tend to contribute to the ambiguity of the statement. Statements
with words like none, no and not should be avoided as far as possible.
For example:
(c) Avoid broad, general statements. Most broad generalisations are false unless
qualified.
For example:
(d) Avoid long complex sentences. Such sentences also test reading
comprehension besides the achievements to be measured.
For example:
(e) Try using in combination with other materials such as graphs, maps and
written material. This combination allows for the testing of more advanced
learning.
(f) Avoid lifting statements directly from assigned reading, notes or other
course materials so that recall alone will not permit a correct answer.
(g) In general, avoid the use of words which would signal the correct response
to the test-wise learner. Absolutes such as „none‰, „never‰, „always‰, „all‰
and „impossible‰ tend to be false while qualifiers such as „usually‰,
„generally‰, „sometimes‰ and „often‰ are likely to be true.
(h) A similar situation occurs with the use of "can" in a true-false statement. If
the learner knows of a single case in which something „can‰ be done, it
would be true.
(i) Ambiguous or vague statements and terms such as "large", "long time",
"regularly", "some" and "usually" are best avoided in the interest of clarity.
(j) Some terms have more than one meaning and may be interpreted differently
by individuals.
(k) True statements should be about the same length as false statements (there is
a tendency to add details in true statements to make them more precise).
(l) Word the statement so precisely that it can be judged unequivocally true or
false.
(n) Avoid verbal clues (specific determiners) that indicate the answer.
SELF-CHECK 4.2
ACTIVITY 4.2
For example:
Directions: Column A contains statement describing selected Asian cities.
For each description find the appropriate city in Column B. Each city in Column
B can be used only once.
Column A Column B
1. The ancient capital of Thailand A. Ayuthia
2. The largest city in Sumatera B. Ho Chi Minh City
3. The capital of Myanmar C. Karachi
4. Formerly known as Saigon D. Medan
The learner reads a premise (Column A) and finds the correct response from
among those in Column B. The learner then prints the letter of the correct response
in the blank beside the premise in Column A. An alternative is to have the learner
draw a line from the correct response to the premise but this is more time
consuming to score. One of the ways to reduce the possibility of guessing the
correct answers is to list a larger number of responses (Column B) than premises
(Column A) as is done in the example. Another way to decrease the possibility of
guessing is to allow responses to be used more than once. Instructions to the
learners should be very clear about the use of responses.
(b) They can also assess a learnerÊs ability to apply knowledge by requiring a
test-taker to match the following:
(d) Matching questions are generally easy to write and score when the content
tested and objectives are suitable for matching question.
(a) Matching questions are limited to the materials that can be listed into two
columns and there may not be much material that lends itself to such a
format;
(b) If there are four items in a matching question and the learners know the
answers for three of them, the fourth item is a give-away through
elimination;
(a) Provide clear instructions. They should explain how many times responses
can be used;
(c) Include more responses than premises or allow the responses to be used
more than once;
(e) Correct answers should not be obvious to those who do not know the content
being taught;
(f) There should not be keywords appearing in both a premise and response,
providing a clue to the correct answer; and
(g) All of the responses and premises for a matching item should appear on the
same page.
SELF-CHECK 4.3
ACTIVITY 4.3
1. Suggest other advantages and weaknesses of using matching
questions.
2. Select five true-false questions in your subject area and analyse each
item using the guidelines discussed.
3. Select five matching questions in your subject area and analyse each
item using the guidelines discussed.
4. Suggest how you would improve the weak items for each type of
questions that you have identified.
MCQs are the most difficult to prepare. These questions have two parts, namely a
stem which contains the question and four or five options which contains the
correct answer. The correct answer is called the keyed response and the incorrect
options are called distractors. The stem may be presented as a question, direction
or a statement while the options could be a word, phrase, numbers, symbols and
so forth. Cruel as it may seem, the role of the distractor is to attract the attention of
respondents who are not sure of the correct answer.
Now, let us look at what an MCQ consists of. It has the stem and also the options.
(a) Stem
The stem should:
(iv) Generally ask for one answer only (the correct or the best answer); and
(i) Have either four or five alternatives, all of which should be mutually
exclusive and not too long;
(iv) Contain the intended answer or the keyed response and it should
appear to be clearly correct to the informed but it should be definitely
incorrect but plausible to the distractors.
SELF-CHECK 4.4
1. Why is MCQ a popular form of objective test?
As stated earlier, MCQs are the most difficult to prepare. We need to focus on
writing the stem as well as providing the options or alternatives. All the options in
multiple-choice items need to be plausible but they also need to separate learners
of different ability levels. Table 4.1 shows some considerations that need to be
taken into account when constructing MCQs, particularly the stems.
(c) Use clear, straight As the level of fertility A major decline in fertility
forward language approaches its nadir, what is the in a developing nation is
in the stem of the most likely ramification for the likely to produce
item. Questions citizenry of a developing nation? A. a decrease in the labour
that are A. A decrease in the labour force participation rate
constructed using force participation rate of of women.
complex wordings women. B. a downward trend in
may become a test B. A downward trend in the the youth dependency
of reading youth dependency ratio. ratio.
comprehension
C. A broader base in the C. a broader base in the
rather than an
population pyramid. population pyramid.
assessment of
whether the D. An increased infant D. an increased infant
learner knows the mortality rate. mortality rate.
subject matter. Note: In the improved item, the word „nadir‰ is replaced with
„decline‰ and „ramification‰ is replaced with „produce‰ which
are more straight forward words.
(d) Use negatives Which of the following is not a Which of the following is a
sparingly. If symptom of osteoporosis? symptom of osteoporosis?
negatives must be A. Decreased bone density. A. Hair loss.
used, capitalise, B. Frequent bone fractures. B. Painful joints.
underscore or
C. Raised body temperature. C. Raised body
bold.
D. Lower back pain. temperature.
D. Decreased bone density.
Note: The better item is stated in the positive so as to avoid use
of the negative „not‰.
(e) Put as much of the Theorists of pluralism have Theorists of pluralism have
question in the asserted which of the following? asserted that the
stem as possible, A. The maintenance of maintenance of democracy
rather than democracy requires a large requires
duplicating middle class. A. a large middle class.
material in each of B. The maintenance of B. the separation of
the options. democracy requires governmental powers.
autonomous centres of C. autonomous centres of
countervailing power. countervailing power.
C. The maintenance of D. the existence of a
democracy requires the multiplicity of religious
existence of a multiplicity of groups.
religious groups.
D. The maintenance of
democracy requires the
separation of governmental
powers.
(h) Avoid using ALWAYS and NEVER in the stem as test-wise learners are likely to rule
such universal statements out of consideration.
ACTIVITY 4.4
1. Select ten MCQs in your subject area and analyse the stem of each
item using the guidelines discussed.
Now, let us look at Table 4.2 which shows some considerations when constructing
the distractors for MCQs.
(d) Distractors based on common learner errors or misconceptions are very effective.
One technique for compiling distractors is to ask learners to respond to open-ended
short-answer questions, perhaps as formative assessments. Identify which incorrect
responses appear most frequently and use them as distractors for a multiple-choice
version of the question.
(e) Do not create distractors that are so close to the correct answer that they may confuse
learners who really know the answer to the question. „Distractors should differ from
the key in a substantial way, not just in some minor nuances of phrasing or
emphasis.‰ (Isaacs, 1994)
(f) Provide a sufficient number of distractors.
You will probably choose to use three, four or five alternatives in a MCQ. Until
recently, it was thought that three or four distractors were necessary for the item
to be suitably difficult. However a study by Owen and Freeman suggested that
three choices are sufficient (Brown, 1987). Clearly the higher the number of
distractors, the less likely it is for the correct answer to be chosen through guessing
(provided all alternatives are of equal difficulty).
ACTIVITY 4.5
1. Do you agree that teachers should not use negatives in the stems of
MCQs? Justify your answer.
3. Select ten MCQs in your subject area and analyse the distractors of
each item using the guidelines discussed. Suggest how you would
improve the weak items.
Share your answers with your coursemates in the myINSPIRE online
forum.
(f) Scores are more reliable than subjectively scored items (such as essays);
(h) Item analysis can reveal how difficult each item was and how well it
discriminates between strong and weaker learners in the class;
(i) Performance can be compared from class to class and year to year;
(j) Can cover a lot of material very efficiently (about one item per minute of
testing time); and
(k) Items can be written so that learners must discriminate among options that
vary in degree of correctness.
While there are many advantages of using MCQs, there are also many limitations
in using such items, namely:
(c) MCQs are not effective for measuring problem-solving skills as well as the
ability to organise and express ideas;
(f) Learners can sometimes read more into the question than was intended;
(g) It often focuses on testing factual information and fails to test higher levels
of cognitive thinking;
(i) It places a high degree of independence on learnersÊ reading ability and the
constructorÊs writing ability;
Now, let us look at Table 4.3 which summarises the procedural rules for the
construction of MCQs.
Test for important or significant Avoid giving clues through the use of
information. faulty grammatical construction.
Avoid trick items. Avoid the use of humour when
Keep the vocabulary consistent developing options.
with the learnersÊ level of Present practical or real-world
understanding. situations to learners.
Avoid overly specific knowledge Use pictorial materials that require
when constructing items. learners to apply principles and
Avoid items based on opinions. concepts.
Be sensitive to cultural, religious Avoid textbook, verbatim phrasing
and gender issues. when developing items.
Keep options or alternatives Use charts, tables or figure that
independent and not overlapping. require interpretation.
Avoid distractors that can provide
clue to test-wiseness.
For example:
Read the following comment that a teacher made about testing and then
answer the question.
guessing, bluffing and the subjective opinions of the teacher who is scoring.
The teaching and learning process would be beneficial if we do away with
tests and depend on learner self-evaluation.‰
Which of the following types of test is this teacher primarily talking about?
A. Aptitude test
B. Diagnostic test
C. Formative test
D. Summative test*
Adapted from:
https://www.learningsolutionsmag.com/articles/804/writing-
multiple-choice-questions-for-higher-level-thinking
For example:
Two 60-year-old male patients (P#1 and P#2) have Type 2 diabetes. Each has
a BMI of 27. The primary treatment for each patient is a diet to reduce blood
glucose levels.
What is the most likely reason why P#2 did not show a decline in glucose
level after three months?
Adapted from:
http://digitalcommons.hsc.unt.edu/cgi/viewcontent.cgi?article=100
9&context=test_items
For example:
Both Su and Ramya want to lose weight. Su goes on a low carbohydrate diet
while Ramya goes on a vegan diet. After six months, Su lost 30 pounds and
Ramya lost 15 pounds.
Adapted from:
http://digitalcommons.hsc.unt.edu/cgi/viewcontent.cgi?article=100
9&context=test_items
SELF-CHECK 4.5
ACTIVITY 4.6
Suggest other advantages and weaknesses of using MCQs.
Objective tests vary depending on how the questions are presented. The four
common types of questions used in most objective tests are short-answer
questions, matching questions, true-false questions and multiple-choice
questions (MCQs).
The two forms of short-answer questions are direct questions and completion
questions.
True-false questions are those in which a statement is presented and the learner
indicates in some manner whether the statement is true or false.
True-false questions can be written quickly and are easy to score. Because they
can be objectively scored, the scores are more reliable than for items that are at
least partially dependent on the teacherÊs judgment.
Avoid lifting statements directly from assigned reading, notes or other course
materials so that recall alone will not permit a correct answer.
MCQs have two parts: a stem that contains the question and four or five
options that contains the correct answer called the keyed response and
incorrect options called distractors.
MCQs are widely used because they can be used to measure learning
outcomes, from simple to complex. They are highly structured with clear tasks
provided and able to test a broad sample of achievement.
MCQs are difficult to construct, tend to measure low level learning outcomes,
lend themselves to guessing and do not measure writing ability.