Professional Documents
Culture Documents
2. Distinguish between formative and summative feedback and give examples of each.
3. Provide two guidelines for giving and two guidelines for receiving feedback.
Measurement
Measurement refers to decisions about how the achievement of objectives will be
operationalized or quantified. In writing complete instructional objectives, a process which
has been discussed in a previous chapter, it is the part of the statement that specifies the
evidence that will be used to determine whether or not the goal has been accomplished.
Testing is one kind of measurement, and usually refers to students' opportunity to respond
to an identical set of questions under controlled conditions. Effective assessment measures
should be both valid and reliable.
A valid measure is one that reflects what it claims to reflect. For a test to be a valid
measure of students' mastery of a set of objectives, it should include representative
questions for all the objectives, not be concentrated on one or two of them. A valid measure
of whether or not students have been successful at learning how to play the piano would by
necessity include their demonstration of performance skills, since even correctly answering
100 percent of a set of questions about how to play the piano will not be a valid indicator
that a student can actually do it. Determining the validity of measures of affective outcomes
is sometimes less clear-cut than measuring objectives in the other learning domains;
however, if one of a teacher's goals is to increase students' joy of reading, it is important to
think about whether their checking out more books from the library is a valid reflection of
their enjoying reading books or if it in fact reflects their getting points toward their grade for
each book read.
A reliable measure is one which is accurate and consistent. Three typical ways of
assessing the reliability of paper-and-pencil tests are the test-retest method, the equivalent
forms method, and the split-half method. In the first instance, if giving the same test to the
same group of students within a short period of time results in similar scores, the test is
judged to be reliable. In the second instance, if two equivalent forms of a test are
developed, covering the same material, reliability can be determined by comparing the
scores on the two forms. In the third instance, the scores for even-numbered and
odd-numbered items on longer tests can be compared to one another to indicate whether
they provide a consistent profile of student mastery. Assessing the rating reliability on essay
tests, project reports, and performances is more challenging. Teachers might occasionally
want to put aside a set of graded papers and reread them at a later time (without referring
to the previously recorded grade) to see whether their judgments are consistent. They
Evaluation
Evaluation is a judgment of merit or worth, often communicated via grading.
Assessment is not necessarily evaluative, nor does it necessarily have to lead to an ultimate
grade. Even when a test or assignment is evaluated as to its relative worth (that is,
students are given a report of how well they did), the evaluative information should be
accompanied by descriptive information which tells students what they did, or are doing,
well and not so well, and how they can do better. In addition, it is often appropriate to
provide descriptive feedback without tacking on an evaluative assessment. The next two
sections of this chapter will deal with these two kinds of information provided by
assessment-based feedback.
Evaluative Feedback
1. The first in their respective classes, orderly and attentive and have made the
most flattering improvement.
2. Orderly, correct and attentive and their improvement has been respectable.
3. They have made very little improvement and as we apprehend from want of
diligence.
4. They have learnt little or nothing and we believe on account of escapade and
idleness. (Milton, Pollio & Eison, 1986, p. 4)
By the 1830s numerical scales became popular. Some schools used a 4-point scale,
some a 9-point scale, some a 20-point scale, and some a 100-point scale. In 1850 the
University of Michigan adopted a pass/fail system; however, by 1860 a "conditioned" level
had been added and in 1864 a 100-point scale was incorporated, with a minimum of 50
required for a pass. Meanwhile, other schools which were using three-level evaluations
(Passed, Passed With Distinction, and Failed) added plus and minus signs so that students
who "Passed With Distinction" could be distinguished from those who merely "Passed With
Distinction --.” There appeared to be an ongoing inclination toward making finer and finer
distinctions among students' relative degrees of success.
At the end of the nineteenth century, the 100-point scale had become quite popular,
with the numerical scores translated into letter symbols to separate students into five
achievement groups. Shortly after the turn of the century, the curve came into being at the
University of Missouri, as a response to an uproar over a professor who had failed an entire
class. The top 3 % of students in a class were thenceforth to be labeled excellent (A), the
next 22 % judged superior (B), the middle 50 % to be assessed as medium (C), the next 22
% rated inferior (D), and the bottom 3 % to fail (F). By the end of World War I the curve
had caught on, coupled with an era of "objectivity" in testing -- true/false and
multiple-choice tests were the hot trends in a new climate of "scientific" evaluation.
Norm-referenced curves, using a 5-point A - F scale, remained the predominant
grading philosophy until the 1960s, when a wave of educational humanism led to adoptions
Descriptive Feedback
Feedback To Students. As we have seen, evaluative feedback -- which is
communicated in the form of some sort of grading system -- is likely to require a descriptive
explanation so that its intended meaning can be interpreted. Without descriptive feedback,
a student will not know why a paper earned a "C" rather than an "A" and will be left
guessing as to how to improve on the next paper. Without descriptive feedback, a parent
will not know whether her child is being evaluated on a norm-referenced or
criterion-referenced basis, or what kind of hybridization entered into the final grade. In
Making Judgments
In making the first decision, deciding how students will be judged, teachers should
be compulsively explicit about what will constitute varying degrees of success and what will
constitute failure. Clearly specified instructional objectives are a means of doing so,
particularly in a mastery learning system where evaluation is limited to an assessment of
whether or not an objective has been mastered or needs further work. In schools in which
graduated grading scales are used, the kind of schools in which most of us work, the basis
on which various grades will be assigned should be clear to everyone involved -- students,
parents, administrators, and the teachers themselves. The measures used in assessment
should themselves be assessed in terms of both their validity (in which case clearly defined
instructional objectives again come into play) and their reliability. Having done these things,
teachers can direct all sorts of passionate, creative energy into devising ways to help
students excel in meeting the goals of the course of study -- but when it comes to
evaluating how students have done, the process should be a dispassionate one of matching
performance to performance criteria.
While some students may be unhappy about an outcome, they will be more resentful
of inconsistency. If they are told what they will have to be able to do and are assessed in
terms of something else their affect for the teacher -- and probably the subject -- will be
diminished. If some students are assessed by different criteria than other students, affect
among students will be compromised. Teachers may be unhappy that some students did not
do as well as they would have wished, and continue to consider ways to modify the
instructional process, but they will be absolved from "giving" grades.
Communicating Judgments
The second decision teachers must make is how to communicate their judgments to
students. We must provide more information to students about their performance than just
their grades. Descriptive feedback can be reinforcing and encouraging. Even work that is
honestly and fairly evaluated as below standard can be returned with positive as well as
corrective comments. Regular formative feedback will help many students do better than
they would have done without it, and will give them an indication of how they are doing and
how they can do better before they are formally evaluated. Judgments about the student as
Learning Orientation
High Low
High LO Low LO
High
High GO High GO
Grade
Orientation High LO Low LO
Low
Low GO Log Go
Students who are high in both learning orientation and grade orientation would seem
to be a teacher's ideal, to want to make learning personally relevant but also to perform
well. They are, in reality, often the students with the highest test anxiety and a strong need
to validate their intrinsic interest in learning with extrinsic indicators that tell them they are
doing OK. High LO/High GO students are likely to be very responsive to all the feedback