Professional Documents
Culture Documents
based
assessment
Tim McNamara
The University of
Melbourne
Standards-based assessment
and criterion referencing
Standards-based assessment is a form of
criterion-referenced assessment (cf norm-
referenced assessment).
Information derived from a
Criterion-Referenced Test
The degree to which the
student has attained
criterion performance, for
example whether he can
QuickTi me and a
TIFF ( Uncompressed) decompr essor
satisfactorily prepare an
are needed to see thi s picture.
experimental report.
Glaser 1994 [1963], p.6
Information derived from a
Norm-Referenced Test
The relative ordering of
individuals with respect
to their test
performance, for
QuickTime and a
TIFF (Uncompressed) decompressor
are needed to see this picture.
example, whether
Student A can solve his
problems more quickly
than Student B.
Glaser 1994 [1963], p.6
Definition of a criterion-
referenced test
A criterion-referenced test is
one that is deliberately
constructed to yield
measurements that are directly
interpretable in terms of
QuickTime and a
TIFF (Uncompressed) decompressor specified performance
are needed to see this picture.
standards. Performance
standards are generally
specified by defining a class or
domain of tasks that should be
performed by the individual.
Glaser and Nitko, 1971, p. 653
Definition of a criterion-
referenced test (2)
A students score on a criterion-referenced measure provides
explicit information as to what the student can and cant do.
Criterion-referenced measures indicate the content of the
behavioural repertory, and the correspondence between what
an individual does and the underlying continuum of
achievement. Measures which assess student achievement
in terms of a certain criterion standard thus provide
information as to the degree of competence attained by a
particular student which is independent of reference to the
performance of others.
Glaser, 1963, p. 519
Norm-referenced test
Any test that is primarily designed to
disperse the performances of students in
a normal distribution based on their
general abilities, or proficiencies, for
purposes of categorizing the students into
levels or comparing students
performances to the performances of
others who formed the normative group.
Brown and Hudson (2002, p. 2)
Is CRT behaviourist?
Criterion-referenced
testing has its origins in
behaviourism, but need
QuickTime and a
not be atomistic, purely
TIFF (Uncompressed) decompressor
are needed to see this picture. dichotomous, or
reductive.
Criterion-referencing and levels
on a continuum
ELICOS placement test
---------------------------------------------------------------------------
Underlying the concept of achievement
Item Estimates (Thresholds) all on all (N = 86 L = 57)
---------------------------------------------------------------------------
3.0 |
|
|
measurement is the notion of a
|
|
|
|
|
continuum of knowledge acquisition
2.0 X
|
|
|
|
24
18
50
ranging from no proficiency at all to
perfect performance. An individuals
|
XX | 37
|
XXX | 53
XX |
XX | 39 54
1.0 XX
X
XXXXXXX
XXXX
|
|
|
|
3
32
12
13
34
25
43
35
45
36 47
achievement level falls at some point
on this continuum as indicated by the
XX | 19
|
XXXX | 27 49 51 57
XXXXXX | 38 42
XXXXX | 29 31 52
XXXXX
XXX
XXXXX
XXXX
|
|
|
|
|
4
33
15
41
16 20 40
The degree to which his achievement
-1.0 X
X
XXXXX
|
|
|
|
|
22
17
30
8
10
44
1
level is assessed by criterion-
|
|
|
|
|
6
referenced measures of achievement
-3.0
|
|
|
|
2 11
or proficiency.
---------------------------------------------------------------------
------
Scales and CRT
The standard against which a students performance is
compared when measured in this manner is the
behavior which defines each point along the
achievement continuum. The term criterion, when
used in this way, does not necessarily refer to final end-
of-course behavior. Criterion levels can be established
at any point in instruction where it is necessary to obtain
information as to the adequacy of an individuals
performance.
Glaser, 1963, pp. 519-520
Interface with policy - scales
and frameworks
Dominant movement in language
education internationally
Driven by need for accountability
and emphasis on demonstrable
outcomes
Has adopted functionalist view of
language education (i.e. not cultural,
intellectual, values dimension)
Response to demands of
globalization, efficiency
Curriculum and assessment
addressed in single framework
Emphasis on reporting
Format of standards
Standards are typically formulated as an
ordered series of statements about levels
of achievement or stages of development.
(There may be multiple sets of ordered
statements for different aspects of
language development)
CEFR Levels A2 , B1 (speaking)
A2: Can understand sentences and frequently used expressions related to
areas of most immediate relevance (e.g. very basic personal and family
information, shopping, local geography, employment). Can
communicate in simple and routine tasks requiring a simple and direct
exchange of information on familiar and routine matters. Can describe
in simple terms aspects of his/her background, immediate environment
and matters in areas of immediate need.
B1: Can understand the main points of clear standard input on familiar
matters regularly encountered in work, school, leisure, etc. Can deal
with most situations likely to arise whilst travelling in an area where the
language is spoken. Can produce simple connected text on topics
which are familiar or of personal interest. Can describe experiences
and events, dreams, hopes and ambitions and briefly give reasons and
explanations for opinions and plans.
Mislevy: claims and evidence
An assessment is a ASSESSMENT
machine for reasoning ARGUMENT
LEVEL
SUMMARIES
STRAND DESCRIPTIONS