Professional Documents
Culture Documents
Meeting 3
1. Make a clear statement of the testing ‘problem’.
2. Write test specifications
3. Write and moderate items.
4. Trial the items on native speakers
Stages of test 5. Trial the test on a group of non-native speakers
development 6. Analyse the results of the trial
(Hughes, 2003)
7. Calibrate scales.
8. Validate (for high stakes)
9. Write handbooks for test takers, test users and staff.
10. Train any necessary staff
What kind of test is it to be? Achievement (final or
progress), proficiency, diagnostic, or placement?
What is its precise purpose?
What abilities are to be tested?
Stating the How detailed must the results be?
problem How accurate must the results be?
How important is backwash?
What constraints are set by unavailability of expertise,
facilities, time (for construction, administration and
scoring)?
According to Brown
(2004) the composition
of test specifications
includes,
the outline of the test,
skills to be included,
item types and tasks.
Sample
( Brown, 2004)
Content (operation, types of text, addresses, length,
topics, structural range, vocabulary, dialect and style,
speed of processing)
Specifications Test structure ( e.g. 3 sections, expeditious reading, or
for the test no separate items, etc.)
Number of items and passages
Timing (for each section and for entire test) e.g. 30
minutes for all multiple choice questions
medium/channel, techniques (paper and pencil, tape,
computer, face-to-face, etc. and how to measure skills and
subskills)
Techniques, e.g. half of the items will be gap filling and the
Specifications other half will be MC.
criterial levels of performance (accuracy, appropriacy, range,
for the test flexibility, size) e.g. completed performance is fulfilled by
cont… 75% accuracy and correct answers.
and Scoring procedures e.g. Students answer questions in a
separated answer sheet and a set of key answer will be
provided for scoring
Sampling (considering content
validity and beneficial backwash)
Writing and Writing items (planned, precise, and
moderating clear)
items Moderating Items (proofread by at
least two colleagues and informal
trial items on native speakers)
Sample of
moderation
grammar
items
Activity: Moderating grammar items
At the whole-test level:
Descriptive statistics (mean, spread),
Reliability (internal consistency, or inter-rater)