Professional Documents
Culture Documents
syllabus
objectives, course design, materials, methodology as well as teacher performance.
Sometimes referred to as continuous assessment -> done over a period of time like a term or an
academic year.
FORMAL ASSESSMENT
● Examinations or any type of language test administered in class by a teacher
SELF ASSESSMENT AND PEER-ASSESSMENT
The involvement of learners in learning.
● May enhance autonomy
Aptitude tests:
Designed to predict who will be a successful language learner, they are based on
the factors which are thought to determine an individual's ability to acquire a second or foreign
language (rather than an individual's ability to use a language at the time of testing.)
● are usually large scale tests taking a long time to administer and with a number of
components, each testing a different facet of language.
● are forward-looking tests, concerned with future language learning, not with previous
learning.
● try to predict learning ability over a long period.
—-
Many proficiency tests where students solve a number of different papers, there is a mixture
of direct and indirect as well as discrete-point and integrative testing.
★ The degree of agreement between two examiners about a mark for the same language
sample is known as inter-rater reliability.
★ The degree of agreement between one single examiner marking of the same language
sample on two separate occasions is known as intra-rater reliability.
BOTH are low in first generation tests.
Second generation testing is very reliable, third generation testing is more valid.
Competence -> the ideal knowledge of language all mature speakers hold in their minds
Performance -> the imperfect realization of the language that comes out when it’s used
The type of output one might expect from a learner whose instruction has consisted of
grammatical rules, and who will therefore be asked to produce sentences to illustrate
these rules would be examples of usage. These examples can be used as an indication of
the learner's current state of competence, but will not necessarily indicate anything about the
learner's possible performance.
—
Performance teaching, and therefore performance testing, require examples of language use,
not usage.
Direct -> if it asks candidates to perform the communicative skill that is being tested.
● Tries to be as much real-life language use as possible.
● Uses examples of performance as an indicator of communicative competence.
● Tests use testing tasks of the same type as language tasks in the real-world.
Techniques:
- Speaking
- Interviewer questioning candidates about themselves
- Using pictures for candidates to compare and contrast
- Role-playing
- Writing
- Writing stories or compositions
- Information leaflets about their school or a place in their town
- Transactional letters where candidates reply to a job advertisement
- Reading
- Multiple choice questions to test comprehension of a test
- Choosing the best summary of a paragraph or a whole text
- Transferring written info to charts, graphs, maps. etc
- Listening
- Completing charts with facts and figures from a listening text.
- Identifying which speaker says what.
- Following directions on a map and identifying the correct hous eo
place.
Indirect -> tries to measure a student’s knowledge and ability by getting at what lies beneath
their receptive and productive skills.
● Try to find out about a student's language knowledge through more controlled items,
such as multiple choice questions or grammar transformation items.
● Assesses competence without eliciting performance (e.g. multiple choice, because no
production of language use from the language learner is required)
● Usually carried out through a battery of many items, each one of which only tests one
small part of the language. Each item is known as a discrete-point item.
Techniques:
- Multiple choice questions
- Cloze procedures
- Transformation or paraphrase (rewrite a sentence)
- Sentence reordering (scrambled sentences)
- Sentence fill-ins (complete the sentence logically)
- Choosing the correct tense of verbs in sentences and passages
- Finding errors in sentences
Utility: A test with high utility will give a lot of feedback to assist in the planning of the
rest of a course or future courses.
● Informal or classroom tests should have high utility, telling both the learner and the
teacher where problems exist.
● Formal examinations traditionally have low utility, with little specific feedback being given
to the learner.
● The more detailed this feedback is, the more useful it is as a tool for the future.
Objective -> It refers to test items that can be marked clearly as right or wrong (multiple choice
item)
Subjective -> requires that an assessor makes a judgment according to some criteria and
experience.
● The issue is to achieve some agreement over marks, both between different markers
(inter-rater reliability) and with the same marker at different times (intra-rater reliability).
● Productive skills (speaking and writing) lend themselves to to subjective marking
Most integrative test elements (writing composition or making an oral presentation) require
subjective assessment.
Backward-looking -> competence tests (to see to what degree it has been assimilated by the
learner)
Forward-looking assessment-> Third generation tests are better linked to the future use of
language, and their assessments of real language use also show mastery of a performance
based syllabus.
Contextualized language:
● Integrative items need a full context in order to function. The closer the items
in an integrative test are to simulating real world language trask, the fuller the
context must be.
- information about communicative purpose, role relationships, channel of
communication etc.
Disembodied language:
● Has little or no context (multiple choice test, based on language usage)
● Items bear little relevance to each other, with no purpose other than as part of a test
BAND SCALES (CRITERIA)
Holistic -> give overall descriptions of ability. Students’ performance is matched to one of the
bands (a number).
LEFT: HOLISTIC
RIGHT: ANALYTIC
Analytic -> separate out aspects of language performance into individual scales, giving a profile
of performance.
● It has more bands
● The more bands that are used the more difficult it is to make reliable descriptors of each
band.