Professional Documents
Culture Documents
Evaluation 28.11.19
Evaluation 28.11.19
Evaluation is the collection, analysis and interpretation of information about any aspect of an
educational program as part of a recognized process of judging its effectiveness, its efficiency and
any other outcomes it may have. It is concerned with assessing the effectiveness of teaching,
teaching strategies, methods and techniques. It provides feedback to the teachers about their
teaching and the learners about their learning.
What is Test?
In the classroom situation, the word test refers to formal and systematic (usually paper & pencil)
procedure to judge the knowledge, understanding and intelligence of the student.
What is Measurement?
It is a procedure of assigning numbers to a specified attribute in such a way that the numbers
indicate the degree to which the student possesses that attribute.
What is feedback?
It is the act providing information for the ongoing observations and assessment to the learners.
What is Evaluation?
It is a broader term than Test and Measurement which includes all types and examinations; to
check the knowledge as well as all aspects of the learner.
Evaluation is a process of judging / determining the value or something by certain appraisal (tool) to
assess the extent to which educational objectives are being realized.
Based on person
Internal evaluation – carried out by the teachers and IQAC
External evaluation – carried out by the external agency other than the college (MUHS)
Process of Continuous Internal Evaluation (CIE) System / Formative Evaluation
1. Strategic Plan of Course Curriculum (theory & practical subject)
2. Allotment and responsibility of Subject Experts for CIE
3. Communication of CIE of respective theory / practical subject
4. Mechanism of CIE (attendance, class test, midterm & pre-final exam, IA marks)
5. Grievance Red-ressal
6. Grading of Internal Assessment Marks
Test construction
Different evaluation tools are constructed and evaluated by the teachers to measure the TLS of any
given educational institute. Test construction is a systematic and well organized process where the
set of activities involved in planning, developing and evaluating the test items concerned to
assessment of cognitive (knowledge), conative (skill) or affective (attitude) domain of learner.
There are various strategies in test construction where the test items are created and decided upon
to measure the learning behavior. Generally, for all educational test construction, the following five
steps are used -
Planning the test
Preparing the preliminary draft of the test
Trying out the preliminary draft of the test
Evaluating the test (validity and reliability)
Construction of the final draft of the test
1. To assess the learning behavior of students (Cognitive, Conative and Affective domain)
2. Test paper helps in clarification of educational objectives are really being achieved
3. The test results helps in identifying the problems in teaching and learning process.
4. The written test / exam helps in self appraisal of students as well as teachers
5. The test results motivate the students as well as teachers
6. The written exam promotes innovative instructional strategies
7. Regular written tests helps the students in progressive development at academics
8. Allows development of writing skill – selection of words, composing sentence, organize
paragraph, grammar, spelling etc.
9. The essay tests allows the students to express their own ideas
10. Helps in identification of advanced learner, average learner and slow learner
The objective type test are called selection type as they are answered by just writing one or two
words, or numerals, fill in the blanks, choosing one cut of multiple responses given, etc.
This type of questions administered on paper or on a computer, where the student is required to
choose a right option from two or more alternative in the item. The choices may be True/False,
Yes/No, High/Low, Agree/Disagree and so on.
2. Plausible answers: all responses in first column should be plausible answers to the premises
in second column otherwise; the test loses some of its reliability because some answers will
be “give-aways.”
3. Clueless: ensure the premises don’t include hints through grammar (like implying the
answer must be plural) or hints from word choice (like using the term itself in a definition).
4. Unequal responses: it is an ideal that we should present more responses than premises, so
the remaining responses don’t work as hints to the correct answer.
5. Limited premises: due to the capacity limitations of working memory, avoid a long list of
premises in the first column; usually 5-6 items, even less might be better.
6. One correct answer: every premise should have only one correct response. Obvious, but
triple-check to make sure each response can only work for one premise.
7. Long and complex statement: should be kept as premises so that each is read once. Shorter
and simpler sentences are kept as responses where they can be read quickly.
Rearrangement items
In this type of items the learner is required to put in order a set of randomly presented material in
terms of size / event / time etc.
Example: following are the steps of nursing research. Arrange them in order of hierarchy –
1. Validity and reliability
2. Conceptual framework
3. Analysis and interpretation
4. Data collection
5. Hypothesis
Strengths of checklist
1. Inexpensive and widely used tool
2. Allows to measure wide range of conative (skill) and affective (attitude) domain
3. Can test application of knowledge in to practice
4. Scoring is easy, quick and consistent
5. Useful for large group
6. Can be used for self appraisal
Limitations of checklist
1. Measures specific characteristic whether present or absent
2. Need long time for preparation list
3. Need skill for tool construction
4. Sometimes, scores might be affected by the observer / interviewer
Numeric rating scale: is a tool contains quantitative symbolization of an attribute where the
number measure or rate the qualitative phenomena or performance of something. For example:
5 4 3 2 1
(5 outstanding, 4 above average, 3 averages, 2 below average, 1 unsatisfactory)
0 1 2 3 4 5 6 7 8 9
(extremely satisfied) (extremely dissatisfied)
Graphic rating scale: is a tool contains bipolar descriptive continuum which has specific qualitative
attribute from lowest to highest. For example:
Descriptive graphic rating scale: is a tool contains bipolar descriptive continuum which has
descriptive qualitative attribute. It does not use numbers; but divide the assessment into series of
verbal phrases to indicate the level of performance. For example:
Ability of IM injection