Professional Documents
Culture Documents
Assignment No.2
ASSIGNMENT No. 2
ANS.
Reliability is the extent to which test scores are consistent, with respect to one or more
sources of inconsistency—the selection of specific questions, the selection of raters,
the day and time of testing.
How do we account for an individual who does not get exactly the same test score
every time he or she takes the test? Some possible reasons are the following:
Test form. Many tests have more than one version or form. Items differ on each form,
but each form is supposed to measure the same thing. Different forms of a test are
known as parallel forms or alternate forms. These forms are designed to have similar
measurement characteristics, but they contain different items. Because the forms are
not exactly the same, a test taker might do better on one form than on another.
Course: Educational Assessment and Evaluation (8602)
Level: M.Ed/ M.A. Semester: Autumn, 2022
Test manuals and independent review of tests provide information on test reliability.
The following discussion will help you interpret the reliability information about any
test.
There are several types of reliability estimates, each influenced by different sources of
measurement error. Test developers have the responsibility of reporting the reliability
estimates that are relevant for a particular test. Before deciding to use a test, read the
test manual and any independent reviews to determine if its reliability is acceptable.
The acceptable level of reliability will differ depending on the type of test and the
reliability estimate used.
Course: Educational Assessment and Evaluation (8602)
Level: M.Ed/ M.A. Semester: Autumn, 2022
Validity.
Validity is the most important issue in selecting a test. Validity refers to what
characteristic the test measures and how well the test measures that characteristic.
Validity tells you if the characteristic being measured by a test is related to job
qualifications and requirements.
Validity gives meaning to the test scores. Validity evidence indicates that there is
linkage between test performance and job performance. It can tell you what you may
conclude or predict about someone from his or her score on the test. If a test has been
demonstrated to be a valid predictor of performance on a specific job, you can
conclude that persons scoring high on the test are more likely to perform well on the
job than persons who score low on the test, all else being equal.
Validity also describes the degree to which you can make specific conclusions or
predictions about people based on their test scores. In other words, it indicates the
usefulness of the test.
A test's validity is established in reference to a specific purpose; the test may not be
valid for different purposes. For example, the test you use to make valid predictions
about someone's technical proficiency on the job may not be valid for predicting his or
her leadership skills or absenteeism rate. This leads to the next principle of
assessment.
Test developers have the responsibility of describing the reference groups used to
develop the test. The manual should describe the groups for whom the test is valid,
and the interpretation of scores for individuals belonging to each of these groups. You
must determine if the test can be used appropriately with the particular type of people
you want to test. This group of people is called your target population or target group.
Conducting your own validation study is expensive, and, in many cases, you may not
have enough employees in a relevant job category to make it feasible to conduct a
study. Therefore, you may find it advantageous to use professionally developed
assessment tools and procedures for which documentation on validity already exists.
However, care must be taken to make sure that validity evidence obtained for an
"outside" test study can be suitably "transported" to your particular situationlevel,
cultural differences, and language barriers.
Q. No. 2 What are the general considerations in constructing essay type test
items.
ANS.
Essay test is a test containing questions that requires the examinee to write several
paragraphs in their own words. Generally, essay tests are designed to measure the
different abilities of examinees such as factual knowledge, language proficiency with
legible handwriting, organizing answer and time management.
The essay tests are still commonly used tools of evaluation, despite the increasingly
wider applicability of the short answer and objective type questions.
An essay test may give full freedom to the students to write any number of pages.
The required response may vary in length. An essay type question requires the pupil to
plan his own answer and to explain it in his own words. The pupil exercises
considerable freedom to select, organise and present his ideas. Essay type tests
provide a better indication of pupil’s real achievement in learning. The answers
provide a clue to nature and quality of the pupil’s thought process.
That is, we can assess how the pupil presents his ideas (whether his manner of
presentation is coherent, logical and systematic) and how he concludes. In other
words, the answer of the pupil reveals the structure, dynamics and functioning of
pupil’s mental life.
The essay questions are generally thought to be the traditional type of questions which
demand lengthy answers. They are not amenable to objective scoring as they give
scope for halo-effect, inter-examiner variability and intra-examiner variability in
scoring.
Some of these are given below with examples from different subjects:
1. Selective Recall.
2. Evaluative Recall.
Course: Educational Assessment and Evaluation (8602)
Level: M.Ed/ M.A. Semester: Autumn, 2022
e.g. Compare the contributions made by Dalton and Bohr to Atomic theory.
e.g. Compare Early Vedic Age with the Later Vedic Age.
5. Decision—for or against.
e.g. Which type of examination do you think is more reliable? Oral or Written. Why?
6. Causes or effects.
e.g., Joint Stock Company is an artificial person. Explain ‘artificial person’ bringing
out the concepts of Joint Stock Company.
9. Analysis
e.g. What was the role played by Mahatma Gandhi in India’s freedom struggle?
12. Classification.
Course: Educational Assessment and Evaluation (8602)
Level: M.Ed/ M.A. Semester: Autumn, 2022
e.g. Classify the following into Physical change and Chemical change with
explanation. Water changes to vapour; Sulphuric Acid and Sodium Hydroxide react to
produce Sodium Sulphate and Water; Rusting of Iron; Melting of Ice.
e.g. If you sat halfway between the middle and one end of a sea-saw, would a person
sitting on the other end have to be heavier or lighter than you in order to make the sea-
saw balance in the middle. Why?
14. Discussion.
e.g. Partnership is a relationship between persons who have agreed to share the profits
of a business carried on by all or any of them acting for all. Discuss the essentials of
partnership on the basis of this partnership.
16. Outline.
e.g. Outline the steps required in computing the compound interest if the principal
amount, rate of interest and time period are given as P, R and T respectively.
e.g. The student is asked to interview some persons and find out their opinion on the
role of UN in world peace. In the light of data thus collected he/she can reorganise
what is given in the text book.
e.g. After reading a lesson the pupils are asked to raise related problems- questions.
Course: Educational Assessment and Evaluation (8602)
Level: M.Ed/ M.A. Semester: Autumn, 2022
e.g. Can you solve this mathematical problem by using another method?
2. It is the only means that can assess an examinee’s ability to organise and present his
ideas in a logical and coherent fashion.
4. Some of the objectives such as ability to organise idea effectively, ability to criticise
or justify a statement, ability to interpret, etc., can be best measured by this type of
test.
5. Logical thinking and critical reasoning, systematic presentation, etc. can be best
developed by this type
6. It helps to induce good study habits such as making outlines and summaries,
organising the arguments for and against, etc.
7. The students can show their initiative, the originality of their thought and the
fertility of their imagination as they are permitted freedom of response.
8. The responses of the students need not be completely right or wrong. All degrees of
comprehensiveness and accuracy are possible.
10. They are valuable in testing the functional knowledge and power of expression of
the pupil.
1. One of the serious limitations of the essay tests is that these tests do not give scope
for larger sampling of the content. You cannot sample the course content so well with
six lengthy essay questions as you can with 60 multiple-choice test items.
4. The long-answer type questions are less valid and less reliable, and as such they
have little predictive value.
7. Improper and ambiguous wording handicaps both the students and valuers.
10. The scores may be affected by his personal bias or partiality for a particular point
of view, his way of understanding the question, his weightage to different aspect of
the answer, favouritism and nepotism, etc.
The teacher can sometimes, through essay tests, gain improved insight into a student’s
abilities, difficulties and ways of thinking and thus have a basis for guiding his/her
learning.
1. Give adequate time and thought to the preparation of essay questions, so that they
can be re-examined, revised and edited before they are used. This would increase the
validity of the test.
2. The item should be so written that it will elicit the type of behaviour the teacher
wants to measure. If one is interested in measuring understanding, he should not ask a
question that will elicit an opinion; e.g.,
3. Use words which themselves give directions e.g. define, illustrate, outline, select,
classify, summarise, etc., instead of discuss, comment, explain, etc.
5. Indicate clearly the value of the question and the time suggested for answering it.
(ii) Students do not have the ability to select those questions which they will answer
best;
(iii) A good student may be penalised because he is challenged by the more difficult
and complex questions.
7. Prepare and use a relatively large number of questions requiring short answers
rather than just a few questions involving long answers.
Course: Educational Assessment and Evaluation (8602)
Level: M.Ed/ M.A. Semester: Autumn, 2022
8. Do not start essay questions with such words as list, who, what, whether. If we
begin the questions with such words, they are likely to be short-answer question and
not essay questions, as we have defined the term.
9. Adapt the length of the response and complexity of the question and answer to the
maturity level of the students.
11. It should be a power test rather than a speed test. Allow a liberal time limit so that
the essay test does not become a test of speed in writing.
12. Supply the necessary training to the students in writing essay tests.
13. Questions should be graded from simple to complex so that all the testees can
answer atleast a few questions.
14. Essay questions should provide value points and marking schemes.
1. Prepare a marking scheme, suggesting the best possible answer and the weightage
given to the various points of this model answer. Decide in advance which factors will
be considered in evaluating an essay response.
b. Pay attention only to the significant and relevant aspects of the answer;
3. The examinee’s identity should be concealed from the scorer. By this we can avoid
the “halo effect” or “biasness” which may affect the scoring.
5. Once the assessment has begun, the standard should not be changed, nor should it
vary from paper to paper or reader to reader. Be consistent in your assessment.
6. Grade only one question at a time for all papers. This will help you in minimising
the halo effect in becoming thoroughly familiar with just one set of scoring criteria
and in concentrating completely on them.
8. If possible, have two independent readings of the test and use the average as the
final score.
ANS’
Frequency distribution tables are used to organize and summarize large amounts of
data into more manageable chunks. They help to identify patterns, trends, and insights
that may not be immediately apparent when looking at the raw data. Constructing a
frequency distribution table involves several steps, which we will outline below:
1. Determine the range of values: The first step is to determine the range of values
that your data set covers. This can be done by finding the minimum and
maximum values in your data set. For example, if your data set consists of the
ages of 50 individuals, you might find that the youngest person is 18 years old
and the oldest person is 60 years old, giving you a range of 42 years.
2. Determine the class intervals: Once you have the range of values, the next step
is to determine the class intervals. Class intervals are ranges of values that will
be used to group the data into categories or bins. The size of the class intervals
Course: Educational Assessment and Evaluation (8602)
Level: M.Ed/ M.A. Semester: Autumn, 2022
will depend on the range of values and the number of categories you want to
create. A common method for determining the class intervals is to use the
square root of the total number of data points. For example, if you have 50 data
points, you would take the square root of 50 (which is approximately 7) and
use this as the number of categories. If your data range was 18 to 60, you could
etc.).
3. Count the frequencies: Once you have established your class intervals, the next
step is to count the frequencies. The frequency is the number of data points that
fall within each class interval. To do this, you would go through your data set
and count how many data points fall into each category. For example, if you
had 50 data points and your class intervals were 5 years, you might find that
there are 6 people between the ages of 18-22, 12 people between the ages of
4. Create the table: The final step is to create the frequency distribution table. This
table will show the class intervals, the frequency (number of data points) that
falls within each interval, and the percentage of the total data set that falls
within each interval. You may also want to include cumulative frequencies,
which show the total number of data points that fall within each interval and all
18-22 6 12% 6
23-27 12 24% 18
28-32 10 20% 28
33-37 8 16% 36
38-42 7 14% 43
43-47 4 8% 47
48-52 2 4% 49
53-57 1 2% 50
Course: Educational Assessment and Evaluation (8602)
Level: M.Ed/ M.A. Semester: Autumn, 2022
58-60 0 0% 50
In this example, we can see that the most common age range is 23-27, which accounts
for 24% of the total data set. We can also see that 70% of the data falls within the age
ANS.
to provide accurate and objective information that can help parents understand their
child's strengths and weaknesses. Here are some steps that can be taken to effectively
opportunity to discuss any concerns or questions they may have. The meeting
Course: Educational Assessment and Evaluation (8602)
Level: M.Ed/ M.A. Semester: Autumn, 2022
should be scheduled at a mutually convenient time for both the parents and the
teacher.
2. Prepare for the meeting: Before the meeting, the teacher should prepare by
3. Start with positive feedback: It is important to begin the meeting with positive
positive relationship with the parents and can increase their receptiveness to
4. Share specific areas of concern: After providing positive feedback, the teacher
claims. It is also helpful to discuss any strategies or resources that can be used
perspective and any concerns they may have. The teacher should acknowledge
their concerns and work with them to develop a plan to address any issues.
6. Develop an action plan: Together with the parents, the teacher should develop
an action plan to help the child improve their performance. This plan should
include specific goals, strategies, and timelines. The teacher should also
7. Follow up: It is important to follow up with the parents after the meeting to
ensure that they understand the information that was shared and to answer any
additional questions they may have. The teacher should also continue to
provide regular updates on the child's progress and any changes to the action
plan.
involves scheduling a meeting, preparing for the meeting, starting with positive
teachers can help parents understand their child's performance and work together to
testing.
referenced testing can be useful in certain situations, it also has its advantages and
disadvantages.
that they are administered and scored consistently. This can help ensure that the
selection.
as multiple-choice questions or scoring rubrics. This can help reduce bias and
sample. This can be useful in some contexts, but may not provide a complete
others, there can be pressure to perform well. This can create anxiety and may
4. Limited scope: Norm-referenced tests are often focused on specific areas, such
as language or math skills. This can be limiting for individuals who have