Professional Documents
Culture Documents
Advance Educational Psychology (Test and Measurement)
Advance Educational Psychology (Test and Measurement)
EDUCATIONAL MEASUREMENT
-refers to the use of educational assessments and
the analysis of data such as scores obtained from
educational assessments to infer the abilities and
proficiencies of students
Chinese were the first people to utilize the
examination to measure ability. It was informally
begun in 225 B.C. and became a definite civil
service examination system in 29B.C. ( Ross,32)
Rev. George Fisher
- an English schoolmaster who started the
earliest scientific process of measuring
achievements of pupils.
- in 1864, he devised an instrument called
the “Scale Book”
- inventor of educational measurement.
J.M. Rice
- a school administrator who started the development of
objective tests like spelling and similar tests in arithmetic
and language in 1894 from which evolved the more
modern objective tests in different subjects.
E. L. Thorndike
- father of educational measurement.
- wrote books
1. “Mental and Social Measurements”- published in
1904 in which were found statistical procedures and tests
upon which statistical techniques and tests of today were
based.
2. “Thorndike Handwriting Scale”
- published in 1904 which assigned quantitative values
to different qualities of handwriting.
C.W. Stone
- in 1908, he constructed two tests in arithmetic which
deals on the ff:
1. fundamental operations
2. arithmetic reasoning - used in school surveys and
was considered his most notable contribution to
educational testing and measurement( Flores,2)
S.A. Curtis (1909)
- originated the concept of norms and standards
M. Hillegas
- constructed the “Hillegas Composition Scale “ in 1912
and became the basis of composition scales.
PSYCHOLOGICAL MEASUREMENT
-is the development of procedures to measure people‘s
characteristics like intelligence or personality.
Gustav Theodore Fechner
-published a book entitled “Elemente der
Psychophysic”which contained the beginnings of
quantitative psychological measurement.
QUANTITATIVE PSYCHOLOGY
-the American Psychological Association
defines Quantitative Psychology as "the study of
methods and techniques for the measurement
of human attributes, the statistical and
mathematical modeling of psychological
processes, the design of research studies, and
the analysis of psychological data".
H.L. F. Helmholtz
- his greatest contributions were his experiments in the
sensory fields of vision and hearing and in the
measurement of the speed impulse and reaction-time.
( Flores,3)
Wilhelm Wundt
-established the first psychological laboratory in Leipzig,
Germany.
- His experiments on visual, auditory, cutaneous, olfactory
and others resulted in the study of individual differences in
physical and mental traits.
Francis Galton& J. Mc Keen Cattell
- conducted earliest experiments in learning about the mind.
Galton- devised a whistle and tests that measured discrimination
of heat, weight, pain, and sound. He also measured mental
imagery.
- his most important contribution was his statistical analysis
in which he suggested a graphical method of correlation.
Cattell - made tests which he used in measuring keenness of
sight and hearing, rate and accuracy of movement, ability to
perceive weight, pitch and time , sensitivity to pain, imagery
and memory. He suggested the term mental tests in 1890. (
Ross, 41)
Alfred Binet
- a French psychologist who was the first to devise an
instrument to measure the more complex mental functions.He
introduced the term “mental age”.
Stern - he introduced the concept of representing
intelligence as the ratio of mental age to chronological age and
suggested the term “mental quotient” in 1912 which became
I.Q.( Ross,26)
Group Intelligence Test
- started during the first world war . It was tried with the
recruits in the United States Army. It was first constructed by
Arthur Otis with the help of Robert Yerkes and Lewis
Terman.
Group Intelligence Tests Developed in 1917
1. Army Alpha- verbal test
2. Army Beta- performance test
Pintner and Paterson
- are the first men to construct a performance test to
measure the intelligence of deaf children called Pintner-
Paterson Performance Scale
Carl E. Seashore
- a pioneer in the measurement of aptitude. He constructed
the first test to measure the musical ability called the
“Seashore Test of Musical Ability which came out in 1915.
Other Aptitude Tests
1. Stenquist Test of General Mechanical Ability (1918)
2. Mathematical ability test by Rogers (1918)
3. McAdory Art Test- one of the earliest art test developed by
Margaret McAdory Siceloff published in 1933.
Personality Tests
- is very useful in industry, business and in the government. It is
used to measure traits such as attitude, temperament, social
ability, temperament, values, emotion, vocational interests,
extroversion, introversion, levels of dominance and
submissiveness.
- Galton- most inf luential in the development of tests to
measure traits.
Famous Scales to Measure Traits
1. Scott Man- to –Man Scale- used extensively with soldiers
during the first World War.
2. Woodworth Personal Data Sheet- used to measure the
ability of soldiers to adjust to army life also during the war
years.
3. X-O Test – published in 1919 by Pressey. It was used to
measure emotionality.
4. Test to measure Introversion-Extraversion- made by
Marston
5. Test to measure ascendance-submissiveness- made by
Allport
6. Test to measure social attitudes and interests- by Hart in 1925
Measurement and Evaluation
Measurement-is the process of determining the
quantity of achievement of learners by means of
appropriate measuring instruments.
Types of Validity
a. Content validity- refers to the relevance of the test to the
subject matter or situation from which they are taken.
b. Concurrent Validity- refers to the correspondence
of the scores of a group in a test with the scores of the
same group in a similar test of already known validity used
as criterion.
CATEGORY WEIGHT
Very adequate 5
Adequate 4
Fairly adequate 3
Inadequate 2
Very inadequate 1
Legend:
Very adequate- means that all in class can be supplied with
the needed facilities at the same time.
Adequate- means that the important facilities are just enough
for the users with some minor facilities missing.
Fairly Adequate- means that a few important and some
minor facilities are missing
Inadequate- means that about half of the needed important
and minor facilities are missing
Very inadequate- means that more than half of the needed
important and minor facilities are missing or not available.
The Rating Scale
FACILITY 5 4 3 2 1
Textbooks
Library references
Laboratory equipment (may be
itemized)
Audio-visual aids (may be itemized
How to compute the Weighted Mean
Use the formula WM= TWF
N
in which
WM= the weighted mean or average
TWF=the total of the products of the weights multiplied
by their corresponding frequencies (raters)
N=the number of raters or their total frequency
NON-TEACHING AID FACILITIES
-refer to classrooms,their sizes, lighting,
ventilation, temperature, freedom from
distractions, and seats, chalkboards and
necessary classroom fixtures. These are
also concerns of evaluation.
Classrooms, seats chalkboards and fixtures may
be rated as very adequate, adequate, fairly
adequate, inadequate and very adequate.
lighting, ventilation, temperature, freedom from
distractions and conditions of classrooms and
seats and fixtures may be rated as very good,
good, fair, poor ,very poor.
Testing Program Composition
1.Standard psychological tests
2.Standard educational ( achievement) tests;
3.Teacher –made achievement tests;
4. Diagnostic tests, either standard or teacher –
made;
5. Health tests ( examinations);
Standard Psychological Test
- It measures:
a. General mental ability (intelligence)
b. Aptitude (ease with which one learns a
certain activity.
c. Personality traits
d. Interests , either personal, vocational, or
professional
Teacher -Made Examinations
-results are usually for diagnostic, marking, and
promotion, as well as evaluation purposes.
Ex. Suppose a class performs poorly in a test.
What are the implications?
The low scores may have been caused by the following:
1.Poor teaching
2.Mentally weak students
3.Inadequate study materials such as textbooks and
references and audio-visual aids.
4.Poor study habits of the students
5. Poor construction of the test
7. Unusual distractions during the test period.
8. Poor conditions of the room : poorly lighted,
poorly ventilated, too hot or too cold.
The teacher-made test must also be judged as to
whether they are properly designed or
constructed to be able to measure effectively the
expected learning outcomes.
5-PointRating Scale
Tests- may be rated as very adequate, adequate, fairly
adequate, inadequate, very inadequate
ex.Individual tests (Psychological tests)
Validity, reliability, usability of tests
- may be rated as very high, high, fairly high,, low, or
very low
The effectiveness of the testing program in measuring the
learning outcomes may be rated as very effective,
effective, fairly effective, ineffective,
very ineffective.
The Learner
Various methods of appraising the learners:
1. The administration of the standard
psychological tests, standard achievement test,
and teacher-made tests.
2. Comparing performance in I.Q tests and
performance in teacher-made tests.
3. Comparing present with previous performance.
4. Observation
5. Questionnaire and/or interview
6. Health Examination
7. Autobiography and philosophy of life
8. Parental interview
9. Visitation
Marking and Grading
The Evaluation Committee has to approve
policies to serve as criteria for evaluating
achievement scores.
Among the policies to be formulated are:
1. The quality symbols to be used in grading,
whether rank, percent, or letter symbols with
their respective descriptive equivalents.
2. The different factors to be included in the
evaluation such as the results in quizzes, tests,
projects etc. with their respective weights.
3. An evaluation standard indicating the level of
performance to be awarded a certain mark
symbol
4. The lowest passing score or grad and the
highest grade.
5. The number of level of grades whether 3,6,7,9
etc.
6. The multiples of grades, in multiple of 1,5 or 25 etc.
7. The method/s of transmuting raw scores into
grades.
8.The methods of computing the final grade:
averaging or cumulative
9.The lengths of tests and frequency of giving them.
10. What are to be measured by tests: factual
knowledge, understanding, creative and critical
thinking, reasoning power, judgment, skills,
values, ability to organize ideas, etc.
THE TEACHER
Among the methods used by some schools to
evaluate the teacher are the following:
1. Observation of actual teaching work.
2. Examination of lesson plans, syllabi, tests and test
papers of students.
3. Peer observation and evaluation.
4. Student evaluation
5. Self-evaluation
6. Meetings and conferences
7. Test performance of students
8. Observation of campus activities
9. Questionnaire and/ or interview
10. Administration of psychological tests
11. Autobiography and philosophy of life
12. Educational qualifications
13. Medical and dental examinations
14. Visitation
Characteristics of the Teacher that are Rated
1.Mastery of the subject 7. Health
matter 8. Attitudes
2. Punctuality and attendance
9. Emotional and social ability
3. Strategies
4. Classroom management 10. Approachability
5. Ability to gather and 11. Productive scholarship
organize instructional 12. Leadership qualities
materials 13. Cooperation
6. Professional growth
Rating the Teacher
The teacher may be rated using the 5-point
rating scale as follows:
Very good- 5
Good - 4
Fair- 3
Poor- 2
Very poor- 1
Characteristics Rating
5 4 3 2 1
1.Mastery of the subject matter _ _ _ _ _
2. Punctuality and attendance _ _ _ _ _
3. Strategies _ _ _ _ _
4. Classroom management _ _ _ _ _
5. etc.
HOW TO COMPUTE THE AVERAGE
IF THE RATERS ARE MANY