Professional Documents
Culture Documents
Quality assessment takes center stage on the learning process. In fact, it is a vital
component of the instructional process. The evaluation and judgment of a teacher on student
performance are based on information obtained in using assessment instruments whose quality is
of paramount importance. Every teacher should have the necessary skill to develop quality test
items. It is a teacher’s mandate to procure for the learners the optimum evaluation strategy.
Teachers who create effective tests, develop remedial instruction and allow students
several attempts to elicit success can improve their teaching method and facilitate student
learning. When instructional process incorporates effective classroom assessments so as to make
them the central feature in student learning, both students and teachers derive unlimited benefits.
2. Reliability - The reliability of an assessment method refers to its consistency. It is also a term
synonymous with dependability or stability. It is the extent to which an assessment tool produces a stable
and consistent result.
Types of Reliability What it is? How do you establish it?
3. Practicality and Efficiency - Practical test is a test that is developed and administered within the
available time and with available resources. In other Moreover, a test should be easy to design, to
administer, to mark and to interpret as to results. Efficiency, in this context, refers to the development,
administration and grading of assessment with the least effort and resources.
4. Fairness. The fairness of a test refers to freedom from any biases. Your students must know exactly what
the learning targets are and what method of assessment will be used. They have to be informed how their
progress will be evaluated in order to make strategies and perform optimally.
Other aspects of fairness include:
1. Opportunity to learn further;
2. Pre-requisite knowledge and skills;
3. Avoidance of student stereotyping
4. Avoidance of bias in assessment procedures; and
5. Accommodating special needs and requirements
Knowledge and
Simple 5 4 3 4 3 3
Understanding
Deep Understanding
and Reasoning
2 5 4 4 2 3
Skills 1 3 5 2 5 3
Products 1 1 5 2 4 4
Affects 1 2 4 4 4 5
Formats of TOS
1. One-way TOS. A one-way TOS maps out the content or topic test objectives, number of hours spent,
format number and placement of items. A one-way TOS cannot ensure that all levels of cognitive behaviors
that should have been developed by the course are covered in the test.
Sample 1. One-Way Table of Specifications
Time Spent on Percent of Class Number
Topics Test Placement
Topic (in hours) Time on Topic of Items
2. Two-Way TOS. A two-way TOS reflects not only the content, time spent, and the number of items but
also the levels of cognitive behavior targeted per test content. One advantage of this format is that it allows
one to see the levels of cognitive skills and dimensions of knowledge that are emphasized by the test.
Sample 2. Two-Way Table of Specifications
Item Specification
The matching test item format requires learners to match a word, sentence or phrase in one column
to a corresponding word, sentence or phrase in second. Column. It is most appropriate when you need to
measure the learner’s ability to identify the relationship or association between similar items. However, it
is not suited for gauging the learners’ higher understanding (analysis and synthesis levels). It can only be
used to assess homogeneous knowledge.
The following are some guidelines in writing good and effective matching type tests:
1. Include homogenous premises and responses in a single matching exercise
2. Clearly indicate in the directions the basis for matching where answers should be written, and if
responses or answer choices can be used more than once.
3. Keep the list relatively short. The ideal number of items is 5 to 10, and a maximum of 15.
4. Arrange premises and responses with maximum clarity. It is desirable to use longer statements as
premises and numbered at the left of the page. The shorter responses are placed at the right and each
identified with letters.
5. Have more responses or answer choices than premises. This will reduce guessing and using the process
of elimination in choosing the correct answer.
6. Place all the premises and responses on a single page
Faulty:
Directions: Match the following.
Food A. Primary reinforcer
Psychoanalysis B. Sigmund Freud
B.F. Skinner C. Operant conditioning
Standard deviation D. Measure of variability
Schizophrenia E. Hallucinations
Good:
Directions: Match the theories in Column I with their advocates in Column II. Write the letter of the correct
answer.
Column I Column II
___ 1. Psychodynamic Theory A. Albert Bandura
___ 2. Trait Theory B. B.F. Skinner
___ 3. Behaviorism C. Carl Rogers
___ 4. Humanism D. Gordon Allport
___ 5. Social Learning Theory . Karn Horney
F. Sigmund Freud
Essay Test
Essay test is the preferred method of evaluation when teachers want to measure learners’ higher
order thinking skills particularly their ability to reason, interpret, analyze, synthesize, and evaluate.
Santos, et al (2007) and Balagtas et al 92019) present the following rules of thumb in constructing good
essay questions:
1. Clearly define the intended learning outcomes to be assessed by the essay test.
2. Refrain from using essay test for intended learning outcomes that are better assessed by other kind
of assessment.
3. Phrase the direction in such a way that students are guided on the key concepts to be included.
Example: Write an essay on the topic: “Plant Photosynthesis” using the following key words and
phrases: chlorophyll, sunlight, water, carbon dioxide, oxygen, by-product, stomata.
4. Note that the learners are properly guided in terms of the keywords that the teacher is looking for
in this essay test.
5. Inform the students on the rubrics to be used for grading their essays. This rule allows the learners
to focus on relevant and substantive materials rather than on peripheral and unnecessary facts and
bits of information.
6. Present tasks that are fair, reasonable and realistic to students
7. Be specific in the prompts about the time allotment.
Item Analysis
After drafting objective test items and administering it, how do you determine if the test items are
properly constructed as to degree of difficulty? How do you set apart students who excel well on the
overall test, and those who do not? An item analysis, as a valuable procedure, can easily provide the
teachers with answers to both questions.
Here are the basic concepts of item analysis:
Item analysis is a technique which evaluates the effectiveness of items in tests. It helps to improve
the test by revising or discarding ineffective items.
An item analysis provides three kinds of important information about the quality of test items.
Item difficulty: A measure of whether an item was too easy or too hard.
Item discrimination: A measure of whether an item discriminated between students who knew the material
well and students who did not.
Computation
Item 1 Item 2 Item 3 Item 4 Item 5
= 0.67 + 0 = 0.67+0.33 = 2.0 + 0.67 = 1.00 + 0.33 = 1.00 + 0.33
2 2 2 2 2
Index of 0.33 0.50 0.83 0.50 0.67
difficulty
Item Difficult Average Easy Average Average
difficulty
4. The index of discrimination is obtained using the formula:
Item discrimination = pH - pL
The value is interpreted using the table:
Index discrimination Remark
0.40 and above Very good item
0.30 - 0.39 Good item
0.20 - 0.29 Reasonably good item
0.10 - 0.19 Marginal item
Below 0.10 Poor item
Statistics plays a vital role in the complexities of life. It aids in decision making, summarizes or
describes data, helps to forecast or predict future outcomes, aids in making inferences, and helps in
comparisons or establishing relationships. In education, statistics give information about the school's
population change (statistics in enrolment and dropout rate), assist in processing specific evaluations, and
surveys weregiven to improve the school system and evaluate the achievements, grades, and in
preparations of the test (proficiency level).
There are two divisions in statistics, which are descriptive statistics and inferential statistics.
Descriptive Statistics is a statistical procedure concerned with describing the characteristics and
properties of a group of persons, places or things that based on confirmable facts. It organizes the
description, presentation and interpretation of data gathered.
Inferential Statistics is a statistical procedure used to draw inferences from the population by obtaining
information from the sample by using techniques of descriptive statistics.
Classification of Variables
Nominal: data are categorical and the numbers are used as identifiers or a representation. The
numbers on the back of a jersey (COED Blazer 1 = Juan dela Cruz) and the social security number are some
examples of a nominal data. If you conduct a survey and you will include gender as a variable, code the
Female as 1 and Male as 2 or vice versa when you enter your data into the computer. Thus, using numbers
1 and 2 can be used to represent the categories of data.
Ordinal: it denotes an ordered series of associations or rank order. In a contest, an individuals are
competing to achieve first, second, or third place. The first, second, and third place represents ordinal data.
If Rose takes first and Willy takes second, we do not know if the competition was close; we only know that
Rose outperformed Willy. Likert-type scales also represent ordinal data. Basically, these scales do not
represent a measurable quantity. An individual may respond 8 to a question and he actually feel less than
someone who responded 5. Another person may not be in half as much pain if he responded 4 than if he
responded 8. This data may only indicate that an individual responded 6 is in less pain than a person
responded 8 and in more pain than a person responded 4. Therefore, Likert-type scales represent a ranking.
Interval:it represents a quantity and has equal units in which zero indicates an additional point of
measurement is an interval scale. For Example 10 degree Fahrenheit or -10 degrees Fahrenheit are an
interval data. Each of these scales are a direct measures of a quantity with equality of units. Thus, zero does
not represent the absolute lowest value. Rather, it is the point on a scale with numbers both above and
below it.
Ratio: it is a scale of measurement which is similar to the interval scale that represents quantity
and has equality of units. However, ratio has an absolute zero (no numbers exist below zero). It is
commonly used in physical measures like height and weight. If one is measuring a height of a person in
centimeters, there is quantity, equal units, and that measure cannot go below zero centimeters. A negative
height is not possible.
The table below shows a summary of fundamental differences between the four scales of
measurement
DATA COLLECTION
Data collection is gathering information from some person or some other ways to get data. Data collection
is done to keep on record for further use, to make essential decisions about different problems, and to
disseminate information on to others.
Primary Data - the collection of data from the first-hand source. This type of data is mostly pure and
original.
Secondary Data –the collection of data from the second-hand source. Information could be from another
researcher or agency.
DATA-GATHERING TECHNIQUES
Method Characteristics Advantages Disadvantages
Direct or
Researcher has direct contact to Clarification can do Costly and time-
interview
the respondents easily consuming
method
Researcher gives or distributes
Indirect or questionnaire to the respondents Saves time and money;
questionnaire either by personal delivery or by A large number of Problem of retrieval
method mail samples can reach
TEXTUAL PRESENTATION
Data presented in a paragraph or in sentences are said to be in textual form. This includes an
enumeration of essential characteristics, emphasizing the most significant features, and highlighting the
most striking attributes of the set of data.
Example:
According to a rapid survey conducted by the government, 77 percent of micro and small firms and
62 percent of medium-sized firms had to close due to the enhanced community quarantines. Those that
remained open suffered a 66.5 percent drop in sales.
The growth forecast for 2020 assumes that the containment measures will gradually ease in the
second half of the year, and economic activities return in some sectors of the economy. Given income losses
and heightened uncertainty, household consumption and private investment are expected to remain weak.
However, economic growth prospects and poverty figures are expected to improve in succeeding
years driven by a rebound in consumption, a stronger push in public investment, supportive fiscal and
monetary policies, and the recovery of global growth. Economic growth is projected to return to above 6
percent in 2021 and 7 percent in 2022. Increased economic activity surrounding national elections will also
boost growth in 2022.
(Philippines: Social Assistance to Poor Households, Support for Small Enterprises Key to Broad-Based
Recovery; http://worldbank.org; June 9, 2020)
TABULAR PRESENTATION
The tabular method makes use of rows and columns. The data are presented in a systematic and
orderly manner, which catches one's attention and may facilitate the comprehension and analysis of the
data presented.
1. Qualitative or Categorical FDT – A frequency distribution table where the data are grouped
according to some qualitative characteristics; data are grouped into non-numerical categories.
Table 2
Frequency Distribution of Gender of the Respondents
Gender Number of Respondents
Male 77
Female 45
Total 122
2. Quantitative FDT – a frequency distribution table where the data are grouped according to some
numerical or quantitative characteristics.
Table 3
Ungrouped Frequency Distribution for the
Weights of 50 Students in Prof Ed 6 Class
WEIGHT (in kg) FREQUENCY Ta
49 2 ble
50 3 4
51 5 Gro
52 7 upe
53 7 d
54 0 Fre
55 0 que
56 0 ncy
57 0 Dis
58 12 tri
59 0 but
60 7 ion
61 0 for
62 4 the
63 2 We
64 1 igh
Total 50 ts
of
50 Students in Prof Ed 6 Class
WEIGHT FREQUENCY
(in kg)
48 – 49 2
50 – 51 8
52 – 53 17
54 – 55 0
56 – 57 0
58 – 59 12
60 – 61 7
62 – 63 6
64 – 65 1
Total 50
Steps in Constructing Grouped FDT
1. Determine the range
R = 57 – 22 = 35
4. List the limits of each class interval. Preferably, lower limit of the lowest class interval
is a multiple of the class size of the class interval
Example: 20 – 24
20 is the lower limit, and 24 is the upper limit
Table 5
Frequency Distribution Table of the scores
in the First Quiz in Prof Ed 6
Class Intervals Frequency
20 – 24 2
25 – 29 6
30 – 34 8
35 – 39 11
40 – 44 10
45 – 49 9
50 – 54 3
55 - 59 1
N 50
A complete grouped frequency distribution table has a class mark or midpoint (x), class
boundaries (c.b), relative frequency (rf), cumulative frequencies, (cf) and relative
cumulative frequency.
Class mark – the midpoint of the class interval getting the average of the upper and lower
limits
�� + ��
�=
2
Table 7
The Contingency Table for the opinion of viewers on the New TV Program
Samples
Choices Total
Men Women Children
Like the Program 59 67 32 158
Indifferent 21 32 12 65
Do not like the Program 46 12 78 136
Total 126 111 122 359
Table 7 is a 3 x 3 table since it has 3 columns and 3 rows. The samples enumerated
in columns are men, women, and children while the choices or alternatives enumerated in
rows are: like the program, indifferent and do not like the program. Column and row totals
are not included in the count.
2. Line Chart – a graphical presentation of data especially useful for showing trends over
a period of time.
3. Pie Chart – it is a circular graph that is useful in showing how a total quantity is
distributed among a grouped of categories. Each pieces of pie represent an amount on
the total portion of the category.
4. Column and Bar Graph – like pie charts, column charts and bar charts are applicable
only to grouped data. It is used for DISCRETE grouped data of ordinal or nominal scale.
1. Frequency Histogram – a bar graph that presents the classes on horizontal axis
and thefrequencies of the classes where on the vertical axis. The vertical lines of the
bars are on the class boundaries, and the height of the bar corresponds to the class
frequency.
2. Frequency Polygon – a line graph that is constructed by plotting the frequencies at
the class marks connecting the plotted points by means of straight lines and
encloses the polygon by adding an additional class at each end, for which the ends of
the line are connected to the midpoints of the additional classes at the horizontal
axis.
3. Relative Frequency Histogram – it displays graph in which horizontal axis
represetns the classes and the vertical axis represents the relative frequencies.
4. Ogives – forms a graph of the cumulative frequency (cf) distribution
a. <ogive – the less than cf is plotted against the Upper true class boundary
b. >ogive – the greater than cf is plotted against the Lower true class boundary
Numerical descriptive measures which indicate or locate the center of a distribution of a set
of data.
SUMMATION NOTATION
�� = �1 + �2 + … + ��
�=1
The number 1 and n are called the lower and upper limits of summation, respectively.
Example: Write out the following in full, that is, without summation signs:
5
1. ) �� = �1 + �2 + �3 + �4 + �5
�= 1
4
2. ) �� �� = �1 �1 + �2 �2 + �3 �3 + �4 �4
�= 1
Rules on Summation
1) The summation notation is distributive over addition.
� � �
�� + �� = �� + ��
�= 1 �= 1 �= 1
2) If c is a constant, then
� �
��� = � ��
�= 1 �= 1
�
3) If c is a constant, then �= 1
� = ��
4
= 2�� + 4 3 ���� 3
�= 1
4
= 2 �� + 4 3 ���� 2
�= 1
= 2 �1 + �2 + �3 + �4 + 12
5
2. �3 − 2 + �4 − 2 + �5 − 2) = (�� − 2)
�= 3
Any single value that describe the "center" of the given data. It is often known as the
average.
Numerical descriptive measures which indicate or locate the center of a distribution of a set
of data.
MEASURES OF LOCATION
The percentile is a measure which divides the distribution into one hundred equal
parts. The quartile measure divides the distribution into four equal parts while the decile
divides the distribution into ten equal parts.
Standard deviation is simply a measure of how far from the mean the data is spread.
It can be visualized like a dartboard, where the center is the MEAN and the darts are the
data in the set.
The greater the standard the deviation is, the greater the spread. The greater the spread, the
more inconsistent it gets.
The remaining Descriptive Statistics in the Excel computation – Kurtosis and Skewness,
refers to the overall shape of the distribution relative to the normal distribution (the bell
curve). Kurtosis is the degree of peakness relative to a normal distribution while Skewness
is the degree of asymmetry (departure from symmetry).
STANDARD SCORE
The standard score is the measures of standard deviations in relation to the mean. It
is computed as
�− �
Z= �
For data with a normal distribution, the standard deviation has the following
characteristics.
1. About 68% of the data are within one standard deviation of the mean.
2. About 95% of the data are within two standard deviations of the mean.
3. About 99.7% of the data are within three standard deviations of the mean.