You are on page 1of 9

Lesson Objectives

 Define, construct, and interpret visual descriptions of data: frequency


distribution, histogram, and frequency polygon.

 Define, calculate, and interpret descriptive statistics concepts: mean, median,


mode, range, and standard deviation.

Introduction

By analyzing students’ performance, you will obtain information to improve both


tests and instruction, to determine grades or the passing score on the test, to
determine whether the test was too easy or too hard, and to evaluate the
effectiveness of instruction. If many scores are low, such scores may reflect
students’ effort and learning abilities. However, they may also indicate that the
instruction was ineffective or that the test was too difficult.

Now that you have access to multiple forms of data from standard and teacher-
made tests, you organize and summarize them so that they provide useful
information. Data can be organized through tables such as in a frequency
distribution, and data can be presented in a visual format through the use of graphs
and charts such as a histogram, frequency polygon or a scatter-plot.

Statistical methods are used to summarize and describe data. One way to
summarize a set of scores is to look at the measures of central tendency (mean,
median, and mode). Another method to describe how much scores are different
from one another is by examining measures of variability (range and standard
deviation).

Frequency Distribution

In statistics, the term “frequency” refers to the number of times each score or event
occurs. We are usually interested in frequency as it relates to the number of
students obtaining each score on a test. One way to record frequencies is in a
frequency distribution table, where each score is listed in a column on the left side
and the frequency with which it occurred is listed on the right. While a frequency
table helps to organize data, it does not provide a great deal of descriptive
information about the scores. Frequency distributions provide the initial organization
and information that is the starting point for many other statistical methods.

The teacher wants to examine the general performance trends among his eighth
grade language arts students in order to evaluate student learning and his own
instruction. She gives a mid-term language arts exam and obtains the following
scores for his students:

95 91 100 96 92 91 87 84 70 65 96 65 56 86 43 65 22 40 93

To evaluate those scores, she creates a frequency distribution table to organize the
test data. The obtained scores are listed in order from highest to lowest.

Frequency Distribution

Scores Tallies Frequenc


y

100 | 1

96 || 2
95 | 1

93 | 1

92 | 1

91 || 2

87 | 1

86 | 1

84 | 1

70 | 1

65 ||| 3

56 | 1

43 | 1

40 | 1

22 | 1

Graphic Displays of Distributions

The data from a frequency table can be displayed graphically. A graph can provide a
visual display of the distributions, which gives us another view of the summarized
data. The graphic representation of the relationship between two different test
scores through the use of scatter plots. We learned that we could describe in
general terms the direction and strength of the relationship between scores by
visually examining the scores as they were arranged in a graph. Some other types
of graphs include histograms and frequency polygons.

A histogram is a bar graph of scores from a frequency table. The horizontal x-axis
represents the scores on the test, and the vertical y-axis represents the frequencies.
The frequencies are plotted as bars.

Histogram of Mid-Term Language Arts Exam


A frequency polygon is a line graph representation of a set of scores from a
frequency table. The horizontal x-axis is represented by the scores on the scale and
the vertical y-axis is represented by the frequencies.

Frequency Polygon of Mid-Term Language Arts Exam


A frequency polygon could also be used to compare two or more sets of data by
representing each set of scores as a line graph with a different color or pattern. For
example, you might be interested in looking at your students’ scores by gender, or
comparing students’ performance on two tests (see Figure 9.4).

Frequency Polygon of Midterm by Gender

Measures of Central Tendency

One way to summarize your data is to look at the measures of central tendency:
mean, median, and mode.

Mean (or arithmetic average)


The mean is the average performance level of a group of students. It is obtained by
taking the sum of a set of scores and dividing by the total number of scores. The
mean can be distorted if there are some scores that are extremely different
(outliers) from the mean of the majority of scores for the group. Consequently, the
median is the most descriptive measure of central tendency.

Median
100
The median is the point in the distribution that splits the scores in two
equal groups, which is also known as the midpoint of a distribution, or the
50th percentile. To calculate the median, organize the raw scores in rank
96
order. The median is the middle value on the scale that divides the number
of scores into equal halves, if the number of scores is odd. When the
number of scores is even, the median is calculated as the average of the
96 two middle scores.

Mode
95
The mode is the most frequently occurring score in a distribution. There
are no mathematical calculations needed for the mode. Once the data are
93 organized in a frequency distribution format, the mode can be identified. In
some cases there may be more than one mode in a distribution if two or
more scores share the highest frequency. A set of scores with two modes is
called bimodal; those with more than two are called multimodal. The
92 following is a step-by-step demonstration of calculating the measures of
central tendency. Mr. Walker creates a table to display the range of scores
obtained by his students:
91
Calculating Mean, Median, and Mode

91

87 Calculating the Mean: the teacher adds up all of the


scores (total 1,437 points) and divides the result by
the total number of scores (19): 1,437 ÷19 = 75.6
86
Finding the Median: Note that with an odd number
(19) of scores, the median is the number that divides
the scores into two equal halves, nine scores above
84
and nine scores below. In this distribution of
scores, 86 is the median.

70 Finding the Mode: In this distribution of


scores, 65occurs most frequently.

65

65

65

56

43
Representation of the Mean, Median, and Mode on a Curve

When representing a standardized test score, they will normally be


40
distributed and have a symmetrical distribution. Notice that in this
distribution the mean, median, and mode are equal. The mean is also the

22
number that divides the scores into two equal groups and is the score that occurs
most frequently.

The shape of a distribution of your test scores can provide useful clues about your
test and your students’ performance. When representing students’ scores on a
graph, the scores often will be positively or negatively skewed. When the
distribution is positively skewed, that implies that the most frequent scores (the
mode) and the median are below the mean. If your test is very difficult, there may
be many low scores and few high ones. The distribution of scores would have a
shape similar to the one depicted below that is positively skewed.

When the tail points to the left, the distribution is negatively skewed. In this
distribution there are high scores and relatively few low scores. Notice that the
mean is influenced by the skewing.

The mean can be distorted if there are some scores that are extremely different
(outliers) from the mean of the majority of scores for the group. Consequently, the
median is the most descriptive measure of central tendency.

Indicators of Variability

Variability is the dispersion of the scores within a distribution. Given a test, a group
of students with a similar level of performance on a specific skill tend to have scores
close to the mean. Another group with varying levels of performance will have
scores widely spread and further from the mean. In other words, how varied are the
scores? Two common measures of variability are the range and standard deviation.

Range

The range, R, is the difference between the lowest and the highest scores in a
distribution. The range is easy to compute and interpret, but it only indicates the
difference between the two extreme scores in a set.

If we use the scores from Mr. Walker’s class (above), we would calculate the range
as: Range (R) = the highest score – the lowest score in the distribution.

95 91 100 96 92 91 87 84 70 65 96 65 56 86 43 65 22 40 93

R = 100 - 22 = 78, so the range is 78.

Standard Deviation

A more useful statistic than simply knowing the range of scores would be to see how
widely dispersed different scores is from the mean. The most common measure of
variability is the standard deviation (SD). The standard deviation is defined as the
numeric index that describes how far away from the mean the scores in the
distribution are located. The formula for the standard deviation is:

Where X = the test score, M = mean, and N = number of scores.

The higher the standard deviation, the wider the distribution of the scores is around
the mean. This indicates a more heterogeneous or dissimilar spread of raw scores
on a scale. A lower value of the standard deviation indicates a narrower distribution
(more similar or homogeneous) of the raw scores around the mean.

Table 5.1.6

Scores Step 1 Step 2


Deviatio
n Scores
Square of
Deviation
Scores

100 24.4 595.36

96 20.4 416.16

96 20.4 416.16

95 19.4 376.36

93 17.4 302.76

92 16.4 268.96

91 15.4 237.16

91 15.4 237.16

87 11.4 129.96

86 10.4 108.16

84 8.4 70.56

70 -5.6 31.36

65 -10.6 112.36

65 -10.6 112.36

65 -10.6 112.36

56 -19.6 384.16

43 -32.6 1062.76

40 -35.6 1267.36
22 -53.6 2872.96

Step 3: Sum of Dev. Scores =


9130.28

Calculating the Standard Deviation:

X = squared raw scores

M = mean = 75.6

N = number of scores = 19

To calculate the Standard Deviation of this


distribution of scores:

1. Subtract the mean (X-75.6) from each score in the


distribution. This will give you roughly half positive and
half negative scores (called deviation scores).

2. Square each of the deviation scores (X - 75.6)², which


will make them all positive.

3. Add the squared deviation scores together (9,130.28).

4. Divide the resulting sum by the number of scores in


the distribution (9,130.28÷19 = 480.54).

5. Take the square root of the results of your division from


Step 4. The square root that you arrive at is the
standard deviation.

√480.54 = 21.92 = SD for this distribution of scores.

You might also like