Professional Documents
Culture Documents
Statistics Refresher
Student Name
Section
Schedule
Program
College
Professor JOAN B. MARASIGAN
Module Duration
SESSION NO. / WEEK NO. 4
TOPICS:
1. Scales of measurement
2. Descriptive Statistics
3. The normal curve
4. Standard scores
Overview
This module is designed to guide you in recollecting and reorganizing your previous
learnings from subjects that encompass Statistical concepts. Knowledge in at least some basic
statistical concepts is necessary to facilitate your understanding of the science of measuring
psychological constructs.
Here, the scales of measurements (nominal, ordinal, interval, ratio) will be reintroduced
with emphasis on measuring psychology-related data/variables. You will also be reoriented about
describing psychology-related data as well as measures of central tendency. The concepts of
normal curve and standard scores will be introduced to equip you with the skills in giving meaning
to test scores and in interpreting them appropriately.
This module dwells more on the Descriptive Statistical concepts in psychological
measurement. It is expected that the knowledge from this module will help you further understand
how we quantify human behavior.
Study Guide
• You can complete this module at your own phase. It means you can take every part to
study depending on your capability to do so.
• This module starts with a brief overview about what the topic is all about. It will also tell
you the learning outcomes we would want you to accomplish after completing this module.
The presentation of topic has complete resources you can check for further readings and
clarifications.
• Requirements involve learning activities and assessment which are available on the last
part of the module. Take time to study each lesson carefully to apply these new leanings
appropriately.
Learning Outcomes
Fig 2: It is assigned that the distance between each Likert response anchor is
one unit of measurement. So, we can arbitrarily assign “Strongly Agree” with
a value of 4, “Agree” with 3, “Disagree” with 2, and “Strongly Disagree” with 1.
Variables. Before we can begin to describe data, we need to decide what sort of data we have.
This seems like a very obvious thing to say, but it is easy to make mistakes. Different sorts of data
need to be summarized in different ways. When we measure something, we are assigning
numbers to individuals (where an individual is usually, but not always, a person). A measurement
is usually called a variable. A variable is anything that can vary (or change) between individuals
(Miles & Banyard, 2007).
Scale (Filipino “Panukat”) is a set of numbers (or other symbols) whose properties model
empirical properties of the objects to which the numbers are assigned. A Scale can be classified
as Nominal, Ordinal, Interval, and Ratio.
TYPES OF
TYPES OF DATA/SCALE EXAMPLE
VARIABLE
Sex: Male, Female
NOMINAL- where there are
Test Result: Passed, Failed
Categorical three or more possible
Jungian Personality Type:
measures- categories, but there is no
ESTJ, ISTJ, ENTJ, and INTJ
Categorical natural order to the categories.
Student: Regular student,
measures are Words are used instead of
working student, student
qualitative or numbers.
athlete, irregular student.
classification
ORDINAL- when the categories Birth Order,
variables.
have an order. However, the Place in a race,
distance between each ranks Class ranking
and orders are not always equal. Grade Level
Also numbers are used to Percentile Scores
designate an orderly series. Likert Scale (though this is
controversial).
Interval Scale- have the same
interval between each score.
Also known as equal-unit scales.
In these scales, the difference
between any two consecutive Fahrenheit and Celsius
numbers reflects an equal temperature scales; calendar
empirical or demonstrable
Continuous
difference between the
Variables- give you
objects or events that the
a score for each
numbers represent
individual person
Ratio scales- numbers achieve
and measures may
the property of additivity, which
(theoretically) take
means they can be added—as
any value.
well as subtracted, multiplied,
and divided—and the result Measures of length; periods of
expressed as a ratio, all with time
meaningful results. Ratio scales
have a true or absolute zero point
that stands for “none” of
whatever is being measured.
These scales also have properties such as magnitude, equal interval, and absolute
zero. Each of these four scale has distinct properties:
TYPES OF STATISTICS
Descriptive statistics
Analysis of data that helps describe, show or summarize data in a meaningful way such
that patterns might emerge from the data. Descriptive statistics do not, however, allow us to make
conclusions beyond the data we have analyzed or reach conclusions regarding any hypotheses
we might have made. They are simply a way to describe our data
(https://statistics.laerd.com/statistical-guides/descriptive-inferential-statistics.php). Typically, the
data can be described based on:
a. Frequency Distribution
b. Measures of central tendency: mean, median, and mode
c. Measures of spread: includes range, quartiles, absolute deviation, variance and
standard deviation.
Inferential statistics
Methods used to make inferences from observations of a small group of people, called a
sample. These inferences are then used to estimate the characteristics of a larger group of
individuals, known as a population (Kaplan & Saccuzzo, 2017). The methods involved are:
a. the estimation of parameter(s)
b. testing of statistical hypotheses.
Table 5
Comparison between Descriptive and Inferential Statistics
Descriptive Inferential
Present data Assess relationships among variables
Draw conclusions and generalized findings
Organize data
about the population based on the sample.
Summarize data Hypothesis testing
Small data set Large data set
Simple Complex
Results obtained represent a portion of the
Represents the entire data
population, but can be used to deduce
set
information about the entire population
Less Error involved is usually more
Frequency Distributions
A single test score means more if one relates it to other test scores. A distribution of
scores summarizes the scores for a group of individuals. In testing, there are many ways to record
a distribution of scores. The frequency distribution displays scores on a variable or a measure
to reflect how frequently each value was obtained (Kaplan & Saccuzzo, 2017).
Above is a sample frequency distribution in tabular (figure 1.1) and graphical formats (figure
1.2).
Percentile Rank
Percentile ranks replace simple ranks when we want to adjust for the number of scores in
a group. A percentile rank answers the question, “What percent of the scores fall below a
particular score (Xi)?”
To calculate a percentile rank (1) determine how # COUNTRY CASES
many cases fall below the score of interest, (2) determine 1 United States 7,382,944
how many cases are in the group (3) divide the number of 2 India 6,549,373
cases below the score of interest (Step 1) by the total 3 Brazil 4,906,833
number of cases in the group (Step 2), and (4) multiply the 4 Russia 1,215,001
result of Step 3 by 100. The formula is: 5 Colombia 848,147
6 Peru 824,985
𝐵 7 Argentina 790,805
𝑃𝑟 = 𝑥 100 = 𝑝𝑒𝑟𝑐𝑒𝑛𝑡𝑖𝑙𝑒 𝑟𝑎𝑛𝑘 𝑜𝑓 𝑋𝑖 8 Spain 789,932
𝑁
9 Mexico 757,953
10 South Africa 679,716
Where: 11 France 606,625
Pr= Percentile Rank 12 United Kingdom 480,017
Xi= the score of interest 13 Chile 470,179
B= the number of scores below Xi 14 Iran 468,119
N= the total number of score 15 Iraq 375,931
16 Bangladesh 367,565
The table above shows the recent data of COVID 17 Saudi Arabia 335,997
18 Italy 325,329
cases worldwide (as of October 5, 2020). Here N=220
19 Turkey 324,443
countries were ranked based on the number of cases and 20 Philippines 322,497
recovery. If you are interested in the relative percentile *N= 220 countries, countries no. 21-
ranking of the Philippines in terms of cases in this global 220 are not shown here
index, you may apply the formula above:
Where:
𝐵 Pr= Percentile Rank
𝑃𝑟 = 𝑥 100 = 𝑝𝑒𝑟𝑐𝑒𝑛𝑡𝑖𝑙𝑒 𝑟𝑎𝑛𝑘 𝑜𝑓 𝑃ℎ.
𝑁 Xi= Philippines
200 B= 200
𝑃𝑟 = 𝑥 100 = 90.91% N= 220
220
Based on the computation above, when we rank the Covid-19 cases worldwide,
Philippines has a percentile rank of 90.91%. It means that we have more COVID cases than
90.91% of the countries in the entire world! Now it’s your turn, give the percentile ranks of the
following countries based on the data above:
Ranks Below
Country N Pr
them (B)
1. USA
2. France
3. Russia
Frequency distribution can also be described by looking at its appearance and symmetry. Below
is an example of frequency distribution of test scores:
Figure 3.3
Symmetrical distribution approximate a normal curve. Majority of the scores are
gathered in the center and spread out evenly from left to right
Skewness
Distributions can be characterized by their skewness (Filipino: baling, pagkakiwal), or
the nature and extent to which symmetry is absent. Skewness is an indication of how the
measurements in a distribution are distributed (Cohen & Swerdlik, 2018).
The asymmetry of a frequency distribution can give us some hints about the characteristic
of the variable measured. When a data is positively skewed (see figure 2.2), the scores are
gathered in the left side of the X-axis; while in negatively skewed distribution (see figure 3.2), the
scores accumulate in the right side of the X-axis. To avoid being confused, look at the thinner tail
of the distribution. In the context of testing, we can say that a test is difficult if the distribution is
positively skewed, and easy if negatively skewed. The relative placement of the mean, median,
and mode also vary depending on the shape of the distribution.
Figure 5
Kurtosis
Although your distribution is approximately normal, you may find that there are a small
number of data points that lie outside the distribution. For example, you have this one classmate
who scores almost perfect in a test when almost all of you barely got a score of 60/100. Or a
student who scored very low in a test when majority of you almost got a perfect mark. They are
called “outliers”.
Where:
Σx= add up all of the values in x.
N= total number of cases
The median is the middle score in a set of scores. The median is used when the mean is
not valid, which might be because the data are not symmetrically or normally distributed, or
because the data are measured at an ordinal level.
The mode is rarely reported in research. It is the most frequent score in the distribution
or the most common observation among a group of scores. The mode is the best measure of
central tendency for categorical data (although it’s not even very useful for that).
Standard Scores
Is a raw score that has been converted from one scale to another scale, where the latter scale has some
arbitrarily set mean and standard deviation? Why convert raw scores to standard scores? Raw scores may be
converted to standard scores because standard scores are more easily interpretable than raw scores. With a
standard score, the position of a testtaker’s performance relative to other testtakers is readily apparent.
z Scores
A z score results from the conversion of a raw score into a number indicating how many
standard deviation units the raw score is below or above the mean of the distribution. Let’s use
an example from the normally distributed “National Spelling Test” demonstrate how a raw score
is converted to a z score. We’ll convert a raw score of 65 to a z-score by using the formula:
But, please take note of the following constants! These will be consistent for any
group regardless of the raw score mean and SD:
As you can see from the following formulae, each of the new standard score
systems is based on the Z-score:
If we take some simple Z-scores (such as -2, -1, 0, +1, and +2) we can use the
formulae to calculate other standard score equivalents as follows:
Although some decimals are shown in the tables above, T-scores, stens and
stanines are all usually rounded to the nearest whole number. A sten calculated to be
7.78 would therefore be rounded up to 8.
For test developers’ intent on creating tests that yield normally distributed measurements, it is generally
preferable to fine-tune the test according to difficulty or other relevant variables so that the resulting distribution
will approximate the normal curve. That usually is a better bet than attempting to normalize skewed distributions.
This is so because there are technical cautions to be observed before attempting normalization. For example,
transformations should be made only when there is good reason to believe that the test sample was large enough
and representative enough and that the failure to obtain normally distributed scores was due to the measuring
instrument.
References
Chapman, K. (2009). FACTSHEET 21: T-Scores, Stens and Stanines. Knight Chapman
Psychological Ltd.
Cohen, R. & Swerdlik, M. (2018). Psychological Testing and Assessment An Introduction to
Tests and Measurement, 9th ed. McGraw-Hill Education.
Kaplan, R. & Sacuzzo, D. (2018). Psychological Testing: Principles, Applications, and
Issues, Ninth Edition. Cengage Learning
Miles, J. & Banyard, P. (2007). Understanding and Using Statistics in Psychology. Sage
Publications Ltd.
Urbina, S. (2014). Essesntials of Psychological Testing. Wiley.
Website:
(https://statistics.laerd.com/statistical-guides/descriptive-inferential-statistics.php).