You are on page 1of 15

PSYCHOLOGICAL STATISTICS (STATPSY) Data Set - a collection of measurements or

observations.
LESSON 1: FUNDAMENTALS OF RESEARCH AND
STATISTICS Population - a complete set of individuals, objects,
or measurements having some common
Statistics - the process of collecting data in a observable characteristic.
systematic manner, examining those data, and
making inferences from them. Sample - a subset of a population that shares the
same characteristics as the population.
COMMONLY USED TERMS IN STATISTICS AND
RESEARCH Statistic - a number resulting from the
Variable - any measurable characteristic of a manipulation of sample data according to
person, environment, or experimental treatment certain specified procedures; statistics for samples
that varies from person to person, environment to are variable rather than constant. They are
environment, or experimental situation to reported by Roman letters (e.g., x, s, r).
experimental situation. - are numbers based on direct observation
and measurement.
Constant - a number that represents a construct
that does not change. Parameter - a value summarizing a measurable
characteristic of a total population that is
Dependent Variable (DV) - an outcome of interest estimated based on the value of a statistic.
(e.g., some aspect of behavior) that is observed Population parameters are constants. Parameters
and measured by a researcher in order to assess are represented by Greek letters.
the effects of the independent variable. - are often inferred values based on
statistics.
Independent Variable (IV)
- the variable that an experimenter uses to
describe or explain the differences in the SAMPLING
dependent variable or to cause changes in the
dependent variable. Simple Random Sample - a subset of a population
- Types of Independent Variables: selected in such a way that each member of the
 Subject Variable - based on a measurable population has an equal an independent
characteristic of the subject that the opportunity to be selected.
experimenter does not directly change. A
condition of the subject that exists before Random Assignment - assigning subjects to
the research begins. different treatment conditions in such a way that
 Manipulated or Experimental Variable - the each subject has an equal and independent
type that the experimenter systematically opportunity to be placed in each treatment
controls or manipulates and to which the condition. With random assignment we ensure
subjects are assigned. that the groups are equivalent to each other
before the research begins.
Data - numbers or measurements that are
collected as a result of observations.

Reference: David, E. (2018), Psychological Statistics Handouts


Bryle Zyver | @zyverxpnd | P - 102
TWO BRANCHES OF STATISTICS FUNDAMENTALS OF RESEARCH

Descriptive Statistics - a set of statistical 3 Common Features of All Scientific Research


procedures used to organize, summarize, and 1. An attempt to answer empirical questions
present the data collected in a research project.  Research Design - a systematic procedure
for collecting data in order to provide
Inferential Statistics - a collection of statistical answers to specific questions.
procedures that allow one to make
generalizations about population parameters 2. The use of publicly verifiable information
based on sample statistics, to determine if there is  Operational Definition - the definition that
a systematic relation between the IV and the DV, a researcher uses to describe the
and to determine if there is a cause-and-effect processes by which an object, event, or a
relation between the IV and the DV. construct is measured.
 Direct Replication - repeating an
experimental manipulation under the
SAMPLING ERROR same conditions as the previous
experiment.
Sampling Error - is the naturally occurring  Systematic Replication - repeating a
discrepancy, or error, that exists between a previous experimental preparation but
sample statistic and the corresponding with one or more changes to the
population parameter. independent variable
- “margin of error”
3. The use of systematic empiricism
**Basic concept: Sample statistics vary from one - suggests that we attempt to make our
sample to another and typically are different from observations in a controlled manner. Specifically,
the corresponding population parameters. we attempt to account for alternative hypotheses
and determine if the behavior we are exploring is
due to the specific conditions we are examining
THE INFERENTIAL FUNCTION OF STATISTICS or due to other factors.
- the end products of most research efforts are
2 Types of Inferences masses of information. If this information takes
1. Parameter Estimation or Generalization - numerical form, we refer to it as data.
involves using samples to make estimates about - our task is to organize, summarize and interpret
parameters. Most population parameters can these data.
never be directly known except in cases in which
the population is small or there are sufficient funds
to study the entire population. STATISTICS IN THE CONTEXT OF RESEARCH

2. Induction - a conclusion based on observation Statistical Analysis - is concerned specifically with


and experience. An induction is said to be making sense out of data and permitting valid
probably true. conclusions or inferences to be drawn from these
data.

Reference: David, E. (2018), Psychological Statistics Handouts


Bryle Zyver | @zyverxpnd | P - 102
** The nature of the research design and the - Instead, they either receive no
quality of the data we collect impose restrictions treatment or they receive a
on the types of conclusions that can validly be neutral, placebo treatment.
drawn.
2. True Experiment
** No statistic is meaningful by itself. A single piece - a type of research where subjects are randomly
of data tells us nothing about relative frequencies assigned to a treatment condition, the researcher
and absolutely nothing about causation. manipulates the independent variable, and
measures the dependent variable.
- Essential Elements of a True Experiment
METHODS OF GATHERING INFORMATION 1. The IV must be under the control of the
experimenter.
1. Correlational Study 2. The subjects must be randomly assigned to
- a type of research that examines the relationship the treatments.
between variables. The research does not support 3. There must be controls for alternative
cause-and-effect conclusions hypotheses.
- can be presented by using:
 Case I: Correlation and Linear Regression
 Correlation – shows only the relationship ESTABLISHING CAUSE AND EFFECT RELATIONS
between the IV and DV
 Linear Regression – IV predicts the DV Method of Agreement - searching for the
 Case II: Intact Group Design presence of one element whenever another
 Intact Group Design - use of subject element is present.
variable as a grouping variable ––
significant difference among groups no Method of Difference - bbserving two situations
cause and effect. that are alike in all ways except one. If different
 DV affects the IV effects are found, they are ascribed to the
 Case III: Experimental Design variable that was not common to two situations.
 Experimental Design - use of random
assignment to groups to identify Double Blind - a control procedure wherein the
significant subject and the researcher collecting the data
- differences among groups are unaware of which experimental condition the
and to make cause and effect inferences subject is in.
- IV causes the DV
 Control Conditions in an Experiment Joint Method of Agreement and Difference - a
 Experimental condition - individuals method of investigation that combines methods
receive the experimental treatment of agreement and difference.
 Control condition
- to provide a baseline for
comparison
- Individuals in a do not receive the
experimental treatment.

Reference: David, E. (2018), Psychological Statistics Handouts


Bryle Zyver | @zyverxpnd | P - 102
MEASUREMENT SCALES Ordinal Scale
- consists of a set of categories that are organized
Constructs - internal attributes or characteristics in an ordered sequence.
that cannot be directly observed but are useful for - measurements on an ordinal scale rank
describing and explaining behavior. observations in terms of size or magnitude.

Operational Definition Interval Scale


- identifies a measurement procedure (a set of - consists of ordered categories that are all
operations) for measuring an external behavior intervals of exactly the same size.
and uses the resulting measurements as a - equal differences between numbers on scale
definition and a measurement of a hypothetical reflect equal differences in magnitude.
construct. - zero point on an interval scale is arbitrary
- Two Components of an OD:
•It describes a set of operations for measuring Ratio Scale - is an interval scale with the additional
a construct. feature of an absolute zero point. With a ratio
•It defines the construct in terms of the resulting scale, ratios of numbers do reflect ratios of
measurements. magnitude.

Discrete Variable - consists of separate, divisible


categories. No values can exist between two
neighboring categories.

Continuous Variable - there are an infinite number


of possible values that fall between any two
observed values.
- is divisible into an infinite
number of fractional parts.

Real Limits
- boundaries of intervals for scores that are
represented on a continuous number line.
- located exactly halfway between the scores. LESSON 2: MEASURES OF CENTRAL TENDENCY
- Each score has two real limits:
•upper real limit Central Tendency - is a statistical measure to
•lower real limit determine a single score that defines the center
of a distribution.
Nominal Scale - the goal of central tendency
- consists of a set of categories that have different is to find the single score that is most typical or
names most representative of the entire group.
- measurements on a nominal scale label and
categorize observations, but do not make any
quantitative distinctions between observations.

Reference: David, E. (2018), Psychological Statistics Handouts


Bryle Zyver | @zyverxpnd | P - 102
FINDING THE CENTER OF A DISTRIBUTION nag-add ka ng 1 point kada score, pag ni-
repeat mo yung step, yung mean na nakuha
a. Mean mo kanina is madadagdagan lang ng 1 point.
- also known as the arithmetic average
- is computed by adding all the scores in the 4. If every score in a distribution is multiplied by
distribution and dividing by the number of scores. (or divided by) a constant value, the mean will
- The mean for a population is identified by the change in the same way.
Greek letter mu, μ (pronounced “mew”), and the • Multiplying (or dividing) each score by a
mean for a sample is identified by M or (read “x- constant value is a common method for
bar”). changing the unit of measurement. (Para
- Formula for population mean lumiit yung value) To change a set of
measurements from minutes to seconds, for
example, you multiply by 60 (kase 1 minute is
equal to 60 seconds); to change from inches
- Formula for sample mean to feet, you divide by 12 (kase 1 foot is equal
to 12 inches)

b. Median
- Characteristics of the Mean - the median is the midpoint of the list if the scores
1. Changing the value of any score will change in a distribution are listed in order from smallest to
the mean. largest
2. Adding a new score to a distribution, or - the median is the point on the measurement
removing an existing score, will usually change scale below which 50% of the scores in the
the mean. The exception is when the new score distribution are located.
(or the removed score) is exactly equal to the - the goal of the median is to locate the midpoint
mean. of the distribution.
3. If a constant value is added or subtracted to -there are no specific symbols or notation to
every score in a distribution, the same constant identify the median. APA uses Mdn.
will be added or subtracted to the mean. - Pag odd number yung total number of scores,
Ex.: automatically yung nasa gitna yung median. Pag
even number yung scores, pag add mo yung
dalawang nasa gitna then divide mo sa 2.

c. Mode
- is the score or category that has the greatest
frequency in a distribution.
- it can be used to determine the typical or most
frequent value for any scale of measurement,
including a nominal scale
- there are no symbols or special notation used to
Meron kang 6 scores which is 4, 2, 3, 3, 2, 3, pag identify the mode or to differentiate between a
pinag-add mo sila 17 total, tas 17 divide mo sa sample mode and a population mode.
6 (kase 6 scores) yung mean mo is 2.83. Pag

Reference: David, E. (2018), Psychological Statistics Handouts


Bryle Zyver | @zyverxpnd | P - 102
Ex.: LESSON 3: FREQUENCIES, PERCENTILES AND DATA
EXPLORATION

Frequency Distribution
- is an organized tabulation of the number of
individuals located in each category on the scale
of measurement.
- Two elements:
1. The set of categories that make up the
Yung mode diyan Is Luigi, hindi 42. Yung 42 yung original measurement scale.
highest sa table or ilang beses (frequent) umulit. 2. A record of the frequency, or number of
Kung sino yung nakakuha ng highest na score, individuals in each category.
yun yung mode.
Shape of a Frequency Distribution
- it is possible to have more than one mode. 1. Symmetrical Distribution - in a symmetrical
Specifically, it is possible to have two or more distribution, it is possible to draw a vertical line
scores that have the same highest frequency. through the middle so that one side of the
 Bimodal - a distribution with two modes. distribution is a mirror image of the other.
 Multimodal - a distribution with more than
two modes.
 No mode - Adistribution with several equally
high points.

**WHEN TO USE THE MEDIAN AND MODE IS NOT 2. Skewed Distribution - the scores tend to pile up
INCLUDED HERE** toward one end of the scale and taper off
gradually at the other end.

PRESENTING MEANS AND MEDIANS IN GRAPHS 3. Tail - the section where the scores taper off
toward one end of a distribution.
Graphs
- can be used to report and compare measures 4. Positively Skewed - a skewed distribution with
of central tendency the tail on the right-hand side; the tail points
- it allows several means (or medians) to be shown toward the positive (above-zero) end of the X-
simultaneously so it is possible to make quick axis. (From left na mataas pababang right)
comparisons between groups or treatment
conditions. 5. Negatively Skewed - the tail points to the left of
- Kinds of Graphs: zero. (From left na mababa pataas ng right)
1. Line Graph
2. Histogram - uses numerical variables, walang
space yung bars, mode
3. Bar Graph - uses categories, may space yung
bars, mean and median

Reference: David, E. (2018), Psychological Statistics Handouts


Bryle Zyver | @zyverxpnd | P - 102
Percentiles 12 is equal to 32%. Yun non yung percentile rank
- the rank or percentile rank of a particular score is ng 7.0 na hinahanap mo kanina pa.
defined as the percentage of individuals in the
distribution with scores at or below the particular Eh pano pag score yung nawawala tapos
value. percentage yung given???
- When a score is identified by its percentile rank,
the score is called a percentile.

Interpolation - the process of estimating


intermediate values in a frequency distribution.
Step 1: Width of interval lang ulit nung sa taas and
Interpolation Process - a single interval is sa baba, both score and percentage pa din wag
measured on two separate scales (for example, makulit.
time and dollars). The endpoints of the interval are
known for each scale. Step 2: Kung mapapansin mo, mababa ng 20
- the interpolation process requires four steps. points yung 40% na score na hinahanap mo dahil
Ex.: Hinahanap mo yung percentage ng isang 60 (yung percent sa taas) minus 40 is 20. Edi yung
score interval mo non is 20/50 or 2/5 (lowest term bes)
(20 na difference ng 60 and 40 tapos over 50 dahil
60 minus 10)

Step 3: 2/5 multiply mo sa 5 edi 2/5 equivalent sa


0.4, i-multiply mo lang sa 5 (yung interval ng 9.5
and 4.5) edi yung sagot 2 points bes.
Step 1: Find the width of the interval. Makikita yon
Step 4: 9.5 yung score sa taas diba, edi minus mo
sa Scores (X) gamit yung score na nasa taas at
lang yung 2 na nakuha mo sa step 3. Edi yung
baba nung hinahanap na score bale 6.5 and 7.5.
sagot 7.5. X = 7.5 non (yun yung sagot fren)
Tapos sa percentage naman, same din taas and
baba so 20% and 44%.

STEM AND LEAF DISPLAYS


Step 2: Yung score na pinapahanap is 0.5 yung
interval. Bakit? Kase 7.0 yung score tapos sa taas
Stem and Leaf Display - this technique requires
niya 7.5 then sa baba 6.5. Difference is 0.5
that each score be separated into two parts: The
first digit (or digits) is called the stem, and the last
Step 3: Yung sa percentage naman, 44% minus
digit is called the leaf.
20% edi 24%, kunin mo lang ½ non or divide mo
lang sa 2 or multiply mo sa 0.5 ikaw bahala bes
For example, X = 85 would be separated into a
HAHAHA edi 12 points/percentage.
stem of 8 and a leaf of 5.
Step 4: 44% yung score sa taas diba, ngayon
subtract mo lang yung 12 points/percentage na
nakuha mo kanina sa step 3 sa 44%. Bale 44 minus

Reference: David, E. (2018), Psychological Statistics Handouts


Bryle Zyver | @zyverxpnd | P - 102
Yung lower observation mo is 15 then highest
naman 29.
Yung lower quartile is 19 then upper quartile
naman 25.
Yung median 21.5 haha.

LESSON 4: VARIABILITY

Variability – provides a quantitative measure of


Tignan mo yung 5, ibig sabihin non may score na the differences between scores in a distribution
56, 52, 57, 59 para lang madali tignan yun yon and describes the degree to which the scores
hahaha. are spread out or clustered together.

Two Purposes of Variability


BOX AND WHISKER PLOT 1. Variability describes the distribution
- Specifically, it tells whether the scores are
Box and Whisker Plot - is a way of summarizing a clustered close together or are spread out over a
set of data measured on an interval scale. It is large distance. Usually, variability is defined in
often used in explanatory data analysis. terms of distance.

This type of graph is used to show the shape of the 2. Variability measures how well an individual
distribution, its central value, and its variability. score (or group of scores) represents the entire
distribution.
In a box and whisker plot: - This aspect of variability is very important for
1. The ends of the box are the upper and lower inferential statistics, in which relatively small
quartiles, so the box spans the interquartile range samples are used to answer questions about
2. The median is marked by a vertical line inside populations
the box
3. The whiskers are the two lines outside the box **In short, mas maiksi yung interval ng bawat
that extend to the highest and lowest score mas maganda or greatest variability siya
observations. non.

RANGE

Range – the distance covered by the scores in a


distribution, from the smallest score to the largest
score.
Ex.:
**Basta pag range, dalawang score lang
ginagamit – yung highest and lowest score.

Reference: David, E. (2018), Psychological Statistics Handouts


Bryle Zyver | @zyverxpnd | P - 102
STANDARD DEVIATION MEASURING VARIANCE AND STANDARD
DEVIATION FOR A POPULATION
Standard Deviation –uses the mean of the
distribution as a reference point and measures Sum of Squares or SS – is the sum of the squared
variability by considering the distance between deviation scores.
each score and the mean.
- The most commonly used and the most Definitional Formula:
important measure of variability.

Steps in Computing SD:


**Mas madali gamitin yung computational
formula**

Ex.: Find the sum of squares:

Step 1. Determine the deviation. Yung score


minus mean (given yung mean dapat)

Step 2. Calculate the mean of the deviation.


Add mo lang mga score tas divide mo kung ilan
sila. Usually sagot ditto zero. So proceed to step Yung X yung score yung X2 yung na-square na X.
3. Pag-add mo lang sila tas substitute mo lang sa
formula then yun na yung sagot which is 22.
Step 3. I-square mo lang yung mga deviation
para positive mga sagot (tawag dito Squared Mean Square or MS – is often used to refer to
Deviation). Tas add mo ulit silang lahat. And then variance, which is the mean squared deviation.
divide mo siya sa kung ilan yung scores na meron
ka. Dun sa picture sa taas 40 yung sum tas 5 yung Population Variance – is represented by the
number of scores, divide mo lang then 8 sagot. symbol σ2 and equals the mean squared
Edi 8 yung variance mo distance from the mean. Population variance is
obtained by dividing the sum of squares by N.
Step 4: Standard deviation is equal to the square
root of variance. Edi square root mo lang yung 8. Formula:
So yung sagot bale is 2.83 charaannnnn.
Population Standard Deviation – is represented by
the symbol σ and equals the square root of the
population.

Formula: or

Reference: David, E. (2018), Psychological Statistics Handouts


Bryle Zyver | @zyverxpnd | P - 102
Ex.: dividing the sum of squares by n -1. (Para mas
malapit siya sa average non kaya minus 1)

Formula:

Sample Standard Deviation – is represented by


the symbol s and equal the square root of the
sample variance.

Formula:

or
Variance =

Ex.:
Standard Deviation =

MEASURING STANDARD DEVIATION AND


VARIANCE FOR A SAMPLE

**May issue yung sample variability dito. Bakit?


Kasi diba pag sinabing sample yun yung
representation ng population. Ibig sabihin non
pag gumamit ka ng sample instead of
population, mataas yung tendency na hindi
accurate yung makuha mong statistic kase nga
certain portion lang yung nasama hindi yung
buong population.

**Pag nangyari yon, may possibility na maging


bias yung sample statistic mo. Either overestimate
siya (mas mataas sa average na dapat makuha
mo) or underestimate (mas mababa sa average
na dapat makuha mo).

**So para maiwasan yung bias na yon,


corrections are made. Remember na capital N is
for population while small n is for sample.

Sample Variance – is represented by the symbol


s2 and equals the mean squared distance from
the mean. Sample variance is obtained by

Reference: David, E. (2018), Psychological Statistics Handouts


Bryle Zyver | @zyverxpnd | P - 102
SAMPLE VARIABILITY AND DEGREES OF FREEDOM Ex. X1= 41, X2= 43, μ= 40, σ= 10
Bale yung X1, X2 dagdagan mo ng 2, edi 43 and
For a sample of n scores, the degrees of freedom, 45 na sila non, hindi magbabago yung standard
or df, for the sample variance are defined as df= deviation (pagitan) nila which is 2 pa din
n – 1. The degrees of freedom determine the
number of scores in the sample that are 2. Multiplying each score by a constant causes
independent and free to vary. the standard deviation to be multiplied by the
same constant.
In general, as a result, the sample with a sample Ex. Pag na-multiply ng 2 yung 41 and 43,
of n scores, the first n -1 scores are free to vary, magbabago yung standard deviation nila
but the final score is restricted said to have n -1 wherein kung 2 yung dating SD magiging 4 siya
degrees of freedom. kase na-multiply sa 2.

BIASED AND UNBIASED STATISTICS Error variance – is used to indicate that the
A sample statistic is unbiased if the average sample variance represents unexplained and
value of the statistic is equal to the population uncontrolled differences between scores.
parameter. (The average value of the statistic is
obtained from all the possible samples for a *As the error variance increases, it becomes
specific sample size, n.) more difficult to see any systematic differences or
patterns that might exist in the data.
A sample statistic is biased if the average value
of the statistic either underestimates or
overestimates the corresponding population LESSON 5: z-SCORES: LOCATION OF SCORES AND
parameter. STANDARDIZED DISTRIBUTIONS

A raw score by itself does not necessarily provide


PRESENTING THE MEAN AND STANDARD much information about its position within a
DEVIATION IN A FREQUENCY DISTRIBUTION GRAPH distribution.

*Dito naman, pag nag-p’present ng graph, The process of transforming X values into z-scores
dapat maliit lang yung standard deviation and serves two useful purposes:
kitang-kita yung difference ng bawat frequency. 1. Each z-score tells the exact location of the
Yung tipong isang tingin lang madali lang original X value within the distribution.
malaman kung ano yung pinapakita nung 2. The z-scores form a standardized distribution
graph. that can be directly compared to other
distributions that also have been transformed
into z-scores.
TRANSFORMATIONS OF SCALE
**Need i-transform yung x values into z-scores
2. Adding a constant to each score does not para mas malinis tignan yung graph kase liliit
change the standard deviation. yung standard deviation non and di kalat-kalat
yung values at the same time medaling i-

Reference: David, E. (2018), Psychological Statistics Handouts


Bryle Zyver | @zyverxpnd | P - 102
compare sa ibang graph kase standard form na STANDARDIZED SCORES
siya.
Standardize a distribution by transforming the
A z-score specifies the precise location of each X scores into a new distribution with a
value within a distribution. predetermined mean and standard deviation
that are whole round numbers.
The sign of the z-score (+ or −) signifies whether
the score is above the mean (positive) or below The goal is to create a new (standardized)
the mean (negative). distribution that has “simple” values for the mean
and standard deviation but does not change
The numerical value of the z-score specifies the any individual’s location within the distribution.
distance from the mean by counting the number
of standard deviations between X and μ.
USING z-SCORES TO STANDARDIZE A DISTRIBUTION
Z-score formula:
*If every X value is transformed into a z-score,
then the distribution of z-scores will have the
Ex.: A distribution of scores has a mean of μ = 86 following properties:
and a standard deviation of σ = 7. What z-score
corresponds to a score of X = 95 in this 1. Shape - the distribution of z-scores will have
distribution? exactly the same shape as the original
distribution of scores. If the original distribution is
negatively skewed, for example, then the z-score
distribution will also be negatively skewed.
Transforming raw scores into z-scores does not
change anyone’s position in the distribution.
DETERMINING A RAW SCORE FROM A z-SCORE

**Pinadali mo lang tignan yung score di mo


Ex.:For a distribution with a mean of μ= 60 and σ
naman iniba yung value so dapat mag-remain
= 8, what X value corresponds to a z-score of z =
yung shape ng distribution.
−1.50?

2. The Mean - the z-score distribution will always


Formula:
have a mean of zero.
Answer: X = 48
3. The Standard Deviation - the distribution of z-
Why? Substitute the values.
scores will always have a standard deviation of 1.
Edi nakuha na yung z multiplied by σ, which is -12
* The advantage of having a standard deviation
60 plus -12 non siya edi 48. Yun bale yung raw
of 1 is that the numerical value of a z-score is
score na hinahanap mo.
exactly the same as the number of standard
deviations from the mean. For example, a z-score
of z = 1.50 is exactly 1.50 standard deviations
from the mean.

Reference: David, E. (2018), Psychological Statistics Handouts


Bryle Zyver | @zyverxpnd | P - 102
TRANSFORMING Z-SCORES TO A DISTRIBUTION
WITH A PREDETERMINED μ AND σ

*The procedure for standardizing a distribution to


create new values for μ and σ is a two-step Pano nagging 55 and 40? Ginamit yung formula
process: sa X value which is:
1. The original raw scores are transformed into z-
scores. (Tamang transform lang ng values para
malinis tignan eto formula) Try naten yung kay Maria. 0.50 yung z-score niya.
Sabi nung prof bet niya na μ = 50 and σ = 10
yung bagong mean and SD niya.

2. The z-scores are then transformed into new X Edi 0.50 multiply sa 10 so 5. Tapos add mo lang
values so that the specific μ and σ are attained. yung 50 so 55 yung sagot. Yun na yung X-value
(Gamitin mo yung formula na to) wag makulit. Ganon din kay Joe tamang gaya
lang ng process haha.

Although z-scores are most commonly used in


Ex. An instructor gives an exam to a psychology the context of a population, the same principles
class. For this exam, the distribution of raw scores can be used to identify individual locations within
has a mean of μ = 57 with σ = 14. The instructor a sample. The definition of a z-score is the same
would like to simplify the distribution by for a sample as for a population, provided that
transforming all scores into a new, standardized you use the sample mean and the sample
distribution with μ = 50 and σ = 10. To standard deviation to specify each z-score
demonstrate this process, we will consider what location. Thus, for a sample, each X value is
happens to two specific students: Maria, who has transformed into a z-score so that:
a raw score of X = 64 in the original distribution;
and Joe, whose original raw score is X = 43. 1. The sign of the z-score indicates whether the X
value is above (+) or below (−) the sample mean,
Transform each of the original raw scores into z- and
scores. For Maria, X = 64, so her z-score is 2. The numerical value of the z-score identifies
the distance from the sample mean by
measuring the number of sample standard
deviations between the score (X) and the
sample mean (M).
For Joe, X = 43, and his z-score is:

LESSON 6: CONVERTING z-SCORES TO PERCENTILE,


T-SCORES

Edi ayan tapos mo na Step 1. Basis mo diyan is Characteristics of the Normal Distribution Curve
raw scores and original mean and standard 1. The scale along the bottom of the distribution is
deviation which is 57 and 14. Step 2 ka na. in z-score units, that is, μ = 0, σ = 1

Reference: David, E. (2018), Psychological Statistics Handouts


Bryle Zyver | @zyverxpnd | P - 102
2. It has a bell-shaped distribution that is b. Percentile rank = 50 + 45.25 = 95.25%
symmetrical (skewness, s3 = 0.0) and mesokurtic
(kurtosis, s4 = 3.0). San galling yung 50? Kase diba yung z-score niya
1.67, so mataas sa mean, edi positive so nasa
3. The proportion of the curve that is under the right side siya. Diba yung gitna automatic 50 na?
different parts of the distribution have Edi add mo nalang 50 sa 45.25 bale yung sagot is
corresponding values such that 95.25%. Yun yung percentile rank niya.
•If we convert a raw score into a z-score, we
can determine its percentile rank. Case 2: Area in any tail
•We can use this to determine proportions of - Find the area given for the z-value in the z-score
the distribution that is between specific parts. table. Subtract the area found from .5000 to find
the probability

COMPARISON OF SCORES Or, see proportion in the tail of distribution (C).

**Basta lagi mong tatandaan na sa distribution Ex. 1: Corrine scores 93 on the IQ test. What is her
graph, lagging 50% yung sa gitna kase yun nga percentile rank?
yung mean so yun yung gitna okay besss. Solution:

Case 1: Area between 0 and any z-score


- The area reading for the z-value in the z-score
table is the required probability The usual, convert mo score niya into z-score tas
hanapin mo yung sagot which is 0.47 sa table
Ex.: column C edi .32 (round off to the nearest tenths)
1. Given the μ = 100 and σ = 15 on a standard IQ tas gawin mong percent edi 32%. Yun yung
test. percentile rank niya.
Frank obtained a score of 125 on the IQ test.
a. What percentage of cases fall between his Ex. 2: Tilda scores 108 on the test. What percent
score and the mean? of test takers obtained a score above hers.
b. What is his percentile rank in the general Solution:
population?

Solution:
a.
Convert ulit ng z-score edi 0.53. Tas hanapin mo
0.53 sa table column C edi 0.20 siya. Di yun yung
sagot kasi yung hinahanap is percentage of test
Transform mo lang yung score ni Frank into z-
takers na mas mataas score sa kanya so need
score. Edi 1.67 siya, tignan mo non yung statistical
mag-subtract ng 50 (dahil nasa right side siya)
table yung 1.67 column D ata to edi .4525 siya.
edi 0.30 or 30% yung final answer.
Gawin mong percentage non either multiply mo
sa 100 or move two decimal places to the right
so magiging 45.25% siya. Yun yung percentage
of cases simula mean hanggang sa score niya.

Reference: David, E. (2018), Psychological Statistics Handouts


Bryle Zyver | @zyverxpnd | P - 102
Case 3: Area between two z-scores on the same Step 2: Find the z corresponding to x = 88
side of the mean
- Find the areas given for both z values in the z-
score table. Subtract the smaller area from the
larger area to get the probability.
Step 3: Add the two areas to obtain area
Ex. Gamitin natin yung sa kanina. between 88 and 120
What is the percentage of area between a score 40.82 + 28.81 = 69.63%
of 123 and a score of 135?
Step 1: Find the z corresponding to x = 123 **Bat pinag-add, kase opposite sides sila, yung
isa nasa positive side yung isa nasa negative
side. Eh area nila yung hinahanap from 88 to 120
so dun nakuha yung 69.63% na percentage of
Convert mo lang z-score tas hanapin mo yung cases.
value niya sa table column D which is 1.53 nga
edi 0.4370 siya percent ulit fren 43.70% na haha
FINDING THE NUMBER OF CASES THAT FALL WITHIN
Step 2: Find the z corresponding to x = 135 AN AREA

For the same standard IQ test with a μ = 100 and


σ = 15 involving 5,000 cases determine the
number of cases that fall within the areas
Step 3: Subtract the two areas to obtain area previously obtained.
between 123 and 135
49 – 43.70 = 5.30% 1. The number of test takers who scored higher
**Pinag-minus kase nga area between 123 and than Tilda’s score of 110.
135 lang yung hinahanap, yung nakita mo is Answer: 5,000 x .30 = 1,500 cases
percentile rank ng 123 and 135 each hindi yung **San galling yung .30??? Sa Case 2 Example 2
from 123 to 135 kaya pinag-subtract. 5.30% yung 5,000 daw yung involve na cases multiply mo
area bale hahaha. lang sa 0.30. Yung 30% kanina yun yung
equivalent percentage ng 1,500 sa 5,000.
Case 4: Area between 2 z-values on opposite Gets???
sides of the mean
- Find the areas given for both z values in the z- 2. The number of cases that fall between a score
score table. Add the two areas obtained from of 123 and a score of 135?
the table to get the probability. Answer: 5,000 x .053 = 265 cases

What percentage of cases fall between a score 3. What percentage of cases fall between a
of 120 and a score of 88? score of 120 and a score of 88?
Step 1: Find the z corresponding to x = 120 Answer: 5,000 x .6963 = 3,482 cases

**Always take note na pag percentage or


proportion tinatanong, percent form yung sagot.
Pag number of cases, whole number.

Reference: David, E. (2018), Psychological Statistics Handouts


Bryle Zyver | @zyverxpnd | P - 102

You might also like