Professional Documents
Culture Documents
Statand Prob Q4 M8
Statand Prob Q4 M8
Department of Education
Regional Office IX, Zamboanga Peninsula
1
eal of artnership
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names,
trademarks, etc.) included in this module are owned by their respective copyright holders.
Every effort has been exerted to locate and seek permission to use these materials from
their respective copyright owners. The publisher and authors do not represent nor claim
ownership over them.
1
Introductory Message
This Self – learning Module (SLM) is prepared so that you, our dear learners, can continue
your studies and learn while at home. Activities, questions, directions, exercises, and
discussions are carefully stated for you to understand each lesson.
Each SLM is composed of different parts. Each part shall guide you step-by-step as you
discover and understand the lesson prepared for you.
Pre-tests are provided to measure your prior knowledge on lessons in each SLM. This will
tell you if you can proceed on completing this module or if you need to ask your facilitator or
your teacher’s assistance for better understanding of the lesson. At the end of each module,
you need to answer the post-test to self-check your learning. Answer keys are provided for
each activity and test. We trust that you will be honest in using these.
In addition to the material in the main text, notes to the Teacher are also provided to our
facilitators and parents for strategies and reminders on how they can best help you on your
home-based learning.
Please use this module with care. Do not put unnecessary marks on any part of this SLM.
Use a separate sheet of paper in answering the exercises and tests. Read the instructions
carefully before performing each task.
If you have any questions in using this SLM or any difficulty in answering the tasks in this
module, do not hesitate to consult your teacher or facilitator.
Thank you.
LEARNING COMPETENCY:
• Calculate the Pearson’s sample correlation coefficient. M11/12SP-IVh-2
• Solve problems involving correlation analysis. M11/12SP-IVh-3
LEARNING OBJECTIVES:
At the end of the lesson, the students are expected to:
a. calculate the Pearson Product-Moment Correlation coefficient.
b. interpret the computed correlation coefficient in terms of strength and direction; and
c. apply and solve real-life problems involving correlation analysis.
2
What I Know
Directions: Read the statement carefully and choose the best answer.
What’s In
Directions: Identify the direction and the strength of the following correlation given. Choose
your answer from the box.
a. Strong positive correlation b. Moderate positive correlation
c. No correlation d. Moderate negative correlation
e. Strong negative correlation f. Perfect correlation
3
1. 2. 3. 4. 5.
What’s New
In the previous lesson, we have learned about bivariate data. We also learned how to
draw the scatterplot of the pair of variables and interpret it quantitatively in terms of its
direction and strength of association using the trend of points. Sometimes, a scatterplot does
not evidently show that a correlation exists between the two variables. This is in the case of
very weak correlation where it would be very difficult to identify the trend line.
Thus, we need to come up with more accurate interpretation of the scatterplot using
quantitative methods. Here, we will be computing some values that will indicate that a
correlation between the two variables exists and where we can describe its strength using
arbitrary scale which we will make. So, brace yourself for the next lessons you will learn.
TASK: Research on the life of Karl Pearson and his important contributions in the
field of statistics. Do not forget to copy and study the formula he proposed for
computing the coefficient of correlation( r).
What is It
LESSON 1
Correlation coefficient, computed from the sample data measures the strength and
direction of a linear relationship between two variables. The strength of correlation is
indicated by the coefficient of correlation. There are several coefficients of correlation. One
that is most commonly used in linear correlation is Pearson Product-Moment coefficient of
correlation, symbolized by r, named in honor of the statistician who did a lot of research on
this area, Karl Pearson.
The symbol for the sample Correlation Coefficient is “r”. To compute r, we use the
formula,
𝒏∑𝑿𝒀 − ∑𝑿 • ∑𝒀
𝒓=
√[𝒏∑𝑿𝟐 − (∑𝑿)𝟐 ] [𝒏∑𝒀𝟐 − (∑𝒀)𝟐 ]
4
where, r is called the Pearson correlation coefficient. This indicates the degree of
relationship between the two values,
X is the values in the first set of data,
Y is the values in the second set of data, and
n is the total number of values/data pairs.
The Pearson correlation coefficient, r, can take a range of values from +1 to -1.
▪ A value greater than 0 indicates a positive correlation; that is, as the value of one
variable increases, so does the value of the other variable.
▪ A value less than 0 indicates a negative association; that is, as the value of one
variable increases, the value of the other variable decreases.
▪ A value of 0 indicates that there is no correlation between the two variables.
The direction of the points scattered tells the direction of correlation that exists between
the variables.
The stronger the association of the two variables, the closer the Pearson correlation
coefficient, r, will be to either +1 or -1 depending on whether the relationship is positive or
negative, respectively. See table below (Table of range of values).
5
Different relationships and their correlation coefficients are shown in the diagram
below:
Achieving a value of +1 or -1 means that all your data points are included on the line
of best fit – there are no data points that show any variation away from this line. Values
for r between +1 and -1 (for example, r = 0.7 or -0.3) indicate that there is variation around
the line of best fit. The closer the value of r to 0 the greater the variation around the line of
best fit.
It indicates the closeness of the point to the trend line. The closer the points are to
the trend line, the stronger the relationship is.
The following data show the scores of five students in Statistics and Physics.
Determine if there is a relationship between the scores in Physics and Statistics. Interpret
the results.
STUDENT SCORE IN STATISTICS X SCORE IN PHYSICS Y
Alfonso 3 5
Frances 9 8
Rafael 10 10
James 12 9
Loida 7 8
STEPS SOLUTION
6
2. Complete the table.
Square all entries in the X column. Student X Y X2 Y2 XY
Put them under X2 column. Alfonso 3 5 9 25 15
Frances 9 8 81 64 72
Square all entries in the Y column. Rafael 10 10 100 100 100
Put them under Y2 column. James 12 9 144 81 108
Loida 7 8 49 64 56
Multiply entries in the X and Y columns. Put
them under the XY column.
𝑟 = 0.90
LESSON 2
7
Meaning
✓ A correlation coefficient of 1 means that for every positive increase in one variable, there
is a positive increase of a fixed proportion in the other. For example, shoe sizes go up in
(almost) perfect correlation with foot length.
✓ A correlation coefficient of -1 means that for every positive increase in one variable,
there is a negative decrease of a fixed proportion in the other. For example, the amount
of gas in a tank decreases in (almost) perfect correlation with speed.
✓ Zero means that for every increase, there isn’t a positive or negative increase. The two
just aren’t related.
The absolute value of the correlation coefficient gives us the strength of the
relationship. The larger the number, the stronger the relationship. For example, |-.75| = .75,
which has a stronger relationship than .65
Let’s find the value of the correlation coefficient from the table below.
STEP 1: Make a chart. Use the given data, and add three more columns: xy, x2, and y2.
STEP 2: Multiply x and y together to fill the xy column. For example, row 1 would be
43 × 99 = 4,257.
8
STEP 3: Take the square of the numbers in the x column, and put the result in the
x2 column.
STEP 4: Take the square of the numbers in the y column, and put the results in the
y2 column.
Subject Age x Glucose level y xy x2 y2
1 43 99 4257 1849 9801
2 21 65 1365 441 4225
3 25 79 1975 625 6241
4 42 75 3150 1764 5625
5 57 87 4959 3249 7569
6 59 81 4779 3481 6561
STEP 5: Add up all of the numbers in the columns and put the result at the bottom of the
column. The Greek letter sigma (Σ) is a short way of saying “sum of.”
𝒏∑𝑿𝒀 − ∑𝑿 • ∑𝒀
𝒓=
√[𝒏∑𝑿𝟐 − (∑𝑿)𝟐 ] [𝒏∑𝒀𝟐 − (∑𝒀)𝟐 ]
9
The range of the correlation coefficient is from -1 to 1. Our result is 0.5298, which
means the relationship between variables is moderate positive correlation.
✓ Assumptions
For the Pearson r correlation, both variables should be normally distributed (normally
distributed variables have a bell-shaped curve). Other assumptions include linearity and
homoscedasticity. Linearity assumes a straight line relationship between each of the two
variables and homoscedasticity assumes that data are equally distributed about the
regression line.
Solve the value of the correlation coefficient for the data obtained in the study of
age and blood pressure given.
SOLUTION:
STEP 1. Make a table.
Step 2. Find the values of xy, x2, y2 and place these values in the corresponding column of
the table.
𝒏∑𝑿𝒀 − ∑𝑿 • ∑𝒀
𝒓=
√[𝒏∑𝑿𝟐 − (∑𝑿)𝟐 ] [𝒏∑𝒀𝟐 − (∑𝒀)𝟐 ]
The correlation coefficient suggests a strong positive relationship between age and blood
pressure.
𝟔(𝟒𝟕,𝟔𝟑𝟒) – (𝟑𝟒𝟓 × 𝟖𝟏𝟗)
r=
√[𝟔(𝟐𝟎,𝟑𝟗𝟗) – (𝟑𝟒𝟓)𝟐 ] [𝟔(𝟏𝟏𝟐,𝟒𝟒𝟑) – (𝟖𝟏𝟗)𝟐 ]
= 𝟎. 𝟖𝟗𝟕
10
What’s More
I. Directions: Calculate r and make a generalization regarding the information that you get
from the computed correlation coefficient for each of the following:
a. ∑X = 225 b. ∑X = 32 c. ∑X = 180
∑Y = 22 ∑Y = 1105 ∑Y = 147
∑X = 9653
2
∑X = 220
2
∑X2 = 6914
∑Y = 143
2
∑Y = 364525
2
∑Y2 = 5273
∑XY = 651 ∑XY = 3402 ∑XY = 4013
n=6 n=6 n=7
QUESTION: Do the data support the hypothesis that height is hereditary? Explain.
Accompany your explanation with statistical computations.
11
What I Can Do
Directions: Briefly answer the Self – Assessment Questions (SAQ) below.
1. Why do we study Pearson’s Correlation Coefficient?
2. How can we determine the strength of association based on the Pearson correlation
coefficient?
3. How do we make our interpretation of the strength of relationship more objective?
4. Cite a real-life application where we used Pearson’s Correlation Coefficient.
Assessment
I. Directions: Read the statement carefully and choose the best answer.
12
II. Directions. Solve the Problem.
Month Jun Jul Aug Sep Oct Nov Dec Jan Feb Mar
X (no. of the theft cases)
Y (no. of vandalism cases)
Additional Activities
Directions: Write TRUE if the statement is correct and FALSE if it is wrong.
1. Relationship between two variables can also be described in terms of its strength.
2. Direction of the line tells the direction of correlation that exist between the variables.
3. Perfect correlation happens when other variables are controlled like we do in our
experiments.
4. Direction of the correlation indicates the closeness of the points to the trend line.
5. The farther the points are to the trend line, the stronger the relationship is.
13
14
https://www.statisticshowto.com/probability-and-statistics/correlation-coefficient-formula/#top
Statistics and Probability by Rene R. Belencia
Statistics and Probability tg
References:
Additional Activities:
1. True 2. True 3. True 4. False 5. False
Assessment:
I. 1. c 2. a 3. c 4. b 5. D
II. 𝑟 = 0.574 ≈ 0.57 Therefore, there is a moderately strong positive correlation
between the number of the theft cases and the number of vandalism cases incurred
in the school.
What’s More:
I. a. 𝑟 = - 0.632 ≈ - 0.63 It indicates moderately negative correlation.
b. 𝑟 = - 0.884 ≈ - 0.88 It indicates strong negative correlation.
c. 𝑟 = 0.104 ≈ 0.10 It indicates very low or no correlation.
II. 𝑟 = 0.904 ≈ 0.90 Yes, the correlation coefficient suggests a strong positive
relationship between the heights of a father and his eldest son.
What’s New:
Karl Pearson was an English mathematician and biostatistician. He was being credited with
establishing the discipline of mathematical statistics. Pearson’s thinking underpins many of
the ‘classical’ statistical methods which are in common use today. One of his contribution is
Correlation coefficient also known as the Pearson Product-Moment Coefficient.
𝑛∑𝑋𝑌 − ∑𝑋 • ∑𝑌
𝑟=
√[𝑛∑𝑋 2 − (∑𝑋)2 ] [𝑛∑𝑌 2 − (∑𝑌)2 ]
What’s In:
1. c 2. b 3. e 4. d 5. f
What I Know:
1. a 2. c 3. b 4. b 5. b
Answer Key
I AM A FILIPINO
by Carlos P. Romulo
I am a Filipino – inheritor of a glorious past, hostage to the It is the mark of my manhood, the symbol of my dignity as
uncertain future. As such, I must prove equal to a two-fold a human being. Like the seeds that were once buried in the
task – the task of meeting my responsibility to the past, and tomb of Tutankhamen many thousands of years ago, it
the task of performing my obligation to the future. shall grow and flower and bear fruit again. It is the insigne
I am sprung from a hardy race – child many generations of my race, and my generation is but a stage in the
removed of ancient Malayan pioneers. Across the centuries, unending search of my people for freedom and happiness.
the memory comes rushing back to me: of brown-skinned I am a Filipino, child of the marriage of the East and the
men putting out to sea in ships that were as frail as their West. The East, with its languor and mysticism, its
hearts were stout. Over the sea I see them come, borne upon passivity and endurance, was my mother, and my sire was
the billowing wave and the whistling wind, carried upon the the West that came thundering across the seas with the
mighty swell of hope – hope in the free abundance of the Cross and Sword and the Machine. I am of the East, an
new land that was to be their home and their children’s eager participant in its struggles for liberation from the
forever. imperialist yoke. But I know also that the East must awake
This is the land they sought and found. Every inch of shore from its centuried sleep, shake off the lethargy that has
that their eyes first set upon, every hill and mountain that bound its limbs, and start moving where destiny awaits.
beckoned to them with a green and purple invitation, every For I, too, am of the West, and the vigorous peoples of the
mile of rolling plain that their view encompassed, every West have destroyed forever the peace and quiet that once
river and lake that promised a plentiful living and the were ours. I can no longer live, a being apart from those
fruitfulness of commerce, is a hollowed spot to me. whose world now trembles to the roar of bomb and cannon
By the strength of their hearts and hands, by every right of shot. For no man and no nation is an island, but a part of
law, human and divine, this land and all the appurtenances the main, and there is no longer any East and West – only
thereof – the black and fertile soil, the seas and lakes and individuals and nations making those momentous choices
rivers teeming with fish, the forests with their inexhaustible that are the hinges upon which history revolves. At the
wealth in wild and timber, the mountains with their bowels vanguard of progress in this part of the world I stand – a
swollen with minerals – the whole of this rich and happy forlorn figure in the eyes of some, but not one defeated
land has been for centuries without number, the land of my and lost. For through the thick, interlacing branches of
fathers. This land I received in trust from them, and in trust habit and custom above me I have seen the light of the
will pass it to my children, and so on until the world is no sun, and I know that it is good. I have seen the light of
more. justice and equality and freedom, my heart has been lifted
I am a Filipino. In my blood runs the immortal seed of by the vision of democracy, and I shall not rest until my
heroes – seed that flowered down the centuries in deeds of land and my people shall have been blessed by these,
courage and defiance. In my veins yet pulses the same hot beyond the power of any man or nation to subvert or
blood that sent Lapulapu to battle against the alien foe, that destroy.
drove Diego Silang and Dagohoy into rebellion against the I am a Filipino, and this is my inheritance. What pledge
foreign oppressor. shall I give that I may prove worthy of my inheritance? I
That seed is immortal. It is the self-same seed that flowered shall give the pledge that has come ringing down the
in the heart of Jose Rizal that morning in Bagumbayan corridors of the centuries, and it shall be compounded of
when a volley of shots put an end to all that was mortal of the joyous cries of my Malayan forebears when first they
him and made his spirit deathless forever; the same that saw the contours of this land loom before their eyes, of the
flowered in the hearts of Bonifacio in Balintawak, of battle cries that have resounded in every field of combat
Gregorio del Pilar at Tirad Pass, of Antonio Luna at from Mactan to Tirad Pass, of the voices of my people
Calumpit, that bloomed in flowers of frustration in the sad when they sing:
heart of Emilio Aguinaldo at Palanan, and yet burst forth “I am a Filipino born to freedom, and I shall not rest until
royally again in the proud heart of Manuel L. Quezon when freedom shall have been added unto my inheritance—for
he stood at last on the threshold of ancient Malacanang myself and my children and my children’s children—
Palace, in the symbolic act of possession and racial forever.”
vindication. The seed I bear within me is an immortal seed.
15