Professional Documents
Culture Documents
Practical Research 2
Quarter 2 – Module 10:
Statistical Techniques for
Bivariate Analysis
Practical Research 2– Grade 12
Alternative Delivery Mode
Quarter 2 – Module 10: Statistical Techniques for Bivariate Analysis
First Edition, 2020
Republic Act 8293, section 176 states that: No copyright shall subsist in any work of
the Government of the Philippines. However, prior approval of the government agency or office
wherein the work is created shall be necessary for exploitation of such work for profit. Such
agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names,
trademarks, etc.) included in this book are owned by their respective copyright holders. Every
effort has been exerted to locate and seek permission to use these materials from their
respective copyright owners. The publisher and authors do not represent nor claim ownership
over them.
Practical Research 2
Quarter 2 – Module 10:
Statistical Techniques for
Bivariate Analysis
Introductory Message
This learning resource hopes to engage the learners into guided and
independent learning activities at their own pace and time. Furthermore, this also
aims to help learners acquire the needed 21st century skills while taking into
consideration their needs and circumstances.
In addition to the material in the main text, you will also see this box in the
body of the module:
As a facilitator you are expected to orient the learners on how to use this
module. You also need to keep track of the learners' progress while allowing them to
manage their own learning. Furthermore, you are expected to encourage and assist
the learners as they do the tasks included in the module.
ii
For the learner:
The hand is one of the most symbolized part of the human body. It is often
used to depict skill, action and purpose. Through our hands we may learn, create
and accomplish. Hence, the hand in this learning resource signifies that you as a
learner is capable and empowered to successfully achieve the relevant competencies
and skills at your own pace and time. Your academic success lies in your own hands!
This module was designed to provide you with fun and meaningful
opportunities for guided and independent learning at your own pace and time. You
will be enabled to process the contents of the learning resource while being an active
learner.
What I Need to Know This will give you an idea of the skills or
competencies you are expected to learn in the
module.
iii
Assessment This is a task which aims to evaluate your
level of mastery in achieving the learning
competency.
1. Use the module with care. Do not put unnecessary mark/s on any part of the
module. Use a separate sheet of paper in answering the exercises.
2. Don’t forget to answer What I Know before moving on to the other activities
included in the module.
3. Read the instruction carefully before doing each task.
4. Observe honesty and integrity in doing the tasks and checking your answers.
5. Finish the task at hand before proceeding to the next.
6. Return this module to your teacher/facilitator once you are through with it.
If you encounter any difficulty in answering the tasks in this module, do not
hesitate to consult your teacher or facilitator. Always bear in mind that you are not
alone.
We hope that through this material, you will experience meaningful learning
and gain deep understanding of the relevant competencies. You can do it!
iv
What I Need to Know
This module was designed and written with you in mind. It is here to help you
master Practical Research 2. The scope of this module permits it to be used in many
different learning situations. The language used recognizes the diverse vocabulary
level of students. The lessons are arranged to follow the standard sequence of the
course. But the order in which you read them can be changed to correspond with
the textbook you are now using.
After going through this module, you are expected to use statistical techniques
to analyze data – study of differences and relationships limited for bivariate analysis
(CS_RS12-IId-g-3).
1
What I Know
Read each item carefully and write the letter of your answers on a separate
answer sheet.
2
8. It is the most appropriate statistical tool to be used when we want to compare
Grade 11 and Grade 12 on their time spent on social media per day.
a. Chi-square c. Analysis of variance
b. T-test d. Multiple regression
9. This statistical tool can be used when you want to compare males and females
in terms of their highest educational attainment (bachelor’s degree, master’s
degree, doctorate degree).
a. Chi-square c. Analysis of variance
b. T-test d. Multiple regression
10. This statistical tool can be used when you want to determine whether a
student’s IQ and hours spent in studying can predict their quarterly exam
score in General Physics 1.
a. Chi-square c. Analysis of variance
b. T-test d. Multiple regression
3
Lesson
Statistical Techniques for
10 Bivariate Analysis
What’s In
4
5. What is the effect of nitrogen
fertilizer on the nitrate
content of lettuce?
_______ 2. Weight
_______ 5. Gender
_______ 7. Age
_______ 9. Academic distinction (with highest honors, with high honors, with honors)
What’s New
A group of Grade 12 Arts & Design students wants to conduct a study on the
effect of classical music on the quarterly exam scores of Grade 11 STEM students in
Pre-Calculus. They divided their sample into two groups: one group listened to
classical music while answering the quarterly exam, while the other group answered
the quarterly exam without listening to any music. Upon collecting data, they
compared the quarterly exam scores of the said groups. What is the most appropriate
statistical tool to be used for the type of data they collected? How should they analyze
and interpret their data?
5
What is It
Note: A normal distribution implies that half of the data you collected is greater than
the mean, while the other half is less than the mean. It can be represented by a graph
that is in the shape of a bell curve, as the data is equally distributed or appears to
be symmetrical on both sides of the mean. If your data does not satisfy this, then it
is a nonnormal distribution.
There are various statistical programs that we can use for data analysis such
as the Statistical Packages for Social Sciences (SPSS), SYSTAT and Minitab 16
(Creswell, 2012). However, it requires payment before we can utilize all its functions.
What most of us are not aware of is Excel, a software program that can be found in
our computers, is also capable of performing data analysis.
6
If you were not able to find it, here is a step-by-step procedure on how you
can install the Excel Analysis Toolpak add-in:
Let us try to make use of Excel in the following statistical techniques used for
bivariate analysis:
✓ Pearson product moment correlation
It measures the strength and direction of the linear relationship of two
quantitative variables. This statistical tool can be used when you want to
determine whether time spent studying is related to quarterly exam scores in
General Mathematics.
The value of the correlation coefficient (also called Pearson’s R) ranges from −1
to 1:
▪ A value of 1 indicates that there is a strong positive linear relationship
between two variables. Both variables go in the same direction. When one
variable increases, the other variable also increases; when one variable
decreases, the other variable also increases.
▪ A value of −1 indicates that there is a strong negative linear relationship
between two variables. Two variables go in the opposite direction. When one
variable increases, the other variable decreases.
▪ A value of 0 indicates that there is no significant relationship between two
variables.
7
Also, the strength of the linear relationship between two variables can be
determined by the absolute value of the correlation coefficient. The closer the
value to 1, the stronger the linear relationship. For example, we want to compare
which of the two values of correlation coefficient has a stronger linear
relationship: −0.88 and 0.26. Getting its absolute value, |−0.88| = 0.88 and
|−0.26| = 0.26. Thus, 0.88 has a stronger linear relationship than 0.26.
3. Another popup will appear. For Input Range, select all the cells containing
your data. Select Columns for Grouped By. Tick the box for Labels in First
Row and select New Worksheet Ply under the Output Options.
8
4. A new worksheet will appear containing the value of the correlation
coefficient.
✓ Multiple regression
It measures the strength of the relationship between one dependent variable
(criterion) and multiple independent variables (predictor). It also gives us the
following information (Cheusheva, 2020; Stat Trek, n.d.):
a. regression equation that models the relationship between the dependent
variable and independent variables
b. how the actual data fits the regression equation formed (coefficient of
multiple determination)
c. when an independent variable acts as a statistically significant predictor of
the dependent variable (p-value)
This statistical tool can be used when you want to determine whether a
student’s IQ and hours spent in studying predict their quarterly exam score in
General Physics 1.
9
Here is how we can make use of this function in Excel:
1. Upon encoding your data in Excel, click Data Analysis in the Data tab.
2. Select Regression in the popup that appeared and click OK.
3. Another popup will appear. For Input Y Range under Input, select all the
cells containing the values of your dependent variable.
4. Same with Input X Range under Input, select all the cells containing the
values of your independent variables.
10
5. Tick the box for Labels and select New Worksheet Ply under the Output
Options.
The regression coefficients will allow us to form the regression equation that
models the relationship between the dependent variable and independent
variables. Since we have two independent variables, we can use the formula:
𝑦 = 𝑏0 + 𝑏1 𝑥1 + 𝑏2 𝑥2
Substituting the values we obtained, we can already form the regression
equation:
𝑦 = 23.1561 + (0.5094)𝑥1 + (0.4671)𝑥2
We can use this equation in predicting a student’s quarterly exam score in
General Physics I, given their IQ and hours spent in studying.
11
The coefficient of multiple determination is represented by R Square =
0.0953. Converting into percentage, it implies that 90.53% of the quarterly
exam scores variation in General Physics I can be explained by student’s IQ
and hours spent in studying.
c. p-value
✓ T-test
It measures the significant difference between the means of two independent
groups – each group consists of a unique set of samples (Frost, n.d.). This
statistical tool can be used when you want to compare Grade 11 and Grade 12
on their time spent on social media per day. Before performing the test, we
formulate two hypotheses:
▪ Null Hypothesis (Ho) - The two population means are equal.
▪ Alternative Hypothesis (Ha) - The two population means are not equal.
We will look at the obtained p-value for the interpretation of the results. If the
p-value is less than the significance level (ex. 0.05), we reject the null
hypothesis. It implies that there is a significant difference between the two
population means.
In Excel, there are two kinds of the two-sample t-test: equal and unequal
variances. If the number of samples of both groups in your study are nearly
equal, then it is safe to assume equal variances; otherwise, other tests for equal
variances should be conducted first.
12
Here is how we can make use of this function in Excel:
1. Upon encoding your data in Excel, click Data Analysis in the Data tab.
2. Select t-Test: Two-Sample Assuming Equal Variances in the popup that
appeared and click OK.
3. Another popup will appear. For Variable 1 Range under Input, select all the
cells containing your data for one group.
4. Same with Variable 2 Range under Input, select all the cells containing your
data for the other group.
13
5. Set the Hypothesized Mean Difference to zero. It is the null hypothesis value,
in which we assumed that there is no difference between the means of two
groups. Tick the box for Labels and set the Alpha (significance level) to 0.05.
Select New Worksheet Ply under the Output Options.
Interpretation: Since p-value = 0.0013 < 0.05, we reject the null hypothesis.
It indicates that the two population means are not equal. Thus, there is
enough evidence to say that there is a significant difference between the
means of two groups. Grade 12 students spend more time in social media
(8.33) than Grade 11 students (7.07).
14
✓ Analysis of variance (ANOVA)
It is used in comparing the means of three or more independent groups. This
statistical tool can be used when you want to compare four grade levels in
Junior High School (Grade 7, Grade 8, Grade 9, and Grade 10) on their time
spent in the school library per week. Before performing the test, we formulate
two hypotheses:
▪ Null Hypothesis (Ho) - The four population means are equal.
▪ Alternative Hypothesis (Ha) - The four population means are not equal.
We will look at the obtained p-value for the interpretation of the results. If the
p-value is less than the significance level (ex. 0.05), we reject the null
hypothesis. It implies that there is a significant difference between the four
population means.
3. Another popup will appear. For Input Range, select all the cells containing
your data. Select Columns for Grouped By. Tick the box for Labels in First
Row and set the Alpha (significance level) to 0.05. Select New Worksheet Ply
under the Output Options.
15
4. A new worksheet will appear containing the p-value.
Interpretation: Since p-value = 0.0528 > 0.05, we accept the null hypothesis.
It indicates that the four population means are equal. Thus, there is enough
evidence to say that there is no significant difference between the means of
four groups.
✓ Chi-square
It tests whether a significant relationship exists between two categorical
variables. This statistical tool can be used when you want to compare males
and females in terms of their highest educational attainment (bachelor’s degree,
master’s degree, doctorate degree). Before performing the test, we formulate two
hypotheses:
▪ Null Hypothesis (Ho) - The highest educational attainment is not dependent
on gender.
▪ Alternative Hypothesis (Ha) - The highest educational attainment is
dependent on gender.
2. Calculate for the expected frequencies (EF) of your data. 𝑁 is the total
number of sample.
(𝑟𝑜𝑤 𝑡𝑜𝑡𝑎𝑙)(𝑐𝑜𝑙𝑢𝑚𝑛 𝑡𝑜𝑡𝑎𝑙)
𝐸𝐹 =
𝑁
16
Example for cell B4 (Junior High School teachers who have bachelor’s degree
as their highest educational attainment):
(60)(51)
𝐸𝐹 = = 30.6
100
3. Calculate for the p-value. Select a blank cell then type the formula:
=CHITEST(actual_range,expected_range). For the actual_range, select the
cells of observed frequencies; while for the expected_range, select the cells
of expected frequencies.
17
4. Press Enter to get the p-value.
Interpretation: Since p-value = 0.9673 > 0.05, we accept the null hypothesis.
There is enough evidence to say that the highest educational attainment is
not dependent on gender.
18
What’s More
19
5. Assume equal variances, get the p-value. ________________________________
6. Should they accept or reject the null hypothesis? ________________________
7. How should they interpret the p-value?
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
On a separate answer sheet, please check your level of understanding for each
statistical tool discussed.
Pearson product
moment of
correlation
Multiple
regression
T-test
Analysis of
variance (ANOVA)
Chi-square
20
What I Can Do
Identify which among the five statistical techniques discussed can be used for
your research. Explain your answer in more than 3 sentences.
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
__________________________________________________________________________________
Assessment
Analyze the situations given and use Excel in performing statistical analysis.
Write your answers on a separate answer sheet.
21
1. What statistical tool best applies to the data?
a. T-test
b. Pearson product of moment correlation
c. Chi-square
d. Multiple regression
22
4. What is the expected frequency for those who are at home and have
regular exercise?
a. 68.98 c. 13.89
b. 54.32 d. 73.87
Additional Activities
23
24
Assessment What’s More
I. 1. B 1. Continuous
2. B 2. T-test is the most appropriate statistical tool to use as it
3. C measures the significant difference between the means
of two independent groups.
II. 1. C 3. The two population means are equal.
2. B 4. The two population means are not equal.
3. A 5. 0.3975
4. C 6. Accept the null hypothesis.
5. C 7. It indicates that the two population means are equal.
6. D Thus, there is enough evidence to say that there is no
7. C significant difference between the means of two groups.
What’s In What I Know
A. 1. IV: temperature 1. A
DV: diffusion 2. D
2. IV: foreign direct investment 3. A
DV: levels of economic growth 4. B
3. IV: daily Facebook use 5. C
DV: attention span 6. C
4. IV: management style 7. D
DV: employee performance 8. B
5. IV: amount of nitrogen fertilizer 9. A
DV: nitrogen content 10. D
B. 1. CA 4. CO 7. CO 10. CO
2. CO 5. CA 8. CO
3. CA 6. CA 9. CA
Answer Key
References
Baraceros, E. L. (2016). Practical Research 2. Sampaloc, Manila: Rex Bookstore Inc.
Cheusheva, S. (2020, October 9). Linear regression analysis in Excel. Retrieved from
https://www.ablebits.com/office-addins-blog/2018/08/01/linear-
regression-analysis-excel/
Creswell, J. W. (2012). Educational Research: Planning, conducting and evaluating
quantitative and qualitative research (4th ed.). Boston, Massachusetts:
Pearson Education Inc.
Frost, J. (n.d.). How to do One-Way ANOVA in Excel. Retrieved from
https://statisticsbyjim.com/anova/one-way-anova-excel/
Frost, J. (n.d.). How to do t-Tests in Excel. Retrieved from
https://statisticsbyjim.com/hypothesis-testing/t-tests-excel/
Glen, S. (n.d.). Correlation Coefficient: Simple Definition, Formula, Easy Steps.
Retrieved from https://www.statisticshowto.com/probability-and-
statistics/correlation-coefficient-formula/
Schmuller, J. (n.d.). Statistical Analysis with Excel For Dummies Cheat Sheet.
Retrieved from https://www.dummies.com/software/microsoft-
office/excel/statistical-analysis-with-excel-for-dummies-cheat-sheet/
Stat Trek. (n.d.). Regression Analysis With Excel. Retrieved from
https://stattrek.com/multiple-regression/excel.aspx
25
For inquiries or feedback, please write or call: