Professional Documents
Culture Documents
Research I
Quarter 3 – Module 4:
Basic Statistics in Experimental
Research
Republic Act 8293, section 176 states that: No copyright shall subsist in any work
of the Government of the Philippines. However, prior approval of the government agency or
office wherein the work is created shall be necessary to exploit such work for a profit. Such
agency or office may, among other things, impose as a condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand names,
trademarks, etc.) included in this module are owned by their respective copyright holders.
Every effort has been exerted to locate and seek permission to use these materials from
their respective copyright owners. The publisher and authors do not represent nor claim
ownership over them.
Research I
Quarter 3 – Module 4:
Basic Statistics in Experimental
Research
Introductory Message
This Self-Learning Module (SLM) is prepared so that you, our dear learners,
can continue your studies and learn while at home. Activities, questions,
directions, exercises, and discussions are carefully stated for you to understand
each lesson.
Each SLM is composed of different parts. Each part shall guide you step-by-
step as you discover and understand the lesson prepared for you.
In addition to the material in the main text, notes to the Teacher are also
provided to our facilitators and parents for strategies and reminders on how they
can best help you with your home-based learning.
Please use this module with care. Do not put unnecessary marks on any
part of this SLM. Use a separate sheet of paper in answering the exercises and
tests. And read the instructions carefully before performing each task.
If you have any questions using this SLM or any difficulty in answering the tasks in
this module, do not hesitate to consult your teacher or facilitator.
Thank you.
What I Need to Know
This module was designed and written with you in mind. It is here to help
you master to determine the appropriate statistical tool for organizing and
describing numerical data in your experimental research. It will aid you in giving
meaning and interpretation of the data you have collected. The language used
recognizes the diverse vocabulary level of students. The lessons are arranged to
follow the standard sequence of the course. However, the order in which you read
them can be changed to correspond with the textbook you are now using.
After going through this module, you are expected to determine the appropriate
statistical tool for organizing and describing numerical data. Specifically, you will
be able to:
a. Define statistics and their types;
b. Determine the appropriate tools used in Descriptive Statistics and Inferential
Statistics; and
c. Decide whether to accept or reject the hypothesis.
1
What I Know
Directions: Read each question carefully. Choose the letter of the correct answer.
3. Which statistical tests usually have stricter requirements and can make
stronger inferences from the data?
a. Parametric
b. Non-Parametric
c. both a and b
d. none of the above
4. What is the formal technique used to test the acceptability of the null
hypothesis?
a. Parametric Test
b. Non-Parametric Test
c. Hypothesis Test
d. Statistical Test
5. What is the appropriate statistical test used in the non-parametric test if the
predictor variable and the outcome variable are quantitative or numeric?
a. ANOVA
b. Chi square Test
c. Spearman’s r
d. T-test
2
6. What Statistical test used in the parametric test where the predictor variable
is categorical and the outcome variable is quantitative or numeric and has
two groups compared?
a. ANOVA
b. Chi square Test
c. Spearman’s r
d. T-test
Lesson
Basic Statistics in
1 Experimental Research
3
What’s In
In your past lesson, you were able to formulate a hypothesis, explain the
relationship between and among variables, and differentiate the types of data.
Recall these by answering the activity below.
Directions:
A. Match column A with column B. Choose the letter of the correct answer.
Column A Column B
B. Analyze the sample research problem and answer the following questions.
Research Problem: Is there a significant effect between flower species and petal
length, petal width, and stem length?
Questions:
1. What is/are the independent variable/s? Are they Quantitative or
Qualitative?
4
What's New
Student B
Research Problem: Is there a significant effect between the types of soil used
(loam, sandy and clay) on the height of tomato plant?
Independent variables: types of soil: loam, sandy and clay
Dependent Variables: height of tomato plant
Hypothesis: There is no significant difference between the types of soil used (loam,
sandy and clay) on the height of tomato plant.
Statistical Tool to use: ANOVA
Do you think their outputs are correct? What was their basis in determining
the appropriate statistical tool to use?
What is It
5
In quantitative research a decision must be made – whether to reject or
accept the hypotheses. Prior to doing so, pertinent information must be gathered,
and a plan should be conceived on how to deal with the information gathered.
Thus, to give meaning to this information and interpret it, statistical methods
must be employed.
O O B AB B
B B O A O
A A O O A
AB O O B AB
From the given data, here is how to organize them using frequency distribution.
6
Measures of Central Tendency or Position or Average
When scores and other measures have been tabulated into a frequency
distribution, the next task is to calculate a measure of central tendency or central
position. This measure of central tendency is synonymous with the word “average”.
An average is a typical value that tends to describe the set of data.
The mean, median, and mode are the three main measures of central
tendency. Mean, or simply the average is the most frequently used and can be
described as the arithmetic average of all scores or groups of scores in a
distribution. The process can be done by adding all the scores or data then divided
by the total number of cases. Median, or the middle-most value in a list of items
arranged in increasing or decreasing order. If the case is in an odd number or
items, there will be exactly one item in the middle. In case the number or items is
an even number, the midpoint will be determined by getting the average of the two-
middle item. Finally, the mode is the score or group of scores that occur most
frequently. Some distributions don’t have mode at all. Others may have more than
one mode. In cases that the distribution has two modes, the term used is bimodal.
Below is an example of how to get the measure of the central tendency of a
distribution.
10 12 10 14 14 13
11 12 14 14 10 12
10 11 13 14 11 12
14 12 12 11 10 10
12 13 12 12 14 14
In dealing with this, arrange the given data from highest to lowest or vice versa
10 11 12 13 14
10 11 12 13 14
10 11 12 13 14
10 11 12 14
10 12 14
10 12 14
12 14
12 14
12
7
Use the formula, where: x – values of data
N-total number of observations
Median = since there are 30 cases, get the 15th and 16th data, that is 12
and
12, add them then divide by 2 = 12
Mode = 12 since this is the most frequent score
Group 1 Group 2
14 5
13 19
18 18
14 14
11 14
Group 1 Group 2
Mean 14 14
Median 14 14
Mode 14 14
As shown in the second table, the two sets of averages have no difference.
But both groups show an obvious difference. Group 2 has more widely scattered
data compared to Group 1. This characteristic called variability or dispersion is
not reflected by averages. The three basic measures of dispersion are range,
variance, and standard deviation.
8
Variance measures how far a data set is spread out. It is mathematically
defined as the average of the squared differences from the mean.
Standard Deviation is the most commonly used measure of dispersion. It
indicates how closely the values of the given data set are clustered around the
mean. It is computed by getting the positive square root of variance. The lower
value of standard deviation means that the values of the given set of data are
spread over a smaller range around the mean. On the other hand, greater value
means that the values of the given set of data are spread over a larger range around
the mean.
Statistical tests are used in hypothesis testing. They can be used to:
determine whether a predictor variable has a statistically significant relationship
with an outcome variable and estimate the difference between two or more groups.
Before deciding what statistical tool will be used in one’s study, a knowledge
of the types of variables is essential because it will help you determine what type of
statistical tool is appropriate.
Choose the test that fits the types of predictor or independent variables and
outcome/dependent variables you have collected.
Statistical tests are used to derive a generalization about the population from
the sample. A statistical test is a formal technique that relies on the probability
distribution for concluding the reasonableness of the hypothesis. These
hypothetical testing related to differences are classified as parametric and non-
parametric tests. The parametric test is one that has information about the
population parameter. On the other hand, the non-parametric test is where the
researcher has no idea regarding the population parameter.
Parametric Tests
The most common types of the parametric test include regression tests,
comparison tests, and correlation tests. Below is a flowchart that will help us
determine the appropriate statistical tool for parametric tests.
9
Example, The Effect of the Amount of Chlorine in the Color of Algae. Identify
first your independent and dependent variables, how many are they, and their type,
whether qualitative/ categorical or quantitative/numeric. After identifying such,
look at the diagram above to know the parametric test's right statistical tool. In the
given problem, the amount of chlorine is the independent variable, it’s numeric or
qualitative, and 2 or more amounts of chlorine may be used in the experiment. The
10
dependent variable is the color of algae; its categorical and color may vary. So,
looking at the above diagram, logistic regression is the appropriate tool.
Non-Parametric Test
Non-parametric tests don’t make as many assumptions about the data and
are useful when one or more common statistical assumptions are violated.
However, the inferences they make aren’t as strong as with parametric tests. The
table below shows how to determine the appropriate non-parametric tool to be
used.
11
What's More
Activity 1
Direction: Read the situation below. Then, construct a Frequency Distribution.
Assessment 1
Direction: Using the constructed Frequency Distribution above, determine the
Mean, Median, Mode and Range.
Activity 2
Directions: Read the following statement. Then, write True if the statement is
correct and False, if not.
1. In a non-parametric test, Spearman’s r is the appropriate statistical tool to
use if the predictor and outcome variables are both quantitative.
2. In a parametric test, Chi-square is used if both predictor and outcome
variable is categorical.
3. Wilcoxon Rank-Sum test is used if the predictor variable is categorical with
three or more groups and has two or more outcome variables.
4. Multiple Regression is the most appropriate statistical tool provided that the
predictor is numerical, and there is more than one quantitative outcome
variable.
5. The most appropriate statistical tool to be used in a parametric test if the
independent variable is categorical and its dependent variable requires
comparing the mean test of 2 groups is the T-test.
Assessment 2
Directions: Read the situation below and answer the questions that follow.
Situation:
John is working on his investigatory project. He wants to investigate the
growth of eggplants in the school garden. He observes that these plants differ in
height and leaf color even though they receive the same amount of sunlight, water,
and fertilizer. He also observes that these eggplants are planted in different sizes of
pots.
12
Questions:
1. What is the research problem of John?
2. What is/are the independent variable/s? How many are there? Is it qualitative
or quantitative?
3. What is/are the dependent variable/s? How many are there? Is it qualitative
or quantitative?
4. What is the most appropriate statistical tool to be used in a parametric test?
Activity 3
Directions: Identify the correct statistical tool for the following sample research
problems. Please refer to the flowchart and table on the What Is It part
of page 7.
1. What is the difference in average pain levels among post-surgical patients
given three different painkillers?
2. What is the effect of drug dosage on the survival of a test subject?
3. What is the effect of flower species on petal length, petal width, and stem
length?
4. What is the effect of two different test prep programs on the average exam
scores for students from the same class?
5. What is the difference in average exam scores for students from two different
schools?
Assessment 3
Directions: Choose one of the sample research problems on Activity 3. Then,
provide the needed information.
13
What I Have Learned
Directions: Complete the Concept Map below by filling in the box with correct
word/s.
Statistics
Types
Descriptive 1.
Statistical Tools
Measure of
2. 5. Nonparametric
Dispersion
14
What I Can Do
Directions: Write one sample Research Problem. Then, supply the needed
information. Assume that the computed p-value is equal to 0.015.
Sample Research Problem:
Independent Variable(s):
Is it Qualitative or Quantitative?
Dependent Variable(s):
Is it Qualitative or Quantitative?
Assessment
Part A
Direction: Identify the following by choosing the correct answer from the box.
15
Part B
Direction: Choose the letter of the correct answer.
1. John wants to find out if the face mask made from banana leaf fiber is more
acceptable than a mask made from cloth. He conducts a survey among the
students and teachers from Magalang National High School. And he found out
that majority of the students and teachers preferred face masks made from
banana leaf fiber to the ones made from cloth. Based on the given scenario,
what type of statistical tool is applied?
a. Measure of Dispersion c. Measure of Variability
b. Frequency Distribution d. Measure of Central Tendency
2. Suppose Student A wants to know if there is a significant difference between the
three types of soil used in their garden and the growth of the tomato plant. What
should be the computed p-value to determine if his/her hypothesis is rejected?
a. 0.04 c. 0.06
b. 0.05 d. 0.07
For items 3, 4, and 5, consider the set of data below:
Grade per subject of Student A during the first grading period
90 98 96 92 90 94 93 95
3. What is the mean score?
a. 92.50 c. 94.50
b. 93.50 d. 95.50
4. What is the median score?
a. 92.50 c. 94.50
b. 93.50 d. 95.50
5. What is the range of the given set of data?
a. 5 c. 7
b. 6 d. 8
Additional Activities
Directions: Given the following p-value, identify whether to accept or reject the null
hypothesis.
1. 0.06
2. 0.02
3. 0.07
4. 0.01
5. 0.04
16
17
What I Know What’s More
1. a Activity 1
2. c Content of the 10 bottles (in ml) of advertised
cooking oil manufacturer.
3. a
CategoryTally Frequency
4. d
990 II2
5. c 986 I1
6. d 985 I1
7. d 980 III3
8. a 978 II3
9. a 970 I1
10. c
Assessment 1
Mean: 981.7
Median: 980.0
Mode: 980.0
What’s In
A. Range: 20
c
Activity 2
a True
e True
d False
b True
B. True
1. flower species, qualitative
2. petal length, petal width, stem length, Assessment 2
quantitative What is the effect of post size to the height and
3. possible answers: types of soil, sunlight leaf color of eggplants?
exposure, amount of water given, pot size and Pot size, one, quantitative
type Height and leaf color,2, qualitative
4. There is no significant effect between the
Logistic regression
flower species and the petal length, petal width
and stem length
5. There is a significant effect between the Activity 3
flower species and the petal length, petal width ANOVA
and stem length Logistic Regression
MANOVA
Paired T-Test
Independent T-Test
Assessment 3
Answer may vary
Answer Key
18
What I Have Learned Assessment
Part A.
1. Inferential 1. Mean
2. Measures of Central Tendency 2. Standard Deviation
3. Median 3. Statistics
4. Frequency Distribution
4. Frequency Dsistribution
5. Chi-square Test
5. Parametric
6. Chi-square test Part B
7. ANOVA 1. b
8. Multiple Regression 2. a
9. Spearman’s r 3. b
10. Chi-square test of Independence 4. b
5. d
Additional Activities
1.Accept
2. Reject
3. Accept
4. Reject
5. Reject
Answer Key
References
Alferes, Merle & Duro, Ma. Cecilia. 2010. Statistics and Probability. Cainta,
Rizal:MSA Publishing House
Bevans, Rebecca. 2020. “Choosing the Right Statistical Test: Types and
Examples.” Scribbr. https://www.scribbr.com/statistics/statistical-
tests/.
Calaguas, Glenn. 2015. Conducting Research in Education and The Social
Sciences. Plaridel, Bulacan: St.Andrew Publishing House
Crossman, Ashley. 2019. “How to Measure Central Tendency Using Mean,
Median, or Mode.” ThoughtCo. https://bit.ly/3i0k82v
Grobman, Kevin. “Re: Good Morning! I would like to ask what statistical tool
have you been using in science investigatory projects for high school
students” Retrieved from: https://bit.ly/3sdp09e
19
For inquiries or feedback, please write or call: