You are on page 1of 11

NEMESIO I.

YABUT SENIOR HIGH SCHOOL

STATISTICS AND PROBABILITY

L# 16 Testing Hypothesis

 Hypothesis testing is a decision making process for evaluating claims about a population parameter.
Null Hypothesis ( HO), states that there is no difference between a parameter and a specific value
Alternative Hypothesis (Ha), states a specific difference between a parameter and a specific value
Possible sets of statistical hypothesis:
1. Two tailed test 2. One Tailed Test (left) 3. One Tailed Test (right)
Ho : parameter = specific value Ho : parameter = specific value Ho : parameter = specific value
Ha : parameter ≠ specific value Ha : parameter < specific value Ha : parameter >specific value

 Level of significance is the maximum probability of committing type I error (α)

 Critical value is the value that separates the rejection area to acceptance area
 Steps in Hypothesis Testing
1. State the hypotheses and identify the claim
Make sure to use the proper symbols
Include the proper units when given
2. Determine the level of significance
3. Find the critical value(s).
Include the diagram that displays all the pertinent information
4. Decision Rule: Reject or not to reject the null hypothesis
5. Compute the test value
Include the proper formula
Round off the final answer properly
Locate and place the test value on the diagram
6. Decision making. Summarize the result

L#17 Testing Hypothesis: Population Mean µ vs. Sample Mean x̄

 Z – Test is appropriate when sample size n ≥ 30 and population variance σ2 is known. If population variance σ2 is

x̄−μ x̄−μ
z c= z c=
σ s
unknown, the sample variance s2 can be substituted to σ2. √n √n
x̄−μ
t c=
s
 T – test is used when the sample size n < 30 and the population variance σ2 is unknown. √n
Problem 1.
A study claims that all SH students spend an average of 3.6 hrs. on Facebook weekly. A researcher wanted to
check if this claim is true. A random sample of 50 students taken by this researcher showed that these students spend

3.8 hrs on Facebook with a standard deviation of 0.8 hr. Using the 1% significance level, can you conclude that the claim
that all SH students spend an average of 3.6 hrs on Facebook?

Step 1 Ho: µ = 3.6 hrs Step 4 Decision Rule: Reject HO if zc > 2.58 or zc < - 2.58
Ha: µ ≠ 3.6 hrs Step 5 Compute the test statistics
1
x̄−μ 3 .8−3 . 6
z c= = =1. 768
σ .8
Step 2 Level of Significance: α= 0.01 √n √50
Step 3 Critical value: ± 2.58 Step 6 Decision Making: Accept Ho. There is enough evidence to
support the claim that all SH students spend an average of 3.6
hrs on Facebook.

Problem 2.
In order to increase customer service, a vulcanizing shop claims its mechanics can change a flat tire in 12 mins. A
time management specialist selected six repair jobs and found their mean time to be 11.6 minutes. The standard
deviation of the sample was 2.1 mins. At α = 0.025, is there enough evidence to conclude that the mean time in changing
a tire is less than 12 mins.

Step 1 Ho: µ = 12 Step 4 Decision Rule: Reject HO if tc < - 2.571


Ha: µ < 12 Step 5 Compute the test statistics
x̄−μ 11. 6−12
t c= = =−0 . 47
σ 2 .1
Step 2 Level of Significance: α= 0.025 √n √6
Step 3 Critical value: tc = -2.571 (d.f = 5) Step 6 Decision Making: Accept Ho. There is enough evidence to
support the claim that mechanics can change a flat tire in 12
mins.

Population Mean µ vs. Sample Mean

1. A survey found that women over the age of 18 consume an average of 2000 calories a day. In order to
see if the number of calories consumed by women over age 18 living in NIYSHS is the same, the
researcher sampled 45 women over the age of 18 and found the mean number of calories consumed
was 1950. The sample standard deviation of the sample was 30 calories. At α = 0.10, can it be
concluded that there is no difference between the number of calories consumed by the women over the
age of 18?
2. A study claims that high school students spend 10 hours in a week in studying. To prove this claim, a
researcher took a sample of 50 students and found out that the sample spend an average of 9.5
hours/week in studying with a standard deviation of 1.2 hours. Can it be concluded that there is no
significant difference between the number of hours spend in studying using 5% significance level?
3. A certain school claims that the average grade of their students in Mathematics is 88.5. A researcher
took a sample of 50 students and found out that their average grade in Mathematics is 84.8 with a
standard deviation of 0.45. Using 2.5 % significance level, would the researcher conclude that the
average grade of students in Mathematics is less than 88.5?
4. A certain School Division claim that the average age of students who graduated from high school is
15.5. You would like to test this claim and took a sample of 100 students. You found out that the
average age of graduating students is 16.8 with a standard deviation of 1.3 years. Can you conclude
that the average age of the students is not significantly higher than 15.5 years at α=0.10.
5. The DOH reported that the mean total cholesterol level in 2002 for all adults was 203. Total cholesterol
levels in participants who attended the seventh examination of the Study are summarized as follows:
n=310, xx = 200.3 and s = 36.8. Is there statistical evidence that mean cholesterol levels in adult is lower
than 203 at α=0.05?
6. A machine in Starbucks is design to fill jars with 16 ounces of coffee. A consumer suspects that the
machine is not filling the jars completely. A sample of 8 jars has a mean of 15.6 ounces and a standard
deviation of 0.3 ounces. Is there enough evidence to support the consumer’s claim at α = 0.10?
7. A recent survey stated that households received an average of 37 telephone calls per month. To test
the claim, a researcher surveyed 25 households and found out that the average number of calls was
34.9. The standard deviation of the sample was 6. At α = 0.05, can the claim be substantiated?
8. In the population of NIYSHS who drink coffee, the average daily consumption is 3 cups per day. A
teacher wants to know if the students tend to drink more coffee than the school average. They ask 20
students how many cups of coffee they drink each day and found out an average of 2.6 cups with a
standard deviation of 0.4. Is there enough evidence that the students drink less than the school
average at α =0.01?

9. In the population, IQ scores are normally distributed with a mean of 100. A certain school wants to
know if their students have an IQ that is higher than the population mean. They take a random sample
of 15 students and find that they have a mean IQ of 109 with a standard deviation of 23. Is there
enough evidence to show that the average IQ is higher than 100 at α =0.05?
10. According to a study, the average salary for a new college graduate in 2013 was P15,327. A small
academic program wants to know if the average salary of their graduates is different from P15,327. In a
2
sample of 10 recent graduates, the average starting salary was P16,210 with a standard deviation
of P700. Assume the annual starting income for graduates of this program is approximately normally
distributed. Is there enough evidence to claim that the average salary is greater than P15,327 at α
=0.05?

L#18 Testing Hypothesis: T – test for Paired Comparisons

 Paired T – Test is used when:


a. A group is experimented on to see the effectiveness of an experiment or a treatment
b. To see a difference occurs in some observable characteristics between equally matched subjects

 This test is also known as:



t= ⋅√n
Dependent t Test Sd
Paired t Test
Repeated Measures t Test where : d̄=
∑d
 The variable used in this test is known as: n


Dependent variable, or test variable (continuous), 2
n ∑ d2 −( ∑ d )
measured at two different times or for two related conditions or units S d=
n ( n−1 )
Problem 1.
A group of Makati Doctors and Nurses conducted HIV Awareness Program in NIYSHS as mandated by DepED. To
test the efficiency of their program, a 30 item test was given to 10 selected students before and after the implementation
of the program. Was the program effective in increasing the level of awareness of the students on HIV using α = 0.01?
Student Pre Post d d2 Step 1
test Test HO: The mean score of the students in the posttest is not significantly
1 10 12 2 4 different from their mean score in the pretest.
2 12 11 -1 1 Ha: The mean score of the students in the posttest is significantly higher
3 13 13 0 0 from their mean score in the pretest.
4 10 15 5 25 Step 2 Level of Significance: α= 0.01
5 9 14 5 25 Step 3 Critical value: 2.821
6 13 16 3 9 Step 4Decision Rule: Reject HO if tc > 2.821
7 14 15 1 1 Step 5 Compute the test statistics
8 15 18 3 9
d̄=
∑ d 21
= =2 .1
9 16 15 -1 1 n 10
10 11 15 4 16


2
∑ d =21 ∑ d 2 =91
S d=


n d2 − ( d )
n ( n−1 )
t= ⋅√ n=
2 .1
∑ =

10 ( 91 )−( 21 )2
10 ( 9 )
⋅√ 10=2 . 91
=2 . 28

Step 6 Decision Making: Reject Ho. The mean score in the posttest is
Sd 2 .28
significantly higher from their mean score in the pretest.

Problem 2.
An algebra teacher wants to compare two methods of teaching College Algebra. One is the lecture method and
the other is the personalized system instruction (PSI) Students are paired by matching those with similar mathematics
background and performance. A random sample of 11 pairs is selected. From each pair, one student is randomly chosen
to take the lecture course; the other takes the PSI course. Both courses are taught by the Algebra Teacher. The final
grades for the 11 pairs of students are found to be the following: Use α = 0.1
Lecture 76 93 76 78 85 92 88 85 87 93 88
PSI 78 94 78 84 84 91 89 88 87 89 89
d
d2

1. Researchers want to test a new anti-hunger weight loss pill. They have 10 people rate their hunger both before
and after taking the pill. Does the pill do anything? Use alpha = 0.05

Before 10 8 7 6 7 8 9 10 8 7

after 8 7 6 6 6 6 7 6 6 5
3
2. A psychologist is interested in his new learning program. 15 Subjects learn a list of 50 words. Learning
performance is measured using a recall test. After the first test all subjects are instructed how to use the learning
program and then learn a second list of 50 words. Learning performance is again measured with the recall test. In
the following table the number of correct remembered words are listed for both tests. Use alpha = 0.10

Student 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15

Score 1 38 46 42 36 38 39 45 41 42 32 29 28 34 27 28

Score 2 42 48 46 41 43 40 46 42 43 34 30 29 36 29 31

L#19 Testing Hypothesis: Comparing Two Sample Means

x̄ 1− x̄ 2 x̄1 − x̄ 2
Z= t=

√ √
s s ( n1 −1 ) s1 2 + ( n 2−1 ) s2 2

n1+n2 – 2
n1
12
+
n2
22

, when n ≥ 30 or
n1 +n 2−2 √

1 1
+
n 1 n2 , d.f =

Problem 1
A study was made to compare the monthly rental cost of 2 bedroom townhouse unit in Makati against a 2 – bedroom
townhouse unit in Taguig. A random sample of 36 townhouse units in Makati showed a mean rental cost of P!8,000 with a standard
deviation of P700 while a random sample of 40 townhouse units in Taguig showed a mean rental cost P17,800 with a standard
deviation of P600. AT the 0.05 level of significance, is there a significant difference in the mean monthly rental of 2 bedroom
townhouse unit between two cities.
Step 1 HO: There is no significant difference between the mean monthly rental of 2 –bedroom townhouse unit in Makati
against mean monthly rental of 2 –bedroom townhouse unit in Taguig. µ1 = µ2
Ha: There is a significant difference between the mean monthly rental of 2 –bedroom townhouse unit in Makati
against mean monthly rental of 2 –bedroom townhouse unit in Taguig. µ1 ≠ µ2
Step 2 α = 0.05
Step 3 Z = ±1.96
Step 4 Reject Ho If ZC > 1.96 or ZC < - 1.96
x̄ 1− x̄ 2 18000−17800
Z= = =1. 33

Step 5 √
s
n1
12
+
s
n2
22
√ ( 700 )2 ( 600 )2
36
+
40
Step 6 Do not reject Ho. There is no significant difference between the mean monthly rental of 2 –bedroom townhouse unit
in Makati against mean monthly rental of 2 –bedroom townhouse unit in Taguig.

Problem 2

Example: Comparing Packing Machines


In a packing plant, a machine packs cartons with jars. It is supposed that a new machine will pack faster on the average than the
machine currently used. To test that hypothesis, the times it takes each machine to pack ten cartons are recorded. Use α=0.10

New machine Old machine


42.1 41.3 42.4 43.2 41.8 42.7 43.8 42.5 43.1 44.0
41.0 41.8 42.8 42.3 42.7 43.6 43.3 43.5 41.7 44.1

x̄ 1 = 42.14, s1 = 0.683
x̄ 2 = 43.23, s2 = 0.750

1. Within a school district, students were randomly assigned to one of two Math teachers – Mr. Cabada and Mr.
Aranas. After the assignment, Mr. Cabada had 40 students, and Mr. Aranas had 35 students. At the end of the

4
year, each class took the same standardized test. Mr. Cabada’s students had an average test score of 84, with a
standard deviation of 10; and Mr. Aranas’ students had an average test score of 85, with a standard deviation of
15. Test the hypothesis that Mr. Cabada and Mr. Aranas are equally effective teachers. Use a 0.10 level of
significance. (Assume that student performance is approximately normal.)

2. A psychologist studying the human factors of computer keyboards setups up an experiment to compare two
different keyboard designs. He measures the number of words per minute typed by one group on Keyboard A
and then he measures the number of words typed per minute by another group of people on Keyboard B. Use
the data below to determine if the typing speeds on the two different keyboards are significantly different. α = 0.10

Keyboard A (words per minute) Keyboard B (words per minute)

54, 62, 75, 59, 78, 64, 69, 72, 50, 73 47, 51, 54, 62, 44, 51, 48, 65, 42, 44, 71, 68

3. An investigator thinks that people under the age of forty have vocabularies that are different than those of people
over sixty years of age. The investigator administers a vocabulary test to a group of 31 younger subjects and to a
group of 31 older subjects. Higher scores reflect better performance. The mean score for younger subjects was
14.0 and the standard deviation of younger subject's scores was 5.0. The mean score for older subjects was 20.0
and the standard deviation of older subject's scores was 6.0. Does this experiment provide evidence for the
investigator's theory at α = 0.05?
4. An investigator predicts that dog owners in the country spend more time walking their dogs than do dog owners in
the city. The investigator gets a sample of 21 country owners and 23 city owners. The mean number of hours per
week that city owners spend walking their dogs is 10.0. The standard deviation of hours spent walking the dog by
city owners is 3.0. The mean number of hours country owners spent walking theirs dogs per week was 15.0. The
standard deviation of the number of hours spent walking the dog by owners in the country was 4.0. Do dog
owners in the country spend more time walking their dogs than do dog owners in the city? Please test the
investigator's theory using an alpha level of .10.
5. An investigator theorizes that people who participate in a regular program of exercise will have levels of systolic
blood pressure that are significantly different from that of people who do not participate in a regular program of
exercise. To test this idea the investigator randomly assigns 21 subjects to an exercise program for 10 weeks and
21 subjects to a non-exercise comparison group. After ten weeks the mean systolic blood pressure of subjects in
the exercise group is 137 and the standard deviation of blood pressure values in the exercise group is 10. After
ten weeks, the mean systolic blood pressure of subjects in the non-exercise group is 127 and the standard
deviation on subjects in the non-exercise group is 9.0. Please test the investigator's theory using an alpha level of
0.05.

L#20 Testing Hypothesis: Z - test for Comparing Population Proportion and sample proportion

When to use?
 If data is on the nominal scale ^p −p
 Outcomes can be dichotomized into two categories namely “ success”
Z=
symbolized by p and “failure” symbolized by q or 1 – p
 If q and p are mutually exclusive

Problem#1
√ pq
n

A respectable survey organization hypothesized that in order for a particular re – electionist governor to be
successful in his re – election bid, he must garner at least 60% of the votes cast. What are the chanced of the re
electionist governor is 1360 out of 2000 registered voters in a survey indicated their desire to vote for the said governor?
Use α = 0.01.

Step 1 HO: The proportion of registered voters who are in favor of the re – electionist governor is not significantly higher
from 60%
Ha: The proportion of registered voters who are in favor of the re-electionist governor is significantly higher from
60%

Step 2 α = 0.01
Step 3 C.V. Z = 2.33
Step 4 Decision Rule: Reject Ho if Zc > 2.33
Step 5
x 1360 Z=
^p −p
=
0 . 68−. 60
=7 .3
^p= = =0 . 68
n 2000 √ √
pq
n
( 0 .60 )( 0 . 40 )
2000

Step 6 Reject Ho. The proportion of registered voters who are in favor of the re-electionist governor is significantly higher
from 60%

5
Problem # 2
The canteen manager claims that 80 percent of his 1,000,000 customers are very satisfied with the service they
receive. To test this claim, the students surveyed 100 customers, using simple random sampling. Among the sampled
customers, 73 percent say they are very satisfied. Based on these findings, can we reject the canteen manager's
hypothesis that 80% of the customers are very satisfied? Use a 0.05 level of significance.

Step 1 HO: ______________________________________________________________________________________

Ha:_______________________________________________________________________________________Step 2
α= _____
Step 3 C.V. = _______________ ____ - tailed
Step 4 Decision Rule: Reject Ho if ____________________________
Step 5 ^p = ______, p = ________, q = __________
^p −p
Z= =

√ pq
n
Step 6 __________________________________________________________________________________
__________________________________________________________________________________

1. We wish to know if we may conclude that fewer than 5% of the drug users in the sampled population are HIV
positive. A survey showed 18 out of 400 drug users are HIV positive. Use α = 0.10
2. A food technologist is testing a new process for preparing a particular kind of fruit for canning. Typically, 10% of
the cans of the fruit prepared using the current process show visible discoloration after 90 days. She selects a
random sample of 225 cans prepared with the new process and finds that after 90 days, only 17 cans show signs
of discoloration. Does this data allow us to say that the new process makes a difference in the proportion of cans
that tend to discolor within 90 days? Use α = 0.05
3. It is believed that 40% of the residents of Barangay Nuevo are in favor of the restoration of the death penalty. Out
of 500 residents surveyed, 220 are in favor of the issue. Test the hypothesis that the proportion of the residents
who favor the restoration of death penalty is not significantly different form 40% Use 0.01 level of significance.
4. An independent research group is interested to show that the percentage of babies delivered
through Ceasarian Section is decreasing. For the past years, 20% of the babies were delivered
through Ceasarian Section. The research group randomly inspects the medical records of 144 births
and finds that 25 of the births were by Ceasarian Section. Can the research group conclude that the
percent of births by Ceasarian Section has decreased at 5% level of significance?

L#21 Testing Hypothesis: Z - test for Comparing two Proportions

0.6−0.7 x 1+ x 2
Z= =−2.76 ^pn =
^p1 = proportion of success in 1st sample n1 + n2


of 1( st0.66 ( ) 1 1
)( 0.34 ) +
sample 300 400
^p2 = proportion of success in 2nd sample q^ n=1− ^pn
n1 = size

n2 = size of 2nd sample

Problem 1.
A survey was done about the issue of allowing late students coming to school to attend their first period class.
Out of 400 female students interview 280 said “yes” and out of 300 male students asked the same question, 180 said
“yes”. At a level of significance of 0.05, is there a significant difference in the proportion of male against female students
who agree of allowing late students coming to school to attend their first period class.

Step 1 Ho: There is no significant difference in the proportion of male against female students who agree of allowing
late students coming to school to attend their first period class.
Ha: There is a significant difference in the proportion of male against female students who agree of allowing late
students coming to school to attend their first period class.
Step 2 α = 0.05
Step 3 C.V. ± 1.96
Step 4 Reject if Zc > 1.96 or Zc < - 1.96
180 280 180+280
^p1 = =0 . 6 p2 = =0 .7 ^pn = =0 . 66 q^ n=1−0 .66=0 .34
Step 5 300 400 300+400
0 .6−0 . 7
Z= =−2 . 76

√ (
( 0 . 66 ) ( 0 .34 )
1
+
1
300 400 )
Step 6 Reject H0: There is a significant difference in the proportion of male against female students who agree of
allowing late students coming to school to attend their first period class.
6
Problem # 2
A survey was conducted to determine the proportion of teachers and students respondents who are in favor of
giving of condoms in public schools. 120 out of 400 teachers and 130 out of 500 students are in favor of the issue. Is
there a significant difference in the proportion of teachers and students who favor the issue? Use a 0.05 level of
significance.
Step 1 Ho: ____________________________________________________________________________________
Ha: ____________________________________________________________________________________
Step 2 __________________
Step 3 __________________
Step 4 Reject if ______________________
Step 5
^p1 =___________ ^p2 =__________ ^pn =____________ q^ n=_____________

Z =_______________
Step 6 _____________________________________________________________________________________

1. Time magazine reported the result of a telephone poll of 800 adult Americans. The question posed of the
Americans who were surveyed was: "Should the federal tax on cigarettes be raised to pay for health care
reform?" The results of the survey were: Use α = 0.10

2. Suppose the Acme Drug Company develops a new drug, designed to prevent colds. The company states that the
drug is equally effective for men and women. To test this claim, they choose a a simple random sample of 100
women and 200 men from a population of 100,000 volunteers. At the end of the study, 38% of the women caught
a cold; and 51% of the men caught a cold.

a. Based on these findings, can we reject the company's claim that the drug is equally effective for men and
women? Use a 0.05 level of significance.

b. Based on these findings, can we conclude that the drug is more effective for women than for men? Use a 0.01
level of significance.

3. In a study of patients on sodium-restricted diets, 55 patients with hypertension were studied. Among
these, 24 were on sodium-restricted diets. Of 149 patients without hypertension, 36 were on sodium-
restricted diets. We would like to know if we can conclude that, in the sampled population, the
proportion of patients on sodium-restricted diets is higher among patients with hypertension than among
patients without hypertension.

L#22 Correlation and Regression

 Correlation analysis is a method used to measure the strength of relationship between two or more
variables.
 The correlation calculation only works well for relationships that follow a straight line. Correlation Is Not
Good at Curves
 "Correlation Is Not Causation" ... which says that a correlation does not mean that one thing causes
the other (there could be other reasons the data has a good correlation).

A. Scatter Diagram: Consider the grades of five students in English and Mathematics

Student Eng (x) Math (y)


100
A 55 69
E C
B 64 85 75 B
M
C 96 99 A
A 50
D 44 52 T D
E 83 89 H 25

25 50 75 100
ENGLISH
Types of Correlation
A. Direction (sign)

7
1. POSITIVE. High scores in one variable are associated with high scores in the second variable. (v.versa)
2. NEGATIVE. High scores in one variable are associated with low scores in the second variable. (v.versa)
3. ZERO. Scores in one variable tend to score neither systematically high nor low in the other variable.

B. Coefficient

Value of correlation r Interpretation 1 is a perfect positive correlation


±0.80 < r < ±0.99 High 0 is no correlation
±0.60 < r < ±0.79 Moderately high -1 is a perfect negative correlation
±0.40 < r < ±0.59 Moderate
±0.20 < r < ±0.39 Low
C. Scatter Diagram
±0.01< r < ±0.19 negligible

A. Pearson Product-Moment Correlation


There are five assumptions that are made with respect to Pearson's correlation:
1. The variables must be either interval or ratio measurements
2. The variables must be approximately normally distributed (
3. There is a linear relationship between the two variables
4. Outliers are either kept to a minimum or are removed entirely.
5. There is homoscedasticity of the data.

Problem 1
Below are the data for six participants giving their number of years
in college (X) and their subsequent yearly income (Y). Income here
is in thousands of Pesos. Test whether there is a relationship
with Alpha = .05.

Step 1 Ho: There is no significant linear correlation between years of education and income
Ha: There is a significant linear correlation between years of education and income
Step 2 α = 0.05
Step 3 rcritical = + 0.811
Step 4 Reject Ho if r < - 0.811 or r > 0.811
Step 5

Step 6 There is a significant relationship between years spent in college and income. The more years of school,
the more the subsequent income. (The relation is significant; the result is not due to chance alone).
Problem 2 Problem 3
A teacher wants to find out if students’ scores in English are A teacher wants to know if the number of hours X
correlated with the scores in Filipino. Use α = 0.01 spent in studying X is correlated with the score
obtained in an examination Y
Studen Scores in Scores in XY X2 Y2 Student X Y XY X2 Y2
t English X Filipino Y A 3.0 20
1 28 27 B 2.7 34
2 22 2 C 3.8 19
3 18 14 D 2.6 10
4 16 25 E 3.3 24
5 30 15 F 3.4 31
6 25 23
7 19 17
8 12 10
9 13 22
10 24 14 8
B. Spearman Rank Correlation Coefficient

 It is also known as the "spearman rho" or "spearman r correlation".


 Correlation coefficient between the ranks (ordinal level). The correlation coefficient is sometimes
denoted by rs.
 Spearman's correlation determines the strength and direction of the monotonic relationship between
your two variables rather than the strength and direction of the linear relationship between your two
variables, which is what Pearson's correlation determines.

Pic 1 is monotonic and linear


Pic 2 is monotonic but not linear
Pic 3 is neither monotonic nor linear

Problem 1
Ten teaching styles were ranked by ABM students and Stem students. The data are tabulated in the table below
where the highest rank(most preferred) is 1 and the lowest rank(least preferred) is 5. Is there a significant relation
between the ranking of ABM and Stem at α= 0.10
Teaching
Style
ABM STEM d d2 Step 1 Ho: There is no significant difference between the preferences of ABM
A 6 5 against STEM.
B 3 2 Ha: There is a significant difference between the preferences of ABM
against STEM.
C 1 3
Step 2 α = 0.10
D 8 6
Step 3 C.V. = ±0.600
E 4 7
Step 4 Reject H0 if rs < - 0.6 or H0 if rs > 0.6
F 2 1
Step 5
G 10 9
H 9 8
I 5 4
J 7 10 Step 6 ___________________________________________________________

Problem 2 Problem 3
The following table shows the rating of a group of five The table below shows the evaluations of the services
Students who have been evaluated independently for of the canteen. Evaluations were done by the faculty
leadership on a scale of 1 to 10 by the principal and a members and the students. Determine if there is a
faculty member. Calculate rs if there is a correlation difference between the two evaluations.
between the two evaluations.
Service Faculty Students
Students Principal area Faculty
A 4 A 7 8 6
B 6 B 8 9 8
C 7 C 6 7 8
D 7 D 9 6 7
E 5 E 7 6 5
F 5 9
G 8 4

C. REGRESSION ANALYSIS
If two variables are correlated, then it is possible to predict or estimate a variable based on the changes
or movements of the other variable. Let X and Y be the correlated variables, the regression equation can be
represented by Y = a + bX, where b ≠ 0.

( ∑ Y ) ( ∑ X 2 )−(∑ X )( ∑ XY ) n ( ∑ XY ) −( ∑ X )( ∑ Y )
a= 2
b= 2
n ∑ X 2−( ∑ X ) n ∑ X 2 −( ∑ X )

9
Problem1.
The following table shows the number of weeks six persons have work at an automobile inspection station and
the number of cars each one inspected on a given day.
a. Determine the regression equation
b. Predict the number of cars inspected by someone who has been working for 10 week.
Employe No. of No. of cars XY X2
e weeks (X) inspected (Y)
A 2 13
B 7 21
C 9 23
D 1 14
E 5 15
F 12 21
∑X = ∑Y = ∑ XY = ∑ X2 =

Problem 2 Problem 3
The table below shows the number of hours 9 The table below shows the final grades of 7
employees have spent working and the number randomly selected SHS students in English(X)
of defective products they made. Determine the and Mathematics (Y). Write the regression
regression equation. Estimate the number of equation.
defective products made by an employee who
worked for 8 hours.

Employe No. of No. of XY X2 Students English Math XY X2


e hrs defective(Y X Y
(X) ) 1 78 84
A 1.0 13 2 89 75
3 85 77
B 1.5 14 4 87 72
5 84 76
C 2.5 16
6 81 78
D 2.1 14 7 95 70

E 3.5 15

F 4.5 20

G 4.0 18

H 5.5 18

i 6.0 20

10
Critical value: SPEARMAN RANK ORDER CORRELATION COEFFICIEN
(DEGREE OF FREEDOM: n - 2

11

You might also like