Professional Documents
Culture Documents
TESTING
1. Define and explain statistical terms
2. Differentiate the different divisions of statistics
3. Identify the scale of measurement of variables
C 4.
5.
Differentiate data sets
Present data in three different ways
H
OBJECTIVES:
A
1. Understand the definitions used in hypothesis testing
2. Construct and differentiate null from alternative
P hypothesis
3. Differentiate type I error from type II error
T 4. State the steps on hypothesis testing
5. Test the means using t-test or z-test
E 6. Find the critical values for t-test and z-test
7. Learn how to solve dependent and independent
R sample tests using MS- Excel
Kinds of Hypothesis
1. Null Hypothesis (Ho) - expresses the idea of non- significance. The statement
opposes the perceived conclusion in a research.
Example:
State the null hypothesis and alternative hypothesis for the following statements.
Ho: There is no significant relationship between the attitude score and academic
performance.
Ha: There is significant relationship between the attitude score and academic
performance.
Type II Error - The second type of error committed is when the alternative hypothesis
is rejected wherein it is the true solution to the problem.
Level of Significance
The significance level is the criterion used for rejecting the null hypothesis.
Choice of levels is largely subjective and most common choices for the level of
significance are α = 0.05 and α = 0.01.
Setting the level of significance at .05 indicates that the researcher is 95%
confident that the alternative hypothesis is true or accepted and the researcher has a 5%
chance of rejecting the null hypothesis. Reducing the significance level at 0.01 indicates
that the researcher is 99% confident that he is making the right decision and 1% chance
of committing error.
The rejection region is the region in which the null hypothesis is rejected if the
test statistics falls within it. By setting the level of significance, the rejection or critical
value can be determined. It serves as the basis whether the hypothesis is rejected or not
rejected.
The location of the rejection region is determined by the nature of the alternative
hypothesis, whether the hypothesis is one-tailed or a two-tailed test.
A test of any statistical hypothesis, where the alternative is one– sided. In a one-
sided test, it is a directional statement of the alternative hypothesis wherein only one
meaning can be interpreted from it.
1. Ho: The average income of a skilled worker in the Philippines is not greater than
300Php.
Ha: The average income of a skilled worker in the Philippines is greater than
300Php.
For the above example the test to be used is one tailed test because the
alternative hypothesis is stated directional. Greater than 300Php has only one
meaning.
Ho: There is no significant difference in the leadership performance of sectarian and non-
sectarian school administrator.
Ha: There is a significant difference in the leadership performance of sectarian and non-
sectarian school administrator.
The Z-test and T- test is used to determine whether there are any significant
differences between the means of two groups.
ZC
x n
where:
x = sample mean
= population mean
n = total number of samples
= population standard deviation
where:
x = sample mean
= population mean
n = total number of samples
= population standard deviation
𝑥 − 𝑛𝑝0
𝑍𝑐 =
√𝑛𝑝0 𝑞0
(𝑃̂1 − 𝑃̂2 )
𝑍𝑐 =
1 1
√𝑝̂ 𝑞̂ (𝑛 + 𝑛 )
1 2
In determining the tabular value for z-test use the table for critical values of z
based on the area under the normal curve. (Refer to Appendix B)
Example 1
A record from local college shows that the mean score in the qualifying
examination of sophomore students was 45 with a standard deviation of 2.5. Samples of
80 students were taken and found to have a mean score of 48. Are the 80 students
better than the rest of the sophomore students? Use 0.01 level of significance
Solution:
1. Ho: The 80 students are not better than the rest of the sophomore students.
Ha: The 80 students are better than the rest of the sophomore students.
2. : 0.01
3. z-test (one tailed)
ZC
x n
x = 48
= 45
n = 80
= 2.5
ZC
48 45 80
2.5
ZC 10.73
7. Formulate conclusion: The 80 students are better than the rest of the sophomore
students.
Example 2
Solution:
(𝑥̅1 − 𝑥̅ 2 ) − 𝑑0
𝑍𝑐 = 2 2
√(𝜎1 ⁄𝑛 ) + (𝜎2 ⁄𝑛 )
1 2
Z C 12.36
6. Decision: Since the computed z value of 12.36 is greater than the tabular value
of 1.96 therefore reject Ho.
Example 3
In a market survey, 31 out of 126 males and 68 out of 190 females indicate a
preference for a particular brand of mobile phones. Do the data warrants the conclusion
that there is a sex difference exists in brand preference for mobile phones? Use 0.01
level of significance
Solution:
1. Ho: Sex difference does not exist in brand preference for mobile phones.
Ha: Sex difference exists in brand preference for mobile phones.
2. : 0.01
3. z-test (two tailed)
(𝑃̂1 − 𝑃̂2 )
𝑍𝑐 =
1 1
√𝑝̂ 𝑞̂ (𝑛 + 𝑛 )
1 2
where:
𝑥1 + 𝑥2
𝑝̂ =
𝑛1 + 𝑛2
z c 2.10 2.10
5. Decision: Since the computed z value of 2.10 is less than tabular value of 2.58
therefore we cannot reject Ho.
6. Formulate conclusion: Sex difference does not exist in brand preference for
mobile phones.
x1 x2
tC
n1 1s1 2 n2 1s2 2
1
1
n1 n2 2 n1 n2
where:
x1 = mean of the first group of sample
x 2 = mean of the second group of sample
n1 = total number of the first group of sample
n2 = total number of the second group of sample
s1 = standard deviation of the first group of sample
s2 = standard deviation of the second group of sample
3. T-test of correlated or paired samples: Compare the means between two related
groups obtained from the same sample.
General Formula:
𝑑̅ − 𝑑0
𝑡=
𝑆𝑑
⁄
√𝑛
where:
𝑑̅ = mean difference between the matched pairs of measures
𝑑0 = hypothesized difference between the matched pairs of measures
𝑆𝑑 = standard deviation of the difference between the matched pairs of
measures
n = total number of samples
t
x 2 x1
n x2 x1 ( x2 x1 )
2 2
n 1
Where:
Example 1
Last year an average life span of 69 years with standard deviation of 8.9 years
was shown by a random sample of 100 recorded deaths in the Philippines. Using 0.05
level of significance does it seem to show that the average life span today is greater than
72 years?
Solution
1. Ho: The average life span today is not greater than 72 years.
Ha: The average life span today is greater than 72 years.
2. : 0.05
3. t-test (one- tailed)
4. df n 1
df 100 1 99
tt 1.66 Tabular value
5. Computation using the formula for sample mean compared with population mean
tC
x n
s
Substitute the given values in the formula
x = 69
= 72
n = 100
tC
69 72 100
8.9
t C 3.37
6. Decision: Since t computed value of -3.37 is less than tabular value of 1.66.
Therefore, we reject our null hypothesis (Ho).
7. Formulate conclusion: The average life span today is greater than 72 years.
Example 2
The claim has been made that personal attention of an instructor will improve a
trainee’s score. Test the claim at the 0.05 level of significance.
Solution
80 61
t
(10 1)(5.40) 2 (10 1)(3.37) 2 1 1
10 10 2 10 10
t c 9.44
6. Decision: Since computed t value of 9.44 is greater than tabular value of 1.73.
Therefore, we reject our null hypothesis (Ho).
Example 3
Before 12 11 14 9 8 6 8 5 4 7
After 13 12 10 9 8 8 7 6 5 5
Solution
1. Ho: There is no enough evidence to support the claim that there was a change in
attitude about exercise.
Ha: There is enough evidence to support the claim that there was a change in
attitude about exercise.
2. : .05
3. t-test (two- tailed)
t
x 2 x1
N x2 x1 ( x2 x1 )
2 2
N 1
Before X1 12 11 14 9 8 6 8 5 4 7
Total
After X 2 13 12 10 9 8 8 7 6 5 5
X 2 X1 1 1 -4 0 0 2 -1 1 1 -2 -1
X 2 X 1 2 1 1 16 0 0 4 1 1 1 4 29
X 2 X1 -1
X 2 X 1 2 29
1
t t 0.18 0.18
1029 1
2
10 1
Decision: Since t computed value of 0.18 is less than tabular value of 2.26.
Therefore, we cannot reject the null hypothesis (Ho).
6. Formulate conclusion: There is not enough evidence to support the claim that
there was a change in attitude about exercise.
Solve each of the following problems completely following the steps in hypothesis
testing.
1. After selecting a random sample of 28 people from a very large population, they
want to determine if the average age of the sample is representative of the
average age of the population. The mean age of the population is 33. From
sample, the mean age is 26 and the standard deviation is 3.4. Is the mean age
of the sample significantly different from the mean age in the population?
2. A math test uses a sample of 16 students with a mean score of 78 and another
sample of 21 students with a mean score of 82. Is there any evidence that one
class is better than the other at .01 level of significance if the sample variance of
the former is 5 and the latter 8?
3. Mr. Piolo and Gerard administered their researcher made test to measure the
mathematical ability of male Grade VI pupils on some solid figures. They
randomly took 10 male pupils of Kalalake Elementary School. Their scores in
each solid figure are shown below.
Students 1 2 3 4 5 6 7 8 9 10
Cube 9 7 6 6 4 3 5 4 5 8
Cylinder 5 6 8 6 5 6 5 7 3 5
Females- 18, 19, 22, 24, 25, 25, 26, 27, 30, 32, 36, 36
Males-13, 16, 17, 19, 20, 20, 21, 21, 24, 24, 26, 27, 28, 30, 32, 32, 33
61 40 67 34 23 65 78 90
Pre-test
45 89 83 52 92 87 67 56
56 80 60 60 70 55 94 60
Post-test
56 77 70 89 94 89 80 80
6. Compare the mean age of incoming students to the known mean age for all previous
incoming students. A random sample of 40 incoming college freshmen was taken and
was found to have a mean age 19.5 years with standard deviation of 1 year. The college
database shows the mean age for previous incoming students was 18. Compute the
appropriate test and make a decision.
7. A company wants to know if the mean of its new product differs from the mean of
the standard which is 0.742 pounds and a standard deviation of 0.0403 pounds. If a
sample of 35 yields a mean of 0.720. Compute the appropriate test and make a decision.
8. The employment status of working students in Olongapo City is below. Is there any
difference in the proportion of employed in males and females? At .05 level of
significance
Employment Status
Gender
Employed Unemployed
Males 256 23
Females 204 77
Total 460 100
Anxious People 3 3 4 5 5
Calm People 4 6 7 9 9
Set up the appropriate statistical test and make a decision.
11. Some investigators have proposed that students have elevated blood pressure during
finals week. To test this hypothesis 8 students volunteered to have their blood pressure
taken at the beginning of the semester and then again during finals week. The blood
pressure data (diastolic) is listed below. Conduct a test to determine if there is a
difference between the first week and finals week.
12. At the beginning of school year the average reading comprehension of freshmen
college students was recorded at 79.37. After three months of studying the same
instruments for reading comprehension test was used for 56 students and scored an
average of 86.48 with a standard deviation of 4.65. Does staying in the college improve
the reading comprehension of the freshmen students?
13. The CEO of a small electric utility claims that 70 percent of his 1,000 customers are
very satisfied with the service they receive. To test this claim, the local newspaper
14. Suppose the previous example is stated a little bit differently. Suppose the CEO
claims that at least 70 percent of the company's 1,000 customers are very satisfied.
Again, 100 customers are surveyed using simple random sampling. The result: 63
percent are very satisfied. Based on these results, should we accept or reject the CEO's
hypothesis? Assume a significance level of 0.05.
15. The average length of time for students to register in the first semester at a certain
school has been 50 minutes. A new registration procedure is being tested. If random
samples of 20 students have an average of 45 minutes with a standard deviation of 9
minutes under the system, can you conclude that the new system is faster than the old
one? Assume that the average length of time is normally distributed. Use 0.01 level of
significance
16. Researchers want to examine the effect of perceived control on health complaints of
geriatric patients in a long-term care facility. Thirty patients are randomly selected to
participate in the study. Half are given a plant to care for and half are given a plant but
the care is conducted by the staff. Numbers of health complaints are recorded for each
patient over the following seven days. Compute the appropriate test for the data
provided below.
23 12 6 15 18 5 21 18
Control over Plant
34 10 23 14 19 23 8
35 21 26 24 17 23 37 22
No Control over Plant
16 38 23 41 27 24 32
17. A researcher wishes to study the effectiveness of Psychoanalysis and Gestalt therapy.
Ten twins were the participants who were randomly assigned to each group (i.e., one of
the twins is assigned randomly to one of the therapies and other twin to the other
therapy). At the conclusion of the study, changes in self concept are measured for each
patient. The purpose of the study is to determine which method is more effective. The
data on changes in self concept are as follows:
Psychoanalysis (twin 36 14 15 20 23 16 25 30 20 29
1)
Gestalt Therapy 20 25 40 15 17 18 34 23 24 10
18. A researcher wishes to study the effectiveness of Reality and Behavior therapy. Ten
patients were randomly assigned to each group. At the conclusion of the study, changes
in self-concept are measured for each patient. The purpose of the study is to determine
which is more effective. The data on changes in self concept are as follow:
Reality Therapy 46 24 25 30 33 26 35 40 30 39
Behavior Therapy 30 35 50 25 27 38 44 33 34 20
19. Score of twelve students before and after a special coaching benefit students in that,
are the students’ scores higher after coaching. Use 0.01 level of significance
Before
92 78 47 37 64 54 43 33 58 62 87 75
coaching
After
82 80 62 49 55 73 59 58 63 65 91 84
coaching
Keyboard A Keyboard B
(words per minute) (words per minute)
47
54 51
62 54
75 62
59 44
78 51
64 48
69 65
72 42
50 44
73 71
68
The columns are probabilities, and the rows degrees of freedoms. Each cell contains the
critical t value for that particular degrees of freedom and probability of Type I error
(alpha α). α is the probability of Type I error.
The t values are for the two tail test. The same t value is used for half the alpha value in
a one tail test.For example, at 20 degrees of freedom, the t value of 2.086 is critical for
α=0.05(2 tail) and α=0.025 (1 tail). With the same degrees of freedom, 1.725 is critical
for α=0.1(2 tail) and α=0.05(1 tail).
✓ Be patient
✓ Be respectful