Professional Documents
Culture Documents
Hypothesis Testing
When interpreting research findings, researchers need to assess whether these findings
may have occurred by chance. Hypothesis testing is a systematic procedure for deciding whether the
results of a research study support a particular theory which applies to a population.
B. Basic Concepts
1. Variable are any characteristics that can take on different values such as height, age,
temperature or test scores.
2. The independent variable is the cause. Its value is independent of other variables. The
independent variable is controlled in the problem.
3. The dependent variable is the effect. Its value depends on changes in the independent
variable. The variable that changes because of this control in the independent variable.
Example : In a study to determine whether how long a student sleeps affects test scores.
The independent variable is the length of time spent in sleeping. The dependent variable is
the test scores.
To carry out statistical hypothesis testing research / alternative and null hypothesis are
employed ;
Research/ Alternative Hypothesis – this is the hypothesis that you propose, also known as the
alternative hypothesis H1. Research / Alternative hypothesis, H1 is a statistical proposition stating
that there is a significant difference between a hypothesized value of a population parameter and
its estimated value. For example :
H1 : First year university students obtain higher grades after an intensive Statistics
course.
The null hypothesis, Ho is the opposite of the research / Alternative hypothesis and expresses that
there is no relationship between variables, or no difference between groups. For example :
Ho : First year university students do not obtain higher grades after an alternative
Statistics course.
to (_ > )
to ( _< )
The purpose of hypothesis testing is to test whether the null hypothesis (there is no
difference, no effect) can be rejected or approved. If the null hypothesis is rejected,
then the research/ alternative hypothesis is accepted, then the research/
alternative hypothesis is rejected.
A one – tailed test may be either left – tailed or right – tailed. A left – tailed test is used
when the H1 states that the true value of the parameter specified in the Ho is less than the
Ho claims. A right – tailed test is used when the H1 states that the true value of the
parameter specified in the Ho is greater than the Ho claims.
2. A two – tailed test results from an H1 which does not specify a direction. I,e. when the H1
states the Ho is wrong.
The main difference between one – tailed and two – tailed tests is that one – tailed tests will
only have one critical region, whereas two –tailed tests will have two critical regions.
Critical region also known as the rejection region, is a set of values for the test
statistic for which the null hypothesis is rejected. i.e. if the observed test statistic is
in the critical region, then we reject the null hypothesis, and accept the alternative
hypothesis.
A critical value is the score the sample would need to decide against the null
hypothesis.
A probability value is used to assess the significance of the statistical test. If the null
hypothesis is rejected, then the alternative to the null hypothesis is accepted.
Examples :
a. A light bulb manufacturer claims that its energy saving light bulbs last an average
of 60 days. Set up a hypothesis test to check this claim.
Because of the “is not” in the H1, we have to consider both the possibility that the
lifetime of the energy saving bulb is greater than 60, and that it is less than 60.This
means, we have to use a two – tailed test.
b. The manufacturer now decides that it is only interested whether the mean
lifetime of an energy saving light bulb is less than 60 days. What changes would
you make from example a ?
H1 : The mean lifetime of an energy saving light bulb is less than 60 days.
Now we have a “less than” in the H1. This means that instead of performing a two
– tailed test, we will perform a left – sided one – tailed test.
D. Level of Significance
Is defined as the criteria or threshold value based on which one can reject the null
Hypothesis or fail to reject the null hypothesis. The level of significance determines whether the
outcome of hypothesis testing is statistically significant or otherwise. The significance level is
also called as alpha level. The level of significance can take values such as 0.10, 0.05, 0.01. The
most common value of the level of significance is 0.05.
If the mean more accurately represents the center of the distribution of your data, and your
sample size is large enough , use a parametric test even if you have a large sample size.
1. t – Tests are employed to compare the mean score on some continuous variable for two
groups. The null hypothesis to be tested is there is no difference between the two groups
(e.g. anxiety scores for males and females are not different).
T – Test can be employed to compare the mean scores of two different groups (independent
– sample t-test) or to compare the same group of people on two different occasions (paired
– sample t-test).
In addition to assessing whether the difference between the two groups is statistically
significant.
2. ANOVA
While t – tests compare the mean score on one variable for two groups, analysis of variance
is used to test more than two groups. Analysis of variance would be employed to test
whether there are differences in anxiety scores for students from different disciplines
referring to the previous example.
Analysis of variance compares the variance (variability in scores) between the different
groups (believed to be due to the variable) with the variability within each group believed to
be due to chance). An F ratio is calculated; a large F ratio indicates that there is more
variability between the groups (caused by the independent variable) than there is within
each group. A significant F test indicates that we can reject the null hypothesis; i.e. that
there is no difference between the groups.
3. Correlation Test
Is a statistical method used to measure the strength of the linear relationship between two
variables and compute their association. Simply put – correlation analysis calculates the
level of change in one variable due to the change in the other.
When data are not distributed normally or when they are on an ordinal level of
measurement, we have to use non – parametric tests for analysis.
It is used to test whether two categorical variables are related to each other. Also, to
determine if there is a significant relationship between two nominal (categorical) variables.
The frequency of each category for one nominal variable is comparted across the categories
of the second nominal variable.