Hypothesis Testing All Tests

PARAMETRIC
AND NON
PARAMETRIC
TEST
INTRODUCTION
 To make the generalisation about the population from the sample, statistical tests are used.
 A statistical test is a formal technique that relies on the probability distribution, for reaching
the conclusion concerning the reasonableness of the hypothesis.
 These hypothetical testing related to differences are classified as parametric and nonparametric
tests.
 The parametric test is one which has information about the population parameter.
 On the other hand, the nonparametric test is one where the researcher has no idea regarding
the population parameter.
DEFINITION OF
PARAMETRIC TEST
 The parametric test is the hypothesis test which provides generalisations for making
statements about the mean of the parent population.
 A t-test based on Student’s t-statistic, which is often used in this regard.
 The t-statistic rests on the underlying assumption that there is the normal distribution of
variable and the mean in known or assumed to be known.
 The population variance is calculated for the sample. It is assumed that the variables of
interest, in the population are measured on an interval scale.
DEFINITION OF
NONPARAMETRIC TEST
 The nonparametric test is defined as the hypothesis test which is not based on underlying
assumptions, i.e. it does not require population’s distribution to be denoted by specific
parameters.
 The test is mainly based on differences in medians.
 Hence, it is alternately known as the distribution-free test.
 The test assumes that the variables are measured on a nominal or ordinal level.
 It is used when the independent variables are non-metric.
KEY DIFFERENCES BETWEEN
PARAMETRIC AND
NONPARAMETRIC TESTS
1. A statistical test, in which specific assumptions are made about the
population parameter is known as the parametric test. A statistical test used
in the case of non-metric independent variables is called nonparametric test.
2. In the parametric test, the test statistic is based on distribution. On the other
hand, the test statistic is arbitrary in the case of the nonparametric test.
3. In the parametric test, it is assumed that the measurement of variables of
interest is done on interval or ratio level. As opposed to the nonparametric
test, wherein the variable of interest are measured on nominal or ordinal
scale.
KEY DIFFERENCES BETWEEN
PARAMETRIC AND
NONPARAMETRIC TESTS
4. In general, the measure of central tendency in the parametric test is mean,
while in the case of the nonparametric test is median.
5. In the parametric test, there is complete information about the population.
Conversely, in the nonparametric test, there is no information about the
population.
6. The applicability of parametric test is for variables only, whereas
nonparametric test applies to both variables and attributes.
7. For measuring the degree of association between two quantitative variables,
Pearson’s coefficient of correlation is used in the parametric test, while
spearman’s rank correlation is used in the nonparametric test.
EQUIVALENT TEST
CONCLUSION
 To make a choice between parametric and the nonparametric test is
not easy for a researcher conducting statistical analysis.
 For performing hypothesis, if the information about the population
is completely known, by way of parameters, then the test is said to
be parametric test whereas, if there is no knowledge about
population and it is needed to test the hypothesis on population,
then the test conducted is considered as the nonparametric test.
Flow Diagram State H0 and H1
Specify the level of significance i.e. ά
Decide the correct sampling distribution
Sample a random sample and workout an appropriate value from sample data
Calculate the prob. that sample result would diverge as widely as it has from
expectations, if H0 were true.
Is the probability equal to or smaller that ά value in case of one-tailed test and ά /2
in case of two tailed test.
Yes No
Reject H0 Accept H0
Thereby run the risk of Thereby run some risk of
committing type I error. committing type II error.
T-TEST
 Test of significance of mean
 Test for the significance of different between two means
 Paired t-test
 Assumptions:
 The parent population from which the sample is drawn is normal.
 The sample observations are independent.
 The population s.d. is unknown.
TEST FOR SINGLE MEAN

With d.f. = n-1

PROCESS
Statement:
Ten cartons are taken at random from an automatic filling
machine. The mean net weight of the 10 cartons is 11.8 oz
and SD is 0.15 oz.
Does the sample mean differ significantly from the intended
weight of 12 oz?
PROCESS
 H0 : = 12 oz
H1 : 12 oz
= = -4.0
See table for tabulated value.
Reject H0 at 5 % level of significance

T-TEST FOR DIFFERENCE OF
MEAN
 The average number of articles produced by two machines
per day are 200 and 250 with SD 20 and 25 respectively on
the basis of records of 25 day’s production. Can you regard
both the machine equally efficient at 1% level of
significance.
PAIRED T TEST
d = x-y
 An IQ test was administered at 5 persons before and after they were trained. The results are
given below:
Before Training IQ (x) After Training IQ (y)

110 120
120 118
123 125
132 136
125 121
 Test whether there is any change in IQ after the training programme.

TEST
 Parametric and non-parametric test
 Goodness of fit
 Independence of attributes
CONDUCTING CHI-SQUARE
ANALYSIS
1) Make a hypothesis based on your basic biological question
2) Determine the expected frequencies
3) Create a table with observed frequencies, expected frequencies, and chi-square values
using the formula:
4) Find the degrees of freedom: n-1

5) Find the chi-square statistic in the Chi-Square Distribution table
6) If chi-square statistic > your calculated chi-square value, you do not reject your null
hypothesis and vice versa.
The following figures shows the distribution of digits in numbers chosen at random from a
telephone directory:
0 1 2 3 4 5 6 7 8 9 Total
1026 1107 997 966 1075 933 1107 972 964 853 10000
Test whether the digits may be taken to occur equally frequently in the
directory. (The table value of Chi square for 9 d.f. at 5% level of significance is
16.92)
ANOVA
ANAlysis Of
VAriance
ANOVA (ONE WAY)
 Analysis of Variance (ANOVA) is a hypothesis-testing technique used to test the equality of
more than two population (or treatment) means by examining the variances of samples that are
taken.
 ANOVA allows one to determine whether the differences between the samples are simply due
to random error (sampling errors) or whether there are systematic treatment effects that causes
the mean in one group to differ from the mean in another.
 Most of the time ANOVA is used to compare the equality of three or more means, however
when the means from two samples are compared using ANOVA it is equivalent to using a t-
test to compare the means of independent samples.
 ANOVA is based on comparing the variance (or variation) between the data samples to
variation within each particular sample. If the between variation is much larger than the within
variation, the means of different samples will not be equal. If the between and within
variations are approximately the same size, then there will be no significant difference
between sample means.
 Assumptions of ANOVA:
 (i) All populations involved follow a normal distribution.
 (ii) All populations have the same variance (or standard deviation).
 (iii) The samples are randomly selected and independent of one another.
EXAMPLE
 A trucking company wishes to test the average life of each of the four brands of tyres. The
company uses all brands on randomly selected trucks. The records showing the lives of tyres .
 Test the hypothesis that the average life for each brand of tyres is the same.
Brand 1 Brand 2 Brand 3 Brand 4

20 19 21 15
23 15 19 17
18 17 20 16
17 20 17 18
16 16
Those who say it can not be
done, should not interrupt those
doing it.
-- Chinese Proverb
PRACTICE
The number of scooter accidents per month in a certain town were as follows:
12, 8, 20, 2, 14, 10, 15, 6, 9, 4
Are these frequencies in agreement with the belief that accident conditions
were the same during this 10 month period?
[Chi sq. table value 16.919 at 5% level of significance with 9 d.f.]

INDEPENDENCE OF
ATTRIBUTES
Own Cell Male Female Total
Telephone
Yes 60 80 140
No 140 120 260
Total 200 200 400

EXAMPLE
 In a certain sample of 2000 families 1400 families are consumers of tea. Out of 1800 Hindu
families, 1236 consume tea.
 Use chi-square test and state whether there is any significance difference between
consumption of tea among Hindu and Non-Hindu families.
 [Table chi-sq 3.841 with 1 d.f. and @ 5% los]

Hypothesis Testing All Tests

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Hypothesis Testing All Tests

Uploaded by

Copyright:

Available Formats

PARAMETRIC

Specify the level of significance i.e. ά

Decide the correct sampling distribution

With d.f. = n-1

Reject H0 at 5 % level of significance

Before Training IQ (x) After Training IQ (y)

 Test whether there is any change in IQ after the training programme.

4) Find the degrees of freedom: n-1

Brand 1 Brand 2 Brand 3 Brand 4

12, 8, 20, 2, 14, 10, 15, 6, 9, 4

[Chi sq. table value 16.919 at 5% level of significance with 9 d.f.]

No 140 120 260

Total 200 200 400

 [Table chi-sq 3.841 with 1 d.f. and @ 5% los]

You might also like