Professional Documents
Culture Documents
AND NON
PARAMETRIC
TEST
INTRODUCTION
To make the generalisation about the population from the sample, statistical tests are used.
A statistical test is a formal technique that relies on the probability distribution, for reaching
the conclusion concerning the reasonableness of the hypothesis.
These hypothetical testing related to differences are classified as parametric and nonparametric
tests.
The parametric test is one which has information about the population parameter.
On the other hand, the nonparametric test is one where the researcher has no idea regarding
the population parameter.
DEFINITION OF
PARAMETRIC TEST
The parametric test is the hypothesis test which provides generalisations for making
statements about the mean of the parent population.
A t-test based on Student’s t-statistic, which is often used in this regard.
The t-statistic rests on the underlying assumption that there is the normal distribution of
variable and the mean in known or assumed to be known.
The population variance is calculated for the sample. It is assumed that the variables of
interest, in the population are measured on an interval scale.
DEFINITION OF
NONPARAMETRIC TEST
The nonparametric test is defined as the hypothesis test which is not based on underlying
assumptions, i.e. it does not require population’s distribution to be denoted by specific
parameters.
The test is mainly based on differences in medians.
Hence, it is alternately known as the distribution-free test.
The test assumes that the variables are measured on a nominal or ordinal level.
It is used when the independent variables are non-metric.
KEY DIFFERENCES BETWEEN
PARAMETRIC AND
NONPARAMETRIC TESTS
1. A statistical test, in which specific assumptions are made about the
population parameter is known as the parametric test. A statistical test used
in the case of non-metric independent variables is called nonparametric test.
2. In the parametric test, the test statistic is based on distribution. On the other
hand, the test statistic is arbitrary in the case of the nonparametric test.
3. In the parametric test, it is assumed that the measurement of variables of
interest is done on interval or ratio level. As opposed to the nonparametric
test, wherein the variable of interest are measured on nominal or ordinal
scale.
KEY DIFFERENCES BETWEEN
PARAMETRIC AND
NONPARAMETRIC TESTS
4. In general, the measure of central tendency in the parametric test is mean,
while in the case of the nonparametric test is median.
5. In the parametric test, there is complete information about the population.
Conversely, in the nonparametric test, there is no information about the
population.
6. The applicability of parametric test is for variables only, whereas
nonparametric test applies to both variables and attributes.
7. For measuring the degree of association between two quantitative variables,
Pearson’s coefficient of correlation is used in the parametric test, while
spearman’s rank correlation is used in the nonparametric test.
EQUIVALENT TEST
CONCLUSION
To make a choice between parametric and the nonparametric test is
not easy for a researcher conducting statistical analysis.
For performing hypothesis, if the information about the population
is completely known, by way of parameters, then the test is said to
be parametric test whereas, if there is no knowledge about
population and it is needed to test the hypothesis on population,
then the test conducted is considered as the nonparametric test.
Flow Diagram State H0 and H1
Sample a random sample and workout an appropriate value from sample data
Calculate the prob. that sample result would diverge as widely as it has from
expectations, if H0 were true.
Is the probability equal to or smaller that ά value in case of one-tailed test and ά /2
in case of two tailed test.
Yes No
Reject H0 Accept H0
Thereby run the risk of Thereby run some risk of
committing type I error. committing type II error.
T-TEST
Test of significance of mean
Test for the significance of different between two means
Paired t-test
Assumptions:
The parent population from which the sample is drawn is normal.
The sample observations are independent.
The population s.d. is unknown.
TEST FOR SINGLE MEAN
= = -4.0
See table for tabulated value.
d = x-y
An IQ test was administered at 5 persons before and after they were trained. The results are
given below:
Goodness of fit
Independence of attributes
CONDUCTING CHI-SQUARE
ANALYSIS
1) Make a hypothesis based on your basic biological question
2) Determine the expected frequencies
3) Create a table with observed frequencies, expected frequencies, and chi-square values
using the formula:
0 1 2 3 4 5 6 7 8 9 Total
1026 1107 997 966 1075 933 1107 972 964 853 10000
Test whether the digits may be taken to occur equally frequently in the
directory. (The table value of Chi square for 9 d.f. at 5% level of significance is
16.92)
ANOVA
ANAlysis Of
VAriance
ANOVA (ONE WAY)
Analysis of Variance (ANOVA) is a hypothesis-testing technique used to test the equality of
more than two population (or treatment) means by examining the variances of samples that are
taken.
ANOVA allows one to determine whether the differences between the samples are simply due
to random error (sampling errors) or whether there are systematic treatment effects that causes
the mean in one group to differ from the mean in another.
Most of the time ANOVA is used to compare the equality of three or more means, however
when the means from two samples are compared using ANOVA it is equivalent to using a t-
test to compare the means of independent samples.
ANOVA is based on comparing the variance (or variation) between the data samples to
variation within each particular sample. If the between variation is much larger than the within
variation, the means of different samples will not be equal. If the between and within
variations are approximately the same size, then there will be no significant difference
between sample means.
Assumptions of ANOVA:
(i) All populations involved follow a normal distribution.
(ii) All populations have the same variance (or standard deviation).
(iii) The samples are randomly selected and independent of one another.
EXAMPLE
A trucking company wishes to test the average life of each of the four brands of tyres. The
company uses all brands on randomly selected trucks. The records showing the lives of tyres .
Test the hypothesis that the average life for each brand of tyres is the same.
Are these frequencies in agreement with the belief that accident conditions
were the same during this 10 month period?