Professional Documents
Culture Documents
1499156887Module16Q1Non Parametrictests
1499156887Module16Q1Non Parametrictests
Co-Principal Investigator
QUADRANT –I
2. Learning Outcome
3. Introduction
There are various situations where the populations under study are not normally distributed. The
data collected from these populations is extremely skewed. In such a situation, the option is to use
a Non-parametric test. Non-parametric statistical tests are hemmed in by fewer and less stringent
assumptions than parametric tests. They are particularly free of assumptions about the
characteristics about the characteristics of the distribution of population of research samples.
They are also called distribution free tests. A nonparametric statistical test does not specify terms
about the parameters of the population from which the sample was drawn.
5. Non-parametric tests
The non-parametric tests such as chi-square, run tests, sign test, the Mann-Whitney U test, and
the Wicoxon matched pair rank test and the Kruskal-Wallis test
• Chi-square
• Run test for randomness
• One sample sign test
Where Xi is the ith sample value, µ is he mean/median and Yi is the ith modified sample
observation (+ or -).
This mean that if a sample value is more than µ, then it is replaced by + sign; if a sample
value is less than µ, then it is replaced by – sign.
Kolmogorov-Smirnov test (K-S) test is similar to the chi-square test to perform goodness of
fit of a specific set of data to an assumed distribution. This test is more power for small
samples whereas the chi-square test is suited for large samples.
In K-S test, the values of the observed cumulative distribution of the random variable in a
given situation are compared with the corresponding values of the theoretical cumulative
distribution of the random variable and their absolute differences are calculated. Then the
maximum of these absolute difference is treated as the calculated statistic of K-S test,
namely Dcal = max | OFi-EFi |
Where OFi is the observed cumulative probability for the ith value of the random variable,
X, EFi is the expected cumulative probability for the ith value of the random variable X,
from the theoretical distribution.
For a given significance level (α) and the sample size (n).
The Kolmogorov-Smirnov test is a one-tailed test. Hence, if the calculated value of D is
more than the theoretical value of D for a given significance level, the reject the null
hypothesis; otherwise, accept the hypothesis
Mean or r, µ =
Variance of, r σ2 =
The formula for the standard normal statistic to test the significance of r is as shown below:
Z=
It is a two tailed test. Hence, the values of table –Zα/2 and Zα/2 define the left critical value
and right critical value of r. If the calculated Z value is in between –Zα/2 and Zα/2, then
accept the null hypothesis, otherwise reject the null hypothesis.
If the calculated value of P is more than the given significance level of α, accept the
null hypothesis; otherwise, reject the null hypothesis.
Compute chi-square statistic using the following formula
If the calculated chi-square statistic is less than the table chi-square value at 1 degree
of freedom and the given significance level of α , accept the null hypothesis; other,
reject the null hypothesis.
Where µU is the mean, σ2 U is the variance. The formulas for mean (µU) and variance (σ2
U) of U are:
µU=
σU2 =
ZU =
H=
Z=
For a given level of significance α, the absolute sample Z should be greater than the absolute
Z α/2 to reject the null hypothesis. For a one sided upper tail test, the null hypothesis is
rejected if sample Z is less than -Zα and for a one-sided lower tail test, the null hypothesis is
rejected if sample Z is less than -Zα.
Summary
There are various situations where the populations under study are not normally distributed.
The data collected from these populations is extremely skewed. In such a situation, the
option is to use a Non-parametric test. Non-parametric statistical tests are hemmed in by
fewer and less stringent assumptions than parametric tests. There are several advantages of a
non-parametric test. However, the methods are not without their drawbacks: The non-
parametric tests such as chi-square, run tests, sign test, the Mann-Whitney U test, and the
Wicoxon matched pair rank test and the Kruskal-Wallis test
One sample sign test is applicable where a sample is drawn from a population having a
continuous symmetrical distribution and known to be non-normal.
Kolmogorov-Smirnov test (K-S) test is similar to the chi-square test to perform goodness of
fit of a specific set of data to an assumed distribution.
A run is defined as a sequence which consists of repeated occurrence of a particular symbol
and the immediate predicting and succeeding symbols are different from the symbol in it or
no other symbol on either side.
The different sign tests for two samples are applied if the two samples have the same size.
Mann-Whitney U test is an alternate to the two sample t-test. This test is based on the ranks
of the observations f the two samples put together. This test is more powerful, when
compared to the Sign tests. The alternate name for this test is Rank-Sum test.
The Kruskal-Wallis test is, in fact, a non-parametric counterpart the one way ANOVAA.
The test is an extension of the Mann Whitney U test. Both methods require that the scale of
the measurement of a simple value should be at least ordinal.