Professional Documents
Culture Documents
- Abdi-Khalil Edriss -
NON-PARAMETRIC STATISTICS
Nowadays, there are many literatures on non-parametric procedures that are different
from parametric procedures, but serving a great deal in research. Non-Parametric
statistical procedures do not depend on certain statistical distributions or assumptions
such as normality, constant variance, etc. Non-parametric procedures are often
known as distribution-free procedures. Since populations do not always meet the
assumptions underlying parametric tests, and therefore the focus of this chapter is on
most frequently used non-parametric statistical procedures.
1
According to Daniel Wayne, the first use of what we would now call a non-parametric statistical
procedure seems to have been reported in 1710 by John Arbuthnot. Uses of such procedures were
conspicuously sparse until the 1940s. The word non-parametric appeared for the first time in 1942 in a
paper by Wolfowitz.
~ 193 ~
8.2. What are some of the advantages of non-parametric
statistics?
When the data have been measured on a scale weaker than that required
for parametric procedures that would otherwise be employed. For
example, the data may consist of count data or rank data, thereby
precluding the use of some otherwise important parametric procedures.
This is perhaps the most widely used non-parametric technique for testing
the null hypothesis that several samples have been drawn from the same or
identical populations. Thus, Kruskal-Wallis-One-Way-Analysis of
Variance Test is performed as follows.
Assumptions
The data for analysis consist of k random samples of sizes n1, n2, n3, … nk.
The observations are independent both within and among samples.
The variable of interest is continuous.
The measurement scale is at least ordinal.
The populations are identical except for a possible difference in location
for at least one population.
Hypothesis
Null hypothesis, Ho: the k population distribution functions have identical
medians.
Alternative hypothesis, Ha: the k populations do not all have the same
median.
Decision Rule
When we are considering three samples, and each sample has 5 or fewer
observations, we compare the computed value of Kruskal-Wallis (KW) Test
Statistic, denoted by H, for significance with the tabulated Kruskal-Wallis
critical values.
~ 195 ~
However, when the number of samples is larger than 3 and the number of
observations per sample are larger than 5, we cannot use KW table. We rather
compare the computed values of H for significance with Chi-square, k2 1 ,
where k is the number of samples in the study. Meaning, KW test shows that
for large sample size, nj, and k, H is distributed approximately at Chi-square
with k-1 degree of freedom.
For the number of samples and sample sizes that can be accommodated by
KW table, we reject Ho if the computed value of H exceeds the critical value
listed in KW table for the pre-selected value of significant level of α.
Test Statistic
Replace each original observation by its rank relative to all the observations in
the k samples.
In case of ties we assign the tied observations the average of the ranks that
would be assigned if there were no ties.
If the null hypothesis is true, we expect the k sums of ranks (that is, the sums
of the ranks in each sample) to be about equal when adjusted for unequal
sample sizes.
12 Ri2
H 3( N 1)
N ( N 1) ni
where Ri is the sum of the ranks assigned to the observations in the ith
treatment under Ho.
NUMERICAL EXAMPLE
In a Rural Credit program, the amount of loan paid back or re-payment in Kwacha to
the lending institutions in the three regions of the country for last year are recorded as
follows in Table 8.1.
~ 196 ~
Table 8.1: Repayments and ranks of borrowers in the three regions
Hypothesis:
Ho: in the three regions of the country the median repayment amounts are the
same
Ha: the three regions do not have the same median repayment amount
Test Statistic
The ranks replacing the original observations will be with number of
observations N=10+6+6=22 and sample size of k=3
Decision Rule
The sample sizes all exceed 5, and therefore we must use chi-Square statistic
to determine whether the sample medians (roughly speaking the average
repayment amount) are significantly different from one region to another.
~ 197 ~
From the KW table, the critical value for 2 k-1, = 2 3-1,0.01 = 2 2, 0.01 = 9.210,
and since H = 9.232 exceeds 2 2, 0.01 = 9.210, we reject the null hypothesis
(Ho) at the given significance level of 0.01. We therefore conclude that the
repayments (medians) of the three regions represented are not equal. Since the
loan repayments (average amount) for each region is different, the lending
institution may institute some kind of repayment policy based on regional
income or other regional differences.
Central 6 90.00
Northern 10 69.00
Southern 6 94.00
Since KW p-value = 0.0099 < 0.05, we reject the null hypothesis of equality of
medians repayment amount attributable to the three regions, and conclude that
the median repayment is significantly different by region.
Additional example using grain yield dry weight (t/h) attributable to four treatments,
the treatments are: 1, 2, 3B and 4. (Data – Courtesy of Clinton Foundation and AGRA)
. kwallis graYdryw, by ( treatment)
1 75 12030.50
2 76 12275.50
3B 77 12300.00
4 76 9754.00
~ 198 ~
Furthermore, results attributable to maize variety are shown as follows.
. kwallis graYdryw, by (cropvariety)
The result (ρ=0.0026 < 0.05) implies that the medians grain yield dry weight
attributable to maize variety (DK9089 and SC719) are significantly different from
each other. Thus, the varieties make difference in maize yield.
~ 199 ~
8.9. What are the steps to use non-parametric MC test?
We first obtain the mean of the ranks of the ith sample, and Ŕj be the mean
of the ranks of the jth sample. We next select an experimental-wise error
rate of , which we think it is an overall effect of significance. Our choice
of is determined in part by k, the number of samples involved, and is
larger for larger k. Note that if we make multiple comparisons with an
experimental-wise error rate, we usually select a value of larger than
those customarily encountered in single comparison inferences. For
example, we may choose 0.15, 0.20 or perhaps 0.25 depending on the size
of k.
The next step is to find the value of Z in Z-score table that has /k(k-1)
area to its right, and finally we form the inequality
N ( N 1) 1 1
Ri Rj Z (1 ( / k ( k 1)))
12 ni nj
POINTS TO PONDER
This procedure allows us to forget about the direction of the differences between
mean ranks when performing the calculations. The direction of the differences should,
of course, be taken into account in the interpretation of the results.
NUMERICAL EXAMPLE
Refer to the previous data, for which we have computed H = 9.23 and suggested to
reject the null hypothesis (the median of repayment is the same for all the three
regions of the country) at 0.01 level. As a result, we concluded that the median
repayments of rural credit are not the same for all the three regions of the country.
~ 200 ~
Now, we would like to make all possible comparisons in order to locate just where the
difference occurred. Let us choose an error rate of 0.15, where k=3 samples involved,
and there will be 3(2)/2 = 3 comparisons to make. Hence, /k(k-1) = 0.15/3(3-1) =
0.025. And referring to a table of Z-score, we find Z0.025 = 1.96.
We then calculate the means of the ranks for the three samples,
R1 69 / 10 6.9, R2 90 / 6 15 and R3 94 / 6 15.67 . We can now make
comparisons between the groups or regions.
EXAMPLE 1
N ( N 1) 1 1
R1 R2 Z (1 ( / k ( k 1)))
12 n1 n2
22(22 1) 1 1
gives the value 1.96 6.57
12 10 6
EXAMPLE 2
N ( N 1) 1 1
R1 R3 Z (1 ( / k ( k 1)))
12 n1 n3
22(22 1) 1 1
gives the value 1.96 6.57 .
12 10 6
~ 201 ~
Decision Rule: Since R1 R3 6.9 15.67 8.77 6.57 , this comparison is
again significant. We therefore conclude that borrowers who received rural credit
in Northern region tend to have lower repayment rate than their counterparts in the
Southern region of the country.
EXAMPLE 3
N ( N 1) 1 1
R2 R3 Z (1 ( / k ( k 1)))
12 n2 n3
22(22 1) 1 1
gives the value 1.96 7.35
12 6 6
~ 202 ~
. oneway repayment region, scheffe tabulate
Summary of Repayment
Region Mean Std. Dev. Freq.
Analysis of Variance
Source SS df MS F Prob > F
Northern -1343.33
0.350
Conclusions
[1] Northern region tend to have lower repayment rate than their counterparts in the
Central region but with no statistically difference (ρ=0.35>0.05) between the two
regions.
[2] Southern region tend to have higher repayment rate than their counterparts in the
northern region with statistically significant difference (ρ=0.004 < 0.05); and
[3] We also conclude that borrowers of rural credit from southern region tend to have
higher repayment rate than their counterparts in the central region of the country, but
with no statistically significant difference ((ρ=0.13 > 0.05).
Lastly, note that the overall result indicates that there is statistically significant
difference (ρ=0.004 < 0.05) between the mean repayments from the regions.
~ 203 ~
interested in comparing each treatment with the control condition without
regard to whether the overall test for a treatment effect is significant.
NUMERICAL EXAMPLE
A fertilizer manufacturer conducted an experiment to compare the effect of four types
of fertilizers on the yield of a certain grain. Homogeneous equal size experimental
plots of soil were made available for the experiment. There were randomly assigned
to receive one of the four fertilizers and plots receiving no fertilizer served as
controls. The yields (‘000 kg) for each plot are given in Table 10.3
Now, we wish to know which fertilizers are superior to no fertilizer. Note that plot 1
is the control plot, where no fertilizer had been applied.
Solution
Step 1
To find out the answers, we convert the above data into ranks, rank total and mean
ranks and tabulate them as follows (Table 10.4).
O 1 58 29 37 40 44 37 49 49 38
10.5 1 2.5 5 6 2.5 8.5 8.5 4 48.5 5.39
A 2 68 67 69 58 62 48 62 76 66
16 15 17 10.5 12.5 12.5 12.5 19 14 123.5 13.72
~ 204 ~
B 3 96 90 90 92 99 86 79 96 75
28.5 23 23 26 30 21 20 28.5 18 218 24.22
Step 2
Looking at the ranks in Table 10.4, the data contain several ties, and thus we have to
adjust for the ties. Since the samples are all the same size, we use the expression
k N ( N 2 1) ( t3 t)
Ri Rj Z (1 ( / k ( k 1)))
6 N ( N 1)
Step 3
Since there are five treatments (four fertilizers and no fertilizer), we have four
comparisons to make. To find the appropriate Z-value for = 0.2, we compute
0.2/2(4) = 0.25, and from Z-score table we have obtained, Z = 1.96. From the absolute
value of the mean ranks, we obtain Ro 5.39 , the mean rank of the yields of the plots
to which no fertilizer was applied. The R j ' s , the mean ranks of the plots receiving
fertilizers, j = A, B, C, and D are R A 13.72 , R B 24.22 , RC 31.78 and
RD 39.89 .
And, now the right-hand side of the above mathematical expression yields -
5 45(452 1) (127 1)
1.96 12.13
6(45)(45 1)
~ 205 ~
Hence, the comparison of yields of plots receiving fertilizer to yields of plots
receiving no fertilizer is shown next.
From Table 8.6, we can see that since 8.33 < 12.127 (comparing the left-hand to the
right-hand of the mathematical expression), we cannot conclude that fertilizer A is
better than no fertilizer. However, since 18.83, 26.39 and 34.50 are greater than 12.13,
it is possible to conclude that fertilizers B, C and D all resulted in higher yields than if
no fertilizer at all were used.
~ 206 ~
. oneway yield fertilizertype
Analysis of Variance
Source SS df MS F Prob > F
A 64 7.8898669 9
B 89.222222 8.0121436 9
C 100.33333 8.0777472 9
D 111.88889 6.5085414 9
O 41.5 8.8317609 8
Analysis of Variance
Source SS df MS F Prob > F
B 25.2222
0.000
C 36.3333 11.1111
0.000 0.082
Conclusions:
1. Since ρ=0.0000 < 0.05 (F=111.14), we can reject the null hypothesis of equal
means (mean yields by fertilizer type). Or, there is different means yield by
fertilizer type.
2. Bartlett’s test, ρ=0.952 > 0.05 suggests acceptance of the null hypothesis of
equal variances or no significant variation between the yield. It is good news
for the validity of ANOVA.
3. Scheffe’s test (Multiple Comparisons test) yield by fertilizer type indicates the
differences between each pair of yield means. Thus, there is significant
differences between the control group (O), and fertilizer types A, B, C, and D.
Except yield differences between C and B (ρ=0.082 > 0.05), D and C
(ρ=0.064 > 0.05), any other pairs indicate significant difference in the mean
yields by fertilizer type (ρ=0.000 < 0.05).
~ 207 ~
IV - METHOD 4: Spearman Rank Correlation
If ties occur between the X’s or among the Y’s, each tied value is assigned
the mean of the rank positions for which it is tied.
~ 208 ~
Hypothesis
A: Two sided
o Ho: Observations on X and Y are independent.
o Ha: Observations on X and Y are either directly or inversely related.
B: One-sided
o Ho: Observations on X and Y are independent.
o Ha: There is a direct relationship between X and Y.
A: One-sided
o Ho: Observations X and Y are independent.
o Ha: There is an inverse relationship between X and Y.
Test Statistic
n
6 d i2 n
i 1 2
The test statistic is rs 1 2
, where d i2 R( x i ) R( y i ) , where
n(n 1) i 1
NUMERICAL EXAMPLE
Suppose we have the following eight (n=8) pairs of observations of X and Y given as:
(xi, yi): (0, 10), (9, 3), (1, 9), (5, 6), (7, 11), 6, 12), (2, 4) and (3, 5) and ranks are –
6(90)
di2 = (-5)2 + 72 + (-3)2 + 12 + 02 + (-2)2 + 12 + 12 = 90 and rs 1 0.07 .
8(64 1)
Decision Rule
Now based on the three types of the hypotheses, we make the following decisions.
For the same data set, STATA output for spearman correlation is as follows.
. spearman x y, stats(rho obs p) star(0.05)
Number of obs = 8
Spearman's rho = -0.0714
Since ρ=0.8655 > 0.05, we fail to reject the null hypothesis that x and y are independent.
Thus, it can be concluded that x and y are independent, and there is negative weak
correlation, ρ = -0.07 14or -7.14%, between x and y.
POINTS TO PONDER
Watch out the decision rule. It depends on whether we use one-sided or two-sided hypothesis
testing with the pre-determined level of significance.
V - METHOD 5: Phi-Coefficient
~ 210 ~
a c
Let the 2 by 2 table is given as , then.
b d
ad bc
(a c)(b d )(c d )(a b)
This works if the observations are grouped in two groups or categories as mentioned
before. Otherwise, the Chi-Square test, 2, can be applied if the observations are
grouped in a contingency table with r rows and c columns, that is, r by c contingency
table.
POINTS TO PONDER
The Phi-coefficient is related to Pearson’s chi-square statistic, 2. The relationship is
expressed by 2 = 2/n. To determine whether a computed value of is significant,
convert this value to 2 = n 2 and compare the resulting 2 with the tabulated chi-
square values with 1 degree of freedom. Note that the Phi-coefficient lies in the range
of –1 to 1 inclusive.
NUMERICAL EXAMPLE
Suppose that a sample of 125 employees is classified by sex and job satisfaction as
follows:
Computing
(15)(25) (35)(50)
0.3595
(50)(75)(60)(65)
We now have a measure of strength of the association between gender and job
satisfaction of a sample of 125 workers.
~ 211 ~
Decision rule
To test for significance, computed 2 = 125 (-0.3595)2 = 6.16 > 21, 0.05 = 3.841 (from
chi-square table); Since the computed value is greater than the tabulated value, we
reject the null hypothesis of ‘no association between sex and job satisfaction.’
Conclusion: Since the computed p-value = 0.0001 < 0.05, we reject the null
hypotheses that sex and job satisfaction are independent. We, therefore, conclude that
there is significant correlation or association between sex and job satisfaction.
Points to Ponder
Tetrachoric correlations assume a latent bivariate normal distribution (X 1, X2) for each pair
of variables (v1, v2), with a threshold model for the manifest variables (v i = 1 if and only if Xi
> 0). The means and variances of the latent variables are not identified, but the correlation,
r, of X1 and X2 can be estimated from the joint distribution of v1 and v2 and is called the
tetrachoric correlation coefficient.
2
2 (Oi Ei ) 2
C and r 1 ,
n(t 1) Ei
~ 212 ~
where 2 is Chi-square statistic, n is the total sample size, t is either the
number of rows or the number of columns in the contingency table, which
ever is smaller, Oi is observed value and Ei is expected value of the ith row
observation.
Note that the expected value can easily be computed using Eij = [Oi.O.j]/n, for
ith row and jth column observations or entries.
Decision Rule
2
If Cramer statistic if C > (r-1)(c-1), then reject the null hypothesis.
NUMERICAL EXAMPLE
The following data have been collected on question ‘How satisfied are you with the
college you attended?’
Note that the total sample size, n = 230, and the number in the parentheses are the
expected values calculated. For example,
Now, using chi-square equation 22 53.178 and since number of rows is less
than the number of columns (that is, r=3 < c=4), we have t-1 = 3-1 = 2. Thus, Cramer
2
53.178 2 2 2
Coefficient C 0.34 , and (r-1)(c-1) = (3-1)(4-1) = (2)(3) =
n(t 1) 230(2)
2
= 14.45 . Hence, since 0.34 < 14.45, we do not reject the null hypothesis. We
6, 0.025
therefore conclude that there is no association between which college the students
attended and the level of satisfaction.
~ 213 ~
Also, STATA results (F=0.27 and p-value=0.7703 > 0.05) show that there no
significant difference by level of satisfaction among the students.
. oneway noofstudents levelofsatisfaction, scheffe
Analysis of Variance
Source SS df MS F Prob > F
Total 200 8 25
Unsatisf -1.66667
0.935
Comparisons between the levels of satisfaction themselves are not significant (Scheffe
results), as well.
~ 214 ~
8.18. How do we calculate Point bi-serial correlation coefficient
measurement?
We use the symbol rpb to designate the point bi-serial correlation coefficient
computed from the sample data. The simplest formula for this is given as
n1 no a1 a o
r pb
n (a i a)2
POINTS TO PONDER
Like any other correlation coefficients, the value of rbp ranges between –1 and 1
inclusive.
NUMERICAL EXAMPLE
In a study of the association between income and education, the data in Table 8.10
were obtained on a sample of 25-year old individuals who either completed or did not
complete college.
1 24000 1 16 41000 1
2 15000 1 17 11500 0
3 35000 0 18 13500 0
4 31000 1 19 32000 0
5 20000 0 20 12000 1
6 50000 1 21 25000 1
~ 215 ~
7 13000 0 22 33000 1
8 10100 1 23 21000 0
9 27000 1 24 32000 0
10 90000 1 25 29000 1
11 13000 0 26 19000 0
12 10000 1 27 21000 0
13 21400 0 28 14000 1
14 16000 0 29 16000 1
15 10000 0 30 30000 1
For this data set, we obtain number of Yes’s, n1 = 16 and number of No’s, n0 = 14
and average of the Yes-income, ā1 = 28568.75 and average of the No-income, ā0 =
19885.75and computing
Decision Rule
Hence, we can conclude that there is strong association between income and
completing college for this particular study since the correlations, rpb= 75.53%.
~ 216 ~
========================================================
MENTAL GYMNASTICS
CHAPTER EIGHT
=======================================================
4. Why does non-parametric statistical procedure allow sample size less than 30?
5. What is the difference between Pearson Correlation coefficient and Spearman rank
correlation coefficient?
8. Calculate the correlation for the following data, and draw conclusion accordingly.
Sex (male=0,
female = 10 income('000)Mk
0 55
0 43
1 38
0 50
1 49
0 42
1 60
1 70
1 65
0 55
~ 217 ~
9. Use the data given in the table. Which fertilizers are superior to no fertilizer?
Fertilizer type, plot number and yield of grains
10. Why do you think populations do not always meet the assumptions underlying
parametric tests? Do some research and build your case.
~ 218 ~