Professional Documents
Culture Documents
HARRY O. POSTEN
Table 1
Minimum value of c for which the t-test is totally robust sizes are somewhat unequal, as long as the smaller sample
at level c (nl = n2 = n) is taken from the population having the smaller variance.
nominal value u = (1(1) nominal value It = u(l) The original paper also contains results for ~O) = .01 with
similar results.
u= 0.05 u= (J.Ol u=(J.05 u=O.OI
n E f 11 E f
:~. Robustness Under Nonnormality
2 .0954 .0539 15 .0098 .0052
3 .0589 .0341 20 .0072 .0038 To precisely determine the degree of robustness of thE'
4 .0419 .0241 25 .0057 .003U two-sample t-test over a wide range of practical nOIl-
5 .0324 .0184 3U .0048 .0025 normal distributions is a difficult problem. An exact
6 .0263 .0148 50 .0028 .O(J15
7 .(J222
theoretical approach is impractical because of its mathe-
.0124 lUO .0014 .OU07
8 .0191 .0106 5UO .0003 .0001 matical intractability, an approximate approach would
!J .0168 .0093 1000 .0001 .0001 lack accuracy assurances, and a simulation approach re-
10 .0150 .0082 JO .0000 .0000 quires an exhorbitant amount of computer time to achieve
Table 2
Maximal regions of robustness of level f for the two tailed t-test
(nominal significance level uO) = 0'(l5)
f = 0.03
E = 0.02
*= sample size change nearest to 10 % change from equality but not greater
reduce dramatically as sample sizes vary significantly respectable preCISIOn over all extensive practical range
from equality. Thus, the t-test tends to lose its strong of distributiuns. A simulation approach, however, can be
degree of robustness rapidl.v as the sample sizes become macle practical by using a computer artifice to speed up
unequal. When each sample size varies by IO "" from a sample generation and by using low priority computer
condition of equal sample sizes, the t-test still has a time to reduce computer costs.
respectable amount of robustness with respect tu the Such a simulation study was provided by Posten (1978).
Type 1 error probability. However, when the sample The intent of thai stud~' was to accurall'ly quantify the
sizes reach a 20 "." difference from equality. Olll' might (kgree of robustness of the two-sample I-test for a range
wish to be more cautious with the usc of the t-test. To of sample sizes over a wiele range of practical distribu-
an important degree, the loss of robustness when sam pic tions. The Pearson family of distributions was chosen
sizes are unequal is in the range where l >
1, that is, because it appeared to have best withstood the test of
when the larger variance is associated with the smaller time, in terms of representing practical data. The range
~ample size. The level of robustness for the unequal of coverage was for both negative and positive skewness
sample size test can, therefore, be significantly improved over O:S:; PI :0;: 2.0 and 1.4::;; P2:O;: 7.8, where p, = p/la fl
if one knows beforehand which population has this and P2 = ft"la". This seems to be a wide range of coverage
smaller variance. In this case, the smaller sample size may for practical distributions if one judges by the range of
be assigned to the population with the smaller variance. reported values of p, and P~ in, for example, ScheHe (1959)
The range of A is the restricted to (0, 1] and table 2 can bo and Pearson and Please (1975). The decision on the fine-
used with the righthand entries all replaced by 1. The re- ness of the grid covering this region was conservatively
sult is that the t-test becomes somewhat robust with re- made and the final coverage was for Pl= 0 (0.4) 2.0 and
93