U09d1 Nonparametric Analyses

u09d1 Nonparametric Analyses
Choose one of the following questions:

1. What factors should be considered when deciding whether to use a parametric or
nonparametric statistical analysis? Identify a research question from your professional
life, research interests, or everyday experience that could be addressed by the chosen
analysis.
2. What are odds? Why do we choose odds over probability in a logistic regression
analysis? How are log odds or logit scores (Li) related to probabilities (pi)? Identify a
research question from your professional life, research interests, or everyday experience
that could be addressed by the chosen analysis.
What factors should be considered when deciding whether to use a parametric or nonparametric
statistical analysis? Identify a research question from your professional life, research interests, or
everyday experience that could be addressed by the chosen analysis.
Unlike parametric statistical tests, non-parametric tests do not rely on precise distributional
assumptions, which is why they are often referred to as distribution free tests. Therefore, one
of the main considerations in choosing between the two classes of tests is whether the sample
distribution violates the assumption of normality (Warner, 2008). For sample sizes less than
1000, the Shapiro-Wilk test for normality (> 1000, use Kolmogorov-Smirnov test), and a visual
inspection of the histograms can be used to determine whether the distribution is normally
distributed. If it is clearly not normally distributed then a corresponding non-parametric tests is
appropriate. Its important to note that probabilities using non-parametric tests are exact
probabilities regardless of the distribution shape.
Moreover, it may be extremely difficult with small sample sizes of N < 10 to determine if the
distribution violates the assumption of normality. In that case, it may also be advantageous to
use a non-parametric test. One advantage in using non-parametric tests is the flexibility it
affords the researcher to measure samples from several different populations. However, even if a
frequency distribution appears highly skewed, most researchers seem to agree that in cases of
large samples sizes choosing a parametric test procedure over a non-parametric test is still
appropriate because of their robust nature and the reliability of the central limit theorem (Warner,
2008).
Another consideration is if a research study is particularly interested in the median score and not
the sample mean, it may be appropriate to perform a non-parametric statistical procedure. For
example, the median score is commonly used in describing diverse household incomes for
specific real estate markets. So if the median income in an area is $33,000, then half the
households have incomes less than $33,000 and half have incomes greater than $33,000.
Another consideration for non-parametric statistical testing is in the use of ordinal or ranking
data for the outcome variable. Typically interval/ratio scale data is required for parametric test
procedures. Ordinal data, as in the case of high school class ranking, can be used in nonparametric tests even with the presence of extreme outliers since the tests of central tendency are
relatively unaffected by outliers. Conversely, parametric tests become less powerful with the
presence of extreme outliers by potentially inflating the variance, and skewing mean scores.
However, by comparison parametric tests typically exhibit more power to avoid a Type I error
than non-parametric tests which tend to be less straight forward and precise in their
determinations to reject the null hypothesis.
A simple example of the use of non-parametric testing involves one of the most commonly used
distribution free tests called the Mann-Whitney test for two independent samples. This test is
similar to the parametric t test only with a broader null hypothesis of the two samples being
randomly selected from the same populations, but not populations with the same mean scores as
in parametric testing. Suppose there are two treatment groups, one with a Body Mass Index
(BMI) of < 30 (N1 = 5), and the other a BMI > 30 (N2 = 7).
The research question is: How does each group handle stressful life events? Because of the
multiplicity of stress factors that could be experienced, it is assumed that the outcome variable Y
of stress scores is not normally distributed. Ranking data is assigned to stress scores per group
and the sum of the ranks would determine which group handled stress better. The determination
to reject the null hypothesis would take into consideration the critical values tables for the MannWhitney test for two independent samples.
Anthony Rhodes
General Psychology PhD
References
Warner, R.M. (2008). Applied statistics: From Bivariate Through Multivariate Techniques.
Thousand Oaks, CA: Sage Publications. ISBN: 9780761927723.

U09d1 Nonparametric Analyses

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

U09d1 Nonparametric Analyses

Uploaded by

Copyright:

Available Formats

u09d1 Nonparametric Analyses

Choose one of the following questions:

You might also like