0% found this document useful (0 votes)

177 views4 pages

Normality and Variance Testing in MINITAB

The document discusses testing assumptions for parametric hypothesis tests, specifically normality and equal variances. It explains that data needs to be tested for normal distribution before comparing means, using tests like Anderson-Darling or Shapiro-Wilk. An example shows archaeology data that is non-normal based on a p-value of 0.000 from an Anderson-Darling test. It also discusses using an F-test to check if two samples have equal variances before choosing a t-test that assumes equal or unequal variances. MINITAB can perform normality tests but not F-tests, while EXCEL can do both.

Uploaded by

Iginla Abibat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

177 views4 pages

Normality and Variance Testing in MINITAB

Uploaded by

Iginla Abibat

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

Hypothesis Testing: Checking Assumptions 1

Testing Assumptions: Normality and Equal Variances

So far we have been dealing with parametric hypothesis tests, mainly the different versions of the
t-test. As such, our statistics have been based on comparing means in order to calculate some
measure of significance based on a stated null hypothesis and confidence level. But is it always
correct to compare means? No, of course not; parametric statistics, by definition, assume that the
data we want to test are normally distributed, hence the use of the mean as the measure of central
tendency. Sample data sets are often skewed to the right for various reasons, and if we cannot
normalize the data we should not compare means (more on normalizing data sets later). In other
words, in order to be consistent we need to formally test our assumptions of normality. Luckily
this is very easy in MINITAB.

For example, say we have a sample of fifty (n = 50) excavated units and we are interested in the
artifact density per unit. Before we think about comparing the data set to another sample (for
example) we need to see if the data is normal. To do this we run our descriptive statistics as
usual and produce some graphics:

Descriptive Statistics

Variable N Mean Median Tr Mean StDev SE Mean

C1 50 6.702 5.679 6.099 5.825 0.824

Variable Min Max Q1 Q3

C1 0.039 25.681 2.374 9.886

Histogram of C1, with Normal Curve

10
Frequency

0 10 20 30

In this case we see that the data set is skewed to the right, and looks more like an exponential
distribution than a normal distribution. To test formally for normality we use either an
Anderson-Darling or a Shapiro-Wilk test. The way these tests work is by generating a normal
Hypothesis Testing: Checking Assumptions 2

probability plot (sometimes called a rankit plot) based on what a normally distributed data set of
a given sample size should look like. They then test the correlation between the predicted
normal data with the actual data. This correlation coefficient has some critical value based on
the degrees of freedom (or sample size) of the data set so that we can compare our coefficient to
the critical value as in all the other tests. However, MINITAB gives us a p value with both tests,
and so we can automatically compare this value to our stated alpha level without having to
bother looking up values in a table.

Here is the Anderson-Darling output for our data set:

Normal Probability Plot

.999
.99
.95
Probability

.80
.50
.20
.05
.01
.001

0 10 20
C1
Average: 6.70196 Anderson-Darling Normality Test
StDev: 5.82523 A-Squared: 1.676
N: 50 P-Value: 0.000

We are primarily concerned with the p value in the bottom right corner of the graph, which in
our case is p = 0.000. The null hypothesis (as usual) states that there is no difference between
our data and the generated normal data, so that we would reject the null hypothesis as the p value
is less than any stated alpha level we might want to choose; the data is highly non-normal and we
should not use parametric statistics on the raw data of excavated units. The straight line on the
graph is the null hypothesis of normality, so that we want our data to be as close to that line as
possible in order to assume normality. The p value tells us whether our data are significantly
different from this line or not. The Shapiro-Wilk test produces the same graph using a slightly
different test statistic, but is equally as valid.

In MINITAB there are two ways of conducting a normality test. The normal probability plot is
generated by the following procedure:

>STATS
Hypothesis Testing: Checking Assumptions 3

>BASIC STATISTICS
>NORMALITY TEST
>Put your data column in the VARIABLE BOX (leave the reference
box empty)
>Choose ANDERSON-DARLING or RYAN-JOINER (same as
Shapiro-Wilk)
>OK

The other way is to choose the GRAPHICAL SUMMARY output option under the GRAPHICS
for the DESCRIPTIVE STATISTICS: This output includes an Anderson-Darling test for
normality at the top on the left.

Descriptive Statistics
Variable: C1

Anderson-Darling Normality Test

A-Squared: 1.676
P-Value: 0.000

Mean 6.70196
StDev 5.82523
Variance 33.9333
Skewness 1.27891
Kurtosis 1.40201
0 4 8 12 16 20 24 N 50

Minimum 0.0386
1st Quartile 2.3741
Median 5.6790
3rd Quartile 9.8855
Maximum 25.6808
95% Confidence Interval for Mu
95% Confidence Interval for Mu
5.0464 8.3575
3 4 5 6 7 8 95% Confidence Interval for Sigma
4.8660 7.2590
95% Confidence Interval for Median
95% Confidence Interval for Median
3.4860 6.8595
Hypothesis Testing: Checking Assumptions 4

Equal Variances: The F-test

The different options of the t-test revolve around the assumption of equal variances or unequal
variances. We have learned that we can usually eye-ball the data and make our assumption, but
there is a formal way of going about testing for equal variances; the F-test. The F-test is not only
used for t-tests, but for any occasion when you are interested comparing the variation in two data
sets. As usual, the test calculates an FSTAT that is compared to a FCRIT in a statistical table, which
can then be turned into a p value. The F-test is very easy.

FSTAT = larger sample variance

smaller sample variance

Of course, what is going on here is that if the sample variances are equal, the ratio of their
differences should be around 1. The test calculates whether the sample variances are close
enough to 1, given their respective degrees of freedom.

For example, say we had two samples: n1 = 25, s1 = 13.2, and n2 = 36, s2 = 15.3. Remember the
ratio is the variance not the standard deviation, so

s 22 15.3 2 234.09
FSTAT = = = = 1.34
s12 13.2 2 174.24

The degrees of freedom are v2 = 36 – 1 = 35 and v1 = 25 – 1 = 24 for the larger and smaller
variances respectively. In an F table we would look for the column v for the larger sample
variance (v2 = 35) along the top of the table, and the row relating to the smaller variance (v1 =
24). In our case, we are not given all the exact degrees of freedom so we assume our critical
value is less than the next highest value give, which would be FCRIT = 1.79. As our FSTAT < FCRIT
we can assume the sample variances are equal. Notice that we cannot calculate a p value from
the table.

MINITAB does not do F-tests, but EXCEL does.

The formula is =FTEST(array1, array2), so =FTEST(Xi:Xj, Yi:Yj), and EXCEL will return a p
value, which you can then compare to an alpha level of your choosing.

Assessing Normality in SPSS Data Analysis
No ratings yet
Assessing Normality in SPSS Data Analysis
5 pages
Stat 136 Chapter 10 Nonnormality and Heteroskedasticity
No ratings yet
Stat 136 Chapter 10 Nonnormality and Heteroskedasticity
49 pages
Excel Normality Tests Kolmogorov-Smirnov, Anderson-Darling, and Shapiro Wilk Tests For Two-Sample Pooled T-Test
No ratings yet
Excel Normality Tests Kolmogorov-Smirnov, Anderson-Darling, and Shapiro Wilk Tests For Two-Sample Pooled T-Test
13 pages
Real Statistics Examples Goodness of Fit
No ratings yet
Real Statistics Examples Goodness of Fit
102 pages
Real Statistics Examples Goodness of Fit
No ratings yet
Real Statistics Examples Goodness of Fit
102 pages
Test of Normality
No ratings yet
Test of Normality
7 pages
Checking The Normality of A Dataset
No ratings yet
Checking The Normality of A Dataset
6 pages
3 Assmuption-Testing PDF
No ratings yet
3 Assmuption-Testing PDF
17 pages
Community Project: Checking Normality For Parametric Tests in R
No ratings yet
Community Project: Checking Normality For Parametric Tests in R
4 pages
Normality Checking 11 Ps
No ratings yet
Normality Checking 11 Ps
4 pages
Assessing Normality in Data Analysis
No ratings yet
Assessing Normality in Data Analysis
32 pages
04 Assumptions
No ratings yet
04 Assumptions
53 pages
SPSS Data Analysis: Frequency & Charts
No ratings yet
SPSS Data Analysis: Frequency & Charts
58 pages
Normality Test Explained: Chi-Square Method
No ratings yet
Normality Test Explained: Chi-Square Method
8 pages
2 Normality PG OK
No ratings yet
2 Normality PG OK
24 pages
Normality Testing in Financial Models
No ratings yet
Normality Testing in Financial Models
20 pages
Inferential Statistics 1
No ratings yet
Inferential Statistics 1
17 pages
Spss Experiments
No ratings yet
Spss Experiments
45 pages
Normal Distribution Overview and Applications
100% (1)
Normal Distribution Overview and Applications
14 pages
Module 2
No ratings yet
Module 2
28 pages
Community Project: Checking Normality For Parametric Tests in SPSS
No ratings yet
Community Project: Checking Normality For Parametric Tests in SPSS
4 pages
Statistical Tests Overview Guide
No ratings yet
Statistical Tests Overview Guide
9 pages
Test For Normality SPSS
No ratings yet
Test For Normality SPSS
5 pages
Appendix I
No ratings yet
Appendix I
52 pages
Statistical Data Types and Tests Guide
No ratings yet
Statistical Data Types and Tests Guide
10 pages
Normality Assessment and Data Transformations
No ratings yet
Normality Assessment and Data Transformations
42 pages
Normality Test
No ratings yet
Normality Test
6 pages
Chapter 13 - Tests For The Assumption That A Variable Is Normally Distributed Final - Edited
No ratings yet
Chapter 13 - Tests For The Assumption That A Variable Is Normally Distributed Final - Edited
4 pages
Data Analysis Challenges in ANOVA
No ratings yet
Data Analysis Challenges in ANOVA
57 pages
Tests For Error Normality - STAT 501
No ratings yet
Tests For Error Normality - STAT 501
6 pages
Mathematics 06 00088 PDF
No ratings yet
Mathematics 06 00088 PDF
16 pages
Comparing Two Means and Variances
No ratings yet
Comparing Two Means and Variances
5 pages
Biostatistics Series Module 3 Comparing Groups .2
No ratings yet
Biostatistics Series Module 3 Comparing Groups .2
10 pages
Comparing Normality Tests Explained
No ratings yet
Comparing Normality Tests Explained
12 pages
Normality Testing and Data Transformation
No ratings yet
Normality Testing and Data Transformation
56 pages
t-Test for Hypothesis Testing Guide
No ratings yet
t-Test for Hypothesis Testing Guide
64 pages
Empirical Data Analysis in Finance
No ratings yet
Empirical Data Analysis in Finance
48 pages
Anderson-Darling Test for Distribution Fit
No ratings yet
Anderson-Darling Test for Distribution Fit
3 pages
Introduction To Data Science Exploratory Data Analysis
No ratings yet
Introduction To Data Science Exploratory Data Analysis
55 pages
A Test For Normality
No ratings yet
A Test For Normality
5 pages
Normality Tests: Kolmogorov-Smirnov & Shapiro-Wilk
No ratings yet
Normality Tests: Kolmogorov-Smirnov & Shapiro-Wilk
16 pages
Nonparametric Tests for Ordinal Data
No ratings yet
Nonparametric Tests for Ordinal Data
106 pages
Research Objectives and Data Assumptions
No ratings yet
Research Objectives and Data Assumptions
29 pages
Normality Testing in SPSS Guide
100% (1)
Normality Testing in SPSS Guide
12 pages
Statistical Normality & Homogeneity
No ratings yet
Statistical Normality & Homogeneity
4 pages
Understanding Statistical Measures and Tests
No ratings yet
Understanding Statistical Measures and Tests
29 pages
Anderson-Darling Normality Test Calculator
No ratings yet
Anderson-Darling Normality Test Calculator
6 pages
Non Parametric Methods
No ratings yet
Non Parametric Methods
19 pages
DWDM
No ratings yet
DWDM
18 pages
Understanding Hypothesis Testing Methods
No ratings yet
Understanding Hypothesis Testing Methods
10 pages
Statistical Assumptions and Tests Guide
No ratings yet
Statistical Assumptions and Tests Guide
22 pages
Hypothesis Test Roadmap
No ratings yet
Hypothesis Test Roadmap
1 page
Normal Distribution and Statistics Overview
No ratings yet
Normal Distribution and Statistics Overview
12 pages
Statistics in Research
No ratings yet
Statistics in Research
48 pages
T Test Assumptions
No ratings yet
T Test Assumptions
11 pages
Psychology Advanced RM Workbook New 2019
No ratings yet
Psychology Advanced RM Workbook New 2019
23 pages

Normality and Variance Testing in MINITAB

Uploaded by

Normality and Variance Testing in MINITAB

Uploaded by

Hypothesis Testing: Checking Assumptions 1

Testing Assumptions: Normality and Equal Variances

Variable N Mean Median Tr Mean StDev SE Mean

Variable Min Max Q1 Q3

Histogram of C1, with Normal Curve

Here is the Anderson-Darling output for our data set:

Normal Probability Plot

Anderson-Darling Normality Test

Equal Variances: The F-test

FSTAT = larger sample variance

MINITAB does not do F-tests, but EXCEL does.

You might also like