Professional Documents
Culture Documents
Hypothesis Testing
7-2
Using Statistics
• A hypothesis is a statement or assertion about the state of nature (about
the true value of an unknown population parameter):
• H0 and H1 are:
✓ Mutually exclusive
– Only one can be true.
✓ Exhaustive
– Together they cover all possibilities, so one or the other must be
true.
Example
Suppose flour packaged by a manufacturer is sold by weight; and a
particular size of package is supposed to average 40 ounces. Suppose
the manufacturer wants to test to determine whether their packaging
process is out of control as determined by the weight of the flour
packages.
7-6
The tails of a statistical test are determined by the need for an action. If action
is to be taken if a parameter is greater than some value a, then the alternative
hypothesis is that the parameter is greater than a, and the test is a right-tailed
test. H0: 50
H1: 50
✓H0: p 40%
✓H1: p < 40%
✓H0: s2 50
✓H1: s2 50
Example
A vendor claims that his company fills any accepted order, on the
average, in at most six working days. You suspect that the average is
greater than six working days and want to test the claim. How will you
set up the null and alternative hypotheses?
Example
A vendor claims that his company fills any accepted order, on the
average, in at most six working days. You suspect that the average is
greater than six working days and want to test the claim. How will you
set up the null and alternative hypotheses?
Example
A manufacturer of golf balls claims that the variance of the weights of
the company’s golf balls is controlled to within 0.0028 oz2. If you wish
to test this claim, how will you set up the null and alternative
hypotheses?
Example
A manufacturer of golf balls claims that the variance of the weights of
the company’s golf balls is controlled to within 0.0028 oz2. If you wish
to test this claim, how will you set up the null and alternative
hypotheses?
Example
At least 20% of the visitors to a particular commercial Web site where
an electronic product is sold are said to end up ordering the product. If
you wish to test this claim, how will you set up the null and alternative
hypotheses?
Example
Suppose a company has held an 18% share of the market. However,
because of an increased marketing effort, company officials believe the
company’s market share is now greater than 18%, and the officials
would like to prove it.
Because the company officials are only interested in “proving” that the
market share has increased and the inclusion of the “less than” sign in the
null hypothesis is confusing. Also, If the equal part of the null hypothesis is
rejected because the market share is seemingly greater, then certainly the
“less than” portion of the null hypothesis is also rejected because it is
further away from “greater than” than is “equal.” Using this logic, the null
hypothesis for the market share problem can be written as
Hypothesis Testing Process
When is the sample mean so far away from the population mean that
the null hypothesis is rejected?
• Confidence Interval
7-19
Rejection Region
Nonrejection Region
• The nonrejection region is the range of values (also
determined by the critical points) that will lead us not to reject
the null hypothesis if the test statistic should fall within this
region. The nonrejection region is designed so that, before the
sampling takes place, our test statistic will have a probability 1-
of falling within the nonrejection region if the null hypothesis
is true
Decision Making
Decision Making
• A decision may be correct in two ways:
✓Fail to reject a true H0
✓Reject a false H0
• A decision may be incorrect in two ways:
✓Type I Error: Reject a true H0
• The Probability of a Type I error is denoted by .
✓Type II Error: Fail to reject a false H0
• The Probability of a Type II error is denoted by .
7-24
The “state of nature” is how things actually are and the “action” is
the decision that the business researcher actually makes
Type - I Error
Suppose the flour-packaging process actually is “in control” and is
averaging 40 ounces of flour per package. Suppose also that a business
researcher randomly selects 100 packages, weighs the contents of
each, and computes a sample mean. It is possible, by chance, to
randomly select 100 of the more extreme packages (mostly heavy
weighted or mostly light weighted) resulting in a mean that falls in the
rejection region. The decision is to reject the null hypothesis even
though the population mean is actually 40 ounces. In this case, the
business researcher has committed a Type I error.
✓ if a manager fires an employee because some evidence indicates
that she is stealing from the company and if she really is not stealing
from the company,
✓ Suppose a worker on the assembly line of a large manufacturer
hears an unusual sound and decides to shut the line down. If the
sound turns out not to be related to the assembly line and no
problems are occurring with the assembly line
Type - II Error
Suppose in the case of the flour problem that the packaging process is
actually producing a population mean of 41 ounces even though the
null hypothesis is 40 ounces. A sample of 100 packages yields a sample
mean of 40.2 ounces, which falls in the nonrejection region. The
business decision maker decides not to reject the null hypothesis. The
packaging procedure is out of control and the hypothesis testing
process does not identify it.
✓ Suppose in the business world an employee is stealing from the
company. A manager sees some evidence that the stealing is
occurring but lacks enough evidence to conclude that the employee
is stealing from the company. The manager decides not to fire the
employee based on theft.
✓ Suppose the worker decides not enough noise is heard to shut the
line down, but in actuality, one of the cords on the line is unraveling,
creating a dangerous situation.
How are α and β related ?
1. alpha can only be committed when the null hypothesis is rejected and beta
can only be committed when the null hypothesis is not rejected, a business
researcher cannot commit both a Type I error and a Type II error at the
same time on the same hypothesis test
2. alpha and beta are inversely related. If alpha is reduced, then beta is
increased, and vice versa.
One way to reduce both errors is to increase the sample size. If a larger
sample is taken, it is more likely that the sample is representative of the
population, which translates into a better chance that a business
researcher will make the correct choice
7-29
Statistical Significance
While the null hypothesis is maintained to be true throughout a
hypothesis test, until sample data lead to a rejection, the aim of a
hypothesis test is often to disprove the null hypothesis in favor of the
alternative hypothesis. This is because we can determine and
regulate , the probability of a Type I error, making it as small as we
desire, such as 0.01 or 0.05. Thus, when we reject a null hypothesis,
we have a high level of confidence in our decision, since we know
there is a small probability that we have made an error.
Example
A survey of CPAs across the United States found that the average net
income for sole proprietor CPAs is $74,914.Because this survey is now
more than ten years old, an accounting researcher wants to test this
figure by taking a random sample of 112 sole proprietor accountants in
the United States which showed a sample mean of $78,695. Assume
the population standard deviation of net incomes for sole proprietor
CPAs is $14,530.
Solution
A survey of CPAs across the United States found that the average net
income for sole proprietor CPAs is $74,914.Because this survey is now
more than ten years old, an accounting researcher wants to test this
figure by taking a random sample of 112 sole proprietor accountants in
the United States which showed a sample mean of $78,695. Assume
the population standard deviation of net incomes for sole proprietor
CPAs is $14,530.
Step 1
Step 2
Solution
Step 3
Type I error rate, or alpha, which is .05 in this problem
Step 4
Because the test is two tailed and alpha is .05, there is 2 or .025 area in each
of the tails of the distribution. Thus, the rejection region is in the two ends
of the distribution with 2.5% of the area in each.
Solution
Step 5
Step 6
Because this test statistic, z = 2.75, is greater than the critical value of z in the
upper tail of the distribution, z = +1.96,
Step 7
Reject the null hypothesis
Step 8
Statistically, the researcher has enough evidence to reject the figure of
$74,914 as the true national average net income for sole proprietor CPAs.
7-34
95% confidence
Population interval around
mean under H0 observed sample mean
Note that the population mean may be 28 (the null hypothesis might be true), but
then the observed sample mean, 31.5, would be a very unlikely occurrence. There
is still the small chance ( = 0.05) that we might reject the true null hypothesis.
represents the level of significance of the test.
7-36
Nonrejection Region
If the observed sample mean falls within the nonrejection region, then you fail to
reject the null hypothesis as true. Construct a 95% nonrejection region around
the hypothesized population mean, and compare it with the 95% confidence
interval around the observed sample mean:
s 5 s 5
0 z.025 = 28 1.96 95% non- 95% Confidence x z .025 = 315
. 1.96
n 100 rejection region Interval n 100
around the around the
= 28.98 = 27,02 ,28.98 population Mean Sample Mean . .98 = 30.52 ,32.48
= 315
The nonrejection region and the confidence interval are the same width, but
centered on different points. In this instance, the nonrejection region does not
include the observed sample mean, and the confidence interval does not include
the hypothesized population mean.
7-37
Solution
A company that delivers packages within a large metropolitan
area claims that it takes an average of 28 minutes for a package to
be delivered from your door to the destination. Suppose you want
to carry out a hypothesis test of this claim at 95% confidence by
taking a sample of 100 packages, with an average delivery time of
31.5 minutes & standard deviation of 5 minutes.
Set the null and alternative hypotheses: s 5
x z = 315
. 196
.
H0: = 28
. 025
n 100
H1: 28
. .98 = 30.52, 32.48
= 315
Collect sample data:
n = 100 We can be 95% sure that the average
x = 31.5 time for all packages is between 30.52
s=5 and 32.48 minutes.
Since the asserted value, 28 minutes,
Construct a 95% confidence interval for is not in this 95% confidence interval,
the average delivery times of all we may reasonably reject the null
packages: hypothesis.
Example
A survey of CPAs across the United States found that the average net
income for sole proprietor CPAs is $74,914.Because this survey is now
more than ten years old, an accounting researcher wants to test this
figure by taking a random sample of 112 sole proprietor accountants in
the United States which showed a sample mean of $78,695. Assume
the population standard deviation of net incomes for sole proprietor
CPAs is $14,530.
Solution
A survey of CPAs across the United States found that the average net
income for sole proprietor CPAs is $74,914.Because this survey is now
more than ten years old, an accounting researcher wants to test this
figure by taking a random sample of 112 sole proprietor accountants in
the United States which showed a sample mean of $78,695. Assume
the population standard deviation of net incomes for sole proprietor
CPAs is $14,530.
7-40
The p-Value
The p-Value
7-42
Example
An automatic bottling machine fills cola into two liter (2000 cc) bottles. A
consumer advocate wants to test the null hypothesis that the average amount
filled by the machine into a bottle is at least 2000 cc. A random sample of 40
bottles coming out of the machine was selected and the exact content of the
selected bottles are recorded. The sample mean was 1999.6 cc. The population
standard deviation is known from past experience to be 1.30 cc. Test this
hypotheses at 95% confidence with the help of p-value.
7-43
Example
An automatic bottling machine fills cola into two liter (2000 cc) bottles. A
consumer advocate wants to test the null hypothesis that the average amount
filled by the machine into a bottle is at least 2000 cc. A random sample of 40
bottles coming out of the machine was selected and the exact content of the
selected bottles are recorded. The sample mean was 1999.6 cc. The population
standard deviation is known from past experience to be 1.30 cc. Test this
hypotheses at 95% confidence with the help of p-value.
n = 40
H0: 2000 x = 1999.6
H1: 2000
s = 1.3
n = 40
For = 0.05, the critical value
of z is -1.645 x−
z= 0 = 1999.6 - 2000
s 1.3
x − 0
The test statistic is: z = s n 40
n
Do not reject H0 if: [z -1.645] = − 1.95 Reject H
Reject H0 if: z −5] 0
7-44
Example
An automatic bottling machine fills cola into two liter (2000 cc) bottles. A
consumer advocate wants to test the null hypothesis that the average amount
filled by the machine into a bottle is at least 2000 cc. A random sample of 40
bottles coming out of the machine was selected and the exact content of the
selected bottles are recorded. The sample mean was 1999.6 cc. The population
standard deviation is known from past experience to be 1.30 cc. Test this
hypotheses at 95% confidence with the help of p-value.
z = x −s 0 = 1999.6 - 2000
H0: 2000 1.3
H1: 2000 n 40
n = 40, 0 = 2000, x-bar = 1999.6,
s = 1.3 = −1.95
p - value = P(Z -1.95)
x − 0 = 0.5000 - 0.4744
The test statistic is: z =
s = 0.0256
n
Example
In an attempt to determine why customer service is important to
managers in the United Kingdom, researchers surveyed managing
directors of manufacturing plants in Scotland. One of the reasons
proposed was that customer service is a means of retaining customers.
On a scale from 1 to 5, with 1 being low and 5 being high, the survey
respondents rated this reason more highly than any of the others, with
a mean response of 4.30. Suppose U.S. researchers believe American
manufacturing managers would not rate this reason as highly and
conduct a hypothesis test to prove their theory. Alpha is set at .05. Data
are gathered and the following results are obtained. Use these data to
determine whether U.S. managers rate this reason significantly lower
than the 4.30 mean ascertained in the United Kingdom. Assume from
previous studies that the population standard deviation is 0.574.
Solution
Step 1
Step 2
Step 3
Type I error rate, or alpha, which is .05 in this problem
Solution
Step 4
Because the test is one tailed test and alpha is .05, there is .05 area in the
left tail of the distribution.
Step 5,6
I)
II)
Solution
Step 7
1. Observed value method
Because the observed test statistic is not less than the critical value and is
not in the rejection region, the statistical conclusion is that the null
hypothesis cannot be rejected
2. Critical Value method
Because the mean obtained from the sample data is 4.156, the researchers
fail to reject the null hypothesis
3. p-Value method
The observed test statistic is z = -1.42. The probability of getting a z value at
least this extreme when the null hypothesis is true is .5000 - .4222 = .0778.
Step 8
The test does not result in enough evidence to conclude that U.S. managers
think it is less important to use customer service as a means of retaining
customers than do UK managers. Customer service is an important tool for
retaining customers in both countries according to managers.
7-49
The “state of nature” is how things actually are and the “action” is
the decision that the business researcher actually makes
Solving For Type II Errors
In business, failure to reject the null hypothesis may mean staying with
the status quo, not implementing a new process, or not making
adjustments. If a new process, product, theory, or adjustment is not
significantly better than what is currently accepted practice,
the decision maker makes a correct decision. However, if the new
process, product, theory, or adjustment would significantly improve
sales, the business climate, costs, or morale, the decision maker makes
an error in judgment (Type II)
• In business, Type II errors can translate to lost opportunities, poor
product quality (as a result of failure to discern a problem in the
process), or failure to react to the marketplace.
• The Type II error plays an important role in business statistical
decision making
• Determining the probability of committing a Type II error is more
complex than finding the probability of committing a Type I error.
Computing
If the null hypothesis is false, the researcher will fail to reject the null
hypotheses whenever is in the nonrejection region, > 11.979
ounces.
Computing
Solution
Example
Suppose a researcher is conducting a statistical test on the following
hypotheses
Power = (1 - )
7-62
Example
A coin is to tested for fairness. It is tossed 25 times and only 8 Heads are
observed. Test if the coin is fair at an of 5% (significance level).
7-66
Example
A coin is to tested for fairness. It is tossed 25 times and only 8 Heads are
observed. Test if the coin is fair at an of 5% (significance level).
=
2
s 2
0
Note: Since the chi-square table only provides the critical values, it cannot
be used to calculate exact p-values. As in the case of the t-tables, only a
range of possible values can be inferred.
7-68
Example
A manufacturer of golf balls claims that they control the weights of the golf balls
accurately so that the variance of the weights is not more than 1 mg2. A random sample
of 31 golf balls yields a sample variance of 1.62 mg2. Is that sufficient evidence to
reject the claim at an of 5%?
7-69
Solution
A manufacturer of golf balls claims that they control the weights of the golf balls
accurately so that the variance of the weights is not more than 1 mg2. A random sample
of 31 golf balls yields a sample variance of 1.62 mg2. Is that sufficient evidence to
reject the claim at an of 5%?
Example
Example
H1: 12
0.8
0.7 .95
0.6
x − 0
0.3
=
.025 .025
0.2
n 0
z
x − 0 14.6-12
.025 .025
0.2
z= = 0.1
s 7.8 0.0
z
-1.96 0 1.96
n 144
Lower Rejection Nonrejection Upper Rejection
2.6 Region
= =4 Region Region
0.65
Since the test statistic falls in the upper rejection region, H0 is rejected, and we may
conclude that the average amount of carry-on baggage is more than 12 pounds.
7-73
Examples
An insurance company believes that, over the last few years, the average liability
insurance per board seat in companies defined as “small companies” has been $2000.
Using = 0.01, test this hypothesis using Growth Resources, Inc. survey data.
7-74
Examples
An insurance company believes that, over the last few years, the average liability
insurance per board seat in companies defined as “small companies” has been $2000.
Using = 0.01, test this hypothesis using Growth Resources, Inc. survey data.
n = 100
H0: = 2000 x = 2700
H1: 2000 s = 947
Example
z
-2.576 0 2.576
Example
The average time it takes a computer to perform a certain task is believed to be 3.24
seconds. It was decided to test the statistical hypothesis that the average performance
time of the task using the new algorithm is the same, against the alternative that the
average performance time is no longer the same, at the 0.05 level of significance.
7-77
Examples
The average time it takes a computer to perform a certain task is believed to be 3.24
seconds. It was decided to test the statistical hypothesis that the average performance
time of the task using the new algorithm is the same, against the alternative that the
average performance time is no longer the same, at the 0.05 level of significance.
Example
-1.96 0 1.96 z
changed from 3.24 seconds.
2
Example
According to the Japanese National Land Agency, average land prices in central Tokyo
soared 49% in the first six months of 1995. An international real estate investment
company wants to test this claim against the alternative that the average price did not rise
by 49%, at a 0.01 level of significance.
7-80
Example
According to the Japanese National Land Agency, average land prices in central Tokyo
soared 49% in the first six months of 1995. An international real estate investment
company wants to test this claim against the alternative that the average price did not rise
by 49%, at a 0.01 level of significance.
H0: = 49 n = 18
H1: 49 x = 38
s = 14
n = 18
For = 0.01 and (18-1) = 17 df ,
x − 38 - 49
critical values of t are ±2.898 t = 0 =
s 14
x − 0
t= n 18
The test statistic is: s
n
- 11
= −3.33 Reject H
Do not reject H0 if: [-2.898 t 2.898]
=
3.3 0
Example
t
statistic is in the lower
-2.898 0 2.898
Example
Canon, Inc,. has introduced a copying machine that features two-color copying capability
in a compact system copier. The average speed of the standard compact system copier is
27 copies per minute. Suppose the company wants to test whether the new copier has the
same average speed as its standard compact copier. Conduct a test at an = 0.05 level of
significance.
7-83
Example
Canon, Inc,. has introduced a copying machine that features two-color copying capability
in a compact system copier. The average speed of the standard compact system copier is
27 copies per minute. Suppose the company wants to test whether the new copier has the
same average speed as its standard compact copier. Conduct a test at an = 0.05 level of
significance.
n = 24
H0: = 27 x = 24.6
H1: 27 s = 7.4
n = 24
For = 0.05 and (24-1) = 23 df , x − 0 24.6 - 27
t = =
critical values of t are ±2.069 s 7.4
x − 0 24
t= n
The test statistic is: s
n -2.4
= = −1.59 Do not reject H
Do not reject H0 if: [-2.069 t 2.069] 1.51 0
Example
The t Distribution
0.8
Since the test statistic falls in
0.7
0.6
.95 the nonrejection region, H0 is
0.5 not rejected, and we may not
0.4
0.3
.025 .025
conclude that the average
0.2
0.1 speed is different from 27
0.0
-2.069 0 2.069 t
copies per minute.
−5
Example
An investment analyst for Goldman Sachs and Company wanted to test the hypothesis
made by British securities experts that 70% of all foreign investors in the British market
were American. The analyst gathered a random sample of 210 accounts of foreign
investors in London and found that 130 were owned by U.S. citizens. At the = 0.05
level of significance, is there evidence to reject the claim of the British securities experts?
7-86
Example
An investment analyst for Goldman Sachs and Company wanted to test the hypothesis
made by British securities experts that 70% of all foreign investors in the British market
were American. The analyst gathered a random sample of 210 accounts of foreign
investors in London and found that 130 were owned by U.S. citizens. At the = 0.05
level of significance, is there evidence to reject the claim of the British securities experts?
n = 210
H0: p = 0.70 130
H1: p 0.70 p =
210
= 0.619
n = 210
For = 0.05 critical values of z are ±1.96 p - p
0 0.619 - 0.70
The test statistic is: z = p − p0 z=
p q
=
(0.70)(0.30)
p0 q 0 0 0
n 210
n
Do not reject H0 if: [-1.96 z 1.96] -0.081
= −2.5614 Reject H
Reject H0 if: [z < -1.96] or z 1.96]
=
0.0316 0
7-87
Example
The EPA sets limits on the concentrations of pollutants emitted by various industries. Suppose that the
upper allowable limit on the emission of vinyl chloride is set at an average of 55 ppm within a range of two
miles around the plant emitting this chemical. To check compliance with this rule, the EPA collects a
random sample of 100 readings at different times and dates within the two-mile range around the plant. The
findings are that the sample average concentration is 60 ppm and the sample standard deviation is 20 ppm.
Is there evidence to conclude that the plant in question is violating the law?
7-88
Example
The EPA sets limits on the concentrations of pollutants emitted by various industries. Suppose that the
upper allowable limit on the emission of vinyl chloride is set at an average of 55 ppm within a range of two
miles around the plant emitting this chemical. To check compliance with this rule, the EPA collects a
random sample of 100 readings at different times and dates within the two-mile range around the plant. The
findings are that the sample average concentration is 60 ppm and the sample standard deviation is 20 ppm.
Is there evidence to conclude that the plant in question is violating the law?
H0: 55 n = 100
x = 60
H1: 55 s = 20
n = 100
For = 0.01, the critical value x − 0 60 - 55
z= =
of z is 2.326 s 20
x − 0 n 100
z=
The test statistic is: s
n 5
= = 2.5 Reject H
Do not reject H0 if: [z 2.326] 2 0
Reject H0 if: z 2.326]
7-89
Example
0 .2
that the average concentration
0 .1
00
of vinyl chloride is more than
0 .0
-5 0 5 55 ppm.
z 2.326
2.5
Nonrejection Rejection
Region Region
7-90
Example
A certain kind of packaged food bears the following statement on the package: “Average net weight 12 oz.”
Suppose that a consumer group has been receiving complaints from users of the product who believe that they are
getting smaller quantities than the manufacturer states on the package. The consumer group wants, therefore, to
test the hypothesis that the average net weight of the product in question is 12 oz. versus the alternative that the
packages are, on average, underfilled. A random sample of 144 packages of the food product is collected, and it is
found that the average net weight in the sample is 11.8 oz. and the sample standard deviation is 6 oz. Given these
findings, is there evidence the manufacturer is underfilling the packages?
7-91
Example
A certain kind of packaged food bears the following statement on the package: “Average net weight 12 oz.”
Suppose that a consumer group has been receiving complaints from users of the product who believe that they are
getting smaller quantities than the manufacturer states on the package. The consumer group wants, therefore, to
test the hypothesis that the average net weight of the product in question is 12 oz. versus the alternative that the
packages are, on average, underfilled. A random sample of 144 packages of the food product is collected, and it is
found that the average net weight in the sample is 11.8 oz. and the sample standard deviation is 6 oz. Given these
findings, is there evidence the manufacturer is underfilling the packages?
n = 144
H0: 12
H1: 12 x = 11.8
s = 6
n = 144
For = 0.05, the critical value
of z is -1.645 x−
z= 0 = 11.8 -12
x − 0 s 6
z=
The test statistic is: s n 144
n
Do not reject H0 if: [z -1.645] =
-.2
= −0.4 Do not reject H
Reject H0 if: z −5] .5 0
7-92
Example
0.2
005
conclude that the manufacturer
0.1
is underfilling packages on
0.0
-5 0 5
z
average.
-1.645
-0.4
Rejection Nonrejection
Region Region
7-93
A floodlight is said to last an average of 65 hours. A competitor believes that the average life of the
floodlight is less than that stated by the manufacturer and sets out to prove that the manufacturer’s
claim is false. A random sample of 21 floodlight elements is chosen and shows that the sample
average is 62.5 hours and the sample standard deviation is 3. Using =0.01, determine whether
there is evidence to conclude that the manufacturer’s claim is false.
7-94
A floodlight is said to last an average of 65 hours. A competitor believes that the average life of the
floodlight is less than that stated by the manufacturer and sets out to prove that the manufacturer’s
claim is false. A random sample of 21 floodlight elements is chosen and shows that the sample
average is 62.5 hours and the sample standard deviation is 3. Using =0.01, determine whether
there is evidence to conclude that the manufacturer’s claim is false.
H0: 65
H1: 65
n = 21
For = 0.01 an (21-1) = 20 df, the
critical value -2.528
005
0 .1
is false, that the average
0 .0
-5
-2.528
0 5
t
floodlight life is less than 65
-3.82 hours.
Rejection Nonrejection
Region Region
7-96
H0: p 0.0096
H1: p 0.0096
n = 600
0 .2
0.4 0.4
p-value=area to
p-value=area to
0.3 right of the test statistic 0.3
right of the test statistic
=0.3018
=0.0062
f(z)
f(z)
0.2 0.2
0.1 0.1
0.0 0.0
-5 0 0.519 5 -5 0 5
z 2.5 z
The p-value is the probability of obtaining a value of the test statistic as extreme as,
or more extreme than, the actual value obtained, when the null hypothesis is true.
The p-value is the smallest level of significance, , at which the null hypothesis
may be rejected using the obtained value of the test statistic.
7-100
When the p-value is greater than 0.10, the result is considered not
significant.
7-101
f(z) 0.3
0.2
0.1
0.0
-5 0 5
-0.4 0.4
z
The further away in the tail of the distribution the test statistic falls, the smaller
is the p-value and, hence, the more convinced we are that the null hypothesis is
false and should be rejected.
In a right-tailed test, the p-value is the area to the right of the test statistic if the
test statistic is positive.
In a left-tailed test, the p-value is the area to the left of the test statistic if the
test statistic is negative.
In a two-tailed test, the p-value is twice the area to the right of a positive test
statistic or to the left of a negative test statistic.