Professional Documents
Culture Documents
Lesson 1 and 2: Sampling Distribution of Sample Means and Finding the Mean and Variance of the Sampling
Distribution of Means
Objectives:
Definition
A statistic and a parameter are very similar. They are both descriptions of groups, like “50% of dog owners prefer Brand X
dog food”. The difference between a statistic and parameter is that statistics describe a sample while a parameter
describes an entire population.
Step 1: Ask yourself, is this a fact about the whole population? Sometimes that’s easy to figure out. For example, with
small populations, you usually have a parameter because the groups are small enough to measure:
Examples of parameters:
10% of US senators voted for a particular measure. There are only 100 US Senators, you can count what every
single one of them voted.
40% of 1,211 students at a particular elementary school got below a 3 on a standardized test. You know this
because you have each and every students’ test score.
33% of 120 workers at a particular bike factory were paid less than $20,000 per year. You have the payroll data
for all of the workers.
Step 2: Ask yourself, is this obviously a fact about a very large population? If it is, you have a statistic.
Examples of statistics:
60% of US residents agree with the latest health care proposal. It’s not possible to actually ask hundreds of
millions of people whether they agree. Researchers have to just take samples and calculate the rest.
45% of Jacksonville, Florida residents report that they have been to at least one Jaguars game. It’s very doubtful
that anyone polled in excess of a million people for this data. They took a sample, so they have a statistic.
30% of dog owners poop scoop after their dog. It’s impossible to survey all dog owners—no one keeps an
accurate track of exactly how many people own dogs. This data had to be from a sample, so it’s a statistic.
1. Determine the number of possible samples that can be drawn from the population using the formula:
N!
N Cn = ( N −n ) !
Formula
Population Mean ( μ)
μ=
∑X where X are the samples and N are the number of samples
N
Population Variance (σ 2)
∑ ( X−μ)2
N
Consider a population of 1, 2, 3, 4, and 5. Suppose samples of size 2 are drawn from this population. Describe
the sampling distribution of the sample means.
What is the mean and variance of the sampling distribution of the sample means?
Compare these values to the mean and variance of the population
Draw the histogram of the sampling distribution of the population mean.
1
Statistics and Probability – Chapter 3: Sampling and Sampling Distribution
y=e¿ ¿
σ √2π
Where:
“A standard normal curve is a normal probability distribution that has a mean μ=0 and a standard deviation σ =1”
2
Statistics and Probability – Chapter 3: Sampling and Sampling Distribution
The Table of Areas under the Normal Curve is also known as the z-Table. The z-score is a measure of
relative standing. It is calculated by subtracting X́ (or μ ¿ from the measurement X and then dividing the result by s (or σ ¿
. The final result, the z score, represents the distance between a given measurement X and the mean, expressed in
standard deviations. Either the z score locates X within a sample or within a population.
Four-Step Process in Finding the Areas Under the Normal Curve Given a z-Value
Examples:
Finding the area that corresponds to is the same as finding the area between z = 0 and z = 1.
Steps Solution
1. Express the given into a three-digit form. Z = 1.00
2. In the table, find the Row z = 1.00
3. In the table, find the Column with the heading .00
4. Read the area(or probability) at the intersection of This area is 0.3413. This is the required area.
Column 1.0 and the Row .00
Steps Solution
1. Express the given z into a three-digit form. Z = 1.36 (as is)
2. In the table, find the row z = 1.3
3. In the table, find the column with the heading .06
4. Read the area(or probability) at the intersection of the This area is 0.4131. This is the required area.
Column 1.3 and the Row .06
3
Statistics and Probability – Chapter 3: Sampling and Sampling Distribution
The z-score
The areas under the normal curve are given in terms of z-values or scores. Either the z-score locates X within a
sample or within a population.
x−μ
z= σ (z-score for population data)
X− X́
z= s
(z-score for sample data)
μ= population mean
X́ = sample mean
Example 1
Reading Scores
Given the mean μ= 50 and the standard deviation, σ = 4 of a population of Reading scores. Find the z-value that
corresponds to a score X= 58.
Steps Solution
1. Use the computing formula for finding the z-scores of x−μ
population data. z=
σ
2. Check the given values. Since these are population μ= 50, σ = 4, and X= 58.
values, the z-score locates X within a population.
3. Substitute the given values in the computing formula. 58−50
Z=
4
4. Compute the z-value. 8
Z= =2
4
38 42 46 50 54 58 62 4
Statistics and Probability – Chapter 3: Sampling and Sampling Distribution
Example 2
Score in PE
Locate the z value that corresponds to a PE score of 39 given that μ=45 and σ =6.
Steps Solution
1. Use the computing formula for finding the z-scores of x−μ
population data. z=
σ
2. Check the given values. Since these are population μ= 45, σ = 6, and X= 39.
values, the z-score locates X within a population.
3. Substitute the given values in the computing formula. 39−45
Z=
6
4. Compute the z-value. −6
Z= = -1
6
Example 3
Steps Solution
1. Use the computing formula for finding the z-scores of x− X́
population data. z=
s
2. Check the given values. The z-score in question locates X́ = 26, s= 4, and X= 20.
X in a sample.
3. Substitute the given values in the computing formula. 20−26
Z=
4
4. Compute the z-value. −6 3
Z= = - = - 1.5.
4 2
Exercises
A. Solve for the z-values and state whether each raw score lies below or above the mean.
1. X = 50, s = 5 , X́ = 40
2. X = 40, σ =8 , μ = 52
3. X = 36, s = 6, X́ = 28
4. X = 74, s = 10, X́ = 60
5. X = 82, σ = 15, μ = 75
B. Given X́ = 62 and s = 8. Find the z-score value that corresponds to each of the following scores up to two decimal
places. Draw the normal curve.
1. X = 70
2. X = 78
3. X = 82
4. X = 50
5. X = 42
C. Suppose you have the population values 50 and 80 and that their corresponding z-scores are -1 and 2, respectively. Is
it possible to determine the population’s mean and standard deviation? If so, what are these values? If not, explain why it
is impossible.
D. There are three grades in a report card that you want to interpret in terms of performance:
Mathematics – 75
English – 85
5
Statistics and Probability – Chapter 3: Sampling and Sampling Distribution
Science – 90
The means are 72, 83, and 88, respectively. The standard deviations are 3, 10, and 15, respectively. Is the information
sufficient for you to compare the grades? If so, discuss your processes. If not, explain why it is impossible.
Lesson 3 and 4: Identifying Regions of Areas Under the Normal Curve and Determining Probabilities
Objectives:
The following notations for a random variable are used in our various solutions concerning the normal curve.
Mathematical notations are convenient forms of lengthy expressions.
P(a < z < b) denotes the probability that the z-score is between a and b.
P(z > a) denotes the probability that the z-score is greater than a.
P(z < a) denotes the probability that the z-score is less than a.
Where a and b are z-score values.
For example, to denote the area between z = 1 and z =2, we use the notation: P(1 < z < 2) = 0.1359.
6
Statistics and Probability – Chapter 3: Sampling and Sampling Distribution
Steps Solution
1. Draw a normal curve.
2. Locate the z-value.
3. Draw a line through the z-value.
4. Shade the required region.
5. Consult the z-Table and find the area that corresponds z =1 corresponds to an area of 0.3413 or 34.13%
to z = 1.
6. Examine the graph and use probability notation to form The graph suggests subtraction.
an equation showing the appropriate operation to get the The required area is equal to 50%-34.13%=15.87%.
required area. That is,
P(z>1) =15.87%.
7. Make a statement indicating the required area. The proportion of the area greater than z=1 is 15.87%.
Steps Solution
1. Draw a normal curve.
2. Locate the z-value.
3. Draw a line through the z-value.
4. Shade the required region.
5. Consult the z-Table and find the area that corresponds z =1 corresponds to an area of 0.4332 or 43.32%
to z = -1.5.
6. Examine the graph and use probability notation to form The graph suggests subtraction.
an equation showing the appropriate operation to get the The required area is equal to 50%-43.32%=6.68%.
required area. That is,
P(z<-1.5) =6.68%.
7. Make a statement indicating the required area. The proportion of the area to the left of z=-1.5 is 6.68%.
CASE 2.a Between two values with the same sign (positive)
Example. Find the area between z = 0.98 and z = 2.58.
Steps Solution
1. Draw a normal curve.
2. Locate the z-value.
3. Draw a line through the z-value.
4. Shade the required region.
5. Consult the z-Table and find the area that corresponds Z = 0.98 corresponds to 33.65%
to z = 0.98 and z = 2.58 Z = 2.58 corresponds to 49.51%
6. Examine the graph and use probability notation to form The graph suggests subtraction.
an equation showing the appropriate operation to get the The required area is equal to 49.51% - 33.65% = 15.86%
required area. That is,
P(0.98 < z < 2.58) = 15.86%
7. Make a statement indicating the required area. The required area between z=0.98 and z=2.58 is 15.86%.
CASE 2.b. Between two values with the same sign (negative)
Example. Find the area between z = -2 and z = -1.5.
Steps Solution
1. Draw a normal curve.
2. Locate the z-value.
3. Draw a line through the z-value.
4. Shade the required region.
7
Statistics and Probability – Chapter 3: Sampling and Sampling Distribution
5. Consult the z-Table and find the area that corresponds Z = -2 corresponds to 47.72%
to z = -2 and z = -1.5 Z = -1.5 corresponds to 43.32%
6. Examine the graph and use probability notation to form The graph suggests subtraction.
an equation showing the appropriate operation to get the The required area is equal to 47.72% - 43.32% = 4.40%
required area. That is,
P(-2 < z < -1.5) = 4.40%
7. Make a statement indicating the required area. The required area between z=-2 and z=-1.5 is 4.40%.
Steps Solution
1. Draw a normal curve.
2. Locate the z-value.
3. Draw a line through the z-value.
4. Shade the required region.
5. Consult the z-Table and find the area that corresponds Z = -1.32 corresponds to 33.65%
to z = -1.32 and z = 2.37 Z = 2.37 corresponds to 49.51%
6. Examine the graph and use probability notation to form The graph suggests addition.
an equation showing the appropriate operation to get the The required area is equal to 49.11% + 40.66% = 89.77%
required area. That is,
P(-1.32 < z < 2.37) = 89.77%
7. Make a statement indicating the required area. The required area between z=-1.32 and z=2.37 is 89.77%.
Steps Solution
1. Draw a normal curve.
2. Locate the z-value.
3. Draw a line through the z-value.
4. Shade the required region.
5. Consult the z-Table and find the area that corresponds z = -1 corresponds to an area of 0.3413 or 34.13%
to z = -1.
6. Examine the graph and use probability notation to form The graph suggests addition.
an equation showing the appropriate operation to get the The required area is equal to 50%+34.13%=84.13%.
required area. That is,
P(z>-1) =84.13%.
7. Make a statement indicating the required area. The proportion of the area above is 84.13%.
Steps Solution
1. Draw a normal curve.
2. Locate the z-value.
3. Draw a line through the z-value.
4. Shade the required region.
5. Consult the z-Table and find the area that corresponds z =1.5 corresponds to an area of 0.4332 or 43.32%
to z = 1.5.
6. Examine the graph and use probability notation to form The graph suggests addition
an equation showing the appropriate operation to get the The required area is equal to 50%-43.32%=93.32%.
required area. That is,
P(z<1.5) =93.32%.
7. Make a statement indicating the required area. The proportion of the area greater than 93.32%.
8
Statistics and Probability – Chapter 3: Sampling and Sampling Distribution
PERCENITLE – is a measure of relative standing. It is a descriptive measure of the relationship of a measurement to the
rest of the data.
For example in a test in Algebra, you got a score of 82 and you want to know how you fared in comparison with
your classmates. If your teacher tells you that you scored at the 90 th percentile, it means that 90% of the grades were
lower than yours and 10% were higher. If the relative frequency histogram were used to show the scores, the 90 th
percentile will be located at a point such that 90% of the total area under the relative frequency histogram lies below the
90th percentile and 10%lies above.
Example 1
Steps Solution
1. Draw the appropriate normal curve.
Example 2
Steps Solution
1. Draw the appropriate normal curve.
Example 3
9
Statistics and Probability – Chapter 3: Sampling and Sampling Distribution
The results of a nationwide aptitude test in mathematics are normally distributed with m = 80 and s = 15. What is
the percentile rank of a score of 84?
Analysis. Since we are interested in the percentile rank of 84, this means finding the percentage of scores below
84. To find the required area from the Table of Areas, we transform 84 to z-score form and then perform the appropriate
mathematical operation.
Steps Solution
1. Convert the raw score of 84 to z-score form. X−μ 84−80
z= = = 0.27
σ 15
2. Find the area that corresponds to z = 0.27 z = 0.27 corresponds to the area 0.1064
3. Draw the normal curve showing the z-score.
4. Draw a line through z = 0.27 in the sketch of the normal Do this under the sketch of the curve in Step 3.
curve in Step 3.
5. Shade the required region in the sketch of the normal Do this under the sketch of the curve in Step 3.
curve in Step 3. The required region is below z = 0.27 expressed as
percentile rank.
6. Examine the shaded regions and decide what operation First, get the total area below z = 0.27.
to apply. That is, 0.5000 + 0.1064 = 0.6064
7. Compute the percentile rank of the score 84. Percentile rank of 84 is 60.64%
8. Make a concluding statement. The percentile rank of the score 84 in a test is 60.64.
Example 1
The results of a nationwide aptitude test in mathematics are normally distributed with m = 80 and s = 15. Find the
raw score such that 70% of the cases are below it.
Analysis Questions:
1. What are the given?
2. What are you going to find?
3. What is suggested by the given to help you solve the problem?
4. Will the application of the normal curve concepts be useful?
Steps Solution
1. Sketch the diagram of a normal curve.
2. Selecting 30%, express this as area. This is the area to 30% is the same as 0.3000
the right of the z-value we want to locate.
3. Find the area between z = 0 and 0.2000. 0.5000-0.3000 = 0.2000
4. Find the area 0.2000 or the value nearest to it in the The nearest area is 0.1985
table of areas under the normal curve.
5. Find the z-value that corresponds to 0.1985 Z = 0.52 <-> the area 0.1985
6. Find a computing formula for determining raw scores. X−μ
z=
σ
7. Modify the formula to solve for X. X = μ+ zσ
8. Substitute the given values in the modified formula to X = μ+ zσ
solve for X. X = 80 + (0.52)(15)
X = 87.8 ≈ 88
9. Make a concluding statement. In the distribution, 70% of the scores are below the score
88.
Example 2
In the same problem, what is the score that divides the distribution such that 99% of the cases is below it?
10
Statistics and Probability – Chapter 3: Sampling and Sampling Distribution
Steps Solution
1. Sketch the diagram of a normal curve.
2. Selecting 1%, express this as area. This is the area to 1% is the same as 0.0100
the right of the z-value we want to locate.
3. Find the area between z = 0 and 0.4900. 0.5000-0.4900 = 0.0100
4. Find the area 0.4900 or the value nearest to it in the The nearest area is 0.4901
table of areas under the normal curve.
5. Find the z-value that corresponds to 0.4901 Z = 2.33 <-> the area 0.4901
6. Find a computing formula for determining raw scores. X−μ
z=
σ
7. Modify the formula to solve for X. X = μ+ zσ
8. Substitute the given values in the modified formula to X = μ+ zσ
solve for X. X = 80 + (2.33)(15)
X = 114.95 ≈ 115
9. Make a concluding statement. In the distribution, 99% of the scores are below the score
115.
Example 3
What are the scores that bound the middle 95% of the distribution?
Analysis Questions.
1. What percent is left at the tails of the distribution?
2. What areas do you need to solve this problem?
3. What normal curve model applies here?
There is an area of 2.5% or 0.0250 above and below the middle 95% of the distribution. To determine the scores
which bound the middle 95%, we must first find the z-score values associated with them. Then compute the raw scores.
Steps Solution
1. Sketch the diagram of a normal curve.
2. Five percent (5%) is left at the two tails of the 5% is the same as 0.05
distribution. Express it as area. This is the area to the right
of the z-value that we want to locate.
3. We are interested in the two areas at the tails of the 0.05 ÷ 2 = 0.0250
distribution, SO, we divide 0.05 by 2.
4. Find the area between 0.4750. 0.5000 – 0.0250 = 0.4750
5. Find the area 0.4750 or the value nearest to it in the The area 0.4750 is found in the Table.
table of Areas Under the normal curve.
6. Find the z-value that corresponds to 0.4750 z = 1.96 <-> the area 0.4750
7. Use the z-values at both sides of the distribution. The other values z = -1.96
8. Find a computing formula for determining the raw X−μ
scores. z=
σ
9. Modify the formula to solve for X. X = μ+ zσ and X = μ – zσ
10. Substitute the given values in the modified formula to For X = μ + zσ
solve for X X = _______
X = _______
For X = μ – zσ
X = _______
X = _______
11. Make a concluding statement. In the distribution, the scores that bound the middle 95%
are ____ and _____.
Example 4
The weights of 1000 children average 50kg and the standard deviation is 5kg. How many children weight between
40kg and 55kg?
11
Statistics and Probability – Chapter 3: Sampling and Sampling Distribution
Steps Solution
1. Draw the normal curve with six z-scores.
2. Use the formula μ ∓ zσ to determine the six raw scores For the raw score that corresponds to z = 1
that correspond to the six z-scores. X = 50 + (1)(5) = 55
For the raw score that corresponds to z =2.
3. Place the raw scores under the corresponding raw Show this in the sketch in Step 1.
scores.
4. What z –score corresponds to 40?
5. What z-score correspond to 55?
6. Find the total area between the z-values that
correspond to 40 and 50.
7. Multiply the total area by 1000 (the total number of
children). Round up to the nearest whole number.
8. Make a concluding statement.
12
Statistics and Probability – Chapter 3: Sampling and Sampling Distribution
A. Use the z-table to find the area that corresponds to each of the following. Draw the normal curve.
B. Determine each of the following areas and show these graphically. Use probability notation in your final answer.
1. above z = 1.46
2. below z = -0.58
8. at most z = -1.5
9. at least z = 1
10. at least z = -2
1. P99
2. 90th percentile
3. 68th percentile
13
Statistics and Probability – Chapter 3: Sampling and Sampling Distribution
4. P40
5. 32nd percentile
a. z = 1.25
b. z = -1.67
c. z = 2.3
d. z = -0.30
e. z = 1.96
Solution:
2. In number 1, find the raw score such that 60% is below it.
3. In a job fair 3000 applicants applied for a job. Their mean age was found to be 28 with a standard deviation of 4years.
a. Draw a normal curve distribution showing the z-scores and the raw scores.
14
Statistics and Probability – Chapter 3: Sampling and Sampling Distribution
4. Five hundred children participated in a field demonstration. Their heights averaged 110cm with a standard deviation of
6cm.
a. What is the probability that a child, picked at random, has a height greater than 116cm?
b. What is the probability that the height of a child, picked at random, is less than 104cm?
15