Professional Documents
Culture Documents
St. Andrew’s College received 900 applications from prospective students. The application form
contains a variety of information including the individual’s Scholastic Aptitude Test (SAT) score and
whether or not the individual desires on-campus housing.
At a meeting in a few hours, the Director of Admissions would like to announce the average SAT score
for the population of 900 applicants.
However, the necessary data on the applicants have not yet been entered in the college’s computerized
database. So, the Director decides to estimate the values of the population parameters of interest based
on sample statistics. The sample of 30 applicants is selected using computer-generated random
numbers.
s as Point Estimator of σ
𝑠= √ ∑ ¿ ¿ ¿ ¿
Note: Different random numbers would have identified a different sample which would have resulted
in different point estimates.
Once all the data for the 900 applicants were entered in the college’s database, the values of the
population parameters of interest were calculated.
Population Mean SAT Score
𝜇=
∑ 𝑥𝑖
=1697
900
Population Standard Deviation for SAT Score
𝜎=√ ∑ ¿¿¿¿¿
• We will use the following notation to define the standard deviation of the Sampling distribution of .
When the population from which we are selecting a random sample does not have a normal
distribution, the central limit theorem is helpful in identifying the shape of the sampling
distribution of 𝑥 ̅.
CENTRAL LIMIT THEOREM
In selecting random samples of size n from a population, the sampling distribution of the
sample mean 𝑥 ̅ can be approximated by a normal distribution as the sample size becomes large.
The larger the sample size, the more closely the sampling distribution of 𝑿 ̅ will resemble a
normal distribution.
If the population is non-normal, then is approximately normal only for larger values of
n.
2.
The foreman of a bottling plant has observed that the amount of soda in each “32-
ounce” bottle is actually a normally distributed random variable, with a mean of 32.2
ounces and a standard deviation of .3 ounce.
If a customer buys one bottle, what is the probability that the bottle will contain more
than 32 ounces?
We want to find P(X > 32), where X is normally distributed and µ = 32.2 and σ =.3
The foreman of a bottling plant has observed that the amount of soda in each “32-
ounce” bottle is actually a normally distributed random variable, with a mean of 32.2
ounces and a standard deviation of .3 ounce.
If a customer buys a carton of four bottles, what is the probability that the mean
amount of the four bottles will be greater than 32 ounces?
Things we know:
1) X is normally distributed, therefore so will
2)
3) 0.15
If a customer buys a carton of four bottles, what is the probability that the mean
amount of the four bottles will be greater than 32 ounces?
what is the probability that one bottle will what is the probability that the mean of
contain more than 32 ounces? four bottles will exceed 32 oz?
Whenever a simple random sample is selected and the value of the sample mean is used to estimate the
value of the population mean m, we cannot expect the sample mean to exactly equal the population mean.
The practical reason we are interested in the sampling distribution of is that it can be used to provide
probability information about the difference between the sample mean () and the population mean
().
Example: A personnel director claims that the mean salary of managers is $51800 with a standard
deviation of $4000. To test this claim he drew a sample of 30 managers and their annual salary was
recorded. The personnel director believes that the sample mean will be an acceptable representation of the
population mean only if the sample mean is within $500 of the population mean.
(It is not possible to guarantee that the sample mean will be within $500 of the population mean.
Therefore, we talk in terms of probability. i.e. What is the probability that the sample mean computed
using a simple random sample of 30 managers will be within $500 of the population mean?)
Therefore, there is an approximately 50 % chance that a sample of 30 managers will provide a sample
mean that lies within $500 of the population mean
In this formula α (Greek letter alpha) is the probability that does not fall into the interval.
To apply this formula all we need to do is substitute the values for µ, σ, n, and α.
α/2 α/2
(1-α)
-z +z
K J Somaiya Institute of Management, India 9.28
From Here to Inference
Simply put, knowledge of the population and its parameter(s) allows us to use the
probability distribution to make probability statements about individual members of the
population.
Statistic
Statistical inference works by reversing the direction of the flow of knowledge in the previous figure.
Statistic Parameter