Professional Documents
Culture Documents
Lecture5 Samplingdist PDF
Lecture5 Samplingdist PDF
STATISTICS
SAMPLING DISTRIBUTIONS
INTRODUCTION TO INFERENTIAL STATISTICS
• Statistical inference is the process of making generalizations about a
population from a sample.
Since most of the characteristics of a population can be described by
parameters, inferential statistics primarily deals
2
• The key concept behind these statistical procedures is the probability
distribution, called sampling distribution, of a sample statistic.
Ex 1:
Suppose we are interested in the following population: X = {1,2,3,4,5).
a) Since this is a very small population (Nx = 5), it is easy to observe the whole
population, to illustrate it with a relative frequency histogram and to find the
parameters, like the population mean and the population variance.
Population of X
0.25 x 3 and x2 2
0.20
0.15
0.10
0.05
x 2
0.00
1 2 3 4 5
3
b) Now consider the hypothetical exercise where, we draw all possible
samples of size two (n = 2) with replacement.
There are 25 possible samples. They are shown in the first row and first
column of the table below.
Compute the sample mean from each of these samples.
(E.g.: If x1 = 1 and x2 = 4, x-bar is 2.5.)
x 1 2 3 4 5
These sample mean values
1 1.0 1.5 2.0 2.5 3.0 form a second population of
X-bar values.
2nd draw (x2)
4
d) Compare the X, and X-bar populations to each other.
X X-bar
Size (N) 5 25
Mean (μ) 3 3
Variance (σ 2) 2 1
Std.Deviation σ 2 1
Population of X-bar1
0.25
0.20
0.15
0.10
0.05
0.00
1.0 1.5 2.0 2.5 3.0 3.5 4.0 4.5 5.0
6
• In practice, the target population is usually large.
7
Even a small random sample is likely to give us fairly
accurate information about the population.
(E.g. the sample mean is expected to be close to the
population mean.)
If the sample is not randomly selected, it is likely to produce misleading,
biased results, even if the sample is relatively large.
Random samples can be drawn in several different ways. The most basic
procedure is called simple random sampling.
9
The difference between the parameter (such as μ) and the statistic (x-
bar) is called error.
10
SAMPLING DISTRIBUTIONS
11
3) If the population (X) is normally distributed, X-bar is also
normally distributed, regardless of the sample size.
12
Ex 2: An automatic machine in a manufacturing process is operating properly
if the lengths of an import sub-component are normally distributed, with mean μ
= 117 cm and standard deviation σ = 2.1 cm.
X
a) Find the probability that one randomly selected unit has a length of greater
than 120 cm.
P( X 120) P( Z 1.43) 0.5 P(0 Z 1.43) 0.5 0.4236 0.0764
120 117
z
2.1
b) Find the probability that if three units are randomly selected, their mean
length exceeds 120 cm. n
120 117
z
1.212
Ex 3:
Imagine we have a large population of, say, invoices with μ = $255.60 and σ =
$45.20.
If we draw a random sample of size 400 and calculate the average value of
these invoices, what is the probability that
a) it exceeds $260.00?
In this example we are not told whether the population of invoices is normally
distributed. However, since n = 400 we can rely on CLT.
14
45.20
x 255.60 and x 2.26
n 400
P( X 260) P( Z 1.95)
260 255.60
z
2.26
0.5 P(0 Z 1.95) 0.5 0.4744 0.0256 or 2.56%
15
SAMPLING DISTRIBUTION of a proportion
• Let us suppose that a population has only two types of elements
(such as yes – no, female – male, defective – non-defective etc.),
N
1
p
N
x
i 1
i
16
• Since the population proportion is the mean of a binary population,
most of what you have learnt about the estimation of a population mean
also applies to the population proportions.
17
Draw all possible samples of the same size, calculate the sample
proportion from each of these samples, and prepare the relative
frequency distribution of p-hat.
n, pˆ 3
we can rely on the fact that p-hat is a special sample mean, so its
expected value is μx and its variance is σx2/n.
18
The sample proportion follows a binomial distribution with n and p
parameters.
19
Ex 4: p=0.1 q=0.9
From past experience 10% of electronic components received from a particular
manufacturer are defective. If, from a particular shipment, a random sample of
400 components is selected, what is the probability that the proportion of
defective parts in the sample is
n
20
b) less than 9%?
0.09 0.1
P( pˆ 0.09) P( Z 0.67) z
0.015
P( Z 0.67) 0.5 P(0 Z 0.67) 0.5 0.2486 0.2514
pq 0.1 0.9
pˆ 0.03 (instead of 0.015)
n 100
and P( pˆ 0.13) P( Z 1.00) 0.5 0.3413 0.1587 (i) is more
likely.
0.13 0.1
z
0.03
21