Professional Documents
Culture Documents
Examples:-
• Mean of population,
• Population Variance,
• Population range.
Basic Definitions … cont
Sample static: refers to a measurement or
characteristic taken from the sample.
• Examples:-
• Sample mean (X)
• Sample Variance (S2)
• Sample standard Deviation ( S)
• Sample proportion
Basic Definitions … cont
Sampling unit: may be a geographical one such as state,
district, village, etc., or a construction unit like house,
etc., or it may be a social unit such as family, club,
school, etc., or it may be an individual.
The needs for sampling
• Goal orientation,
• Measurability,
• Practicality,
• Economy,
• Independence,
• Homogeneity, and
• Adequacy.
Sample Frame
C. Stratified sampling
If a population from which a sample is to be drawn does
not constitute a homogeneous group, stratified
sampling technique is generally applied in order to
obtain a representative sample.
Under stratified sampling, the population is divided into
several sub-populations that are individually more
homogeneous than the total population (the different sub-
populations are called ‘strata’) and then we select items
from each stratum to constitute a sample.
Stratified sampling results in more reliable and detailed
information.
Types of probability sampling… cont…
C. Stratified sampling… cont…
• How to form strata?: The strata be formed on the basis
of common characteristic(s) of the items to be put in
each stratum.
• How should items be selected from each stratum?: the
usual method, for selection of items for the sample from
each stratum, resorted to is that of simple random
sampling. Systematic sampling can be used if it is
considered more appropriate in certain situations.
• How to allocate the sample size of each stratum?: we
usually follow the method of proportional allocation
under which the sizes of the samples from the different
strata are kept proportional to the sizes of the strata.
Types of probability sampling… cont…
C. Stratified sampling… cont…
• Factors to be considered in Stratified Sampling
– Base of stratification
– Number of strata
– Sample size with in strata
Types of probability sampling… cont…
D. Cluster sampling
If the total area of interest happens to be a big one, a
convenient way in which a sample can be taken is to divide the
area into a number of smaller non-overlapping areas and
then to randomly select a number of these smaller areas
(usually called clusters).
In this method, the universe (population) is divided in to some
recognizable subgroups, called clusters. After this, simple
random sample of this cluster is drawn and all elements
belonging to the selected cluster longitude the sample.
Cluster sampling, no doubt, reduces cost by concentrating
surveys in selected clusters. But, certainly it is less precise than
random sampling. It reduces the cost of sampling for a
population scattered over large geographic areas.
Types of probability sampling… cont…
E. Multi-stage sampling
• is a further development of the principle of cluster
sampling.
When different sampling techniques are combined at the
various stages to select the representative from the Universe
it is known as multi-stage sampling.
Eg: Suppose we want to investigate the working efficiency of
nationalised banks in Ethiopia and we want to take a sample
of few banks for this purpose. The first stage is to select
large primary sampling unit such as states in a country.
Then we may select certain districts and interview all
banks in the chosen districts. This would represent a two-
stage sampling design with the ultimate sampling units
being clusters of districts.
Non-probability (Non-Random) Sampling
Basic Concepts
Estimation
• is the process of using statistics as estimates of parameters.
• is any procedure where sample information is used to estimate/
predict the numerical value of some population measure (called a
parameter).
Estimator: any sample statistic that is used to estimate a
population parameter (sample mean for population mean, sample
variance for sample standard deviation, sample proportion for
population proportion).
Estimate- is a specific numerical value of our estimator. Eg:
Important Properties of Good Estimators
• indicates standard that can be used to evaluate estimators:
A. Unbiasedness : The estimator is said to be unbiased when the mean
of sampling estimator is equals to population parameter.
The sample mean is an unbiased estimator of the population
mean.
The sample variance is an unbiased point estimator the
population variance because the mean of the sampling distribution
of the sample variance is equal to the population variance.
The sample proportion is an unbiased estimator of the population
proportion.
However, because standard deviation is a nonlinear function of
variance, the sample standard deviation is not an unbiased
estimator of population standard deviation.
B. Efficiency: refers to the size of standard error of statistics. The most
efficient estimator is the one with smallest variance.
Important Properties of Good Estimators…
C. Consistency: related to the relationship between the
larger the sample size and the variance. It is related to
their behavior as the sample gets large.
A statistic is a consistent estimator of population
parameter if sample size is large/increases, it becomes
almost more certain, the value of the statistic comes
very close to the value of the population parameter. An
unbiased estimator is a consistent estimator if the
variance approaches to 0/zero/ as n increases.
D. Sufficiency (or sufficient statistic): is an estimator that
utilizes all the information a sample contains about the
parameter to be estimated. For example, the sample mean
is a sufficient estimator of the population mean.
Concepts of Statistical Estimation…
Estimation procedure
1. Select a sample.
2. Collect the required information from the
members of the sample.
3. Calculate the value of sample statistics.
4. Assign the value(s) to the corresponding
population parameter.
An estimator may be a point estimator or interval
estimator.
Concepts of Statistical Estimation…
Statistical Inference:
• branch of statistics which is concerned with using
probability concepts to deal with uncertainty in
decision making.
• is based on estimation and hypothesis testing and
any inference drawn about the population (i.e
based on sample statistic or function of sample
information).
Type of Estimates
A point estimate:
• is a single number that is used to estimate an unknown population
parameter.
• It is a single value that is measured from a sample and used as an
estimate of the corresponding population parameter.
The most important point estimates (given that they are
single values) are:
Type of Estimates
An interval estimate
• is a range of values used to estimate a population
parameter.
• describes the range of values with in which a parameter
might lie.
• Stated differently, an interval estimate is a range of
values with in which the analyst can declare with some
confidence that the population parameter will fall.
Example
• Suppose we have the sample 10,20,30,40 and 50
selected randomly from a population whose mean μ is
unknown.
• The sample mean, , = ∑xi /n = 10 + 20 + 30 + 40 +
50 /5= 30 is a point estimate of μ/population mean.
• On the other hand, if we state that the mean, µ, is
between x ± 10 , the range of values from 20 (30-10) to
40 (30+10) is an interval estimate.
Interval Estimators of the Mean and the Proportion
Point estimators of population parameters, while useful, do not
convey as much information as interval estimators. It produces
a single value as an estimate of the unknown population
parameter. The estimate may or may not be close to the
parameter value; in other words, the estimate may be incorrect.
An interval estimate, on the other hand, is a range of values
that conveys the fact that estimation is an uncertain process.
The standard error of the point estimator is used in creating
a range of values; thus a measure of variability is incorporated
into interval estimation.
Further, a measure of confidence in the interval estimator is
provided; consequently, interval estimates are also called
confidence intervals. For these reasons, interval estimators
are considered more desirable than point estimators.
Interval estimation for population mean (µ)
As a result of the CLT, the following Z formula for
sample means can be used when sample sizes are large,
regardless of the shape of the population distribution or
for smaller sizes if the population is normally distributed.
Interval estimation for population mean (µ) …
The confidence interval for population mean is affected
by:
• The population distribution, i.e., whether the population
is normally distributed or not
• The standard deviation, i.e., whether σ is known or not.
• The sample size, i.e., whether the sample size, n, is large
or not.
Condition1: Confidence interval estimate of (µ) when Distribution is Normal and σ is known
Step 3 compute for value from standard normal table (t – table) reflecting
confidence level
Z = Z0.005 = 2.576
Step 4 compute for population mean (μ) or Construct the Confidence
interval of µ
μ = 80.76 to 90.24
Step 5: Interpretation: we state with 99% confidence that the average
distance traveled by rented cars lies between 80.76 and 90.24miles.
Condition 3: Confidence interval for μ, when σ unknown, n small and population is normal
Steps
1. Calculate for the degrees of freedom (v = n-1)
2. Calculate sample standard error of the mean i.e.
3. Compute for a / 2 (compute for incorrect proportion)
Condition 3: Confidence interval for μ, when σ unknown, n small
and population is normal…
Example: If a random sample of 27 items produces x =
128.4 and s = 20.6. What is the 98% confidence interval
for μ? Assume that x is normally distributed for the
population. What is the point estimate?
Given
• n = 27 s = 20.6
• x = 128.4 c = 98% = 0.98
Solution
The point estimate of the population mean is the sample mean, in
this case 128.4 is the point estimate.
1. Compute for degrees of freedom: V= n – 1= 27 – 1 = 26
2. Calculate sample standard error of the mean: = 20.6/√27
= 20.6/5.196 = 3.965
Condition 3: Confidence interval for μ, when σ unknown, n
small and population is normal…
3: a/2 = (1-C)/2 = (1-0.98)/2 = 0.02/2 = 0.01
4. Look up t , v value from t-distribution, i.e
t , ,v = t0.01,26 = 2.479
4: Determine for confidence interval
μ = 128.4 ±2.479 * 3.965
128.4 – 9.829 ≤ μ ≤ 128.4 + 9.829
118.571 to 138.229
5: Interpret Results: There is a 98% confidence that the
population mean lies between 118.57 and 138.23
Interval Estimation of the Population Proportion
• We know that a sample proportion, p, is an unbiased
estimator of a population proportion P and if the sample
size is large, the sampling distribution of p is normal with:
• Example
• Suppose that 60% of electrical contractors in a region use
a particular brand of wire what is the prob. of taking a
random sample of size 120 from those electrical contractor
and finding that 0.5 or less use that brand of wire?
Interval Estimation of the Population Proportion…