Professional Documents
Culture Documents
Means
Business Statistics/ Statistical
Inference
Case of Two Independent Populations
The Concept
Consider a random sample mean for some random
sample 1 belonging to a sampling distribution of
mean of population 1. The expected value of which
is the mean of the population 1, for which the
sampling distribution of mean is made. The
variance of is given by i.e. where is the standard
deviation of sample 1 from population 1 and is the
sample size of the sample 1 from this sampling
distribution of mean .
The Concept
Consider another random sample mean for some
random sample 2 belonging to a sample distribution
of mean of population 2. The expected value of
which is the mean of the population 2 for which the
sampling distribution of mean is made. The variance
of is given by i.e. where where is the standard
deviation of sample 2 from population 2 and is the
sample size of the sample 2 from this sampling
distribution of mean .
The Concept
• The two situations can be depicted as follows
Sampling distribution
of mean=
𝜇1 𝑋1
Sampling distribution
of mean =
𝜇2 𝑋2
Population Cloud 1
Sample 1
𝑋1
𝑥1 𝜇1
𝑠 22 𝑠2
𝑣𝑎𝑟𝑖𝑎𝑛𝑐𝑒 2= ⇒𝑆𝐸𝑀 2=
𝑁2 √𝑁2
Population Cloud 2
𝑥2 𝜇2 Sample 2
𝑋2
±
+ Population Cloud 3
+
Estimating Population Mean
• Remember that for one sample the estimation of is given as,
=
Here value is called critical value for a CI or
• If the two sampling distributions are added to obtain then the estimation is
given using two sample means from two separate population as follows,
For example:
Confidence
Interval (CI)
Or
interval
𝛼 𝛼
2 2
−𝑡 𝑐 𝑡=0 +𝑡 𝑐
−𝑡 𝛼 +𝑡 𝛼
2 2
Confidence
Interval (CI)
Or
interval=
𝛼 𝛼
2 2
−𝑡 𝑐 𝑡=0 +𝑡 𝑐
−𝑡 0.025 +𝑡 0.025
Two cases and DF
• In independent populations case, with reference
to SEM we can have two cases
– Equal variances
– Unequal variances
Case of Equal variances
• If sample variances are equal then equation
can be simplified from
• Further more, in this case we have to pool or group the variances as one
single value. This pooled standard deviation or (as it is called) is given by,
• The difference of the sample values will make a new difference dataset that
will estimate a new difference population as follows:
Here
is the mean of a new data set which contains differences of measured values
from two samples taken from same single population.
is the standard deviation of new differences dataset.
is the sample size of new differences dataset.
is the critical value at some CI or level of significance with
Find the point to point difference
of two samples and this will form
Population Cloud new difference dataset, . Mean
of all such similar difference
Sample 1 Sample 2 dataset will form Sampling
Distribution of
Difference of Means=
Sampling Distribution of
Difference of Means=
𝜇𝑑 𝑋𝑑
𝑥𝑑
Summary of Formulae for Estimation of
Difference of Means
S. No Formula to Use Condition
When we have two samples
from two separate
1 populations and the two
samples have equal
variances.
• The last case is different from the first two. Firstly note that
population is same and secondly the sample values are
subtracted and this gives a new differences dataset. After
obtaining this new differences dataset, its sample mean, variance
and standard deviation are used in the estimation formula.
– This is also called dependent means or paired groups case or repeated
measures
Examples
Estimation of Difference of Means
Example 1
• In a packing plant, a machine packs cartons with jars. It is supposed that a new machine will
pack faster on the average than the machine currently used. An experiment was conducted
to record the time taken to pack 10 cartons by new and present/ old machine. The results in
seconds, are shown in the following table. Give 99% CI for the difference between the mean
time it takes the new machine to pack 10 cartons and the mean time it takes the old/ present
machine to pack 10 cartons.
Sample 1(New Machine) Sample 2 (Old machine)
42.1 42.7
41.0 43.6
41.3 43.8
41.8 43.3
42.4 42.5
42.8 43.5
43.2 43.1
42.3 41.7
41.8 44.0
42.7 44.1
and and
Example 1
• Here two samples from two separate populations are taken
– Therefore it is independent means case
• Hence the difference between the mean time it takes the new
machine to pack 10 cartons and the mean time it takes the old/
present machine to pack 10 cartons can be estimated by using,
The 99% CI is
• The sample statistics for two samples from two populations sophomores and juniors written
with subscripts 1 and 2 respectively are as follows:
Which gives
( )
2 2 2
𝑠1 𝑠2
+
𝑁1 𝑁2
𝐷 𝐹=
( ) ( )
2 2 2 2
1 𝑠1 1 𝑠2
+
𝑁 1−1 𝑁1 𝑁 2− 1 𝑁2
• After obtaining DF we can find required value from the table as follows:
Example 2
Example 2
• Hence the value is 2.056.
• Therefor the calculation is proceeded as follows:
• The DF is