Professional Documents
Culture Documents
Calculate:
The lower limit of the first class interval i.e. 0-5 is ‘0’.
The class limits of the third class, i.e. 10-15 are 10 (lower
limit) and 15 (upper limit).
The class mark is defined as the average of the upper and
lower limits of a class. For 5-10, the class mark is (5+10)/2
= 7.5
The class size is the difference between the lower and
upper class-limits. Here, we have a uniform class size,
which is equal to 5 (5 – 0, 10 – 5, 15 – 10, 20 – 15 are all
equal to 5).
Life of AA batteries, in minutes Table summary. This table displays the
results of Life of AA batteries. The information is grouped by Battery life,
minutes (x) (appearing as row headers), Frequency (f), Relative frequency
and Percent frequency (appearing as column headers).
Where:
n 75
37.5
2 2
39 is the first cumulative frequency to be greater than or equal to 37.5
50 54 is the median class.
~
X L w ( n c)
L 49. 5, w 5 med f 2
med med
n 75, c 17, f 22 49. 5 5 (37. 5 17)
med 22
54. 16
D. Quantiles
• When a distribution is arranged in order of magnitude of items,
the median is the value of the middle term.
• Their measures that depend up on their positions in distribution
quartiles, deciles, and percentiles are collectively called
quantiles.
Quartiles:
Quartiles are measures that divide the frequency distribution in
to four equal parts.
The value of the variables corresponding to these divisions are
denoted Q1, Q2, and Q3 often called the first, the second and the
third quartile respectively.
Q1 is a value which has 25% items which are less than or equal
to it. Similarly Q2 has 50%items with value less than or equal to
it and Q3 has 75% items whose values are less than or equal to it
To find Qi (i=1, 2, 3) we count of the classes beginning from
the lowest class.
• For grouped data: we have the following formula
• Remark: The decile class (class containing Di )is the class with the
smallest cumulative frequency (less than type) greater than or
F. Percentiles:
• Percentiles are measures that divide the frequency distribution
in to hundred equal parts.
• The values of the variables corresponding to these divisions are
denoted P1, P2,.. P99 often called the first, the second,…, the
ninety-ninth percentile respectively.
• To find Pi (i=1, 2,..99) we count of the classes beginning from
the lowest class.
• For grouped data: we have the following formula
10
170 (123.25 88)
72
174.90
ii. Q2
• Determine the class containing the second quartile.
w 2* N
Q2 LQ ( c)
2
fQ
2
4
10
170 ( 246.5 244)
LQ 190 ,
2
w 10 72
N 493 , c 244 , f Q 107
2
190.23
iii. Q3
• Determine the class containing the third quartile.
w 3* N
Q3 LQ 3 ( c)
fQ
3
4
10
200 (369.75 351)
LQ 200 ,
3
w 10 49
N 493 , c 351 , f Q 49
3
203.83
b) D7
• Determine the class containing the 7th decile.
LD 190 ,
7
w 10
N 493 , c 244 , f D 107
7
c) P90
• Determine the class containing the 90th percentile.
Measures of Dispersion (Variation)
1. Range
• The simplest measure of dispersion is the range.
• It is the difference between the highest and the lowest values
in a data set.
Probability& Probability
Distributions
What Is Probability?
• No doubt you are familiar with terms such as probability,
chance, and likelihood.
• They are often used interchangeably.
• What is probability? In general, it is the chance that something
will happen.
• Probability is a numerical measure of the likelihood that an
event will occur.
• Thus, probabilities can be used as measures of the degree of
uncertainty associated with events.
• If probabilities are available, we can determine the likelihood of
each event occurring.
• Probability values are always assigned on a scale from 0 to 1.
• A probability near zero indicates an event is unlikely to occur;
a probability near 1 indicates an event is almost certain to
occur.
• In business probability analysis helps to determine reasonable
• Other probabilities between 0 and 1 represent degrees of
likelihood that an event will occur.
• For example, if we consider the event the price of good
increase next month, we understand that when the
exchange rate report indicates a near-zero probability of
increase in price, it means almost no chance of price change.
• However, if a 0.90 probability of change in price is reported,
we know that the price is likely to increase.
• A 0.50 probability indicates that the price is just as likely to
increase as not.
• Consider an event that probability that the rain will rain
tomorrow which is uncertain.
• If the probability is near to zero, it shows that the rain has
little/ no chance of raining; while when the probability near
to one, it implies higher chance of raining tomorrow.
• Managers and business accountant often base their
decisions on an analysis of uncertainties such as:-
What are the chances that sales will decrease if we
increase prices?
What is the likelihood a new assembly method will
increase productivity?
How likely is it that the project will be finished on
time?
What is the chance that a new investment will be
profitable?
What are the chances that women participation will
increase if they are educated?
What is the chance that crime will decrease if we
increase society mobilization?
Why is knowledge of probability necessary for the study of
statistical inference?
• In order to be able to say something about a population on the basis
of some sample evidence we must first examine how the sample data
are collected.
• In many cases, the sample is a random one, i.e. the observations
making up the sample are chosen at random from the population.
• If a second sample were selected it would almost certainly be
different from the first.
• Each member of the population has a particular probability of being
in the sample (in simple random sampling the probability is the same
for all members of the population).
• To understand sampling procedures, and the implications for
statistical inference, we must therefore first examine the theory of
probability.
• As an illustration of this, suppose we wish to know if a coin is fair,
i.e. equally likely to fall heads or tails. The coin is tossed 2 times and 2
heads are recorded. This constitutes a random sample of tosses of the
coin. What can we infer about the coin? If it is fair, the probability of
Fundamental concepts of probability
• To determine probability, you have to know the
definition and concepts of the following terms.
Experiment: Any process of observation or
measurement which generates well defined outcome.
• It may be probabilistic/random or non probabilistic
experiment.
Probability Experiment: It is an experiment that can be
repeated any number of times under similar conditions
and it is possible to enumerate the total number of
outcomes without predicting an individual outcome.
• It is also called random experiment.
• On any single repetition of an experiment, one and only
one of the possible experimental outcomes will occur.
• Several examples of experiments and their associated
outcomes follow.
Experiment Experimental Outcomes
Toss a coin Head, tail
Select a part for inspection Defective, non defective
Conduct a sales call Purchase, no purchase
Roll a die 1, 2, 3, 4, 5, 6
Play a football game Win, lose, tie
Sample Space: Set of all possible outcomes of a probability
experiment.
• It is the set of all experimental outcomes.
• Sample space can be Countable (finite or infinite) or
Uncountable.
• An experimental outcome is also called a sample point to
identify it as an element of the sample space.
Event: It is a subset of sample space.
• It is a statement about one or more outcomes of a random
experiment.
• Constitutes one or more possible outcomes of an
experiment.
• They are denoted by capital letters.
Outcome: The result of a single trial of a random
experiment.
• An experimental outcome is also called a sample point to
identify it as an element of the sample space.
• Example: suppose you roll a die and let A be the event of
odd numbers, B be the event of even numbers, and C be
the event of number 8.
• Solution: since the die has six sides you have, { 1,3,5},
{2,4,6 } and {}.
• Remark: If S (sample space) has n members then there are exactly
subsets or events.
Equally Likely Events: Events which have the same chance of
occurring.
Mutually Exclusive Events: Two events which cannot happen at
the same time/have no the same chance of occurrence.
Complement of an Event: The complement of an event A means
non-occurrence of A and is denoted by or and contains those
points of the sample space which don‘t belong to A.
Elementary Event: an event having only a single element or
sample point.
Compound events: Most practical problems require the calculation
of the probability of a set of outcomes rather than just a single
one, or the probability of a series of outcomes in separate trials.
Intersection: The intersection of two events say A and B denoted
by A ∩ B and read as the intersection of A and B or A and B. The
intersection of A and B are all elements which are common both in
A and B.
• Union: The Union of A and B denoted by A ∪ B or A
or B and read as A union B which is the set of
elements either in A or B or in both.
• Independent Events: Two events are independent if
the occurrence of one does not affect the probability
of the other occurring.
• Dependent Events: Two events are dependent if the
first event affects the outcome or occurrence of the
second event in a way the probability is changed.
Approaches of measuring probability
P(A) + P(A c ) = 1
Intersection of Two Events
• The intersection of events A and B is the set of all sample points
that are in both A and B.
• The intersection is denoted: A and B
• The joint probability of A and B is the probability of the
intersection of A and B, i.e. P(A and B)
• For example, let A = tosses where first toss is 1 {(1,1), (1,2), (1,3),
(1,4), (1,5), (1,6)} and B = tosses where the second toss is 5 {(1,5),
(2,5), (3,5),(4,5), (5,5), (6,5)}
• The intersection is {(1,5)}
• The joint probability of A and B is the probability of the
intersection of A and B, i.e. P(A and B) = 1/36
Union of Two Events
• The union of two events A and B, is the event containing all
sample points that are in A or B or both:
• Union of A and B is denoted: A or B
1 T T T 0
2 T T H 1
3 T H T 1
4 T H H 2
5 H T T 1
6 H T H 2
7 H H T 2
• Note
8 that the
H outcomeH “zero heads” H occurred
3 only once,
“one head” occurred three times, “two heads” occurred
three times, and the outcome “three heads” occurred only
once. That is, “zero heads” happened one out of eight times.
Thus, the probability of zero heads is one eight, the
probability of one head is three eights, and so on.
Number of Heads, X Probability of Outcome, P(X)
0 1/8 = 0.125
1 3/8 = 0.375
2 3/8 = 0.375
3 1/8 = 0.125
Total 8/8 = 1.000
Column
1 2315 7548 5901 8372 5993 6744
12
13
Statistical Inference
Inference
…but we
We want to can only
learn about calculate
population sample
parameters statistics
…
What is Statistical inference?
• Statistical inference/inferential statistics is the process
of drawing conclusions about population from samples
that are subject to random variation.
Estimation theory
• Estimation theory is a branch of statistics that deals with
estimating the values of parameters based on
measured/empirical data that has a random component.
• An estimate is a single value that is calculated based on
samples and used to estimate a population value
• An estimator is a function that maps the sample space to
a set of estimates.
• The entire purpose of estimation theory is to arrive at an
estimator, which takes the sample as input and
produces an estimate of the parameters with the
corresponding accuracy.
• There are three types of statistical inference:
Estimation, Hypothesis testing and Linear
regression analysis
Estimation:
• The objective of estimation is to determine the
approximate value of a population parameter on the
basis of a sample statistic.
• For example, the sample mean is employed to
estimate the population mean.
• We refer to the sample mean as the estimator
of the population mean .
• Once the sample mean has been computed, its
value is called the estimate.
• To be a good estimator it should be
Unbiasedness (mean): whether the mean of this
estimator is close to the actual parameter?
• Formally, an estimator ˆμ for parameter μ is said to
be unbiased if:
E(ˆμ) = μ .
Efficiency (variance): whether the standard
deviation of this estimator is close to the actual
parameter
• Suppose we are given two unbiased estimators for a
parameter.
• Then, we say that the estimator with a smaller
variance is more efficient.
Consistency (size): whether the probability distribution
of the estimator becomes concentrated on the parameter
as the sample sizes increases
• An unbiased estimator is said to be consistent if the
difference between the estimator and the target
population parameter becomes smaller as we increase
the sample size.
• Formally, an unbiased estimator ˆμ for parameter μ is
said to be consistent if V (ˆμ) approaches zero as n → ∞.
• Note that being unbiased is a precondition for an
estimator to be consistent.
• Example: The variance of the sample mean ̄ X is σ2/n,
which decreases to zero as we increase the sample size
n. Hence, the sample mean is a consistent estimator for
• There are two types of estimators
A point estimator is a statistic (that is, a function of
the data) that is used to infer the value of an
unknown parameter in a statistical model.
• A point estimator draws inferences about a
population by estimating the value of an unknown
parameter using a single value or point.
Statistics as Estimators for Parameters
Point estimator of the proportion
• Selecting an individual from this population is an
experiment that has ONLY two outcomes: either that
individual has the characteristic we are interested in
or does not.
• This kind of an experiment is called a Bernoulli
experiment that has two outcomes, a success or
a failure.
• Let a random sample of size n be taken from this
population.
• The sample proportion, denoted by p̂ (read p-hat),
given by
p̂
• is a point estimator of a population proportion p
whereas a specific value p̂ is a point estimate of the
population proportion.
• Here X is the number of individuals in the sample
with the specified characteristic.
• The sample proportion p̂ is a statistic that estimates
the population proportion, p , which is a parameter
• Example: A random sample of 50 households was
selected for a telephone survey. The key question
asked was, do you or any member of your
household own a cellular telephone with a built-
in camera? Of the 50 respondents, 15 said yes
and 35 said no. Find the point estimate of the
true proportion, p, of households with cellular
telephones with built-in cameras.
• Solution:
• The first group answering ―yes and the second
group answering ―No, such that the proporition of
saying yes is
p̂ = 0.3
Point estimator of the mean
• Let there be a population with unknown mean “μ”
and unknown standard deviation “σ”(lower case Greek
sigma).
• To estimate μ and, we draw a simple random sample of
size n: X1, X2……Xn
• Then compute the sample mean