Professional Documents
Culture Documents
METHODS FOR
MANAGEMENT
Session -10
RECAP
• Introduction – definition, types of statistics, levels of measurement
• Collection / compilation/ classification / tabulation
• Presentation – graphical and diagrammatic
• Measures of central tendencies
• Measures of dispersion
• Measures of skewness
• Exploratory data analysis
• Association between variables – covariance and correlation
• Probability – concepts, laws and Baye’s theorem
• Random variable- discrete and continuous
• Theoretical distributions – Binomial, Poisson and Normal distributions
• Sampling techniques , sampling distribution, central limit theorem and
estimation theory- types ( point and interval estimate)
Learning objectives
• Hypothesis testing
• Errors in TOH
• Level of significance
• Types – parametric and non parametric
• Steps involved in TOH
• Chi square
• Independence
• Goodness of fit (uniform)
TESTING
OF
HYPOTHESIS
Hypothesis
• An assumption to be tested
• If the sample statistic differs from the population
parameter, a decision must be made as to
whether or not this difference is significant
• If it is, the hypothesis is rejected. If not, it is
accepted
Set up a Hypothesis
• H0:Null Hypothesis
• No significant difference between the sample statistic
and the population parameter
• Any difference found is accidental, arising out of
sampling fluctuations
• H1: Alternate Hypothesis
• A hypothesis that is different from the null hypothesis
• If sample info leads us to reject H0, then accept H1
In hypothesis testing, we must stated the assumed or
hypothesized value of the population before we begin
sampling. This assumption is called the null hypothesis.
Two-tailed test
Is a significance test in which it will reject the null hypothesis
if the sample mean is significantly higher or lower then
hypothesized population mean.(i.e. there are two rejection
region)
Ho : # o
Terminologies
Significance level
Complementary concepts to confidence limits.
Probability of committing a TYPE 1 error, naming rejecting
the null hypothesis when in reality it is true.
There is no single standard or universal level of significance
for testing hypothesis.
The higher the significance level, the higher the probability
of rejecting a null hypothesis when it is true.
Set up a Significance level
• The confidence with which an experimenter
rejects or retains H0 depends on the level of
significance involved
• = 5%
• 5% chance that H0 is rejected when it should be
accepted
• 95% confident that we have made the right decision
• Willing to accept a 5% chance of being wrong to reject
H0
Suitable Test Statistic
D C O N D I T I O N
E H0 : True H0 : False
C Accept H0 Correct Decision TYPE 2 Error
I Confidence Level ()
S (1 - )
I Reject H0 TYPE 1 Error Correct Decision
() Power of Test
O
(1 - )
N
Type 1 error,
Is the error of rejecting a null hypothesis when it is true.
Type 11error,
Is the error of accepting a null hypothesis when it is actually
false.
Step 4 : calculation
Step 5 : inference
• Classical approach : if table value > calculated value – accept Ho
• P value approach : if p> accept Ho.
FLOW CHART FOR HYPOTHESIS TESTING
State H0 as well as H1
O E 2
2 =
E
where O = the observed frequency of any value E
= the expected frequency of any value
The obtained value from the formula is compared with
the value from 2 table for a given significance level and
the number of degrees of freedom.
Example:
Are technical support calls equal across all days of the week? (i.e., do calls follow a
uniform distribution?)
Sample data for 10 days per day of week:
= 1722
Logic of Goodness-of-Fit Test
If calls are uniformly distributed, the 1722 calls would be expected
to be equally divided across the 7 days:
1722
246 expected calls per day if uniform
7
Chi-Square Goodness-of-Fit Test: test to see if the sample results
are consistent with the expected results
Observed & Expected Frequencies
Observed Expected
oi ei
Monday 290 246
Tuesday 250 246
Wednesday 238 246
Thursday 257 246
Friday 265 246
Saturday 230 246
Sunday 192 246
( o e ) 2
2 i i
ei
• Reject H0 if
2 2
α
(with k – 1 degrees of
freedom) 0 2
Do not Reject H0
reject H0
2
Chi-Square Test Statistic
Contingency Tables
Situations involving multiple population
proportions
Used to classify sample observations according
to two or more characteristics
Also called a cross-tabulation table.
Example:
The following data concerning industrial accidents and absentees
classified according to the types of employee.
χ
2 O E 2
with d.f . (r 1)(c 1)
E
where:
O = observed frequency
E = expected frequency
r = number of rows
c = number of columns
Contingency Analysis
Example:
Left-Handed vs. Gender
Dominant Hand: Left vs. Right
Gender: Male vs. Female
for
Area
380 415 2
550 585 2
450 415 2
585 415 585 415 585 415 585 415
1225 0.002409 0.001709 0.002409 0.001709
i.e. 2cal =10.0891
Tabulated 2 for (2-1)(2-1)=1 d.f. at 5% level of significance is 3.841 i.e.
2tab=3.841.
Here we see that 2cal>2tab (10.0891>3.841)Ho is rejected i.e. it is highly
significant at 5% level of significance. Thus we conclude that nature of area is
related to voting preference in the election.
Alternative procedure: To calculate the value 2, we can use the following
formula:
Total
a b a+b
c d c+d
Total a+c b+d N = a+b+c+d
N = 620+380+550+450=2000
N ad bc 2000 620 450 380 550
2 2
2
10.09165
a b a c b d c d 1000 1170 830 1000
What is a Hypothesis
Of a test?
• A hypothesis is an I assume the mean AGE
of this class is 50!!!
assumption about the
population parameter. Am I correct? TEST IT!
• A parameter is a
characteristic of the
population, like its
mean or variance.
• The parameter must
be identified before
analysis.
• Steps:
• State the Null Hypothesis
• State its opposite, the Alternative Hypothesis
• Hypotheses are mutually exclusive &
exhaustive
• Sometimes it is easier to form the
alternative hypothesis first.
Hypothesis Testing Process
Assume the
population
mean age is 50.
(Null Hypothesis) Population
The Sample
Is X =20 @ m =50? Mean Is 20
No, not likely!
REJECT
Sample
Null Hypothesis
Reason for Rejecting H0
Sampling Distribution
Our sample
mean (20) we reject the
falls in the null hypothesis
tails!It’s not H0 that µ = 50.
likely!
Hypotyzed
population mean.
20 µ = 50 Sample Mean
Critical
Rejection
Regions Value
α “Area” of the
Rejection region
0
Level of Significance, α and
the Rejection Region
One tail (left) test
a Critical
H0: m 0
H1: m < 0 0 Value(s)
Rejection
Regions One tail (right) test
H0: m 0
a
H1: m > 0
0
H0: m = 0
Two tails test
H1: m ¹ 0 a/2
0
Errors in Making Decisions
• Type I Error
• Reject Null Hypothesis when it is True (“False
Positive”)
• Has Serious Consequences
• Probability of Type I Error Is α
Called Level of Significance
•
• Type II Error
• Do Not Reject Null Hypothesis when it is
False (“False Negative”)
• Probability of Type II Error Is β
( Power 1- β )
a &b Have an Inverse
Relationship
Reduce probability of
one error and the
other one goes up.
• Used to determine optimal strategies where a decision maker faced with several
decision alternatives and an uncertain, or risky pattern of future events.
• Decision – Definition
Choosing from
alternatives
Determination of
payoff
Identification of all
courses of action
(Strategies)
S3
(15) 0 150 300 300
S4
(20) -50 100 250 400
Solution – opportunity loss
States of nature
E1 (5) E2 (10) E3 (15) E4 (20)
S2 50 0 100 200
(10)
S3
(15) 100 50 0 100
S4
(20) 150 100 50 0
Decision Making Under Certainty
Manager knows which event will occur
pick the alternative with the best payoff
Possible Future Demand
Alternative Low High
Small facility 200 270
Large facility 160 800
Do nothing 0 0
Conservative
30 m 5m -10 m
Speculative
40 m 10 m -30 m
Counter cyclical
-10 m 0 15 m
• If the prior probabilities for improving economy, stable economy and worsening
economy are 0.1, 0.5 and 0.4, which investment would Warren consider?
Solution
E(conservative) = 0.1*30+0.5*5-0.4*10 =
E(Speculative) = 40*0.1+0.5*10 -30*0.4 =
E(counter cylic) = -10*0.1+ 0*0.5+ 10*0.4 =
Decision Trees