Professional Documents
Culture Documents
Hypothesis testing
Formulation of Hypothesis
In formulating the ‘null’, usually the words “no”, “not”, or “same” or “independent” will
be the part of the stated hypothesis
Importance of Hypothesis
1|Page
Tests of Hypotheses
Parametric Tests – When the data is on continuous scale (interval and ratio scale) then
parametric tests can be used
Ex : of Parametric test
Non Parametric Tests – When the data is on categorical scale (Nominal & Ordinal) then
Non parametric tests can be used
Chi – Square test, Kolmogorov Smirnov D test, Wilcoxon Matched Pairs test etc
2|Page
Chi Square Analysis
Chi-Square is used to determine how the observed frequencies differ from expected
frequencies.
Chi-Square is a non parametric test of hypothesis. It can be used for both the nominal and
the ordinal data.
The chi-square goodness-of-fit test is used to analyze the distribution of frequencies for
categories of one variable. It is used in analyzing nominal data variables, such as age or
number of bank arrivals, to determine whether the distribution of these frequencies is the
same as some hypothesized or expected distribution. However, the goodness-of-fit cannot
be used to analyze two variables simultaneously.
To test the hypothesis that training is significant in improving the skills of salesman
at 5% significance level [ table value at 5% significance level is 9.488]
No. of 15 32 9 12 2
respondents
Solution :
Null Hypothesis [H0] : Training is not significant in improving the skills of the
salesman
3|Page
Formula for calculating Chi-Square :
15 14 1 1 0.071
32 14 18 324 23.14
9 14 -5 25 1.785
12 14 -2 4 0.28
Total 35.56
Table value at 5% significance level for 4 degrees of freedom from chi-square table
= 9.488
4|Page
Conclusion
Since the calculated value(35.56) is greater than table value or critical value (9.488),
we should reject the null hypothesis and conclude that training is significant in
improving the skills of salesman.
Very Important 50
Somewhat Important 60
Somewhat Unimportant 40
Very Unimportant 30
Important 90
Somewhat Important 30
Neutral 40
Somewhat 30
Un Important
Unimportant 50
Extremely 60
5|Page
Un Important
3. The demand for a particular spare part in a factory was found to vary from day-to-
day in a sample study, the following information was obtained
Test the hypothesis that the number of parts demanded does not depend on the day
of the week. Test tis at 5% significance level[table value at 5% = 11.070]
Test whether the sales of milk is uniformly distributed for 12 months [table value at
5% significance level : [table value : 19.675]
January 1610
February 1585
March 1649
April 1590
May 1540
June 1397
July 1410
August 1350
September 1495
October 1564
November 1602
December 1655
6|Page
7|Page
Unit 9
Correlation Analysis
Correlation Analysis is used as a statistical tool to ascertain the association between the
two variables. It also determines the nature and strength of the variables. It determines the
degree of relationship or direction between the variables
Ex: Income – Investment ( as Income increases, the level of investment also increases),
IQ – Productivity
Methods of correlation
Bivariate correlation – Pearson Product moment formula & Assumed mean or short cut
method
Rank Correlation –
Karl Pearson coefficient of correlation can’t be calculated when the series that are ranked
according to size. It is a convenient method to rank the series. It is used for qualitative
phenomenon
8|Page
Ex : Intelligence tests
Spearman Rank correlation(for individual ranks) and Edward Spearman formula (for tied
or similar ranks)
9|Page
Find the value of r by using Pearson Product moment formula between the following
X : 25 18 32 21 35 29
Y : 16 11 20 15 26 28
Solution
Formula for Pearson Product moment formula called as covariance method. It determines
the joint variation between the variables
r = n∑xy - ∑x.∑y
X Y X2 Y2 XY
r = n∑xy - ∑x.∑y
r = 0.84
since the value of r lies between 0.75 to 1 then it is high positive correlation and it can be
concluded that r is strongly correlated. There is a strong correlation between X and Y
10 | P a g e
Problem
Age(cars) 2 4 6 7 8 10 2
Maintenance(’00 16 15 18 19 17 21 20
)
Problem
Productivity 50 55 60 65 52 60 47 36 75
Experience(yrs) 49 72 74 44 58 66 50 30 35
Problem
11 | P a g e
12 | P a g e