Professional Documents
Culture Documents
RESEARCH METHODOLOGY
Sl No Cricketer A Cricketer B
1 31 51
2 48 5
3 13 12
4 51 83
5 38 37
6 43 112
7 50 42
8 36 18
9 47 79
10 82 20
A n s we r :
Cricketer A :
a) μ( Mean) = (31+48+13+51+38+43+50+36+47+82)/10 = 439/10 = 43.9
❑
√
b) Standard deviation σ = ( ∑ ❑ x ❑−x)❑2 /n
❑
= (43.9-31) + (43.9-48)2 +(43.9-13)2 +(43.9-51)2 +(43.9-38)2 +(43.9-43)2 +(43.9-50)2 +
2
c) Variance = σ 2 = 278.49
Cricketer B :
√
b) Standard deviation σ = ( ∑ ❑ x ❑−x)❑2 /n
❑
= (45.9-51) + (45.9-5) +(45.9-12)2 +(45.9-83)2 +(45.9-37)2 +(45.9-112)2 +(45.9-42)2 +
2 2
c) Variance = σ 2 = 1016.29
d) Coefficient of variation : σ /μ* 100 = 31.88/45.9 * 100 = 69.46 %
A n sw e r :
Let X be t he num ber of bombs hit t ing a t arget is a binomial variat e, wit h t he
param et ers 2 = 4 , p = 1 /2= 0 .5 , q = 1 -p = 1 -0 . 5 = 0 .5
Th e PM F i s
P( x ) = n cx p x q (n -x ) ; x = 0 , 1 , 2 , …. n
= 4 c x ( 0 . 5 ) x ( 0 . 5 ) 4-x ; x = 0 ,1 , 2 , 3 ,4
a) P ( t h e b r i d g e i s d e st r o y e d )
= 4 C3 ( 0 . 5 ) 3 ( 0 . 5 ) 4-3 + 4 C4 ( 0 . 5 ) ( 0 . 5 )4-4
b) P (n on e of t h e b om b h it ) = p (x = 0)
= 4 C 0 ( 0 . 5 ) 0 ( 0 . 5 ) 4-0 = 0 . 1 2 9 6
A n s we r :
Probabilit y of defect ive pens (p) = 1/10 (given)
Probabilit y of non-defect ive pens (q) = 1-p = 9/10
No. of Trials , n= 12
a) Exact ly 2 defect ive pens
A n sw e r :
A research hypot hesis is a specific, clear, and t est able proposit ion or
predict ive st at em ent about t he possible out com e of a scient ific
research st udy based on a part icular propert y of a populat ion, such as
presum ed differences bet ween groups on a part icular variable or
relat ionships bet ween variables. Specifying t he research hypot heses is
one of t he m ost im port ant st eps in planning a scient ific quant it at ive
research st udy. A quant it at ive researcher usually st at es a priori
expect at ion about t he result s of t he st udy in one or m ore research
hypot heses before conduct ing t he st udy, because t he design of t he
research st udy and t he planned research design oft en is det erm ined by
t he st at ed hypot heses.
N u l l H y p o t h e si s
The null hypothesis is a general statement t hat states t hat t here is no
relat ionship between t wo phenomenon’ s under considerat ion or t hat
t here is no associat ion bet ween t wo groups.
A hypothesis, in general, is an assumpt ion t hat is yet t o be proved wit h
sufficient pieces of evidence. A null hypot hesis t hus is t he hypot hesis
a researcher is t rying t o disprove.
A null hypot hesis is a hypot hesis capable of being object ively
verified, t est ed, and even reject ed.
If a st udy is t o compare method A wit h m ethod B about t heir
relationship, and if t he study is preceded on t he assumption t hat both
m ethods are equally good, t hen t his assumption is t ermed as t he null
hypothesis.
The null hypot hesis should alw ays be a specific hypot hesis, i.e., it
should not st at e about or approxim at ely a cert ain value.
Ex a m p l e s
The follow ing are som e exam ples of null hypot hesis:-
1. If t he hypothesis is t hat “ the consumption of a particular medicine reduces
t he chances of heart arrest ” , t he null hypot hesis will be “ the
consumption of t he m edicine doesn’ t reduce t he chances of heart arrest .”
A l t e r n a t i v e H y p o t h e si s
An alt ernat ive hypot hesis is a st at em ent t hat describes t hat t here is a
relat ionship bet ween t wo select ed variables in a st udy.
An alt ernat ive hypot hesis is usually used t o st at e t hat a new t heory
is preferable t o t he old one (null hypot hesis).
This hypot hesis can be sim ply t erm ed as an alt ernat ive t o t he null
hypot hesis.
The alternative hypothesis is t he hypothesis t hat is t o be proved t hat
indicates t hat t he results of a study are significant and t hat t he sample
observation is not results just from chance but from some non-random
cause
If a study is t o compare method A wit h m ethod B about t heir
relat ionship and we assume t hat t he method A is superior or t he method
B is inferior, t hen such a statement is t ermed as an alternative
hypot hesis.
Alt ernat ive hypot heses should be clearly st at ed, considering t he
nat ure of t he research problem .
Ex a m p l e s
The follow ing are som e exam ples of alt ernat ive hypot hesis:
1. If a researcher is assum ing t hat t he bearing capacit y of a bridge
is m ore t han 10 t ons, t hen t he hypot hesis under t his st udy will be:
Null hypot hesis H0: µ= 10 t ons
Alt ernat ive hypot hesis Ha: µ> 10 t ons
b . Z -t e st
c. F-t e st
Answer: Here,
H0 = Education and family life are ‘independent’ in our population.
H1 = Education and family life are ‘dependent’ in our population.
In order to test the hypothesis for this problem, we need to use the Chi – square statistic and
as we have a contingency table of the size 2 x 2, we can use the Yates formula to find the
statistic.
Here, our calculated χ2 value is (1.78). But, in the Chi – square distribution table, at df = 1 and
5% level of significance, the χ2 value is (3.84).
As our calculated value of χ2 is less than the table value, we accept our null hypothesis and
reject the alternate hypothesis.
∴ For the above sample, we can say that the “Education and family life are ‘independent’
in our population”.
7) A scientist wants to know if education level and marital status are related for all
people in some country. He collects data on a simple random sample of n = 300 people,
part of which are shown below. Test the hypothesis that the two categorical variables,
education level and marital status are dependent at 5% level of significance.
Middle
High PhD and
school or Bachelor's Master's Total
School higher
lower
Never
18 36 21 9 6 90
Married
Married 12 36 45 36 21 150
Divorced 6 9 9 3 3 30
Widowe
3 9 9 6 3 30
d
Total 39 90 84 54 33 300
Answer:
Here,
H0 = Marital status and Education level are ‘independent’ in our population.
H1 = Marital status and Education level are ‘dependent’ in our population
In order to test the hypothesis for this problem, we need to use the Chi – square statistic. For
that we need to find the Expected values for the above-mentioned Observed values.
These expected frequencies are calculated as,
Eij = (Oi * Oj) / n where,
- Eij is an expected frequency;
- Oi is a marginal column frequency;
- Oj is a marginal row frequency;
Oij Eij (Oij – Eij) (Oij – Eij)2 (Oij – Eij)2 / Eij
Middle school or lower 18 11.7 6.3 39.69 3.39
12 19.5 -7.5 56.25 2.88
6 3.9 2.1 4.41 1.13
3 3.9 -0.9 0.81 0.21
High School 36 27 9 81 3
36 45 -9 81 1.8
9 9 0 0 0
9 9 0 0 0
Bachelor's 21 25.2 -4.2 17.64 0.7
45 42 3 9 0.21
9 8.4 0.6 0.36 0.04
9 8.4 0.6 0.36 0.04
Master's 9 16.2 -7.2 51.84 3.2
36 27 9 81 3
3 5.4 -2.4 5.76 1.07
6 5.4 0.6 0.36 0.07
PhD and higher 6 9.9 -3.9 15.21 1.54
21 16.5 4.5 20.25 1.23
3 3.3 -0.3 0.09 0.03
3 3.3 -0.3 0.09 0.03
Total 23.57
8) a) It is required to test whether those who practice yoga have average blood sugar
less than 120. A sample consisting of 17 persons who practice yoga is observed. If their
mean blood sugar is 108 and S.D is 8. What would you conclude? Also find the
confidence interval and interpret the results
Answer:
Given: µ =120; n =17, x̄ =108; s =8 α =5% (n <30 small sample test)
H0 : The average blood sugar is 120 (µ =120)
H1 : Average blood sugar is less than 120 (µ <120)
Under H0, the t-test statistic is
t= (x́ - μ) /s/√n
t= 108-120/8/√17 tcal= -6.1846
α= 0.05 and df= (n-1) = 16 and the test is one tailed therefore ttable= 1.75
since tcal< ttable, we accept H0
Conclusion: average blood sugar level is equal to 120
Confidence interval
When α= 0.05 the table value is given by tα= 1.75
̅ x́ ± tα s/√ n
x́ + tα s/√ n = 120 + 1.75(8*√ 17 ¿= 165.72
x́ - tα s/√ n = 120 - 1.75(8*√ 17 ¿=50.28
Therefore, since the population and sample mean lie within the region of these
intervals, we accept H0 .
b) The mean yearly sales of a product in a mall is 55.3. After advertising the mean
yearly sales in 17 stores is 63.8 and its SD is 6.2. Test whether the advertisement was
useful or not.
Answer :
H0 : advertisement was not successful; µ = 55.3
H1 : advertisement was successful; µ > 55.3
n= 17, x́= 63.8, SD = 6.2
t= (x́ - µ)
s/√n
t= 63.8-55.3
6.2/√ 17
tcal = 5.6526
At α = 5%, df = (n-1) = (17-1) = 16 and one tailed test.
Ttable = 1.746
Confidence interval
x́ ± tα s/√ n
x́ + tα s/√ n = 63.8 + 1.746(6.2*√ 17 ¿ = 66.42
x́ - tα s/√ n = 120 - 1.746(6.2*√ 17 ¿ = 61.17
Since tcal > tα , we reject Ho
Conclusion: Advertisement was successful
9) Tw o random sam ples were draw n from t w o norm al populat ions and
t heir values are:
A : 16,17,25,26,32,34,38,40,42
B : 14,16,24,28,32,35,37,42,43,45,47
Test whether t he t wo populations have t he same variance at 5% level of
significance.
Answer :
Given dat a:-
A : 16,17,25,26,32,34,38,40,42 n= 9
B : 14,16,24,28,32,35,37,42,43,45,47 n= 11
let ’s t ake hypothesis H0
σ12 = σ22
Ha: σ12 ≥ σ22
X̅A = 270/9 = 30
X̅B = 363/11 = 33
F= S12 /S 22 , S22 ≥ S22 , S2 = Σ(x – x̄)2 / n-1
S22 = 734/9-1 = 734/8 = 91.75
S12 = 1298/11-1 = 1298/10 = 129.8
F= 129.8/91.75
F= 1.4147 (calculat ed f value)
V1 = 11-1 = 10
V2 = 9-1 = 8
F0.05 = 3.35 (t abulat ed value)
Q10) Use the given data to find the minimum sample size required to estimate a
population proportion or percentage. You are the operations manager for American
Airlines and you are considering a higher fare level for passengers in aisle seats. You
want to estimate the percentage of passengers who now prefer aisle seats. How many
randomly selected air passengers must you survey? Assume that you want to be 95%
confident that the sample percentage is within 2.5 percentage points of the true
population percentage. Assume that a prior survey suggests that about 38% of air
passengers prefer an aisle seat.
Answer:
Answer:
Hypothesis testing is
H0: µ= 25450 (average income of people in Chicago represent the average income of people
in US.)
H1: µ≠ 25450 (average income of people in Chicago doesn’t represent the average income of
people in US.)
Given the data
x́= 26500 , σ = 2800, n= 250
The test statistic is given by,
x́−µ 26500−25450
,
Zcal= σ , Zcal= 2800 Zcal= 5.929
√n √ 250
At 5% LOS the Ztable is said to be 1.96
Since Zcal >Ztable value we reject H0
Confidence interval,
CI= µ±Zασ x́
X1= 25450-347.09
X1= 25102
X2 = 25450+347.9
X2= 25797.09
Conclusion: average income of people in Chicago doesn’t represent the average income of
people in US. The population mean lies within the confidence interval limit but the sample
mean doesn’t lie within the region.
Q12) A company is engaged in the packaging of a superior quality tea in jars of 500 gm
each. The company is of the view that as long as jars contain 500 gm of tea, the process
is in control. The standard deviation is 50 gm. A sample of 225 jars is taken at random
and the sample average is found to be 510 gm. Has the process gone out of control?
Answer:
510−500
z=
50/ √ 225
z=3
At 5% level of significance, The Z value for two tailed tests is ±1.96
Q13) An inventor has developed a new, energy-efficient lawn mower engine. He claims
that the engine will run continuously for 5 hours (300 minutes) on a single gallon of
regular gasoline. From his stock of 2000 engines, the inventor selects a simple random
sample of 50 engines for testing. The engines run for an average of 295 minutes, with a
standard deviation of 20 minutes. Test the null hypothesis that the mean run time is 300
minutes against the alternative hypothesis that the mean run time is not 300 minutes.
Use a 0.05 level of significance. (Assume that run times for the population of engines are
normally distributed.)
Answer:
Note that these hypotheses constitute a two-tailed test. The null hypothesis will be rejected if
the sample mean is too big or if it is too small.
For this analysis, the significance level is 0.05. The test method is a one-sample t-test.
Analyse sample data. Using sample data, we compute the standard error (SE), degrees of
freedom (DF), and the t statistic test statistic (t).
SE = s / sqrt(n) = 20 / sqrt(50) = 20/7.07 = 2.83
DF = n - 1 = 50 - 1 = 49
t = (x - μ) / SE = (295 - 300)/2.83 = -1.77
Since we have a two-tailed test, the P-value is the probability that the t statistic having 49
degrees of freedom is less than -1.77 or greater than 1.77.
We use the t Distribution Calculator to find P(t < -1.77) = 0.04, and P(t > 1.77) = 0.04. Thus,
the P-value = 0.04 + 0.04 = 0.08.
Since the P-value (0.08) is greater than the significance level (0.05), we cannot reject the null
hypothesis.
Q14) A high-end computer manufacturer sets the retail cost of their computers based
in the manufacturing cost, which is $1800. However, the company thinks there are
hidden costs and that the average cost to manufacture the computers is actually much
more. The company randomly selects 40 computers from its facilities and finds that the
mean cost to produce a computer is $1950 with a standard deviation of $500. Run a
hypothesis test to see if this thought is true.
Answer:
H0: μ ≤ 1800
H1: μ > 1800
Find the test statistic using the z-score formula:
Q15) a) A mall manager believes that his customers are equally divided between men
and women. Out of 600 sample customers there were 390men. At 5% LOS test the belief
of the mall manager is correct or not.
Answer:
Hypothesis testing
H0: π=0.5(customers aredivided equally between men and women)
H1: π ≠ 0.5 (customers are not divided equally between men and women)
Given,
p= 390/600 = 0.65
0.5∗0.5
σ π = π (1−π ) , σ π = √
√ n
Test statistic is given by
600
, σ π =0.020
p−π 0.65−0.5
Zcal= ,= , Zcal=7.5
σπ 0.020
The Ztable value at 5% LOS is ±1.96.
Since Zcal>Ztable, we reject H0.
Conclusion: customers are not divided equally into men and women.
b) The CEO of a company believes that 72% of his employees follow company rules.
To test this claim 550 samples were taken and it was found that 120 of these followed
the rules. At 1% LOS what can be concluded about the CEO.
Answer:
Hypothesis testing
H0: π=0.72(72% of employees follow company rules)
H1: π <0.72 (less than 72% of employees follow company rules)
Given,
p= 120/550 = 0.218
σ π = π (1−π ) , σ π = 0.72∗0.28 , σ π =0.019
√ n √
Test statistic is given by
550
p−π 0.218−0.72
Zcal= , = , Zcal=-26.42
σπ 0.019
The Ztable value at 1% LOS is ±2.33.
Since Zcal>Ztable, we reject H0.
Conclusion: less than 72% employees follow company rules.