Professional Documents
Culture Documents
(Information Technology)
(Semester IV)
2018-19
By
Ms. Maitreyi Joglekar
a. Explain-
i) Mean Deviation ii) Variance
Ans i) Mean Deviation-
Consider a data with N observations X1, X 2…… X n occurring with the frequencies
f1 , f2 …fn , then Mean deviation is denoted by MD and is defined as-
∑ 𝑓 |𝑋−𝑋̅|
MD = ∑𝑓
𝑀𝐷
Coefficient of Mean Deviation= 𝑋̅
∑ 𝑓𝑋
Where 𝑥̅ = ∑𝑓
ii) Variance-
Variance is a square of standard deviation.
∑(𝑋 − ̅̅̅
𝑋)2 𝑋2
𝑆. 𝐷 = 𝜎 = √ = √∑ − 𝑋̅ 2
𝑁 𝑁
∑𝑋
Where 𝑋̅ = 𝑚𝑒𝑎𝑛 = 𝑁
𝜎
Coefficient of SD= 𝑋̅
Variance= 𝜎 2
b. Calculate mean and std. deviation.
∑ 𝑓 |𝑋−𝑋̅|
Mean Deviation= MD = ∑𝑓
= 0.00324
̅̅̅̅2
∑(𝑋−𝑋) 𝑋2
𝑆. 𝐷 = 𝜎 = √ 𝑁
= √∑ 𝑁
− 𝑋̅ 2=
=11.877
𝒄 𝒊𝑵
𝑸𝒊 = 𝑳𝟏 + { ( − 𝒄𝒇)} where i= 1, 2, 3
𝒇 𝟒
Q1= 195
Q3= 225. 31
Class f( X- X
Interval Mark F fx X- X bar bar)
-
0-10 5 8 40 -32.667 261.336
10 to -
20 15 12 180 -22.667 272.004
20 - 30 25 20 500 -12.667 -253.34
30-40 35 25 875 -2.667 -66.675
40-50 45 15 675 7.333 109.995
50-60 55 9 495 17.333 155.997
60-70 65 6 390 27.333 163.998
70-80 75 5 375 37.333 186.665
80-90 85 5 425 47.333 236.665
105 3955 65.997 -0.035
Mean- 37.667
∑ 𝑓 |𝑋−𝑋̅|
Mean Deviation= MD = ∑𝑓
Mean deviation= 0.00033
e. Compute median
Example:
data=c("East", "West", "south", "North")
factdata=factor(data)
print(factdata)
A data frame is a table or a two-dimensional array-like structure in which each
column contains values of one variable and each row contains one set of values
from each column.
Data frames are created in R as follows:
emp.data=data.frame(emp_id=c (1:3),emp_name=
c("Rick","Dan","Michelle"),salary = c(1000,1500,2000))
print(emp.data)
Question 2
b. Define-
i) Raw moments ii) Central Moments
Ans Central Moments- Arithmetic mean of various powers of deviations taken from mean 𝑋̅ .
∑(𝑋 − 𝑋̅)𝑟
𝜇𝑟 =
𝑁
∑ 𝑓(𝑋 − 𝑋̅)𝑟
𝜇𝑟 =
∑𝑓
Raw Moments- Arithmetic mean of various powers of deviations taken from the origin.
∑ 𝑋𝑟
𝜇𝑟′ =
𝑁
′
∑ 𝑓𝑋 𝑟
𝜇𝑟 =
∑𝑓
𝜇1 =0
𝜇2 = 𝜇2′ − ( 𝜇1′ )2
𝜇3 = 𝜇3′ − 3 𝜇1′ 𝜇2′ + 2( 𝜇1′ )3
𝜇4 = 𝜇4′ − 4 𝜇1′ 𝜇3′ + 6( 𝜇1′ )2 𝜇2′ − 3( 𝜇1′ )4
c. If a pair of dice is thrown and X denotes sum of numbers on them. Find PDF of X. Also
find expectation of X.
Ans Let X- denote sum of the numbers appearing on dice.
X can take values starting from 2-12.
Total sample space will have 36 combinations.
So PDF of X-
• Mean of X – E(X)
𝐸(𝑋) = ∑ 𝑥 𝑃(𝑥)
𝐸(𝑋) = 7
𝜇
Coefficient of Kurtosis= 𝜎44 - 3
∑ X 2 = 168
s.d= 𝜎 = 2.3734
Median= 4
3(𝑀𝑒𝑎𝑛 − 𝑀𝑒𝑑𝑖𝑎𝑛)
𝑆𝑘 =
𝜎
𝑆𝑘 = 0.3611
Question 3
a. A random sample of 100 balls gave 10 % defective balls. Find 99 % confidence limits
for the percentage of defective balls. In consignment.
Ans Given-
P= 10/100= 0.1
q = 1-p
q= 0.9
N= 100
To find 99% confidence limits, critical value of Z is- 𝒁𝒄 = 2.58
𝑝𝑞
𝜎𝑝 = √ 𝑁 = 0.03
99 % confidence limits:
𝑷 ± 𝒁𝒄 𝝈𝒑
= (0.1 -2.58* 0.03), (0.1 + 2.58* 0.03)
(0.0226, 0.1774)
c. For a 100 sample, 35 are working as professors. Construct 95% confidence interval for
probability intervals.
Ans Given-
P= 35/100= 0.35
q = 1-p
q= 0.65
𝑝𝑞
𝜎𝑝 = √ 𝑁 = 0.047
99 % confidence limits:
𝑷 ± 𝒁𝒄 𝝈𝒑
= (0.35 -1.96* 0.047), (0.35 +1.96* 0.047)
(0.304, 0.442)
Z score-
f. A car manufacturer claims that 40 % of all cars build will be still running after 10 years.
A random sample of 400 cars showed 150 cars will still run after 10 years. Test claim
for 1% LOS.
Ans Population Sample
p=0.4 P = 150/400= 0.375
q= 0.6 N = 400
Ho: p= 0.4
H1: p is not 0.4
Z score-
Z= - 1.02
Using two tailed test for LOS 1%
Critical value of Z is 2.58.
Z < 2.58
Hence, Accept H0.
Question 4
a. Test the significance of the following rank correlation coefficient at 5% level-
R = 0.139, n = 10
Ans At 5% level of significance-
So for two tailed test we should accept 𝐻0 since here 𝑅 = 0.139 which lies in between
−0.6485 𝑡𝑜 0.6485
Therefore,
We can say that there is no significant correlation at 5% level.
b. 20 % of apples in large consignment are found to be bad for probability that at least 25%
appples are bad in sample size of 400 drawn from it.
Ans Population Sample
p=20/100= 0.2 N= 400
q= 1-0.2= 0.8 P= 0.25
c. 20 samples of size 100 are selected. Find the expected no of samples that will have at
least 14 defective blades if total consignment have 10 % defective.
Ans N= 100
P= 0.1
Poison’s distribution-
e-m mx
P(X) =
X!
Where m= np
m= 100*0.1= 10
e. It is known that 30 % male adults are unmarried. A sample of 100 male adults is selected.
Find the chance that-
a. 25-32 %
b. At most 33 % unmarried adults
Ans N= 100
P= 0.3
m=nP= 30
Poison’s distribution-
e-m mx
P(X) =
X!
a) P(25-32%)= P(25) + P(26) + P(27) + P(28) + P(29) + P(30) + P(31) + P(32) = 0.45
b) P (At most 33)= 0.765
f.
Question 5
a. Explain-
i. coefficient of correlation ii. Std. error of estimate
Ans
Ans
Ans
d.
Ans
Ans
f.
Ans