Professional Documents
Culture Documents
600
500
400
300
200
100
0
0 5 10 15 20 25
We can see that the relationship is straight between the Advertising cost just as the sales which
has been accomplished.
The straight connection diagram is as per the following between the sales and the publicizing
cost is as per the following:
Both the sales just as the publicizing cost have been plotted as follows:
600
500
400
300
200
100
0
0 5 10 15 20 25
We have plotted the chart of the sales just as the commercial and we have seen that the
diagram is moving in similar ways. This shows that there is a straight relationship is there
among sales and notice. Our subsequent examination is about the relationship coefficient
between these two factors.
The connection coefficient will disclose to us the connection between these information. Is
there are positive relationship exists or is there a negative connection exists or is there is no
connection exists between them.
➢ Karl Pearson correlation coefficient is as follows :
The formula for the Karl person coefficient of the correlation is as follows:
𝑛 ∑ 𝑋𝑌−( ∑ 𝑋 ) ( ∑ 𝑌 )
r=
√( 𝑛 ∑ 𝑋 2 −(∑ 𝑋 ) 2 )𝑋 ( 𝑛 ∑ 𝑌 2 −(∑ 𝑌 ) 2 )
∑y = 208
∑XX = 3717395
∑XY = 90552.7
∑YY = 2472
𝑛 ∑ 𝑋𝑌−( ∑ 𝑋 ) ( ∑ 𝑌 )
r=
√( 𝑛 ∑ 𝑋 2 −(∑ 𝑋 ) 2 )𝑋 ( 𝑛 ∑ 𝑌 2 −(∑ 𝑌 ) 2 )
20 𝑋 90552.7−7969.90 𝑋 208
r=
√( 20 𝑋 3417395.23 −(7969.90) 2 )𝑋 ( 20 𝑋 2472 −(208 ) 2 )
1811054 −1657739.20
=
√( 68347904.6 –( 63519306.01)𝑋 ( 49440 –( 43264)
153314.80 153314.8
= =172688.809 = 0.88781
√( 4828598.59 𝑋 6176
We have calculated the correlation coefficient between these two variables and these
two variables are positive and it is 0.88781.
This implies we can say that there is a positive relationship and having an immediate effect.
This implies as we increment the commercial expense the sales increment and as we decline
the ad cost the sales diminishes. Our third investigation is to discover the relapse condition of
X on the Y.
This implies we need to have the relapse condition of the sales dependent on the ad.
➢ Regression Analysis of sales on Advertising
➢ Regression analysis of X on Y is as follows
➢ (X-X bar) = by (Y – Y bar)
𝑛 ∑ 𝑋𝑌−( ∑ 𝑋 ) ( ∑ 𝑌 )
➢ by = 2
𝑛 ∑ 𝑌 2 −(∑ 𝑌 ) )
20 𝑋 90552.7−7969.90 𝑋 208 153314.80
➢ bxy = =
20 𝑋 2472 −(208 ) 2 ) 49440 –( 43264)
600
500
400
300
200
100
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Answer 2.
The table given below is as follows for the single year age population that has been
taken from census 2011, the table which has been taken age-wise is as follows.
After summarization of the mean data, the mean has been calculated and the
mean is as follows:
∑ 𝑓𝑥
Mean value = ∑𝐹
= 2837747 / 112680 = 25.18
Variance
∑(𝑥 − 𝑀𝑒𝑎𝑛)2 𝑓
VAR(𝑋) =
∑𝑓
18165815.4
⇒ VAR(𝑋) = = 161.216
112680
The data has been summarized for the standard variation and the standard
deviation has been calculated as follows:
class Mid
interval Value Population
age group (X) (f) X*F X - Xbar (X-Xbar)2 (X-Xbar)f
1 to 10 5 18146 90730 -20.18 407.23 7389639.13
11 to 20 15.5 20895 323872.5 -9.68 93.70 1957911.65
21 to 30 25.5 34145 870697.5 0.32 0.10 3496.448
31 to 40 35.5 24453 868081.5 10.32 106.50 2604303.19
41 to 50 45.5 15041 684365.5 20.32 412.90 6210465
Sum 50 112680 2837747 1020.44 18165815.4
The data for the Less than Ogive has been summarized and the graph has been
prepared which is as follows:
Less than
Ogive graph
120000
100000
80000
60000
40000
20000
0
1 to 10 11 to 20 21 t0 30 31 to 40 41 to 50
100000
80000
60000
40000
20000
0
1 to 10 11 to 20 21 t0 30 31 to 40 41 to 50
Population (f)
40000
35000
30000
Population (f)
25000
20000
15000
10000
5000
0
1 to 10 11 to 20 21 t0 30 31 to 40 41 to 50
class interval age group
Conclusion:
We have calculated the mean and found the mean age for the details provided is
as follows,
And the standard deviation is 0.094. This shows that when the data for the 112680
number has been collected the mean age is 24.76 between the age of 1 year to 50
years.
The deviation which has been calculated and found to be very less and is 0.094.
Hence, we data are averagely divided between all the age groups and this could be
seen from the deviation also.
The graph of the histogram is also average compare to the age of 21 to 30 years
where the frequency of the data is maximum. Rest all the data are under the
average figure only.
Answer 3 a:
Calculation of the forecast when the alpha value is 0.20, 0.50, and 0.80 is as
follows:
Ft = α x A t-1) + ( 1- α ) X ( F( t- 1) )
The calculation of the MSE, as well as MAD, is as follows when the alpha = 0.20
∑ 𝑎𝑏𝑠𝑜𝑙𝑢𝑡𝑒 ( 𝐴𝑡−𝐹𝑡 )
The MAD = = 4738.47 / 21 = 225.64
𝑛
∑ ( 𝐴𝑡−𝐹𝑡 )( 𝐴𝑡−𝐹𝑡)
The MSE = = 1555814.03 / 21 = 74086.38
𝑛
The calculation of the MAD and the MSE when the alpha is 0.50 and 0.80 is as
follows :
The alpha value of 0.20 is the relatively near to the actual values.
Answer 3 b:
The Z curve when the car travels more than 70 Miles per gallon.
The Z value for the car that travels more than 70 miles per gallon is as follows:
70
Mean 65
65 mean
value
3:
The car travels between 55 and 70 miles per gallon is as follows:
More than
55 and less
Normal Z curve than 70
55 65 70