Professional Documents
Culture Documents
Assignment - 1
Submitted By
George Thomas
P22251
a. Identify the type of data
b. Number of variables
c. Number of Observations
d. Draw a scatter plot between the variables and check for any association between the variables.
g. Identify the variables which are having a strong association (use correlation matrix) h. Draw box plot
for the first two variables.
Answers
a. The data is TIME SERIES data
b. There are 9 variables
c. There are 9 observations ranging from 27 February to 7 March
NO - NO2
40
35
30
25
20
15
10
0
0 20 40 60 80 100 120
d.
PM2.5 - PM10
200
180
160
140
120
100
80
60
40
20
0
0 20 40 60 80 100 120 140
e.
g. If correlation value of any variable is greater than +0.7 and -0.7 there is association between
variables
Variables Correlation
PM10 - PM2.5 0.9882
NOx – NO 0.97695
NH3 - NO 0.96672
CO – PM2.5 0.7837
CO – PM10 0.76799
OZONE – NO2 0.9517
NH3 - NOx -0.9720
h.
ASSIGNMENT 2
Create two columns of numbers in excel sheet with first column having numbers from
1,2,3,4,5,6,7,8,9, 10 and second column having numbers from 2,4,6,8,10,12,14,16,18,20.
It is seen that the correlation between the variables in columns 1 and 2 & columns 3 and 4 is the
same. It does not change with a change in the variables, whereas the covariance value changes
with a change in the variables. So it can be concluded that correlation is insensitive to the scale
of the variable, and covariance is sensitive to the scale of the variable.