Professional Documents
Culture Documents
Relationships
Association and Causation
Association
Correlation is not causation!
Data set FITNESS for correlation example
CORR PRODECDURE
Univariate Statistics
p i j Cov(Yi , Y j ) p j V (Y j )
= = 1−
( p − 1) V (Y0 ) ( p − 1) V (Y0 )
If the covariance of items is small compared to variance of their sum, it
implies that total variation is less due to Ti and more due to error (Ei) as
(Cov(T, E) =0 by assumption.
Hint : For obtaining second equality for alpha, Expand V(Yo) which is variance of (Y1+Y2+Y3+..Yp) and substitute
out the Cov terms from equation V(Yo) = ∑V(Yj) - ∑ (i≠j) CoV(Yi, Yj)
Cronbach's alpha is known as a measure of internal consistency. Generally,
this measure will increase as the inter-correlations among test items increase
Theoretically, if all test items measured the same construct, the inter-
correlations among them would be maximized. In this sense, the
Note: If the variances of the items V(Yj) vary much, you can standardize the
Items to unit variance before computing Cronbach’s alpha
Requesting Cronbach’s
alpha
Another Example: Fish Data
❑ Outliers:
Observations with large residuals, i.e. observations whose
actual values are far away from their predicted values. May
indicate data peculiarity or a data entry problem
y =α + β x + β x +u
1 2
1 2
We assume:
• E(u) = 0
• E(X.u) = 0
• No auto correlations
• No multicollinearity
Running regression
That is, the contribution of Alaska observation to the coefficient of variable SINGLE is given
by 0.14 times the standard errors of Bsingle [0.14 * BSingle ]
Therefore, if we exclude Alaska, the coefficient of Bsinglle will decrease by the contribution of
Alaska observation , or by 2.17. In other words the coefficient BSingle will decrease from
132.408 to 130.23 (= 132.408 - 2.17)