Professional Documents
Culture Documents
ovariation
Variance: average amount of deviation of data from
mean
Covariance: simultaneous variation of data from
their means in two or more variables
Potential Relationship
Below the mean for one variable below the mean for other too
Above the mean for one variable above the mean for other too
Estimating Relationships
Cross-product deviations
◦ Same sign = positive relations
◦ Different sign = negative relation
Covariance
◦ Averaged sum of combined deviations
Problem:
◦ Covariance is Scale Dependent
◦ Can’t be used for comparisons
Standardization
A standard unit of measurement
◦ Meters, seconds, grams etc
◦ No standard units of measurements for
psychological variables i.e., attitudes, attributes etc
Solution
◦ Standard Deviation as standard unit
Deviation in standard units:
◦ observed deviation/standard deviation
Covariance in standard units
◦ Covariance/standard deviation
Problem Two variables = 2 sd
Correlation coefficient
Solution:
Standardization:
◦ Standard coefficient range (-1 to +1)
2. t-test
Confidence intervals for r
Spss doesn’t do the job
Manual computation
Analyze Correlate
Bivariate
Assumptions
Pearson’s r
◦ Interval level data
Probability testing
◦ Normal distribution
Exception
◦ One of the two variable can be a categorical but
with only two categories
Data is not normal or not at interval level
◦ Spearman’s rho (rs)
Small N with many categories
◦ Kendall’s tau (Ԏ )
Biserial and Point–biserial
One variable is dichotomous
Conceptual difference in dichotomy
Point–biserial correlation (r )
pb
◦ Discrete dichotomy (true dichotomy)
◦ rpb is simple bivariate correlation
(True Null rb = 0)
• b0 is the intercept.
• The intercept is the value of the
Y variable when all Xs = 0.
• This is the point at which the
regression plane crosses the
Y-axis (vertical).
Beta Values
bAdverts
b0
bairplay
Methods of Regression
Hierarchical:
◦ Experimenter decides the order in which
variables are entered into the model.
Forced Entry:
◦ All predictors are entered simultaneously.
Stepwise:
◦ Predictors are selected using their semi-
partial correlation with the outcome.
Hierarchical Regression
entered in a separate
step/block.
Experimenter makes the
decisions.
Hierarchical Regression
It is the best method:
◦ Based on theory testing.
◦ You can see the unique
predictive influence of a new
variable on the outcome because
known predictors are held
constant in the model.
Bad Point:
◦ Relies on the experimenter
knowing what they’re doing!
Forced Entry Regression
Allvariables are entered into
the model simultaneously.
The results obtained depend
on the variables entered into
the model.
◦ It is important, therefore, to have
good theoretical reasons for
including a particular variable.
Stepwise Regression I
Exam
Performance
Variance
Variance
explained
explained
(1.3%)
(1.7%)
Variance
accounted for Revision
by Revision Time
Time (33.1%)
Semi-Partial Correlation in Regression
Beta values:
◦ the change in the outcome
associated with a unit change in
the predictor.
Standardised beta values:
◦ tell us the same but expressed
as standard deviations.
Beta Values
b = 0.087.
1
◦ So, as advertising increases by
£1, record sales increase by
0.087 units.
b = 3589.
2
◦ So, each time (per week) a song
is played on radio 1 its sales
increase by 3589 units.
Click to edit Master text styles
Second level
Constructing a Model
Third level
Fourth level
Fifth level
y b0 b1 X 1 b2 X 2
Sales 41124 0.087Adverts 3589plays
◦ Standardized Residuals
Influential cases
◦ Cook’s distance
Standardized Residuals
In an average sample, 95% of
standardized residuals should lie
between 2.
99% of standardized residuals should
lie between 2.5.
Outliers
◦ Any case for which the absolute value of
the standardized residual is 3 or more, is
likely to be an outlier.
Cook’s Distance
must be met.
Violating these assumptions stops
This
assumption can be
checked with collinearity
diagnostics.
• Tolerance should be more than 0.2
(Menard, 1995)
Normalityof Errors:
◦ Normal probability plot.
Regression Plots
Homoscedasticity:
ZRESID vs. ZPRED
Good Bad
Normality of Errors: Histograms
Good Bad
Normality of Errors: Normal
Probability Plot
.75
.25
0.00
0.00 .25 .50 .75 1.00
Good Bad