Professional Documents
Culture Documents
11 - APM 1205 Linear Model
11 - APM 1205 Linear Model
Variable Transformation
Refer to the transformation It also help:
of independent/response
variables to improve or • Make coefficient more
address the following issues: interpretable
• Meet Model Assumptions
• Linearity of the model • Improve generalization
• Heteroscedasticity of the and predictive power
Model • Put predictors in the
• Normality of the error common scale
terms
Logarithmic Transformation
Use if there is a non linear relationship between dependent and independent variable
A log transformation is used to reduce the effect of outliers and to stabilize the
variance when the data is highly skewed
where
Centering is basically a technique where mean of independent
variables is subtracted from all the values. It means all independent
variables have zero mean.
Standardization or Normalization
• Standardization or Normalization is
the process of putting different
variables on the same scale
• Assumably Reduce multicollinearity
of the model where
• If variables are in the higher order,
its tends to have higher interaction
correlation with other variable
• The linear model coefficients can be
interpreted as the change in the
response (i.e. dependent variable)
for a 1 standard deviation increase in where
the predictor (i.e. independent
variable)
Comparison
Transformation Improve Meet Model Improve Model Produce
Coefficient Assumption generalization comparable
Interpretability predictors
Logarithmic X X
Square root X X
Standardization or X X
Normalization
Centering X