Professional Documents
Culture Documents
• Intercept = -28.85
• Slope Sqft = 0.171
• Slope LotSize = 6.78
• Slope Baths = 15.53
Estimation of Regression
SUMMARY OUTPUT
Regression Statistics
Multiple R 0.977722197
R Square 0.955940694
Adjusted R Square 0.950856928
Standard Error 20.29773478
Observations 30
ANOVA
Significance
df SS MS F
F
Regression 3 232413.739 77471.25 188.0379 9.66E-18
Residual 26 10711.94896 411.998
Total 29 243125.688
Lower Upper
Coefficients Standard Error t Stat P-value Lower 95% Upper 95%
95.0% 95.0%
Intercept -28.80558256 29.69997693 -0.96989 0.34104 -89.8548 32.24359 -89.8548 32.24359
Sqft 0.170903982 0.01544169 11.0677 2.46E-11 0.139163 0.202645 0.139163 0.202645
LotSize 6.777241714 1.420744663 4.770204 6.16E-05 3.856859 9.697624 3.856859 9.697624
Baths 15.52386756 9.204704708 1.686514 0.103661 -3.39667 34.44441 -3.39667 34.44441
Regression Modeling
Four Criteria for Regression Assessment
• Logic Is there an a priori reason to expect a causal
relationship between the predictors and the response
variable?
• Fit Does the overall regression show a significant
relationship between the predictors and the response
variable?
• Parsimony Does each predictor contribute significantly to the
explanation? Are some predictors not worth the trouble?
• Stability Are the predictors related to one another so
strongly that regression estimates become erratic?
ANOVA Table Format
Coefficient of Determination
Adjusted R2
ANOVA vs t-test
t-test ANOVA
checks whether two populations are checks whether three or more populations
statistically different are statistically different
T-value F- value
difference in means and the spread of the distributions (i.e., variance) across groups
lower p-value reflects a value that is more significantly different across populations
• ANOVA – ANalysis of Variance
• ANOVA is a hypothesis testing procedure that is used to evaluate
differences between 2 or more samples
Compare THREE sample
means to see if a difference Is one mean so far away
exists somewhere among from the other two that it is
them likely not from the same
populations?
Shows how far the mean it is away from the mean of the
larger sort of combined population
Oddball distribution, sort of
the one that doesn’t belong
in the same population as
the other two
Means are in very different
locations relative to the
overall mean
Variability
AMONG/BETWEEN
the sample means
Variability AROUND/
WITHIN the distributions
Example for ANOVA
ANOVA Table
• The total degrees of freedom (DF) are the amount of
information in your data.
• DF Num : degrees of freedom for the numerator to
calculate the probability of obtaining an F value that is at
least as extreme as the observed F value.
• DF DENOM : degrees of freedom for the denominator to
calculate the probability of obtaining an F value that is at
least as extreme as the observed F value.
Adj SS
Adj SS Term
The adjusted sum of squares for a term is the increase in the regression
sum of squares compared to a model with only the other terms. It quantifies
the amount of variation in the response data that is explained by each term
in the model.
Adj SS Error
The error sum of squares is the sum of the squared residuals. It quantifies
the variation in the data that the predictors do not explain.
Adj SS Total
The total sum of squares is the sum of the term sum of squares and the
error sum of squares. It quantifies the total variation in the data.
Adj MS
• The adjusted mean square of the error (also called MSE
or s2) is the variance around the fitted values.
F- Value
• P-value ≤ α: The differences between some of the means are statistically significant
If the p-value is less than or equal to the significance level, you reject the null
hypothesis and conclude that not all population means are equal. Use your
specialized knowledge to determine whether the differences are practically significant.
For more information, go to Statistical and practical significance.
• P-value > α: The differences between the means are not statistically significant
If the p-value is greater than the significance level, you do not have enough evidence
to reject the null hypothesis that the population means are all equal. Verify that your
test has enough power to detect a difference that is practically significant. For more
information, go to Increase the power of a hypothesis test.
T-value