Professional Documents
Culture Documents
TUTORIAL 4
SECTION A
a) In the matrix form of the simple linear regression mode, the least squares estimator for is
β X X X Y where the element of X are fixed constants in laboratory experiment.
Derive E β̂ and Var β̂ .
b) For the multiple regression model, with an intercept term, complete the following part,
ˆ e
YY ˆ Xβˆ
Y
ˆ ˆ
prove that Y YY ee .
SECTION B
In an effort to decide whether or not to apply for a liquor license, a restaurant owner sampled ten
comparable restaurants in his town and recorded the total profits (Y), the food sales (X 1), and liquor
sales (X2) for the past month. His sample yielded the following results. (All figures are in thousands
rands).
y i =β 0 + β 1 x i1 + β 2 x i2 + ε i
The linear model where is the error term was proposed.
; ;
[ ]
31 . 4
X ' Y = 86 . 58
101 .37 and ∑ y 2= 107 .14
TUTORIAL 4 2024
β1 β2 .
a) Find the multiple regression equation and interpret the estimates of the and
b) Predict the total profits in a restaurant with R1700 food sales and with R3000 liquor
sales.
c) Test for the overall.
d) Determine the coefficient of multiple determination and interpret its value.
g) Construct and interpret a 95% confidence interval of the average profits in a restaurant with
R1700 food sales and with R3000 liquor sales.
SECTION C
The data consist of the 67 houses with Y = Sales Price of the house (“salesprice” on the output in
Table 2), and the three predictor variables are:
2|Page
TUTORIAL 4 2024
Table 1
------------------------------------------------------------------------------
salesprice | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
sqft100 | 9893.524 2508.561 3.94 0.000 4880.564 14906.48
bedrooms | -36018.68 14665.17 -2.46 0.017 -65324.68 -6712.675
lotsize | 2.512511 1.124531 2.23 0.029 .2653148 4.759707
intercept | 290558.1 75398.79 3.85 0.000 139885.6 441230.5
------------------------------------------------------------------------------
NOTE: In case you aren’t familiar with this notation, 1.18e+11 = 1.18 x 1011, as an example.
a) The estimated regression equation for the full model with all 3 variables, filling in numbers for
the coefficients is given by
Y^ = 290558.1 + 9893.524X1 – 36018.68X2 + 2.513X3.
Interpret the values of the three regression coefficients.
b) Is the overall model useful? Carry out an appropriate test and draw a conclusion using
α =0 . 05 .
c) Test for the contribution of each independent variable using p-value approach.
d) The 95% confidence interval for the intercept, β0, is given by (139885.6; 441230.5).
Draw conclusions about the tests of the other three coefficients using their confidence
intervals.
SECTION D
A statistician tried to fit a model to house prices as a function of the following variables as given in
Table 2:
Table 2
Tax Property taxes in thousands of rands
Age Age of the house in years
Bed Number of Bedrooms
Bath Number of Bathrooms
Space Living space in thousands of square metre
3|Page
TUTORIAL 4 2024
The response variable was the sale price of the house in thousands of rands. Below is an
ANOVA table from fitting this model.
Analysis of Variance
Sum of Mean
Source DF Squares Square F Value
Model a 675.85947 d f
Error b c e
Corrected Total 23 831.50958
(a) Give the complete ANOVA table by filling in the missing values a, b, c, d, e and f.
(b) State the null and alternative hypothesis that the F value (f) is testing. Give the conclusion of the
test for this fitted model at the significance level of 0.05.
4|Page