Professional Documents
Culture Documents
We can use the gradient boosting approach with base models other than regression trees.
Explain under which circumstances gradient boosting linear regression (with another linear
regression predicting residuals) will work better or worse than just a single linear regression
on the data and explain why.
For the linear regression considered here, don’t use any kernels or base function extension
(so the model is just linear).
Explain what happens with a random forest classification model if, all other hyperparameters
kept fixed, the number of trees in the ensemble grows (in the limit to ∞).
Will the classification accuracy on a test set not used for training go up or down? What about
the variance?