Professional Documents
Culture Documents
Soubhagya Dash
PGP/25/116
AIB - A
Note: The preceding images make it clear that there is overfitting beginning with the fifth
degree and continuing ahead. Consequently, the degree of our polynomial should go up to 5.
Therefore, the best degree lies between degree 2 to 5.
5. At this point, we are experimenting with different degrees (ranging from 1 to 20) and
quantifying the mistakes associated with testing and training.
Additional Learning: The training data and test data is split in 80:20. 8 | P a g e
6. The snapshot of the data is attached below:
Additional Learning: Because at that point overfitting might be regarded as testing error
that is greater than training error, we should pick the degree up to 5.
7. If we plot the RMSE of the training set against the testing set, we will observe overfitting.
Additional Learning: Error on the test set first decreases, but then gradually climbs once a
certain threshold of complexity is reached (5 or so). As the degree of difficulty is increased,
there is a corresponding decrease in the amount of error on the training set. Therefore, the
best number for the complexity of the model is five, which has a low bias and a low
variance.
I would like to conclude my report here.