10 Deadly Sins of ML Model Training

16/08/2021 10 Deadly Sins of ML Model Training | by Sandeep Uttamchandani | Better Programming
Sign in to your member account (je__@g__.com) for unlimited access.
Sign in with Google
Not you? Sign in or create an account
10 Deadly Sins of ML Model Training

These mistakes are easy to overlook but costly to redeem
Sandeep Uttamchandani Follow

Apr 22 · 3 min read
https://betterprogramming.pub/10-deadly-sins-of-ml-model-training-a5046c1f5094 1/5
Photo by Frank Vessia on Unsplash
ML model training is the most time-consuming and resource-expensive part of the

overall model-building journey. Training by definition is iterative, but somewhere
during the iterations, mistakes seep into the mix. In this article, I share the ten deadly
sins during ML model training — these are the most common as well as the easiest to
overlook.
Ten Deadly Sins of ML Model Training

1. Blindly increasing the number of epochs when the model is not converging
During model training, there are scenarios when the loss-epoch graph keeps bouncing
around and does not seem to converge irrespective of the number of epochs. There is no
silver bullet as there are multiple root causes to investigate — bad training examples,
missing truths, changing data distributions, too high a learning rate. The most common
one I have seen is bad training examples related to a combination of anomalous data and
incorrect labels.
2. Not shuffling the training dataset

Sometimes there are scenarios where the model seems to be converging, but suddenly
the loss value increases significantly, i.e., loss value reduces and then increases
significantly with epochs. There are multiple reasons for this kind of exploding loss. The
most common one I have seen is outliers in the data that are not evenly
distributed/shuffled in the data. Shuffling, in general, is an important step including for
patterns where the loss is showing a repeating step function behavior.
3. In multiclass classification, not prioritizing specific per-class metrics

accuracy
For multiclass prediction problems, instead of tracking just the overall classification
accuracy, it is often useful to prioritize the accuracy of specific classes and iteratively
work on improving the model class by class. For instance, in classifying different forms
of fraudulent transactions, focus on increasing the recall of specific classes (such as
foreign transactions) based on business needs.
4. Assuming specificity will lead to lower model accuracy

Instead of building a generic model, imagine building a model for a specific geographic
region or specific user persona. Specificity will make the data more sparse but can lead
to better accuracy for those specific problems. It is important to explore the specificity
and sparsity trade-off during tuning.
5. Ignoring prediction bias

Prediction bias is the difference between the average of predictions and the average of
labels in the dataset. Prediction bias serves as an early indicator of model issues. A big
nonzero prediction bias is indicative of a bug somewhere in the model. There’s an
interesting Facebook paper in the context of ad CTR. Typically, the bias is useful to
measure across prediction buckets.
6. Calling it a success just on model accuracy numbers

Accuracy of 95% means 95 of 100 predictions were correct. Accuracy is a flawed metric
with a class imbalance in the dataset. Instead investigate deeply into metrics, such as
precision/recall and how it correlates to overall user metrics such as spam detection,
tumor classification, etc.
7. Not understanding the impact of regularization lambda

Lambda is a key parameter in striking the balance between simplicity and training-data
fit. High lambda → simple model → possibly underfitting. Low lambda → complex model
→ potential overfitting your data (won’t be able to generalize to new data). The ideal
value of lambda is one that generalizes well to previously unseen data: data-dependent
and requires analysis.
8. Using the same test set over and over

The more the same data is used for parameter and hyperparameter settings, the lesser
confidence that the results will actually generalize. It is important to collect more data
and keep adding to the test and validation sets.
9. Not paying attention to initiation value in neural networks

Given non-convex optimization in NN, initialization matters.
10. Assuming wrong labels always need to be fixed

When wrong labels are detected, it is tempting to jump in and get them fixed. It is
important to first analyze misclassified examples for the root cause. Oftentimes, errors
due to incorrect labels may be a very small percentage. There might be a bigger
opportunity to better train for specific data slices that might be the predominant root
cause.
To summarize, avoiding these mistakes puts you significantly ahead of most other
teams. Incorporate these as a checklist in your process.
Sign up for programming bytes

By Better Programming
A bi-weekly newsletter sent every Friday with the best articles we published that week. Code tutorials,
advice, career opportunities, and more! Take a look.
Get this newsletter
Machine Learning Data Science Programming Python Artificial Intelligence
About Write Help Legal
Get the Medium app

10 Deadly Sins of ML Model Training - by Sandeep Uttamchandani - Better Programming

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

10 Deadly Sins of ML Model Training - by Sandeep Uttamchandani - Better Programming

Uploaded by

Copyright:

Available Formats

16/08/2021 10 Deadly Sins of ML Model Training | by Sandeep Uttamchandani | Better Programming

Sign in to your member account (je__@g__.com) for unlimited access.

Sign in with Google

Not you? Sign in or create an account

Sandeep Uttamchandani Follow

Photo by Frank Vessia on Unsplash

ML model training is the most time-consuming and resource-expensive part of the

Ten Deadly Sins of ML Model Training

2. Not shuffling the training dataset

3. In multiclass classification, not prioritizing specific per-class metrics

4. Assuming specificity will lead to lower model accuracy

5. Ignoring prediction bias

6. Calling it a success just on model accuracy numbers

7. Not understanding the impact of regularization lambda

8. Using the same test set over and over

9. Not paying attention to initiation value in neural networks

10. Assuming wrong labels always need to be fixed

Sign up for programming bytes

Get this newsletter

Machine Learning Data Science Programming Python Artificial Intelligence

About Write Help Legal

Get the Medium app

You might also like

Sign in to your member account (je@g.com) for unlimited access.