Professional Documents
Culture Documents
Machine Learning
Machine Learning
Chapter 07
Automated Machine Learning
Copyright 2022 © McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill
Education.
What Is Automated Machine Learning (AutoML)?
© McGraw-Hill Education 2
What Questions Might Arise?
How was the data collected and What are the reasons behind why
prepared for analysis? the recommended model
How did the model arrive at a produced the most accurate
particular conclusion? decision?
What is the blueprint of the Are there data issues that could
model? be impacting the validity of the
model?
Why did the model arrive at a
particular conclusion? Is the model consistent in its
predictions?
What variables had the greatest
impact on the predicted outcome? Why is the model a good
predictor?
What patterns exist in the data?
How accurate is the model?
© McGraw-Hill Education 3
AutoML in Marketing
© McGraw-Hill Education 4
Which Companies Are Actively Using AutoML?
© McGraw-Hill Education 5
What Are Key Steps in the Automated Machine Learning
Process?
Building models.
Recommending models.
© McGraw-Hill Education 6
Data Preparation
© McGraw-Hill Education 7
Model Building
© McGraw-Hill Education 8
Creating Ensemble Models
© McGraw-Hill Education 9
Simple Approaches to Ensemble Modeling
For categorical
Another more advanced technique involves using target variables,
a weighted average. the most common
• The higher quality data would be assigned category of
greater importance and thus weighted higher. “majority” rule can
be used.
© McGraw-Hill Education 10
Advanced Ensemble Methods – Bagging
© McGraw-Hill Education 11
Exhibit 7-3: Bagging (Bootstrap Aggregating)
© McGraw-Hill Education 13
Exhibit 7-4: Boosting
© McGraw-Hill Education 14
Model Recommendation
Multiple predictive models are examined and the model with the most
accurate predictions is recommended.
• Accuracy is determined by how well a model identifies relationships
and patterns in a dataset and uses this knowledge to predict outcomes.
• Higher levels of accuracy are measured based on better predictions of
observations, not in the original datasets used to develop the model.
• The most accurate prediction model(s) is then used to make better
decisions.
© McGraw-Hill Education 15
Case Study – Loan Data: Understanding When and How to
Support Fiscal Responsibility in Customers
Lending Club is a P2P lending platform who wants to reduce their default
rate from 10 to 8 percent within a year.
• A supervised model is needed to identify borrowers with a high chance
of default.
• You will upload data into DataRobot for AutoML analysis.
• After identifying the target variable, you run the model and evaluate
the results before applying the model to predict new cases.
The results of the AutoML revealed 2 customers out of 11 who were more
likely to default on their loan.
• Lending Club can send targeted messages to these customers about
bill paying, penalties, and free access to financial advisors.
© McGraw-Hill Education 16
End of main content.
www.mheducation.com
Copyright 2022 © McGraw-Hill Education. All rights reserved. No reproduction or distribution without the prior written consent of McGraw-Hill
Education.