ML Application Life Cycle: & Solving The Right Problems!

ML Application Life Cycle
& Solving the right problems!

Srujana Merugu
11/20/19 1
Formal approach to engineering ML systems ?
Software Engineer ML Engineer,)

Data)Scientist
Software)engineering:)) Engineering)ML)Systems:))
SDLC,&Agile,&SCRUM,&SRS, ?)?
Software)Design:) Offline)Modeling:
OOP,&design&patterns,&SOA&... Preprocessing,&feature&engg.,&
learning,&evaluation
Programming)Languages: ML)Tools/)Packages:
Java,&&Python,&&C++,&Scala Tensorflow,&spark.ml,&sklearn,&R
Data)structures)&)Algos:) ML)Concepts)&)Algos
sort,&&&trees,&lists&& Linear&models,&Neural&models,&
BiasIvariance&&&
2
ML#Application
Life#Cycle Problem(
formulation
Online( Data
evaluation(& definitions
evolution
Deployment(&( Offline
maintenance ML(modeling
Production( SKILLS
Pre6deployment(
system(
testing
design
Product(+(ML(+(Engg.
Engg.(+(ML
Production(
system Engg.
implementation
ML 3
ML#Application
Life#Cycle Problem(
formulation
Application##### ! ML#&#
requirements Optimization#
Online( Data Problems
evolution
Production( SKILLS
Pre6deployment(
system(
testing
design
Engg.(+(ML
Production(
system Engg.
implementation
ML 4
ML#Application
Life#Cycle Problem(
formulation
Precise#sources#&#definitions#of#
all#data#elements#
Online( Data
evolution
Checks:#
! Diff.#types#of#leakage
! Data#quality#issues
Deployment(&( Offline ! Distributional#violations
Production( SKILLS
Pre6deployment(
system(
testing
design
Engg.(+(ML
Production(
system Engg.
implementation
ML 5
ML#Application
Life#Cycle Problem(
formulation
Offline#training##&#evaluation#of#
Online( Data ML#models

evolution
! Multi:step#iterative#process
Production( SKILLS
Pre6deployment(
system(
testing
design
Engg.(+(ML
Production(
system Engg.
implementation
ML 6
Offline Modeling
Data Collection &-Integration
Data-Exploration-
Data-Sampling/Splitting-
Data-Preprocessing
Feature-Engineering
Model-Training,-Evaluation-------------
&--Fine?tuning
Meet
Business-
Goals?
7
ML#Application
Life#Cycle Problem(
formulation
Functional##Production#System
Online( Data ! Scalability
evolution ! Responsiveness
! Fault#tolerance
! Security
Deployment(&( Offline ! …
Production( SKILLS
Pre6deployment(
system(
testing
design
Engg.(+(ML
Production(
system Engg.
implementation
ML 8
ML#Application
Life#Cycle Problem(
formulation
Equivalence#checks#for#offline#
Online( Data modeling#vs.#production#settings
evolution ! Data#fetch#process
! Entire#model#pipeline#
! Data#distributions#
Production( SKILLS
Pre6deployment(
system(
testing
design
Engg.(+(ML
Production(
system Engg.
implementation
ML 9
ML#Application
Life#Cycle Problem(
formulation
Automation#of
Online( Data
definitions
▪ Predictions#for#new#instances
evaluation(&
evolution ▪ Data#quality#monitoring
▪ Data#logging#&#attribution
▪ Periodic#re=training
Production( SKILLS
Pre6deployment(
system(
testing
design
Engg.(+(ML
Production(
system Engg.
implementation
ML 10
ML#Application
Life#Cycle Problem(
formulation
! A/B#testing#on key#prediction#
Online( Data quality#& business#metrics#
evolution ! Assessment#of#system#
performance#aspects#
! Diagnosis#to#find#areas#of#
improvement
Production( SKILLS
Pre6deployment(
system(
testing
design
Engg.(+(ML
Production(
system Engg.
implementation
ML 11
System Objectives
• Effectiveness w.r.t. business metrics

• Ethical compliance
• Fidelity wr.t. distributional assumptions
• Reproducibility
• Auditability
• Reusability
• Security
• Graceful failure Can achieve these only with a formal approach
• ….
with checklists, templates & tests for each stage!
12
ML#Application H
Life#Cycle Problem(
formulation
SKILLS
H
P
Online( Data Product(+(ML(+(Engg.
evolution Engg.(+(ML
Engg.
L P
ML(modeling ML
maintenance
APPLICATION(DEPENDENT
L
P Production(
Pre6deployment( H High – Application(Specific
system(
testing
design P Partial
L Low – Application Agnostic
L
Production(
system
implementation
13
ML & Data Science Learning Programs
Deployment
Issues
Modeling
Process
ML
Pipelines
Learning
Algorithms
Data
Lot of emphasis on algorithms,

ML tools & modeling! Problem
Formulation
14
Factors for Success of ML Systems
ML
Deployment Pipelines
Issues
Modeling Learning
Algorithms
Process
Data
Problem Problem formulation & data

Formulation
become more critical !
15
Problem Formulation
Business Problem: Optimize a decision process to improve business metrics
• Sub-optimal decisions due to missing information
• Solution strategy: predict missing information from available data using ML
Decision
Decisions
Process Business
Metrics
External
Response
ML ML ML
Model Model Model
Ask “why?” to arrive at the right ML problem(s) !
Reseller Fraud Example
• Bulk buys during sale days on e-commerce websites

• Later resale at higher prices or returns
Objective: Automation of reseller fraud detection

Option 1: Learn a binary classifier using historical orders & human auditor labels
Objective: Automation of reseller fraud detection

Option 1: Learn a binary classifier using historical orders & human auditor labels
Limitations:
● Reverse-engineers human auditors’ decisions along with their biases and
shortcomings
● Can’t adapt to changes in fraudster tactics or data drifts
● No connection to “actual business metrics” that we want to optimize

Objective: Reduce return shipping expenses; increase #users served (esp. sale time)
Decision process:
• Partner with reseller in case of potential to expand user base
• Block fraudulent orders or introduce friction (e.g., disable COD/free returns)
Missing information relevant to the decision:

• Likelihood of the buyer reselling the products
• Likely return shipping costs
• Unserved demand for the product (during sale and overall)
• Likelihood of reseller serving an untapped customer base
Key elements of an ML Prediction Problem
REPRESENTATION OBJECTIVES
• Instance definition • Modeling metrics
• Target variable to be predicted • Ethical & fairness constraints
• Input features • Deployment constraints
OBSERVATIONS
• Sources of data
Instance Definition
• Is it the right granularity for the decision making process?
• Is it feasible from the data collection perspective ?
Multiple options (reseller fraud example)

• a customer
• a purchase order spanning multiple products
• a single product order (i.e., customer-product pair)
Target Variable to be Predicted
• Can we express the business metrics (approximately) in terms of the

prediction quality of the target variables(s)?
• Will accurate predictions improve the business metrics substantially?
– estimate biz. metrics for different cases (ideal, current-baseline, likely)
• What is the data collection effort ?

– manual labeling costs, joins with external data
• Is it possible to get high quality observations?
– uncertainty in the definition, noise or bias in labeling process
Input features
• Is the feature predictive of the target ?
• Are the features going to be available in production setting ?

– define exact time windows for features based on aggregates
– watch out for time lags in data availability
– be wary of target leakages (esp. conditional expectations of target )
• How costly is to compute or acquire the feature ?

– monetary and computational costs
– might be different in training and deployment settings
Sources of Data
• Is the distribution of training data similar to production data?

– at least conditional distribution of target given input signals?
– are there fairness issues that require sampling adjustments?
– can we re-train with “new data” in case production data evolves over time?
• Are there systemic biases in training data due to collection process?

– fixed training filters?
• adjust the prediction scope to match with the filter
– collection limited by existing model?
• explore-exploit strategies & statistical bias correction approaches
Modeling Metrics - Classification
• Online metrics are meant to be computed on a live system

– can be defined directly in terms of the key business metrics (e.g., net revenue)
– typically measured via A/B tests & influenced by a lot of factors
• Offline metrics are meant to be computed on retrospective “labeled” data

– more closely tied to prediction quality (e.g., area under ROC curve)
– typically measured during offline experimentation
11/22/19 26
Modeling Metrics - Classification
• Online metrics are meant to be computed on a live system

– can be defined directly in terms of the key business metrics (e.g., net revenue)
– typically measured via A/B tests & influenced by a lot of factors
• Offline metrics are meant to be computed on retrospective “labeled” data

– more closely tied to prediction quality (e.g., area under ROC curve)
– typically measured during offline experimentation
• Primary metrics are ones that we are actively trying to optimize

– e.g., losses due to fraud
• Secondary metrics are ones that can serve as constraints or guardrails
– e.g., customer base size
11/22/19 27
Modeling Metrics
• What are the key online metrics (primary/secondary)?

– a deep question related to system goals ! !
• Are the offline modeling metrics aligned with online metrics ?

– relative goodness of models should reflect online metric performance
11/21/19 28
Ethical and Fairness Constraints
• What are the long term secondary effects of the ML

system ?
• Is the system fair to different user segments ?
Need$to$be$incorporated$in$the$modeling$metrics$!
Deployment Constraints
• What are the application constraints?

– user interface based restrictions (interaction mode, form factor)
– connectivity issues
• What are the hardware constraints ?

– client side or server side computation
– memory, compute power
• What are scalability requirements ?

– size of data, frequency of processing( training/batch prediction)
– rate of arrival of prediction instances & latency bounds (online predictions)
Key Tenets for ML-Applications
11/21/19 31
Data Definitions
• Precisely record all sources & definitions for all data elements
– (ids, features, targets, metric-factors) for (training, evaluation, production)
• Establish parity across training/evaluation/production
– definitions, level sets, units, time windows, missing value handling, correct snapshots
• Review for common data leakages
– peeking into future, target
• Pro-actively collect information on data quality issues & resolve
– missing/invalid value causes, data corruptions
Offline Modeling
• Ensure data is of high quality

– Fix missing values, outliers, systemic bias
• Narrow down modeling options based on data characteristics
– Learn about the relative effectiveness of various preprocessing, feature engineering,
and learning algorithms for different types of data.
• Be smart on the trade-off between feature engg. effort & model complexity
– Sweet spot depends on the problem complexity, available data, domain knowledge,
and computational requirements
• Ensure offline evaluation is a good “proxy” for the “real unseen” data
evaluation
– Generate data splits similar to how it would be during deployment
Engineering
• Work backwards from the application use case

– Data/compute /ML framework choices based on deployment constraints
• Clear decoupling of modeling and production system responsibilities
– self contained models (config, parameters, libs) from data scientists
– application-agnostic pipelines for scoring, evaluation, re-training, data-collection
• Maintain versioned repositories for data, models, experiments
– logs, feature factories
• Plan for ecosystems of connected ML models
– easy composition of ML workflows
11/22/19 34
Deployment
• Establish offline modeling vs. production parity

– Checks on every possible component that could change
• Establish improvement in business metrics before scaling up
– A/B testing over random buckets of instances
• Trust the models, but always audit
– Insert safe-guards (automated monitoring) and manual audits
• View model building as a continuous process not a one-time effort
– Retrain periodically to handle data drifts & design for this need
Main Takeaways
• Map out your org-specific ML application life cycle
• Introduce checklists, templates, and tests for each stage
• Invest effort in getting the problem formulation right (ask “why?”)
• Be proactive about data issues
11/21/19 36
Thank You !
Happy Modeling !
Contact: srujana@gmail.com

ML Application Life Cycle: & Solving The Right Problems!

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

ML Application Life Cycle: & Solving The Right Problems!

Uploaded by

Copyright:

Available Formats

ML Application Life Cycle

& Solving the right problems!

Software Engineer ML Engineer,)

Online( Data ML#models

Data Collection &-Integration

• Effectiveness w.r.t. business metrics

L Low – Application Agnostic

Lot of emphasis on algorithms,

Problem Problem formulation & data

• Sub-optimal decisions due to missing information

• Solution strategy: predict missing information from available data using ML

• Bulk buys during sale days on e-commerce websites

Objective: Automation of reseller fraud detection

Objective: Automation of reseller fraud detection

● No connection to “actual business metrics” that we want to optimize

Missing information relevant to the decision:

• Instance definition • Modeling metrics

• Target variable to be predicted • Ethical & fairness constraints

• Input features • Deployment constraints

• Is it the right granularity for the decision making process?

• Is it feasible from the data collection perspective ?

Multiple options (reseller fraud example)

• Can we express the business metrics (approximately) in terms of the

• What is the data collection effort ?

• Is the feature predictive of the target ?

• Are the features going to be available in production setting ?

• How costly is to compute or acquire the feature ?

• Is the distribution of training data similar to production data?

• Are there systemic biases in training data due to collection process?

• Online metrics are meant to be computed on a live system

• Offline metrics are meant to be computed on retrospective “labeled” data

• Online metrics are meant to be computed on a live system

• Offline metrics are meant to be computed on retrospective “labeled” data

• Primary metrics are ones that we are actively trying to optimize

• What are the key online metrics (primary/secondary)?

• Are the offline modeling metrics aligned with online metrics ?

• What are the long term secondary effects of the ML

• Is the system fair to different user segments ?

• What are the application constraints?

• What are the hardware constraints ?

• What are scalability requirements ?

• Ensure data is of high quality

• Work backwards from the application use case

• Establish offline modeling vs. production parity

• Map out your org-specific ML application life cycle

• Introduce checklists, templates, and tests for each stage

• Invest effort in getting the problem formulation right (ask “why?”)

• Be proactive about data issues

You might also like