Professional Documents
Culture Documents
Learning for
Manufacturing:
Data Science for All
Jeff Kavanaugh
June 7, 2018
1 | Internet
1Infosys of Manufacturing Midwest 2018
Confidential
Today’s discussion
2 | Internet
2Infosys of Manufacturing Midwest 2018
Confidential
www.infosys.design/plantio
3 | Internet
3Infosys of Manufacturing Midwest 2018
Confidential
Industry 4.0: Beyond the POC… the time to scale is upon us
Industry 4.0 integrates the physical and virtual worlds through technology enablers, which brings the fungibility and
speed of software to manufacturing operations. The potential value created by Industry 4.0 vastly exceeds the low-
single-digit cost savings that many manufacturers pursue today (acatech, Infosys, BCG, McKinsey, et al).
Disruptive technology enablers for Industry 4.0 are at a tipping point Industry 4.0 is changing manufacturing
25B
connected things forecasted to
250M
connected vehicles are forecasted
$493B
in digital revenue gains per
ship by 2020.* to have some form of wireless annum
network connection by 2020.*
Efficiency 35%
places the 2022 goal of 46% at risk
concepts
Maintenance • Reason: data hurdles and piecemeal
Efficiency
• 46% want to implement POC approach – unclear path
Industry 4.0 solutions 32%
Approach to overcome barriers:
Information systematically for enhanced 32%
Efficiency asset efficiency by 2022 1. Evaluate your digital maturity
2. Proof of concepts to demonstrate
• Only 30% have implemented 18%
Energy 15% business value, then scaled action
Efficiency
data-driven or intelligent
4%
services 3. Set clear targets
2017 2022
Service 4. Prioritize measures that will bring
Systematically implemented Potential recognized the most value to business
Efficiency
Partly implemented No awareness
5. Demonstrate courage, persistence
Source: Infosys and Institute for Industrial Management (FIR) at RWTH Aachen study conducted in 2015 and updated in 2017.
Sample size: 433 executives across industrial manufacturing sectors from China, France, Germany, UK and USA
5 | Internet
5Infosys of Manufacturing Midwest 2018
Confidential
Humans still matter! Industry 4.0 maturity is about more than the
technology, and poor reasoning skills are constraining progress
Source: Industrie 4.0 Maturity Index, acatech study supported by Infosys, 2017 7 | Internet
7Infosys of Manufacturing Midwest 2018
Confidential
Machine Learning is an important component in Industry 4.0 analytics
Other AI Offerings
Computer Vision
Video Analysis
Time & AI Infrastructure
Image Insights /
Classification
Labeled Y Comparison Language
Data ? Chat Bots Translation
Y N Clustering /
Predict
Anomaly Detection
Categories?
N
Predicting Y Regression
Text Extraction
values ? N Dimension Knowledge High-Fidelity Speech
Reduction Management Synthesis
8 | Internet
8Infosys of Manufacturing Midwest 2018
Confidential
Machine Learning involves solving business problems using 25+
algorithms segmented into 4 groups
Solving a Machine Learning (ML) Different algorithms are better suited We have found the Python scikit-learn
problem depends on finding the for different types of data and flowchart useful for selecting ML
right algorithms for the business different problems algorithms specific to business
problem problem and available data
Yes/No;
Old Faithful
quality
(Describes
pass/fail
relationships)
Groups into
similar Addresses
characteristics too much
sensor data!
Infosys Confidential
http://scikit-learn.org/stable/tutorial/machine_learning_map/index.html 9 | Internet
9 Infosys of Manufacturing Midwest 2018
Confidential
Machine Learning platforms (tools) have a large library of algorithms,
designed to address different types of business problems
Classification Regression
Identify category to which an object belongs Predicts a continuous-valued attribute associated with an object
Applications: spam detection, image recognition, quality P/F Applications: forecasting, pricing determination
Support Vector k-Neighbors Random Forest / Stochastic Gradient Stochastic Gradient Lasso Ridge Elastic Net
Machine (SVM) Classification Decision Trees Descent (SGD) Descent (SGD) Regression
Classifier Regressor
K-Means Gaussian Mean Shift Spectral Randomized Principal Locally Linear Kernel Spectral
Clustering Mixture Models Clustering Component Analysis Embedding Approximation Embedding
(PCA)
Infosys Confidential 10 | Internet
10Infosys of Manufacturing Midwest 2018
Confidential
Machine learning techniques are organized by ability to learn
Simplification and
Production planning and Demand forecasting using
automation of manual
scheduling sales pipeline
services billing
• Chillers
• HVAC Central Command Center Demand Management
• Generators
• Elevators
• Sewage
Treatment
plants
• Solar power
plants Digital Twin and Optimum Predictive
Operating Conditions Maintenance
Large Campuses
80 million+ square feet Assets Managed Solution Approach
Business Value
Maturity Level/
Centralized command center for real-time visibility
Visibility Transparency Predictability Adaptability
Path of development
Path of development
16 16 | Internet
Infosys of Manufacturing Midwest 2018
Confidential
Predictability
Business Value
Maturity Level/
Implemented advanced analytics on chiller data for event detection and prognostics
Visibility Transparency Predictability Adaptability
Path of development
17 17 | Internet
Infosys of Manufacturing Midwest 2018
Confidential
Example: greenfield waste water treatment plant’s pumping station
FOCUS
The strict environmental permit must be • Plant: State-of-the-art waste water treatment plant in Europe
fulfilled which means that the effluent
from the pumping station to
environment is not acceptable
TARGET
To find optimal design solution to fulfil the
required availability and safety with minimum
lifecycle cost
• Three operating scenarios:
1) Average flow
EVALUATE
To create a RAMS simulation model of the 2) Average + Industry peak flow
different design alternatives with different
operation and maintenance scenarios 3) Peak flow (heavy rain, flood)
18 18 | Internet
Infosys of Manufacturing Midwest 2018
Confidential
Moving from a traditional RAMS* to AI-based RAMS design enabled
faster decision-making with more accuracy
RAMS Definition of Maintenance Development and Optimization of the Plant specific maintenance
Identification of
Design Categories for RCM- Planning of CBM, service program to achieve required availability
critical RCM positions
Process positions TBM and CM task and safety with minimum costs
Remaining Useful
Life Estimation
Plant performance
KPIs:
RAMS and Risk
Index
Downtime insights:
Troubled asset, reason
for failures, cost savings
20 20 | Internet
Infosys of Manufacturing Midwest 2018
Confidential
Condition-based monitoring used AI to recommend pump
maintenance and proactive interventions
Results of RAMS analyses of three design solution cases with current maintenance service program (without
CBM= Condition Based Maintenance) and with Drishti 4.0* operational excellence AI platform (=with CBM)
Maintenance service program Without CBM With CBM Without CBM With CBM Without CBM With CBM
• Purchased vehicle
Enrolled in trial Enrolled in trial but did Did not enroll in Enrolled in trial and Enrolled in trial but did Did not enroll in trial
and converted not convert to paid subscription trial converted to paid not convert to paid subscription
to paid
Sep Oct Nov Dec Jan Feb Mar Apt May Jun Jul Aug Sep
• Next, we collected all usage and sales data for these Usage
Analysis
vehicles for the month before the renewal (~35,000 total commands)
– What specific commands were used by each vehicle? e.g. Remote Lock, Remote Start, Vehicle Finder, etc.
– What year / model was the vehicle?
– What was usage on the weekday vs the weekend for each vehicle?
– What metropolitan area was the selling dealer in?
– Did the customer subscribe to paid services?
After getting the right data, we can build a model to answer the overarching question:
Which customers will subscribe?
25 25 | Internet
Infosys of Manufacturing Midwest 2018
Confidential
After a number of tuning iterations, the model enables churn
prediction on an individual basis
• The classification model was tuned over multiple iterations using Microsoft Azure, in order to find the
ideal level of accuracy and resiliency measured with the Receiver Operating Characteristic (ROC)
– Certain data was removed from the model to improve model quality
This curve would
indicate we could
repeat customers extraneous data fields (e.g. VIN) predict every single
customer perfectly
(impossible!)
VIN: SN0001
Model: SUV MODEL A
Weekday uses: 48 Renewal This straight line
Weekend uses: 18
Remote: 0 probability: would indicate we
Status: 0
73.9% are guessing
Finder: 31 randomly
Lock: 35
This is the
Over the course of 100+ iterations, the machine learning accuracy of our
algorithm uses the training set to build a decision tree model using limited
based on the input data. Sample decision branches: data – this can only
• Is weekend usage greater than weekday usage? improve as we add
• Was the car purchased in an area with extreme weather? more data points,
e.g. demographics
The tuned model enables us to determine the churn probability for each customer
Infosys Churn Model POC, 2018 26 26 | Internet
Infosys of Manufacturing Midwest 2018
Confidential
We can now choose customers to address, to maximize profitability
Probability of renewal
5$ 16 $ 38 $ 59 $ 81
• Customers < 20% likely to renew 10 $ 12 $ 49 $ 87 $ 125
15 $ (9) $ 44 $ 96 $ 149
• Profit in first year: $96k 20 $ (36) $ 30 $ 96 $ 161
Additional net profit per
• This is maximum cumulative profit for the scenario 25 $ (59) $ 13 $ 85 $ 158
year (000s)
30 $ (80) $ (3) $ 74 $ 150
35 $ (95) $ (15) $ 65 $ 145
We choose the retention (renewal) probability of 40 $ (109) $ (26) $ 57 $ 141
customers to address
* Using Azure Machine Learning Studio
Jeff Kavanaugh
Partner, Manufacturing
Infosys Consulting
jeff_kavanaugh@infosys.com
Adjunct Professor
University of Texas at Dallas
jeff_kavanaugh@utdallas.edu
28 28 | Internet
Infosys of Manufacturing Midwest 2018
Confidential