Professional Documents
Culture Documents
Best Practices For Human-In-The-Loop PDF
Best Practices For Human-In-The-Loop PDF
3
Machine learning
unsupervised ML:
4
Active learning
a special case of semi-supervised ML:
6
Let’s consider:
• AI encompasses much more than deep learning:
there’s also knowledge graph, planning systems,
other kinds of learning, mathematical optimization,
etc.
• Also think about teams of people + machines
• How can these different parts be integrated?
• Which methodologies need to be followed?
7
1/ Achieving human parity
Achieving Human Parity in Conversational
Speech Recognition
W. Xiong, et al. Microsoft
(2016-10-12)
Microsoft researchers reached
human parity in conversational
speech recognition, reducing
speech-to-text error rates below
the human threshold of ~5%
8
1/ Achieving human parity
Achieving Human Parity in Conversational
Speech Recognition
W. Xiong, et al. Microsoft
AI defined? achieving parity with human
(2016-10-12)
experts, in domains where experts solve
“hard problems” —
Microsoft researchers reached
perhaps more aptly
stated as learning in uncertain domains
human parity in conversational
speech recognition, reducing
speech-to-text error rates below
the human threshold of ~5%
9
1/ Achieving human parity
Achieving Human Parity in Conversational
Speech Recognition
W. Xiong, et al. Microsoft
(2016-10-12)
Microsoft researchers reached
human parity in conversational
speech recognition, reducing
speech-to-text error rates below
seen repeatedly as a
the human threshold of ~5%
metric for “expertise”
10
2/ Segments for AI adoption in industry
segment liabilities assets
▪ facing barriers: talent gap, competing
▪ HITL provides a vector to compete
MIT SMR
investment priorities, security concerns against top incumbents, with many 2017
adopters unexplored areas of opportunity
▪ verticals eroded by horizontal business
lines from top incumbents
12
3/ “Detach from named methods”
Developers Should Abandon Agile
Ron Jeffries (2018-05-10)
“No matter what framework or method your management
thinks they are applying, learn to work this way:
• Produce running, tested, working, integrated software
every two weeks, every week…
• Keep the design of that software clean. As it grows, the
design will tend to become complex and crufty…
• Use the current increment of software as the
foundation for all your conversations with your
product leadership and management…
13
3/ “Detach from named methods”
Developers Should Abandon Agile
~20 yrs ago: “Agile”
Ron Jeffries (2018-05-10) created value by iterating
on a code base, while the nature of the data
“No matterwaswhat framework
relatively or method
invariant, yourspecified
e.g., management
by a
thinks theyschema,
are applying, learn to work this way:
relegated to unit tests, database, etc.
• Produce running, tested, working, integrated software
today:
every two OSS
weeks, shifted
every week…ROI – now someone else
maintains the code; but without LOTS of
• Keep the design of that software clean. As it grows, the
carefully labeled data, that code
design will tend to become complex and crufty…may not
yield much return
• Use the current increment of software as the
foundation for all your conversations with your
product leadership and management…
14
4/ The reality of data rates
“If you only have 10 examples of something, it’s going
to be hard to make deep learning work. If you have
100,000 things you care about, records or whatever,
that’s the kind of scale where you should really start
thinking about these kinds of techniques.”
15
4/ The reality of data rates
Transfer learning aside, most DL use cases require large, carefully
labeled data sets, while RL requires much more data than that.
Active Learning can yield good results with substantially smaller
data rates, while leveraging an organization’s expertise to
bootstrap toward larger labeled data sets, e.g., as preparation for
deep learning, etc.
supervised reinforcement
learning learning
data rates
(log scale)
active deep
learning learning 16
4/ The reality of data rates
Transfer learning aside, most DL use cases require large, carefully
labeled data sets, while RL requires much more data than that.
Active Learning can yield good results with substantially smaller
data rates, while leveraging an organization’s expertise to
bootstrap toward larger labeled data sets, e.g., as preparation for
deep learning, etc.
many enterprise
use cases fit here
supervised reinforcement
learning learning
data rates
(log scale)
active deep
learning learning 17
4/ The reality of data rates
“Reducing the Data Demands of Smart Machines”
https://www.darpa.mil/news-events/2018-07-11
• reduce the data needed to build new models by 106
• reduce the data needed to adapt models from
millions to hundreds of labeled examples
“We are encouraging researchers to create novel
methods in the areas of meta-learning, transfer
learning, active learning, k-shot learning, and
supervised/unsupervised adaptation to solve
this challenge.”
18
5/ Human-AI Coexistence?
China: AI superpower
Kai-Fu Lee Sinovation Ventures
AI SF, 2018-09-06
quadrants plotted by
“optimization” vs “compassion"
19
5/ Human-AI Coexistence?
Second-order cybernetics provides decades of learnings
which now apply for AI adoption in enterprise
Second-Order Cybernetics
Ranulph Glanville
Systems Science and Cybernetics (2003)
20
Active Learning:
case studies and patterns
Who’s doing this?
22
Case Study: Primer
Using Artificial Intelligence to Fix Wikipedia's
Gender Problem
Tom Simonite
Wired (2018-08-03)
23
Case Study: Stitch Fix
Building a business that combines human
experts and data science
Eric Colson StitchFix
O’Reilly Data Show (2016-01-28)
“what machines can’t do are things around cognition,
things that have to do with ambient information, or
appreciation of aesthetics, or even the ability to
relate to another human”
24
Case Study: Figure Eight
Real-World Active Learning: Applications and
Strategies for Human-in-the-Loop ML
Ted Cuzzillo
O’Reilly Media (2015-02-05)
26
Case Study: Lola
Paul English on Lola's Debut for Business Travelers
Elizabeth West
Business Travel News (2017-10-04)
founded 2015 by Paul English and other Kayak execs:
on-demand, personal travel service; uses expert travel agents for HITL
initially criticized by travel industry as “competing against Siri”;
currently displacing OTAs in a reversal of “AI vs. jobs”
can book on Airbnb, Southwest, etc., which aren’t available via OTA,
because of the human delegation
“The first time you use Lola it’s going to be great because it’s a conversation.
We’re not making you think like a computer”
“Instead of showing you 300 choices or 1,000 choices, we think we can
show you three choices, kind of good, better, best”
27
Case Study: SAP Concur
When Privacy Scales
Amanda Casari (SAP Concur)
AI SF (2018-09-06)
28
Case Study: hCaptcha
Growing two-sided markets with blockchain tech
Eli-Shaoul Khedouri Intuition Machines
AI SF (2018-09-05)
Customers: provide datasets which need to be labelled
https://twitter.com/pacoid/status/1037413665283010560
https://hcaptcha.com/
https://www.hmt.ai/
29
Case Study: Crowdbotics
Anand Kulkarni Crowdbotics
parse specs from JIRA history, reuse what’s been done before;
generate PRs for popular web stacks: React, Flask, Ruby, etc.
31
Case Study: B12
Building human-assisted AI applications
Adam Marcus B12
O’Reilly Data Show (2016-08-25)
“Humans where they’re best, machines for the rest.”
example: http://www.coloradopicked.com/
32
Case Study: Clara Labs
Strategies for integrating people and machine
learning in online systems
Jason Laska Clara Labs
The AI Conf, NY (2017-06-29)
establishing a two-sided marketplace where
machines and people compete on a spectrum
of relative expertise and capabilities
33
Case Study: Clara Labs
Strategies for integrating people and machine
learning in online systems
Jason Laska Clara Labs
“the trick is
The AI Conf, NY (2017-06-29) to design systems from Day 1
which learn implicitly from the intelligence
which ismarketplace
establishing a two-sided where
already there”
machines and people compete on a spectrum
Michael Akilian Clara
of relative expertise and capabilities
Labs
34
Active Learning:
theory, tooling, practices
HITL theory: the business of business
Risk, Uncertainty, and Profit
Frank Knight
Riverside Press (1921)
37
HITL tooling: active learning
Agnostic Active Learning Without Constraints
Alina Beygelzimer, Daniel Hsu, John Langford,
Tong Zhang
NIPS (2010-06-14)
39
HITL practices: model interpretation
A Survey Of Methods For Explaining Black Box Models
Riccardo Guidotti, et al.
(2018-02-19)
Understanding Black-box Predictions via Influence Functions
Pang Wei Koh, Percy Liang
ICML (2017-07-10)
The Building Blocks of Interpretability
Chris Olah, et al. Google Brain
Distill (2018-03-06)
Challenges for Transparency
Adrian Weller
WHI (2017-07-29)
The Mythos of Model Interpretability
Zachary Lipton
WHI (2016-03-06)
40
HITL practices: model interpretation
explicability of ML models becomes essential,
must be intuitive for the human experts involved:
Skater, and also Anchors, SHAP, STREAK, LIME, etc.
41
HITL practices: no-collar workforce
No-collar workforce: Humans and machines in one loop
Anthony Abbatiello, Tim Boehm, Jeff Schwartz, Sharon Chand
Deloitte Insights (2017-12-05)
▪ near-future: human workers and machines complement the
other’s efforts in a single loop of productivity
▪ 2018-20: expect firms to embrace a “no-collar workforce”
trend by redesigning jobs
▪ yet only ~17% ready to manage a workforce in which people,
robots, and AI work side by side – largely due to cultural, tech
fluency, regulatory issues
▪ e.g., what about onboarding or retiring non-human workers?
these are no longer theoretical questions
▪ HR orgs must develop strategies and tools for recruiting,
managing, and training a hybrid workforce
42
Social Systems:
collaboration with machines
First-order cybernetics
Cybernetics: Or Control and Communication
in the Animal and the Machine
Norbert Wiener MIT
MIT Press (1948)
47
Second-order cybernetics
Autopoiesis and Cognition: The Realization of the Living
Humberto Maturana, Francisco Varela
Kluwer (1980 / original 1972)
“Action emerges from committed interactions of people
Understanding Computers and Cognition:
making requests and
A New Foundation for Design
promises in networks of commitments;
such networks aren’t brought
Terry Winograd, Fernando Flores
forth by plans.”
Intellect Books (1986)
– Fernando Flores
[could rewrite
Conversations quote
for Action and as “people
Collected and machines”]
Essays
Fernando Flores
Createspace (2013)
48
Summary:
how this matters
What is changing and why?
Second-order cybernetics began partly as a study of how
complex systems fail, and also about what social systems
and physical systems had in common
▪ “data-driven organizations”
Human
Experts
as conversations for action Experts gain insights
via Model explanations
Customer
Organizational Use Cases
53
Examples, “design systems which
Actions
Experts decide
learn implicitly from
about edge cases,
providing examples
the intelligence that’s
already there”
Models focus Experts
(e.g., weak supervision) Models explore
uncertainty when needed
Human
Experts
Experts gain insights
via Model explanations
Customer
Organizational Use Cases
Learning
Customers request
Sales, Marketing,
Service, Training
Customers 54
publicaions, interviews, conference summaries…
https://derwen.ai/paco
@pacoid