Professional Documents
Culture Documents
Write your questions down to ask during Demo & Lab time.
Quiz (Retain that knowledge)
Simple poll of the workshop to retain the knowledge.
These serve as a way for you to remember key insights about what
we’re talking about.
Discussion (Learn by Reflection)
The purpose of the demo is to introduce and guide you on the Lab
sections.
I will introduce the data and general purpose of the lab sections.
Try things, fail, and if you don’t finish it all in 1.5 hours that’s ok.
➔ Economics
➔ Political Science
➔ Theory of Rationality
Problems with deduction
➔ Predictable irrationality
➔ Logical fallacies
➔ Local minima
Inductive reasoning
➔ Evidence based
➔ Statistics
➔ Generalizations
➔ Statistical Syllogisms
➔ Proof by Induction
➔ Prediction
➔ Analogies
➔ Causal inference
Problems with induction
➔ Biases
Confirmation bias
Attribution bias
Favoritism
Inductive bias
Racism
Sexism
➔ Black Swans
➔ Not all variables are available
Weapons of Math Destruction
Author Cathy O'Neil
Sexism → Generalizations
Who uses machine learning?
Amazon recommendations Robot Process Automation at FortressIQ Waze
Nest Thermostat Detecting defects with images like food, Semantic Similarity using word2vec
Spotify discover weekly cracks in subfloors, etc Playing music
Facebook lookalike audiences Recommended connections Customer lifetime value calculation
Amazon photos organization FB News feed What ads, to serve to who, and when?
Siri, Google, Cortana, Alexa Learned database indexes Forecasting the weather
Chat bots and implied intent Personalization with Sailthru Better insurance. by monitoring your driving
AlphaGo A/B testing with optimizely patterns
MarI/O or tool assisted gaming Malware detection at Malware bytes Object detection with YOLO, and RCNNs
Yelp photo aggregation Facial detection at Border crossings with Learn where military bases are (Strava
thematic clusters Rekognition accidently gave this out)
Google autocomplete. Self driving cars with Tesla Anomaly detection in technical trading
Sense energy meter Deep fakes Detecting sexual orientation with computer
Hedge funds and renaissance Better workouts with data at Volt vision
technologies Summarizing Legalese with LegalRobot InnerEye
Fraud detection Preventing money laundering at Paypal 23andme relatives
Check deposit recognition Verifying identity at SheerID Predicting lost connections in Airlines
Policing with data How the Oakland A's won with Reducing emissions with deep learning at
Netflix recommendations Moneyball Siemens
Better Pricing with AirBNB Emotions and Sentiment with IBM Predicting delivery time at Postmates
Painting with neural nets Watson Why do summer songs all sound the same?
Predict maintenance problems Age detection using Rude Carnie Roombas
Social Media
➔ Facebook newsfeed
➔ Waze
➔ Strava
➔ 23andme relatives
➔ Robotic procedures
Retail
➔ Target targeting pregnant woman
➔ Recommendation engines
➔ Fraud detection
Finance
➔ Hedge Funds like Renaissance Tech
➔ Anomaly detection in technical trading
➔ Executing trades effectively at JP Morgan
➔ Feature extraction of SEC filings
➔ Fraud detection
➔ Check deposit detection
Marketing
➔ Look-alike audiences
➔ Advertisements
➔ Personalization in marketing
➔ Thematic clustering
Data security
➔ Malware detection
➔ Cloudflare
➔ DDOS mitigation
Finding a function
that maps data to
values based on
previous observations
f(x) = y
3 Classes of ML Algorithms
SUPERVISED UNSUPERVISED
LEARNING LEARNING
f(x) = y f(x) = x
3 Classes of ML Algorithms
SUPERVISED UNSUPERVISED REINFORCEMENT
LEARNING LEARNING LEARNING
➔ Linear Optimization
(Dantzig + von Neumann)
Computational Optimization
Using computers to find the best:
➔ Hill Climbing
➔ Genetic Algorithms
➔ Simulated Annealing
➔ LIPO
Optimization / Machine Learning
Optimization Machine Learning
Mathematical Engineering
techniques to find techniques to find
best option within a best option. More folk
given search space or theorems here, like
numerical set. deep learning
architectures.
Optimization Theory: Pure ML
Machine Learning: Applied ML
OK but how much math
do I really need to know?
What should you focus on?
Pure Math Applied Math
Algebra Differential equations
Calculus, real and complex analysis Physics
Geometry, topology Computer Science
Combinatorics Information Theory
Logic Probability Theory + Statistics
Number Theory Game Theory
Optimization
Linear Algebra
What should you focus on?
Pure Math Applied Math
Algebra (Group Theory) Differential equations
Calculus, real and complex analysis Physics
Geometry, topology Computer Science
Combinatorics Information Theory
Logic Probability Theory + Statistics
Number Theory Game Theory
Optimization
Linear Algebra
Machine learning is like a stew
The optimized combination of:
+ Computer Science
+ Information Theory
+ Statistics
+ Probability
+ Domain expertise
Machine learning is like a stew
The optimized combination of:
+ Computer Science
+ Learn a programming language
+ Information Theory
+ Entropy
+ Statistics & Probability
+ Mean, Median, Mode
+ Distributions
+ Domain expertise
Information Theory: Entropy
Weather in Death Valley
10 codes: 10-4
KJ7IUD 73
Information Theory: Entropy
Statistics: Mean, Median, Mode
➔ Mode
The highest probability
➔ Median
50% is below and 50% above
➔ Mean
weighted average
Distributions to know
Discrete Continuous
Poisson Normal
Phone calls received Height in a classroom
Binomial Uniform
Heads / Tails on a coin Random number
Exponential
Particle decay
Discrete Distributions
Poisson Binomial
Continuous Distributions
Normal Uniform Exponential
Lying with Statistics
➔ Biased samples
➔ Biased averages
➔ Discarded data
➔ Graph manipulation
➔ The dead cat phenomena
➔ Correlation vs Causation
High interest credit card debt
➔ Entanglement ➔ Glue Code
➔ Hidden Feedback Loops ➔ Pipeline Jungles
➔ Undeclared Consumers ➔ Experimental Code Paths
➔ Unstable Data dependencies ➔ Configuration Debt
➔ Underutilized data dependencies ➔ Fixed Thresholds
➔ Correction Cascade ➔ Correlation Changes
http://bit.ly/1zwONap
Discussion
Time to stretch!
Visualizations
● Relevance to you
● Best practices
● Bad practices
● Analyze Iris with Jupyter notebook and Matplotlib
● Analyze thermostat readings using IBM Watson Studio
● Q&A
What works better?
My own story about
visualizations
The Tibetans believe in 8 senses
The occipital lobe is quite large
Seeing is important to
influence thought
Types of Visualizations
➔ Numerical
➔ Categorical
➔ Combination
➔ Maps
➔ Network
➔ Time series
Great resource: data-to-viz.com
The 6+1 Edward Tufte Principles
1. Show comparisons
2. Show causality
3. Use multivariate data
4. Complete integrate modes
5. Establish credibility
6. Focus on content
7. Ruthless pruning
Comparisons
Causality
Multivariate
Integrated
Credibility
➔ Authenticity with data
Where’d it come from?
How did it get here?
➔ Honesty with data
Telling the actual truth not the expected or perceived truth
➔ Open with data
Missing data
Outliers
Errors: precision, recall, accuracy, F1 score etc.
Content
➔ Minimize architecture
https://bit.ly/2m2r0n3
Visualize Fast with IBM Watson
https://bit.ly/2kszxj0
Churn Prediction
● Relevance to you
● Best practices
● Bad practices
● Using Watson to determine churn
● Discussion & Q&A
Churn at ClickFunnels
Expert Blindspot
You think they
Missed opportunity
Market Traction
Want
19% of startups failures are from
The sweet spot!
competitors winning [1]
Customers
No market Need
Irrelevant work
Don’t want
42% of startup
Time wasters
failures are here [1]
[1] Top 20 reasons why startups fail - CB Insights
CRISP-DM &
IBM Data Science &
TACT
CRISP-DM
Data Science Methodology
TACT
Target
Transmit
Arrange
Compose
Compose
Pro
Cu du
sto ct
Transmit me Rea Arrange
r R lm
ea
lm
Target
tactmethod.com
IBM Data Science Methodology
➔ Target the right target
Understand the business
Build your analytic approach
➔ Arrange the data
Gather requirements
Collect data
Understand the data
Prepare the data
➔ Compose a model
Model: AutoML
➔ Transmit
Evaluate
Deploy
Gather feedback
Discussion
Andon Cups
Churn Prediction with IBM
Import sklearn.model_selection as
cross_validtion
https://ibm.co/2mjdNXl
Decision Optimization
● Relevance to you
● Best practices
● Bad practices
● Using Watson to make decisions
● Discussion & Q&A
Emails at Conversica
Principles - Ray Dalio
➔ Bridgewater Associates:
biggest hedge fund.
➔ Runs the company on:
Principles
Algorithms
Recipes
➔ “Coach” app internal to
Bridgewater Associates
Objective
➔ Minimize / Maximize
➔ Argmin / Argmax
➔ Convexity / Non-convex
➔ Continuous / Discrete
➔ Differentiable / Not
➔ Unconstrained
https://bit.ly/2kMwe6j
Decision Optimization Examples
https://bit.ly/2mbdMoe
Why IBM Watson Studio?
What is your motivation?
If you stay till the end
https://ibm.co/2kybYS5
Free stuff
Discussion
What Machine Learning is…
and is not
● Algorithms that take data to insight
● The many uses of machine learning
● The classes
● What is Deep Learning, AI, Optimization in relation?
● What math should I know?
● The High Interest Credit Card Debt
Visualizations
● AutoAI
● Deployment solved
● Visualization distilled
● Deep Learning dispelled
● Collaborative work environment
● OSS Friendly: Python, R, Scala, Spark. Doesn’t matter
What was your number?
Miguel.Maldonado@ibm.com
IBM.com/MachineLearning
Rate today ’s session