Professional Documents
Culture Documents
Practical
SQL Server
Machine Learning Services
Thank you Sponsors
Jan Mulkens
Microsoft BI Consultant
Competence Lead: Microsoft Advanced Analytics
@JanMulkens
www.janmulkens.be
www.globalpowerbi.com
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
My employer
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Activities
Blog:
- www.janmulkens.be
- www.globalpowerbi.com
Organiser:
- Conference (www.powerbidays.com)
- User groups
- Microsoft Advanced Analytics (bit.ly/msaaug)
- Power BI Belgium (bit.ly/FlemishPowerBI)
- Virtual Power BI events (www.globalpowerbi.com)
- Internal Ordina events (BI Bake Off, Exam Preps, Fast Tracks, ...)
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Power BI Days
By & for the community
September 22 | 2018 Mechelen, Belgium
September 23 | 2018 Virtual
www.powerbidays.com
Activities
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Activities
Speaker & Mentor:
- Conferences, User Groups, Webinars
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Side effects do NOT include
• Becoming an expert
• Full depth and breadth of SQL Server ML Services
• Gaining statistical knowledge
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Side effects DO include
• Get started with Machine Learning
• Good practices
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
The usual ML talk
Not so classic
Introduction to
Machine Learning
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
What is Machine Learning
Machine learning explores the study and construction of
algorithms that can learn from and make predictions on
data. Such algorithms operate by building a model from
example inputs in order to make data-driven predictions
or decisions, rather than following strictly static program
instructinos.
Machine learning is closely related to and often overlaps
with computational statistics; a discipline that also
specializes in prediction-making. It has strong ties to
mathematical optimization, which delivers methods,
theory and application domains to the field. @JanMulkens
https://en.wikipedia.org/wiki/Machine_learning
Un événement #SQLSatParis – SQLSaturday Paris 2017
What is Machine Learning
@JanMulkens
http://dilbert.com/strip/2013-02-02
Un événement #SQLSatParis – SQLSaturday Paris 2017
Machine Learning
Traditional Programming: Data and program is run on the
computer to produce the output.
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Types of algorithms
https://en.wikipedia.org/wiki/Outline_of_machine_learning
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Types of algorithms
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Types of algorithms
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Applications Machine Learning Types
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Applications
• Spam filters • Player matching
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Applications • Recommendations
• Chat bots
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Standard demo
SQL Server SQL Server
2016 2017
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Standard demo
• Install and enable R Services
• sp_execute_external_script: Hello world
• Basic output of data with R Services
• Basic output of data – with a twist (column names! Wow!)
• Basic statistics operation and output
• Create stored procedure and save data
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
What you should
actually know
Revolution Analytics
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
In-database
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Machine Learning Services
Data Analyst / Scientist T-SQL Developer
Preprocess
Pick ML
Split Data
Algorithm
Train Model
Score Model
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Import data
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Basic workflow Import Data
Preprocess
Pick ML
Split Data
Algorithm
Train Model
Score Model
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Preprocess data
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Basic workflow Import Data
Preprocess
Pick ML
Split Data
Algorithm
Train Model
Score Model
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Split data
Original dataset
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Basic workflow Import Data
Preprocess
Pick ML
Split Data
Algorithm
Train Model
Score Model
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Pick an algorithm
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Basic workflow Import Data
Preprocess
Pick ML
Split Data
Algorithm
Train Model
Score Model
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Basic workflow Import Data
Preprocess
Pick ML
Split Data
Algorithm
Train Model
Score Model
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Evaluation!
https://towardsdatascience.com/metrics-to-evaluate-
your-machine-learning-algorithm-f10ba6e38234
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Basic workflow
Predictive
Experiment
Training experiment
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Demo
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
How to
Get Started
Microsoft Team Data Science Process
• Data science lifecycle definition
• Standardized project structure
• Infrastructure and resources for data science
projects
• Tools and utilities for project execution
@JanMulkens
http://aka.ms/tdsp
Un événement #SQLSatParis – SQLSaturday Paris 2017
Microsoft Team Data Science Process
@JanMulkens
http://aka.ms/tdsp
Un événement #SQLSatParis – SQLSaturday Paris 2017
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Business Understanding
@JanMulkens
http://aka.ms/tdsp
Un événement #SQLSatParis – SQLSaturday Paris 2017
Business Understanding
Project Lead Project Manager
Project Charter
@JanMulkens
http://aka.ms/tdsp
Un événement #SQLSatParis – SQLSaturday Paris 2017
Data Acquisition & Understanding
@JanMulkens
http://aka.ms/tdsp
Un événement #SQLSatParis – SQLSaturday Paris 2017
Data Acquisition & Understanding
Solution Architect Project Lead Data Scientist
Solution
Data Summary
Architecture
Report
Diagram
@JanMulkens
http://aka.ms/tdsp
Un événement #SQLSatParis – SQLSaturday Paris 2017
Modeling
@JanMulkens
http://aka.ms/tdsp
Un événement #SQLSatParis – SQLSaturday Paris 2017
Modeling Data Scientist
Feature Model
Engineering Development
Model Report
Solution Architect
Develop Data
Pipeline
@JanMulkens
http://aka.ms/tdsp
Un événement #SQLSatParis – SQLSaturday Paris 2017
Deployment
@JanMulkens
http://aka.ms/tdsp
Un événement #SQLSatParis – SQLSaturday Paris 2017
Deployment Data Scientist
Dashboard
Solution Architect
Deploy Pipeline
@JanMulkens
http://aka.ms/tdsp
Un événement #SQLSatParis – SQLSaturday Paris 2017
Acceptance
@JanMulkens
http://aka.ms/tdsp
Un événement #SQLSatParis – SQLSaturday Paris 2017
Acceptance
Project Lead Project Manager
Transition to
Checkpoint Project Finalize
Production Support
Documentation
Data Scientist
@JanMulkens
http://aka.ms/tdsp
Un événement #SQLSatParis – SQLSaturday Paris 2017
How to advance
What’s your ML test score?
A rubric for ML production systems
• https://ai.google/research/pubs/pub45742
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Start now
Let the Data Science Process be your guide
https://aka.ms/tdsp
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Start now
Get the project templates
https://github.com/Azure/Azure-TDSP-ProjectTemplate
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Start now
Start learning R
• https://www.datacamp.com/courses/free-
introduction-to-r
Or learn Python
• https://www.edx.org/course/introduction-python-
absolute-beginner-microsoft-dev236x
• https://www.edx.org/course/introduction-python-
fundamentals-microsoft-dev274x
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Reading Material
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Reading material
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Blogs to read
• Ginger Grant (@DesertIsleSQL)
• Stairway to Machine Learning Services
http://www.sqlservercentral.com/stairway/171757/
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Viewing Material (old to new)
Machine Learning with R and SQL Server 2016
• https://www.youtube.com/watch?v=LvQYiVj4TNQ
How to build machine learning applications using R and Python in SQL Server 2017
• https://www.youtube.com/watch?v=b9Xhvyfx6qk
How to use R, Python, and machine learning with Microsoft SQL Server 2017
• https://www.youtube.com/watch?v=6K9IY_4ghn0
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
@JanMulkens
Un événement #SQLSatParis – SQLSaturday Paris 2017
Contact Jan
Jan@JanMulkens .be
Un événement
Speaker help (FREE!) www.SpeakingMentors.com @JanMulkens
#SQLSatParis – SQLSaturday Paris 2017
Thank you Sponsors