Professional Documents
Culture Documents
Digital Technology
Lecture 1: Introduction
2
Week Topic Hands-on coding labs
1 Introduction to Machine Learning and Digital Technology
2 Python for Reproducible Machine Learning Python & Jupyter basics
3 Data Design and Predictive Modelling Human mobility data
4 Building a Machine Learning Model for Prediction Predicting wine quality
5 Supervised learning 1: Classification Vaccine hesitancy
6 Supervised Learning 2: Ridge and Lasso Regression Predicting Airbnb prices
7 Reading week
8 In-class test (Multiple choice questions) Clustering movie ratings
Unsupervised Learning: Clustering
9 Networks Community detection
10 Text Analytics 1: Text processing and Sentiment Analysis Amazon food reviews
11 Text Analytics 2: Topic Discovery and Word Embeddings Movie reviews
12 Ethics, Bias, and Fairness in Machine Learning Fairness of criminal risk models
3
Essential textbooks
Link to the ebook via the QMUL subscription to O’Reilly Platform Link to the ebook via QMUL library
Link to related tutorials in the form of Python Jupyter notebooks Link to code and data
4
Essential textbooks
Ebook is freely available here Link to the ebook via the QMUL subscription to O’Reilly Platform
5
Advanced textbooks
Link to the ebook via the QMUL subscription to O’Reilly Platform Link to the ebook via the QMUL subscription to O’Reilly Platform
6
O'Reilly Learning Platform via QMUL Databases A–Z
https://go.oreilly.com/queen-mary-university-of-london
7
An open learning resource—links to many other resources
https://valdanchev.github.io/reproducible-data-science-python
8
Responsible Machine learning (ML) & Data
Ethics
Fairness
Data Analytics
Machine
Learning
Computing Responsible Maths and
Digital ML/Data Statistics
Technology Analytics
Software Traditional
Development Research
Business
Knowledge
9
Learning Goals
10
Data Opportunities and Computing
• Volume of data
• Variety of data (tabular, text, networks)
• Big data (data sets that are too large for
traditional data processing systems)
• Powerful computers & cloud compute
• Better algorithms
11
From Data to Business Decision Making
12
What is Machine Learning?
13
More definitions
14
Example: Spam filtering [Spam or Ham]
15
Business use cases
16
Business example: Predicting customer churn
• A large telecommunication firm are having a
major problem with customer retention—20%
of cell phone customers leave when their
contracts expire
• Telecommunication companies are now
engaged in battles to attract each other’s
customers while retaining their own
• Marketing has already designed a special
retention offer
• Your task is to devise a precise, step-by-step
plan for how the data science team should use
companies’ vast data resources to solve the
problem
17
Business example: Predicting customer churn
18
Machine Learning Tasks in Business
19
Machine Learning Tasks in Business: Examples
20
Types of Machine Learning
21
Supervised Machine Learning
• Is there a specific, quantifiable target that we are interested in or trying to predict?
- Think about the decision to churn
• Do we have data on this target?
- Do we have enough data on this target?
- Need a min ~500 of each type of classification
• Do we have relevant data prior to decision?
- Think timing of decision and action
• The result of supervised data mining is a model that predicts some quantity
22
Subclasses of Supervised Machine Learning
• Classification
- Categorical target
• Often binary (yes/no) but could also be multiclass (three or more classes)
• Regression
- Numeric target
23
Subclasses of Supervised Machine Learning
• ‘Will this customer purchase service 𝑆1 if given incentive 𝐼1?’
- Classification problem
- Binary target (the customer either purchases or does not)
• ‘Which service package (𝑆1, 𝑆2, or none) will a customer likely
• purchase if given incentive 𝐼1?;
- Classification problem
- Three-valued target
• ‘How much will this customer use the service?’
- Regression problem
- Numeric target
- Target variable: amount of usage per customer
24
How does supervised machine learning work?
25
Using supervised machine learning to induce a prediction
model from training dataset
26
Example dataset
Target feature is what we want to predict—in the case below this is the probability that the loan will be
repaid or default. The learning is supervised because there is a target variable which supervises what the
model is optimizing for.
27
Simple supervised machine learning model for prediction
28
Unsupervised machine learning
• Goal: discover structure in data
• We have input observations, but no target
feature.
• Used to identify groups of similar observations
using clustering techniques. For example, the
model might segment transactions based on
the money amount, currencies, payment
device and other variables into different
groups.
• Other examples:
- Topic discovery in text
- Community detection in network data
29
Machine Learning Process in Business
• Business Understanding
• Data Understanding
• Data Preparation
• Modelling
• Evaluation
• Deployment
30
Hands-on learning using accessible and user-friendly
computational tools
31
Python & Jupyter for AI/ML Research
Python
• Python is open source and free programming language
• Python is one of world’s most popular programming language with a growing
community
• Python programming skills are in high demand on the job market
• The Python ecosystem includes fast, powerful, and flexible open source tools for
doing data science and AI/ML, such as Pandas, Seaborn, and scikit-learn
Jupyter Notebook and Colab
• Jupyter Notebook is an open-source web application that allows you to create and
share documents that contain code, equations, visualisations and text
• Supports a wide range of workflows in data science and machine learning
• Colab is a free environment that runs Jupyter notebooks on the Google Cloud and
requires no install or setup.
32
33
User-friendly
interactive
computational
tools
• Prior knowledge of
programming is not
required
• Coding for ML/AI will be
taught from first
principles
34
Next week
Week 2
Python for Reproducible Machine Learning
•[HIML] Chapter 2
•Chapter 3 in David Amos et al. Python
Lecture Reading
Basics: A Practical Introduction to Python.
•Watch https://www.youtube.com/watch?v=in
N8seMm7UI ;
35
Acknowledgements
36
Thank you