Professional Documents
Culture Documents
Lecture 1 - Introduction
kdd.in.tum.de
15.10.2019
Register on Piazza
https://piazza.com/tum.de/fall2019/in2064
Data Mining
Introduction to Machine Learning 1 and Analytics
Self-driving cars and robotics
Data Mining
Introduction to Machine Learning 2 and Analytics
Game playing
Data Mining
Introduction to Machine Learning 3 and Analytics
Natural language processing
Data Mining
Introduction to Machine Learning 4 and Analytics
Web, ads and recommendations
Data Mining
Introduction to Machine Learning 5 and Analytics
Data science
Data Mining
Introduction to Machine Learning 6 and Analytics
Physics, biology and medicine
Data Mining
Introduction to Machine Learning 7 and Analytics
What unites all these technologies?
• Computer vision
• Natural language processing
• Recommender systems
• Computational advertising
• Robotics All are using Machine learning
• Artificial intelligence
• Data science
• Bioninformatics
• Many other fields
Data Mining
Introduction to Machine Learning 8 and Analytics
Hot topic
# of participants in IN2064 1000
500
0
2011 2013 2015 2017 2019
Year
Data Mining
Introduction to Machine Learning 9 and Analytics
What is Machine Learning?
Data Mining
Introduction to Machine Learning 11 and Analytics
Supervised learning
Model Predictions
f : X → Y ŷtest = f (Xtest )
Data Mining
Introduction to Machine Learning 12 and Analytics
Supervised learning: Classification
If the targets yi represent categories, the problem is called classification.
Examples
• Handwritten digit recognition
• Transaction classification
(fraud, valid)
• Object classification
(cat, dog, hotdog, ...)
• Cancer detection
Data Mining
Introduction to Machine Learning 13 and Analytics
Supervised learning: Regression
If the targets yi represent continuous numbers, the problem is called
regression.
Examples
• Stock market prediction
• Demand forecasting
• User involvement measurement
• Revenue analysis
Data Mining
Introduction to Machine Learning 14 and Analytics
Unsupervised learning
Unsupervised learning is concerned with finding structure in unlabeled
data.
Typical tasks
• Clustering
- Group similar objects together
• Dimensionality reduction
- Project down high-dimensional data
Data Mining
Introduction to Machine Learning 15 and Analytics
Unsupervised learning
Typical tasks - continued
• Generative modeling
- (Controllably) generate new
”realistic” data
• Topic models
- Discover hidden semantic
structures in text.
Data Mining
Introduction to Machine Learning 16 and Analytics
Other categories
• Reinforcement learning
- Learning by interacting with a dynamic environment. Goal is to
maximize rewards obtained by performing “desirable” actions.
• Semi-supervised learning
- Learning to combine lots of unlabeled data with a few labeled
examples for further prediction tasks.
• Active learning
- Learn while obtaining labels by querying an oracle.
• Learning to rank
- What are the relevant items for a given query?
(e.g. Netflix, web search, ad placement)
Source: https://beamandrew.github.io/deeplearning/2017/02/23/deep_learning_101_part1.html
Data Mining
Introduction to Machine Learning 18 and Analytics
General information
Staff
• Lecturer: Prof. Dr. Stephan Günnemann
• Teaching assistants:
Aleksandar Bojchevski, Johannes Klicpera, Anna Kopetzki, Aleksei
Kuvshinov, Oleksandr Shchur
Details
• 8 ECTS
• Language - English
Data Mining
Introduction to Machine Learning 19 and Analytics
Piazza
• Register at https://piazza.com/tum.de/fall2019/in2064
Use Piazza to ask questions - your emails will likely not be answered.
Data Mining
Introduction to Machine Learning 20 and Analytics
Schedule
Class hours
Mon 10:00–11:45 MW 2001 Lecture
Optional
We will hold a Q&A session where you can ask your questions in person.
Data Mining
Introduction to Machine Learning 21 and Analytics
Schedule & Logistics
Data Mining
Introduction to Machine Learning 22 and Analytics
Grading
Exam
• Written final exam in February
• 120 minutes
Data Mining
Introduction to Machine Learning 23 and Analytics
Tentative list of topics
Data Mining
Introduction to Machine Learning 24 and Analytics
Contents
Data Mining
Introduction to Machine Learning 25 and Analytics
What this course is not about
Data Mining
Introduction to Machine Learning 26 and Analytics
Recommended reading
Data Mining
Introduction to Machine Learning 27 and Analytics
Linear Algebra and Calculus
Read
• http://cs229.stanford.edu/section/cs229-linalg.pdf
• http://cs229.stanford.edu/summer2019/cs229-prob.pdf
• Bishop [ch. 1.2.0 - 1.2.3, 2.1 - 2.3.0]
Data Mining
Introduction to Machine Learning 28 and Analytics
Today & next week: k-NN and Decision Trees
Data Mining
Introduction to Machine Learning 29 and Analytics