Professional Documents
Culture Documents
Introduction
Agenda
•Knowledge Pyramid
•Data mining Background
•What is data mining?
•*Main data mining objectives
•Current state of data mining
•Challenges
Knowledge Pyramid
Data, Information and Knowledge
• Data (D)
• Isolated factual recording of
separate objects and events
• Enables the recording of the
seen events
• Information (I)
• Fact of meaningful context
K
represented by relationships
between isolated data items
• Information enables the
responding to the seen
I
events
• Knowledge (K)
• Verified known information
D
that is accommodated into
the business process
• Enable the anticipation of the
unseen events
DIKW with Examples
Data mining Background
Current IT Trend
https://www.kdnuggets.com/2018/12/predictions-data-science-analytics-2019.html
8
Data Mining Background
• Facts:
• Storing the data is an operational necessity
• Storing the data has become easy and affordable
• Data acquisition is fully or partially automatic and fast
• Consequences:
• The speed of data comprehension does not match the speed of data
acquisition
• Many commercial database management systems (DBMSs) are not
equipped with data comprehension and analysis tools.
• We may be data rich, but information poor.
Data Mining Background
Describe
what already
happened.
Data Mining & Other Disciplines
Machine Learning
Statistics
(Artificial Intelligence)
DATA MINING
Database
Management
What is data mining?
Data Mining
sophistication
sophistication
High end of
Low end of
17
Also Known as …
Business intelligence (core component), big data analytics, predictive analytics,
knowledge discovery in database …
The story of diapers and beers
Automatic Credit Card Approval
The data mining objectives
Data Mining Objectives
• Classification
• Using existing data to form a classification model and then using the
model to assign an appropriate class label for a data record (e.g. safe
vs. risky customers)
• Estimation
• Similar to classification but to assign a value to an output variable of a
data record (e.g. estimated house value, stock price)
• Prediction
• Similar to classification and estimation, but more concerned with
future outcome of the output (e.g. tomorrow’s weather, coming
election outcomes)
• Description
• General description of data characteristics (e.g. customer profile)
Data Analytics Objectives
• Classification
• Using existing data to form a model and then to assign an appropriate class
label for a data record (e.g. safe vs. risky customers)
23
Data Analytics Objectives
• Estimation
• Similar to classification but to assign a value to an output variable of a data
record (e.g. estimated house value)
Data Analytics Objectives
• Prediction
• Similar to classification and estimation, but more concerned with future
outcome of the output (e.g. tomorrow’s weather)
Data Analytics Objectives
• Description
• describes real-world events and the relationships between factors
Current state of data mining
Current States
• Some nuisances
• Mining cookies
• Spyware and miningware
• Intrusion to privacy
• Some serious problems
• “Big Brother is watching”
• Unfair advantages in trading practice e.g. high-frequency trading
(HFT)
• Abuse of personal data
• Ethical concerns
Business Insider
https://www.businessinsider.com/foreign-intelligence-agents-china-spying-on-
americans-zoom-2020-4
Data Mining: Promises
• Algorithms
• Combinatorial problems and fast algorithms
• Comprehensibility of patterns
• Meaningful evaluation of the patterns
• Discovery of changing and evolving patterns
• Integration of data mining techniques
Summary