Professional Documents
Culture Documents
Concept Description
Characterization and Comparison
By : Mohammed A. Abdulhamed
Concept Description
Topics covered
• What is concept description?
• Data generalization and summarization-
based characterization
• Attribute-Oriented Induction
• Efficient Implementation of Attribute-
Oriented Induction
Concept Description
• data mining can be classified into two
categories:
1-Descriptive data mining describes the data set in
concise and summarative manner and presents
interesting general properties of the data.
2-Predictive data mining analyzes the data in order
to construct one or a set of models, and attempts to predict
the behavior of new data sets.
Concept Description
• Concept Description
*The simplest kind of descriptive data mining.
*A concept usually refers to a collection of data
such as frequent_buyers, graduate_students,
and so on.
*generates descriptions for characterization and
comparison of the data.
*It is sometimes called class description.
Concept Description
• Characterization : provides a concise
and succinct summarization of the given
collection of data.
• Comparison : (also known
discrimination) provides descriptions
comparing two or more collections of
data.
Characterization and
comparison techniques
*data generalization allowing data sets to
be generalized at multiple levels of
abstraction facilitates users in examining
the general behavior of the data.
Example : instead of examining individual
customer transactions, sales managers may
prefer to view the data generalized to higher
levels, such as summarized by customer
groups according to geographic regions.
Concept Description vs. OLAP
Birth_Region
Canada Foreign Total
Gender
M 16 14 30
F 10 22 32
Total 26 36 62
Thank you