You are on page 1of 14

DATA MINING AND

TECHNIQUES
INTRODUCTION

• There is a huge amount of data available in the Information Industry.


This data is of no use until it is converted into useful information. It
is necessary to analyze this huge amount of data and extract useful
information from it.
CONTENT

 What is Data Mining?


 Why Data Mining?
 What is KDD Process?
 Data Mining Techniques
 Data Mining Query Language
 Applications of Data Mining
WHAT IS DATA MINING?

 Extraction of interesting Patterns or Knowledge


from huge amount of data

 (Knowledge Discovery from Data)

 One of the Step from KDD process


WHY DATA MINING?

The Explosive Growth of Data: from terabytes to


petabytes

 We are drowning in data, but starving for


knowledge!

Fraud detection and detection of unusual patterns


WHAT IS KDD PROCESS?

 Data cleaning
To remove noise and inconsistent data
 Data integration
Where multiple data sources may be combined
 Data selection
Related Data
 Data transformation
Unified format
 Data mining
Extract Patterns

 Pattern evaluation
To identify the truly interesting patterns
representing knowledge

 Knowledge presentation
Present the mined knowledge to the user
KDD PROCESS
DATA MINING TECHNIQUES

Classification

Clustering

Regression
CLASSIFICATION

 Classification is the process of predicting the class of a new


item.

 Therefore to classify the new item and identify to which


class it belongs
CLUSTERING
 Group Data into Clusters

 Similar data is grouped in the same cluster

 Dissimilar data is grouped in the same cluster


REGRESSION

 “Regression deals with the prediction of a value,


rather than a class.”
 Regression is a data mining function that predicts a
number
APPLICATIONS OF DATA MINING

 Education
 Customer Relationships Management (CRM)
 Future Health
 Lie Detection
 Financial Banking
 Research Analysis
 Criminal Investigation

You might also like