Professional Documents
Culture Documents
and
Data Visualization
SOM 485
Fall 2007
Getting Started
What is Data Mining?
Online Analytical Processing
Data Mining Techniques
Market Basket Analysis
Limitations and Challenges to Data Mining
Data Visualization
Siftware Technologies
Applications of DM
Customer Relationship
Management (CRM)
software is an
application that can
benefit DM
Activities of CRM
One-to-One Marketing
Sales Force Automation
Sales Campaign Management
Marketing Encyclopedia
Call Center Automation
Verification of DM
Requires a lot of prior knowledge on the
decision makers part
Used mainly in casinos
i.e. Can determine if a new customer is a high roller, a souvenir
buyer, a ticket purchaser, etc.
OLAP continue
Codds 12 Rules for OLAP
1.
2.
3.
4.
5.
6.
7.
8.
9.
10.
11.
12.
Multidimensional View
Transparent to the User
Accessible
Consistent Reporting
Client-Server architecture
Generic Dimensionality
Dynamic Sparse Matrix Handling
Multi-user Support
Cross-Dimensional Operations
Intuitive Data Manipulation
Flexible Reporting
Infinite Levels of Dimension and
Aggregation
DATA MINING
TECHNIQUES
FOUR MAJOR
CATEGORIES
1. Classification
2. Association
3. Sequence
4. Cluster
CLASSIFICATION
- Mining processes
intended to discover
rules that define
whether an item
belongs to a particular
class of data
- Two Sub-processes:
1) Building a Model
2) Predicting
Classifications
ASSOCIATION
Techniques that employ association
search all details from operational
systems for patterns with a high probability
of repetition
SEQUENCE
Time series analysis methods relate
events in time based on a series of
preceding events
Through analysis, various hidden trends,
often highly predictive of future events,
can be discovered.
Example: Mail Industry
CLUSTER
To create partitions so that all members of
each set are similar according to some
metric
Simply a set of objects grouped together
by virtue of their similarity or proximity to
each other
Example: Credit Card Transactions
DATA MINING
TECHNOLOGIES
through discovery
Statistical Analysis statistically evaluating
products and making a decision based on logical
reasoning
Neural Networks attempts to mirror the way
the human brain works in recognizing patterns
by developing mathematical structures with the
ability to learn
DATA MINING
TECHNOLOGIES CONT
Method
Frozen
Pizza
Milk
Cola
Potato Chips
Pretzels
Frozen Pizza 2
Milk
Cola
Potato
Chips
Pretzels
Data Visualization
Process by which numerical data are
Requires:
1.
2.
3.
4.
Data input
Data storage, retrieval, and query
Data transformation, analysis, and modeling
Data reporting
GIS continued
Spatial Data elements stored in map
form
Siftware Technologies
Siftware Technologies
IBM
Informix
Red Brick
DB2
Oracle
Silicon Graphics
Sybase
Informix
Three-tier model
Tier 1: Client presentation layer
Tier 2: Hewlett-Packard hardware
Tier 3: Data layer INFORMIX OnLine
database
Three-dimensional Visualization
Visual models can save days and even months
from the review process
Review
Data mining (DM)
Techniques used to mine data
Market Basket Analysis: The King of DM
Algorithms
Review continued..
Current Limitations and Challenges to
Data Mining
Data Visualization
Siftware Technologies