Professional Documents
Culture Documents
Data Mining: Chris Nelson CS 157 A Fall 2007
Data Mining: Chris Nelson CS 157 A Fall 2007
Chris Nelson
CS 157 A
Fall 2007
Data Mining
Data Dredging
The process of scanning a data set for relations and then
coming up with a hypothesis for existence of those relations.
MetaData
Data that describes other data. Can describe an individual
element, or a collection of elements.
Wikipedia example: In a library, where the data is the
content of the titles stocked, metadata about a title would
typically include a description of the content, the author, the
publication date and the physical location
Applications for Data Dredging in business include Market
and Risk Analysis, as well as trading strategies.
Applications for Science include disaster prediction.
AI/Machine Learning
Combinatorial/Game Data Mining
Good for analyzing winning strategies to games, and thus
developing intelligent AI opponents. (ie: Chess)
Business Strategies
Market Basket Analysis
Identify customer demographics, preferences, and purchasing
patterns.
Risk Analysis
Product Defect Analysis
Analyze product defect rates for given plants and predict
possible complications (read: lawsuits) down the line.
Privacy Concerns
Wiki quote:
"data mining gives information that would not be
available otherwise. It must be properly interpreted
to be useful. When the data collected involves
individual people, there are many questions
concerning privacy, legality, and ethics."
Controversies continued
Verdict is still out. This may violate an old (100+ years) New York law
prohibiting advertising using endorsements without the endorsees
consent.
Facebook currently offers users no way to opt out of Beacon (once it has
been activated ?). Users can close the accounts, but account data is never
deleted.
Bottom Line