Professional Documents
Culture Documents
Wirth, R., & Hipp, J. (2000). CRISP-DM: Towards a standard process model for data mining.
Paper presented at the Proceedings of the 4th international conference on the practical
applications of knowledge discovery and data mining.
A case study
• An online retailer has spent 90% of its marketing budget on attracting visitors to their
online store through many channels of advertising. Its CEO and other directors feel that
their online advertising campaigns are quite successful with “remarkable” growth in
traffic to their website and in “new” customers over the last few years. However, the
total revenue and profit do not meet their expectations, and this threatens their
business model.
• With their own business understanding and experience, they want to increase average
revenue per customer per year, which they believe that it is below its main competitors.
• Marketing Director proposes a strategy that incentivises customers to repurchase after
they have bought first purchase. Obviously, they have budget constraint in doing this.
The board of directors think of using business analytics.
A case study: Performance Lawn Equipment
• Using the data set “PerformanceLawnEquipmentData” and Performance Lawn Equipment (Case study)
document
• In groups of 4 or five, each of you play the following roles:
(1) Sales Director (looking after sales performance),
(2) Production and Delivery Director (looking after all production and deliveries of all orders),
(3) Support Service Director (looking at customers services, complaints, accounting, etc.), and
(4) CEO (Chief Executive Officer) who looks after overall business management and administration including
human resource management.
(5) Assistant to CEO whose main task is to manage some business analytics projects that the company plans
to do.
Tasks: (Be prepared to present a brief summary of your answers so please take notes while discussing.) Using
the CRISP-DM framework and discuss among your group?
Business
Understanding
• Determine business • Class discussion (10 minutes)
objectives • What are business problems or opportunities?
• The background • Set clear objectives (could be more than one
• The problems / objectives)
opportunities
• Agree on success criteria for BA proposal
• Project objectives
(initiatives)
• Success criteria
• What types of analytics involved?
• Identify and document what resources are
Business available?
Understanding • Data
• Staff & experience
• Budget
• Assess situations: go into
details of • What resources are required and then
• Resources available
identify constraints
• Requirements and • Other risks involved?
constraints (assumptions)
• Risks and contingencies • Evaluate costs and benefits
• Costs and benefits • Understand how action can be taken based
on the likely outcomes (how to deploy?)
• Not only in development stage but also in
deployment stage
• Data analysis goals:
Business
Understanding
• Project plan
• Project plan: • Tools and techniques
• Business problems • Classification of customers
• Business goals
• Resources & constraints
• Linear regression, logistic regression,
• Data analysis goals
regression tree
• Initial assessments of • Optimisation (what levels of incentives to
tools and techniques obtain higher total revenue or total profit)
Data
Understanding
• Collect initial data • High level:
• Describe data • Identify data sources and data fields
• Data volume and properties
• Review data strategy and documentation
• Accessibility and availability of
attributes • What data are relevant and in what formats
• Attribute types, range, correlation and
identifies (database, text files, excel etc.)
• Basis descriptive analysis
• Crucially, target data fields that maps to
• Explore data business/analytical objectives, e.g.
• Visualise and identify relationships