You are on page 1of 7

Cross Industry Standard Practice – Data Mining

Reference Model
“The sequence of the phases is not rigid.

Moving back and forth between


different phases is always required.

It depends on the outcome of each


phase which phase or which particular
task of a phase, has to be performed
next.

The arrows indicate the most important


and frequent dependencies between
phases.”
Phase 1: Convert Business Objectives into a Data Mining Problem Definition
Phase 2: Collect & Get Familiar with Data – ID Quality Issues & 1st Insights
Phase 3: Selection, Transformation, & Cleaning + Feature Creation
Phase 4: Modeling Techniques Selected & Applied w/Calibrated Parameters
Phase 5: Review Model Steps & Results – Decide on Usage of Model

Evaluation also relies heavily on the output of phase 1:


• To confirm the model properly achieves the business objectives.
• And to determine if some important business issue has not been sufficiently considered.
Phase 6: A Simple Report or a Repeatable Process across an Enterprise

Often it is the customer, not the data analyst,


who carries out the deployment steps.

“However, even if the analyst will not carry


out the deployment effort it is important for
the customer to understand up front what
actions need to be carried out in order to
actually make use of the created models.”

You might also like