2. Extracting Actions in Decision Tree
For CRM applications, using a set of examples adecision tree can be built, which is described by set of attributes such as name, sex, birthday, etc., andfinancial information such as yearly income and familyinformation such as lifestyles and number of children.Decision tree is used in vast area of data mining because one can easily convert methods into rules andalso obtain characteristics of customers those who belong to a certain class. The algorithm that is used inthis paper do not only relay on prediction but also itcan classify the customers who are loyal and such rulescan be easily derived from decision trees.The first step is to extract rules when there is norestrictions in the number of rules that can be produced. This is called as unlimited resource case [3].The overall process of the algorithm is described asfollows:
Algorithm 1:
Step 1: Import customer data with data collection,data cleaning, data preprocessing and so on.Step 2: A Decision tree can be build usingdecision tree learning algorithm[11] to predict, if acustomer is in desire status or not. One improvementfor the decision building is to use the area under thecurve of the ROC curve [7].Step 3: Search for optimal actions for eachcustomer using the key component proactive solution[3].Step 4: Produce reports, for domain experts toreview the actions that deploy the actions.
2.1 A search for a leaf node in the unlimitedresources
This algorithm search for optimal actions andtransforms each leaf node to another node in the moredesirable fashion. Once the customer profile is built,the customers who are there in the training examplesfalls into a particular leaf node in a more desirablestatus thus the probability gain can then be convertedinto expected gross profit.When a customer is moved from one leaf toanother node there are some attribute values of thecustomer that must be changed. When an attributevalue is transformed from V1 to V2, it corresponds toan action that incurs cost which is defined in a costmatrix.The leaf node search algorithm searches allleafs in the tree so that for every leaf node ,a bestdestination leaf node is found to move the customer tothe collection of moves are required to maximize thenet profit.The domain specific cost matrix for the net profit of an action can be defined as follows:P
N
et=P
E
*Pgain-
∑
i COSTij (1)
Where P
N
et denotes the net profit, P
E
denotes the total profit the customer in the desired status, Pgain denotesthe probability gain, and COST
ij
denotes the cost of each action involved.The leaf node search algorithm for searchingthe best actions can be described as follows:
Algorithm: leaf-node search
1. For each customer x, do2. Let S be the source leaf node in which x falls into;3. Let D be a destination leaf node for x the maximumnet profit P
Net
.4. Output (S, D, Pnet);
An example of customer profile:
-Low Med HIGHL H0.1F M0.9 0.2 0.8 0.5Consider the above decision tree, the tree has fivenodes. A, B, C, D, E each with the probability of customers being a loyal. The probability of attritorssimply “1” minus this probability.Consider a customer Alexander who’s record statesthat the service=Low (service level is low), sex=M(male), and Rate =L (mortgage rate is low). Thecustomer is classified by the decision tree. It can beseen that Alexander falls into the leaf node B, which predicts that Alexander will have only a 20 percentchance of being loyal. The algorithm will now searchthrough all other leafs (A, C, D & E) in the decisiontree to see if Alexander can be “replaced” into a bestleaf with the highest net profit.1. Consider the leaf node A. which do not have a high probability of being loyal(90%), because the cost of action would be very high if Alexander should beC RATESEX