Professional Documents
Culture Documents
1 Introduction
T. Washio et al. (Eds.): PAKDD 2008, LNAI 5012, pp. 884–889, 2008.
c Springer-Verlag Berlin Heidelberg 2008
Customer Churn Time Prediction in Mobile Telecommunication Industry 885
each of the M −1 classifiers will generate a model for each of the new datasets. For
each sample (customer) we estimate the probability that it belongs to a target
class (i.e., churn probability at that time) as follows: P r(t = t1 ) = 1 − P r(t =
t2 ∨ t = t3 . . . ∨ t = tM ) = P r(t > t1 ), P r(t = tk ) = P r(t > tk−1 ) − P r(t > tk )
k = 2, 3, . . . , M − 1, and P r(t = tM ) = P r(t > tM−1 ). To predict the churn
time of a customer with unknown churn time, the sample is processed by each
of the M − 1 classifiers and the class with maximum probability is assigned to
that customer.
In our experiments, we first start by grouping tenures into ranks such that
(ta , tb ] ≺ (tb , tc ] ≺ (tc , ∞), where ta < tb < tc < ∞. It is important to note
that this grouping into ranks could come from domain experts, like for example,
finding a set of customers who are likely to churn in 6-12 months period or finding
set of customers who are likely to stay for more than one year or two years.
In the next step, we repeat OR experiments on each rank, with a preference
level attached between each atomic time unit (that is, at month level). This
hierarchical way is taken to overcome the problem of large number of classes
present in the current problem.
100 100
90 90
80 80
70 70
Percent of churners
Percent of churners
60 60
50 50
40 40
30 month 7 30 month 7
month 8 month 8
month 9 month 9
20 month 10 20 month 10
month 11 month 11
Random Random
10 10
0 0
0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10
Decile Decile
100 100
90 90
80 80
70 70
Percent of churners
Percent of churners
60 60
50 50
40 40
30 month 12 30 month 12
month 13 month 13
month 14 month 14
20 month 15 20 month 15
Random Random
10 10
0 0
0 1 2 3 4 5 6 7 8 9 10 0 1 2 3 4 5 6 7 8 9 10
Decile Decile
References
1. Neslin, S.A., Gupta, S., Kamakura, W., Lu, J., Mason, C.H.: Defection detection:
Improving predictive accuracy of customer churn models. Working paper series,
Teradata center for customer relationship management at Duke university (2004)
2. Allison, P.D.: Survival analysis using the SAS system: A practical guide, SAS
Institute Inc, Cary, NC (1995)
3. Lu, J.: Predicting customer churn in telecommunications industry - An application
of survival analysis modeling using SAS. SAS User Group international online
proc., Paper No. 114-27 (2002)
4. SAS Institute Inc.: SAS/STATUsers
R Guide, Version 6. SAS Institute Inc., 1,2
(1989)
5. Bolton, R.N.: A dynamic model for the duration of the customer’s relationship
with a continuous service provider: The role of satisfaction. Marketing Science 17,
45–65 (1998)
6. Chu, W., Keerthi, S.S.: Support vector ordinal regression. Neural Computa-
tion 19(3), 145–152 (2007)
7. Frank, E., Hall, M.: A simple approach to ordinal classification. In: Proc. of the
European Conf. on Machine Learning, pp. 145–156 (2001)
8. Witten, I.H., Frank, E.: Data mining: Practical machine learning tools and tech-
niques with Java implementations. Morgan Kaufman, San Francisco (2005)
9. Cox, D.R.: Regression models and life tables. J. of the Royal Stat. Soc., Series
B 34, 187–220 (1972)
10. Gopal, R.K., Bhattacharyya, C., Meher, S.K.: Customer churn time prediction in
mobile telecommnunication industry using ordinal regression. ARG-TR-Y7-001,
Technical report, Satyam Computer Services Limited (2007)