Professional Documents
Culture Documents
1|Page
1. Project Objective
The objective of the report is to understand banking data provided and to build the CART and
Random forest model for the Personal loan campaign
Dataset
The data set used for the project is PL_XSELL.csv which is containing the banking summary data of
20,000 banking customers.
20000 customers were targeted with an offer of personal loan on 10% interest rate, out of which
2512 customers responded positively. The data needs to be used to create classification model(s) in
order to predict the response of new set of customers in the future, depending on the attributes
available in the data.
2. Random Forest
2|Page
4. Exploratory Data Analysis
Introduction
3|Page
It is observed that the amount of debit transaction, no of credit transaction. Total Debits, Total
credits and total cash withdraw data is right skewed.
4|Page
Correlation plot show no negative correlation,
5|Page
5. Clustering
The second part of the question deals with selecting the idel clustering technique and building a
cluster model.
Centroid based clustering is the optimal clustering mechanism for the given dataset, hence kmeans
clustering choose, and cluster model is built.
The number of optimum clusters is 3, using Eucledian distance and K-means clustering
6|Page
6. CART Model Building and Evaluation
Complexity parameter
7|Page
Success Segment Meaning
Customer with No of debit transaction is less Maximum probability for getting personal loan
than 6.5 and age >25 and <50, amount of
cheque transaction <8000
Customer with no of debit transaction greater Maximum probability for getting personal loan
than 6.5, and no of credit transaction greater
than 3.5
8|Page
7. Random Forest model and evaluation
9|Page
Random Forest model evaluation
Random forest Model performance is good on both test and train data, and is more stable than
CART model.
8. Conclusion
Hence the Bank can you the Random forest model to identify a potential customer to whom it can
sell personal loan more accurately than CART model.
10 | P a g e
11 | P a g e