Professional Documents
Culture Documents
CITATIONS READS
20 144
1 author:
Hussein A Abdou
University of Huddersfield
24 PUBLICATIONS 216 CITATIONS
SEE PROFILE
All in-text references underlined in blue are linked to publications on ResearchGate, Available from: Hussein A Abdou
letting you access and read them immediately. Retrieved on: 11 May 2016
The current issue and full text archive of this journal is available at
www.emeraldinsight.com/1526-5943.htm
JRF
10,1 An evaluation of alternative
scoring models in private banking
Hussein A. Abdou
38 Salford Business School, Salford University, Salford, UK
Abstract
Purpose – This paper aims to investigate the efficiency and effectiveness of alternative
credit-scoring models for consumer loans in the banking sector. In particular, the focus is upon the
financial risks associated with both the efficiency of alternative models in terms of correct
classification rates, and their effectiveness in terms of misclassification costs (MCs).
Design/methodology/approach – A data set of 630 loan applicants was provided by an Egyptian
private bank. A two-thirds training sample was selected for building the proposed models, leaving a
one-third testing sample to evaluate the predictive ability of the models. In this paper, an investigation
is conducted into both neural nets (NNs), such as probabilistic and multi-layer feed-forward neural
nets, and conventional techniques, such as the weight of evidence measure, discriminant analysis and
logistic regression.
Findings – The results revealed that a best net search, which selected a multi-layer feed-forward net
with five nodes, generated both the most efficient classification rate and the most effective MC. In
general, NNs gave better average correct classification rates and lower MCs than traditional techniques.
Practical implications – By reducing the financial risks associated with loan defaults, banks can
achieve a more effective management of such a crucial component of their operations, namely, the
provision of consumer loans.
Originality/value – The use of NNs and conventional techniques in evaluating consumer loans
within the Egyptian private banking sector utilizes rigorous techniques in an environment which
merits investigation.
Keywords Banking, Financial risk, Scoring procedures (tests)
Paper type Research paper
Introduction
Recently, the role of effective management of different financial and operational risks is
especially important for bankers who have come to realize that banking operations
affect and are affected by economic and social environmental risks that they face. It is
believed that the Egyptian banking sector has been “tough” since 1999, and is “expected
to remain so” for the performance of the banking sector in Egypt has shown an “ongoing
profitability weakness due to revenue pressure” and a high incidence of problem loans
(Oldham and Young, 2004; CBE, 2006/2007). Therefore, the role that scoring techniques
can play is critical in helping to reduce the current and/or the expected risk they face,
because of an inadequate risk-reduction through efficient diversification.
Especially, with the fast growth in the credit industry and the huge loan portfolio
management, credit scoring is regarded as a one the most important techniques in banks
and has become a very critical tool during recent decades. A number of credit-scoring
The Journal of Risk Finance models have been developed to evaluate the credit risk process of both new and existing
Vol. 10 No. 1, 2009
pp. 38-53 loan clients. Scoring techniques assess, and therefore help to decide, “who will get credit,
q Emerald Group Publishing Limited
1526-5943
how much credit they should get, and what operational strategies will maintain the
DOI 10.1108/15265940910924481 profitability of the borrowers to the lenders” (Thomas et al., 2002; Hand and Jacka, 1998).
Statistical techniques, such as discriminant analysis (DA), regression analysis, and Alternative
logistic regression (LR), used in building the scoring models have been examined scoring models
(Crook et al., 2007; Greene, 1998; Altman et al., 1994; Orgler, 1971). A few studies have
explored the scoring models using the weight of evidence (WOE) measure, or in terms
of poor as good or good and bad credit, also results were comparable with those from
other techniques (Banasik et al., 2003; Bailey, 2001; Siddiqi, 2006). The evaluation of
new client’s loans has attracted some attention in the last ten decades and it is 39
considered as one of the most critical applications of credit-scoring models (Sarlija et al.,
2004; Chen and Huang, 2003; Malhotra and Malhotra, 2003).
The neural network models have the highest average correct classification (ACC)
rate when compared with other traditional techniques, such as DA and LR, taking into
accounts that results were very close. A few credit-scoring models have investigated
the use probabilistic neural nets (PNNs) (Ganchev et al., 2007; Zekic-Susac et al., 2004;
Masters, 1995), whilst many scoring models applying multi-layer feed-forward nets
(MLFNs) have been used (Erbas and Stefanou, 2008; Dimla and Lister, 2000; West,
2000; Reed and Marks, 1999; Desai et al., 1996; Bishop, 1995).
Other statistical models, as well as neural networks and fuzzy algorithms have been
used in building scoring models (Tsai and Wu, 2008; Anderson, 2007; Crook et al., 2007;
Nur Ozkan-Gunay and Ozkan, 2007; Yu et al., 2007; Seow and Thomas, 2006; Hand
et al., 2005; Zhang and Bhattacharyya, 2004). Furthermore, comparisons between
conventional and advanced statistical scoring techniques have been investigated as
well (Lee and Chen, 2005; Ong et al., 2005; Lee et al., 2002; Desai et al., 1997).
Comparisons have also been extended to include other neural networks, such as
back-propagation and feed-forward nets (Min and Lee, 2008; Malhotra and Malhotra,
2003; Arminger et al., 1997). Neural network models are better representations of data
than LR and CART, as shown by the statistical association measures (Zekic-Susac et al.,
2004; Zhang et al., 1999), while other traditional techniques, such as DA, in general, as
noted by Liang (2003) has a better classification ability but worse prediction ability,
whereas LR has a moderately better prediction ability.
The chosen environment is the Egyptian banking sector. Indeed, from the review of
literature to date, the author was not aware of other studies in Egypt in covering
credit-scoring techniques. Therefore, the intention was to cover this gap, which was found
in the Egyptian banking sector. This paper is organized as follows: part two details the
research methodology and data collection. Part three explains the results. Finally, part
four concludes the results of the study and suggests areas for future research.
WOE can be calculated from the IO using the logarithmic function, which can be
considered as raw scores, as follows:
WOE ¼ lnðIOÞ:
To determine the Point Score, or WOE Score, the following equation might be used:
X P
Point Score ¼ £ Rw £ ½WOE þ c ;
lnð2Þ
where, P – is the score at which the odds are doubled; Rw – is the correlation coefficient
(from a multiple regression) between the respective variable and the WOE for the
variable; c – is a constant applied to each variable (Bailey, 2001; Siddiqi, 2006).
Discriminant analysis
However, as to the statistical assumptions implicit in implementation, DA requires the
data to be independent and normally distributed. Consequently, the general formula
of DA is as follows:
Z ¼ a þ b 1 X 1 þ b2 X 2 þ · · · þ bn X n ;
where Z represents the discriminant Z-score, a is the intercept term, and bi represents
the respective coefficient in the linear combination of explanatory variables, Xi, for
i ¼ 1, . . . , n (Lee et al., 2002).
Logistic regression
LR is a widely used statistical modeling technique in which the probability of a binary
outcome (zero or one) is related to a set of potential predictor variables in the form:
p
ln ¼ a þ b1 X 1 þ b 2 X 2 þ · · · þ b n X n ;
12p
where p is the probability of the outcome of interest, a is the intercept term, and bi Alternative
represents the respective coefficient in the linear combination of explanatory variables, scoring models
Xi, for i ¼ 1, . . . , n. The dependent variable is the logarithm of the odds
{ln½p=ð1 2 pÞ}, which is the logarithm of the ratio of two probabilities of the
outcome of interest (Lee et al., 2002).
Figure 1.
PNN and MLFN
Inputs Pattern Summation Output Inputs 1st
hidden 2ndhidden Output structures
Layer Layer Layer Layer
JRF Neural Tools software the sigmoid function is not utilized (Figure 1). The reason is to
10,1 avoid a restriction on outputs values, to create a superb model for training purposes.
Empirical results
In order to run the proposed models, SPSS 15.00, STATGRAPHICS Plus 5.1, and
Neural Tools software were used in this research. The detailed credit scoring results
WOE
G 95 196 291 32.65 48 94 142 33.80
B 0 98 98 100.00 0 50 50 100.00
T 389 49.61 192 51.04
WOET1
G 154 137 291 52.92 80 62 142 56.34
B 4 94 98 95.92 5 45 50 90.00
T 389 63.75 192 65.10
WOET2
G 120 80 200 60.00 62 32 94 65.96
B 4 94 98 95.92 5 45 50 90.00
T 298 71.81 144 74.31
WOET3
G 211 80 291 72.51 110 32 142 77.46 Table II.
B 4 94 98 95.92 5 45 50 90.00 Classification results
T 389 78.41 192 80.73 for the WOEs: predictions
(in columns) versus
Notes: Cut-off point 0.50. G ¼ good; B ¼ bad; T ¼ total observations (in rows)
JRF WOET2 samples, Table II summarizes classification results for this model. The ACC
10,1 rates for training and testing samples were 78.41 and 80.73 per cent, respectively.
Discriminant analysis
Taking into account that DA credit-scoring models were designed to develop
discriminating functions, which can help predict the dependent variable? All the 12
44 predicted variables were entered. The one discriminating function with a P-value of
0.0000 was statistically significant at the 95 per cent confidence level. Table III
summarizes classification results for DA model. It can be observed that the ACC rate
for the training sample was 87.40 per cent, depending on 0.5 prior probabilities for
groups; while an 85.42 per cent ACC rate was found using the holdout sample.
Logistic regression
Four variables, namely ADD INC, HOR, MAR STA, and GENDER were not significant
at the 90 per cent confidence level. They were kept in the final model because of their
potential importance and for the comparison purposes with other techniques. Because
the P-value for the LR model was less than 0.01, there was a statistically significant
relationship between the variables at the 99 per cent confidence level. In addition, the
P-value for the residuals was greater than or equal to 0.10, indicating that the model
was not significantly worse than the best possible model for this data at the 90 per cent
or higher confidence level. Table III summarizes the results of the LR credit-scoring
model, using the original 12 independent variables. It can be observed that the ACC
rates were 88.95 and 82.81 per cent for training and testing samples, respectively.
DA
G 254 37 291 87.29 125 17 142 88.03
B 12 86 98 87.76 11 39 50 78.00
T 389 87.40 192 85.42
Table III. LR
Classification results for G 272 19 291 93.47 135 7 142 95.07
the DA and LR; B 24 74 98 75.51 26 24 50 48.00
predictions (in columns) T 389 88.95 192 82.81
versus observations
(in rows) Note: Cut-off point 0.50
same predictor variables. A 97.69 and 89.06 per cent ACC rates were found Alternative
applying training and testing samples, respectively, as revealed in Table IV. scoring models
Powerful neural net models
The main difference between PNN and MLFN and powerful NN models was that for the
former, the data cases in both training and testing samples were selected judgmentally
by dividing the whole data-set into a 33 per cent testing sample and a 67 per cent training 45
sample (the same samples data-sets were used with other scoring techniques used in this
paper namely, WOE, DA, and LR). By contrast, for the later, the data cases in the testing
and the training samples were automatically selected by the Neural Tools software,
applying a different 33 per cent data-set as a testing sample and a different 67 per cent
data-set as a training sample for each trial.
The experiment was repeated several times, the reason for repeating the process was
to investigate whether different results, in terms of average correct classification rate,
were being achieved because of the random selection procedure as part of the software
design. Finally, there is evidence of significant differences between the powerful neural
net models, as described below. Following are the classification results for the best
powerful NN models under each of the 20 PNN, MLFN and BNS models, respectively.
From results revealed in Table V, PNN3 was (trial number three applying PNNs) the
best model (based on the highest overall testing ACC rate), between all the 20 powerful
PNN models, achieving a 94.09 per cent and an 89.58 per cent ACC rates applying
training and testing samples, respectively. While three different MLFNs were the best
(based on the highest predictive ability) between the 20 powerful NN models, MLFN5,
MLFN7 and MLFN10. ACC rates in the training samples were 95.12, 96.66 and 96.40 per
cent for MLFN5, MLFN7 and MLFN10, respectively; and an 86.98 per cent ACC rate for
the three nets in the testing samples, as shown in Table V.
As explained earlier PNN and MLFN from two to six nodes, this was an option
under BNS, as a part of the currently used software, were applied in this paper as well.
It can be observed from Table V that BNS4-MLFN-5N (BNS4-MLFN-5N means trial
number four under the BNS with MLFN selecting five nodes as a best net) was the best
model, between all the 20 powerful BNS models. A 94.60 and 91.15 per cent ACC rates
were found in the training and testing samples, respectively, (for more details
regarding all models including all trials, see the Appendix).
As it shown in Table VI, there is evidence of significant differences between NN
models; the ANOVA F-ratio was 13.04. It was significant at 99 per cent confidence level.
PNN
G 288 3 291 98.97 135 7 142 95.07
B 11 87 98 88.78 24 26 50 52.00 Table IV.
T 389 96.40 192 83.85 Classification results
MLFN for the PNN and MLFN;
G 287 4 291 98.63 133 9 142 93.66 predictions (in columns)
B 5 93 98 94.90 12 38 50 76.00 versus observations
T 389 97.69 192 89.06 (in rows)
JRF
Training sample Testing sample
10,1 Sample model G B T Overall percentage G B T Overall percentage
PNN3
G 276 8 284 97.18 141 8 149 94.63
B 15 90 105 85.71 12 31 43 72.09
46 T 389 94.09 192 89.58
MLFN5
G 278 9 287 96.86 135 11 146 92.47
B 10 92 102 90.20 14 32 46 69.57
T 389 95.12 192 86.98
MLFN7
G 280 10 290 96.55 126 17 143 88.11
B 3 96 99 96.97 8 41 49 83.67
T 389 96.66 192 86.98
MLFN10
G 279 8 287 97.21 134 12 146 91.78
Table V. B 6 96 102 94.12 13 33 46 71.74
Classification results for T 389 96.40 192 86.98
the best powerful NNs BNS4-MLFN-5N
(60 models comprising G 279 18 297 93.94 123 13 136 90.44
20 PNN, 20 MLFN and B 3 89 92 96.74 4 52 56 92.86
20 BNS) T 389 94.60 192 91.15
Powerful NN models
PNN MLFN BNS Overall
Count 20 20 20 60
Average (mean) 84.5055 83.6980 87.2140 85.1392
SD 2.18984 2.74707 1.80751 2.70667
ANOVA F-ratio – – – 13.04 * * *
Fisher’s least significant difference test
PNN-MLFN – – – 0.8075
PNN-BNS – – – 2 2.7085 * *
MLFN-BNS – – – 2 3.5160 * *
Cochran’s C-test – – – 0.483468
Bartlett’s test – – – 1.060090
Levene’s test – – – 1.604170
Kruskal-Wallis median test statistic
Average rank 25.10 21.60 44.80 –
Table VI. Test statistic – – – 20.6185 * * *
A comparative statistical
evaluation of NN models Notes: Statistically significant difference at: * *5 and * * *1 per cent levels, respectively
Besides, all the neural net models namely, PNNs, MLFNs, BNSs are significantly
different at 95 per cent confidence level as revealed by Fisher’s least significant
difference test (there was no statistically significant differences between PNN and MLFN
powerful models). Since the smallest of the P-value was greater than or equal to 0.05,
there was not a statistically significant differences amongst the standard deviations at
the 95 per cent confidence level according to the Cochran’s C/Bartlett’s/Levene’s tests.
Moreover, the Kruskal-Wallis median test statistic shows statistically significant Alternative
differences at 99 per cent confidence level for NNs models with test statistics 20.62, scoring models
which means that the average correct classification rates are significantly different in
each neural net.
Differences between different NN models can also be observed in the graphical
analysis in Figure 2. In the BOX-and-Whisker plot reveals that the inter-quartile ranges
for (colored shaded boxes) Categories 0 and 1 do overlap, but not Category 2; and 47
Category 2 has a narrower inter-quartile range; and the medians are different for the
three categories. The analysis of means plot with a 95 per cent decision limit reveals
that Category 0 is between the CL and the LDL; whilst Category 2 is higher than the
UDL and Category 1 is lower than the LDL.
86
CAT
1 LDL = 84.14
85
2 84
83
77 80 83 86 89 92 0 1 2
Testing ACC CAT
Notes: ACC = average correct classification; CAT = category: 0 = PNN, 1 = MLFN, and 2 = BNS.
Box-and-Whisker Plot: the vertical line within the colored shaded area represents the median; the right
end of the colored shaded area is equal to the upper quartile; the left end of the colored shaded area
is equal to lower quartile; the right end of the “whisker” represents the minimum of (i) the upper Figure 2.
quartile plus the inter-quartile range, and (ii) the maximum response; the left end of the “whisker” equals Box-and-whisker plot and
the maximum of (i) the lower quartile minus the inter-quartile range, and (ii) the minimum response; analysis of means plot
outliers are shown individually. with 95 per cent decision
Analysis of Means Plot with 95% Decision Limits: CL = Central Limit (overall mean), limits
UDL = Upper Decision Limit, LDL = Lower Decision Limit
JRF (bad credit is misclassified as good credit) error is much higher than the MC associated
10,1 with Type I (good credit is misclassified as bad credit) error, which is also true in other
case studies based on housing loans (Lee and Chen, 2005). Hofmann, who compiled his
German credit data, reported that the ratio of MCs associated with Types II and I is 5:1,
as noted by West (2000). In this paper, this relative cost ratio will be used to calculate
the EMC for the proposed models (MCs have been calculated for all models including
48 all trial-applications, see the Appendix). The prior probabilities of good and bad credit
are set as 74.5 and 25.5 per cent, respectively, using the ratio of good and bad credit in
the Egyptian data-set.
Table VII summarizes the ACC rates, errors, and EMCs results for conventional
techniques (classifications based on a 0.50 cut-off point) namely, WOE, DA, and LR,
and NN techniques namely, PNN, PNN3, MLFN, MLFN5, MLFN7, MLFN10 and
BNS4-MLFN-5N.
As concluded in Table VII, DA had the highest ACC rate at 85.42 per cent, amongst
all conventional techniques. Meanwhile, BNS4-MLFN-5N has the highest ACC rate at
91.15 per cent, amongst all techniques. All models predict good credit better than
bad credit, except two models namely, WOE and BNS4-MLFN-5N. In addition, the
highest correctly classified bad credit was 100.00 per cent for WOE, whilst the highest
correctly classified good credit was 95.07 per cent for both LR and PNN. It can be
concluded from Table VII, that the average performance of the NN models was better
than the average performance of the conventional models.
Furthermore, comparing conventional techniques the MC at 0.3697 was for DA.
That was the chosen model according to the ACC rate at 85.42 per cent. Comparing
NN models the lowest MC at 0.1623 was for BNS4-MLFN-5N amongst these
techniques. That was the chosen model, according to the ACC rate at a 91.15 per cent
ACC rate (Table VII). At last, comparing all conventional and NN techniques, the
highest ACC rate, which was 91.15 per cent, leads to selecting BNS4-MLFN-5N.
However, this does provide the lowest EMC, which was 0.1623 (for more details
regarding ACC rates and EMCs for all models including all trial-applications, see the
Appendix, Table AI).
References
Altman, E.I., Marco, G. and Varetto, F. (1994), “Corporate distress diagnosis: comparisons using
linear discriminant analysis and neural networks (the Italian experience)”, Journal of
Banking & Finance, Vol. 18 No. 3, pp. 505-29.
Anderson, R. (2007), The Credit Scoring Toolkit: Theory and Practice for Retail Credit Risk
Management and Decision Automation, Oxford University Press, New York, NY.
Arminger, G., Enache, D. and Bonne, T. (1997), “Analyzing credit risk data: a comparison of
logistic discriminant, classification tree analysis, and feedforward networks”,
Computational Statistics, Vol. 12 No. 2, pp. 293-310.
Bailey, M. (2001), Credit Scoring: The Principles and Practicalities, White Box Publishing, Bristol.
Banasik, J., Crook, J. and Thomas, L. (2003), “Sample selection bias in credit scoring models”,
Journal of the Operational Research Society, Vol. 54 No. 8, pp. 822-32.
Bishop, C.M. (1995), Neural Networks for Pattern Recognition, Oxford University Press,
New York, NY.
CBE (2006/2007), “Banking developments”, Economic Review, Vol. 47 No. 3, pp. 32-56.
CBE (2007/2008), “Structure of the Egyptian banking system”, Economic Review, Vol. 48 No. 3,
p. 93.
Chen, M. and Huang, S. (2003), “Credit scoring and rejected instances reassigning through
evolutionary computation techniques”, Expert Systems with Applications, Vol. 24 No. 4,
pp. 433-41.
Crook, J., Edelman, D. and Thomas, L. (2007), “Recent developments in consumer credit risk
assessment”, European Journal of Operational Research, Vol. 183 No. 3, pp. 1447-65.
JRF Desai, V.S., Crook, J.N. and Overstreet, G.A. (1996), “A comparison of neural networks and linear
scoring models in the credit union environment”, European Journal of Operational
10,1 Research, Vol. 95 No. 1, pp. 24-37.
Desai, V.S., Conway, D.G., Crook, J.N. and Overstreet, G.A. (1997), “Credit scoring models in the
credit union environment using neural networks and genetic algorithms”, IMA Journal of
Mathematics Applied in Business and Industry, Vol. 8 No. 4, pp. 323-3463.
50 Dimla, D.E. and Lister, P.M. (2000), “On-line metal cutting tool condition monitoring: II: tool-state
classification using multi-layer perceptron neural networks”, International Journal of
Machine Tools & Manufacture, Vol. 40 No. 5, pp. 769-81.
Erbas, B. and Stefanou, S. (2008), “An application of neural networks in microeconomics:
input-output mapping in a power generation subsector of the US electricity industry”,
Expert Systems with Applications, Vol. 36 No. 2, pp. 2317-26.
Fisher, R.A. (1936), “The use of multiple measurements in taxonomic problems”, Annals of
Eugenics, Vol. 7 No. 2, pp. 179-88.
Ganchev, T., Tasoulis, D., Varhatis, M. and Fakotakis, N. (2007), “Generalized locally recurrent
probabilistic neural network with application to text-independent speaker verification”,
Neurocomputing, Vol. 70 Nos 7/9, pp. 1424-38.
Greene, W. (1998), “Sample selection in credit-scoring models”, Japan and the World Economy,
Vol. 10 No. 3, pp. 299-316.
Hand, D.J. and Jacka, S.D. (1998), Statistics in Finance, Arnold Applications of Statistics, London.
Hand, D.J., Sohn, S.Y. and Kim, Y. (2005), “Optimal bipartite scorecards”, Expert Systems with
Applications, Vol. 29 No. 3, pp. 684-90.
Lee, T. and Chen, I. (2005), “A two-stage hybrid credit scoring model using artificial neural
networks and multivariate adaptive regression splines”, Expert Systems with Applications,
Vol. 28 No. 4, pp. 743-52.
Lee, T., Chiu, C., Lu, C. and Chen, I. (2002), “Credit scoring using the hybrid neural discriminant
technique”, Expert Systems with Applications, Vol. 23 No. 3, pp. 245-54.
Liang, Q. (2003), “Corporate financial distress diagnosis in china: empirical analysis using credit
scoring models”, Hitotsubashi Journal of Commerce and Management, Vol. 38 No. 1,
pp. 13-28.
Malhotra, R. and Malhotra, D.K. (2003), “Evaluating consumer loans using neural networks”,
Omega the International Journal of Management Science, Vol. 31 No. 2, pp. 83-96.
Masters, T. (1995), Advanced Algorithms for Neural Networks: ACþ þ Sourcebook, Wiley,
New York, NY.
Min, J.H. and Lee, Y-C. (2008), “A practical approach to credit scoring”, Expert Systems with
Applications, Vol. 35 No. 4, pp. 1762-70.
Nur Ozkan-Gunay, E. and Ozkan, M. (2007), “Prediction of bank failures in emerging financial
markets: an ANN approach”, Journal of Risk Finance, Vol. 8 No. 5, pp. 465-80.
Oldham, M. and Young, M. (2004), “Egypt: difficult times remains for the banking sector”,
Special Report, FitchRatings, New York, NY.
Ong, C., Huang, J. and Tzeng, G. (2005), “Building credit scoring models using genetic
programming”, Expert Systems with Applications, Vol. 29 No. 1, pp. 41-7.
Orgler, Y.E. (1971), “Evaluation of bank consumer loans with credit scoring models”, Journal of
Bank Research, Vol. 2 No. 1, pp. 31-7.
Palisade Corporation (2005), Neural Tools: Neural Networks Add-in for Microsoft Excel,
Version 1.0, Palisade Corporation, New York, NY.
Reed, R.D. and Marks, R.J. (1999), Neural Smithing: Supervised Learning in Feedforward Alternative
Artificial Neural Networks, The MIT Press, London.
Sarlija, N., Bensic, M. and Bohacek, Z. (2004), “Multinomial model in consumer credit scoring”,
scoring models
paper presented at the 10th International Conference on Operational Research, Trogir.
Seow, H. and Thomas, L.C. (2006), “Using adaptive learning in credit scoring to estimate take-up
probability distribution”, European Journal of Operational Research, Vol. 173 No. 3,
pp. 880-92. 51
Siddiqi, N. (2006), Credit Risk Scorecards: Developing and Implementing Intelligent Credit Scoring,
Wiley, Hoboken, NJ.
Thomas, L.C., Edelman, D.B. and Crook, L.N. (2002), Credit Scoring and Its Applications, Society
for Industrial and Applied Mathematics, Philadelphia, PA.
Tsai, C. and Wu, J. (2008), “Using neural network ensembles for bankruptcy prediction and credit
scoring”, Expert Systems with Applications, Vol. 34 No. 4, pp. 2639-49.
West, D. (2000), “Neural network credit scoring models”, Computers & Operations Research,
Vol. 27 Nos 11/12, pp. 1131-52.
Yu, L., Wang, S. and Lai, K. (2007), “An intelligent-agent-based fuzzy group decision making
model for financial multicriteria decision support: the case of credit scoring”, European
Journal of Operational Research, July.
Zekic-Susac, M., Sarlija, N. and Bensic, M. (2004), “Small business credit scoring: a comparison of
logistic regression, neural networks, and decision tree models”, paper presented at the
26th International Conference on Information Technology Interfaces, Dubrovnik.
Zhang, G., Hu, M.Y., Patuwo, B.E. and Indro, D.C. (1999), “Artificial neural networks in
bankruptcy prediction: general framework and cross-validation analysis”, European
Journal of Operational Research, Vol. 116 No. 1, pp. 16-32.
Zhang, Y. and Bhattacharyya, S. (2004), “Genetic programming in classifying large-scale data:
an ensemble method”, Information Sciences, Vol. 163 Nos 1/3, pp. 85-101.