Professional Documents
Culture Documents
net/publication/220636600
CITATIONS READS
16 1,632
2 authors:
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Paolo Giudici on 31 May 2014.
www.palgrave-journals.com/jors/
Keywords: predictive models; Bayesian merging; probability of default; parametric models; survival analysis;
model selection
implies that the hazards ratio is constant over time, for two expert judgment with empirical data in a two-step approach.
SMEs provided that the covariates do not change. In more Considering the problem at hand, expert judgment are repre-
detail, there are n SMEs and, associated with the i th SME (i = sented by qualitative data, while empirical data are derived
1, . . . , n) there is a survival time t and a fixed censoring time from quantitative information.
ci . The t times are assumed to be independent and identically More precisely, following the predictive models described
distributed with density f (t) and survival function S(t). The in Section 2, for each SME we observe i = 1, . . . , n, a score
exact survival time t of a SME will be observed only if t c j . i , derived from quantitative information and a score i∗ i =
Data in this framework can be represented by the n pairs of 1, . . . , n derived from qualitative information.
the random variables (yi , vi ), where: Our proposal is based on the following steps:
A similar likelihood function holds for a set of right 6. The final probability of default for each SME is a linear
censored qualitative data on n SMEs: combination of and ∗ weighted by
i computed as:
P D i =
i i + (1 −
i )i∗ , i = 1, . . . , n.
n
[h 0 (yi ) exp(i∗ )]i (S0 (yi )exp(i ) )
∗
L(∗ , h 0 (t)|D ∗ ) ∝ We remark that the data indicator
i must satisfy suitable
i=1 regularity conditions: 2 (i∗ ) + 2 (i ) = 0, 2 (i ) < ∞ and
n
n 2 (i∗ ) < ∞.
∗ i ∗
= [h 0 (yi ) exp(i )] exp − exp(i )Ho (yi ) , The resulting P D i , in step 6, can be interpreted as a Bayesian
i=1 i=1 before posterior probability for the models proposed (see eg
A very important remark is that Cox model generates Bernardo and Smith, 1994). For example, let yi indicate the
survival functions that are adjusted for covariate values. default variable, where yi = 0 means good SME and yi = 1
More precisely, it is possible to derive the probability that a means bad SME. We can assume that the yi are i.i.d distributed
company may default in a given year (T = ti ), conditional according to a Bernoulli probability distribution function:
upon it was solvent up to that point in time on the basis p(yi |) = yi (1 − )1−yi , ∈ (0, 1). (5)
of quantitative and qualitative variables. More precisely,
∗
Si (X 1i , . . . , X ki |T = t) and Si (X 1i , . . . , X ∗pi |T = t) can be We assume that the prior distribution of is:
∗
viewed as i and i , respectively.
1
h() = 0 −1 (1 − )0 −1 ,
B(0 , 0 )
3. Merging model: a proposal
In this section we describe how to combine the quantitative (0 )(0 )
B(0 , 0 ) = , (6)
and qualitative risk scores i and i∗ for the n SMEs at hand. (0 + 0 )
Our proposal is well justified in the Bayesian paradigm. In
general, the Bayesian framework provides a unified and intu- it follows that the posterior probability distribution of is:
itively appealing approach to the problem of drawing infer- yi +0 −1 yi +0 −1
h(|y) ∝ (1 − )n− , (7)
ences from observations. Bayesian statistics views statistical
inference as a problem in belief dynamics and it uses evidence where y is the n × 1 vector of the observed default (yi = 0
about a phenomenon to revise and update the knowledge about means good SME, yi = 1 means bad SME).
it. Following the Bayesian theory (see eg Bernardo and Smith, Such posterior corresponds
n to a Beta(,
) probability
1994), it is a scientifically justifiable to integrate informed distribution, with = i=1 Yi + 0 , = n − i=1
n
Yi + 0 ).
1070 Journal of the Operational Research Society Vol. 62, No. 6
n
Setting i=1 yi = k, where k is the observed frequency of confidence intervals can be calculated with the percentile
default, the posterior expectation of is: method (Hosmer and Lemeshow, 2000). Furthermore, statis-
tical tests based on the
2 statistics can be useful to compare
k + 0 n k different models on the basis of the AUC (see eg DeLong
E(|y) = = ·
n + 0 + 0 n + 0 + 0 n et al, 1988).
0 + 0 0
+ . (8)
n + 0 + 0 0 + 0 5. Application
If we set
i = n+n+ , i = nk and i∗ = ( +0 ), we obtain Our empirical analysis is based on annual 1999 to 2004 data
0 0 0 0
Equation (7) in the special case in which the scores i and from Creditreform, one of the major rating agencies for SMEs
i∗ are constant among SMEs. in Germany, covering 1003 firms belonging to 352 different
A further extension considers the multinomial probability business sectors.
distribution with a Dirichlet prior (see eg Bernardo and Smith, When handling bankruptcy data it is natural to label one
1994). of the categories as success (healthy) or failure (default) and
to assign them the values 0 and 1, respectively. Our data
4. Model selection and evaluation set consists of a binary response variable (solvency) Yit and
a set of explanatory variables: X 1it , . . . , X kit , quantitative
∗
In order to compare different models, the empirical literature financial ratios and X 1it , . . . , X ∗pit qualitative features. The
typically uses criteria based on statistical tests (see Burnham sample size available is composed of about 1000 SMEs.
and Anderson, 1998), criteria based on scoring functions (see Considering the quantitative variables, the balance sheet is
Akaike, 1973; Schwarz, 1978; Vapnik, 1998), computational divided into two parts which, based on the following equation,
criteria (see eg Hastie et al, 2001) and criteria based on loss must equal (or balance out) each other: Assets = liabilities +
functions (see eg Kohavi and John, 1997). equity. This means that assets, or the means used to operate the
Considering the problem at hand, a clear comparison company, are balanced by a company’s financial obligations
between the models described in Section 2 can be derived along with the equity investment brought into the company
using the confusion matrix (see eg Kohavi and John, 1997). and its retained earnings.
Table 1 reports a theoretical confusion matrix containing Given this understanding of our balance sheet data and
the number of elements that have been correctly or incorrectly how it is constructed, we can discuss some techniques used to
classified for each class. analyze the information therein contained. This is mainly done
In the context of our study, for a given cut off p (0 < p < 1), through financial ratio analysis. Financial ratio analysis uses
the entries in the confusion matrix have the following formulas to gain insight into a company and its operations.
meaning: a true positive for example, a bad SME is clas- Based on the balance sheet and using financial ratios (like the
sified as bad; c a false positive for example, a good SME debt-to-equity ratio), it can give a better idea of a company’s
is classified as bad; d a true negative for example, a good financial condition along with its operational efficiency. It is
SME is classified as good; b a false negative for example, important to note that some ratios will need information from
a bad SME is classified as good. In principle, each of these more than one financial statement, such as the balance sheet
outcomes would have some associated loss or reward. and the income statement.
The cut-off point can be selected taking into account the The main types of ratios that use information from the
false-negative classification and the a priori incidence of the balance sheet are financial strength ratios and activity ratios.
target variable (P opt) or the value at which sensitivity and Financial strength ratios, such as the debt-to-equity ratio,
specificity are equal (P fair). It is possible also to derive a provide information on how well the company can meet
cut off, maximising the statistic Kappa (see eg Cohen, 1960). its obligations and how they are leveraged. This can give
A related instrument to validate the performance of a investors an idea of how financially stable a company is and
predictive model for probabilities is the Receiver Oper- how the company finances itself. Activity ratios focus mainly
ating Characterisitcs (ROC) curve (Hastie et al, 2001; on current accounts to show how well a company manages
Figini and Giudici, 2009). A recommended index of accu- its operating cycle. These ratios can provide insight into the
racy associated with a ROC curve is the Area Under company’s operational efficiency.
the Curve (AUC), as a threshold-independent measure of There is a wide range of individual financial ratios that
predictive performance. For such measure, bootstrapped Creditreform uses to learn more about a company. Given our
available dataset, we computed a set of 11 financial ratios
Table 1 Theoretical confusion matrix suggested by Creditreform based on its experience:
Observed/Predicted ŷi = 1 ŷi = 0
• Supplier target days: it is a temporal measure of financial
yi = 1 a b
yi = 0 c d
sustainability expressed in days that considers all short and
medium term debts as well as other payables.
S Figini and P Giudici—Statistical merging of rating models 1071
• Outside capital structure: this ratio evaluates a firm’s capa- Creditreform business experts of:
bility to receive forms of financing other than banks’ loans.
• Cash ratio this ratio indicates the cash a company can • KdtUrt: this information is relevant in order to define if the
generate in relation to its size. business relationship between an SME and Creditreform is
• Capital tied up: this ratio evaluates the turnover of short acceptable or is not recommended. This feature is based on
term debts with respect to sales. past credit decisions and it shows a value 0 if the relation-
• Equity ratio: it measures a company’s financial leverage ship is acceptable and 1 otherwise.
calculated by dividing a particular measure of equity by • ZwsUrt: this variable summarises the payment history for
the firm’s total assets. each SME. The levels are 0 if the payment is within time
• Cash flow to effective debt: this ratio indicates the cash a and 1 if irregular payments are present. Furthermore textual
company can generate in relation to its size and debts. descriptions highlight reminders to encourage payment for
• Cost income ratio: the cost income ratio is an efficiency each SME.
measure similar to the operating margin one which is useful • Entw: this variable describes information on the company
to measure how costs are changing compared to income. is development level. A level equal to 2 or 1 or 0 means
• Trade payable ratio: this ratio reveals how often the firm respectively a positive company development, a stagnating
payables turn over during the year; a high ratio means a rela- company development and a declining company develop-
tively short time between purchase of goods and services ment.
and their payment; a low ratio may be a sign that the • Auft: it is a categorical variable that reports the order situa-
company has chronic cash shortages. tion for each SME. If the order situation is good, the vari-
• Liabilities ratio: it is a measure of a company’s financial able is equal to 2; if the order situation is declining or bad
leverage calcu lated by dividing a gross measure of long- it is equal to 0 or 1.
term debt by the firm’s assets; also it highlights what debt • AnzMta: this feature is a grouped variable derived from a
proportion the company is using to finance its assets. quantitative information. We have computed three groups
• Result ratio: this is an index of how profitable a company is composed of different number of employees in relation to
relative to its total assets; it gives an idea as to how efficient special company structures.
management is at using its assets to generate earnings. • We remark that all of the previous variables are true expert
• Liquidity ratio: This ratio measures the extent to which opinions, although most of them are based on available
a firm can quickly liquidate assets and cover short-term objective information recorded in a subjective way.
liabilities. It is therefore of interest to short-term creditors.
5.1. Exploratory analysis
Furthermore, we considered the following additional annual
account positions, which were standardised in order to avoid In this section we report univariate statistical measures based
computational problems with the previous ratios: on variability and tendency for the quantitative financial
ratios and exploratory heterogeneity indexes for the qualita-
• Total assets: it is the sum of current and long-term assets tive features.
owned by a firm. The results derived on the qualitative features are
• Total equity: It refers to total assets minus total liabilities, summarised in Table 2. Table 2 reports classical heterogeneity
and it is also referred to as equity or net worth or book indexes (such as Gini and the entropy) for the qualitative
value. variables considered for good SMEs (Solvency = 0) and bad
• Total liabilities: it includes all the current liabilities, SMEs (Solvency = 1), respectively.
long term debt, and any other miscellaneous liabilities a In order to derive information about the relationship
company may have. between each variable and the solvency, bivariate explo-
• Net income: this is equal to the income that a firm has after rative analyses on the data available are reported in Tables
subtracting costs and expenses from the total revenue. 3 and 4. Table 3 shows the pairwise correlations computed
between each quantitative variable and a quantitative variable
As pointed out in Section 2, we have derived the following provided by Creditreform to measure the solvency (index
categorical variables on the basis of opinions expressed by for creditworthiness). Based on the p-value, the significant
Table 3 Quantitative risk factors: discriminant power Table 7 Quantitative data results: chosen risk factors (SDM)
Variable Coefficient of p-value Quantitative variables Estimate Std. Error p-value
Correlation
Supplier target days −0.017 0.6073 Result ratio 1.088 0.389 < 0.0001
Outside capital structure 0.169 < 0.0001 Supplier target days 0.156 0.051 < 0.0001
Cash ratio −0.121 0.0002 Cost income ratio −0.360 0.181 < 0.0001
Capital tied up 0.032 0.3299 Liabilities ratio −1.167 0.211 < 0.0001
Equity ratio −0.146 < 0.0001
Cash flow to effective debt −0.096 0.0036
Cost income ratio 0.021 0.5306
Trade payable ratio 0.096 0.0034 Table 8 Qualitative data results: chosen risk factors (SDM)
Liabilities ratio 0.287 < 0.0001 Qualitative variables Estimate Std. Error p-value
Result ratio −0.221 < 0.0001
Liquidity ratio −0.116 0.0004 ZwsUrt 0.515 0.1749 0.0032
We have compared the results achieved using single models DeLong E, DeLong D and Clarke-Pearson D (1988). Comparing the
and merged models in terms of model performance and areas under two or more correlated receiver operating characteristic
predictive capability. curves: A nonparametric approach. Biometrics 44: 837–845.
Dobson AJ (2002). Introduction to Generalised Linear Model.
Empirical evidences collected on a real data source show Chapman Hall: London.
that the merged longitudinal predictive model performs better Duffie D and Singleton KJ (1997). An econometric model of the term
than survival duration models. On the basis of the data at structure of interest-rate swap yields. J Financ 52: 1287–1322.
hand the LPM derived using our proposal for data merging Figini S and Giudici P (2009). Applied Data Mining for Business and
shows a clear superiority in terms of predictive performance. Industry. Wiley: London.
Fuertes AM and Kalotychou E (2006). Early warning systems for
In contrast, in our opinion Cox proportional hazard model sovereign debt crises: The role of heterogeneity. Comput Stat Data
employed in SDM models shows some weaknesses for default An 51: 1420–1441.
modelling because assumes that the event times are indepen- Granger C and Pesaran M (2000). Economic and statistical measures
dent and the risk is proportional across the SMEs. We believe of forecast accuracy. J Forecasting 19: 537–560.
that our proposed model can be a good starting point for more Hastie T, Tibshirani R and Friedman JH (2001). The Elements
of Statistical Learning: Data Mining, Inference, and Prediction.
accurate risk estimation. Springer Verlag: New York.
Hosmer DW and Lemeshow S (2000). Applied Logistic Regression.
Wiley: New York.
Acknowledgements — This work has been supported by MUSING 2006
contract number 027097, 2006 to 2010. The paper is the result of a Kohavi R and John G (1997). Wrappers for feature subset selection.
collaboration between the authors, however it has been written by Silvia Artif Int 97: 273–324.
Figini. Ku LW, Liang Y-T, and Chen H-H (2006). Opinion extraction,
summarization and tracking in news and blog Corpora. In:
Proceedings of AAAI-2006 Spring Symposium on Computational
References Approaches to Analyzing Weblogs.
Li Y, Bontcheva K and Cunningham H (2004). An SVM based learning
Abrahams CR and Zhang M (2009). Credit Risk Assessment: The algorithm for information extraction. Machine Learning Workshop.
New Lending System for Borrowers, Lenders and Investors. Wiley: Sheffield.
Chichester, NY. Lin W, Wilson T, Wiebe J and Hauptmann A (2006). Which side
Abrahams CR and Zhang M (2008). Fair Lending Compliance: are you on? Identifying perspectives at the document and sentence
Intelligence and Implications for Credit Risk Management. Wiley: Levels. In: Proceedings of the Tenth Conference on Computational
Chichester, NY. Natural Language Learning (CoNLL-X), June, New York. AAAI
Akaike H (1973). Information theory and an extension of the Press, Menlo Park, CA, pp 109–116.
maximum likelihood principle. Second International Symposium Merton RC (1974). On the pricing of corporate debt: The risk structure
on Information Theory, pp. 267–281. of interest rates. J Finance 29: 449–470.
Altman E and Sabato G (2006). Modelling credit risk for SMEs: Riloff E, Patwardhan S and Wiebe J (2006). Feature subsumption
Evidence from the US market. ABACUS 19(6): 716–723. for opinion analysis. In: Proceedings of the Conference on
Bernardo J and Smith A (1994). Bayesian Theory. Wiley: London. Empirical Methods in Natural Language Processing (EMNLP-06).
Burnham KP and Anderson DR (1998). Model Selection and Available at ACL Anthology, a digital archive of research pages
Inference: A Practical Information—Theoretic Approach. Springer- in computational linguistics.
Verlag: New York. Schwarz G (1978). Estimating the dimension of a model. Ann Stat 6:
Choi Y, Cardie C, Riloff E and Patwardhan S (2005). Identifying 461–464.
sources of opinions with conditional random fields and extraction Vapnik V (1998). Statistical Learning Theory. Wiley: New York
patterns. In: Proceedings of the 2005 Human Language Technology (Chichester, UK).
Conference/Conference on Empirical Methods in Natural Language Webb A (2003). Statistical Pattern Recognition, 2nd edn. Wiley:
Processing. Available at ACL Anthology, a digital archive of Chichester, NY.
research pages in computational linguistics. Wiebe J, Wilson T and Cardie C (2005). Annotating expressions of
Cohen J (1960). A coefficient of agreement for nominal scales. Educ opinions and emotions in language. Lang Resour Eval 39(2–3):
Psychol Meas 20: 3746. 165210.
Crook J, Edelman D and Thomas L (2006). Recent developments
in consumer credit risk assessment. Eur J Opl Res 183: Received April 2009;
1569–1581. accepted February 2010 after two revisions