Professional Documents
Culture Documents
Support Vector Machines For Credit Scoring
Support Vector Machines For Credit Scoring
MACHINES FOR
CREDIT SCORING
Michal Haltuf
Diploma thesis
Binary classification
Explanatory variables
+ Kernel trick
Euclidean distance
( ⋅ + )
=
| |
= minimization of
= minimization of
subject to ⋅ + ≥ 1, ∀
Lagrange function
Primal optimization problem:
∗
= min max *( , +)
$;&' ()
s.t.
1
0+ =0
2
+ ≥ 0, ∀
Support vectors
∗
= ∑12 + ∗
78 = 9:;(∑12 + ∗ ⋅ + ∗)
⇨ SUPPORT VECTORS
Credits: Yifan Peng, 2013
Kernel function
Some data are not linearly separable
SOLUTION:
Choose a suitable mapping function =( )
Map vectors from the original input space to the
higher-dimensional feature space
Find linear separating hyperplane in the feature
space
Cover’s theorem
Credits: Eric Kim, 2013
, →[ , , + ]
Credits: Eric Kim, 2013
Credits: Eric Kim, 2013
Kernel trick
Our solution depends only on the dot products
WHY?
C(;D ) → C(;) problem
I I I
HE H H E ⋅E
exp − = exp exp
JI JI JI
L M
Apply Taylor expansion KL = ∑O
42) 4!
I I
In ℝ1 ... = exp H
JI
H E
1⋅1+
JI !
⋅
JI !
E+
JQ !
⋅
JQ !
E+⋯
W
− 1 1 1
= = exp 1; ;
T
;
U
;…
2S S 1! S 1! S 1!
SOLUTION:
allow some data to appear on the „wrong“ side of
hyperplane – Slack variable.
Constraint:
y ⋅ + ≥1−Y, Y ≥0
Credits: Stephen Cronin, 2010
Soft margin
Adjust objective function by Cost parameter
1
1
min + Z0Y
2
2
Different approaches
Binning
Platt’s scaling
1
[ =17 =
1 + exp {\7 + ]}
Binning - Obtaining Calibrated Probability Estimates from Support Vector Machines, J. Drish, 2001.
Platt’s scaling - Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods, J. C. Platt, 1999.
Implementation
Quadratic programming – matrix of size n2
2932 observations
656 defaults
37 explanatory variables
Find optimal
parameters (C,γ)
Performs better
than LR – potential
non-linearities
in data
Model comparison
0,8
0,75
AUC
0,7 LR
SVM-L
SVM-R
0,65
0,6
0,55
0 5 10 15 20
Number of variables
Quantifying the edge
Backtesting using 3 different strategies
Random diversification
Naive scoring
Quantitative model (LR, SVM-RBF)