Professional Documents
Culture Documents
1
Performance evaluation
2
Introduction
Context
3
Introduction
Context
4
True and working pure premiums
Regression function
5
True and working pure premiums
Technical assumptions
• All predictors π(X ) under consideration and µ(X ) are continuous random
variables.
• A predictor π(X ) is supposed to be correct on average, that is,
6
True and working pure premiums
Notation
7
True and working pure premiums
Convex order
• The more π(X ) is dispersed, the more information it contains about the true
premium.
- The constant predictor π(X ) = E[Y ], the least dispersed one, does not bring
any information about the relative riskiness of the different policies.
• Definition :
Consider two non-negative random variables Z1 and Z2 . Then, Z1 is said to
be smaller than Z2 in the convex order, henceforth denoted as Z1 cx Z2 , if
E[g(Z1 )] ≤ E[g(Z2 )]
for all the convex functions g for which the expectations exist.
8
True and working pure premiums
Convex order
• We have
Z1 cx Z2 ⇒ V[Z1 ] ≤ V[Z2 ].
⇒ cx is a variability order : it only applies to random variables with the
same expected value and compares the dispersion of these variables.
• We can interpret Z1 cx Z2 as “Z2 is more variable than Z1 ”.
- The variability in question extends beyond the simple comparison of standard
deviation.
9
Performance curves
Concentration curve
• Definition :
The concentration curve of the true premium µ(X ) with respect to the
working premium π(X ) is defined as
• Interpretation
:
CC µ(X ), π(X ); α represents the proportion of the total true premium
income corresponding to the sub-portfolio π(X ) ≤ Fπ−1 (α), i.e. to the
100α% of contracts with the smallest premium π.
10
Performance curves
Concentration curve
• Idea :
Policies with low-risk profiles are at risk of leaving the portfolio, being
attracted by a competitor.
- It is therefore important not to over-charge this group of policyholders.
- Hence the importance of the concentration curve to assess the appropriateness
of the premium π.
11
Performance curves
• Notice that
π(X ) ≤ Fπ−1 (α) ⇔ Fπ π(X ) ≤ α.
12
Performance curves
Lorenz curve
• Definition :
The Lorenz curve LC associated with the predictor π(X ) is defined as
• Interpretation :
A Lorenz curve is thus strictly related to dispersion (or variability) by
definition.
- It is known that increasing the predictor π(X ) in the convex order moves its
Lorenz curve lower.
13
Performance curves
14
Performance curves
15
Performance curves
• We can equivalently replace the pure premium µ(X ) with the response Y in
the concentration curve.
• Assuming the samples (Yi , X i ), i = 1, . . . , n, to be iid, the concentration
curve can be estimated as follows :
CC
c µ(X ), π(X ); α = CC[Y c , π(X ); α]
1 X
= Yi
nY bπ−1 (α)
i|b
π (X i )≤F
P
i|b
π (X i )≤F b −1 (α) Yi
= Pn π .
i=1 Yi
• CC
c expresses the total sub-portfolio loss in relative terms, as a percentage of
the aggregate loss at the entire portfolio level.
16
Performance curves
• LC
c expresses the percentage of the total premium income corresponding to
the 100α% smaller premiums when the latter are computed using a predictor
π.
17
Performance curves
Properties
18
Performance curves
Properties
• Line of independence/equality :
- If Y and π(X ) are independent then
- If π(X ) brings a lot of information about the true premium µ(X ), then the
concentration curve should be far from the line of independence.
19
Performance curves
Properties
• Line of independence/equality :
- Proposition :
If µ(X ) is positively expectation dependent on π(X ), that is, if the inequality
E[µ(X )] ≥ E µ(X )π(X ) ≤ t
Proof :
It suffices to write
E µ(X )I[π(X ) ≤ t] P[π(X ) ≤ t]E µ(X )π(X ) ≤ t
=
E[Y ] E[Y ]
≤ P[π(X ) ≤ t].
20
Performance curves
Properties
• Convexity :
- Proposition :
The concentration curve α 7→ CC[µ(X ), π(X ); α] is convex if, and only if,
µ(X ) is positively regression dependent on π(X ), that is, if the function
t 7→ E µ(X )π(X ) = t
is non-decreasing.
- The increments of the function
E Y I[Fπ−1 (α) < π(X ) ≤ Fπ−1 (α + ∆)]
CC[Y , π(X ); α + ∆] − CC[Y , π(X ); α] =
E[Y ]
21
Performance curves
Measuring goodness-of-lift
• Performance of a predictor :
- The performances of a predictor π(X ) is assessed by means of the respective
positions of the two curves
- As the total expected income of π and µ match the total expected loss, the
two ratios are directly comparable.
- As actuaries, we would like that the graph of CC is as close as possible to the
graph of LC.
⇒ The smaller the area between the two curves the better.
22
Comparison of the performances of two predictors
CC[µ(X ), π1 (X 1 ); α] ≤ CC[µ(X ), π2 (X 2 ); α]
23
Comparison of the performances of two predictors
• Proposition :
If
π2 (X 2 ) cx π1 (X 1 ) and (Y , Π2 ) conc (Y , Π1 )
then predictor π1 (X 1 ) is more discriminatory than predictor π2 (X 2 ) for
response Y .
• π1 (X 1 ) is more discriminatory than π2 (X 2 ) if π1 (X 1 ) is simultaneously
more variable in the sense of the convex order) and more correlated (in the
sense of the concordance order) with the response Y than π2 (X 2 ).
24
Comparison of the performances of two predictors
where
α α2
Z
E (α − Π)+ = (α − ξ)dξ = .
0 2
25
Comparison of the performances of two predictors
• Again, as
E Y (α − Π)+ = E E[Y (α − Π)+ |X ]
= E µ(X )(α − Π)+
26
Comparison of the performances of two predictors
• The area between the two curves CC and LC turns out to be a better
performance indicator.
• This area between the curves, ABC in short, is given by
Z 1
ABC[π(X )] = CC[Y , π(X ); α] − LC[π(X ); α] dα
0
1
Z 1
= E Y I[Π ≤ α] − E π(X )I[Π ≤ α] dα
E[π(X )] 0
Z 1Z ∞
1
= P π(X ) ≤ y, Π ≤ α] − P Y ≤ y, Π ≤ α] dydα
E[π(X )] 0 0
1
= Cov π(X ), Π − Cov Y , Π .
E[π(X )]
27
Numerical examples
Assumptions
- Frank’s copula :
1 (exp(−θu) − 1)(exp(−θv ) − 1)
Cθ (u, v ) = − ln 1 + , θ 6= 0.
θ exp(−θ) − 1
28
Numerical examples
Variability
29
Numerical examples
Dependence
30
Numerical examples
Crossing copulas
• Consider a Clayton copula C1 and a Frank copula C2 .
• There exists a function f such that C1 (u, v ) − C2 (u, v ) ≤ 0 if v ≤ f (u) and
C1 (u, v ) − C2 (u, v ) ≥ 0 if v ≥ f (u).
⇒ Not ordered according to the concordance order.
Line type π(X ) µ(X ) C ABC
short dash Gam(1, 1) Gam(1, 1) Frank(τ = 0.5) 7.79%
dotted Gam(1, 1) Gam(1, 1) Clayton(τ = 0.5) 9.66%
31
Numerical examples
32
Case study
Data set
33
Case study
Models
34
Case study
35
Case study
Goodness-of-lift metrics
36
Case study
Goodness-of-lift metrics
37
References
References
Denuit, M., Sznajder, D., Trufin, J. (2019).
Model selection based on Lorenz and concentration curves, Gini indices and convex order.
Insurance : Mathematics and Economics 89, 128-139.
Frees, E.W., Meyers, G., Cummings, A.D. (2011).
Summarizing insurance scores using a Gini index.
Journal of the American Statistical Association 106, 1085-1098.
Frees, E.W., Meyers, G., Cummings, A.D. (2013).
Insurance ratemaking and a Gini index.
Journal of Risk and Insurance 81, 335-366.
Gourieroux, C. (1992).
Courbes de performance, de sélection et de discrimination.
Annales d’Économie et de Statistique 28, 107-123.
Gourieroux, C., Jasiak, J. (2011).
The Econometrics of Individual Risk : Credit, Insurance, and Marketing.
Princeton University Press.
38