Mack 1994
Mack 1994
by Thomas h4ack
MEASURING THE VARIABILITY
OF CHAIN LADDER RESERVE ESTIMATES
Abstract:
The variability of chain ladder reserve estimates is quantified
without assuming any specific claims amount distribution
function. This is done by establishing a formula for the so-
called standard error which is an estimate for the standard
deviation of the outstanding claims reserve. The information
necessary for this purpose is extracted only from the usual
chain ladder formulae. With the standard error as decisive tool
it is shown how a confidence interval for the outstanding claims
reserve and for the ultimate claims amount can be constructed.
Moreover, the analysis of the information extracted and of its
implications shows when it is appropriate to apply the chain
ladder method and when not.
102
1. Introduction and Overview
The chain ladder method is probably the most popular method for
estimating outstanding claims reserves. The main reason for this
is its simplicity and the fact that it is distribution-free,
i.e. that it seems to be based on almost no assumptions. In this
103
assumption underlying the chain ladder method is derived from
the formula used to estimate the ultimate claims amount. In
Chapter 3, the comparison of the age-to-age factor formula used
by the chain ladder method with other possibilities leads to a
second underlying assumption regarding the variance of the
claims amounts. Using both of these derived assumptions and a
third assumption on the independence of the accident years, it
is possible to calculate the so-called standard error of the
estimated ultimate claims amount. This is done in Chapter 4
where it is also shown that this standard error is the
appropriate measure of variability for the construction of a
confidence interval. Chapter 5 illustrates how any given run-off
triangle can be checked using some plots to ascertain whether
the assumptions mentioned can be considered to be met. If these
plots show that the assumptions do not seem to be met, the chain
ladder method should not be applied. In Chapter 6 all formulae
and instruments established including two statistical tests set
out in Appendices G and H are applied to a numerical example.
For the sake of comparison, the reserves and standard errors
according to a well-known claims reserving software package are
also quoted. Complete and detailed proofs of all results and
formulae are given in the Appendices A - F.
The proofs are not very short and take up about one fifth of the
paper. But the resulting formula (7) for the standard error is
very simple and can be applied directly after reading the basic
notations (1) and (2) in the first two paragraphs of the next
104
chapter. In the numerical example, too, we could have applied
formula (7) for the standard error immediately after the
completion of the run-off triangle. But we prefer to first carry
through the analysis of whether the chain ladder assumptions are
met in this particular case as this analysis generally should be
made first. Because this analysis comprises many tables and
plots, the example takes up another two fifths of~the paper
(including the tests in Appendices G and Ii).
Ri = ci1 - =i,I+l-i
is the outstanding claims reserve of accident year i as Ci I+l-i
,
has already been paid or incurred up to now.
105
I-k
(2) fk = c 'j,k+l 1 :f: 'jk r 1 2 k 6 I-l,
j=l
are the so-called age-to-age factors.
106
variables Cik whose values are assumed to be known. If nothing
iS indicated all Cik are assumed to be unknown.
What we said above regarding the increase from Cik to Ci k+l can
I
now be formulated in stochastic terms as follows: The chain
ladder method assumes the existence of accident-year-independent
factors fl, . . . . fIel such that, given the development Gil' ....
Cikt the realization of Ci k+l is 'close' to Cikfk, the latter
I
being the expected value of Ci k+l in its mathematical meaning,
,
i.e.
107
uses the same age-to-age factor fk for different accident years
i = 1+1-k, . . . , I. Therefore equations (3) also postulate age-
to-age parameters fk which are the same for all accident years.
The other is the fact that (1) uses only the most recent
observed value Ci,I+l-i as basis for the projection to ultimate
ignoring on the one hand all amounts Gil, . . . . Ci,I-i observed
earlier and on the other hand the fact that Ci,I+1-i could
substantially deviate from its expected value. Note that it
would easily be possible to also project to ultimate the amounts
tilt **.I ci,I-i of the earlier development years with the help
of the age-to-age factors fl, . . . . fImZ and to combine all these
projected amounts together with Ci,I+l-ify+l-i'..-'fI-1 into a
common estimator for CiI. Moreover, it would also easily be
possible to use the values Cj,I+1-i of the earlier accident
years j < i as additional estimators for E(Ci,I+I-i) by
translating them into accident year i with the help of a measure
of volume for each accident year. These possibilities are all
ignored by the chain ladder method which uses Ci I+1-i as the
,
only basis for the projection to ultimate. This means that the
chain ladder method implicitly must use an assumption which
states that the information contained in Ci I+1-i cannot be
I
augmented by additionally using Gil, . . . . Ci,I-i or CI I+l-i,
I
. . . , Ci-l,I+l-i. This is very well reflected by the equations
(3).
108
assumption which has important consequences 'and which cannot be
taken as met for every run-off triangle. Thus the widespread
impression the chain ladder method would work with almost no
E(Ci,k+l/CikICil,".,Cik) = fk
because Cik is a scalar under the condition that we know CiI,
. . . . Cik. This form of (3) shows that the expected value of the
individual development factor Ci k+l/Cik equals fk irrespective
I
of the prior development Gil, . . . . Cik and especially of the
foregoing development factor Cik/Ci,k-l. As is shown in Appendix
G, this implies that subsequent development factors Cik/Ci,k-l
and Ci,k+l/Cik are uncorrelated. This means that after a rather
high value of Cik/Ci,k-l the expected size of the next
development factor Ci,k+l/Cik is the same as after a rather low
value Of Cik/Ci,k-1. We therefore should not apply the chain
ladder method to a business where we usually observe a rather
Small increase Ci,k+l/Cik if Cik/Ci,k-1 is higher than in most
109
3. Analysis of the Ase-to-Aoe Factor Formula: the Key to
Measurina the Variability
Because the chain ladder method neither in (1) nor in (2) takes
into account any dependency between the accident years we can
conclude that the independence of the accident years is also an
implicit assumption of the chain ladder method. We will
110
in the same way and can thus distort the independence. How such
a situation can be recognized is shown in Appendix H.
because
= E(E(Cj,k+llCjl,.-*ICjk)/Cjk) (b)
= E(Cjkfk/Cjk) (cl
= E(fk)
= fk . (d)
Here equality (a) holds due to the iterative rule E(X) =
E(E(XIY)) for expectations, (b) holds because, given Cjl to cjk,
Cjk is,a scalar, (c) holds due to assumption (3) and (d) holds
because fk is a scalar. (When applying expectations iteratively,
e.g. E(E(XIY)), one first takes the conditional expectation
E(X(Y) assuming Y being known and then averages over all
possible realizations of Y.)
ill
Therefore the question arises as to why the chain ladder method
uses just fk as estimator for fk and not the simple average
1 I-k ._
~ ' 'j,k+l/'jk
I-k j=l
of the observed development factors which also would be an
unbiased estimator as is the case with any weighted average
I-k I-k
gk = jz, wjk 'j,k+l/'jk with C Wjk=l
j=l
of the observed development factors. (Here, wjk must be a scalar
if Cjl, . . . . Cjk are known.)
The fact that the chain ladder estimator fk uses weights which
are proportional to Cjk therefore means that Cjk is assumed to
be inversely proportional to Var(Cj,k+I/CjklCjI,...,Cjk), or
stated the other way around, that
var(Cj,k+l/CjklCjl,.~.,cjk) = ak2/Cjk
with a proportionality constant ak2 which may depend on k but
112
not on j and which must be non-negative because variances are
always non-negative. Since here cjk is a scalar and because
generally Var(X/c) = Var(X)/c2 for any scalar c, we can state
the above proportionality condition also in the form
As it was the case with assumptions (3) and (Q), assumption (5)
also has to be considered a basic condition implicitly
underlying the chain ladder method. Again, condition (5) cannot
a priori be assumed to be met for every run-off triangle. In
Chapter 5 we will show how to check a given triangle to see
whether (5) can be considered met or not. But before we turn to
the most important consequence of (5): Together with (3) and (4)
it namely enables us to quantify the uncertainty in the
estimation of CiI by CiI.
The aim of the chain ladder method and of every claims reserving
method is the estimation of the ultimate claims amount CiI for
the accident years i = 2, . . . . I. The chain ladder method does
this by formula (l), i.e. by
Cif = ci I I+l-i'fI+l-i'""fI-l f
This formula yields only a point estimate for CiI which will
normally turn out to be more or less wrong, i.e. there is only a
113
very small probability for CiI being equal to CiI. This
probability is even zero if CiI is considered to be a continuous
variable. We therefore want to know in addition if the estimator
114
The mean squared error is exactly the same concept which also
underlyies the notion of the variance
Var(X) = E(X - E(X))2
of any random variable X. Var(X) measures the average distance
of X from its mean value E(X).
115
CiI is at the same time the standard error s.e.(Ri) of the
reserve estimate
% = ciI - ci,I+l-i
of the outstanding claims reserve
Ri = CiI - Ci I 1+1-i
because
where
1 I-k 'j k+l
(8) ok2 = - c cjk ( I_--fk)2, 1 I k 5 I-2.
I-k-l j=l 'jk
is an unbiased estimator of ok2 (the unbiasedness being shown in
Appendix E) and
116
triangle is completed step by step according to the chain ladder
method. In (7), for notational convenience we have also set
%,1+1-i = 'i,I+l-i.
2 2
(9) ax-1 = min ( a~-2/a~-3, min(aIm3, aim21 1 .
(31 = ci,I+l-i + Ri
the ultimate claims amount CiI consists of a known part Ci I+l-i
I
and an unknown part Ri. This means that the probability
distribution function of CiI !given the observations D which
include Ci I I+l-i) is completely determined by that of Ri. We
therefore need to establish a confidence interval for Ri only
and can then simply shift it to a confidence interval for CiI.
117
For this purpose we need to know the distribution function of
Ri. Up to now we only have estimates Ri and s.e.(Ri) for the
mean and the standard deviation of this distribution. If the
volume of the outstanding claims is large enough we can, due to
the central limit theorem, assume that this distribution
function is a Normal distribution with an expected value equal
to the point estimate given by Ri and a standard deviation equal
to the standard error s.e.(Ri). A symmetric 95%-confidence
interval for Ri is then g+ven by
( Ri - a*s.e.(Ri) , Ri + 2.s.e.(Ri) ).
exP(fii + Ui2/2) = Ri ,
118
This leads to
Ui2 = ln(1 + (S.e.(Ri))2/Ri2) ,
(10)
Hi = ln(Ri) - Oi2/2 .
Now, if we want to estimate the 90th percentile of Ri, for
example, we proceed as follows. First we take the 90th
percentile of the Standard Normal distribution which is 1.28.
Then eXp(piil.28Ui) with cci and Ui2 according to (10) is the
90th percentile of the Lognormal distribution and therefore also
approximately of the distribution of Ri. For instance, if
s.e.(Ri)/Ri = 1, then oi 2 = ln(2) and the 90th percentile is
I I
= c (s-e. Will2 + Cir( Ii1 2ak2'fk2
i-2 I k=I+l-i I-k
' 'nk
n=l
120
triangle whether the chain ladder assumptions (3) and (5) are
met or not.
As has been pointed out before, the three basic implicit chain
ladder assumptions
are not met in every case. In this chapter we will indicate how
these assumptions can be checked for a given run-off triangle.
We have already mentioned in Chapter 3 that Appendix H develops
a test for calendar year influences which may violate (4). We
therefore can concentrate in the following on assumptions (3)
and (5).
121
1, . . . . I-k. Therefore, we can estimate the regression
coefficient b = fk by the usual least squares method
I-k
' (Ci,k+l - Cikfk)2 ~ minimum .
i=l
If the derivative of the left hand side with respect to fk is
set to 0 we obtain for the minimizing parameter fk the solution
I-k I-k
(12) fkO = ' 'ikCi,k+l / C Cik 2 .
i=l i=l
This is not the same estimator for fk as according to the chain
ladder formula (2). We therefore have used an additional index
'0' at this new estimator for fk. We can rewrite fko as
1-k Cik2 'i,k+l
fkO = c -.
i=l I-k
2 'ik
' 'ik
i=l
which shows that fko is the Cik2-weighted average of the
individual development factors Ci I k+l/Cik, whereas the chain
ladder estimator fk is the Cik-weighted average. In Chapter 3 we
saw that these weights are inversely proportional to the
underlying variances Var(Ci,k+l/CiklCil,...RCik).
Correspondingly, the estimator fkO assumes
122
Here we remember that indeed the least squares method implicit lY
assumes equal variances Var(Yi) = Var(ei) = u2 for all i. If
this assumption is not met, i.e. if the variances Var(Yi) =
Var(ti) depend on i, one should use a weighted least squares
approach which consists of minimizing the weighted sum of
squares
C Wi(Yi - C - Xib)’
i=l
where the weights Wi are in inverse proportion to Var(Yi).
‘k-
123
I-k I-k 'i k+l
' (Ci,k+l - Cikfk)2 / Cik2 = C ( A- fk)2 .
i=l i=l 'ik
Here the minimizing procedure yields
1 1-k Ci k+l
(13) fk2 = - c A ,
I-k i=l 'ik
which is the ordinary unweighted average of the development
factors. The variance assumption corresponding to the weights
used is
Var(Ci,k+llCil,...,Cik ) being proportional to Cik'
or equivalently
Var(Ci,k+l/CiklCilr...,Cik) being proportional to 1.
124
purely random. It is recommended to compare all three residual
plots (for i = 1, . . . . I-k)
Plot 0: ci,k+l - CikfkO against 'ik '
Plot 1: (ci,k+l - Cikfkl)r/Cik against 'ik 1
Plot 2: (ci,k+l - Cikfk2)/Cik against Cik ,
and to find out which one shows the most random behaviour. All
this should be done for every development year k for which we
have sufficient data points, say at least 6, i.e. for k S I-6.
6.
The data for the following example are taken from the
'Historical Loss Development Study', 1991 Edition, published by
the Reinsurance Association of America (RAA). There, we find on
page 96 the following run-off triangle of Automatic Facultative
12s
business in General Liability (excluding Asbestos &
Environmental):
i=l 5012 a269 10907 11805 13539 16181 18009 la&u 18442 18834
i=2 1 106 6285 5396 lo646 13782 15599 15196 16169 16704
i=3 1 3410 a992 13873 16141 18735 22214 22863 23466
id 1 5655 11555 15764 21266 23425 26083 27067
i-5 1 1092 9565 1563.6 22169 25955 26180
i=lO 1 2063
fkO 1 2.217 1.569 1.261 1.162 1.100 1.041 1.032 1.016 1.009
fkl 1 2.999 1.624 1.271 1.172 1.113 1.042 1.033 1.017 1.009
fk2 ; 8.206 1.696 1.315 1.185 1.127 1.043 1.034 1.011 1.009
126
If one has the run-off triangle on a personal computer it is'
very easy to produce the plots recommended in Chapter 5 because
most spreadsheet programs have the facility of plotting X-Y
graphs. For every k = 1, . . . . 8 we make a plot of the amounts
Ci,k+l (y-axis) of development year k+l against the amOUntS Cik
(x-axis) of development year k for i = 1, . . . . 10-k, and draw a
straight line through the origin with slope fkl. The plots for k
= 1 to 8 are shown in the upper graphs of Figures 1 to 8,
respectively. (All figures are to be found at the end of the
paper after the appendices.) The number above each point mark
indicates the corresponding accident year. (Note that the point
mark at the upper or right hand border line of each graph does
not belong to the plotted points (Cik, Ci,k+l), it has only been
127
2063.2.999 = 6187 is within the bulk of the data points plotted.
In any case, Figure 1 shows that any forecast of Cl0 2 is
,
associated with a high uncertainty of about k3000 or almost
*50% of an average-sized Ci,2 which subsequently is even
enlarged when extrapolating to ultimate. If in a future accident
year we have a value Gil outside the interval (2000, 4000) it is
reasonable to introduce an additional parameter by fitting a
regression line with positive intercept to the data and using it
for the projection to Ci2. Such a procedure of employing an
additional parameter is acceptable between the first two
development years in which we have the highest number of data
points of all years.
128
We need not to look at the regression lines with slopes fko or
fk2 as these slopes are very close to fk (except for k=l). But
we should look at the corresponding plots of weighted residuals
in order to see whether they appear more satisfactory than the
previous ones. (Note that due to the different weights the
residuals will be different even if the slopes are equal.) The
residual plots for fkO and k = 1 to 4 are shown in Figures 9 and
10. Those for fk2 and k = 1 to 4 are shown in Figures 11 and 12.
In the residual plot for fl,O (Figure 9, upper graph) the point
furthest to the left is not an outlier as it is in the plots for
fl,l = ft (Figur 1, lower graph) and f1,2 (Figure 11, upper
graph) . But with all three residual plots for k=l the main
problem is the missing intercept of the regression line which
leads to a decreasing trend in the residuals. Therefore the
improvement of the outlier is of secondary importance. For k = 2
the three residuals plots do not show any major differences
between each other. The same holds for k = 3 and 4. The residual
plots for k = 5 to 8 are not important because of the small
number of data points. Altogether, we decide to keep the usual
chain ladder method, i.e. the age-to-age factors fk = fk
1,
,
because the alternatives fk,O or fk,2 do not lead to a clear
improvement.
129
neither test leads to a rejection of the underlying assumption
as is shown in the appendices mentioned.
k 1 2 3 4 5 6 7 8 9
Ok2 27883 1109 691 61.2 119 40.8 1.34 7.88
130
(The numbers in the 'Overall'-row are R, s-e.(R) and s.e.(R)/R.)
For i = 2, 3 and 10 the percentage standard error (last column)
is more than 100% of the estimated reserve Ri. For i = 2 and 3
this is due to the small amount of the corresponding reserve and
is not important because the absolute amounts of the standard
errors are rather small. But the standard error of 150 % for the
most recent accident year i = 10 might lead to some concern in
practice. The main reason for this high standard error is the
high uncertainty of forecasting next year's value C10,2 as was
seen when examining the plot of Ci2 against Gil. Thus, one year
later we will very likely be able to give a much more precise
forecast of C1o,lo.
132
of t = -.8211 which corresponds to the 21st percentile. This
means that a 87% - 21% = 66% confidence interval for each
accident year leads to a 90% - 10% = 80% confidence interval for
the overall reserve amount. In the following table, the
confidence intervals thus obtained for Ri are already shifted
(by adding Ci,I+l-i) to confidence intervals for the ultimate
claims amounts CiI (for instance, the upper limit 16994 for i=2
has been obtained by adding C2,g = 16704 and 290 from the
preceding table):
confidence intervals
=i, 10 for 80% prob. overall empirical limits
133
can be found in Appendices G and Ii. They are
fk,nin 1 1.650 1.259 1.082 1.102 1.009 .W3 1.026 1.003 1.009
fk,max i 40.425 2.723 1.977 1.292 1.195 1.113 1.043 1.033 1.009
134
This package is a modelling framework in which the user can
specify his own model within a large class of models. But it
also contains some predefined models, inter alia also a 'chain
ladder model'. But this is not the usual chain ladder method,
instead, it is a loglinearized approximation of it. Therefore,
the estimates of the oustanding claims amounts differ from those
obtained here with the usual chain ladder method. Moreover, it
works with the logarithms of the incremental amounts
Ci k+l-Cik
I
and one must therefore eliminate the negative increment C2 7-
,
'2,6* In addition, C2 I was identified as an outlier and was
I
eliminated. Then the ICRFS results were quite similar to the
chain ladder results as can be seen in the following table:
Even though the reserves Ri for i=9 and i=lO as well as the
overall reserve R differ considerably they are all within one
standard error and therefore not significantly different. But it
should be remarked that this manner of using ICRFS is not
135
intended by Zehnwirth because any initial model should be
further adjusted according to the indications and plots given by
the program. In this particular case there were strong
indications for developing the model further but then one would
have to give up the 'chain ladder model'.
7. Final Remark
136
must be judged by an actuary and/or underwriter who knows the
business under consideration. Even then, unexpected future
changes can make all estimations obsolete. But for the many
normal cases it is good to have a sound and simple method.
Simple methods have the disadvantage of not capturing all
aspects of reality but have the advantage that the user is in a
position to know exactly how the method works and where its
weaknesses are. Moreover, a simple method can be explained to
non-actuaries in more detail. These are invaluable advantages of
simple models over more sophisticated ones.
137
Apoendix A: Unbiasedness of Acre-to-Acre Factors
138
I-k I-k
(A4) E(fklBk) = c cjkfk / c cjk = fk .
j=l j=l
139
Avvendix B: Minimizina the Variance of Indevendent Estimators
(Bl) iWi= 1
i-l
(which guarantees E(T) = t) is minimal iff the coefficients Wi
are inversely proportional to Var(Ti), i.e. iff
wi = c/Var(Ti) , lSi.51.
140
wi = X / (2*Var(Ti)) .
These weights Wi indeed lead to a minimum as can be seen by
calculating the extremal value of Var(T) and applying Schwarz's
inequality.
Var(TilAk) = Var(Ci,k+I/CikjCiIt...,Cik)
and arrive at the result that
the minimizing weights should be
inversely proportional t0 Var(Ci,k+I/CiklCiI,...,Cik).
141
Avvendix C: Unbiasedness of the Estimated Ultimate Claims Amount
142
This result can easily be extended to arbitrary products of
different fk’S, i.e. we have
E(CII) = E(E(CiIlCil,...rCi,I+l-i)) (4
= E(E(Ci,I+l-ifI+l-i..-..fI-1ICilt...~Ci,I+l-i)) (b)
= E(Ci,I+1-iElfI+l-i'...'fI-lICil~..-,Ci,I+l-i)) (C)
= E(Ci,I+l-iE(fI+l-i'...'fI-l)) Cd)
= E(Ci,I+r-i)*E(fI+l,i..-..fI-l) (e)
= E(Ci,I+1-i)'fI+1-i'...*fI-l - (f)
Here (a) holds because of the iterative rule for expectations,
(b) holds because of the definition (1) of CiT, (c) holds
because Ci,I+l-i is a scalar under the stated condition, (d)
holds because conditions which are independent from the
conditioned variable fI+l-i*e..*fI-l can be omitted (observe
assumption (4) and the fact that fI+l-i, . . . . fT-1 only depend
on variables of accident years < i), (e) holds because E( fI+l-
i’... *fI-1) is a scalar and (f) holds because of (Cl).
E(Cir) = E(E(CiIlCil,...,Ci,I-1))
p E(Ci, I-1fI-l)
= E(Ci,I-1) Q-1
143
= E(CilI-2fI-2)fI-1
= E(Ci,I-2) fI-2fI-1
= etc.
= E(Ci,I+l-i)fI+1-i'....fI-l
= E(CiI) *
. culation of the Standard Error of Ci1
. *
ProDosltloll : Under the assumptions
145
We first consider Vari(CiI). Because of the general rule Var(X)
= E(X2) - (E(X))2 we have
146
Now, we apply (D6) and (D5) successively to get
+ Ei(Ci,I-2)QI-22fI-12 +
+ Ei(Ci,I-22)fI-22fI-12
= Ci,I+l-lfr+l-l***fI-201-l 2+
+ Ci,I+l-lfI+l-l"'fI-301~22fI-12 +
+ Ei(Ci,r-3f~I-32fr-z2fI-12 +
+ Ei(Ci,I-32)fI-32fI-22fI-12
= etc.
I-l
= Ci,I+l-i D fI+l-i"'fk-lek2fk+12"'fI-l2
k=I+l-i
+ Ci,I+l-i 2f I+l-i2'*..*fI-12
where in the last step we have used Ei(Ci,I+l-if = Ci,I+l-i and
I-l
(D9) Vari(Cir) = Ci,I+l-i C fI+l-i "~fk,l~k2fk+12~~*fI-12
k=I+l-i
We estimate this first summand of mse(CiI) by replacing the
unknown parameters fk, Ok2 with their unbiased estimators fk and
Ok', i.e. by
I-l
(DlOl Ci,I+l-i D f1+1-i"' fk-l'=,f~f,f+l-*fI2,1 =
k=I+l-i
147
2 I-l ak2/fk2
z Ci,*+l-ifl+l-l"'f~-l C
k=I+l-i Ci,I+l-ifI+l-i"'fk-1
I-l ak2/fk2
= c"i, c
k=I+l-i Cik
where we have used the notation Cik introduced in the
proposition for the estimated amounts of the future Cik, k >
1+1-i, including Ci,I+l-i = Ci,I+l-is
Ei (CiI) = Ci,I+l-ifI+I-i'.*.'fI-I
and therefore
Sk = fI+l-i'...'fk-ffkfk+l.....fI-1 -
- fI+l-i'...'fk-lfkfk+I*...*fI-I
= f*+1-is... 'fk-l(fk-fk)fk+l'...'fI-1 .
This yields
148
F2 = (SI+l-i + ea. + ~S1-1)~
I-l
x c Sk2 + 2 c sjsk s
k=I+l-i -f<k
where in the last summation j and k run from 1+1-i to I-l. Now
we replace Sk2 with E(Sk2/Bk) and sjsk, j < k, with S(SjSklBk).
This means that we approximate Sk2 and sjsk by varying and
averaging as little data as possible so that as many ValUSS Cik
= ek2 / fs:cjk
we obtain
I-k
E(sk2 1%) = fi+1-i ...f:,l":f:+l".f;-l / c cjk e
j=l
149
fkr ak2. Altogether, we estimate F2 = (fI+l-i'...*fI-1 -
fI+l-~-...*f~-~)2 by
I-l 2 2 2
c ( f1+1-i". &‘ak’fk+l ’ " fi-1 / >=Ibjk ) =
k=I+l-i
2 2 2 I-l ak2/fk2
(D13) Ci,I+l-ifI+l-i'.**'fI-1 c =
k=I+l-i I-k
c cjk
j=l
I-l ak2/ fk2
= c:, c
k=I+l-i I-k '
c cjk
j=l
From (D2), (DlO) and (D13) we finally obtain the estimator
150
5 ak2
Proof: In this proof all summations are over the index j from
j=l to j=I-k. The definition of Ok2 can be rewritten as
151
omitted in E(Cj,k+l'IBk), i.e.
= cjkak2 + (cjkfk12
where the rule E(X2) = Var(X) + (E(X))2 and the assumptions (5)
and (3) have also been used.
152
Awwendix F: The Standard Error of the Overall Reserve Estimate
153
I
(=I Var( i CiIlD) = z Var(CiIICil, ---, Ci,I+l-i) ,
i=2 i=2
whose summands have been calculated in Appendix D, see (D9).
Furthermore
I
(F3) ( E( g CiIID) - cciI)2= ( i ( E(CiIlD) -Ci1) )2=
i=2 i=2 i=2
= c (E(CiIID) - CiI)‘(E(CjI(D) - Cj1)
Zli,jSI
= c Ci,I+I-iCj,I+I-jFiFj
Zli,jSI
I
= C (Ci,I+1-iFi12 + 2 C Ci,I+l-i.Cj,I+l-jFiFj
i=2 icj
with (like in (Dll))
Fi = fI+I-i"'fI-1 - fI+l-i"'fI-1
which is identical to F of Appendix D but here we have to carry
the index i, too. In Appendix D we have shown (cf. (D2) and
(Dll)) that
mse(Ri) = Var(CiIlCiI,...,Ci,I+I-i) + (Ci,I+I-iFi)' .
Comparing this with (Fl), (F2) and (F3) we see that
I I
(F4) mse( C Ri) = D mse(Ri) + C 2'Ci,I+l-iCj,I+I-jFiFj.
i-2 i=2 2Si<j51
We therefore need only develop an estimator for FiFj. A
procedure completely analogous to that for F2 in the proof of
Appendix D yields for FiFj, i-zj, the estimator
I-l 2 I-k
c f1+1-j ...f~-la:f~+l...fr-l, E C* ,
k=I+l-i n=l
which immediately leads to the result stated in the proposition.
154
ADDendiX G: Testins for Correlations between Subsequent
DeveloDment Factors
155
From (Gl) we obtain through the specialization j = k
156
smallest one on rank one and so on. Let rik, 1 I i < I-k, denote
the rank of ci,k+l/cik obtained in this way, 1 I rik 5 I-k. Then
we do the same with the preceding development factors
Tk is defined to be
E(Q) = 0 ,
Var(Tk) = 1/(1-k-l) .
A value Of Tk close to 0 indicates that the development factors
between development years k-l and k and those between years k
and k+l are not correlated. Any other value of Tk indicates that
the factors are (positively or negatively) correlated.
157
values T2, T3, . . . . TI-2 obtained in the same way like Tk.
(There is no Tl because there are no development factors before
development year k=l and similarly there is also no TI; even
TX-1 is not included because there is only one rank and
therefore no randomness.) According to Appendix B we should not
form an unweighted average of T2, .,., TI-2 but rather use
weights which are inversely proportional to Var(Tk) = 1/(1-k-l).
This leads to weights which are just equal to one less than the
number of pairs (rik, sik) taken into account by Tk which seems
very reasonable.
We thus calculate
I-2
(G5) T = 'X2 (I-k-l)Tk / C (I-k-l)
k=2 k=2
I-2 I-k-l
= c Tk I
k=2 (I-2)(1-3)/2
I-2
E(T) = C E(Tk) = o ,
k=2
I-2
c-1 Var(T) = '.X2 (I-k-l)2 Var(Tk) / ( C (I-k-l) )2
k=2 k=2
I-2
= c (I-k-l) / ( 'X2 (I-k-l) )2
k=2 k=2
A.
=
(I-2)(1-3)/2
where for the calculation of Var(T) we used the fact that under
the null-hypothesis subsequent development factors and therefore
also different Tk's are uncorrelated.
158
Because the distribution of a single Tk with I-k 2 10 is Normal
in good approximation and because T is the aggregation of
several uncorrelated Tk's (which all are symmetrically
distributed around their mean 0) we can assume that T has
approximately a Normal distribution and use this to design a
significance test. Usually, when applying a significance test
one rejects the null-hypothesis if it is very unlikely to hold,
e.g. if the value of the test statistic is outside its 95%
confidence interval. But in our case we propose to use only a
50% confidence interval because the test is only of an
approximate nature and because we want to detect correlations
already in a substantial part of the run-off triangle.
Therefore, as the probability for a Standard Normal variate
lying in the interval (-.67, . 67) is 50% we do not reject the
null-hypothesis of having uncorrelated development factors if
.67 .67
S T I +
d((~-2) (I-3)/2) d((I-2)(1-3)/2) '
If T is outside this interval we should be reluctant with the
application of the chain ladder method and analyze the
correlations in more detail.
c anter 6:
We start with the table of all development factors:
159
FI F2 F3 F4 F5 F6 F7 F8 F9
i=l 1.6 1.32 1.08 1.15 1.20 1.11 1.033 1.00 1.01
i=2 40.4 1.26 1.98 1.29 1.13 0.99 1.043 1.03
i=3 2.6 1.54 1.16 1.16 1.19 1.03 1.026
i=4 2.0 1.36 1.35 1.10 1.11 1.04
i-5 8.8 1.66 1.40 1.17 1.01
i=6 4.3 1.82 1.11 1.23
i=7 7.2 2.72 1.12
i=8 5.1 1.89
i=9 1.7
ril si2 ri2 si3 ri3 si4 ri4 si.5 ri5 si6 ri6 si7 ri7 si8 ri8
112 2 112 2 5 4 4 3 2 11
9 8 117 6 6 5 3 2 113 2 2
4 3 4 4 4 3 3 3 4 3 2 2 1
3 2 3 3 5 4 112 13
8 7 5 5 6 5 4 41
5 4 6 6 2 2 5
7 6 8 7 3
6 5 7
2
160
randomness.) From these figures we obtain Spearman's rank
correlation coefficients Tk according to formula (G4):
k 2 3 4 5 6 7 8
161
ADDendiX H: Testina for Calendar Year Effects
162
Therefore, in order to check for such calendar year influences
we only have to subdivide all development factors into 'smaller'
and *larger* ones and then to examine whether there are
diagonals where the small development factors or the large ones
clearly prevail. For this purpose, we order for every k, 1 5 k 5
I-l, the elements of the set
Fk = f ci,k+l/cik 1 1 S i 5 1-k ) I
i.e. of the column of all development factors observed between
development years k and k+l, according to their size and
subdivide them into one part LFk of larger factors being greater
than the median of Fk and into a second part SFk of smaller
factors below the median of Fk. (The median of a set of real
Having done this procedure for each set Fk, 1 I k I I-l, every
163
Now we count for every diagonal Aj, 1 I j 5 I-l, of development
factors the number Lj of large factors, i.e. elements of L, and
the number Sj of small factors, i.e. elements of S. Intuitively,
if there is no specific change from calendar year j to calendar
year j+l, Aj should have about the same number of small factors
as of large factors, i.e. Lj and Sj should be of approximately
the same size apart from pure random fluctuations. But if Lj is
significantly larger or smaller than Sj or, equivalently, if
zj = min(Lj, Sj) ,
165
E(Z) = E(Z2) + . . . + E(ZT-1) ,
Var(Z) = Var(Z2) + . . . + Var(ZT-1)
and we can assume that Z approximately has a Normal
distribution. This means that we reject (with an error
probability of 5 %) the hypothesis of having no significant
calendar year effects only if not
E(Z) - 2*+ar(Z) 5 Z 5 E(Z) + 2&ar(Z) .
F1 F2 F3 F4 F5 F6 F7 F8 F9
i=l 1.6 1.32 1.08 1.15 1.20 1.11 1.033 1.00 1.01
i=2 40.4 1.26 1.98 1.29 1.13 0.99 1.043 1.03
i=3 2.6 1.54 1.16 1.16 1.19 1.03 1.026
i=4 2.0 1.36 1.35 1.10 1.11 1.04
i=5 8.8 1.66 1.40 1.17 1.01
i=6 4.3 1.82 1.11 1.23
i=7 7.2 2.72 1.12
i=8 5.1 1.89
i=9 1.7
166
‘S’, a large rank with 'L' and a mean rank with *** we obtain
the following picture:
j=l s s s s L L * s *
j=2 L s L L * s L L
j=3 s s l s L s s
j ~‘4 s s L s s L
j==5 L L L L s
j=6 * L s L
j=7 L L s
j=8 L L
j=9 S
1 1 2 0 .5 .25
0 0 3 1 .75 .1875
1 1 4 1 1.25 .4375
3 1 4 1 1.25 .4375
3 1 4 1 1.25 .4375
4 2 6 2 2.0625 .6211
4 4 8 3 2.90625 .8037
4 4 8 3 2.90625 .8037
167
therefore the null-hypothesis of not having significant calendar
year influences is not rejected so that we can continue to apply
the chain ladder method.
168
Figure 1 : Regression and Residuals
CL2 against Gil
12000
a
9000 81
0
6000
3000
0 f
C 2000 4000 6000
Gil
480
cI
240
0
a
0
---_-----___________-------------------------------
0
a 0
0
-480 1-
0 2000 4ocm 6000
Cil
Figure 2: Regression and Residuals
Ci3 against Ci2
2oow
15000
10000
5000
Ci2
Ci2
170
Figure 3: Regression and Residuals
Ci4 against Ci3
24000
180oc
12000
6000
0
0 6000 12000 180 100
Ci3
60
q
30
5
-5
'i?
Q,
L 0
B
2
.-p1
g -30
-60 -I
0 6Oba 12000 113Ctoo
Ci3
Figure 3: Regression and Residuals
Ci4 against Ci3
24000
85
12000
6000
0 4 I
0 6000 12000 16000
Ci3
60
30
.-s
? 0
B
q
z
or 000
‘G
?= -30
-80 -I- I I
0 6000 12000 1E 00
Ci3
172
Figure 4: Regression and Residuals
Ci5 against Ci4
30000
Ci4
16
Ci4
173
Figure 5: Regression and Residuals
Ci6 against Ci5
32000
24000
16000
27000
Ci5
0 0
0
---------------------------------------------~------
-20 I /
0 9000 18000 27000
Ci5
174
Figure 6: Regression and Residuals
Ci7 against Ci6
Ci6
10
0 ____________________--------------------------------
0
0
-5
0
-10 r 1 ,
0 10000 20000 30000
Ci6
175
Figure 7: Regression and Residuals
Ci8 against Ci7
32000
24000
16000
8000
.--ii
-33
_________________- -------------~--------------------.
i? 0
TJ
3
c
.-or
g -, q
-2
a 10000 20000 3QOOO
Ci7
176
Figure 8: Regression and Residuals
Ci9 against Ci8
28uw
21000
14000
7000
_-_---____-____---__---------------------------------
Cl
r i I
138
177
Figure 9: Residual Plots for fk0
8000 T-
X
+ooo-> X
X
X
____-_____-______-______________________------
X
x
-8000 r
a 2000 4000 6000
Cil
5000
X
2500
X
0 x
s X
'Z
2 0 ------------------_-------------------------------
X
a
3
-c X
,-m
X
x
g -2500
-5000
0 4000 8000 120 100
Ci2
178
Figure 7 0: Residual Plots for fkO
5000
2500
X
0 x
.-G
% 0 -____--__--_--_------------------------------ _---
-u
3
s x x
a-cn
x x
g -2500
-5000
6000 72000
Ci3
1500
X
X
750
0
-0
3
‘63
p1 01
X
------------------------------------------
I
X
-1500 f r / 1
cl 8000 16ODO 2 +c 100
Ci4
179
Figure 1 1 : Residual Plots for fk2
-6---~-----------------------------------------‘
"V
v v V V
1.2
V
.6
V
V
0 _____________-------____________________-----------~
V
180
Figure 12: Residual Plots for fk2
-.8
Ci3
.li
V
.06
V
5
2
'Z
L
@ 0 ____________________-------------------------------
-0 v
a, V
z v
al
'5
F -.06
-.12 , - 1 I
181
Figure 13: Plot of In(wk2) against k
I Ln co
0
182