Statistical estimation techniques in life and non-life insurance An overview Boualem Djehiche KTH Royal Institute of Technology, Stockholm April 2013 (Preliminary and incomplete) Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Outline of the course Statistical inference of basic models in I. II. III. Collective risk theory Life insurance Non-life insurance Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview I. Collective risk theory The basic model of collective risk theory deals with a model of the risk business of an insurance company, and study the probability of ruin, i.e., the probability that the risk business ever will be below some specific negative value. Our goal is to do statistical inference of the ruin probability for ”discretely” observed risk business. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview A. The classical risk process Let (Ω, F, P) be a complete probability space carrying the following independent objects, among others: (i) a Poisson process N = {Nt , t ≥ 0} (N0 = 0) with intensity λ ≥ 0 i.e. E[Nt ] = λt, Nt is the number of claims on the company during (0, t]; (ii) a sequence {Uj }j=1,2,... of i.i.d. r.v. with common distribution function F , with F (0) = 0, supported by (0, +∞), with mean µ and finite variance. Uj is the size of the claim j. The risk process X is defined by Xt = ct − Nt X Uj . (1) j=1 where c > 0 is a constant called gross risk premium rate. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview 105 100 95 90 85 80 75 70 0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 Figure: Classical risk process u + Xt : u = 100, c = 50, λ = 15, F (dy ) ∼ exp(10). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview In insurance terminology, the condition F (0) = 0, c > 0 is called positive risk sums. This includes most non-life branches and ordinary life products. The case F (0) = 1, c < 0 is called negative risk sum. A typical example of such a situation is the Life annuity or pension insurance, where −c is the life annuity rate paid from the company to the policyholder, and the claim (death of the policyholder) places an amount corresponding to the expected pension to be paid. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview The process St := rate λ. PNt j=1 Uj is called a compound Poisson process with jump The Laplace transform of St reads E[e rSt ] = e λtg (z) , where Z (2) +∞ g (z) = (e zy − 1)dF (y ) (3) 0 is the associated cumulative function. Let Dg be the maximal open interval of elements a such that g (a) < ∞. Put, g (a) = +∞ if a 6∈ D̄g . Under Assumption 0 ∈ Dg , g is strictly convex, C ∞ , g (0) = 0 and g 0 (0) = µ. In particular, the Laplace exponent of −Xt is ϑ(z) := t −1 log E[e −zXt ] = −cz + λg (z). Boualem Djehiche KTH Royal Institute of Technology, Stockholm (4) Statistical estimation techniques in life and non-life insurance An overview The expected profit at time t is E[Xt ] = ct − E[St ] = ct − E[Nt ]E[U1 ] = (c − λµ)t. (5) The relative safety loading is the ratio of the expected profit to the expected total claims: c − λµ c ρ := = − 1. (6) λµ λµ The risk process X is said to have a positive safety loading if ρ > 0 (the business is solvent) or c > λµ (called the net profit condition). Theorem Under the net profit condition c > λµ, the equation ϑ(z) = 0 admits a unique nontrivial solution, R ∈ Dg called the Lundberg exponent (or local adjustment coefficient) : c g (R) = R. (7) λ Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Ruin probability The (infinite time horizon) ruin probability of a company facing the risk process (1) and having initial capital u is ψ(u) = P{u + Xt < 0 for some t > 0} = P{τ (u) < ∞}, (8) where, the stopping time τ (u) = inf{t ≥ 0, u + Xt < 0} (9) is the ruin time of the risk process X . We note that by the strong law of large numbers, limt→∞ Xt t = c − λµ a.s. Thus, in case of positive safety loading (c > λµ), ψ(∞) = 0. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview It can be shown that (see Cramér (1955) for a direct approach and Feller (1971) for an approach using a renewal argument) the probability of non-ruin, φ(u) := 1 − ψ(u) solves the equation Z λ u φ(u) = φ(0) + φ(u − y )(1 − F (y ))dy . (10) c 0 By monotone convergence, as u → +∞, we get φ(∞) = φ(0) + λµ φ(∞). c (11) Noting that φ(∞) = 1 − ψ(∞) = 1, we get that 1 − φ(∞) = ψ(0) = Boualem Djehiche KTH Royal Institute of Technology, Stockholm λµ 1 = , c 1+ρ if ρ > 0. Statistical estimation techniques in life and non-life insurance An overview In terms of the Lundberg exponent we have the following upper bound of the ruin probability Lundberg bound. For any initial capital u ≥ 0, ψ(u) ≤ e −Ru . (12) Furthermore, this upper bound is optimal: For any > 0, lim e (1+)Ru ψ(u) = +∞. u→+∞ More precisely, we have the Cramér-Lundberg approximation of the ruin probability ρµ c − λµ lim e Ru ψ(u) = 0 = . (13) u→+∞ g (R) − c/λ λg 0 (R) − c Relations (12) and (13) imply that statistical inference on the ruin probability boils down to statistical estimation of the Lundberg exponent. This is what we will do in the rest of this chapter. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Example: Exponentially distributed claims Assume F is an exponential distribution function with parameter µ: F̄ (x) = 1 − F (x) = e µx , x > 0. Then, the Lundberg exponent is R= ρ µ(1 + ρ) and the ruin probability is ψ(u) = 1 − ρ u − ρ u e µ(1+ρ) ≤ e µ(1+ρ) . 1+ρ Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview B. A jump-diffusion (Gerber) risk model The new risk process introduced by Gerber (1970) is defined by Xt = ct + σWt − Nt X Uj . (14) j=1 where, W is a standard Brownian motion independent of the compound Poisson process. It takes account of possible fluctuations of the gross premium rate. The main feature of this process is that ”the small” negative jumps in an interval are due to either a diffusion shock or to a jump of the compound Poisson process. In practice, it is very difficult to distinguish between these two types of jumps from a given sample of observations of the risk process X . Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview 170 160 150 140 130 120 110 100 90 80 70 0 1 2 3 4 5 6 7 8 9 10 Figure: Jump-diffusion risk process u + Xt : u = 100, c = 50, λ = 10, F (dy ) ∼ |N(0, 15)|, Wt ∼ N(0, 9t). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview For this process, the Laplace exponent of −Xt is ϑ(z) := t −1 log E[e −zXt ] = −cz + σ2 2 z + λg (z). 2 (15) Moreover, under the safety loading condition c > λµ, the Lundberg exponent is the unique nontrivial solution R ∈ Dg of the equation ϑ(R) = 0. The Lundberg upper bound (12 of the ruin probability remains valid and the Cramér-Lundberg approximation becomes lim e Ru ψ(u) = u→+∞ Boualem Djehiche KTH Royal Institute of Technology, Stockholm c − λµ . λg 0 (R) − c + Rσ 2 (16) Statistical estimation techniques in life and non-life insurance An overview Estimation of the Lundberg exponent Recall that the Lundberg exponent solves −cR + σ2 2 R + λg (R) = 0. 2 (17) We should estimate σ, λ and g from observations of the path X . I. Inference from complete observations (the classical approach) (based on work essentially by Grandell) If we observe the complete path of X over [0, t], we assume c known and set σ = 0. Then I I λ is estimated with Nt /t, since by the SLLN limt→∞ Ntt = λ a.s. R∞ g is estimated as follows. An estimator of G (z) = 0 e zy dF (y ) is Ĝ (z) := Nt 1 X zUj e . Nt j=1 An estimator R̂t of R is solution of −c R̂t + Nt (Ĝ (R̂t ) − 1) = 0. t Boualem Djehiche KTH Royal Institute of Technology, Stockholm (18) Statistical estimation techniques in life and non-life insurance An overview Theorem Assume g (2R) < ∞. Then √ t(R̂t − R) →d Y , as t → ∞, (19) where Y is a normally distributed r.v. with E[Y ] = 0 and variance σY2 = Var (Y ) = g (2R) − 2cR/λ . λ(g 0 (R) − c/λ)2 In general σY is unknown and we have to replace it with its natural estimator q t Ĝ (2R̂t ) σ̂y = √ Ĝ 0 (R̂t ) Nt 1.6σ̂ A one-sided approximate 95%-confidence interval for R is (R̂t − √t y , +∞). An empirical Lundberg inequality which holds for all u in approximately 95% of all investigations is ψ(u) ≤ e 1.6σ̂ −(R̂t − √ y )u t . (20) Finer confidence intervals based on the Cramér-Lundberg approximation can also be obtained (see e.g. Grandell (1991)). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview II. Inference from discrete observations (based on the work by Shimizu and Yoshida) We assume that the risk process X Xt = ct + σWt − Nt X Uj j=1 is observed at (n + 1)-time points tin = ihn , i = 0, 1, . . . , n, and hn > 0 . The sampled observation is denoted by X n = {Xtin }i=0,...,n . When we consider asymptotics of the estimators we will assume that hn → 0 and nhn → ∞ as n → ∞. Under discrete sampling we do not know the exact Poisson process Nt , claims sizes Uj nor σ. Moreover, we cannot really tell whether a ”small” negative shock in an interval is due to a diffusion shock or to the occurrence of a claim. We only have to relay on detecting the occurrence and the likely size of a jumps of the risk process in a given interval. We briefly describe such a filter. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview n Denote the size of a jump that occurs in (ti−1 , tin ] by n ∆ni X := Xtin − Xti−1 . n and let Jin denote the number of jumps in (ti−1 , tin ]. Using Doob and Burkholder-Davis-Gundy type inequalities , it holds that for any p ≥ 0 p sup E sup Wt = O(hnp/2 ). (note that hn → 0, as n → ∞.) 1≤i≤n n ≤t≤t n ti−1 i This in turn implies that for any L > 0, q ∈ (0, 1/2) and any p ≥ 0, P{ sup n ≤t≤t n ti−1 i n |Xt − Xti−1 | > Lhnq , Jin = 0} ≤ Cp hnp . This inequality states that a jump occurs if |∆ni X | > Lhnq . Furthermore, |∆ni X | can be a good approximation of the jump size as hn → 0. In practice, the estimation of the thresholds L and q from the observations is critical. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Since our risk process has only negative jumps, we have the following refinement of the the above estimates: Lemma Assume that there is a constant C > 0 such that F () ≤ C for any > 0 small enough. Then, for any L > 0, any q ∈ (0, 1/2) and any p ≥ 0, P{∆ni X < −Lhnq , Jin = 0} = O(hnp ), P{∆ni X ≥ −Lhnq , Jin = 0} = e −λhn − O(hnp ), P{∆ni X < −Lhnq , Jin = 1} = O(hn ), P{∆ni X ≥ −Lhnq , Jin = 1} = O(hn1+q ), P{∆ni X < −Lhnq , Jin ≥ 2} = O(hn2 ), P{∆ni X ≥ −Lhnq , Jin ≥ 2} = O(hn2 ), as n → ∞. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Based on the previous lemma, we may introduce the following filters. For L > 0 and p ∈ [0, 1/2) Cin := {ω ∈ Ω, ∆ni X ≥ −Lhnp }, Din := {ω ∈ Ω, ∆ni X < −Lhnp }, (21) where, Cin is a filter to find an interval with no jumps, and Din is a filter to detect a jump. Estimation of the diffusion coefficient σ Theorem For p ∈ (0, 1/2), let Pn σn2 := n 2 1Cin i=1 (∆i X − chn ) 1 Pn hn i=1 11Cin . (22) Suppose that hn → 0 and nhn → ∞ as n → ∞. Then σ̂n2 →P σ 2 . If, in addition nhn2 → 0 as n → ∞, √ √ n(σ̂n2 − σ 2 ) →d 2σ 2 N(0, 1). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Estimation of the compound Poisson parameters We consider estimation of functionals of F of the form Z ∞ F [Mz ] := Mz (y )dF (y ), z ∈ (a, b) 0 An example includes Mz (y ) = e zy − 1 and its derivatives w.r.t. y . Here is a general result that includes all the cases we need in our setting Theorem For p ∈ (0, 1/2), consider the estimator n 1 X fˆn [Mz ] := Mz (|∆ni X |)11Din . nhn i=1 (23) Suppose that there exists a constant C > 0 such that |∂zj Mz (y )| ≤ C (1 + |y |)e by for j = 0, 1. Then as n → ∞ such that hn → 0 and nhn → ∞, fˆn [Mz ] →P λF [Mz ]. Moreover, if ∂y ∂z Mz (y ) exists and is such that |∂y ∂z Mz (y )| ≤ C (1 + |y |)e by . Then sup |fˆn [Mz ] − λF [Mz ]| →P 0, n → ∞. z∈(a,b) Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Setting Mz (y ) = 1 and Mz (y ) = y respectively, and choosing the compact interval [a, b] ⊂ Dg large enough to contain R, we obtain estimators of λ and λµ. n 1 X λ̂n := 11Din →P λ, (24) nhn i=1 n 1 X n |∆i X |11Din →P λµ. nhn i=1 (25) as n → ∞ such that hn → 0 and nhn → ∞. Hence, an empirical safety loading condition is: c> n 1 X n |∆i X |11Din . nhn i=1 (26) We also have asymptotic normality of these estimators: Theorem Suppose that as n → ∞, nhn → ∞ and nhn1+δ → 0 for some δ ∈ (0, 1). Then, for p ∈ (δ/2, 1/2), we have √ √ nhn (λ̂n − λ) →d λN(0, 1), n → ∞. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Estimation of the Lundberg exponent An estimator R̂n > 0 of R is solution of ϑ̂n (z) := −cz + n σ̂n2 2 1 X z|∆ni X | z + (e − 1)11Din = 0. 2 nhn i=1 (27) We have ϑ̂n (0) = 0, ϑ̂0n (z) = −c + σ̂n2 z + n 1 X z|∆ni X | n (e ∆i |11Din nhn i=1 increasingly tends to ∞ as z → ∞. ϑ̂00n (z) = σ̂n2 + n 1 X z|∆ni X | n 2 (e ∆i | 11Din > 0, nhn i=1 z > 0. Then, the equation ϑ̂n (R̂n ) = 0, R̂n > 0, admits a unique solution if ϑ̂00n (0) < 0, which is equivalent to the empirical safety loading condition (26): c> n 1 X n |∆i X |11Din . nhn i=1 Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Theorem Suppose that the safety loading condition c > λµ holds. Then R̂n →P R as n → ∞ such that hn → 0 and nhn → ∞. Suppose further that Z ∞ g (2R) := (e 2Ry − 1)dF (y ) < ∞. 0 Then, √ nhn (R̂n − R) →d ΣN(0, 1) as n → ∞ such that hn → 0 and nhn → ∞, where, Σ2 = λ(g (2R) − 2g (R) − 2) . (σ 2 R − c + λg 0 (R))2 We note that when σ = 0 we obtain the classical result obtained above. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Possible extensions Carry out a similar program for finite-time ruin probabilities ψ(T , u) := P(τ (u) ≤ T ) of the risk process with variable premium: t Z Xt = u + t Z σ(s, Xs )dWs − b(s, Xs )ds + 0 0 Nt X Uj , 0 ≤ t ≤ T. j=1 Tools: Large deviations techniques and Siegmund duality (see Djehiche (1993) or Asmussen (2000) for the case σ = 0). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview References- Incomplete list Aalen, Odd O.; Hoem, Jan M (1978): Random time changes for multivariate counting processes. Scand. Actuar. J. no. 2, 81101. Andersen, P.K:, Borgan, Ø., Gill, R.D., Keiding, N. (1993): Statistical models based on counting processes, Springer-Verlag, New York, Berlin, Heidelberg. Asmussen, S. (2000): Ruin Probabilities. World Scientific, Singapore. Cramér, H. (1930): On the Mathematical Theory of Risk. Skandia Jubilee Volume, Stockholm. Cramér, H. (1955): Collective Risk Theory. Skandia Jubilee Volume, Stockholm. Djehiche, B. (1993): A large deviations estimate of ruin probabilities. Scand. Actuarial J. (1), pp. 42-59. Feller, W. (1971): An Introduction to Probability Theory and Its Applications. Wiley, New York. Grandell, J. (1991): Aspects of risk theory. Springer-Verlag, New York. Karr, Alan F. (1986): Point processes and their statistical inference. Marcel Dekker, Inc. New York and Basel. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview References II Lundberg, F. (1903): I. Approximerad Framställning av Sannolikhetsfunktioner. II. Aterförsäkring av Kollektivrisker. Almqvist och Wiksell, Uppsala. Lundberg, F. (1926): Försäkringsteknisk Riskutjämning. F. Englunds boktryckeri, AB, Stockholm. Mancini, C. (2004): Estimation of the characteristics of the jump of a general Poisson-diffusion model. Scand. Actuar. J. 2004, no. 1, 42-52. Shimizu, Y. and Yoshida, N. (2006): Estimation of parameters for diffusion processes with jumps from discrete observations. Stat. Inference Stoch. Process. 9 (2006), no. 3, 227-277. Shimizu, Y. (2009): A new aspect of a risk process and its statistical inference. Insurance Math. Econom. 44(1), 70-77. Shimizu, Y. (2009): Functional estimation for Lvy measures of semimartingales with Poissonian jumps. J. Multivariate Anal. 100(6), 1073-1092. Shimizu, Y. (2008): A practical inference for discretely observed jump-diffusions from finite samples. J. Japan Statist. Soc. 38 (2008), no. 3, 391-413. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview II. Life Insurance By life insurance policy or contract we mean any form of person insurance contract over a (long) period of time such as life or pension and disability or sickness coverage. In such products, premiums and benefits are typically contingent to upon transitions of the policyholder between a number of states stated in the contract. Thereof the use of the powerful (semi)-Markov chain theory to carry out the valuation and estimation of the contracts. We first give a short introduction to the basic constituents of a life insurance contract and related reserving. Then we single out the main parameters that control the evolution of the life insurance contract and focus on their statistical estimation. These parameters are the mortality rate and disability inception and recovery rates. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview A Markov chain model of a life insurance contract Let E = {0, 1, 2, . . . , m} be the (finite) set of possible states of the policy. Starting at 0, the policy is assumed to be in one and only one state at each time. Let X (t) denote the state of the policy at time t ∈ [0, n]. We assume that the process X is right-continuous with a finite number of jumps, with transition probability pij (s, t) = P[X (t) = i|X (s) = j], i, j ∈ E , 0≤s≤t≤n (28) and transition intensity µij := lim h↓0 pij (t, t + h) , h i 6= j The total transition intensity from state i at time t is X µi· (t) = µik (t) (29) (30) k:k6=i so that pii (t, t + dt) = 1 − µi· (t)dt + o(t). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Basic Kolmogorov equations A. The Kolmogorov backward equation: for s ≤ t, ∂pij P ∂s (s, t) = µi· (s)pij (s, t) − k:k6=i µik (s)pkj (s, t), B. The Kolmogorov forward equation: for s ≤ t, ∂pij P ∂t (s, t) = −pij (s, t)µj· (t) + k:k6=i pik (s, t)µkj (t), (31) pij (t, t) = δij . (32) pij (s, s) = δij . C. The Chapman-Kolmogorov equation X pik (s, u) = pij (s, t)pjk (t, u), s ≤ t ≤ u. (33) j∈E The key parameter in this Markov chain framework is the transition intensity which is the object of our statistical inference study. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Examples 1. Single life with one cause of death (one absorbing state) Here E = {0, 1}, where state 0 = alive, state 1= dead (absorbing state). If T denotes the life length of a person with survival probability F̄ (t) = P(T > t), the Markov chain counts the number of deaths: X (t) = 11{T ≤t} , t ∈ [0, n], with transition probability p00 (s, t) = Rt F̄ (t) = e − s µ(u)du . F̄ (s) µ is called mortality intensity (rate, force). Its estimation from data is of central importance in Life Insurance. '$ '$ a &% Boualem Djehiche KTH Royal Institute of Technology, Stockholm µ d &% Statistical estimation techniques in life and non-life insurance An overview 2. Single life with m causes of death (m absorbing states) Here E = {0, 1, . . . , m}, where state 0 = alive, state j= dead with cause j (absorbing state). These absorbing states model different causes of death such as death by ”car accident”, ”normal death”, ”death caused by a disease” etc.. The total mortality intensity is µ0· (t) := µ(t) = r X µj (t), (34) j=1 where, µj (t) := µ0j (t) denotes the mortality rate for death with cause j. This is nothing but the transition intensity from state 0 (alive) to the absorbing state j. The probability that an s years old person will die from cause j before age t is then Z t Ru p0j (s, t) = e − s µ(τ )dτ µj (u)du. (35) s Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview '$ a &% @ @ @ @ @ µj @ µ1 µn @ @ @ @ @ '$ d1 &% ? '$ ... ... dj &% Boualem Djehiche KTH Royal Institute of Technology, Stockholm @ R @ '$ dn &% Statistical estimation techniques in life and non-life insurance An overview 3. Disability, recovery and death This model is widely used to analyze insurance contracts with payments depending on the state of the health of the insured. For example I Sickness insurance that provides an annuity benefit during disability periods; I Life insurance with premium waiver during disability; I Pension with additional benefits to other members of the family. Here the possible states are a = alive/active, i=invalid/unemployed, and d=dead/recovered or any other suitable labeling. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview '$ α(t) '$ β(t) i &% a A &% A A A A A A A µ(t) A A A ν(t) A U A '$ d &% Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Payment streams and reserving techniques Let X be the Markov chain with intensities µij associated with an insurance contract. Let Ij (t) = 11{X (t)=j} , t ∈ [0, T ] denote the indicator process of whether the policy is in state j or not, and Nij (t) = #{s : X (s − ) = i, X (s) = j, s ∈ (0, t]} denote the number of transitions from state i to state j during the time interval (0, t]. We have dIj (t) = dN·j (t) − dNj· (t), (36) where, N·j (t) := X Nkj (t), Nj· (t) := k;k6=j X Njk (t). k;k6=j We have, for t ≤ u E [Ij (u)|X (t) = i] = pij (t, u), (37) E [dNjk (u)|X (t) = i] = pij (t, u)µjk (u)du. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview A standard payment stream A (benefits less premiums) has usually the following form: X X Ij (t)dAj (t) + ajk (t)dNjk (t) , dA(t) := j (38) k;k6=j where, dAj (t) := aj (t)dt + (Aj (t) − Aj (t − )) = aj (t)dt + ∆Aj (t). (39) specifies the so-called general life annuity payment i.e. payments due during sojourn in state j. The payment aj (t) is the rate of a state-wise annuity payable continuously at time t, while the lump sum payment ∆Aj (t) is an endowment at time t. The annuity function Aj is usually assumed to have a finite number of discontinuity points {t1 , t2 , . . . , tq }. The payments ajk (t) specify the so-called general life assurance i.e. amounts that are payable immediately upon transition from state j to state k. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Expected present values and prospective reserves The liability at time t for which the insurer should provide a reserve (prospective reserve) is the present value of the payment streams (future benefits less premiums) A over the lifespan [t, n] of the insurance contract: Z n Rs (40) V (t) = e − t r (u)du dA(s). t When the policy is in state i at time t, then, in view of (37), the state-wise prospective reserve is Rs Rn Vi (t) := E [V (t)|X (t) = i] = t e − t r (u)du E [dA(s)|X (t) = i] (41) Rs Rn P P = t e − t r (u)du j pij (t, s) dAj (t) + k;k6=j ajk (s)µjk (s)ds , when r , aj , aik are all deterministic function. Written in differential form, Vj satisfies the following Feynman-Kac type formula: Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview The Backward Thiele’s differential equation (Actuarial Black and Scholes formula!) dVi dt (t) = (r (t) + µi· (t))Vi (t) − t ∈ (tp−1 , tp ), ∆Vj (tp ) = −∆Aj (tp ), Vj (n) = 0. P j;j6=i µij (t)Vj (t) − ai (t) − P j;j6=i aij (t)µij (t), p = 1, . . . , q, p = 1, 2, . . . , q, i ∈ E , (42) This equation admits an explicit solution only for a few uninteresting/trivial insurance contracts. In most cases it is solved using a numerical integration recipe. A fourth order ”Runge-Kutta” procedure seems to work efficiently in almost all practical situations. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Thiele’s equation can be recast in the following form ”preferred by actuaries” X − ai (t)dt = dVi (t) − r (t)Vi (t)dt + Rij (t)µij (t)dt (43) j;j6=i where, Rij (t) = aij (t) + Vj (t) − Vi (t), (44) is the so-called ”Sum-at-Risk” associated with a possible transition from state i to state j. P The term j;j6=i Rij (t)µij (t)dt is called the ”risk premium” in (t, t + dt). The term dVi (t) − r (t)Vi (t)dt is called the ”savings premium” in (t, t + dt). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview The equivalence principle (aka Fairness constraint) This principle means that V0 (0) = −A0 (0). (45) This condition imposes a constraint on the contractual payments aj , Aj and aij to design a premium level for given benefits. Noting that A0 (0− ) = 0, (45) is equivalent to Z V0 (0− ) := E n e− Rs t r (u)du dA(s) = 0. (46) 0− I The state-wise prospective reserve V (t) can be seen as the value function of a singular control problem where subject to the fairness constraint, where the control parameter is the process A(t). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview First and second order reserving bases The jump intensities µij (purely actuarial parameters or liability driving parameter) and the discounting rate r which constitute the ”expected return” of the investment portfolio (the main driver of the asset side) constitute the so-called reserving basis: I First order technical basis (Prudent or conservative): This is a set of assumptions about the portfolio return (or just an interest rate that reflects the market value of the cash flow), r , the transition rates µij (including mortality rates), costs and other relevant technical parameters etc.. These assumptions are meant to lead to premiums and reserves that include a high safety loading that hedges against worst case scenarios. The first order premiums and reserves are usually higher than experience based or historically observed values. This means that a systematic surplus is created by the company and, by law, it should be redistributed to the policyholder is terms of bonuses that are usually allocated but not distributed until the termination of the policy. Here we face a model risk. I Second order technical basis also called experience (or market) basis sets values of the parameters based on realistic scenarios collected based on the history of the policy. The company updates the reserves on a regular basis and adjusts for the parameters using the bonus fund created using the first order basis. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview A typical example of adjustments to be made under the experience (market) basis is compensation for a possible non-equivalence of the first order payments i.e. V0 (0− ) 6= 0, is . The insurance company compensates for this by adding dividend payments D to the first order payments. D has usually the following form: X X Ij (t)dDj (t) + dD(t) := δjk (t)dNjk (t) , (47) j k;k6=j dDj (t) := δj (t)dt + (Dj (t) − Dj (t − )) = δj (t)dt + ∆Dj (t). (48) The coefficients δj , ∆Dj and δij are stochastic processes adapted to the ”demographic-economic” history F with a more complex structure than the coefficients related to the payment processes A. The dividend process D is chosen (constrained) to attain the ultimate equivalence (fairness): Z n Rs e − t r (u)du d(A + D)(s) | Fn = 0. E (49) 0− Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview In the Black and Scholes world, the dividend payments are provided by an asset portfolio such as the following diffusion Y modulated by the jump process X : dY (t) = rY (t)dt + σ(t, X (t), Y (t))Y (t)dW (t) + d(C − D)(t), Y (0− ) = 0, (50) where, C is the usual income (or contribution) process usually of the following form (similar to A and D): X X Ij (t)dCj (t) + dC (t) := cjk (t)dNjk (t) , (51) j k;k6=j dCj (t) := cj (t)dt + (Cj (t) − Cj (t − )) = cj (t)dt + ∆Cj (t). (52) Assuming the coefficients δj (t), ∆Dj (t) and δij (t) are functions of (t, Y (t)), the state-wise prospective reserve is Vi (t, x) := E [V = i, Y (t) = x] i hR(t)|X (t) Rs n = E t e − t r (u)du d(A + D)(s)|X (t) = i, Y (t) = x (53) satisfies a more complex ”Thiele’s” PDE (cf. Steffensen (2006) and Fahrenwaldt (2013)). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Graduation techniques- Estimation of the mortality rates We start with statistical inference of the mortality rate µ which is the only jump intensity in the simplest life insurance contract: Single life with one cause of death (one absorbing state) i.e. E = {0, 1}, where state 0 = alive, state 1= dead (absorbing state). The underlying Markov chain counts the number of deaths: X (t) = 11{T ≤t} , t ∈ [0, n], where, T denotes the life length of a person with survival probability p00 (s, t) = Rt F̄ (t) = e − s µ(u)du , F̄ (s) 0 ≤ s ≤ t ≤ n. In actuarial practice one often considers the remaining life length Tx of an insured of age x. The corresponding survival probability over a time period of length t ≥ 0 is P(Tx > t) := P(T > x + t|T > x) = e − Boualem Djehiche KTH Royal Institute of Technology, Stockholm R x+t x µ(u)du = e− Rt 0 µ(x+u)du . (54) Statistical estimation techniques in life and non-life insurance An overview In a more general framework where ’stochastic mortality’ modeling can be incorporated, consider (the possibly random) force (or rate) of mortality at t for individual aged x at time 0, µ(x, t). Then, the survival index is Z t S(x, t) := exp − µ(x + s, s)ds 0 the probability of survival of an individual aged x during the time interval [0, t], given the mortality force µ(x, s) i.e. P(Tx > t) = E [S(x, t)] In Eq.(54), µ(x, t) = µ(x + t). The main goal of this section is to estimate the mortality force µ(x, s), given historical mortality data of a population of insured individuals. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview I. An Age-specific model: Gompertz-Makeham graduation formula This model captures the evolution of mortality in mutually exclusive age cohorts but disregards a possible common risk factor that links all cohorts together. Consider an insured population of ages xi , i = 1, 2, . . . , n. Let Nx denote the exposure i.e. the number of individuals of the same age x and Dx denotes the number of individuals dead during the interval (x, x + 1). Assuming that the remaining survival lengths of all individuals are independent, and the insured population is homogeneous in the sense that the survival probability of all individuals is the same. A stochastic model based on a ”crude approximation” of the Binomial distribution by the Poisson distribution suggests that Dxi ∼ independent Poisson(µxi Nxi ). (55) Then mortality rate (or force) µxi for a population of age xi , i = 1, 2, . . . , n can be estimated by the so-called ”central or crude death rate” µ̂xi = Dxi , Nxi Boualem Djehiche KTH Royal Institute of Technology, Stockholm i = 1, 2, . . . , n. (56) Statistical estimation techniques in life and non-life insurance An overview Gompertz and later Makeham famous graduation formula suggests a mortality rate of the form µx := α + βe γx (57) where, the parameters α, β and γ which satisfy α + β > 0, β > 0 and γ ≥ 0 are estimated using the insured population data. When α = 0 we get Gompertz mortality law. A fairly standard way to perform the parameter estimation is to use a weighted least squares method: minimize Q= n X wxi (µ̂xi − α − βe γxi ) 2 (58) i=0 w.r.t. the parameters α, β and γ, where the weight is the inverse of the variance of µ̂xi : Nx2i Nx2i Nx wxi = = = i Var (Dxi ) Nxi µ̂xi µ̂xi (59) so that Q is approximately χ2 -distributed. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview In practice, one fixes a value for γ ’based on experience’ and finds the optimal values of α and β. In the Swedish life insurance business, there is a Central Mortality Committee that estimates these parameters to be used by insurance companies and pension funds. For example, in the so-called M90 investigation, the committee suggested that µx = α + βe γ(x−f ) where, the parameter f adjusts for mortality of females among the insured population. Values f = 4 or 5 years are used. For M90, α = 0, 001, β = 0, 000012 and γ = 0.044. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview II. Gompertz graduation formula with a view towards GLM Recall Gompertz’ graduation formula: µx := βe γx (60) or log µx , which is linear in age: log µx = log β + log e γx := a + γx. This can be extended to a quadratic or a polynomial form log µx = a + bx + cx 2 , log µx = a0 + a1 x + a2 x 2 + . . . + ap x p . GLM means that we perform a regression of log µx with respect to a basis {1, x}, {1, x, x 2 }, {1, x, . . . , x p }, or any other carefully chosen ’spline’ basis {B1 (x), B2 (x), . . . , Bp (x)} such that µx = p X Bj (x)aj j=1 Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview and estimate the coefficients a0 , a1 , . . . , ap which maximize the penalized log-likelihood function: 1 L(a) − λa0 B 0 Ba 2 where, L(a) is the log- likelihood of the model Dxi ∼ independent Poisson(µxi Nxi ), , i = 1, . . . , n, (61) (62) and λ > 0 is a smoothing parameter. A similar approach can be applied to obtain a smooth year (or period) specific mortality: maximize the penalized log-likelihood function L(θ) − 1 0 0 λθ P Pθ, 2 (63) where, L(θ) is the log- likelihood of the model Dti ∼ independent Poisson(µti Nti ), Boualem Djehiche KTH Royal Institute of Technology, Stockholm t = tmin , . . . , tmax . (64) Statistical estimation techniques in life and non-life insurance An overview The smoothing parameter λ can be estimated using the Akaike Information Criterion (AIC) (Akaike (1987)), the Bayesian Information Criterion (BIC) (Schwartz (1978)) or the Generalized Cross-Validation (GCV) (Craven and Wahba, 1979). Below, we will suggest another criterion (see the HP-filter section below). An age-period model: Lee-Carter graduation formula Lee and Carter (1992) suggest a Gompertz type graduation formula for the full mortality rate µ(x, t): log µ(x, t) := α(x) + β(x)κ(t), subject to the constraints X β(x) = 1, x X κ(t) = 0, (65) (66) t fitting X (log µobs (x, t) − α(x) + β(x)κ(t))2 . x,t Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview This model captures the evolution of mortality in mutually exclusive age cohorts while at the same time includes a possible common risk factor k(t) that links all cohorts together over time. The parameters a(x) and b(x) are age-specific while k(t) is time (period) dependent only and should capture the random period effect of the mortality rate. k(t) is usually modeled as a time series or a random walk with drift. Lee and Carter (1992) suggest an ARIMA (discretized diffusion process) for κ of the form k(t + 1) = k(t) + a1 + a2 ξ + σz(t) where, z(t) is white noise and ξ ∈ {0, 1} is a dummy variable that captures major outbreaks of disease leading to a huge mortality wave such as the 1918 worldwide flu outbreak or the 2008 earthquake in China etc.. Estimation is usually performed w.r.t. each dimension: x and time (period) t. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Here are some suggestions (see Lee and Carter (1992), Brouhns et al. (2002), Currie, Richards and co-authors (2003-2012), Samuelsson (2008) etc..). I Given κ(t) = κ̂(t), fit a GLM with regressor κ̂: log µ(x, t) := α(x) + β(x)κ̂(t) I Given α(x) = α̂(x), β(x) = β̂(x), fit a GLM with offset α̂(x) and regressor β̂(x): log µ(x, t) := α̂(x) + β̂(x)κ(t) I Perform a regression w.r.t. a 2-d spline basis Ba (x) ⊗ By (t) for age and time dimensions (x, t). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Building blocks of the MLE for the Lee-Carter model Following Brouhns et al. (2002), the MLE approach to the Lee-Carter model is based on the Poisson distribution assumption that Dx,t ∼ Poisson(µ(x, t)Nx,t ), with log µ(x, t) := α(x) + β(x)κ(t), (67) x = xmin , . . . , xmax , t = tmin , . . . , tmax . The parameters α(x), β(x) and κ(t) are estimated by maximizing the log-likelihood X L(α, β, κ) := (Dx,t (α(x) + β(x)κ(t)) − Nx,t exp (α(x) + β(x)κ(t))) + Cst. x,t The nonlinear term β(x)κ(t) does not allow for a closed form of the maximizing parameters. One instead uses an iterative method such as the Newton-Raphson updating scheme (or any more efficient numerical optimization algorithm): θ(n+1) = θ(n) − ∂L(n) /∂θ , ∂ 2 L(n) /∂θ2 which numerically solves ∂L(n) /∂θ = 0. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview (0) (0) (0) α̂x = 0, β̂x = 1, κ̂t = 0, (0) 1 tmax −tmin +1 alternatively α̂x = (0) κ̂t = P (0) x β̂x P (0) t log(µ̂(x, t)), β̂x = (0) log(µ̂(x, t)) − α̂x (n) (n) (n) (n) t (Dx,t −D̂x,t ) P (n) − t D̂x,t , 1 , tmax −tmin +1 , D̂x,t = Nx,t exp (α̂x + β̂ (n) κ̂t ), (n) P (n) P (n+1) = α̂x − (n+2) = κ̂t − α̂x κ̂t (n+3) β̂x (n+2) = β̂x (n+1) β̂x (n+1) (n+1) )β̂x t (Dx,t −D̂x,t P (n+1) (n+1) 2 D̂ (β̂x ) t x,t − − , (n+2) n+2 )κ̂ (t) t (Dx,t −D̂x,t P (n+2) − t D̂x,t (κ̂n+2 )2 t (n) = β̂x , (n+2) α̂x P Boualem Djehiche KTH Royal Institute of Technology, Stockholm , (n+1) = α̂x (n+3) α̂x (n+1) κ̂t (n+2) , β̂x (n+2) = α̂x (n) = κ̂t , (n+3) , κ̂t (n+1) = β̂x , (n+2) = κ̂t , Statistical estimation techniques in life and non-life insurance An overview The parameters are standardized in each step of the iteration to satisfy the constraints X X β(x) = 1, κ(t) = 0, (68) x t by letting α̂x(n+1) = α̂x(n) + Aβ̂x(n) , (n) κ̂(n+1) = (κ̂t − A)B, x β̂x(n+1) = β̂x(n) /B, (69) where, A= X (n) 1 κ̂ , tmax − tmin t t B= X β̂x(n) . (70) x The estimated values of κ(t), t = tmin , . . . , tmax are used to fit it to a dynamical model (see HP-filter below). We mentioned above that Lee and Carter fit κ(t) to an ARIMA model of the form k(t + 1) = k(t) + a1 + a2 ξ + σz(t) where, z(t) is white noise and ξ ∈ {0, 1} is a dummy variable that captures major mortality changes. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Mortality among Swedish insured (cf. Swedish Research Board for Actuarial Science (Samuelsson (2008)) Alfa, kvinnor Alfa, män −1 −1 −2 −2 −3 −3 −4 −4 −5 −5 −6 −6 −7 −7 −8 −8 −9 30 40 50 60 Ålder 70 80 90 −9 30 40 50 60 Ålder 70 80 90 Figure: The αx parameter for ages 30-90 years (females and males). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Beta, kvinnor 0.04 Beta, män 0.04 Orginal Utjämnad Orginal Utjämnad 0.035 0.035 0.03 0.03 0.025 0.025 0.02 0.02 0.015 0.015 0.01 0.01 0.005 0.005 0 30 40 50 60 Ålder 70 80 90 0 30 40 50 60 Ålder 70 80 90 Figure: Estimated and smoothed βx parameter for ages 30-90 years (females and males). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Kappa, kvinnor 15 10 10 5 5 0 0 −5 −5 −10 −10 −15 −15 −20 1985 1990 1995 Year Kappa, män 15 2000 2005 −20 1985 1990 1995 År 2000 2005 Figure: Estimated and linearized κ(t) parameter for data 1985-2005 (females and males). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview κt, kvinnor 15 Kappa, kvinnor 15 Orginal Utjämnad Orginal Utjämnad 10 10 5 5 0 0 −5 −5 −10 −10 −15 −15 −20 1985 1990 1995 År 2000 2005 −20 1985 1990 1995 År 2000 2005 Figure: Estimated and smoothed κ(t) parameter for data 1985-2005 (females and males). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Mortality jumps, due to e.g. new life standards or medical development etc., are also important to capture in a mortality model, despite the serious difficulties to perform reliable estimation. Cox et al. (2010) suggest two types of mortality jump events to the Lee-Carter model: log µ(x, t) := α(x) + β(x)κ(t) − G (x, t) + H(x, t), where I G (x, t) captures a permanent longevity jump and takes the form G (x, t) := K (x, t) + D(x, t), with K (x, t) := ∞ X yj Aj (x)11{t≥ηj } = Jump reduction component, j=1 and D(x, t) := ∞ X ζi (t−νi )Fi (x)e −ξi (t−νi ) 11{t≥νi } = Trend reduction component. j=1 I K (x, t) captures temporary adverse mortality jumps and takes the form H(x, t) := ∞ X bj Bj (x)e −κi (t−τi ) 11{t≥τj } . j=0 Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview An age-period-cohort model: Extending Lee-Carter graduation formula The Lee-Carter model captures the age-period effect, but does not reflect the possible cohort effect (calender year-age = t − x). A simple model that would simultaneously capture the age-period-cohort effect would be log µ(x, t) := α(x) + κ(t) + γ(t − x). Renshaw and Haberman (2009) suggested the following extension of the Lee-Carter model to capture the cohort effect (calender year-age = t − x): log µ(x, t) := β1 (x) + β2 (x)κ(t) + β3 (x)γ(t − x). (71) A generalization of this mortality model for data divided into N components reads N X log µ(x, t) := βj (x)κj (t)γj (t − x). j=1 In a series of papers the Edinburgh teams including Currie, Richards and co-authors (2003-2012) and Cairns and co-authors (2006-2012) suggest other extensions and perform deep statistical analysis that seem tune the age-period-cohort effect when applied to mortality data from England and Wales, and USA. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview An infinite dimensional approach to mortality modeling The mortality rate can be viewed as an (infinite dimensional) curve of (x, t). To capture the high level of uncertainty in projections of future mortality one is tempted to translate the ”machinery” developed for ”forward” interest rate yields such as ”the HJM-model under the Musiela parametrization etc..” to mortality rates. One is tempted to translated the calibration techniques of interest rate yield curves, to perform hopefully more accurate projections of future mortality (though with limited data points). See e.g. Biffis (2005), Biffis and Millossovitch (2006), Biffis and Denuit (2006), Biffis, Denuit and Devolder (2010), and Tappe and Weber (2013) for further details. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview References- Incomplete list Aalen, Odd O.; Hoem, Jan M (1978): Random time changes for multivariate counting processes. Scand. Actuar. J. no. 2, 81101. Andersen, P.K:, Borgan, Ø., Gill, R.D., Keiding, N. (1993): Statistical models based on counting processes, Springer-Verlag, New York, Berlin, Heidelberg. Brouhns, N., Denuit, D., Vermunt, J. K. (2002): A Poisson log-bilinear regression approach to the construction of projected lifetables. Insurance: Mathematics and Economics, 31(3), pp. 373-393. Barberin, J. ()2008): Heath-Jarrow-Morton modelling of longevity bonds and the risk minimization of life insurance portfolios. Insur. Math. Econom. 43 (1), pp. 41-55. Bauer, D. (2008): Stochastic mortality modeling and securitization of mortality risk. IFA-Schriftenreihe, Ulm. Bauer, D., Benth, F.E, and Kiesel, R. (2010): Modeling the forward surface of mortality (Preprint, Univ. Ulm). Biffis, E. (2005): Affine processes for dynamic mortality and actuarial valuation. Insur. Math. Econom. 37(3), pp. 443-468. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview References II Biffis, E. and Denuit, M. (2006): Lee-Carter goes risk-neutral: An application to the Italian Annuity Market. Giornale dell’Istituto Italiano degli Attuari, Vo. LXIX, pp.33-53. Biffis, E., Denuit, M. and Devolder, p. (2010): Stochastic mortality under measure changes. Scand. Actua. J. (4), pp. 284-311. Biffis, E.and Millosovitch, P. (2006): A bidimensional approach to mortality risk. Decision in Economics and Finance, 29(2), pp. 71-94. Cairns, A.J.G., Blake, D., Dowd, K., Coughlan, G., Epstein, D., Ong, A., Balevich, I., (2009): A quantitative comparison of stochastic mortality models using data from England and Wales and the United States. North American Actuarial Journal 13(1), pp. 1-35. Cox, S.H, Lin, Y. and Pedersen, H. (2010): Mortality risk modeling: Applications to insurance securitization. Insurance: Mathematics and Economics 46(1), pp. 242-253. Christiansen, M. C., Denuit, M. M. and Lazar, D. (2012): The Solvency II square-root formula for systematic biometric risk. Insurance: Mathematics and Economics, 50 (2), pp. 257-265. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview References III Currie, I. D. (2013): Smoothing constrained generalized linear models with an application to the Lee-Carter model Statistical Modelling, 13, 69-93. http://doi:10.1177/1471082X12471373 Currie, I. D. (2012): Forecasting with the age-period-cohort model? (pdf) Proceedings of 27th International Workshop on Statistical Modelling, Prague, pp. 87-92. Currie, I. D. (2012): Forecasting with the age-period-cohort model? (pdf) Proceedings of 27th International Workshop on Statistical Modelling, Prague, pp. 87-92. Currie, I. D. (2011): Modelling and forecasting the mortality of the very old. (pdf) ASTIN Bulletin, 41, pp. 419-427. Currie, I. D., M Durban, M., and Eilers, P H C (2004): Smoothing and forecasting mortality rates Statistical Modelling, 4, 279-298. Dowd, K., Cairns, A.J.G., Blake, D., Coughlan, G.D., Epstein, D., Khalaf-Allah, M. (2010): Evaluating the goodness of fit of stochastic mortality models. Insurance: Mathematics and Economics 47, PP. 255-265. Fahrenwaldt, M. A. (2013): Sensitivity of life insurance reserves via Markov semigroups (preprint). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview References IV Haberman, S. and Renshaw, A. (2009): On age-period-cohort parametric mortality rate projections. Insurance Math. Econom. 45, no. 2, 255270. Haberman, S. and Renshaw, A. (2012): Parametric mortality improvement rate modelling and projecting. Insurance Math. Econom. 50, no. 3, 309333. Hainaut, D. (2012): Multidimensional LeeCarter model with switching mortality processes. Insurance: Mathematics and Economics 50 (2), pp. 236-246 Hoem, Jan M. (1969): Markov chain models in life insurance. Blätter Deutsch. Gesellschaft Vers. math., Vol. 9 (No. 2), pp. 91-107. Hoem, Jan M. (1969):Purged and partial Markov chains. Scand. Actuar.J., 52, pp. 147-155. Hoem, Jan M. (1972): On the statistical theory of analytic graduation Sixth Berkeley symposium, Proceedings of the Sixth Berkeley Symposium on Mathematical Statistics and Probability (Univ. California, Berkeley, Calif., 1970/1971), Vol. I: Theory of statistics, pp. 569600. Univ. California Press, Berkeley, Calif. Hoem, Jan M. and Aalen, O.O. (1978): Actuarial values of payment streams. Scand. Actuar. J. pp. 38-47. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview References V Hoem, Jan M.(1984): A contribution to the statistical theory of linear graduation. Insurance Math. Econom. 3, no. 1, 117. Hyndman, Rob J. and Shahid Ullah, Md. (2007): Robust forecasting of mortality and fertility rates: A functional data approach. Comput. Stat. Data Analys. 51, pp. 4942-4956. Karr, Alan F. (1986): Point processes and their statistical inference. Marcel Dekker, Inc. New York and Basel. Lee, R.D. and Carter, L. (1992): Modelling and forecasting the time series of US mortality. Journal of the American Statistical Association 87, pp. 659-671. Norberg, R. (1991): Reserves in life and pension insurance. Scand. Actuar. J. 1991, no. 1, 324. Norberg, R. (1993): Identities for present values of life insurance. Scand. Actuar. J. pp. 100-106. Norberg, R. (1999): A theory of bonus prognoses in life insurance. Finance Stoch. 3, no. 4, 373390. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview References VI Norberg, R. (2001): On bonus and bonus prognoses in life insurance. Scand. Actuar. J. 2001, no. 2, 126147. Norberg, R. (2002): Basic life insurance mathematics. Lecture Notes, University of Copenhagen. Norberg, R. (2010): Forward mortality and other vital rates are they the way forward? Insurance Math. Econom. 47, no. 2, 105112. Richards, S. J. Kirkby, J. G. and Currie, I. D. (2006): The importance of year-of-birth in two-dimensional mortality data. British Actuarial Journal, 12 (I), pp. 5-61. Richards, S. J. and Currie, I. D. (2009): Longevity risk and annuity pricing with the Lee-Carter model. British Actuarial Journal, 15, pp. 317-365. Richards, S. J. (2008): Detecting year-of-birth mortality patterns with limited data. J. of the Royal Statistical Society, Series A, 171(1), pp. 279-298. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview References VII Richards, S. J. (2010) Correspondence: Detecting year-of-birth mortality patterns with limited data. J. of the Royal Statistical Society, Series A, 173(4), pp. 920-924. Richards, S. J. (2012): A handbook of parametric survival models for actuarial use. Scand. Actua. J. 4, pp. 233-257. Samuelsson, E. (2008): Mortality among Swedish insured. Scan. Actua. J. Volume 2008, Issue 2-3, pp. 184-199. Steffensen, M. (2006): Surplus-liked life insurance. Scandinavian Actuarial Journal 1, pp. 1-22. Tappe, S. and Weber, S. (2013): Stochastic mortality models: An infinite-dimensional approach (preprint). Wolthuis, H; Hoem, Jan M (1990): The retrospective premium reserve. Insurance Math. Econom. 9 (1990), no. 2-3, 229234. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview A systematic graduation technique applicable to mortality and other series Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Trend of a time series as an inverse problem By graduation we mean extraction of a smooth trend from a time series of data. A smooth trend ŷ = (ŷ1 , . . . , ŷT ) of a given a time series x = (x1 , . . . , xT ) is a minimizer of the following problem (Spoerl (1943), Leser (1961), Hodrick and Prescott (1980, 1997)) T −2 T X X ŷ (α, x) = arg min (xt − yt )2 + α (yt+2 − 2yt+1 + yt )2 y t=1 ! t=1 for a given α > 0, called the smoothing parameter. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview where, ŷ (α, x) = arg min kx − y k2 + αkPy k2 y Py (t) := (yt+2 − yt+1 ) − (yt+1 − yt ), P is the ((T − 2) × T -matrix) 1 0 P := 0 t = 1, 2, . . . , T − 2, shift operator −2 1 ··· ··· 1 −2 ··· ··· ··· 1 ··· 1 ··· ··· ··· −2 0 0 . 1 The unique solution ŷ (α, x) to the optimal problem is ŷ (α, x) = (IT + αP 0 P)−1 x, Boualem Djehiche KTH Royal Institute of Technology, Stockholm (72) Statistical estimation techniques in life and non-life insurance An overview A Gaussian random walk model of the trend To determine the value of the smoothing parameter α, Hodrick and Prescott suggest the time series x follows a linear mixed model: x = y + u, Py = v , where (the prior), u v ∼ N (0, Σuv ), with covariance matrix Σuv := σu2 IT 0 Boualem Djehiche KTH Royal Institute of Technology, Stockholm 0 σv2 IT −2 , Statistical estimation techniques in life and non-life insurance An overview Since P is of rank T − 2, the signal v := Py does not determine a unique y but rather the set of solutions y := {P 0 (PP 0 )−1 v + Z γ; γ ∈ R 2 } where the T × 2-matrix Z satisfies −1 PZ = 0, P 0 PP 0 P + ZZ 0 = IT , Z 0 Z = I2 . (Orthogonal Parametrization) Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview (x, y ) is normally distributed: x Z ∼N γ, Σxy , y Z with covariance matrix Σxy := σu2 IT + σv2 Q σv2 Q σv2 Q σv2 Q , where, Q := P 0 (PP 0 )−1 (PP 0 )−1 P, Boualem Djehiche KTH Royal Institute of Technology, Stockholm QZ = 0, Z 0 Q = 0, Statistical estimation techniques in life and non-life insurance An overview and Q[ σu2 σ2 I + Q]−1 = [ u2 IT + Q]−1 Q, 2 T σv σv which yields that Q[ σu2 IT + Q]−1 Z = 0. σv2 The maximum likelihood estimator of γ is argminγ (x − Z γ)0 [σu2 IT + σv2 Q]−1 (x − Z γ) = Z 0 x. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Noise-to-Signal Ratio Theorem 1 (Hodrick and Prescott (1997)) α = σu2 /σv2 . Hodrick and Prescott derived this equality using ŷ (α, x) ≈ E [y |x], from signal-extraction techniques which is widely used in Signal Processing. First to prove Theorem 1 is (Schlicht (2005): Theorem 2 (Schlicht (2005)) α = σu2 /σv2 and γ = Z 0 x (ML-estimator of γ) if and only if ŷ (α, x) = E [y |x]. 0 (Recall ŷ (α, x) := (IT + αP P)−1 x) Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Proposition (Dermoune-D-Rahmania(2009)) α∗ = σu2 /σv2 = arg min kE [ y | x] − y (α, x)k2 . α>0 Moreover, the error (optimal gap) E [ y | x] − y (α∗ , x) = Z γ − Z 0 x is a centered Gaussian vector with covariance matrix cov Z Z 0 x − γ = σu2 ZZ 0 . In particular, E [kE [ y | x] − y (α∗ , x)k2 ] = E [kZ (γ − Z 0 x) k2 ] = σu2 trace(ZZ 0 ) = 2σu2 . Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview For US real GNP time series, the smoothing parameter α is determined in two ways: I Hodrick and Prescott (1980, 1997) finds α∗ = 1600 using a prior view based on macroeconomics arguments: σu = 5% and σv = 1/8% in the rate of the growth in a quarter. I Schlicht (2005) finds α∗ = 2.40, using a complicated ML-estimator which is a fixed point of a highly nonlinear matrix equation. The consistency of the ML-estimator is still open. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview A consistent estimator of the noise-to-signal ratio α We use observations from the centered Gaussian and stationary (thanks to P) time series Px: Px = v + Pu ∼ N (0, σv2 IT −2 + σu2 PP 0 ). The elements of the covariance matrix are σv2 δij + σu2 (PP 0 )ij = r|i−j| , where, 2 σv + 6σu2 , −4σu2 , rk = σ2 , u 0, Boualem Djehiche KTH Royal Institute of Technology, Stockholm if k = 0; if k = 1; if k = 2; otherwise. Statistical estimation techniques in life and non-life insurance An overview A classical fact that (see e.g. Chapter 7 in Brockwell and Davis (1991)) r̂k = TX −2−k 1 Px(j)Px(j + k), (T − 2) − k j=1 k = 0, 1, 2, is an unbiased estimator of rk = E [Px(s)Px(s + k)] in the sense that E [r̂k ] = rk . Theorem The following statistics 1 α b=− 4 !−1 P −2 (T − 3) Tj=1 Px(j)2 3 + P −3 2 (T − 2) Tj=1 Px(j)Px(j + 1) (73) based on the time series of observation Px, is a consistent estimator of the smoothing parameter α. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Performance of different estimates of the statistics α̂ • α = 0.5 based on 5000 simulations. T-2 1000 2500 5000 mean(α̂) 0.512 0.503 0.501 standard deviation(α̂) 0.09 0.06 0.04 • α = 10 based on 5000 simulations. T-2 1000 7000 14000 percentage of (α̂ < 0) 0.255 0.044 0.010 standard deviation(α̂) 548 226 65 Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Estimating α for US real GNP time series under the period 01-01-1947–01-01-2006 For US real GNP time series, the smoothing parameter α is estimated as follows. • Using Eq. (1), α∗ = 0.41. • Hodrick and Prescott (1980, 1997) find α∗ = 1600. • Schlicht (2005) finds α∗ = 2.40. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview US Real GNP Estimated trend Trend using Schlicht estimator Hodrick−Prescott trend 4200 4000 3800 3600 3400 3200 3000 2800 2600 65 70 75 Boualem Djehiche KTH Royal Institute of Technology, Stockholm 80 85 90 95 100 Statistical estimation techniques in life and non-life insurance An overview 1 0.5 0 −0.5 Autocorrelation function of the noise component using alpha=1600. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview 1 0.8 0.6 0.4 0.2 0 −0.2 −0.4 Autocorrelation function of the noise component using alpha=2.4028. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview 1 0.5 0 −0.5 Autocorrelation function of the noise component using alpha=0.4137. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Calibration Estimated values of the smoothing parameter α. The values of α̂ are based on the estimator given by Eq. (1) and α∗ are obtained using the Schlicht’s estimator. x US real GNP GBP Euro CHF JPY α̂ 0.4137 0.3734 0.3675 0.5896 0.4489 α∗ 2.4028 1.2998 0.67173 1.4592 1.8383 x S&P 500 Nasdaq Dow Jones FTSI Nikkei α̂ 0.4963 0.5152 0.6417 0.4125 0.4721 α∗ 8.1615 3.2756 1.1907 3.0841 2.4028 Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview The Hodrick-Prescott Multivariate filter The Hordrick-Prescott Multivariate filter (HPMV) seeks to estimate the unobserved variable y as a solution to the following minimization problem arg min(||x − y ||2 + α1 ||Py ||2 + α2 ||z − βy ||2 ), y given the following dynamics: x = y + u, z := x ∗ − dX = βy + ξ, Py = v , (74) (75) where, xt∗ is another explanatory variable (the Phillips curve) that can be explained by the unobserved variable y , and X is an exogenous variable affected by the parameter d. with u ξ ∼ N (0, Σ), (76) v Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview σu2 IT Σ := 0 0 0 σξ2 IT 0 0 . 0 σv2 IT −2 the time series (x, z) can be represented in terms of (u, v , ξ) as x = u + P 0 (PP 0 )−1 v + Z γ, z = ξ + βP 0 (PP 0 )−1 v + βZ γ, (77) for some γ ∈ R2 . In view of (77), Σxyz x Z z ∼ N βZ γ, Σxzy , y Z 2 σu IT + σv2 Q βσv2 Q σv2 Q 2 2 2 2 βσ Q σ I + βσ Q βσ v v vQ ξ T = σv2 Q βσv2 Q σv2 Q (78) , where, Q := P 0 (PP 0 )−1 (PP 0 )−1 P Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview The unique solution ŷ to Problem (74) is ŷ = a(IT + bP 0 P)−1 w , where, a := 1/(1 + α2 β 2 ), b := α1 /(1 + α2 β 2 ) and w := x + α2 βz. Proposition The best smoothing parameters of the HPMV filter according to Schilcht Criterion are those positive parameters α1∗ and α2∗ for which α1∗ /α1e = 1 + α2∗ β 2 (α2∗ /α2e − α1∗ /α1e ) , where, α1e = σu2 /σv2 and α2e = (79) σu2 /σξ2 . In particular, (1) α1∗ = α1e if and only if α2∗ = α2e . (2) If α1∗ /α1e < 1, then necessarily α2∗ /α2e < α1∗ /α1e . (3) If α1∗ /α1e > 1, then necessarily α2∗ /α2e > α1∗ /α1e . Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview The following statistics α b1e 1 =− 4 !−1 P −2 (T − 3) Tj=1 Px(j)2 3 + , P −3 2 (T − 2) Tj=1 Px(j)Px(j + 1) PT −3 j=1 Px(j)Px(j + 1) j=1 Pz(j)Pz(j + 1) α b2e = PT −3 , and βb2 = 2(T − 3) PT −2 2(T − 3) PT −2 j=1 j=1 Pz(j)2 + 3(T − 2) PT −3 Pz(j)Pz(j + 1) + 3(T − 2) PT −3 Px(j)Px(j + 1) Px(j)2 j=1 j=1 , based on the time series of observations Px and Pz, are consistent estimators of the smoothing parameters α1e , α2e and the parameter β. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Calibration T-2 500 1000 5000 (mean(α̂e1 ), std(α̂e1 )) (1.13, 0.61) (1.05, 0.33) (1.00,0.11) (mean(α̂e2 ), std(α̂e2 )) (1.01, 0.16) (1.00, 0.11) (1.00, 0.05) (mean(β̂), std(β̂)) (0.47, 0.29) (0.47, 0.22) (0.49, 0.08) Table: Performance of different estimates of the statistics on αe1 = 1, αe2 = 1 and β = 0.5 based on 1000 simulations. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview T-2 500 1000 5000 (mean(α̂e1 ), std(α̂e1 )) (1.13, 0.61) (1.05, 0.33) (1.00,0.11) (mean(α̂e2 ), std(α̂e2 )) (0.50, 0.09) (0.50, 0.06) (0.50, 0.02) (mean(β̂), std(β̂)) (2.06, 0.43) (2.01, 0.23) (2.00, 0.10) Table: Performance of different estimates of the statistics on αe1 = 1, αe2 = 0.5 and β = 2 based on 1000 simulations. T-2 500 1000 5000 (mean(α̂e1 ), std(α̂e1 )) (1.13, 0.61) (1.05, 0.33) (1.00,0.11) (mean(α̂e2 ), std(α̂e2 )) (16.30, 2.73) (16.14, 1.84) (15.96, 0.84) (mean(β̂), std(β̂)) (0.19, 0.05) (0.20, 0.03) (0.19, 0.01) Table: Performance of different estimates of the statistics on αe1 = 1, αe2 = 16 and β = 0.2 based on 1000 simulations. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Multivariate HP filter Example Consider the following two-dimensional HP-filter: x i = y i + u i , Py i = v i , i = 1, 2, where, the noises u 1 and u 2 , respectively v 1 and v 2 , may be correlated. P 0 A := , 0 P we get the following two-dimensional HP-filter: x = y + u, Ay = v . Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview u v with covariance matrix ∼ N (0, Σuv ). Σuv := Σu 0 0 Σv . (x, y ) is normally distributed: x Z ∼N γ, Σxy , y Z with covariance matrix Σxy := Σu + Qv Qv Qv Qv , where, Qv := A0 (AA0 )−1 Σv (AA0 )−1 A E [ y | x] = Z γ + Qv [Σu + Qv ]−1 (x − Z γ). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview The maximum likelihood estimator of γ is γ̂ = argminγ (x − Z γ)0 [Σu + Qv ]−1 (x − Z γ) = Z 0 Σu ZZ 0 [Σu + Qv ]−1 x. We suggest the following multivariate HP-filter. Definition A multivariate HP-trend minimizes the following weighted sum ŷ (Σ, Ω)) = argminy (x − y )0 Σ−1 (x − y ) + y 0 A0 Ω−1 Ay , where (Σ, Ω) is a pair of positive definite matrices with appropriate dimensions that acts as a smoothing parameter. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Multivariate noise-to-signal ’ratio’ We suggest to find the best smoothing parameter (Σ∗ , Ω∗ ) for which (Σ∗ , Ω∗ ) = argmin{(Σ,Ω)∈A} kE [ y | x] − y ((Σ, Ω), x) k2 . (80) in the following class A of admissible matrices. A = {(Σ, Ω) posi. def. s.t. ŷ − Z γ̂ is orthogonal to Z γ̂} = {(Σ, Ω) posi. def. s.t. ΣΠ = ΠΣ}. where, Π = A0 (AA0 )−1 A. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Proposition The pairs of invertible matrices (Σ∗ , Ω∗ ) ∈ A are optimal i.e. (Σ∗ , Ω∗ ) = argmin{(Σ,Ω)∈A} kE [ y | x] − y ((Σ, Ω), x) k2 . if and only if they are solutions of the following equation. −1 Σ∗ A0 Ω∗−1 Σv (AA0 )−1 A = Σu − Z Z 0 [Σu + Qv ]−1 Z Z 0. (81) (82) In particular, given Σ∗ , Ω∗−1 = 0 −1 (AA ) ∗−1 AΣ 0 Σu − Z Z [Σu + Qv ] Boualem Djehiche KTH Royal Institute of Technology, Stockholm −1 Z −1 Z 0 A0 Σ−1 v . Statistical estimation techniques in life and non-life insurance An overview Moreover, the error (optimal gap) E [ y | x]−y ((Σ∗ , Ω∗ ), x) = −1 Z γ − Z 0 [Σu + Qv ]−1 Z Z 0 [Σu + Qv ]−1 x is a centered Gaussian vector with covariance matrix = cov Z Σu ZZ 0 [Σu + Qv ]−1 x − γ −1 Z Z 0 [Σu + Qv ]−1 Z Z 0. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Solvability of Eq. (82) Lemma Eq. (82) is solvable if and only if ΠΣu (IdT − Π) = 0, (if and only if ΠΣu = Σu Π.) (83) where, Π = A0 (AA0 )−1 A In this case, (82) reduces to ΣA0 Ω−1 Σv (AA0 )−1 A = Σu Π. Boualem Djehiche KTH Royal Institute of Technology, Stockholm (84) Statistical estimation techniques in life and non-life insurance An overview Theorem Assume ΠΣu = Σu Π. Then (Σ, Ω) ∈ A is optimal if and only if (Σ, Ω) satisfies 0 0 −1 Σ−1 u ΣA = A Σv Ω. In particular, y ((Σ, Ω) , x) = y ((Σu , Σv ) , x) , for all x ∈ R dT . Moreover, the covariance matrix of the error (optimal gap) is cov (E [ y | x] − y ((Σ, Ω), x)) = ZZ 0 Σu ZZ 0 . Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview A final remark (a) Using, ΠA0 = A0 also yields that −1 ŷ = IdT + ΠΣu ΠA0 Σ−1 x, v A We only need the variance of Πu which is ΠΣu Π and Σv , thus similar to α in the standard HP-filter. (b) (Σu , Σv ) is optimal i.e. it satisfies (82). In particular, if Ω = Σv = σv2 Id(T −k) , we have ŷ = IdT Σu + 2 A0 A σv −1 x, suggesting an optimal smoothing parameter α = Σu /σv2 which is a matrix. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview (c) The pair of matrices (Σ∗ , Σv ) is optimal, for any invertible matrix Σ∗ that satisfies Σ∗ Π = ΣΠ = ΠΣ, is equivalent to choosing Σ∗ of the form var (Πu) Σ∗ = 0 0 L . (85) written in the basis q √ A0 g1 / λ1 , . . . , A0 gd(T −k) / λd(T −k) , Z1 , . . . , Zkd , where λi are the eigenvalues of AA0 and g1 , . . . , gd(T −k) an orthonormal set of eigenvectors corresponding to λi , i = 1, . . . , λd(T −k) . The matrix L is any arbitrary dk × dk-matrix. When L = var ((I − Π)u), Σ∗ = Σu . Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Optimal Spline Smoothing of the Hodrick-Prescott Trend The HP-filter is of limited use for any analysis of the behavior of x in any of the time intervals (tj−1 , tj ) unless an appropriate smooth curve fit, such as polynomial splines, between the tj ’s is used. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Spline smoothing of the HP-trend We suggest a smooth curve fit which is consistent with the HP-filter: The chosen smooth curve y (t) is such that y (tj ) = ȳj , j = 1, . . . , n. That is, the chosen smooth curve y (t) should minimize the following cost functional Z tn 2 J(y ) = kx − ȳ k2 + λ y (2) (t) dt + ȳ T P T WP ȳ , t1 for some smoothing parameters (λ, W ), with λ > 0 and W positive definite matrix. Moreover, the smoothing parameters should be optimal: (λ∗ , W ∗ ) = arg min{λ>0,W positive definite} kE[ȳ | x] − ȳ (λ, W , x)k2 . Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview A smoothing cubic B-spline trend We suggest to look for a smoothing cubic spline curve y (t) which minimizes J of the form x = ȳ + u, P ȳ = v , y (t) = b(t)T τ, ȳ = B T τ, where, • b(t) is an (n × m) cubic smoothing B-spline, • B = (b(t1 ), . . . , b(tn )) ∈ Rn×m , • τ = (τ1 , . . . , τm )T ∈ Rm is the new control parameter, • ȳi := y (ti ), i = 1, . . . , n. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview With y (t) = b(t)T τ, Z tn (y (2) (t))2 dt = τ T G τ, t1 where, G ∈ Rm×m is the Gramian associated to b: Z tn 2 d b(t) d 2 b T (t) dt. G = dt 2 dt 2 t1 The cost functional J(y ) becomes J(τ ) = kx − ȳ k2 + λ tn Z 2 y (2) (t) dt + ȳ T P T WP ȳ t1 = x T x − 2τ T B T x + τ T Lτ, L := λG + B T (In + P T WP)B is positive definite (see Kono et al. (2005)). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview The unique minimizer of J is τ̂ = L−1 B T x, which, in terms of (λ, W ) and x, reads τ̂ (λ, W , x) = (λG + B T (In + P T WP)B)−1 B T x. Hence, our cubic B-spline optimal trend is ŷ (t) = b(t)T (λG + B T (In + P T WP)B)−1 B T x, Boualem Djehiche KTH Royal Institute of Technology, Stockholm t ∈ [t1 , tn ]. Statistical estimation techniques in life and non-life insurance An overview Using E[ ȳ | x] = ȳ ( σu2 , x) = σv2 In + σu2 T P P σv2 −1 x. (λ∗ , W ∗ ) = arg min{λ>0,W positive definite} kE[ȳ | x] − ȳ (λ, W , x)k2 . if and only if (λ∗ , W ∗ ) satisfy the following balance equation ∗ T T ∗ −1 B(λ G + B (In + P W P)B) T B x= σ2 In + u2 P T P σv !−1 x. or W∗ = σu2 In−2 − λ∗ (PP T )−1 P(BB T )−1 BGB T (BB T )−1 P T (PP T )−1 . σv2 Theorem (D., Marcus and Rahmania (2011)) • The balance equation has always has a feasible solution. • The cubic B-spline optimal trend is ŷ (t) = b(t)T B T (BB T )−1 (In + Boualem Djehiche KTH Royal Institute of Technology, Stockholm σu2 T −1 P P) x. σv2 Statistical estimation techniques in life and non-life insurance An overview A numerical example We use three different estimators of the noise-to-signal ratio σu2 /σv2 : 2 2 • αDDR = σ\ u /σv proposed in Dermoune et al. (2008), 2 2 σ\ u /σv n 1 = max 0, − 4 !−1 P 2 o (n − 3) n−2 3 j=1 ξj , + Pn−3 2 (n − 2) j=1 ξj ξj+1 (86) where, ξ = Px. • The Generalized Cross Validation (GCV) estimator as solved efficiently in Wienert (2007) and Weinert (2009). • The one proposed by Hodrick and Prescott (1997), αHP = 1600, for quarterly time series. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview αDDR αGCV αHP US GNP 0.321 0.240 1600 GBP/USD 0.446 0.198 1600 S&P500 0.267 3.65e-3 1600 Table: estimated values of the noise-to-signal ratio σu2 /σv2 . Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Figure: US quarterly GNP from 1947 Q1 to 2010 Q2. Index number 2005=100. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Figure: Daily GBP/USD close prices from 03/31/03 to 10/01/10. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Figure: Monthly close prices of S&P500 from Dec 1987 to Aug 2010. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Main References I Dermoune, A., Djehiche, B. and Rahmania, N. (2008): Consistent estimators of the smoothing parameter in the Hodrick-Prescott Filter. J. Japan Statist. Soc., Vol. 38 (No. 2), pp. 225-241. Dermoune, A., Djehiche, B. and Rahmania, N. (2009): Multivariate extension of the Hodrick-Prescott Filter- Optimality and characterization. Studies in Nonlinear Dynamics and Econometrtics., (13), pp. 1-33. Djehiche, B., Marcus, M., and Rahmania, N. (2011): On a Graduation Problem involving both the Hodrick-Prescott Filter and Optimal Spline Smoothing. Far East J. Theoretical Statistics, 36(1), 1-19. Djehiche, B. and H. Nassar, H. (2012): A Functional Hodrick-Prescott filter (preprint). Djehiche, B. and Rahmania, N. (2013): Modeling and estimating correlated growth and business cycles in a multivariate Hodrick-Prescott filter. Far East J. Theoretical Statistics, (42)1, 41-70. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Main References II Hoem, Jan M. (1972): On the statistical theory of analytic graduation Sixth Berkeley symposium, Proceedings of the Sixth Berkeley Symposium on Mathematical Statistics and Probability (Univ. California, Berkeley, Calif., 1970/1971), Vol. I: Theory of statistics, pp. 569-600. Univ. California Press, Berkeley, Calif. Hoem, Jan M.(1984): A contribution to the statistical theory of linear graduation. Insurance Math. Econom. 3, no. 1, 1-17. Hodrick, R. and Prescott, E. C. (1997): Postwar U.S. business cycles: An empirical investigation. Journal of Money, Credit and Banking. 29(1), 1-16. Kano, H., Nakata, H. and Martin, C. (2005): Optimal curve fitting and smoothing using normalized uniform B-splines: A tool for studying complex systems. Appl. Math. and Comput. 169(1), 96-128. Schlicht, E. (2005): Estimating the smoothing parameter in the so-called Hodrick-Prescott filter. J. Japan Statist. Soc., Vol. 35 No. 1, 99-119. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Stochastic claims reserving in disability insurance Outline I Disability inception model I Recovery model I Stochastic claims reserving Based on I H. Aro, B. Djehiche and B. Löfdahl (2013): Stochastic modelling of disability insurance in a multi-period framework (to appear in Scandinavian Actuarial Journal). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Disability inception A plot of age vs. year vs. inception frequencies for females is displayed below. 60 60 50 2000 40 2005 30 2010 50 2000 40 2005 30 2010 Inception seems to be strongly time- and age-dependent. How can we model this behaviour? Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Disability inception I Let Ex,t denote the number of healthy individuals with age in [x, x + 1) at the beginning of time period t I Let Dx,t denote the number of individuals among Ex,t with disability inception in the interval [t, t + 1) I Goal: model inception over time, t = 0, 1, 2, . . . I Assume Dx,t is binomially distributed given Ex,t : Dx,t ∼ Bin(Ex,t , px,t ) where px,t is the inception probability of an x-year-old. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Disability inception I In order to reduce the dimensionality of the problem and achieve some level of smoothness, we employ logistic regression: px,t := log n px,t X i i = νt φ (x), 1 − px,t i=1 where φi (x) are age-dependent basis functions and νti time-varying stochastic risk factors. I Changing notation, pνt (x) = px,t , we invert the expression above, obtaining Pn i i exp i=1 νt φ (x) P pνt (x) = 1 + exp( ni=1 νti φi (x)) I This guarantees that the probabilities pνt (x) ∈ (0, 1). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Disability inception Given historical values of Dx,t and Ex,t , and a set of basis functions {φi }, the log-likelihood function for yearly values of νt ∈ Rn can be written l(νt ) = Xh x∈X Dx,t n X νti φi (x) − Ex,t log 1 + exp i=1 n X i νti φi (x) + ct . i=1 I If the basis functions are linearly independent, −l(νt ) is strictly convex. I Minimizing using methods from numerical optimization yields unique estimates of νt . I Question: How do we determine the basis functions? Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Disability inception I I The basis functions can be chosen by the user, according to some criteria Desired properties of pνt (·), e.g. continuity or smoothness w.r.t. x, is achieved by choosing continuous or smooth φi (·) I I I Population characteristics can be taken into account Suitable choices of basis functions gives the risk factors concrete interpretations Alternatively, an optimal basis can be extracted from the data using functional principal component analysis. I Yields better model fit, but harder to interpret results. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Disability inception I Consider the model pνt (x) = νt1 φ1 (x) + νt2 φ2 (x), where the basis functions are linear on x ∈ [25, 64]: 1 φ (x) = I 64 − x 39 , 2 φ (x) = x − 25 39 A linear combination of φ1 and φ2 is then also linear. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Disability inception 1 1 1 0.9 0.9 0.9 0.8 0.8 0.8 0.7 0.7 0.7 0.6 0.6 0.6 0.5 0.5 0.5 0.4 0.4 0.4 0.3 0.3 0.3 0.2 0.2 0.2 0.1 0.1 0 25 30 35 40 45 50 55 60 0 25 0.1 30 35 40 45 50 55 60 0 25 30 35 40 45 50 55 60 Figure: Left: two basis functions. Centre: basis functions scaled with risk factor values 0.4 and 0.6. Right: the resulting linear combination. Note: φ1 (25) = φ2 (64) = 1, and φ1 (64) = φ2 (25) = 0. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Disability inception I Under this model, the logistic inception probability of a 25-year old is given by pνt (25) = νt1 φ1 (25) + νt2 φ2 (25) = νt1 . I Similarly, for a 64-year old we have pνt (64) = νt2 . I An x-year old can be seen as a convex combination of a 25-year old and a 64-year old. I Inception for the population described by only νt1 and νt2 . Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Disability inception 60 60 50 2000 40 2005 30 2010 50 2000 40 2005 30 2010 Figure: Left: Inception frequencies per 5-year age groups, females. Right: Model surface. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Recovery from disability I Recovery from disability is slightly more complicated I I I The probability of recovering from illness depends on the amount of time spent in the ’ill’ state. This is known as a semi-Markov property. We extend the disability inception model above to the semi-Markov case, and apply it to recovery modelling. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Recovery model I Let Ex,d,t denote the number of individuals with disability inception age in [x, x + 1) and disability duration d at some point in the time period [t, t + 1). I Let Rx,d,t denote the number of individuals among Ex,d,t that recover during [d, d + ∆d) and [t, t + 1). I Goal: model recovery over time, t = 0, 1, 2, . . . I Assume Rx,d,t is binomially distributed given Ex,d,t : Rx,d,t ∼ Bin(Ex,d,t , px,d,t ) where px,d,t is the probability that an individual, with disability inception age in [x, x + 1) and disability duration d at some point in [t, t + 1), recovers before duration d, d + ∆d. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Recovery model I We propose the following logistic regression model: pνt (x, d) = n X φi (x) i=1 k X νti,j ψ j (d), j=1 where φi and ψ j , are age and duration dependent basis functions, respectively, and νti,j are stochastic risk factors. I Compare with inception model: pνt (x) = n X νti φi (x). i=1 I Basically the same model, extended one dimension. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Recovery model I I Likelihood also has the same structure as before. Strict convexity if I I I {φi } lin. indep. {ψ i } lin. indep. Again, we estimate νt using numerical optimization. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Recovery model I Consider the model pνt (x, d) = φ1 (x) 3 X νt1,j ψ j (d) + φ2 (x) j=1 3 X νt2,j ψ j (d) j=1 where φ and ψ are given by: 64 − x , 39 x − 25 , 39 1 = 2 = 1 = 1, 2 = 3 = d, √ d. φ (x) φ (x) ψ (d) ψ (d) ψ (d) Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Recovery model I The recovery probabilities for a 25-year old are given by pνt (25, ·) = φ1 (25) 3 X νt1,j ψ j (·) + φ2 (25) j=1 = 3 X 3 X νt2,j ψ j (·) j=1 νt1,j ψ j (·), j=1 determined by νt1,1 , νt1,2 , νt1,3 . I Similarly, the recovery probabilities for a 64-year old determined by νt2,1 , νt2,2 , νt2,3 . I An x-year old can be seen as a convex combination of a 25-year old and a 64-year old. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Recovery model I Let an x-year old’s illness duration be the r.v. D. I The probability that the illness lasts longer than d years is given by d/∆d−1 λ(x, d) = Pνt (D > d) = Y (1 − pνt (x, n∆d)). n=0 I These are analogous to survival curves. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Recovery model 2006 0 0 60 5 60 5 55 55 50 50 45 10 40 45 10 40 35 35 30 15 25 30 15 25 Figure: Left: Conditional recovery probabilities. Right: Recovery surface, females, calendar year 2006. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Claims reserving Let Xt denote the state of an individual at time t, and let i denote the ’ill’ state. I Classical approach: estimate νt and use the measure Pνt for computing premiums and reserves. I The initial reserve is given by the expected value of the discounted monthly payments, i hX V = E I {Xtk = i}e −rtk X0 = i, ν = ν̂ k = X P(Xtk = i|X0 = i, ν = ν̂)e −rtk . k I Assuming independence, all systematic risk vanishes. I Let portfolio grow to infinity and be happy (LLN). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Stochastic claims reserving I New approach: model future probabilities of inception and recovery through the modelling of the risk factors νt . I Can be done by assuming a stochastic process form of ν. I Introduces systematic risk to the model. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Stochastic claims reserving I Let V denote the present value of all future cashflows from an infinitely large portfolio I Conditional on ν, V is the random variable hX i V = E I {Xtk = i}e −rtk X0 = i, ν k = X P(Xtk = i|X0 = i, ν)e −rtk . k I The distribution of V determined by the distribution of ν. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Stochastic claims reserving I A simple candidate for ν is the multidimensional random walk with drift. I Easy to simulate V by simulating ν. I Numerical example: I I I ν 6-dimensional Brownian Motion with drift. Drift and covariance matrix estimated from fitted values of ν for the years 2000-2011 10,000 trajectories for ν simulated over T time steps. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Stochastic claims reserving A histogram of the reserve is displayed below. The fitted curve is a Generalized Extreme Value Distribution (Frechet case). V fit 1 1.2 1 Density 0.8 0.6 0.4 0.2 0 2 4 6 8 10 12 14 Data Figure: Reserve distribution and fitted GEV. E [V ] = 1.6V0 , EC0.005 (V ) = 10V0 , EC0.01 (V ) = 9V0 , V0 = BE. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Main References Aalen, Odd O.; Hoem, Jan M (1978): Random time changes for multivariate counting processes. Scand. Actuar. J. no. 2, 81-101. H. Aro, B. Djehiche and B. Löfdahl (2013): Stochastic modelling of disability insurance in a multi-period framework (to appear in Scand. Actuar. J.). Efron, B. (1988): Logistic regression survival analysis and the Kaplan-Meier curve. J. Amer. Stat. Assoc. 82(402), pp. 414-425. Hyndman, R. and Ullah, M.S. (2007): Robust forecasting of mortality and fertility rates: A functional data approach. Comput. Stat. Data Analys. 51, pp. 4842-4956. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview II. Non-life insurance We give a very short overview on claims reserving for a non-life insurance product. We will only focus on the Chain-Ladder techniques for claims reserving. Credibility theory and related topics are not covered in this short course. Non-life insurance (also called General insurance (UK) or Property and Casualty Insurance (USA) usually includes the following lines of business: I Motor insurance (motor third party liability, motor hull) I Property insurance (private and commercial property against fire, water, flooding etc..) I Liability insurance (private and commercial liability including director and officers (D & O) liability insurance). I Accident insurance (personal and collective accident incl. compulsory accident insurance and workmen’s compensation). I Health insurance I Marine insurance I Travel, legal protection, credit insurance, epidemic insurance etc.. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview The history of a typical claim may look as follows: Reporting Closing Accident Claims payments Closing Reopening Payment ? ? ? ? ?? ? ? ? ? - time Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview A framework to the claim reserving problem We assume that the data are adjusted for inflation and exposure. Assume that we have N claims with reporting dates T1 ≤ T2 ≤ . . . ≤ TN . Fix the i-the claim. Then Ti = Ti,0 , Ti,0 , . . . , Ti,Ni is the sequence of dates where some action on the claim i is observed at time Ti,j , new information available at time Ti,j for claim i Ii,j = 0, else, payment at time Ti,j for claim i Xi,j = 0, else, (with Xi,j = 0, Ii,j = Ø if Ti,j = ∞.). Payment process of claim i. The settlement process of the claim i, (Ti,j , Ii,j , Xi,j )j≥0 , defines I The cumulative payment process X Ci (t) = Xi,j (87) j:Ti,j ≤t I The total ultimate claim amount Ci (∞) = Ci (Ti,Ni ) = X Xi,j , (88) j≥0 Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview I The outstanding claim at time t: total claims reserves for claim i at time t for the future liabilities: X Ri (t) = Ci (∞) − Ci (t) = Xi,j . (89) j:Ti,j >t I The aggregated processes of all claims are FtN = σ{(Tij , Ii,j , Xi,j )i≥1,j≥0 : Ti,j ≤ t}, Ft = σ(FtN ⊗t ), (if additional exogenous information is available), C (t) = PN i=1 Ci (t), P R(t) = C (∞) − C (t) = N i=1 Ri (t) X = Ri (t)11{Ti ≤t} + i X Ri (t)11{Ti >t} . i | {z } reserves for reported claims | {z } reserves for IBNR claims (90) where, IBNR claim stands for ”incurred but not yet reported” claim. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Consider the process that counts the number of reported claims: X Nt = 11{Ti ≤t} . i≥1 Then R rep (t) = E[ P i P t Ri (t)11{Ti ≤t} |Ft ] = E[ N i=1 Ri (t)|Ft ], P P R IBNR (t) = E[ i Ri (t)11{Ti >t} |Ft ] = E[ N i=Nt +1 Ri (t)|Ft ] are the best estimates of the expected future payments of the reported (at the individual claim’s level) respect. IBNR claims (available only at the collective (aggregated) level). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Best estimates of outstanding loss liabilities The claims reserving problem aims at estimating I the conditional distributions (very difficult!) µt (·) = P[C (∞) ∈ ·|Ft ], I the conditional expectation (which is a martingale) Mt := E[C (∞)|Ft ], I (91) (92) the conditional variance (which is a supermartingale) Vt := Var (C (∞)|Ft ) = E[C 2 (∞)|Ft ] − (E[C (∞)|Ft ])2 (93) At this level one could use the martingale representation theorem (or filter theory) to determine e.g. M, through an ”innovation gain process” (Malliavin derivative). But, we have scarce information and statistically few data points to characterize the gain process. Thus, we will use more ”adhoc” though intuitively sound and simpler models that we fit to data. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview The Chain-ladder method We first introduce the so-called claims development triangles and some notation: i = accident year, year of occurrence, j = development year, development period. Let Xi,j denotes all payments in development period j for claims with accident year i i.e. payments for claims with accident year i done in accounting year i + j. The cumulative payments are Ci,j := j X Xi,k , (94) k=0 the payment in a single accounting year is (the (k+1)-st diagonal) X Xk = Xi,k , (95) i+j=k the outstanding loss liabilities (or claims reserve) for accident year i at time j we have to estimate are (lower triangle) Rij = ∞ X Xi,k = Ci,∞ − Ci,j . (96) k=j+1 Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Accident year i Development year j 1 2 3 ... j ... J 0 1 . . . I+1-J I+2-J Realizations of Ci,j , Xi,j . . . I+i-J . . . . . . I-2 Predicted Ci,j , Xi,j I-1 I Table: Development triangle Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview We assume that the last development period is J: Xi,j = 0 for j > J, and the last observed accident year is I (I ≥ J). We have observations in the upper triangle DI = {Xi,j , i + j ≤ I }, k = {Xi,j , i + j ≤ I , 0 ≤ j ≤ k} ⊆ DI , (J = DI ), and the payments in the lower triangle we need to estimate: DIc = {Xi,j , i + j > I }. There at least three models of claims reserves. Below, we will review the following models (preferred by actuaries). I The chain-ladder model (Mack) (distribution-free), I The Bornhuetter-Ferguson model, I The Poisson model. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview The Chain-ladder model Model Assumptions I Different accident years i are independent. I (Ci,j )j is a Markov chain such that there exist factors (CL or age-to-age 2 factors) f0 , f1 , . . . , fJ−1 > 0 and variance parameters σ02 , σ12 , . . . , σJ−1 such that, for all 0 ≤ i ≤ I , 1 ≤ j ≤ J E[Ci,j |Ci,1 , . . . , Ci,j−1 ] = E[Ci,j |Ci,j−1 ] = fj−1 · Ci,j−1 , (97) 2 Var [Ci,j |Ci,1 , . . . , Ci,j−1 ] = Var [Ci,j |Ci,j−1 ] = σj−1 · Ci,j−1 . (98) From these assumptions it immediately follows that Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Lemma (a) The process (Ci,j · Πj−1 k=0 fk )j≥0 is a martingale. In particular −1 E[Ci,j ] = ΠJ−1 k=j fk E[Ci,J ], (99) −1 i.e. ΠJ−1 is the ”deflator” or ”discount factor” that describes the k=0 fk proportion already paid from E[Ci,J ] after j development periods. (b) For all I − J + 1 ≤ i ≤ I , E[Ci,J |DI ] = E[Ci,J |Ci,I −i ] = Ci,I −i · fI −i · · · fJ−1 . (100) and Var (Ci,J |DI ) = (E[Ci,J |Ci,I −i ])2 J−1 X k=I −i Boualem Djehiche KTH Royal Institute of Technology, Stockholm σk2 /fk2 . E[Ci,k |Ci,I −i ] (101) Statistical estimation techniques in life and non-life insurance An overview For known factors fj , the best estimate of the expected claims reserves of accident year i based on DI is given by E[Ci,J |DI ] − Ci,I −i = Ci,I −i (fI −i · · · fJ−1 − 1). (102) Chain-ladder estimators • The CL factors fj−1 are estimated with PI −j k=0 Ck,j b . fj−1 = PI −j k=0 Ck,j−1 (103) • The CL estimator of E[Ci,J |DI ] is given by CL bi,j b i,J |DI ] = Ci,I −i · b C = E[C fI −i · · · b fj−1 . (104) • The CL estimator of the variance factor σj2 is σ bj2 = 2 I −j−1 X 1 Ci,j+1 b Ci,j − fj . I − j − 1 i=0 Ci,j Boualem Djehiche KTH Royal Institute of Technology, Stockholm (105) Statistical estimation techniques in life and non-life insurance An overview Lemma (a) The estimator b fj of fj is j+1 . Given j , b fj is unbiased: E[b f j |j ] = f j . b b (b) The estimator fj for fj is unbiased: E[fj ] = fj , (c) b f0 , · · · b fJ−1 are uncorrelated (though dependent!): E[b f0 · · · b fJ−1 ] = E[b f0 ] · · · E[b fJ−1 ], for 1 ≤ j ≤ J − 1, b fj−1 and b fj are, given j−1 , negatively correlated: 2 Cov (b fj−1 ,b fj 2 |j−1 ) < 0; (d) Given j , the estimator b fj satisfies the following minimum conditional variance property: σj2 Var (b fj |j ) = PI −j−1 i=0 Boualem Djehiche KTH Royal Institute of Technology, Stockholm I −j−1 Ci,j = min αi ∈R X i=0 αi Ci,j+1 , Ci,j (106) Statistical estimation techniques in life and non-life insurance An overview Lemma (a) Given j , σ bj2 is an unbiased estimator of σj2 : E[b σj2 |j ] = σj2 . (b) σ bj2 is an unconditionally unbiased estimator of σj2 : E[b σj2 ] = σj2 . Lemma CL bi,j (a) The estimator C is an unbiased estimator for E[Ci,J |DI ] = E[Ci,J |Ci,I −i ]: CL bi,j E[C |CI −i ] = E[Ci,J |DI ], CL bi,j E[C ] = E[Ci,J ]. (b) An estimator of the conditional variance process for a single accident year i is J−1 2 X σ bk2 /b fk2 CL d (Ci,J |DI ) = C bi,j Var , (107) b CL C k=I −i i,k or, in a recursive form, CL 2 2 d (Ci,J |DI ) = Var d (Ci,J−1 |DI ) · b bi,J−1 Var fJ−1 +σ bJ−1 ·C . Boualem Djehiche KTH Royal Institute of Technology, Stockholm (108) Statistical estimation techniques in life and non-life insurance An overview A time series model (conditional resampling techniques) The Ci,j ’s follow an auto-regressive model: We assume that there exists fj > 0, σj > 0 and random variables i,j+1 such that for all i ∈ {0, . . . , I } and j ∈ {0, . . . , J − 1} we have p Ci,j+1 = fj · Ci,j + σj · Ci,j · i,j+1 , (109) where, given 0 , the random variables i,j+1 are independent with E[i,j+1 |0 ] = 0, for i ∈ {0, . . . , I }, E[2i,j+1 |0 ] = 1, P[Ci,j+1 > 0|0 ] = 1, j ∈ {0, . . . , J − 1}. This model have many nice properties that we leave to the reader to investigate (see e.g. Wüttrich and Merz (2008)). Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview The Bornhuetter-Ferguson model This is a robust method that does not consider outliers in the observations. Many actuaries prefer this model. Model Assumptions I Different accident years i are independent. I There exist parameters µ0 , . . . µI > 0 and a pattern β0 , . . . βJ > 0 with βJ = 1 such that for all i ∈ {0, . . . , I }, j ∈ {0, . . . , J − 1} and k ∈ {0, . . . , J − j}, E[Ci,0 ] = µi · β0 , E[Ci,j+k |Ci,0 , . . . , Ci,j ] = Ci,j + µi · (βj+k − βj ). (110) In particular, we have E[Ci,J ] = µi , E[Ci,j ] = µi · βj = βj · E[Ci,J ]. (111) Compared with (99) in the Chain Ladder model, βj plays the role of the deflator −1 ΠJ−1 k=j fk . (112) Therefore, we can use the estimators b fj of fj to estimate βj : b−1 βbj := ΠJ−1 k=j fk . Boualem Djehiche KTH Royal Institute of Technology, Stockholm (113) Statistical estimation techniques in life and non-life insurance An overview The Bornhuetter-Ferguson estimators The BF-estimator is given by BF bi,j b i,J |DI ] = Ci,I −i + (1 − βbI −i ) · µ C = E[C bi J−1 b−1 = Ci,I −i + 1 − Π f ·µ bi , (114) k=I −i k where, µ bi = is an a priori estimator of E[Ci,J ]. CL bi,j To compare this estimator with the CL-estimator C , we rewrite (104) in the form (114) as follows: CL bi,j b i,J |DI ] = Ci,I −i · ΠJ−1 b C = E[C k=I −i fk J−1 b = Ci,I −i + Ci,I −i · Πk=I −i fk − 1 b CL C b = Ci,I −i + J−1i,J b · ΠJ−1 k=I −i fk − 1 Πk=I −i fk CL b−1 · C bi,J = Ci,I −i + 1 − ΠJ−1 . k=I −i fk Boualem Djehiche KTH Royal Institute of Technology, Stockholm (115) Statistical estimation techniques in life and non-life insurance An overview Hence, the difference lies in the fact that the BF-estimator relies on an a priori CL bi,J which is completely estimate of µi , whereas the CL estimator relies on C derived from observations. I Often an apriori estimate of µi is often a (subjective) plan value from a strategic business plan. I The ”still-to-come” factor 1 − βI −i is often estimated using (113) through the observations driven chain-ladder technique. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview The Poisson model The Poisson model is mainly used to count claims. The remarkable thing with this model is that it leads to the same reserve as the chain-ladder model. Mack (1991) was first to prove that the chain-ladder reserves are obtained as maximum likelihood estimates of the reserves for the Poisson model. Model Assumptions I There exist parameters µ0 , . . . , µI > 0 and γ0 , . . . , γJ > 0 such that the incremental payments Xi,j are independent Poisson distributed with mean E[Xi,j ] = µi · γj , PJ for all i ≤ I and j ≤ J, and j=0 γj = 1. The cumulative payment in accident year i, Ci,J = distributed with mean E[Ci,J ] = µi . Boualem Djehiche KTH Royal Institute of Technology, Stockholm (116) PJ k=0 Xi,k is then Poisson (117) Statistical estimation techniques in life and non-life insurance An overview We have I µi stands for the expected number of claims in accident year i. I i,j γi = E[Ci,J defines an expected reporting/cash flow pattern over different ] development periods j. Moreover, E[X ] E[Xi,j ] γj = E[Ci,0 ] γ0 is independent of i. I The Poisson model satisfies the assumptions of the Bornhuetter-Ferguson model with pattern factors β0 = γ0 and βj = j X γk , k=0 where, P E[Ci,j+k |Ci,0 , . . . , Ci,j ] = Ci,j + kl=1 E[Xi,j+l |Ci,0 , . . . , Ci,j ] Pk = Ci,j + µi · l=1 γj+l = Ci,j + µi · (βj+k − βj ) (118) and E[Ci,0 ] = E[Xi,0 ] = µi · β0 . Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview The Poisson estimators In view of (118), a way to estimate the parameters (µi )i and (γj )j is to use the BF-estimators. But, the Poisson distribution assumption makes it natural to use the maximum likelihood technique as follows. The likelihood function for random variables DI = {Ci,j ; i + j ≤ I , j ≤ J} is given by e −µi ·γj (µi γj )Xi,j LDI (µ0 , . . . , µI ; γ0 , . . . , γJ ) := Πi+j≤I Xi,j ! Maximizing this function w.r.t. the parameters (µi ) and (γj )j , we obtain the following system (which admits a unique solution) Pmin(I −i,J) Pmin(I −i,J) µbi · γbj = j=0 Xi,j = Ci,min(I −i,J) , j=0 PI −j bi · γbj = i=0 µ (119) PI −j i=0 Xi,j , for all 0 ≤ i ≤ I and 0 ≤ j ≤ J. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview We have the following Lemma The ML estimator of the Poisson model is given for i + j > I by Poi bi,j b i,j ] = µbi · γbj , X = E[X Poi bi,J b Poi b i,J |DI ] = Ci,I −i + P C = E[C −i+1 Xi,j j=IP P I −i = Ci,I −i + j=I −i+1 1 − j=0 γbj · µbi . Boualem Djehiche KTH Royal Institute of Technology, Stockholm (120) Statistical estimation techniques in life and non-life insurance An overview The log-normal distribution model and the GLM This model was suggested by Kremer (1982) and further discussed by Verall (1990) based on a subtle result by Finney (1941). Model Assumptions I The incremental payments Xi,j are independent and log-normally distributed, i.e. Yi,j := log Xi,j , Yi,j ∼ N (E[Yi,j ], Var (Yi,j )) . (121) We want to estimate the mean θi,j := E[Xi,j ] along with the corresponding standard errors. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Kremer (1982) suggested a so-called General Linear Model (GLM) for the mean θi,j when Yi,j := β0 + β1 xij1 + . . . + βp−1 xijp−1 + i , where, the matrix (xijk )i,k is called the design matrix and i are assumed to be white noise with mean zero and the same variance as Yi,j . We skip this framework and instead focus on a direct estimation of θi,j . The eventual estimator θbi,j should be unbiased. Since in claim reserving, the number of claims is generally not large and an asymptotically unbiased estimator (but is biased) may induce quite fatal estimates of the claims reserve. This problem arises also when calibrating the Geometric Brownian motion to observed returns of financial assets and contains some subtle details. We only look at the one-dimensional case. Extension to our set up is merely straightforward. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Parameter estimation of the log-normal distribution Suppose X1 , . . . , Xn are i.i.d. observations which are log-normally distributed. We want to find an unbiased estimator of the mean θ := E[Xi ]. Set Yi = log(Xi ) ∼ N(µ, σ 2 ), Then θ = exp (µ + i = 1, . . . , n. 1 2 σ ). 2 (122) The maximum likelihood estimates of µ and σ 2 are the independent r.v. P 2 µ̂n := n1 nj=1 Yi ∼ N(µ, σn ) σˆn 2 := 1 n Pn j=1 (Yi − µ̂)2 ∼ σ2 n · χ2n−1 , where, we know that σ̂n is a biased estimator of σ, but sn2 := n σ̂n2 n−1 is an unbiased estimator of σ. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview An estimator of θ would be θ̂n = exp (µ̂n + 1 2 σ̂n ). 2 (123) The moment generating functions of µ̂n and σ̂n2 are 1 σ2 t 2 n E[e µ̂n t ] = e µt+ 2 2 σ 2 − n−1 2 E[e σ̂n t ] = 1 − 2 t . n , Hence, 1 1 σ2 n 2 E[θ̂n ] = E[e µ̂n ] E[e 2 σ̂n ] = e (µ+ 2 1 2 = e (µ+ 2 σ ) e − 1 2 n−1 σ 2 n 2 → e (µ+ 2 σ ) e − σ2 2 e 1− σ2 2 ) 1− n−1 σ2 − 2 n = θ, n−1 σ2 − 2 n >θ n → ∞, i.e. θ̂ is only an asymptotically unbiased estimator of θ. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview To correct for the bias, Finny (1941) introduced the function gm (t), where gm (t) := ∞ X k=0 mk (m + 2k) tk . m(m + 2) · · · (m + 2k) k! (124) (Note that gm (t) → e t , m → ∞.) Finny (1941) showed that with m = n − 1, 1n−1 2 sn ) θ˜n := e µ̂n gn−1 ( 2 n (125) is an unbiased estimator of θ. Moreover, since gm (t) → e t , m → ∞, θ˜n is also asymptotically unbiased. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview Analysis of the standard errors We would like to find an unbiased estimator of the variance of θ˜n : 2 en := Var (θ˜n ) = E[θ˜n ] − E[θ˜n ]2 . We have 2 2 1 2 E[θ˜n ]2 = θ2 = e (µ+ 2 σ ) = e (2µ+σ ) , whose unbiased estimator is e 2µ̂n gn−1 ( n−1 2 sn ). n Thus an unbiased estimator of en is 2 ( ên := e 2µ̂n [gn−1 1n−1 2 n−1 2 sn ) − gn−1 ( sn )]. 2 n n Boualem Djehiche KTH Royal Institute of Technology, Stockholm (126) Statistical estimation techniques in life and non-life insurance An overview Main references Björkwall, S. (2011): Stochastic claims reserving in non-life insurance: Bootstrap and smoothing models. PhD. Stockholm University, Stockholm. Wüthrich, Mario V. and Merz, M. (2008): Stochastic Claims Reserving Methods in Insurance. John Wiley & Sons. Taylor, Creg C. (2000): Loss Reserving: An Actuarial Perspective. Kluwer Academic Publishers. Verrall, R.J. (1991): On the estimation of reserves from loglinear models. Insurance: Mathematics and Economics. Vol. 10(1) pp. 75-80. Boualem Djehiche KTH Royal Institute of Technology, Stockholm Statistical estimation techniques in life and non-life insurance An overview
© Copyright 2024 Paperzz