A combined stochastic programming and optimal control approach to personal finance and pensions Agnieszka Karolina Konicza∗ David Pisingera [email protected] [email protected] Kourosh Marjani Rasmussena Mogens Steffensenb [email protected] [email protected] April 4, 2013 a DTU Management Engineering, Management Science, Technical University of Denmark, Produktionstorvet 426, 2800 Kgs. Lyngby, Denmark b Department of Mathematical Sciences, University of Copenhagen, Universitetsparken 5, 2100 Copenhagen, Denmark Abstract The paper presents a model that combines a dynamic programming (stochastic optimal control) approach and a multi-stage stochastic linear programming approach (SLP), integrated into one SLP formulation. Stochastic optimal control produces an optimal policy that is easy to understand and implement. However, explicit solution may not exist, especially when we want to deal with constraints, such as the limits on the portfolio composition, the limits on the insured sum, an inclusion of transaction costs or taxes on capital gains, which are important issues regularly mentioned in the scientific literature. Two applications are considered: (A) optimal investment, consumption and insured sum for an individual maximizing the expected utility of consumption and bequest, and (B) optimal investment for a pension saver who wishes to maximize the expected utility of retirement benefits. Numerical results show that among the considered practical constraints, the presence of taxes affects the optimal controls the most. Furthermore, the individual’s preferences, such as impatience level and risk aversion, have even a higher impact on the controlled processes than the taxes on capital gains. Keywords: dynamic programming, multi-period stochastic linear programming, power utility, personal finance, retirement. JEL Classification: C61, D14, D91, G11, G22 1 Introduction The purpose of this paper is to define a framework for solving various multi-period financial optimization problems. The framework is based on a model which combines two optimization technologies: dynamic programming (stochastic optimal control), which is a classical approach for continuous time financial problems, and stochastic programming, which is broadly applied in operations research and allows us to focus on the practicalities of the problem. These two methods complement each other, especially in the areas where one or the other does not perform well on its own, see e.g. Mulvey (2004). Stochastic optimal control produces an optimal policy that is easy to understand and implement. However, explicit solution may not exist, especially, when we want to deal with constraints, such as limits on portfolio composition, limits on the insured sum, ∗ Corresponding author at: DTU Management Engineering, Management Science, Technical University of Denmark, Produktionstorvet 426, 2800 Kgs. Lyngby, Denmark. Tel.: +45 4525 3109; Fax: +45 4525 3435; email: [email protected] 1 an inclusion of transaction costs or taxes on capital gains, which are important issues regularly mentioned in the scientific literature. Stochastic programming (SLP) is a general purpose decision model with an objective function that can take a variety of forms. It is based on the scenario trees that represent the range of possible outcomes for the uncertainties. Rather than finding a generic optimal policy, the optimal solution is computed numerically at each node in the scenario tree, for the specified decision variables. SLP can easily address realistic considerations and constraints, as long as they have an algebraic form. The main drawback of this optimization method is that the problem size grows quickly as a function of the number of periods and scenarios. In particular, taking into consideration the entire lifetime of an individual can be challenging in terms of computational feasibility. The problem of optimal consumption and investment has been widely investigated. In terms of dynamic programming approaches, probably the most influential papers have been written by Merton (1969, 1971) who defined the original problem of optimizing utility of consumption and terminal wealth over a fixed time horizon for an investor. This work has inspired many researchers who later either expanded the original model or investigated different objective functions, by introducing for instance the lifetime uncertainty, insured sum and the labor income, Richard (1975), stochastic interest rate, Munk and Sørensen (2004), salary uncertainty, Cairns et al (2006), multi-person household, Bruhn and Steffensen (2011), borrowing constraints, Byung and Yong (2011), or constant linear taxation, Bruhn (2012). Applications within defined contribution pension scheme with focus on the investment strategy either during the accumulation or post retirement phase (decumulation) together with the optimal time of annuitization, have been considered by Milevsky and Young (2007) and Gerrard et al (2004, 2012). The stochastic programming approach is more common within portfolio management and asset-liability management than within the areas concerning the individual investors. See, for instance, Mulvey et al (2006, 2007), who argue that multi-period investment models combined with Monte Carlo simulation can address significant issues for long-term investors, and Ferstl and Weissensteiner (2010), who present a stochastic linear programming model for optimal asset allocation in a situation where a financial company wishes to minimize the conditional value at risk. Nevertheless, applying stochastic programming or other numerical approaches to find the optimal decisions from the individual investors’ point of view can also be found in the literature. See, e.g. Horneff et al (2008), considering an individual characterized by Epstein-Zin preferences, Cai and Ge (2012), investigating the asset allocation for an investor with a loss aversion objective, a predetermined objective and a greedy objective, Consigli et al (2012), comparing investment opportunities offered by traditional pension products and unit-linked contracts with variable life annuities, and Blake et al (2013), deriving the optimal investment for a loss averse pension saver with an interim and final target. Finally, Geyer et al (2009a) argue that two main optimization technologies, namely, dynamic programming and stochastic linear programming can be combined and both solutions can be integrated into one SLP formulation. The authors show that it is possible to accurately replicate the closed-form solution obtained by Richard (1975) by using a multi-stage stochastic programming approach. However, they do not include the insured sum making the model less applicable. To the best of our knowledge, this is the only paper that combines both methodologies. We apply the idea of combining stochastic optimal control with multi-period stochastic programming presented in Geyer et al (2009a) and analyse two optimization problems: (A) Optimal investment, consumption and insured sum for an individual maximizing the utility of consumption and a terminal utility of leaving a positive amount of money upon death over an uncertain lifetime, Richard (1975). This problem has been considered in Geyer et al (2009a), but in contrast to their work we have formulated the SLP model including an optimal control for the insured sum. (B) A pension saver wishes to maximize the utility of the future retirement benefits by controlling the optimal investment both before and after retirement and optimal benefits only after retirement. This problem has not to our knowledge been considered in the literature, and we find it particularly relevant for a pension product design in a defined contribution plan. The optimal controls help the pension fund design a payout profile for a retiree based on her impatience level and risk preferences. Except for working with different optimization problems, our paper differs from Geyer et al (2009a) in several aspects, among which the main ones are: • We derive the explicit formula for the optimal value function and optimal controls for problem (B) using 2 Hamilton-Jacobi-Bellman techniques. This formulation has not to our knowledge been considered in the scientific literature. • We focus on the optimal controls during the individual’s entire lifetime. Not only the decisions at the beginning of the contract are of our interest, but particularly their development with time. • We use a different method for the linearization of the objective function based on a non-linear optimization problem. We provide the evidence that the method is more suitable for our analysis and allows for controlling the linearization error better. • We follow a standard actuarial procedure and model the lifetime uncertainty in a continuous time setting with parameters calibrated to the Danish mortality rates. We find that: • The results from the model combining two optimization methodologies are consistent with the optimal controls that can be calculated explicitly using stochastic dynamic approach. The model defines a solid framework for various life long optimization decisions, while allowing for incorporating a number of practical constraints. • Among limits on portfolio composition, limits on the insured sum, transaction costs, and taxes on capital gains, only the latter has a significant impact on the optimal controls. • The individual’s preferences, such as risk aversion and impatience level, have a higher impact on the optimal controls, and especially, on the size and development of the benefits, than limits on the portfolio composition, limits on the insured sum, transaction costs or taxes on capital gains. • The presence of transaction costs in contrast to other considered constraints suggests a non-constant investment in risky assets in the situation where there is no income (such as the period after retirement). All other constraints are consistent with the classical result, which claims the the optimal proportion in risky assets should be constant. The paper is organized as follows. Section 2 describes problems (A) and (B) in more detail together with their optimal solutions obtained via dynamic programming. Section 3 explains how to build a bridge between dynamic programming and stochastic linear programming by incorporating both optimization methods in one SLP formulation. Section 4 focuses on defining the SLP model, i.e. the objective function and the constraints, the linearization of the objective and scenario tree generation. In Section 5 we present the numerical results of the optimal controls obtained via two different optimization methods, and the impact on the controls of the modifications considered during the first years of the contract. Finally, in Section 6 we summarize the paper and suggest the future work. The paper also includes four appendices consisting of: derivation of the optimal value function and controls for problem (B), App. A, the mutual fund theorem, App. B, derivation of the expected values of the savings, App. C, and the comparison of two linearization methods, App. D. 2 Model description The section develops the general model settings relevant for problems (A) and (B). • We denote by Xt the amount of savings that belong to an individual interpreted as general personal savings (problem (A)), and the level of savings on the pension account (problem (B)). • The savings are invested in N underlying assets: one risk-free (bonds) and N − 1 risky assets (an equity fund). The economy is represented by a standard Brownian motion W , which is defined on the measurable space (Ω, F), where F is the natural filtration of W . The asset prices Sti are modelled by Black-Scholes dynamics: dSti = αi Sti dt + σi Sti dWti , S0i = (1) si0 , 3 where αi and σi are constants. The risk-free asset is defined by α3 = r and σ3 = 0. Merton (1971) shows that without loss of generality we can assume that all risky assets are included in one mutual fund, the prices of which follow the dynamics dSt = αSt dt + σSt dWt , where α and σ are defined in App. B. • The payments connected to the savings account are formalized by the continuous processes: – a payment process lt that, depending on the problem, is interpreted either as the labor income that is contributed to the account or the percentage of the labor income that is paid to the retirement savings (premiums), – a payment process ct determining the consumption of the savings for the private purposes or the benefits that the pension fund pays to the retiree. • The individual has an uncertain lifetime characterized by a finite state Markov chain Z, defined on a measurable space (Ω, F). The jump intensities of the process Z, µ and µ∗ , denote the mortality rates under the objective probability measure P and under the pricing probability measure P∗ , respectively. P and P∗ are equivalent probability measures on (Ω, F) and the jump intensities are defined in Sec. 5. • The individual is risk averse and has a utility function u characterized by a constant relative risk aversion 1 − γ, constant elasticity of intertemporal substitution 1/(1 − γ) (EIS) and time dependent weights wt reflecting the importance of present consumption in contrast to future consumption characterized by the impatience factor ρ: u(t, c) = γ1 wt1−γ cγ , where wt1−γ = e−ρt . Parameter γ is defined for (−∞, 1) \ {0}, whereas for γ = 0 we have the case of the logarithmic utility. Below, we define the model setup specific for the particular problems. 2.1 Problem (A) - optimal investment, consumption and insured sum We keep the original settings defined in Richard (1975) and recall only the most important assumptions and results that are crucial for this paper. Savings dynamics. The individual has a bequest motive and upon death her heirs receive an amount on the savings account plus the insured sum It . We assume that life insurance policies are tradable financial contracts and the individual has access to a life insurance market. She buys a life insurance with a insured sum It , and for that coverage she pays a premium at rate µ∗t It , where µ∗t is the natural premium intensity decided by the life insurance company, also called pricing mortality. In particular, we allow for negative It , which means that the person sells the amount It to the pension fund and purchases life annuities. The wealth dynamics while the person is alive develop as follows: dXt = r + πt (α − r) Xt dt + πt σXt dWt + lt dt − ct dt − µ∗t It dt, (2A) X0 = x0 , where r is the return on the risk-free asset, α the return on the risky mutual fund, σ the volatility of the mutual fund, π is the proportion invested in the mutual fund, and 1 − π in the risk-free asset. Optimization problem. Since the individual has a bequest motive, she obtains the utility from the amount that she will leave upon death to her heirs: U (t, x) = γ1 vt1−γ xγ , where v is the weight for this utility. It sounds reasonable to define v in terms of the weights for the utility of consumption w, hence v denotes the weight put on her heir’s consumption relative to her own. vt1−γ = λ−γ wt1−γ , 4 where λ is a constant. The individual can control consumption, investment and insured sum, in order to maximize the utility of consumption and bequest. Given the wealth dynamics, X, the problem is mathematically formulated as: # "Z e T R A − ts µτ dτ u(s, c) + µs U (s, Xs + Is ) ds , (3A) V (t, x) = sup Et,x e t π,c,I∈Q[t,Te) V A (Te, x) = 0, where Et,x is the conditional expectation under P, given that the person is alive at time t and holds wealth Xt = x, and Q[t, Te) is the set of control processes for the time [t, Te) which are admissible at time t. Te is aR fixed time point at which the investor is dead with certainty. Both utilities are multiplied by factor s e− t µτ dτ denoting the probability that the individual survives until time s > t, given she has survived until time t. Furthermore, the utility of bequest is multiplied by the mortality intensity rate µs , which represents the probability that the person dies within a short period after time s. The utility functions u and U are assumed to be strictly concave in c and X + I, respectively. The optimal value function for this problem is given by V A (t, x) = γ 1 fA (t)1−γ x + gA (t) , γ (4A) where Z Te fA (t) = e 1 R − 1−γ ts µτ −γ(µ∗ τ +ϕ) dτ t Z Te gA (t) = e− Rs t (r+µ∗ τ )dτ " ws + µs (µ∗s )γ # 1/(1−γ) vs ds, ϕ=r+ (α−r)2 2σ 2 (1−γ) , ls ds, (5A) (6A) t and the optimal investment, consumption and insured sum are of the form: πt∗ = 2.2 α−r σ 2 (1−γ) Xt +gA (t) , Xt c∗t = wt fA (t) It∗ = Xt + gA (t) , µt µ∗ t 1/(1−γ) vt fA (t) (Xt + gA (t)) − Xt . (7A) Problem (B) - optimal investment with optimal life annuities The individual is considered specifically as a pension saver who wishes to maximize the expected utility function of her future retirement benefits. She purchases a pension contract at time t0 with an assumption of retiring at time T and receiving the benefits until time Te. For Te < ∞, the pension saver is interested in temporary life annuities, whereas for Te → ∞ she prefers life annuities. Furthermore, we assume that the benefits are directly linked to the market and no guarantees are provided. One can think of this model as a special case of problem (A), where the individual is a pension saver with no bequest motive. In case of her death, the pension fund inherits all her savings. Moreover, she obtains no utility from consumption before retirement. We interpret the consumption process specifically as a pension benefit process, and process lt as pension contributions (premiums). Savings dynamics. We split the problem into a before retirement (accumulation) phase and an after retirement (decumulation) phase. In practise the pension saver is not allowed to withdraw the pension savings for the consumption reasons or receive the benefits before retirement. Thus, ct = 0 for t ∈ [t0 , T ). It is also reasonable to assume that the payment process lt denoting the premiums paid to the pension fund is set to 0 after retirement, i.e. lt = 0 for t ∈ [T, Te). Since the person has no bequest motive, she has an additional income of µ∗t Xt , which is the amount that the pension fund pays to be her only inheritor. With these assumptions the definition of savings dynamics are of the form (r + πt (α − r) + µ∗t ) Xt dt + πt σXt dWt + lt dt, t ∈ [t0 , T ), dXt = (2B) (r + πt (α − r) + µ∗t ) Xt dt + πt σXt dWt − ct dt, t ∈ [T, Te), Xt = x0 . 5 Optimization problem. The pension saver wishes to maximize the utility of pension benefits, or in other words, the utility of consumption after retirement. She obtains no utility from consumption before retirement, i.e. u(t, c) = 0 for t ∈ [t0 , T ), hence: "Z e # T V B (t, x) = sup e− Et,x π∈Q[t,Te), c∈Q[max(t,T ),Te) Rs t µτ dτ u(s, c)ds , (3B) max(t,T ) V B (Te, x) = 0. where, as before, Et,x is the conditional expectation under P, given that the person is alive at time t and holds savings Xt = x, and Q[t, Te) is the set of control processes for time [t, Te) which are admissible at time t. The control processes in this problem are the proportion in the risky fund πt and the benefits ct . Note that even though the individual obtains no utility from consumption before retirement, the investment process πt is controlled both before and after retirement. Also, as in problem (A), we take into consideration the R − ts µτ dτ . At time Te the uncertain lifetime of the individual by multiplying the utility function by the factor e investor is assumed to be dead with certainty. The utility function u is assumed to be strictly concave in c. To our knowledge this problem has not been considered in the literature, therefore we derive the optimal value function and the optimal controls. We refer the reader to Appendix A for details. The optimal value function is of the form: γ (4B) V B (t, x) = γ1 fB (t)1−γ x + gB (t) , where 1 RT R Te − 1 RTs (µ −γ(µ∗ − τ +ϕ))dτ 1−γ fB (t) = e 1−γ t τ T e 1 Rs ∗ R e µ −γ(µτ +ϕ) dτ T − fB (t) = t e 1−γ t τ ws ds, ( µτ −γ(µ∗ τ +ϕ) dτ ws ds, t ∈ [t0 , T ), (5B) t ∈ [T, Te), Rs RT ∗ gB (t) = t e− t (r+µτ )dτ ls ds, t ∈ [t0 , T ), gB (t) = 0, t ∈ [T, Te), (6B) and the optimal controls are given by πt∗ = α−r Xt +gB (t) , σ 2 (1−γ) Xt t ∈ [t, Te), c∗t = w(t) fB (t) Xt , t ∈ [T, Te). (7B) Since the utility function u is concave in c, the retirement benefits must be non-negative. Furthermore, we must have that XTe = 0, which ensures that all the savings that belong to the pension saver have been paid out during the distribution phase. In both problems the functions gA (t) and gB (t) are of the same form. The first function have been defined by Richard (1975) as human capital and represents the expected present value of future earnings, or, in other words, what an individual’s labor force is worth in the financial market. Function gB (t) represents the present value of the future premiums. 3 Combined SLP and optimal control One of the main contributions of the paper is constructing a model which combines two optimization methodologies: dynamic programming and stochastic programming. We are motivated by the fact that the explicit solutions only exist for simple models, hence often cannot be applied in practise. As soon as we add some realistic constraints, such as limits on portfolio composition, limits on the insured sum, transaction costs or capital gains taxes, explicit solutions do not exist. The SLP approach fills this gap and allows for incorporating a number of constraints in an easy way. However, the main drawback of SLP is limited ability to handle many time periods under sufficient uncertainty. The scenario tree grows exponentially with each time period, and solving the problem becomes soon computationally infeasible. Therefore, we split the individual’s lifetime into two periods. We apply the SLP approach to obtain the optimal decisions for the first years, t ∈ [t0 , TSLP ), whereas the decisions for the remaining lifetime of the individual, t ∈ [TSLP , Te), are solved 6 Figure 1: The first years decisions are solved by SLP model, whereas the decision over the remaining lifetime of the individual are solved using HJB techniques. explicitly. Fig. 1 presents the concept. The mathematical formulation for problem (A), defined in eq. (3A) can be rewritten using its recursive properties in the following way: Z TSLP RT R − t SLP µτ dτ A A − ts µτ dτ V (TSLP , XTSLP ) . u(s, c) + µs U (s, Xs + Is ) ds + e e V (t, x) = sup Et,x π,c,I∈Q[t,TSLP ) t (8A) The above definition is convenient because it separates the problem into two periods, which allows for applying different optimization methodologies in each period. The crucial part of the model is to insert the optimal value function derived explicitly into the objective function of the SLP model, thus we can ensure that the SLP formulation incorporates the decisions for the entire lifetime and not only for the first years. We start with the latest period, t ∈ [TSLP , Te), and calculate the optimal value function at time TSLP , i.e. V A (TSLP , XTSLP ) according to formula (4A). Then, insert this function into eq. (8A) and solve the optimization problem using multi-period stochastic linear model. In this way we construct one SLP formulation that covers the entire lifetime of the individual. We proceed analogously with problem (B). The mathematical formulation for the optimization problem defined in eq. (3B) can be rewritten as V B (t, x) = Z sup TSLP Et,x π∈Q[t,TSLP ), c∈Q[max(t,T ),TSLP ) e− Rs t µτ dτ u(s, c)ds + e− R TSLP t µτ dτ V B (TSLP , XTSLP ) , max(t,T ) (8B) where V B (TSLP , XTSLP ) is given in eq. (4B). From here, we can focus on modelling the first years decisions via SLP. 4 SLP formulation Stochastic programming is a method for the study of optimal decision making under uncertainty, Birge and Louveaux (1997). It can be compared with dynamic programming, though it allows for more flexibility when defining an objective function and emphasizes constraints accompanying the problem. A multi-period stochastic linear program with recourse combines anticipative and adaptive models in a common mathematical framework. For example, the investor specifies the composition of a portfolio by taking into account both possible future movements of asset returns (anticipation) and that the portfolio will be rebalanced (taking recourse decisions) as prices change (adaptation), Zenios (2008). To start with, let us formulate a two-stage version of the problem with recourse. We need two vectors for decision variables to distinguish between the anticipative and adaptive policy: • y0 - a vector of first-stage decisions, which are made before the random variables are observed; the decisions does not depend on the future observations but anticipate possible future realizations of the random vector, • ỹ1 = y1 (y0 , ω) - a vector of second-stage decisions which are made after the random variables ω have been observed. They are constrained by decisions y0 and depend on the realizations of the random vector ω. 7 The objective is to maximize/minimize a certain function f , whose value depends on the decisions y0 and y1 (y0 , ω) taken at stages t0 and t1 , accordingly. We keep the notation from Geyer et al (2009a) and formulate the problem as follows: Maximize f (y0 ) + E[f (y1 (y0 , ω))], A0 y0 ≤ b0 , s.t. A1 (ω)y1 (y0 , ω) ≤ b1 (ω), where A(ω) and b(ω) are the model parameters, which can also be dependent on the random variable ω. Multi-period stochastic programs have a similar structure. Observations are made at TSLP different stages, hence the decision vectors taken at stage t, ỹt = yt (ω1 , . . . , ωt , y0 , . . . , yt−1 ), depend on the decisions taken in all previous stages and also on the history of the random variables (ω1 , . . . , ωt ). In this way, the problem has the following structure Maximize f (y0 ) + TX SLP E[ft (ỹt )], t=1 At ỹt ≤ bt , s.t. t = t0 , . . . , TSLP , and the information constraints are given as decide y0 → observe ω1 , A(ω1 ), b(ω1 ) → → observe ωT , A(ωTSLP ), b(ωTSLP ) ... → decide ỹ1 → → observe ω2 , A(ω2 ), b(ω2 ) decide ỹTSLP . The probability distribution of the random variable ωi , which in our case reflects the distribution of the returns on risky assets, is described in terms of a discrete number of outcomes (scenarios). A scenario tree consists of nodes nν(t) . Each of them is assigned to a certain period t and state ν(t), whereas the links between the nodes indicate possible transition between states as time evolves. Node n0 is the root node and it has no predecessor, whereas every other node has a unique predecessor from the previous set of states. Each sequence {nν(t0 ) , nν(t1 ) , . . . , nν(TSLP ) }, where nν(tj ) is a predecessor of nν(tj+1 ) for j = 0, . . . , TSLP − 1 defines a scenario with the associated probability P rnν(tj ) ≥ 0. The total number of scenarios in the tree is specified by the product of the number of outcomes in each stage t. An example showing a three-stage problem is illustrated on Fig. 2. Figure 2: A 3-period scenario tree with a constant branching factor bf = 3. 4.1 Objective and constraints The stochastic programming model determines the optimal decisions at each node of the event tree, given the information available at that point. An important assumption is that randomness in asset returns Rni ν(t) is exogenous and not affected by decisions. The following decision variables and parameters are used in the model formulation: Indices: t, stage (point in time), t = t0 , . . . , TSLP , ν(t), state in stage t, ν(t) = 1, . . . , Kt , 8 nν(t) , node corresponding to stage t and state ν(t), i, available asset, i = 1, . . . , N . Parameters: Kt , number of nodes (states) in period t, N , number of available assets, lt , labor income/premiums paid to the savings account at time t, x0 , initial value of the savings, µt , mortality rate for a y+t-year old individual at time t (probability to die between time t and t + 1), µ∗t , pricing mortality rate for a y+t-year old individual at time t (probability to die between time t and t + 1 used by the pension fund for pricing life insurance / life annuities), Rni ν(t) , return on asset i at node nν(t) , obtained via scenario generation, see Sec. 4.4, P rnν(t) , probability of being in node nν(t) obtained via scenario generation, see Sec. 4.4. Decisions variables: Peni ν(t) ≥ 0, amount of asset i purchased at node nν(t) , Seni ν(t) ≥ 0, amount of asset i sold at node nν(t) , eni , total holdings of asset i at node nν(t) , X ν(t) en− , total holdings in all assets at node nν(t) including the accumulated capital gains before any payments X ν(t) i en en− = x0 1{t=t } + PN 1 + Rni 1 . X or trades are made, i.e. X 0 i=1 ν(t−1) {t>t0 } ν(t) ν(t) e Cnν(t) ≥ 0, consumption/benefit at node nν(t) , Ien , insured sum at node nν(t) (problem (A) only). ν(t) Expression 1{(·)=t} denotes an indicator function equal to 1 if (·) = t and 0 otherwise. The entire SLP formulation that replicates the assumptions made in dynamic programming model setup, consists of the objective function and three constraints: the budget constraint, the asset inventory constraint and the constraint on positivity of certain variables. With only these three constraints we can replicate the continuous time models presented in Sec. 2. Afterwards, in Sec. 4.2, we modify these equations in order to investigate the impact of various factors, such as limits on portfolio composition, limits on the insured sum, transaction costs, and taxes on capital gains. 4.1.1 Problem (A) The problem of optimizing the expected utility function of consumption and a terminal utility of leaving a positive amount of money upon death, where the investment, consumption and insured sum are the controlled processes, can be modeled with the set of the following equations. The objective function is obtained by substituting the integrals with the sums and the expectation operator E with its discrete definition in the eq. (8A). The budget constraint (10A), or alternatively, the cash flow balance constraint specifies that the amount invested in the purchase of new securities plus consumption must be equal to the amount gained from the sale of the securities plus labor income that is paid to the savings account plus any initial savings x0 that the person has at the beginning of the contract. Moreover, we add the term µ∗t Ienν (t) to the left hand side of the equation, which is the price for the life insurance that the investor pays at each period. The inclusion of the insured sum variable Ienν(t) in the budget constraint and in the objective function for problem (A) is an important part of the model that distinguishes our work from Geyer et al (2009a). Constraint (11A) calculates the value of the savings on the pension account. It is equal to the accumulated capital gains/losses on the assets held in portfolio from the previous period plus any amount purchased in the current period minus sold securities. The continuous time versions of the problems assume the utility function u to be strictly concave in c, and U to be strictly concave in Xt + It . Furthermore, for γ < 0, u(0) = U (0) = −∞, therefore we define the positivity constraints (12A). For γ ∈ (0, 1) we have u(0) = 0 and U (0) = 0, so the positivity constraints are substituted with non-negativity constraints. 9 This leads to the model: max TSLP X−1 X s=t0 nν(s) X + P rnν(s) · e P rnν(TSLP ) · e− − Rs t0 µτ dτ R TSLP t0 h µτ dτ i en ) + µs U s, X e − + Ien u(s, C nν(s) ν(s) ν(s) en− · V A TSLP , X ν(T , SLP ) (9A) nν(TSLP ) N X en (t) + µ∗t Ien (t) = x0 1{t=t } + Peni ν (t) + C ν ν 0 i=1 N X Seni ν (t) + lt , (10A) i=1 for t = t0 , . . . , TSLP − 1, ν(t) = 1, . . . , Kt , i eni en X = 1 + Rni ν(t) X 1 + Peni ν(t) − Seni ν(t) , ν(t) ν(t−1) {t>t0 } (11A) for t = t0 , . . . , TSLP − 1, ν(t) = 1, . . . , Kt , i = 1, . . . , N, nν(t−1) is the ancestor of nν(t) . e − + Ien > 0, X nν(t) ν(t) en C > 0, ν(t) 4.1.2 for t = t0 , . . . , TSLP − 1, ν(t) = 1, . . . , Kt . (12A) Problem (B) The problem of maximizing the expected utility of retirement benefits, where the investment process is controlled both before and after retirement, whereas the benefits are controlled only after retirement, can be modelled using SLP formulation as follows. The objective function (9B) is a discrete version of eq. (8B) where the integrals are substituted with the sums and the expectation operator E with its discrete definition. The budget constraint (10B) specifies that the amount invested in the purchase of new securities plus consumption must be equal to the amount gained from the sale of the securities plus premium that is paid to the savings account plus any savings x0 that the person has at the beginning of the contract. We also add en (t) to the right hand side of the balance equation, which is the price that the pension fund the term µ∗t X ν pays the pension saver to be her only heir, and the pension benefit to the side of the outgoing payments for t ≥ T. Constraint (11B) calculates the value of the savings on the pension account. This constraint is identical to the asset inventory balance for problem (A). Similarly as in problem (A), the utility function u is assumed to be strictly concave in c, and for γ < 0, we have u(0) = U (0) = −∞. Therefore, we assume that the benefits are positive, (12B), or, for γ ∈ (0, 1), non-negative. This leads to the following SLP formulation: max TSLP X−1 X P rnν(s) · e − Rs t0 µτ dτ en ) u(s, C ν(s) s=max(t0 ,T ) nν(s) X + P rnν(TSLP ) · e− R TSLP t0 µτ dτ en− · V B TSLP , X ν(T SLP ) , (9B) nν(TSLP ) N X en (t) 1{t≥T } = x0 1{t=t } + Peni ν (t) + C ν 0 i=1 N X Seni ν (t) + lt + µ∗t i=1 N X eni , X ν(t) (10B) i=1 for t = t0 , . . . , TSLP − 1 ν(t) = 1, . . . , Kt . i eni en X = 1 + Rni ν(t) X 1 + Peni ν(t) − Seni ν(t) , ν(t) ν(t−1) {t>t0 } for (11B) t = t0 , . . . , TSLP − 1, ν(t) = 1, . . . , Kt , i = 1, . . . , N, nν(t−1) is the ancestor of nν(t) , en (t) > 0, C ν for t = max(t0 , T ), . . . , TSLP − 1, ν(t) = 1, . . . , Kt : 10 (12B) 4.2 Additional constraints The reason for modelling the first years decisions with SLP is that this optimization method can easily capture the practical constraints such as limits on portfolio composition, limits on the insured sum, transaction costs, and taxes on capital gains. Below, we present how to modify the original constraints or add other constraints to the SLP formulation presented in Sec. 4.1. Limits on portfolio composition. A feature that is interesting from the point of view of a private investor, a pension saver and a pension fund, is the limit on the portfolio composition. For instance, no pension fund allows for borrowing or short selling of the assets. Alternatively, if the investor has certain preferences about the minimum and maximum percentage of her wealth invested in a certain asset, we would like to include it in the optimization model. The limits on the portfolio composition can be incorporated in the SLP formulation by adding the following constraints for t = t0 , . . . , TSLP , ν(t) = 1, . . . , Kt and i = 1, . . . , N : eni ≥ di X ν(t) N X eni , X ν(t) eni X ≤ ui ν(t) i=1 N X eni , X ν(t) (16) i=1 where di and ui are the lower and upper limits for the holding of asset i. In particular, di and ui can be extended to be time dependent, which would be suitable for a person with specific preferences only for a certain year. Limits on the insured sum. Richard (1975)’s model does not assume any constraint on the size or the sign of the insured sum It . Including such a constraint in the dynamic programming approach is definitely not trivial to solve and to our knowledge the explicit solution for this case has not been derived. The SLP model allows for adding the limits on the insured sum in a straightforward way: Ienν(t) ≥ dins , Ienν(t) ≤ uins . (17) As shown later in Sec. 5.1, being able to control the sign of the insured sum is especially important from the practical point of view. A negative insured sum means that each period the pension fund pays the amount µ∗t Ienν(t) in order to receive a part of the individual’s savings equal to Ienν(t) upon her death. This situation describes purchasing life annuities. In reality the individual is not allowed to purchase life annuities before she retires, so being able to set dins = 0 for t < T is an important extension. Transaction costs. Similarly, the presence of transaction costs q i makes the considered problems more realistic. Investigating how the transaction costs affect the optimal controls is of practical importance. We add the costs as a percentage of the traded amount by modifying the budget equations (10A) and (10B). We subtract the costs from the amount of the assets sold in a given period, and add the costs to the amount purchased. The modified budget equations are defined as follows: N X en (t) + µ∗ Ien (t) = x0 1{t=t } + Peni ν (t) (1 + q i ) + C t ν ν 0 i=1 N X N X Seni ν (t) (1 − q i ) + lt , (10A’) i=1 en (t) 1{t≥T } = x0 1{t=t } + Peni ν (t) (1 + q i ) + C ν 0 i=1 N X Seni ν (t) (1 − q i ) + lt + µ∗t i=1 N X eni , X ν(t) (10B’) i=1 for t = t0 , . . . , TSLP − 1 and ν(t) = 1, . . . , Kt . Taxes on capital gains. Worth consideration are also taxes on capital gains τ i . Generating a scenario tree for the SLP model allows us to subtract taxes only from the positive capital gains, i.e. ( Rni ν(t) (1 − τ i ), Rni ν(t) > 0, i (18) net Rnν(t) = Rni ν(t) , Rni ν(t) ≤ 0. This requires changing the asset inventory balance constraints (11A) and (11B) as follows: i eni en X = 1 + net Rni X 1{t>t } + Peni − Seni , ν(t) ν(t) ν(t−1) 0 ν(t) ν(t) 11 (11A’,11B’) for t = t0 , . . . , TSLP − 1, ν(t) = 1, . . . , Kt and i = 1, . . . , N . This is also convenient in contrast to the stochastic optimal control approach, where the asset prices are modelled by the Black Scholes model, and one can only adjust the expected returns and volatility of the assets by introducing αi,net = αi (1 − τ i ) and σi,net = σi (1 − τ i ). The taxes are then subtracted from both positive (gains) and negative (losses) returns. The latter can be interpreted as a possibility to deduct the taxes from the negative capital income. Other modifications. There are plenty of economic factors whose impact on the optimal control would be interesting to investigate but are beyond the scope of this paper. Probably the most relevant from a practical perspective is modelling the asset returns using a different model than Black Scholes. Geyer et al (2009b) extended their previous work by applying the first-order unrestricted vector autoregressive process (VAR) to model asset returns. They find that there is a substantial difference in asset allocation which reflects the impact of time-varying investment opportunities, which, not surprisingly, shows the sensitivity of the model to the assumptions in the scenario generation. Note that all the constraints are linear, though the objective function is not. We prefer the linear solvers due to their robustness and speed, therefore in the next section we describe how to linearize the objective function. 4.3 Linearization To be able to use linear programming solvers for solving problems (A) and (B), the objective function in each problem needs to be linearized. Alternatively, the problems could be solved using the nonlinear optimization techniques, however the linear solvers are preferable. One of the differences between our work and Geyer et al (2009a) lies in the method of linearization. Our approach differs in two ways: • the position of breakpoints, • the formula for the linear approximation. In this section we describe our method of linearization, whereas in Appendix D we provide the evidence that this method is more suitable for our problem. Breakpoints. The linearization of the objective function requires choosing the number and the position of breakpoints. This choice can affect the standard errors of the SLP optimization results. The parameter m denoting the number of breakpoints is chosen freely, as well as the minimum and maximum breakpoint. To obtain the positions of the remaining breakpoints we solve a non-linear optimization problem, where the objective is to maximize the area underneath the power function: X X (19) max (bpm − bpj ) h(bpj ) − h(bpj−1 ) + 0.5 (bpj − bpj−1 ) h(bpj ) − h(bpj−1 ) , bpj ,j=1,...,m j6=m ∀j h(bpj ) = j6=1 γ 1 γ bpj . (20) The code for the optimization problem is accessible in the online GAMS library, Rasmussen (2011). The solution to this problem is locally optimal which is sufficient for our purposes. In a situation, when the solution consists of the repetitive breakpoints (∃i6=j bpj = bpi ), we use a heuristic method and rerun the model for different minimum and maximum breakpoints, bp1 and bpm , in order to find m different points. The choice of bp1 and bpm is crucial (next to the choice of the number of breakpoints m) and reflects how well we can reproduce the closed-form solutions for the optimal controls. We estimate bp1 and bpm using the properties of geometric Brownian motion, which allow for deriving the explicit formulae for the expectation of the savings process with incorporated optimal controls, E[Xt∗ ], see App. C, hence also the expected values of E[c∗t ], E[πt∗ ] and E[It∗ ]. These values can guide us in specifying the possible range of the outcomes of the SLP variables, i.e. the length of the intervals [bp1 , bpm ]. 12 Exact utility vs. linearized utility of consumption 4 −3 x 10 exact utility linearized utility −4 −5 −6 −7 −8 −9 n0 n30 n60 n90 n120 n150 n180 nodes n210 n240 n270 n300 n330 Figure 3: Exact utility versus linearized utility of consumption for the nodes associated with the first five periods in problem (A). When the cross and circular markers cover each other, the linearization is accurate. Minimum breakpoint, bpc1 = 0.7E[c∗T ]= SLP ] = 0.042, number of breakpoints, m = 40. 0.015, maximum breakpoint, bpcm = 2E[c∗T SLP Approximating function. Since the utility functions and the optimal value functions are concave, they can be approximated as closely as desired by m−1 linear segments between the breakpoints bpj (j = 1, . . . , m), Hillier et al (2010). The argument x is defined in terms of non-negative decision variables zj associated with each segment x=ε+ m X zj , (21) j=1 where ε ≥ 0 is the lower limit of the argument of the function chosen as small as possible for γ ∈ (−∞, 0) and ε = 0 for γ ∈ (0, 1), subject to constraints: 0 ≤ z1 ≤ bp1 − ε, 0 ≤ zj ≤ bpj − bpj−1 , j = 2, . . . , m − 1, 0 ≤ zm , with a special restriction zj+1 = 0 whenever zj < bpj − bpj−1 . (22) For any concave function h(x), its slope decreases as zj is increased, and the algorithm for maximizing h(x) automatically gives the highest priority to using z1 , the next highest priority to using z2 , and so on. Thus, the special restriction is implicitly included in the model and all the constraints can be modelled without the use of indicator variables. Using the above, any concave function h(x) is approximated by: h(x) ≈ h(ε) + m X h(bp j )−h(bpj−1 ) zj . bpj −bpj−1 (23) j=1 As explained in App. D the results of the linerization are sensitive to the choice of the minimum and maximum breakpoints, the length of the interval, and the number of breakpoints. The approximations are best for short intervals and for small values of x such as x ∈ [0.01, 1]. Therefore, to ensure a good approximation, we rescale all input data regarding payments, e.g. initial savings, x0 , and labor income/premiums, lt . Fig. 3 shows an example of the accurate approximation of the utility of consumption for each node associated with the first five years of the contract (341 nodes). 4.4 Scenario tree The uncertainty associated with the market returns is modeled by an N -dimensional random process. The multivariate return process evolves in discrete time, and the underlying probability distributions are approximated by discrete distributions in terms of a scenario tree. 13 Our goal is to reproduce the model assumptions that lead to the explicit solutions in the stochastic optimal control approach. The prices for the securities follow the Black Scholes model defined in eq. (1), therefore, we aim for constructing a multi-period scenario tree with a discrete representation of a normal distribution N αi − σi 2 /2, σi . There are plenty of scenario generation methods for stochastic programming, whose evaluation has been described in Kaut and Wallace (2005). To find a discrete representation of a normal distribution, while keeping the minimum number of nodes, we choose to follow the approach presented in Høyland and Wallace (2001), i.e. we match the first four moments plus the correlations of the simulated processes. This algorithm has been improved in Høyland et al (2003), where a scenario tree is obtained by a number of transformations. According to the authors, the new algorithm is efficient for large trees, and does not require solving a nonlinear optimization problem, which can be time consuming. For our work, however, we choose the older algorithm, i.e. Høyland and Wallace (2001). In our experiments it performs better for small trees, and allows to define the probabilities of each branch as variables (in contrast to Høyland et al (2003) which requires the pre-specified probabilities). The approach is improved by a priori adding the non-arbitrage constraints suggested by Klaasen (2002). 5 Numerical results In this section we first present realistic examples illustrating problems (A) and (B) in the original model setup (continuous time) and the SLP solutions obtained by the SLP model presented in Sec. 4.1. Afterwards, we add or modify the particular constraints as described in Sec. 4.2 and investigate their impact on the optimal controls during the first years of the contract. Parameters. If not specified in the captions of the tables and figures, the following parameters have been chosen for testing the models: • Market: the number of assets, N = 3; two risky and one risk-free asset, the expected returns are, respectively, α1 = 0.05, α2 = 0.07, r = α3 = 0.02, the volatility of the assets, σ1 = 0.2, σ2 = 0.25, σ3 = 0, and the correlation between the risky assets is corr = 0.5; all parameters are adjusted for inflation. • Utility function: risk aversion, 1 − γ = 4, corresponding to the optimal proportion in risky asset after retirement πt∗ equal to 25%, the impatience factor for the utility weights, ρ = 0.04, intuitively chosen such that ρ ≥ r, and the weight on the utility of bequest relatively to the utility of consumption, λ = 5, implying that the optimal amount which the inheritors receive upon death is approximately equal to five years of the person’s consumption while she is alive, Xt∗ + It∗ ≈ 3.5c∗t (λ is only relevant for problem (A)). • Contract: age at the beginning of the contract, age0 = 45, retirement age, ageT = 65, the final age at which the person is assumed to be dead with certainty, Te = 110, type of retirement benefits (problem (B)): life annuity. • Lifetime uncertainty: the mortality intensity model is of the form µt = µ∗t = θ + 10β+δ(y+t)−10 , where y is the age of the person at time t0 , and θ, β and δ are constants2 . The model has been calibrated to the mortality rates among Danish women in 2010 over age 40, Finanstilsynet (2010), where θ = 0.0, β = 4.59364, δ = 0.05032. • Scenario tree: number of SLP stages, TSLP = 6, branching factor in a single tree (number of nodes), bf = 4, number of trees notree = 50, which implies in total 45 × 50 = 51.200 scenarios. • Linearization: number of breakpoints, m = 40, minimum and maximum breakpoints for the consumption, bpc1 = 0.7E[c∗TSLP ], bpcm = 2E[c∗TSLP ], minimum and maximum breakpoints for the savings plus the x+g = 2E[XT∗SLP + gTSLP ], present value of labor income/premiums, bp1x+g = 0.5E[XT∗SLP + gTSLP ], bpm x+ins minimum and maximum breakpoints for the savings plus insured sum, bp1 = 0.8E[XT∗SLP + ∗ x+ins ∗ ∗ ITSLP ], bpm = 1.5E[XTSLP + ITSLP ], 2 In principle the mortality intensity model does not assume that an individual is dead at time T e with probability 1. However, for Te = 110, this error is negligible. 14 • Payments: – Problem (A). Labor income lt = 27.000 EUR, corresponding to the average Danish disposable income (after taxes) for a 45-year old individual, Danmarks Statistik (2010), average savings of a 45-year old individual, x0 = 60.000 EUR, – Problem (B). Before retirement: premiums, lt = 4.000 EUR, corresponding to 15% of the salary. Assuming that the person has been contributing to the pension account for around 15 years, accumulated with some capital gains, gives the value of the initial savings of x0 = 75.000 EUR; After retirement: premiums, lt = 0, the value of the savings is estimated from the expected savings B for a 70 year old person (see before retirement case), i.e. x0 = E[X25 ] = 225.000 EUR. Similarly as Geyer et al (2009a), we have chosen to consider the small-scale optimization problems rather than the large-scale problems. On the first place, we are interested in replicating the optimal controls obtained from the continuous time models. The small-scale models are sufficient to evaluate whether a similar solution can be achieved by running the SLP model. With only three assets: one risk-free and two risky assets, the first four moments of a normal distribution can be approximated with 4 nodes. 6 periods with 4 nodes each give in total 45 = 1.024 scenarios. Additionally, we rerun each program notree times for different scenario trees, and present the results in terms of means and standard errors from sampling. We have tested the results for the various number of trees, and choose notree = 50, since for ntree > 50 the results remain unchanged. Each problem has been implemented using Matlab 7.12.0 (R2011a) for calculation of the explicit solutions, GAMS 23.7.3 with CONOPT 3 (3.15A) solver for scenario generation and finding the position of breakpoints, and GAMS with CPLEX 12.3.0.0 solver for the SLP formulation and solution of the stochastic programming models. The running time of each model for one scenario tree takes approximately 20 seconds on a Dell computer with an Intel Core i5-2520M 2.50 GHz processor and 4 GB RAM. 5.1 Numerical results for Problem (A) We start with analyzing the optimal controls obtained explicitly by solving Richard (1975)’s model and the SLP solutions replicating this model (no additional constraints). Fig. 4 (top left and right) shows the development in the expected values of the savings account, Xt∗ , optimal consumption, c∗t , and insured sum, It∗ , in 1000 EUR (left figure), as well as the optimal proportion in both risky assets, πt∗ , and optimal proportion in the first risky asset πt∗1 (right figure) during the entire lifetime of the individual. The markers, which represent the optimal decisions obtained from the SLP model, closely follow the continuous lines representing the explicit solutions. Table 1 shows the exact numbers plotted on the figure for the first five years of the contract. Since we compare a continuous time model with a combined continuous and discrete time model, we expect the discrepancies. The explicit solutions are calculated based on the assumption that the labor income, consumption, insured sum, as well as capital gains occur simultaneously. In the SLP solutions, we first account for the payments, and then make the investment decisions based on the value of the savings after the payments. Therefore, the investment control obtained in the SLP formulation should be compared with the average continuous time investment over the two adjacent time periods. Furthermore, with each time period the discretization error accumulates, which can be seen on the development of the values of savings, Xt∗ , hence the discrepancies in all optimal controls are expected to occur. The standard deviations calculated from testing different scenario trees are negligibly small, which means that increasing the number of scenarios will not necessarily improve the precision of the results. Richard (1975)’s model implies that the initial proportion in risky assets for our choice of parameters is given by πt∗0 = 181%, where the distribution between the risky assets is specified by the mutual fund theorem as πt∗1 = 0.33πt∗0 and πt∗2 = 0.67πt∗0 , see Appendix B. The optimal solution favours the asset with a higher 0 0 return but also a higher volatility, therefore πt∗1 < πt∗2 , ∀t . These proportions decrease smoothly over time as the individual manages to increase her savings due to labor income and capital gains. After retirement πt∗ remains constant, which is a classical result known in the literature. The SLP solution closely follow the explicit solutions, though it consistently remains a few percentages below the original solutions. In particular, the SLP solution slightly favors the first risky asset - with a higher return and higher volatility - and the ratio πt∗1 /πt∗ = 36%. The optimal consumption rate increases at a very slow pace over the entire lifetime. The increase is caused by the choice of impatience factor ρ and risk tolerance γ in relation to the choice of market parameters αi , σi , and mortality rates, as explain later in the analysis of Figure 6. An interesting fact is that for our parameters choice the insured sum It∗ is positive only during the first year of the contract. Afterwards, It∗ is negative, meaning that it is optimal to purchase the life annuities which would increase 15 Expected value of the savings, optimal consumption and insured sum Optimal investment in risky assets X*t − expl. X*t − SLP −I*t − expl. −I*t − SLP c*t − expl. c*t − SLP 350 300 250 200 150 π*1 − SLP t 2 π*t − SLP π*1 − expl. t 1.5 π*t − expl. 1 100 0.5 50 0 45 65 − retirement 0 45 85 55 Optimal investment in risky assets, π*t 1.8 Optimal investment in the first risky asset, π*1 t 1.8 π*t : orig. constraints 1.6 1.2 π*t : qi=0.5% 1.4 π*t : τi=20% 1.2 1 1 0.8 0.8 0.6 0.6 0.4 0.4 0.2 0.2 0 45 46 47 48 49 π*t : orig. constraints 1.6 π*t : πt ≤ 1 and It ≥ 0 1.4 65 − retirement 0 50 π*t : πt ≤ 1 and It ≥ 0 π*t : qi=0.5% π*t : τi=20% 45 46 47 48 49 50 Optimal insured sum, I*t 10 0 −10 −20 −30 I*t : original constraints −40 I*t : qi=0.05% −50 I*t : πt≤ 1 and It ≥ 0 I*t : τi=20% 45 46 47 48 49 Figure 4: Problem (A). Explicit solution and SLP solution for the expected value of the savings, optimal consumption and insured sum (left, top, in 1000 EUR). Optimal investment in both risky assets and in the first risky asset (right, top, πt∗ = πt∗1 + πt∗2 ). The SLP solution for the expected optimal investment in both risky assets (left, middle) and in the first risky asset (right, middle) given different constraints. The SLP solution for the expected optimal insured sum given different constraints (bottom, in 1000 EUR). the individual’s savings rather than to purchase a life insurance from the pension company. This result is also consistent with Richard (1975) conclusions. Recall that the amount that is paid out to the inheritors upon the person’s death is not equal to the insured sum, but to the value of the savings plus the insured sum. Therefore It∗ can be negative as long as Xt∗ + It∗ > 0. The size and the sign of insured sum depends on parameter λ. Choosing λ = 1 means that the individual puts an equal weight on her consumption and the bequest motive, therefore the optimal legacy amount is approximately equal to the optimal consumption while the person is alive, i.e. Xt∗ + It∗ ≈ c∗t . Fig. 4 and Table 1 show the case for λ = 5, where the optimal death sum corresponds to approximately five years of optimal consumption while the person is alive, i.e. Xt∗ + It∗ ≈ 3.5c∗t . The example takes as input some realistic parameters, however, not all the optimal controls are actually realistic. First, the insured sum is negative for t > t0 . In practise the life annuities can first be offered to a person when she retires, and not 20 years before retirement. Secondly, πt∗ > 1 for t ≤ t3 . It may happen that the individual does not have an opportunity to borrow money and short the risk-free asset. We have 16 control model expl. orig. constraints πt ≤ 1 and It ≥ 0 Xt∗ q i = 0.5% τ i = 20% expl. orig. constraints πt ≤ 1 and It ≥ 0 πt∗ q i = 0.5% τ i = 20% expl. orig. constraints πt ≤ 1 and It ≥ 0 πt∗1 q i = 0.5% τ i = 20% expl. orig. constraints π t ≤ 1 and It ≥ 0 c∗t q i = 0.5% τ i = 20% expl. orig. constraints πt ≤ 1 and It ≥ 0 It∗ q i = 0.5% τ i = 20% t0 60.0 60.0 (0.0) 60.0 (0.0) 60.0 (0.0) 60.0 (0.0) t1 72.3 72.9 (0.1) 70.9 (0.0) 71.3 (0.0) 68.5 (0.2) t2 84.9 85.9 (0.2) 82.6 (0.1) 83.9 (0.1) 77.4 (0.4) t3 97.6 99.0 (0.3) 94.9 (0.1) 97.0 (0.2) 86.5 (0.5) TSLP − 1 110.5 112.3 (0.4) 107.6 (0.1) 110.3 (0.2) 95.8 (0.7) 1.81 1.79 (0.02) 1.00 (0.00) 1.44 (0.02) 0.81 (0.09) 1.51 1.49 (0.02) 1.00 (0.00) 1.39 (0.01) 0.70 (0.08) 1.30 1.26 (0.01) 1.00 (0.00) 1.25 (0.01) 0.61 (0.07) 1.12 1.10 (0.01) 0.96 (0.01) 1.12 (0.01) 0.54 (0.07) 0.99 0.96 (0.02) 0.89 (0.02) 1.03 (0.02) 0.48 (0.06) 0.60 0.64 (0.02) 0.14 (0.03) 0.46 (0.02) 0.06 (0.06) 0.50 0.53 (0.02) 0.21 (0.01) 0.44 (0.01) 0.06 (0.05) 0.43 0.46 (0.02) 0.26 (0.02) 0.42 (0.01) 0.05 (0.05) 0.37 0.40 (0.02) 0.28 (0.02) 0.39 (0.01) 0.04 (0.05) 0.33 0.35 (0.02) 0.28 (0.03) 0.37 (0.01) 0.04 (0.05) 20.8 20.7 (0.0) 20.7 (0.0) 20.7 (0.0) 20.7 (0.0) 20.8 20.8 (0.1) 20.6 (0.1) 20.6 (0.1) 20.4 (0.1) 20.8 20.9 (0.1) 20.7 (0.0) 20.7 (0.0) 20.3 (0.1) 20.8 20.9 (0.1) 20.7 (0.0) 20.7 (0.0) 20.3 (0.1) 20.8 21.0 (0.1) 20.7 (0.0) 20.7 (0.0) 20.1 (0.1) 9.5 9.8 (0.4) 9.3 (0.0) 9.0 (0.2) 8.8 (0.3) -2.8 -3.2 (0.2) 4.8 (0.3) -1.9 (0.2) -0.2 (0.3) -15.2 -16.0 (0.3) 3.1 (0.2) -14.5 (0.2) -9.4 (0.4) -27.9 -29.1 (0.4) 1.7 (0.1) -27.4 (0.2) -18.8 (0.6) -40.8 -42.3 (0.5) 0.9 (0.1) -40.6 (0.2) -28.4 (0.7) Table 1: Problem (A). Explicit solution and SLP solution with various modifications for the expected value of the savings, optimal consumption and insured sum (in 1000 EUR) and optimal investment in both risky assets and in the first risky asset (πt∗ = πt∗1 + πt∗2 ). The numbers are presented in terms of means and standard errors (in parenthesis). mentioned that one of the drawbacks of dynamic programming is that the explicit solutions exist only if we consider simple models, such as the one above. As soon as we add realistic constraints, such as, not allowing for purchasing a life annuity before retirement, or not allowing for being short in any asset, the explicit solutions cannot be derived. Being able to add realistic constraints is the main reason to include the SLP 17 part, and hence to combine the stochastic optimal control approach with stochastic programming approach into one formulation. We have shown that the closed-form solutions can be replicated well. The numbers are not the exact match, but the patterns and the level of the optimal controls are consistent with the continuous time models. Therefore, we find the model reliable and use it as a tool to analyze the optimal controls under the effects of various modifications during the first years of the contract. A borrowing constraint and an annuity purchase constraint. Figure 4 (middle left and right, and bottom plot) and Table 1 show the optimal solutions that simultaneously account for two additional constraints: (i) the individual does not have a possibility to borrow money in order to invest more than she has in other assets (this constraint is defined in eq. (16) where ui = 100%), and (ii) the individual is not allowed to purchase a life annuity before retirement, It > 0 for t < T , as defined in eq. (17). As expected, the constraints directly affect the optimal investment and insured sum. The optimal investment πt∗ is now equal to its upper limit 100% and decrease faster than in the original model setup. The constraint naturally also implies a change in the distribution between the risky assets. Before, the ratio of the first risky asset to the total risky investment was given by πt∗1 /πt∗ ≈ 36%. With the new constraints the ratio starts with 14% at t0 and with time converges to the original optimal ratio, which implies that πt∗1 increases until the optimal ratio between the risky assets is reached. The optimal insured sum is positive and converges to its lower limit 0. Finally, the value of the savings, and as follows, the optimal consumption, is lower, due to more conservative investment and positive costs for the insured sum µ∗t It . Transaction costs. Secondly, we investigate the effect of transaction costs q i . These have been defined as a constant percentage of a traded amount and are the same for purchases and sales and for all assets, see eq. (10A’) As a result, the amount invested in risky assets is lower during the first few years than in the original problem, but after a few years stays above the original πt∗ . We conclude that introducing the transaction costs implies that the development of the risky investment over time is smoother, i.e. it decreases as in all other cases but the difference between the maximum and minimum value is smaller. We explain it by the fact that rebalancing portfolio is expensive. A smoother optimal investment implies the lower traded amount, hence lower costs. Other optimal controls, i.e. consumption and insured sum, are not much affected by the transaction costs. As well as the total savings, the optimal values are slightly lower (lower absolute value for the insured sum). Fig. 4 (middle, left and right) shows the results for q i = 0.5%, but we have also investigated higher costs, q i = 1%, and the patterns for optimal controls remain similar. Taxes on capital gains. Finally, we are interested in the impact of capital gains taxes on the optimal decisions. We have integrated a linear taxation on the positive returns on asset i, τ i , as defined in eq. (18). In comparison with other constraints, the taxes on capital gains are the most influential for the optimal decisions. Only τ i = 20% causes already a loss of 17% of the expected savings after 5 years relatively to the original problem. Less savings implies less money to consume and to be paid for the life annuity. Both the consumption and the absolute value of the insured sum decrease. But the biggest impact can be seen on the investment controls. The optimal percentage in stocks is only πt∗0 = 81% comparing to the original πt∗0 = 179%, while the proportion in the first risky asset is only πt∗1 = 6%. Subtracting taxes only from 0 the positive returns causes that the scenario tree is asymmetric and favours the risk-free assets (the positive outcomes are lower, whereas the negative outcomes are as severe as they were before). A similar conclusion has been drawn by Geyer et al (2009a). The SLP solutions are also more volatile for this case than for other considered constraints. Especially, the investment controls varies significantly for different scenario trees. 5.2 Numerical results for problem (B) This optimization problem is relevant for a pension product design in a defined contribution pension scheme. In practice the individual cannot control the size of the pension benefits. This decision belongs to the pension fund. However, the pension fund can take into consideration the pension saver’s risk preferences γ, the market parameters hidden under the variable ϕ, the impatience factor ρ and both regular and pricing mortality intensities µt and µ∗t and design the benefits (payout profile) such that the individual’s preferences are included. t Xt , see eq. (7B). Inserting the definitions of The optimal pension benefits are of the form c∗t = fBw(t) 18 functions wt and fB (t) leads to ct = Xt āy+t , t ∈ [T, Te) (24) where āy+t is a traditional single-life annuity that pays 1 EUR per year continuously to an individual who is y + t years old, i.e. Z Te R − ts r̄+µ̄τ dτ āy+t = e ds, t where r̄ = 1 1−γ ρ − γ 1−γ ϕ, ϕ=r+ (α−r)2 2σ 2 (1−γ) , µ̄τ = 1 1−γ µτ γ ∗ 1−γ µτ . − Parameters r̄ and µ̄t are the utility adjusted interest rate and mortality rate, respectively. Figures 5 and 6 explain the concept of optimal annuity design and show the expected optimal life annuities c∗t for the individuals with different risk aversion 1 − γ and impatience factor ρ. Fig. 5 shows the initial value of the benefit at time of retirement, c∗T , and its sensitivity on γ and ρ. Given the value of the savings account XT = 265.000 EUR and no premiums after retirement, the size of the life annuities for t = T varies between 12.800 EUR and 19.500 EUR. The lowest initial benefit is for patient persons who prefer more risky investment, i.e. those characterized by f.ex. ρ = 0.0 and γ = 0, whereas the highest benefits will be received still by those who prefer more risky investment but are impatient, i.e. ρ = 0.04 and γ = 0. The tendency is similar for the risk averse retirees - the impatient ones will receive the highest annuities in the beginning of the retirement. 19 18 17 16 15 14 13 0.04 0.02 0 0 −0.5 −1 −1.5 −2 −2.5 −3 γ ρ Figure 5: Sensitivity of the optimal life annuities at the beginning of retirement, c∗T , to risk aversion 1 − γ and impatience factor ρ. The numbers are in 1000 EUR. xT = 265. Other parameters as specified in Sec. 5. Fig. 6 shows the development of the size of the benefits during retirement (the payout profile) for different values of γ and ρ. We immediately notice that the optimal payout profile is different for individuals with different patience levels and risk preferences. The annuity payments can be constant, decreasing or increasing. To be precise, it is the value of the impatience factor ρ relatively to the average return on investment adjusted for the risk aversion level 1−γ that drives the wealth process, which controls the development of the payments. Impatient persons (f.ex. with ρ = 0.08) wish to receive the highest benefits in the first years of the retirement, whereas those who are indifferent between the benefits received just after retirement or later in the future (ρ = 0.0) receive lower benefits in the beginning. The latter allows for larger capital gains during retirement, thus, those who are patient and survive until age 95, will on average receive between 2.5-5 times higher benefit at age 95 (depending on their risk aversion) than upon retirement. Parameter γ affects mostly the size of the benefits. The more risky investment gives on average a higher return, hence higher benefits. Furthermore, risk averse persons (f.ex. γ = −3) will receive each year smoother benefits (the difference between c∗65 and c∗95 is smaller) than for a person with a low risk aversion, such as γ = 0. 19 model t0 t1 t2 t3 TSLP − 1 Xt∗ expl. orig. constraints πt∗ expl. orig. constraints πt∗1 expl. orig. constraints 75.0 75.0 (0.0) 0.45 0.44 (0.01) 0.15 0.16 (0.01) 82.0 82.2 (0.0) 0.42 0.42 (0.01) 0.14 0.15 (0.01) 89.1 89.6 (0.0) 0.40 0.40 (0.01) 0.13 0.15 (0.01) 96.5 97.2 (0.1) 0.38 0.39 (0.01) 0.13 0.14 (0.01) 104.1 105.1 (0.1) 0.37 0.36 (0.01) 0.12 0.13 (0.01) 225.0 225.0 (0.0) 225.0 (0.0) 225.0 (0.0) 225.0 (0.0) 216.9 216.7 (0.0) 216.7 (0.0) 215.7 (0.1) 214.3 (0.1) 208.7 208.5 (0.1) 208.6 (0.2) 207.4 (0.1) 203.8 (0.1) 200.6 200.2 (0.2) 200.3 (0.2) 198.9 (0.1) 193.4 (0.2) 192.5 191.8 (0.2) 192.0 (0.2) 190.4 (0.1) 183.0 (0.3) 0.25 0.25 (0.00) 0.29 (0.00) 0.26 (0.00) 0.11 (0.01) 0.25 0.25 (0.00) 0.29 (0.00) 0.25 (0.00) 0.11 (0.01) 0.25 0.25 (0.00) 0.29 (0.00) 0.25 (0.00) 0.11 (0.01) 0.25 0.25 (0.01) 0.29 (0.00) 0.25 (0.00) 0.11 (0.01) 0.25 0.25 (0.01) 0.29 (0.00) 0.24 (0.00) 0.11 (0.01) 0.08 0.09 (0.00) 0.15 (0.00) 0.09 (0.00) 0.01 (0.01) 0.08 0.09 (0.00) 0.15 (0.00) 0.09 (0.01) 0.01 (0.01) 0.08 0.09 (0.00) 0.15 (0.00) 0.09 (0.00) 0.01 (0.01) 0.08 0.09 (0.01) 0.15 (0.00) 0.09 (0.00) 0.01 (0.01) 0.08 0.09 (0.01) 0.15 (0.00) 0.08 (0.00) 0.01 (0.01) 17.8 17.8 (0.0) 17.8 (0.0) 17.7 (0.1) 17.3 (0.0) 17.9 17.7 (0.1) 17.7 (0.1) 17.6 (0.1) 17.1 (0.1) 17.9 17.7 (0.0) 17.7 (0.0) 17.6 (0.0) 17.1 (0.0) 17.9 17.7 (0.0) 17.7 (0.0) 17.6 (0.0) 17.0 (0.0) 17.9 17.7 (0.0) 17.8 (0.0) 17.6 (0.0) 16.9 (0.0) control before retirement after retirement expl. orig. constraints π ∗1 ≥ 15% Xt∗ q i = 0.5% τ i = 20% expl. orig. constraints π ∗1 ≥ 15% πt∗ q i = 0.5% τ i = 20% expl. orig. constraints π ∗1 ≥ 15% πt∗1 q i = 0.5% τ i = 20% expl. orig. constraints π ∗1 ≥ 15% c∗t q i = 0.5% τ i = 20% Table 2: Problem (B) before and after retirement. Explicit solution and SLP solution with various modifications for the expected value of the savings and optimal benefits (in 1000 EUR) and optimal investment in both risky assets and in the first risky asset (πt∗ = πt∗1 + πt∗2 ). The numbers are presented in terms of means and standard errors (in parenthesis). 20 50 ρ=0.00, γ=−3 ρ=0.00, γ=−1 ρ=0.00, γ=0 40 30 20 10 65 − retirement 70 75 80 85 90 30 ρ=0.04, γ=−3 ρ=0.04, γ=−1 ρ=0.04, γ=0 20 10 65 − retirement 95 70 75 80 85 90 30 95 ρ=0.08, γ=−3 ρ=0.08, γ=−1 ρ=0.08, γ=0 20 10 65 − retirement 70 75 80 85 90 95 Figure 6: Optimal benefits c∗t for the first 30 years of retirement. Sensitivity of the benefits to various choices of the impatience factors ρ = {0.00, 0.04, 0.08} and risk preferences, γ = {−3, −1, 0}. The numbers are in 1000 EUR. xT = 265. Other parameters are as specified in Sec. 5. Basic constraints. Analogously as in Sec. 5.1 we present the results for problem (B) for the time before retirement, t < T , and after retirement, t > T , on Fig. 7 and Table 2. Fig. 7 shows the development in the expected values of the savings account, Xt∗ , and optimal life annuities, c∗t (in 1000 EUR), before retirement (top, left) and after retirement (middle, left). The figures on the right show the optimal proportion in the first risky asset πt∗1 during the lifetime of the individual for the case before retirement (top, right) and after retirement (middle, right). Given that at age 45 the individual has x0 = 75.000 EUR on her savings account and she continues to pay the premiums of lt = 4.000 EUR every year, which are invested in a conservative way, i.e. γ = −3, she should expect that her savings at retirement will reach 265.000 EUR, which will define the pension benefits at almost c∗t = 18.000 EUR at age 65. The size and development of the benefits is optimal given she puts more weight on the benefits just after retirement than on those in the future, and given she is risk averse (the percentage in risky assets at retirement is only about 25%), i.e. ρ = 0.04 and γ = −3. The investment before retirement is more conservative than in problem (A). The process lt denoting the pension contributions is much lower than in the previous case, where lt was defined as the labor income. The optimal investment control that we derive in problem (B) follows the classical result saying that the optimal investment is proportional to the sum of the value of the savings and the present value of the pension Xt +gB (t) contributions relatively to the value of savings, recall eq. (7B), πt∗ = σ2α−r . Therefore, πt∗ is much (1−γ) Xt lower than in problem (A) starting with 45% at age 45 and decreasing afterwards. After retirement, the optimal proportion in risky assets remains constant at 25%. The ratio of the investment in the first risky asset to the total risky assets according to the mutual fund theorem remains the same, i.e. πt∗1 /πt∗ = 33%, both before and after retirement. Similarly as in problem (A) we conclude that the SLP solution closely follows the optimal controls derived explicitly. The value of the savings obtained from the SLP formulation is slightly higher (before retirement) or lower (after retirement) with each time period, whereas as in problem (A) the ratio between the risky assets is a few percentage higher than for the continuous time model. As mentioned before, we expect the discrepancies due to the discretization. The SLP model, in contrast to the continuous time model, assumes that the decision on investment is made after all the payments occur, and not simultaneously. In contrast to problem (A), we find that the model provides a realistic solution that can be applied by a pension fund. Nevertheless, there is a number of constraints whose effect on the optimal control is worth investigating. Incorporating the pension saver’s preferences, such as the limits on the portfolio composition, or practical constraints, e.g. transaction costs and taxes on capital gains in a dynamic programming approach is non trivial. Therefore, we have included the SLP part in our model, which adds flexibility to the model 21 Expected value of the savings and optimal benefits Optimal investment in risky assets X*t − expl. X*t − SLP c*t − expl. 300 250 π*1 − SLP t 0.5 π*t − SLP π*1 − expl. t 0.4 π*t − expl. 200 0.3 150 0.2 100 0.1 50 0 45 65 − retirement 0 45 85 55 Expected value of the savings and optimal benefits 65 − retirement Optimal investment in risky assets X*t − expl. X*t − SLP c*t − expl. c*t − SLP 200 150 π*1 − SLP t π*t − SLP 0.25 π*1 − expl. t π*t − expl. 0.2 0.15 100 0.1 50 0.05 0 70 75 0 70 80 75 Optimal investment in risky assets, π*t 80 Optimal investment in the first risky asset, π*1 t π*t : orig. constraints π*t : orig. constraints 0.3 π*t : π1t ≥ 15% 0.3 π*t : π1t ≥ 15% 0.25 π*t : qi=0.5% 0.25 π*t : qi=0.5% π*t : τi=20% 0.2 0.15 0.15 0.1 0.1 0.05 0.05 0 45 46 47 48 49 π*t : τi=20% 0.2 50 0 45 46 47 48 49 50 Figure 7: Problem (B) before and after retirement. Explicit solution and SLP solution for the expected value of the savings and optimal benefits before retirement (left, top, in 1000 EUR). Explicit solution and SLP solution for the expected value of the savings and optimal benefits after retirement (left, middle, in 1000 EUR). Optimal investment in risky assets before retirement (right, top) and after retirement (right, middle). The SLP solution for the expected optimal investment after retirement given different constraints in both risky assets (bottom, left) and in the first risky asset (bottom, right). πt∗ = πt∗1 + πt∗2 . 22 and allows for investigating the desired factors. Limits on the portfolio composition. Imagine a 70 year old pension saver who simply is not satisfied with the way the pension fund invests her savings. In particular, she believes that the first risky asset is more valuable and does not agree that after retirement only 9% of her savings are invested in this asset (see Table 2, original constraints). She prefers a higher percentage in the first risky asset and defines her new preferences, for instance, as minimum 15% should be invested in the first risky asset. Such a constraint can easily be incorporated in the SLP model by adding eq. (16), where constant d1 is set to be equal 15%. The optimal solution for this case is shown on Figure 7 (middle left and right) and Table 2. Both the total investment in risky assets and the investment in each of the risky assets remain constant as before. Though, both values are now higher. The pension saver’s preferences are taken into account and πt∗1 raises from 9% in the original model setup to 15%, whereas the investment in both risky assets increases from 25% to 29%. The fact the πt∗ increases as well can be explained by the fact that there exists an optimal ratio between the risky assets which the model tries to remain as closely as possible. The SLP formulation allows for adding various limits on the portfolio. The limits can be both from above and below, and separately for each asset class. We have investigated various combinations of limits on a particular asset, and without presenting the numbers, we came to the following conclusions. None of these constraints have much effect on the overall value of the savings or the optimal benefits, but adding a limit just to one asset class directly affects the distribution between all three asset classes. Imposing a higher percentage on an asset class than the percentage given by the optimal control in the case without constraints, results in setting this asset exactly to this limit. And, vice versa, limiting the amount of savings in a given asset class from above when the original optimal investment suggest a higher amount, results in new optimal investment equal to the limit. Furthermore, the limit set on one of the risky assets, triggers the change in the same direction of the percentage of the second risky assets. Transaction costs. In problem (A) we concluded that adding transaction costs, defined as a percentage of a traded amount, implies a smoother development in the proportion invested in risky assets. In this case, where the original optimal investment is constant, the development of πt∗ is not constant as well, but actually decreases very slowly. The reason for that is that while the value of the savings decreases, trying to rebalance the portfolio each year to a constant proportion is more expensive than rebalancing to the decreasing proportion. As before, the changes in the optimal benefits are so small that it is hard to conclude how the transaction costs affect the benefits, except for the obvious fact, that the savings are slightly lower, which implies lower benefits. Taxes on capital gains. Finally, to strengthen the conclusions we drew from investigating the effect of adding taxes on the positive returns in problem (A), we repeat this investigation for problem (B). The results are similar: the impact of the taxes on the optimal investment is significant. Without taxes the optimal investment in both risky assets was given πt∗ = 25% and in the first risky asset πt∗1 = 9%. With the presence of taxes, the optimal controls are down to πt∗ = 11% and πt∗1 = 1%. Again, we explain this fact by the asymmetry in the scenario tree caused by subtracting taxes only from the positive capital gains. It is interesting though that taxes affect the size of the investment but not its development, i.e. the proportions remain constant as in the case of the original problem. Furthermore, the SLP solutions are not as volatile as for problem (A). The standard deviations are low, which shows a stability for different scenario trees. In terms of other controls, similarly as in problem (A) the taxes significantly affect the value of the savings, implying lower and decreasing pension benefits. 6 Conclusions and future work We have presented a model combining two optimization technologies: multi-period stochastic linear programming and dynamic programming, to obtain optimal decisions related to various problems within personal finance and retirement. The optimization methods complement each other very well. SLP allows for building flexible and practical models, which are difficult to solve with the stochastic optimal control approach. The explicit formulas, however, can handle any number of periods and are useful for the validation of the results obtained by the SLP. 23 The numerical results show that among considered limits on portfolio composition, limits on the insured sum, transaction costs and taxes on capital gains, the presence of taxes affects the optimal controls the most, and the changes in both the value of savings and the controlled processes are significant. Except for the contribution regarding computational methods, the paper introduces an important aspect regarding designing pension benefits in a defined contribution pension scheme. Problem (B) suggests how the pension fund could decide on the initial size of the life annuities and their development during the decumulation phase such that the individual’s risk and impatience preferences are captured. Interestingly, the pension saver’s preferences have a higher impact on the size and development of the benefits than limits on portfolio composition, transaction costs or taxes on capital gains. The model can be improved in various ways. The most interesting would be the extension of the SLP part to even more periods while still keeping the running time of the program within a couple of minutes. Another suggestion is to introduce the bond market instead of the risk-free asset and/or the uncertainty on the labor income/premiums. Finally, since the stochastic programming approach allows the objective function to take a variety of forms, it could be of interest to consider a different utility function, for instance, loss aversion framework, where an individual can control both the gains and losses in savings relative to a pre-defined target, or a multi-objective function, taking into account other preferences than risk aversion and impatience. References Birge JR, Louveaux F (1997) Introduction to Stochastic Programming, corrected edn. Springer Series in Operations Research and Financial Engineering, Springer Blake D, Wright D, Zhang Y (2013) Target-driven investing: Optimal investment strategies in defined contribution pension plans under loss aversion. Journal of Economic Dynamics and Control 37(1):195–209 Bruhn K (2012) Consumption, investment and life insurance under different tax regimes. Annals of Actuarial Science FirstView:1–26, DOI 10.1017/S1748499512000279, URL http://dx.doi.org/10.1017/ S1748499512000279, http://journals.cambridge.org/article_S1748499512000279 Bruhn K, Steffensen M (2011) Household Consumption, Investment and Life Insurance. Insurance: Mathematics and Economics 48(3):315–325 Byung HL, Yong HS (2011) Optimal investment, consumption and retirement decision with disutility and borrowing constraints. Quantitative Finance 11(10):1581–1592 Cai J, Ge C (2012) Multi-objective private wealth allocation without subportfolios. Economic Modelling 29(3):900–907 Cairns AJG, Blake D, Dowd K (2006) Stochastic Lifestyling: Optimal Dynamic Asset Allocation for Defined Contribution Pension Plans. Journal of Economic Dynamics and Control 30(5):843–877 Consigli G, Iaquinta G, Moriggia V, Tria MD, Musitelli D (2012) Retirement planning in individual assetliability management. IMA Journal Management Mathematics 23(4):365–396 Ferstl R, Weissensteiner A (2010) Cash management using multi-stage stochastic programming. Quantitative Finance 10(2):209–219 Finanstilsynet (2010) Benchmark for levetidsforudsaetninger. Accessed January 20, 2013, fom http://www. finanstilsynet.dk/levetider Gerrard R, Haberman S, Vigna E (2004) Optimal Investment Choices Post-retirement in a defined contribution pension scheme. Insurance: Mathematics and Economics 35(2):321–342 Gerrard R, Højgaard B, Vigna E (2012) Choosing the optimal annuitization time post-retirement. Quantitative Finance 12(7):1143–1159 Geyer A, Hanke M, Weissensteiner A (2009a) Life-cycle Asset Allocation and Consumption Using Stochastic Linear Programming. The Journal of Computational Finance 12(4):29–50 24 Geyer A, Hanke M, Weissensteiner A (2009b) A stochastic programming approach for multi-period portfolio optimization. Computational Management Science 6(2):187–208 Hillier FS, Lieberman GJ, Hillier F, Lieberman G (2010) Introduction to Operations Research, 9th edn. McGraw-Hill Science/Engineering/Math Horneff WJ, Maurer RH, Stamos MZ (2008) Life-cycle asset allocation with annuity markets. Journal of Economic Dynamics and Control 32(11):3590–3612 Høyland K, Wallace SW (2001) Generating Scenario Trees for Multistage Decision Problems. Management Science 48(11):1512–1516 Høyland K, Kaut M, Wallace SW (2003) A Heuristic for Moment-Matching Scenario Generation. Computational Optimization and Applications 24(2-3):169–185 Kaut M, Wallace SW (2005) Evaluation of scenario generation methods for stochastic programming. Pacific Journal of Optimization 3(2):257–272 Klaasen P (2002) Comment on: ”Generating Scenario Trees for Multistage Decision Problems”. Management Science 47(2):295–307 Merton RC (1969) Lifetime portfolio selection under uncertainty: the continuous-time case. The Review of Economics and Statistics 51(3):247–257 Merton RC (1971) Optimum consumption and portfolio rules in a continuous-time model. Journal of Economic Theory 3(4):373–413 Milevsky, Young (2007) Annuitization and asset allocation. Journal of Economic Dynamics and Control 31(9):3138–3177 Mulvey JM (2004) Applying Optimization Technology to Portfolio Management. Journal of Portfolio Management 30(SUPPL.):162–168+14 Mulvey JM, Simsek KD, Zhang Z (2006) Improving Investment Performance for Pension Plans. Journal of Asset Management 7(2):93–108 Mulvey JM, Ural C, Zhang Z (2007) Improving Performance for Long-Term Investors: Wide Diversification, Leverage, and Overlay Strategies. Quantitative Finance 7(2):175–187 Munk C, Sørensen C (2004) Optimal consumption and investment strategies with stochastic interest rates. Journal of Banking and Finance 28(8):1987–2013 Rasmussen KM (2011) Finding optimal breakpoints when linearizing a power utility function. Retrieved January 20, 2013, from http://www.gams.com/modlib/libhtml/cclinpts.htm Richard SF (1975) Optimal consumption, portfolio and life insurance rules for an uncertain lived individual in a continuous time model. Journal of Financial Economics 2(2):187–203 Danmarks Statistik (2010) Personindkomster. Accessed January 20, 2013, from http://www.dst.dk/da/ Statistik/emner/indkomster/personindkomster.aspx Zenios S (2008) Practical Financial Optimization: Decision Making for Financial Engineers. Wiley Appendices A Derivation of the optimal controls for problem (B) To our knowledge problem (B) has not been considered in the literature. We derive here the optimal value function and the optimal controls for the problem of maximizing the utility of consumption, where the control processes are specified over different time periods: investment is controlled both before and after retirement, 25 πt ∈ Q[t0 , Te), benefits are controlled only after retirement, ct ∈ Q[max(t0 , T ), Te). The problem is solved first for the period after retirement, where we refer to Bruhn and Steffensen (2011), and then for the period before retirement, which is one of the contributions of this paper. In the latter we insert the optimal value function obtained in after retirement period as the boundary condition at time T . After retirement period. The problem of optimizing expected utility of consumption after retirement in the case where the person controls investment πt∗ , life insurance purchase It∗ and consumption c∗t , and has no bequest motive has been solved in Bruhn and Steffensen (2011). The authors derive the optimal value function V (t, x) = γ1 ft1−γ xγ , (25) and the explicit formulas for the optimal controls πt∗ Xt = where Z ft = c∗t = α−r σ 2 (1−γ) Xt , Te e It∗ = 0, wt ft Xt , 1 R − 1−γ ts µτ −γ(µ∗ τ +ϕ) dτ ws ds, t ϕ=r+ (α−r)2 2σ 2 (1−γ) . (26) Before retirement period. We apply Hamilton-Jacobi-Bellman techniques to solve the problem. We first define the HJB equation for the considered period, based on the savings dynamics defined in eq. (2B): n o 2 (t,x) (t,x) ∂V (t,x) − µt V + sup lt ∂V∂x + r + πt (α − r) + µ∗t x ∂V∂x + 12 πt2 σ 2 x2 ∂ V∂x(t,x) = 0, 2 ∂t V (T, x) = π 1 1−γ γ XT . γ fT The boundary condition is equal to the optimal value function at time T given by eq. (25). To solve the problem we first guess the solution and verify that it is a correct guess: γ V (t, x) = γ1 ft1−γ x + gt . We find the functions ft and gt by plugging in the derivatives to the HJB equation: γ γ−1 ∂gt 1−γ −γ ∂ft + ft1−γ x + gt γ ft ∂t x + gt ∂t γ−1 1−γ γ 1 1−γ = µt γ ft (x + gt ) − lt ft x + gt − (r + µ∗t )(x + gt )ft1−γ (x + gt )γ−1 + (r + µ∗t )gt ft1−γ (x + gt )γ−1 − 2 1−γ 1 (α−r) (x 1−γ 2σ 2 ft + gt )γ , and obtain for t ∈ [t0 , T ): Z T Rs ∗ gt = e− t (r+µτ )dτ ls ds, (27) t ft = e 1 R − 1−γ tT (µτ −γ(µ∗ τ +ϕ))dτ fT = e 1 R − 1−γ tT (µτ −γ(µ∗ τ +ϕ))dτ Z Te e 1 R − 1−γ Ts µτ −γ(µ∗ τ +ϕ) dτ ws ds. (28) T The optimal investment is then given by: ∂ ∂π B : π = − α−r σ2 x ∂V (t,x) ∂x ∂ 2 V (t,x) ∂x2 = 1 α−r x+gt 1−γ σ 2 x . Mutual fund Theorem 1 (Merton (1971)). Given n assets with prices Si whose changes are log-normally distributed, then (1) there exists a unique pair of ”mutual funds” constructed from linear combinations of these assets such that, independent of preferences (i.e., the form of utility function), wealth distribution, probability of distribution of lifetime individuals will be indifferent between choosing from a linear combination of these two funds or a linear combination of the original n assets. 26 Corollary 1 (Merton (1971)). If one of the assets is ”risk-free” [assume with the index i = N ], then the proportions of each asset held by the mutual funds are ∀i=1,...,N −1 θi = PN −1 [σij ]−1 (αi −r) PN −1i=1 PN −1 . −1 (αj −r) i=1 j=1 [σij ] Hence, without loss of generality we can work with only two assets, one risk-free and the other a mutual fund of risky assets with a log-normally distributed price St defined by dSt = αSt dt + σSt dWt , with α= N −1 X σ2 = θi αi , i=1 C N −1 N −1 X X θi θj σij , dW = i=1 j=1 N −1 X θi σσi dWi . i=1 Derivation of the expected values of the savings The expected values of the savings account, optimal investment, optimal consumption and optimal insured shown in the plots in Sec. 5 can be derived explicitly. Knowing these values is not only useful for analyzing the numerical results, but also for estimating the minimum and maximum breakpoints when linearizing the utility function, 4.3. Denote Zt := Xt∗ + g(t), then dZt = dXt∗ + ∂g(t) ∂t and ∂g(t) ∂t = (r + µ∗t )g(t) − l(t). Problem (A). The dynamics of process Z are given dZt = dXt∗ + ∂g(t) ∂t . Inserting the dynamics of process X together with the optimal controls gives 1/(1−γ) (α−r)2 ∗ ∗ µt vt wt α−r dZt = r + µt + σ2 (1−γ) − fA (t) − µt µ∗ fA (t) Zt dt + σ(1−γ) Zt dWt t 1/(1−γ) vt α−r t = ψt − fAw(t) − µ∗t µµ∗t fA (t) Zt dt + σ(1−γ) Zt dWt , t Z0 = x0 + gA (0), 2 with ψt = r + µ∗t + σ(α−r) 2 (1−γ) . The solution to the equation is given by: Z t 1/(1−γ) s Zt = Z0 exp ψs − fAw(s) − µ∗s µµ∗s s 0 vs fA (s) − (α−r)2 2σ 2 (1−γ)2 ds + α−r σ(1−γ) Wt , (29) implying that Xt > −gA (t), ∀t . Process Z is a geometric Brownian motion, whose expected value is given explicitly: Z t 1/(1−γ) ∗ µs ws vs E[Zt ] = Z0 exp ψs − fA (s) − µs µ∗ fA (s) ds . 0 s From the definition of Z we can calculate the expected value of the savings process: E[Xt∗ ] = E[Zt ] − gA (t). Problem (B). Analogically, we derive the expected value of savings for problem (B) before and after retirement: nR o t x0 + gB (0) exp ψ ds − gB (t), t ∈ [t0 , T ), s 0 nR o E[Xt∗ ] = t s x0 exp ψs − fBw(s) ds , t ∈ [T, Te). 0 27 D Comparison of linearization methods In this section we compare the linearization method presented in our work with the method suggested in Geyer et al (2009a). There are two main differences between our approaches: finding the position of breakpoints and defining the formula for the linear approximation. First, we compare the position of breakpoints defined by the method described in Geyer et al (2009b) and in our work defined by eq. (19)-(20). The first method defines the minimum and maximum breakpoints and uses the curvature of the utility function to obtain optimal positions of the remaining breakpoints. We refer the reader to Geyer et al (2009a) for details. Figures 8 and 9 show, respectively, the curvature of the considered utility functions and the utility function with the most defined curvature (γ = −3) with the optimal positions of the breakpoints for different minimum and maximum breakpoints, as well as for the different lengths of the intervals. The plots to the left in Fig. 6 present the position of breakpoints obtained from the curvature method, and the plots to the right, breakpoints given by the optimization method applied in our work. In the comparison we have used m = 40 breakpoints. γ=−3 γ=−1 γ=0 −0.2 −0.4 κ(x) −0.6 −0.8 −1 −1.2 −1.4 1 2 3 4 5 x 6 7 8 9 10 Figure 8: Curvature of utility functions with γ ∈ {−3, −1, 0}. We have indicated two main differences between the two methods. Firstly, the curvature of the utility function does not inherit one of the important properties of the utility functions, rescalability, i.e. the shape of the utility function remains the same over all intervals A × [bpmin , bpmax ], ∀A > 0, where A is a constant. Therefore, the optimal position of the breakpoints in the curvature method is different for intervals Aj × [bpmin , bpmax ] and Ak × [bpmin , bpmax ], where Aj 6= Ak . On the contrary, the position of the breakpoints obtained from the optimization method is rescaled, and remains the same for the mentioned intervals, as can be seen on Fig. 9a and 9b, where we investigate the positions of the breakpoints over different intervals, x ∈ [1, 10] and x ∈ [0.1, 1] = 0.1[1, 10]. Secondly, the breakpoints in the curvature method are concentrated in the interval, where the absolute value of the curvature is the highest, whereas the breakpoints in the optimization method are more spread out, though with more points around the arguments for which the absolute value of the curvature is the highest. This is shown on Fig. 9c, where the chosen interval is longer, x ∈ [0.1, 10]. Before we can compare the approximations based on the positions of breakpoints we have to define the formula for the approximation. Geyer et al (2009b) approximate the utility function by h̃(x) ≈ h(bp2 )−h(bp1 ) z1 bp2 −bp1 + m X h(bp j )−h(bpj−1 ) zj bpj −bpj−1 + h(bpm )−h(bpm−1 ) zm+1 , bpm −bpm−1 (30) j=2 where x= m+1 X zj , 0 ≤ z1 ≤ bp1 , 0 ≤ zj ≤ bpj − bpj−1 , j = 2, . . . , m, 0 ≤ zm+1 . j=1 Both the above formula and the one defined in eq. (23) approximate a concave function by a certain number of segments between the breakpoints, and define argument x in terms of non-negative decision variables zj associated with each segment. The reason we changed eq. (30) to eq. (23) is to be able to control the 28 0 0 b40 −0.05 −0.1 −0.1 −0.15 −0.15 u(x) u(x) −0.05 b40 −0.2 −0.2 −0.25 −0.25 −0.3 −0.3 b1 −0.35 1 b1 2 3 4 5 6 7 8 9 10 −0.35 1 2 3 4 5 x 6 7 8 9 10 x (a) x ∈ [1, 10] 0 b39 −50 −100 −100 −150 −150 u(x) u(x) 0 −50 −200 −200 −250 −250 −300 b40 −300 b1 −350 0.1 b1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 −350 0.1 1 0.2 0.3 0.4 0.5 x 0.6 0.7 0.8 0.9 1 x (b) x ∈ [0.1, 1] 0 b40 −50 −100 −100 −150 −150 u(x) u(x) 0 −50 −200 −200 −250 −250 −300 −300 b1 −350 0 b40 b1 1 2 3 4 5 x 6 7 8 9 10 −350 0 1 2 3 4 5 x 6 7 8 9 10 (c) x ∈ [0.1, 10] Figure 9: Positions of the breakpoints for the power utility function u(x) = (plots to the left) and the optimization method (plots to the right). 1 γ x , γ γ = −3, obtained via the curvature method linearization error. Notice, that all the considered utility functions are negative, and u(0) = −∞, whereas function h̃(x) always returns a positive number. Therefore, we added a small number ε in our formulation, 0 < ε < bpmin , to have a starting point for which the approximated function is negative. Table 3 compares the values of the utility function with their approximations based on two methods for finding the position for the breakpoints: the optimization method (columns (3) and (5)) and the curvature method (columns (4) and (6)) calculated accordingly to formula (23) (columns (3) and (4)) and formula (30) (columns (5) and (6)). As also shown on Fig. 9, we have investigated three different intervals x ∈ [1, 10], x ∈ [0.1, 1] and x ∈ [0.1, 10], and for each interval we choose five values of x equally spaced over the interval. We first analyze columns (3) and (4), which differ with the positions of the breakpoints. The numbers in bold show the approximation closest to the value of the utility function. Both optimization methods have the same minimum breakpoint, hence for x = bp1 we obtain the same result. Out of 12 samples, the curvature method outperforms the optimization method only twice. The results over interval x ∈ [0.1, 1] are particularly interesting, since for the small values of x, such as x = 0.325 and x = 0.550 the approximation based on the breakpoints obtained via the curvature method is very imprecise, whereas for x = 0.775 and x = 1.0 it slightly outperforms the optimization method. Fig. 9b (left) supports these results. We notice that all the remaining breakpoints in the curvature method are placed in the interval (0.65, 1.0), therefore the approximation is good for x ≥ 0.65 and imprecise for x ≤ 0.65. The running time of finding positions of breakpoints for both considered methods is a couple of seconds. 29 As for the linearization error - compare columns (2) with (3) and (4) - we conclude that both methods are sensitive to the choice of the minimum and maximum breakpoints, as well as to the length of the interval. The approximations are the best and can be precise for short intervals, such as x ∈ [0.1, 1] and for the small values of x. Therefore, to ensure a good approximation, we rescale all input data regarding payments, i.e. initial savings, x0 , and labor income/premiums, lt . It is hard to say much about the results in columns (5) and (6). As mentioned before, we chose formula (23) over (30) for the ease of controlling the linearization error. We believe that these two formulae approximate the utility function similarly well because there is only a small difference between them - the starting point. We have tested both formulae, and when it comes to the optimization, the values of decision variables zj are in both cases the same. formula for approximation h(x), eq. (23) h̃(x), eq. (30) opt. curv. opt. curv. x u(x) x ∈ [1, 10] 1.000 3.250 5.500 7.750 10.000 -0.333 -0.010 -0.002 -7.16100E-4 -3.33333E-4 -0.312 -0.162 -0.181 -0.123 -0.124 -0.312 -0.174 -0.312 -0.312 -0.250 0.941 1.265 1.272 1.274 1.274 0.972 1.276 1.286 1.295 1.305 x ∈ [0.1, 1] 0.100 0.325 0.550 0.775 1.000 -333.333 -9.710 -2.004 -0.716 -0.333 -333.312 -9.703 -2.090 -0.675 -0.436 -333.312 -198.354 -63.422 -0.726 -0.426 941.122 1264.735 1272.432 1273.739 1274.122 59.970 194.902 329.834 392.587 392.970 x ∈ [0.1, 10] 0.100 2.575 5.050 7.525 10.000 -333.333 -0.020 -0.003 -7.82275E-4 -3.33333E-4 -333.312 -0.026 -0.047 -0.060 0.062 -333.312 -0.172 -0.312 -0.250 -0.250 928.620 1261.933 1261.950 1261.952 1261.953 65.364 398.651 398.666 398.682 398.697 Table 3: Comparison of the value of the utility function with its approximations based on two methods for finding the position for the breakpoints: the optimization method (column (3) and (5)) and the curvature method (column (4) and (6)) calculated accordingly with formulae (23) (column (3) and (4)) and formula (30) (column (5) and (6)) for different intervals x ∈ [1, 10], x ∈ [0.1, 1] and x ∈ [0.1, 10], and different values of x equally spaced over the interval. The numbers in bold shows the approximation closest to the value of the utility function. 30
© Copyright 2026 Paperzz