Dynamic Financial Hedging Strategies for a Storable Commodity with Demand Uncertainty Panos Kouvelis Olin School of Business, Washington University in St. Louis, US, [email protected] Zhan Pang Department of Management Science, Lancaster University, Lancaster, United Kingdom [email protected] Qing Ding Lee Kong Chian School of Business, Singapore Management School, Singapore [email protected] We consider a firm purchasing and processing a storable commodity in a volatile commodity price market. The firm has access to both a commodity spot market and an associated financial derivatives market. The purchased commodity serves as a raw material which is then processed into an end product with uncertain demand. The objective of the firm is to coordinate the replenishment and financial hedging decisions to maximize the mean-variance utility of its terminal wealth over a finite horizon. We employ a dynamic programming approach to characterize the structure of optimal time-consistent policies for inventory and financial hedging decisions of the firm. Assuming unmet demand is lost, we show that under forward hedges the optimal inventory policy can be characterized by a myopic state-dependent base-stock level. The optimal hedging policy can be obtained by minimizing the variance of the hedging portfolio, the value of excess inventory and the profit-to-go as a function of future price. In the presence of a continuum of option strikes, we demonstrate how to construct custom exotic derivatives using forwards and options of all strikes to replicate the profit-to-go function. The financial hedging decisions are derived using the expected profit function evaluated under the optimal inventory policy. These results shed new light into the corporate risk and operations management strategies: inventory replenishment decisions can be separated from the financial hedging decisions as long as forwards are in place, and the dynamic inventory decision problem reduces to a sequence of myopic optimization problems. In contrast to previous results, our work implies that financial hedges do affect optimal operational policies, and inventory and financial hedges can be substitutes. Finally, we extend our analysis to the cases with backorders, price-sensitive demand, and variable transaction costs. Key words : Finance-Operations Interface, commodity price risk, inventory management, financial hedging. History : May 2013 1. Introduction Commodity price volatility has a significant impact on raw material purchasing cost and inventory management in many industries. For example, in chemical industry, manufacturers’ (e.g., BASF) cost structures largely depend on the price of crude oil which experienced tremendous fluctuations in the past several years, ranging from around $30 in 2003 up to a July 2008 high of $147.30 but then down to a December 2008 low of $32 (Berling and Martı́nez-de-Albéniz 2011). Managing commodity 1 2 material price risk has become one of the biggest challenges for supply chain management. More and more firms have realized the strategic importance of integrating the financial risk management and operational management in their supply chains. For example, in December 2001, Ford announced a loss of $1 billion on its precious-metals inventory and forward-contract agreements. On the contrary, HP was able to cut its purchasing price risk for flash memory using its procurement risk management strategy based on a contract portfolio approach (Nagali et al. 2008). The survey of Bodnar et al. (1998) shows that 50% of U.S. non-financial firms were using financial derivatives as risk management instruments. There is a growing research interest in studying the financial risk management in supply chain and operational contexts. We refer to Kleindorfer (2008a,b) for excellent overviews of the recent developments and the research challenges to use financial instruments (e.g., forwards, futures and options) in production and supply chain management. Our paper adds to this literature by addressing a joint dynamic financial hedging and inventory control problem for a risk-averse firm with mean-variance utility of the terminal wealth. We attempt to answer the following questions: (1) How to find a time-consistent integrated inventory and financial hedging policy under the meanvariance framework? (2) What is the structure of the optimal joint replenishment and financial hedging policy? (3) What is the distinguishing role of inventory and financial hedges in managing commodity price and demand risks? and (4) Are inventory and financial hedges complements or substitutes? More specifically, we consider a firm purchasing a storable commodity from a volatile (physical) spot market with the access to associated financial derivatives. This commodity serves as a raw material which is then processed into an end product with fixed selling prices. The demand of the end product is random and depends on spot price. Unmet demand is treated as lost sales. The spot price evolves as a Markov process. The physical spot market and the related financial derivatives market (with the financial instruments written on the spot price) are assumed to be liquid in that no transaction cost is incurred. Moreover, we assume that there is no arbitrage opportunity in both the physical spot market and financial market. The objective of the firm is to maximize the mean-variance utility of terminal wealth. We formulate this problem as a stochastic dynamic program. Note that the mean-variance criterion is not time consistent which will induce a decision maker to deviate from the previous decision at a later time. This is because the variability of terminal wealth that the decision maker perceives at any point of time is higher than that of a later date. Inspired by Basak and Chabakauri (2010), we apply the law of total variance (Weiss 2005) to develop the recursive equation by decomposing the conditional mean-variance utility at any date into two parts. The first part is the expected future mean-variance utility and the second the conditional variance of expected future cash flow. 3 The second part serves as an additional risk adjustment due to the greater perceived variability compared to the variability that will be perceived in next period, which provides an incentive to deviate from the decisions made in previous periods. Note that although the mean-variance criterion is not time-consistent, the derived policy in our model is time-consistent, namely, the optimal decisions in the future are also optimal now. We analyze two types of financial hedging strategies. We first assume that only the vanilla derivative contracts such as forwards and simple options (e.g., European puts and calls) are used. We show that, in the presence of a forward contract maturing at next period, the joint replenishment and hedging optimization problem can be decomposed into two sub-problems: One is a single-period inventory decision problem and the other involves the financial hedging decisions. Therefore, myopic state-dependent base-stock policies are optimal. On the other hand, the dynamic financial hedging decisions can be obtained by minimizing the variance associated with the expected future cash flow conditioned on the next-period spot price. We then show how to construct exotic derivative contracts with European calls and puts of all strikes to fully replicate the expected future cash flow conditioned on the next period’s price. Note that exotic derivatives refer to all those nonstandard and tailor-made derivative contracts which normally cannot be replicated by a finite number of standard exchange-traded vanilla derivatives, and mostly traded in over-the-counter (OTC) markets (Briys et al. 1998). Thus the continuum of option strikes also belongs to the exotic derivatives. In order to examine whether the insight applies to more general settings, we study several important extensions: (1) demand backordering; (2) price-sensitive demand; and (3) transaction costs. When demand is backlogged, we show that the replenishment decisions are still separable from the financial hedging decisions, while the hedging decision must take into account the optimal pricing and inventory policies when predicting future cash flows. When demand is price-sensitive, we show that the joint pricing and replenishment decisions can still be made myopically and separated from financial hedging decisions. When there exist variable transaction costs, the myopic policies are no longer optimal, but we can still use the exotic contracts to replicate the profit-to-go, which implies that the replenishment decisions can still be separated from the financial hedging decisions. Our contributions to the literature are two-fold. First, we develop a dynamic programming framework to derive the time-consistent optimal inventory and financial hedging policies under an inter-period mean-variance utility. Second, our analysis sheds new light into effective commodity risk management practices: Inventory is used to hedge demand (quantity) risk while financial derivatives are used to hedge the price risk. The inventory replenishment decisions can be separated from financial hedging decisions if the portfolio of hedging derivatives is structured properly. 4 Financial hedging does affect inventory policies and often simplifies them. However, optimal financial hedging strategies are heavily dependent on inventory policies as they determine future cash flows. In other words, inventory decisions can be separated from financial hedging decisions, but not the other way around. This paper is organized as follows. Section 2 briefly reviews the related literature. Section 3 presents the model and the dynamic programming formulation. Section 4 characterizes the optimal joint replenishment and hedging policies under different hedging strategies. Sections 5, 6 and 7 discuss the relationship between financial hedges and inventory hedges, elaborate on how financial hedges create value, and illustrate numerically some of the obtained insights. Section 9 concludes the paper. All the proofs are in the Appendix. 2. Related Literature This section reviews the related literature. We first briefly review the most relevant financial literature and then focus on the operations management literature. Starting from the seminar paper of Markowitz (1952), the mean-variance analysis has long been seen as the cornerstone of modern portfolio theory and has been widely adopted in both academia and industry. There is an extensive literature about the mean-variance analysis of portfolio optimization. We refer to Basak and Chabakauri (2010) for a comprehensive review of its recent developments. In a complete market, it has been shown that a (static) pre-commitment portfolio strategy chosen at an initial date also solves the one with a quadratic objective for some specific parameters (e.g., Li and Ng 2000). However, Basak and Chabakauri (2010) point out that an investor may subsequently find it optimal to deviate from the initial policy if market conditions change in the future. They solve a continuous-time dynamic asset allocation problem of a meanvariance optimizer in a complete market setting and provide a simple dynamic portfolio policy. Their work appears to be the first to obtain in an incomplete market environment a closed-form solution of the dynamically optimal policy, from which the investor has no incentive to deviate, namely, time-consistent policies. In a similar setting, Basak and Chabakauri (2011) study how to hedge a non-tradable asset with a tradable asset. They obtain time-consistent hedges by dynamic programming and show that their dynamically optimal hedges typically outperform their static and myopic counterparts. Inspired by the developments of Basak and Chabakauri (2010, 2011), we develop a discrete time dynamic programming framework to address the joint replenishment and financial hedging problem. Our model differs from their models mainly in two aspects: (1) We explicitly incorporate and address demand uncertainty. (2) We consider the integration of dynamically optimal operational (inventory) hedges and financial hedges, while there are no operational decisions in their model. (3) We consider a discrete time setting as opposed to their continuous 5 time model, which allows us to decompose the inventory and financial hedging decisions and better explore the structural insights. Our work also relates to the optimal positioning problem in derivative securities. Carr and Madan (2001) show that a twice-differentiable payoff function can be replicated by a risk-less asset (the bond), a single risky asset (the stock) and European options of all strikes (derivatives). Oum et al. (2006) employ Carr and Madan’s (2001) approach to develop the optimal static hedging strategy in a competitive electricity wholesale market when both demand and price are uncertain. Our model further extends Carr and Madan’s (2001) analysis to construct the dynamic replication strategies in an inventory system. Our paper is in line with the literature on the joint operational and financial risk management; see Kleindorfer (2008a,b), Zhu and Kapuscinski (2011) and Kouvelis et al. (2013) for reviews of recent developments. Compared to the financial literature, there is relatively less attention that has been paid to the financial risk management in operational environments. Although some of the tools in finance, such as risk averse decision criterion (e.g., mean-variance analysis of Sobel 1994, expected utility framework of Bouakiz and Sobel 1992, Eeckhoudt et al. 1995 and Agrawal and Seshadri 2000), portfolio optimization (e.g., Martı́nez-de-Albéniz and Simchi-Levi 2005) and financial option theory (e.g., Ding et al. 2007), have been introduced into operations research, they do not take into account the unique aspects of commodity risk management in uncertain demand environments. Next, we review the related literature in the single-period (newsvendor) and multi-period inventory setting, respectively. Recent developments on the interfaces of operational and financial risk management mainly focus on the single-period (newsvendor) setting. Gaur and Seshadri (2005) address a joint inventory ordering and financial hedging problem for a retailer with demand being correlated to the price of a financial asset. They first show how to construct single-period hedging strategies in both the mean-variance and the more general utility-maximization frameworks. They then investigate the impact of financial hedges on the expected utility and the inventory levels. They show that financial hedging increases the optimal inventory level for the risk averse newsvendor, and brings it closer to the risk-neutral profit-maximization decision. With weather sensitive demand, Gao et al. (2011) study a joint inventory ordering and weather derivative hedging decision problem under a CVaR framework and obtain similar insights. Ding et al. (2007) consider the integration of operational and financial hedging strategies for a global firm facing exchange rate risk. Chod et al. (2010) examine the relationship between the operational flexibility and financial hedging in capacity investment decisions. They show that product flexibility and financial hedging tend to be complements (substitutes) when product demands are positively (negatively) correlated, whereas postponement flexibility is a substitute to financial hedging. This literature has also been extended 6 to supply chain settings (e.g., Caldentey and Haugh 2009). Caldentey and Haugh (2009) address the impact of financial hedging on the supply chain performance and contract design, and show that in a typical two-stage supply chain the supplier always prefers the flexible contract with financial hedging to the flexible contract without financial hedging whereas the buyer might or might not prefer the flexible contract with hedging. There are only a few papers addressing the joint financial hedging and operational decisions in multi-period settings. With a quadratic utility function, Caldentey and Haugh (2006) provide a modeling framework that allows continuous trading in the financial market to hedge against a cash flow driven by an operational policy. With an additive exponential utility function, Smith and Nau (1995) address a project valuation problem and show that the operational and financial hedging decisions can be separated from each other. Applying the same framework of Smith and Nau (1995), Chen et al. (2007) propose a multi-period consumption model that integrates pricing, inventory control, consumption and financial hedging decisions. Assuming that financial hedging decisions are made after observing the realized demand, they show that the inventory and pricing decisions can be separated from the consumption and financial hedging decisions. Their results indicate that the financial hedging does not change the inventory and pricing decisions when compared to the system without financial hedges. In a similar setting, Geman and Ohana (2008) address the problem of managing a storable commodity portfolio that includes physical assets and hedging positions in spot and forward markets. Their discussion focuses on the time-consistency of the utility functions. Note that the utility functions used by Chen et al. (2007) and Geman and Ohana (2008) are both intra-period (see Kouvelis et al. 2013 for a detailed discussion), as opposed to our inter-period utility. Kouvelis et al. (2013) is the first paper to employ an inter-period utility for the joint inventory control and financial hedging problem for a storable commodity in a multi-period setting. More specifically, they consider a risk-averse buyer procuring a storable commodity from a spot market and a fixed-price long-term contractual channel, and having access to the financial derivatives. In each period, the buyer maximizes the mean-variance utility of the net present value (NPV) of the total cash flow over the remaining periods, taking into account the future decisions under the same criterion. However, it is notable that since the mean-variance utility is not time-consistent, the future decisions under the same criterion may not be optimal for the objective function of current period. Our paper addresses the same issue of Kouvelis et al. (2013), but without the presence of longterm contract, and with the objective of maximizing the mean-variance utility of the terminal wealth. We employ the dynamic programming approach to obtain a time-consistent optimal policy, explicitly accounting for the fact that the mean-variance utility itself is not time-consistent. Our 7 analysis shows that the joint replenishment and hedging policy can be decomposed as follows: (1) the optimal inventory policy is derived from solving a sequence of myopic optimization problems, and thus myopic base-stock policies are optimal; and (2) the financial hedging portfolio is to offset the price risk associated with the inventory carried over to the next period and the profit-to-go as a function of future price. Our result confirms that the operational (inventory) decisions can be separated from financial decisions. Differing from results in Chen et al. (2007), we show that financial hedges do affect optimal inventory policies (myopic policies are no longer optimal without financial (forward) hedges). In the last period, financial hedges increase inventory order-up-to level, whereas in previous periods financial hedges can reduce inventory order-up-to level in comparison to the unhedged solution. We further extend our analysis to account for backlogs, price-sensitive demand, and transaction costs. In particular, in the presence of transaction costs, the myopic inventory policy is no longer optimal, but the inventory decision is still separable from financial hedging decisions when exotic hedges with forwards and options of all strikes may be used. 3. 3.1. The Model Problem Description Consider a risk averse firm purchasing a storable commodity from a volatile spot market. This commodity serves as a raw material which can be processed into an end product with uncertain demand and fixed selling prices. Assume that there exists a spot market and an associated financial market for financial derivative contracts written on the spot price. The inventory is reviewed periodically and the periods are indexed by t = 0, 1, 2, · · · , T + 1. Period 0 is the beginning period and period T + 1 is the ending period. The firm serves the demand occurring in period t = 1, · · · , T , denoted by dt , with a fixed unit selling price rt . Unmet demand is lost. The excess inventory is carried over to the next period which incurs a holding cost ht for each unit of inventory. We assume that this commodity product can be sold or bought from spot market with current spot price at the beginning of each period. The replenishment leadtime is zero. Assume that the commodity spot market is liquid in the sense that there is no transaction cost and the bid-ask spread is zero. Let St be the risk-adjusted spot price at the beginning of period t. Denote by s the realization of the spot price. The spot price process {St , t = 0, 1, · · · , T } is a Markov process. Demand in each period may depend on the spot price. Without loss of generality, we assume that the support of demand is R+ . Let Φt (·|s) and ϕt (·|s) be the distribution and density functions of demand given a spot price s. Assume that Φt (ξ, s) is twice differentiable in (ξ, s). As Gaur and Seshadri (2005), one may specify the demand as the form dt = a + bs + ϵt , 8 where ϵt is an error term independent of s such that Et [ϵt ] = 0 and E[ϵ2 ] < ∞. We denote by Ft the filtration generated by (St , dt ). Throughout the paper, we make the no arbitrage assumption and assume a risk-neutral probability measure Q (see, e.g., Geman and Ohana 2008). For convenience, let Et [·] = E[·|Ft ] = E[·|St = s] denote the conditional expectation operator under Q. According to the competitive storage theory, the commodity price satisfies the following noarbitrage arguments (see, e.g., Working 1948, Williams and Wright 1991, Routledge et al. 2000). Assumption 1. The spot price and holding cost satisfy the no-arbitrage condition, i.e., s + ht − αE[St+1 |S(t) = s] ≥ 0, where α is the risk-free discount factor. The financial market is assumed to be complete. All the financial contracts (e.g., forwards/futures and options) are written on spot price and fairly priced. For example, the forward prices are determined by the expected spot price under a risk neutral probability measure at maturity of the contract. Let ft,τ (s) be the forward price quoted in period t and for delivery in period τ ≥ t given the spot price at t, s. In particular, ft,τ (s) = Et [Sτ ] and ft,t = St . The no arbitrage assumption implies that the forward prices are martingale under P, i.e., Et [ft,τ ] = ft,t = St . Let Ht (St+1 ) be the net profit (or marked-to-market payoff) of a portfolio under the hedging strategy in period t with a random payoff depending on the realization of the spot price St+1 in period t + 1. Without loss of generality, we let function Ht represent a hedging strategy. Let Ht be the set of feasible hedging strategies. We impose the following assumption on the payoff function of the heding portfolio. Assumption 2. For all t and for any Ht ∈ Ht , the expected profit from the hedging strategy is zero, i.e., Et [Ht (St+1 )] = 0. This assumption implies that the financial derivatives are fairly priced and thus the expected profit earned from the hedging portfolio is zero. This is a common assumption in finance literature (see, e.g., Brown and Toft 2002). 3.2. Dynamic Programming Formulation Note that in period 0 only hedging decision is made. From period 1 to T , a joint inventory and hedging decision is made. Let xt be the inventory level at the beginning of period t and yt be the order-up-to level in period t such that yt ∈ R+ . The sequence of events in each period t > 0 is as following: (1) observe the spot price St = s and obtain the payoff of last period’s financial hedge Ht−1 (St ); (2) place an order yt − xt (bid if yt > xt or ask if yt < xt ) from spot market, and choose 9 a hedging portfolio Ht which matures at the next period; (3) demand dt occurs; and (4) holding costs are incurred for ending inventory of the level (yt − dt )+ . Let U be the set of admissible joint replenishment and financial hedging policies. Under any policy u ∈ U , the dynamics of inventory level are xt+1 = (yt − dt )+ , t = 1, · · · , T. The operational profit function of each period is denoted by π̃t (xt , yt , St ) = rt min(yt , dt ) − St (yt − xt ) − ht (yt − dt )+ , t = 1, 2, · · · , T, π̃T +1 (xT +1 , ST +1 ) = xT +1 ST +1 . Note that π̃0 = 0. For simplicity, we assume that x1 = x0 = 0. Let W̃t be the initial wealth at the beginning of period t. Under a joint inventory and hedging strategy, the dynamics of wealth are represented by W̃t+1 = α−1 [W̃t + π̃t (xt , yt , St )] + Ht (St+1 ), t = 1, · · · , T and the terminal wealth is expressed as W̃T +1 = α−1 [W̃T + π̃T (xT , yT , ST )] + ST +1 xT +1 + HT (ST +1 ) T ∑ −(T +1) =α W̃0 + α−(T +1−t) [π̃t (xt , yt , St ) + Ht−1 (St )] + ST +1 xT +1 + HT (ST +1 ), t=1 where W0 is the initial wealth level which is an exogenously given positive constant. For convenience, we redefine the operational profit associated with period t’s operation as πt (yt , St , St+1 ) = rt min(yt , dt ) − St yt − ht (y − dt )+ + αSt+1 (yt − dt )+ , = (rt − St )yt − (rt + ht − αSt+1 )(yt − dt )+ , with π0 = 0 and πT +1 = 0. This profit function counts the total profit from the sales revenue less the holding cost and plus the clear value of the ending inventory at the beginning of the next period. Since the market is liquid and there is no backlog, the excess inventory can be sold back to the market at the spot price of the next period without any transaction cost, and thus the inventory replenishment decision of the next period does not depend on how much inventory is on hand. This treatment effectively allows us to separate the cash flow related to the current period’s inventory decision from the cash flow related to the next period’s inventory decision. Redefine the dynamics of wealth process as Wt+1 = α−1 [Wt + πt (yt , St , St+1 ) + αHt (St+1 )], t = 0, 1, · · · , T. (1) 10 The terminal wealth can be expressed as WT +1 = T ∑ α−(T +1−t) [πt (yt , St , St+1 ) + αHt (St+1 )]. t=0 Notice that WT +1 = W̃T +1 . The firm’s objective is to maximize the mean-variance utility of the terminal wealth at the end of the planning horizon. In particular, the dynamic joint replenishment and financial hedging problem of the firm is given by max{Ut := Et [WT +1 ] − λV art [WT +1 ]}, u∈U (2) subject to the wealth dynamics (1). Remark 1 (Time Consistency). The variance risk measure is time-inconsistent while the expectation operation is time-consistent. To see this, we apply the law-of-total-variance (e.g., Weiss 2005) such that V art (WT +1 ) = Et [V art+1 (WT +1 )] + V art [Et+1 [WT +1 ]], t < T + 1. Clearly, the period-(t + 1) variance is larger than the expected variance at period-(t + 1). Thus, the hedging strategy in period t should account not only for the expected period-(t + 1) variance of the terminal wealth, but also the variance of period-(t + 1) expected terminal wealth. Then, given any admissible policy u, the mean-variance utility Ut satisfies the recursive representation Ut = Et [Ut+1 ] − λV art [Et+1 [Wt+1 ]]. That is, the expected mean-variance utility in period t consists of the expected future utility and an adjustment. This adjustment enables us to determine the optimal policy by backward induction. This derived optimal policy is time-consistent in the sense that the firm optimally chooses the policy taking into account that he or she will act optimally in the future. We refer to Basak and Chabakauri (2010) for a more detailed discussion about the dynamic mean-variance analysis framework and the time consistence of the derived optimal policy. Let J˜t (Wt , s) as the mean-variance utility under the optimal policy u∗ which solves (2) given the wealth level and spot price at the beginning of period t. The following proposition presents the dynamic programming formulation for J˜t . Proposition 1. J˜0 (W0 , s) satisfies the following optimality equation: [ ] J˜0 (W0 , s) = E0 [J˜1 (W1 , S1 )] − λ min V ar0 α−T H0 (S1 ) + v1 (S1 ) . H0 ∈H0 (3) 11 For t = 1, · · · , T , J˜t (Wt , s) satisfies the following optimality equation: { } [ −(T +1−t) ] ˜ ˜ Jt (Wt , s) = max Et [Jt+1 (Wt+1 , St+1 )] − λV art α [πt + αHt (St+1 )] + vt+1 (St+1 ) ,(4) yt ≥0,Ht ∈Ht where vt (s) is the expected profit-to-go function (accounted to period T + 1) under the optimal policy and satisfies the following recursive equation: vt (s) = Et [α−(T +1−t) πt (yt∗ (s), St , St+1 )] + Et [vt+1 (St+1 )], (5) with the terminal value vT +1 (s) = 0. Here, yt∗ (s) is the optimal order-up-to level in period t when the spot price is s. Equation (3) shows that in period 0 only the hedging decision is made. For t = 1, ..., T , the optimality equation (4) states that the period-t mean-variance of terminal wealth under the optimal policy equals the maximum of expected period-(t + 1) mean-variance of terminal wealth less the product of λ and the variance of period-(t + 1) expected terminal wealth. The second term serves as a time-consistent adjustment term which enables us to determine the optimal policy by backward induction. The derived optimal policy is time-consistent in that the firm optimally chooses the decisions taking into account that she will act optimally in the future periods. The following proposition shows that the formulation (4) for the problem (2) can be transformed into a simpler equivalent problem by separating the wealth at the beginning of each period from the objective function. Proposition 2. The optimal mean-variance utility function J˜t (Wt , s) can be decomposed into two parts as J˜t (Wt , s) = Jt (s) + α−(T +1−t) Wt , where Jt (s) satisfies the following optimality equations: J0 (s) = E0 [J1 (S1 )] − λα−2T min V ar0 [αT v1 (S1 ) + H0 (S1 )], (6) Jt (s) = max{α−(T +1−t) gt (y, s)} + Et [Jt+1 (St+1 )], t = 1, ..., T, (7) H0 ∈H0 y≥0 where JT +1 (s) = 0 and [ −(T +1−t) ] At (s)V ar[(y − dt ) ] + α ψt (y, s) , gt (y, s) = Et [πt ] − λα { } + T −t ψt (y, s) = min V art [St+1 Et [(y − dt ) ] + α vt+1 (St+1 ) + Ht (St+1 )] . + 2 Ht ∈Ht (8) (9) where At (s) = Et [(rt + ht − αSt )2 ] = (rt + ht − αft (s))2 + α2 V art [St+1 ]. The equations (6) and (7) are equivalent to (3) and (4). Note that Jt (s) is the period-t expected mean-variance of the terminal wealth with subtraction of the initial wealth in period t. The separation of the initial wealth in the objective function allows us to separate Jt+1 from the optimization 12 problem in each period and concentrate on the expected single period profit Et [πt ] and the timeconsistent adjustment term. The optimality equation (7) implies that in each period t the optimal policy is independent of the initial wealth of this period. The inventory and financial hedging decisions depend only on the initial price and thus the current inventory decision will not affect next period’s decisions. Moreover, the time-consistent term is divided into two parts: the first one is related to the variance of overage inventory which is carried over to the next period, and the second one is about the variance of the cash flows related to the spot price: the value of expected overage inventory carried to next period, the period-(t + 1) expected profit-go-to, and the financial hedging term. This decomposition implies that the financial hedges are used to mitigate the price risk (by minimizing the variance of the cash flows related to the spot price of the next period), while the risk due to demand uncertainty (i.e., the variance of overage inventory) cannot not be financially hedged. In other words, demand risk is hedged with physical inventory and price risk can be hedged by financial instruments. In general, the demand risk cannot be perfectly hedged. The optimal policy can be solved by backward induction with (7). The optimality equation (7) also indicates that the joint inventory and financial hedging decision can be solved by two sequential steps. First, given any inventory decision y, the optimal hedging strategy is derived by solving the problem (9) from which we can see that the financial hedges should be constructed to offset the cash flow at the beginning of the next period, St+1 Et [(y − dt )+ ], and the discounted profit-to-go αT −t vt+1 (St+1 ). Second, the optimal inventory decision is obtained by solving the problem in (7), given the optimal inventory level dependent hedging strategy. Before closing this section, we introduce the evolution of the variance. Let Vt (s) be the conditional variance of the terminal wealth in period t, t = 0, 1, ..., T . The preceding analysis implies that Vt (s) satisfies the following recursive equations: V0 (s) = E0 [V1 (S1 )] + α−2T V ar0 [αT v1 (S1 ) + H0 (S1 )], Vt (s) = Et [Vt+1 (St+1 )] + α−2(T +1−t) [At (s)V ar[(yt∗ − dt )+ ] + α2 ψt (yt∗ , s)], t = 1, ..., T. 4. Optimal Policies This section first addresses the optimal hedging and inventory control policy with forwards and then develop the replication strategy when there exist options of all strikes. 4.1. Hedging with Vanilla Derivatives Vanilla derivatives refer to the standard derivative contracts such as futures, forward and simple options. Futures and forward contracts are the most commonly traded derivative contracts in commodity markets. The trading mechanisms of the futures and forwards are quite similar except 13 the fact that futures are traded on exchanges and forwards are traded in over-the-counter (OTC) market. For the sake of simplicity, we ignore the difference in trading details between the futures and forwards. The following theorem characterizes the structure of the integrated inventory replenishment and hedging policies with vanilla derivatives. Theorem 1 (Forward Hedge). Assume that Ht contains at least a forward contract maturing at next period. For all t, (a) a myopic base-stock policy is optimal and the optimal base-stock level is obtained by solving the following sub-problem: { } max Et [πt ] − λα−(T +1−t) At (s)V ar[(y − dt )+ ] , t = 1, ..., T. y≥0 (10) In particular, if rt ≤ s, the least optimal inventory level yt∗ (s) = 0; otherwise, yt∗ (s) is the unique solution of the following equation: [ ] rt + ht − αft (s) − 2α−(T +1−t) λE[(y − dt )+ ]At (s) Φ̄t (y |s) − (s + ht − αft (s)) = 0. (11) (b) the optimal hedging portfolio consists of a position of shorting E[(yt∗ (s) − dt )+ ] units of forwards maturing at next period and a portfolio H̃t∗ which is the solution of the following subproblem: min V art [H̃t (St+1 ) + αT −t vt+1 (St+1 )] H̃t ∈Ht (12) (c) in particular, if only forward contracts maturing at next period are available, then H̃t∗ = −αT −t and ψt (y, s) = α 2(T −t) Covt [St+1 , vt+1 (St+1 )] [St+1 − ft (s)], V art [St+1 ] [ ] (Covt [St+1 , vt+1 (St+1 )])2 V art [vt+1 (St+1 )] − . V art [St+1 ] Theorem 1 characterizes the optimal policy in the presence of forward contracts. Specifically, part (a) shows that a myopic base-stock policy is optimal. Part (b) shows that the optimal hedging position of the forward contracts consists of two parts: Short E[(yt∗ (s) − dt )+ ] units of forward and long −αT −t Covt [St+1 ,vt+1 (St+1 )] V art [St+1 ] (short if it is negative) units of forwards that mature next period. The first part offsets the variation due to the excess inventory carried over to next period and the second part is to hedge against the risk associated with the future cash flow, vt+1 (St+1 ). The final hedging position is the combination of the two parts: ( ) ∗ ∗ + T −t Covt [St+1 , vt+1 (St+1 )] Ht = − E[(yt (s) − dt ) ] + α [St+1 − ft (s)]. V art [St+1 ] 14 It is a long (short) position if E[(yt∗ (s) − dt )+ ] + αT −t Covt [St+1 ,vt+1 (St+1 )] V art [St+1 ] is negative (positive). This theorem implies that, in the presence of the forward contract maturing at the next period, one could make the inventory decision independently, knowing that the risk associated with the expected excess inventory will be eliminated by a forward hedge. Then Jt can be rewritten as Jt (s) = α−(T +1−t) ḡt (s) − λα−2(T −t) ψ̄t (s) + Et [Jt+1 (St+1 )], { } ḡt (s) = max Et [πt ] − λα−(T +1−t) At (s)V art [(y − dt )+ ] , y≥0 [ ] ψ̄t (s) = min V art αT −t vt+1 (St+1 ) + H̃t (St+1 ) . H̃t ∈Ht (13) (14) That is, the joint optimization problem (7)-(9) can be decomposed into two sub-problems: One only involves inventory decisions and the other only involves hedging decisions. This decomposition allows us to compute the original dynamic program in four steps: (1) Derive the optimal myopic base-stock level for each period by solving the inventory-related sub-problem (13); (2) Compute the expected profit-to-go function vt for all the periods; (3) Compute the optimal portfolio H̃t∗ of (14) using the vt ; and (4) Obtain the optimal hedging portfolio by combing the short position related to the expected excess inventory and H̃t∗ . Note that although we restrict our attention to the forward contracts, it is straightforward to observe from (14) that this decomposition applies to the general setting where the hedging portfolio contains not just forwards but also other derivatives contracts (e.g., options). Remark 2. The optimality of the myopic base-stock policy relies on the existence of the forwards (maturing in next period). Without the forward contract maturing at next period, the risk associated with overage inventory cannot be perfectly replicated and therefore the determinant of the optimal base-stock level yt∗ (s) involves the profit-to-go vt+1 , which implies that the optimal base-stock policy may not be myopic any more and the joint optimization problem may not be decomposed into two separate sub-problems. This decomposition has an important practical implication. In practice, inventory decisions are made by operations managers while the financial hedging decisions are made by financial managers. Our result suggests that the management can allow the operations manager to manage the inventory decision separately and myopically. But, when making the hedging decisions, the risk manager must take into account the optimal inventory policy to forecast the resulting future cash flow. Therefore, the financial hedges simplify the operations manager’s task and help clarify the relationship between operations management and financial risk management within an organization. The following corollary is a direct application of Theorem 1. 15 Corollary 1. Suppose the demand is perfectly correlated to the spot price and has the specification dt = at + bt s for constants at , bt such that dt ≥ 0. Then, for t = 1, · · · , T , (a) yt∗ (s) = a + bs if rt > s and yt∗ (s) = 0 if rt ≤ s. ∑T (b) vt (s) = Et [ i=t αi−t (rt − Si )+ (ai + bi Si )]. (c) If only the forwards contracts maturing at next period are available, then the optimal hedging portfolio is Ht∗ = − Covt [St+1 , αT −t vt+1 (St+1 )] [St+1 − ft (s)]. V art [St+1 ] (d) If options contracts maturing at any future periods are available and b = 0, i.e., dt = a, then the optimal hedging strategies is to short ai units of put options with strike price rt maturing at future periods i = t + 1, ..., T . Corollary 1 shows that when the demand is perfectly correlated to the spot price, i.e., only price risk exists, the ordering quantity of each period is simply to order the demanded quantity. But vt (s) is still nonlinear in s (due to the term (rt − St )+ ), which implies that the price risk cannot be perfectly hedged by forward contracts. For deterministic demand, the price risk can be perfectly hedged by entering the short positions in put options. We next examine the impact of risk-aversion on the inventory policy. Let yt∗ (s; λ) be the optimal inventory decision with respect to λ. The following corollary shows the effect of risk aversion on the optimal inventory decision. Corollary 2. For all t, yt∗ (s; λ) is decreasing in λ. This result shows that the more risk averse the decision maker is the less inventory the firm orders, and thus the fewer forwards to short. In particular, as λ = 0, the yt∗ (s; λ) reduces to a newsvendor solution (if rt > s): yt∗ = Φ̄−1 t ((s + ht − αft (s))/(rt + ht − αft (s))). Let πt∗ , Ut∗ , Et [WT∗+1 ] and V art [WT∗+1 ] be the period-t expected operational profit, mean-variance, the expected terminal wealth and variance of the terminal wealth under the optimal policy. We have the following intuitive result. Corollary 3. For all t, E[πt∗ ], Ut∗ , Et [WT∗+1 ] and V art [WT∗+1 ] are all decreasing in λ. 16 4.2. Hedging with Continuum of Option Strikes In financial markets, since option contracts traded in exchanges normally have limited number of strikes, exotic derivatives are structured to develop more sophisticated hedging or trading strategies. In financial literature, it is common to assume that there exists a continuum of options of all strikes as a proxy to a frictionless financial market. As argued by Carr and Madan (2001), the assumption of a continuum of strikes is essentially the counterpart of the standard assumption of continuous trading, which serves as a reasonable approximation to a market environment where there is a large but finite number of European options strikes (e.g., options for S&P 500). In a single-period hedging model, they show that a twice-continuously differentiable payoff function can be replicated by a portfolio that consists of a spectrum of European put and call options with all strike prices, forwards and risk-less bonds. This type of derivative contracts belong to the exotic derivative contracts. In the following analysis, we will follow Car and Madan’s (2001) approach to construct the optimal exotic hedging strategies. Note that Car and Madan’s (2001) approach requires the payoff function to be twice differentiable. To employ their approach, we assume that the spot price satisfies the following condition. Assumption 3. For all t and any real function v(·) which is twice differentiable almost everywhere, the conditional expectation E[v(St+1 )|St = s] is twice differentiable in s almost everywhere. For example, for the geometric Ornstein-Uhlenbeck process (also called Schwartz’s One-Factor model, Schwartz 1997): d ln S = κ(α∗ − ln S)dt + σdz ∗ (t) (15) where κ is the mean-reverting rate, α∗ = η − σ 2 /2 − γ is the risk adjusted long-run mean (η is the long-run mean and γ is the market price of risk) and z ∗ is a Brownian motion under the risk neutral measure. Assume the length of each period is equal to ∆. It follows from Schwartz (1997) that the discrete-time dynamics of St+1 are of the form ( ( ) ) St+1 = exp α∗ 1 − e−κ∆ + e−κ∆ ln St + ϵt+1 , ) ( √ 1−e−2κ∆ σ . where ϵt ∼ N 0, 2κ (16) ′′ Apparently, for any twice differentiable function v with finite E[v ′ (S)] and E[v (S)], v(St+1 ) is twice differentiable in St and then the conditional expectation E[v(St+1 )|St = s] = [ ( −κ∆ )] E v se exp (α∗ (1 − e−κ∆ ) + e−κ∆ + ϵt+1 ) is twice differentiable in s. We now proceed to present our key result of this section. Theorem 2 (Exotic Hedge). Suppose that there exist forwards and European calls and puts of all strikes, and Assumption 3 holds. For all t, 17 (a) the myopic base-stock policy is optimal. If rt ≤ s, the least optimal base-stock level yt∗ (s) = 0; otherwise, yt∗ (s) is the unique solution of the following FOC (first-order-condition): [ ] rt + ht − αft (s) − 2α−(T +1−t) λE[(y − dt )+ ]At (s) Φ̄t (y |s) − (s + ht − αft (s)) = 0. (17) ′ (b) the optimal hedging portfolio is to short Et [(yt∗ (s) − dt )+ ] + αT −t vt+1 (s) units of forwards, ′′ ′′ αT −t vt+1 (K)dK puts for all strikes K < s and αT −t vt+1 (K)dK calls for all strikes K > s maturing at next period. That is, [ ′ ∗ Ht (St+1 ) = − (Et [(yt∗ (s) − dt )+ ] + αT −t vt+1 (s))[St+1 − ft (s)] ] ∫ s ∫ ∞ T −t ′′ + T −t ′′ + + α vt+1 (K)[(K − St+1 ) − Pt (K)]dK + α vt+1 (K)[(St+1 − K) − Ct (K)]dK , 0 s where Pt (K) = Et (K − St+1 )+ and Ct (K) = Et (St+1 − K)+ . Theorem 2 characterizes the structure of the optimal joint inventory control and hedging strategies with complete term structure. Part (a) shows that the optimal ordering policy is described by a myopic state-dependent base-stock level, yt∗ (s). In any period t = 1, · · · , T , if rt > s, then order up/sell down to yt∗ (s); otherwise, no order is placed. The ordering policy is myopic since the optimal base stock level in a period does not depend on the decisions to make in the future. This significantly simplifies the computation of the optimal solution. Part (b) describes the structure of the optimal hedging strategy. The hedging portfolio consists of two parts that are associated with the inventory decision of current period and the profit-togo function of the next period, respectively. First, the optimal hedging strategy requires shorting Et [(yt∗ (s) − dt )+ ] units of forwards that mature next period. The intuition behind this is as follows. In the first part, given the order-up-to inventory level, yt∗ (s), the expected inventory carried over to next period is Et [(yt∗ (s) − dt )+ ]. Thus, the associated risky payoff is Et [(yt∗ (s) − dt )+ ]αSt+1 ] with the variance being (Et [(yt∗ (s) − dt )+ ])2 V art (St+1 ). Shorting Et [(yt∗ (s) − dt )+ ] units of forwards offsets the variation caused by the excess inventory carried over to next period. The second part of the hedging strategy is related to vt+1 (St+1 ), the profit-to-go function which represents the expected total profit from next period. We follow Carr and Madan (2001) to decompose the profit-to-go function into three components which can be interpreted by the portfolio of ′ (s) the forwards and options. Specifically, the optimal hedging decision requires to short αT −t vt+1 ′′ units of forward (in addition to those related to inventory decision), αT −t vt+1 (K)dK puts for all ′′ strikes K < s and αT −t vt+1 (K)dK calls for all strikes K > s maturing at next period. As a result, under the optimal hedging strategy, the next-period expected profit vt+1 (St+1 ) is completely offset by the proposed portfolio. 18 Now, under the optimal hedging strategies, the equation (8) is equal to gt (y, s) = Et [πt ] − λα−(T +1−t) At (s)V ar[(y − dt )+ ], where the second term involves the variance associated with dt . This implies that even with complete term structure of the financial instruments, we are not able to completely hedge all the risk. In fact, we can only mitigate risks related to market spot price (i.e., St+1 Et [(yt∗ (s) − dt )+ ] and vt+1 (St+1 )) with the financial instruments. The risk associated with the demand is hedged with physical inventory. This demonstrates the different roles of inventory and financial hedge in managing the demand and market price risks. 5. Inventory Hedges versus Financial Hedges This section discusses the setting without the financial hedges, namely inventory hedge. Without the financial hedges, for t = 1, ..., T , the optimality equation becomes Jt (s) = max {α−(T +1−t) ĝt (y, s)} + Et [Jt+1 (St+1 )], y≥0,z≥0 (18) where [ ĝt (y, s) = Et [πt ] − λα −(T +1−t) ] At (s)V ar[(y − dt ) ] + α ψt (y, s) , + 2 ψt (y, s) = V art [St+1 Et [(y − dt )+ ] + αT −t vt+1 (St+1 )]. (19) (20) Let ŷt∗ be the optimal inventory decision without the financial hedges. We first analyze the last period’s problem. Proposition 3. In period T , without financial hedges, (a) if rt ≤ s, the optimal inventory decision ŷT∗ (s) = 0; otherwise, the optimal inventory decision ŷT∗ (s) satisfies the following first-order condition: [ ] 0 = rt + hT − αfT (s) − 2α−1 λE[(y − dT )+ ]AT (s) Φ̄T (y, s) − (s + hT − αfT (s)) −2λαV arT [ST +1 ]ΦT (y, s)ET [y − dT ]+ . (21) (b) in particular, when rt > s, ŷT∗ (s) < yT∗ (s), i.e., the unhedged inventory level is lower than hedged inventory decision. Part (a) characterizes the optimal inventory solution. Part (b) states that the financial hedges lead to a higher inventory level compared to the system without financial hedges and therefore provides a better service level. This result is consistent with the numerical findings in Kouvelis et al. (2013) that the financial hedges increase customer service level by increasing the inventory level. In other 19 words, the inventory hedges are complementary to the financial hedges. Similar insights are also found in newsvendor models (see, e.g., Gaur and Seshadri 2005 and Gao et al. 2011). Although the relationship between inventory and financial hedges in the last period are essentially the same as that in the newsvendor model, it may be different in the previous periods. The following proposition gives a condition that the hedged inventory decision is less than the unhedged. Proposition 4. For any period t = 1, · · · , T − 1, suppose that yt∗ (s) > 0. Then, ŷt∗ (s) > yt∗ (s) if and only if E[(yt∗ (s) − dt )+ ] < −αT −t Covt (St+1 , vt+1 (St+1 )) . V art (St+1 ) This proposition shows that the optimal unhedged inventory decision may not necessarily be less than the optimal inventory decision under financial hedges when the spot price is negatively correlated to the expected next-period profit-to-go (as a function of the next period’s price). Intuitively, the expected profit is a decreasing function of the purchasing price and thus the profit-to-go is likely to be negatively correlated to the spot price. To justify this intuition, we consider the following special case. Proposition 5. If the demand is perfectly negatively correlated with the spot price with the form s+ht −αft dt = a − bs, a > 0, b > 0, then ŷt∗ > dt if and only if λ < − 2αCovt (S . t+1 ,vt+1 (St+1 )) ∑T Note that vt (s) = Et [ i=t αi−t (rt − Si )+ (ai − bi Si )] which is a strictly decreasing function of St = s. Then it is clear that St+1 and vt+1 (St+1 ) are negatively correlated, i.e., Covt (St+1 , vt+1 (St+1 )) < 0. This proposition shows that, when the demand is perfectly negatively correlated with spot price and the risk-averse degree is sufficiently high, the unhedged strategy orders more inventory in period t to hedge the future inventory risk. Recall that, in the presence of financial hedges, the optimal inventory decision is to order dt . The discussion above sheds new light into the relationship between physical inventory and financial hedges: Financial hedges can be a substitute of physical inventory in a multi-period inventory system, whereas it is a complement to the inventory in a single-period inventory system where financial hedges lead to a higher inventory and thus a higher service level. This is the key difference between the storable commodities (e.g., oil and gas) and perishable commodities (e.g., electricity). The implication gained from the newsvendor model may no longer apply to the dynamic model of storable commodity inventories though in conventional inventory models many results found in newsvendor model generally hold in the corresponding dynamic models. This justifies the importance of explicitly addressing the dynamic hedging and inventory control problem. 20 6. How Financial Hedges Create Value A conventional thought is that hedging under no arbitrage condition does not increase or decrease profit. Recall that for a fixed λ, exotic hedges have the same expected terminal wealth and inventory level as that of forward hedges, though the variance of exotic hedges is smaller. For convenience, we use use superscripts e, f and u to indicate these strategies (or market profiles): hedging with forwards and options with all strikes (exotic hedges), hedging with forwards only (forward hedges) and unhedged inventory policy. The following lemma compares the meanvariance utilities, expected terminal wealth and variances under difference hedging strategies for the same λ. The proof is straightforward and is therefore omitted. Lemma 1. For any given λ, wealth and price levels at time t, Ut∗e ≥ Ut∗f ≥ Ut∗u , but Et [WT∗e+1 ] = Et [WT∗f+1 ] and V art [WT∗e+1 ] ≤ V art [WT∗f+1 ]. Note that the risk-averse parameter λ reflects the risk tolerance level of a decision maker. A higher risk tolerance implies a smaller λ. In practice, instead of giving λ directly, one normally specifies the maximum variance level that he/she can tolerate while maximizing the expected return, i.e., maxω∈U E0 WTω+1 , subject to V ar [W ω ] ≤ δ 0 T +1 where ω is a policy and U is the set of feasible policies, and δ > 0 is a given variance level. Let λ be the Lagrange multiplier. Then the problem becomes min max{E0 WTω+1 − λ[V ar0 [WTω+1 ] − δ]}. λ ω (22) For any i ∈ {e, f, u}, let λ∗i be the corresponding optimal Lagrange multiplier derived from the Lagrange problem (22) Let δ0 be the variance of the terminal wealth under the risk-neutral solution and WT0+1 the expected risk-neutral terminal wealth. The following theorem characterizes the efficient frontier of the three hedging strategies. Theorem 3. Given the same wealth level and spot price at the beginning of the planning horizon, and a tolerance level δ < δ0 , the following assertions hold. (a) Exotic hedges dominate forward hedges which then dominate unhedged inventory policies, i.e., U0∗e ≥ U0∗f ≥ U0∗u . (b) For any i ∈ {e, f, u}, λ∗i > 0 and V ar0 [WT∗i+1 ] = δ but E[WT∗e+1 ] ≥ E[WT∗f+1 ] ≥ E[WT∗u+1 ]. 21 (c) λ∗i is decreasing in δ and then E[WT∗i+1 ] is increasing in δ. In particular, as δ → δ0 , E[WT∗i+1 ] converges to risk-neutral expected terminal wealth WT0+1 . (d) λ∗e ≤ λ∗f and thus yt∗e ≤ yt∗f . Theorem 3 shows that the efficient frontiers are increasing and converge to the expected riskneutral terminal wealth level (see parts (a) and (c)). The efficient frontier under exotic hedge dominates that under forward hedge and the efficient frontier under forward hedge dominates that of unhedged inventory policies. Part (b) and (d) imply that for any given tolerance level, the exotic hedges provide a higher expected terminal wealth level than forward hedges do; and the forward hedges in turn provide a higher expected terminal wealth level than unhedged inventory policies do. This indicates that financial hedging is profitable. Finally, part (d) shows that the complete hedges yield a higher inventory level (service level) than the forward hedge. These results have important practical implications. First, the risk-aversion parameter λ is rarely specified directly in practice. Instead, it is natural to identify the risk-tolerance level δ, which then determines the selection of λ as a Lagrange multiplier to penalize the variance. From this perspective, one can argue that the risk-aversion parameter can be endogenously selected. Second, for any given risk-tolerance level, exotic hedges dominate the vanilla (forward or simple option) hedges and the optimal selection of the Lagrange multiplier binds the variance constraint, which implies that the expected terminal wealth level under exotic hedges is higher than that under a less complete hedge. Third, exotic hedges yield a higher inventory level than forward hedges do, which also explains why hedging can be profitable. Fourth, exotic hedges have a lower risk-aversion parameter λ than forward hedges do, which implies that compared to forward hedges, exotic hedges allow one to be less sensitive to the risk and therefore to be able to gain more profit. 7. Numerical Example This section presents a numerical example to demonstrate and compare the structures of the optimal integrated financial hedging and replenishment policies and their mean and variance performances under different hedging strategies: (1) forward hedges, (2) exotic hedges, and (3) no (financial) hedge. Specifically, we consider a two-period model, i.e., T = 2, with the length of each period being ∆ = 1. Assume that the risk-adjusted spot price process follows the Geometric Mean-Reverting process (16). We use the calibration of Schwartz (1997) for crude oil price to specify the parameters of the price process: κ = 0.428, µ = 2.991, σ = 0.257, γ = 0.002. The demand is expressed as dt = µd + ϵt where µd is the mean of the demand and the error terms ϵt are i.i.d. normally distributed with mean 0 and variance σd2 . This implies that the demand is 22 normally distributed with mean µd and variance σd2 . Set µd = 10 and σd = 5. The initial wealth level is assumed to be zero. The other cost and model parameters are rt = 50, ht = 5, α = 0.9 and λ = 0.01. The optimal policy is computed by applying the value iteration approach on the corresponding optimality equations. The results are demonstrated in figures (1)-(3). Figures 1 demonstrates the optimal order-up-to levels with or without financial hedges. Recall that as long as the forward contracts are used, the risk associated with the inventory carried over to the next period is perfectly hedged and the firm solves the same optimization problem regardless what other derivative contracts are used. Thus, the inventory decisions under forward hedges and exotic hedges are identical. When comparing the hedged and unhedged inventory decisions, one can see the relationship between the hedged and unhedged inventory order-up-to levels changes when moving from the last period to the previous periods: In the last period t = 2, the hedged order-up-to level is greater than the unhedged order-up-to level whereas in period 1 the hedged is smaller than the unhedged. In other words, financial hedge increases the order-up-to level in the last period but decreases the order-up-to level in the first period. Recall that newsvendor models predict that financial hedges induce the firm to order more and thus financial hedges and inventory are complements to each other (see, e.g., Gaur and Seshadri 2005). However, in a dynamic system like ours, such a prediction is no longer true. In fact the opposite relationship is more likely to hold in periods before the last period. This can be explained by the relationship between the profit-to-go function vt (s) and the spot price s. As shown in Figure 3, the profit-to-go functions are decreasing in the spot price, which implies that in each period t < T the next period’s price St+1 and the profit-to-go vt+1 (St+1 ) tend to be negatively correlated. Since the negative correlation implies that to offset the variation of the expected future cash flow it is optimal to enter a long position in forward (or to buy forward contracts) if forwards are used, when there is no financial hedge, the inventory carried over to the next period can play the role of forward hedges. That is, the financial hedges and inventory are substitutes for each other. Figure 2 demonstrates the hedging solutions. Note that the positive (negative) sign indicates a long (short) position. We first discuss the solution under the forward hedge. In the last period, the optimal hedging policy should offset the risk associated with the excess inventory, which implies a short position. Thus the sign is negative. But in the previous period, the hedging positions are all positive. This echoes previous discussion that the negative correlation between the spot price and profit-to-go suggests that the forward hedger should enter a long position. More over, the forward hedgers will buy (long) more in period 0 than in period 1. This can be explained by the fact that the downward slop of v1 is greater than v2 (the change of v1 is greater the price increases from 1 to 40), i.e., the St+1 and Vt+1 are more negatively correlated in period 0. Under the exotic hedge, we have the similar observations. Comparing the forward positions under the forward hedge and 23 optimal order−up−to level (t=2) 7 6.5 6.5 2 6 order−up−to level y* order−up−to level y*1 optimal order−up−to level (t=1) 7 5.5 5 4.5 4 3.5 3 0 unhedged hedged 10 6 5.5 5 4.5 unhedged hedged 4 20 spot price s 30 Figure 1 3.5 0 40 10 20 spot price s unhedged futures hedge perfect hedge 30 40 Ordering decisions exotic hedge at the same period, one can observe that the forward positions under the forward hedge are higher than that under the exotic hedge. This is can be explained by the fact that some of the risks are hedged by the options. Recall that only the period 0 and period 1 need the options. Different from the corresponding forward positions, the period 0 has a lower position than period 1, implying that it is more likely to enter a short position in period 0 than period 1. Note that the short position is due to the convexity of the profit-to-go function (the second order derivative is positive). In general, compared to the forward position, the number of options to long or short is relatively smaller, which can be explained by the fact the profit-to-go functions are decreasing almost linearly. T−t ’’ options hedging curve of strikes α hedging positions of forwards 14 v (K) 0.8 forward hedge (t=0) 12 0.6 10 0.4 number of options forward hedge (t=1) 8 6 exotic hedge (t=0) 4 2 0 −2 0 30 Figure 2 exotic hedge (t=0) −0.4 forward hedge and exotic hedge (t=2) 20 spot price s exotic hedge (t=1) 0 −0.2 exotic hedge (t=2) 10 0.2 40 −0.6 0 Hedging decisions 10 20 strike price K 30 40 24 4 x 10 variance of terminal wealth (t=0) 4 0 460 variance V terminal wealth level v0 expected terminal wealth (t=0) 470 450 440 430 420 20 22 24 26 28 3 2 1 20 30 400 350 26 28 30 4 x 10 variance of terminal wealth (t=1) 2 1.5 300 20 22 24 26 28 1 20 30 expected terminal wealth (t=2) 22 24 26 28 30 variance of terminal wealth (t=2) 200 unhedged hedged unhedged futures hedge exotic hedge 2 6000 variance V terminal wealth level v2 24 1 2.5 variance V terminal wealth level v1 expected terminal wealth (t=1) 450 22 150 5000 4000 100 20 22 24 26 spot price s Figure 3 28 30 20 22 24 26 spot price s 28 30 Means and variances of terminal wealth Finally, we discuss the mean and variance performances under hedging or unhedged strategy. As mentioned above, the left panels of Figure 3 show that the profit function is decreasing in the spot price. The profits under the forward hedges and exotic hedges are identical and they are very close to the profit without hedges. In the last period, the hedged profit is slightly higher than the unhedged profit but the opposite is observed in the previous periods. This can be explained as follows: The order-up-to level under financial hedges in the last period is higher (and thus the profit is closer to the risk neutral one). But in the previous period the order-up-to levels under financial hedges is lower, which leads to lower profits. Nevertheless, in this example, the difference in profit is very small. The right panels of Figure 3 show that in periods 0 and 1 the unhedged variances are greater than the variances under forward hedges and the later are greater than that under exotic hedges. In the last period, the forward hedge and exotic hedge have the same variance whereas the unhedged 25 variance is smaller than the hedged variance. This is due to the different optimal inventory decisions under hedged and unhedged strategies. In summary, this example demonstrates and compares the structural relationships between inventory control and financial hedging under different hedging strategies. The dynamic system does behave differently from the static models such as the newsvendor model and the difference between them sheds new light into the commodity inventory management. 8. Extensions This section extends the analysis for the basic lost-sales case to several important extensions: backorder, price-sensitive demand, and transaction costs. In these extensions, we focus on examining the relationship between the inventory control and financial hedges. 8.1. Backorders In many B2B businesses, unmet demand is often backlogged due to the contractual obligation to meet all the demand, which may incur backlog costs. Let bt be the unit backlog cost of period t. Assume that each backorder occurs in a period must be met in the next period, which is subject to the contractual arrangement. That is, if there is backlog at the beginning of a period, the firm will order at least as much as the backlog to meet the backlog even when the spot price is high. Then the inventory order-up-to level of each period can be any nonnegative value, which implies that the profit-to-go function vt+1 only depends on spot price. The backordering model yields the same Bellman equation as (7) except that the operational profit of period t, πt , is replaced by πt (yt , St , St+1 ) = rt dt − St yt − ht (yt − dt )+ − bt (dt − y)+ + αSt+1 (yt − dt ), (23) Hence, the analysis of the lost-sales model applies directly to the backordering case and similar insights can be obtained. 8.2. Price-Sensitive Demand In the preceding analysis, the demand is exogenously given. However, in many industries, especially B2C businesses, customers may have certain degree of price sensitivity though the underlying products are commodities or made of commodity materials (e.g., retail gasoline and heating oils, food, etc.) . This section aims to extend the preceding analysis to the case of price-sensitive demand. Specifically, we assume that the demand dt is of the additive form: dt = µt (r, s) + ϵt , where µt (r, s) is a deterministic and strictly decreasing function of the price r and dependent on the spot price, and ϵt is a random variable with zero mean. The distribution function of ϵt can be 26 dependent of the spot price of period t, St . The price level is chosen from an interval [r, r]. Then the operational profit in period t is πt (r, y, St , St+1 ) = (r − St )y − (r + ht − αSt+1 )(y − dt )+ . The profit-to-go function vt depends only on the spot price. Following the analysis of Section 3.2, we can obtain the Bellman equation: For t = 1, ..., T , Jt (s) = max {α−(T +1−t) gt (r, y, s)} + Et [Jt+1 (St+1 )]}, (24) y≥0,r∈[r,r] where −(T +1−t) [ ] + 2 At (r, s)V art [(y − dt ) ] + α ψt (r, y, s) , gt (r, y, s) = E[πt ] − λα { } + T −t ψt (r, y, s) = min V art [St+1 E[(y − dt ) ] + α vt+1 (St+1 ) + Ht (St+1 )] , Ht ∈Ht (25) where At (r, s) = Et [(r + ht − αSt )2 ]. Then, similar to the analysis of Section 4, the inventory and pricing decisions can be isolated as long as a forward maturing at next period is in used (with a short position of E[(y − dt )+ ] for the chosen price and inventory levels). Hence, the pricing and inventory decisions can be obtained by simply solving the myopic joint pricing and inventory decision problems for all periods: max {E[πt ] − λα−(T +1−t) At (r, s)V art [(y − dt )+ ]}. y≥0,r∈[r,r] Letting y = z + µt (r, s), the joint pricing and inventory decision optimization can be written as { } + −(T +1−t) + max max{(r − s)(zt + µt (r, s)) − (r + ht − αSt+1 )(z − ϵt ) − λα At (r, s)V art [(z − ϵt ) ]} . r∈[r,r] y≥0 Clearly, for any fixed r, the objective function is unimodal in y and thus the optimal ordering solution satisfies the first order condition. The optimal price can be searched linearly. Using the optimal pricing and inventory decisions, the hedger can estimate the expected future cash flows and obtain the profit-to-go function vt . Finally, the the optimal hedging decision can be obtained by solving the problem (25). Thus, the insight obtained in Section 4 can be strengthened by incorporating the pricing decisions: In the presence of financial hedges (in particular, the forward contract maturing at next period), both the pricing and inventory decisions can be obtained myopically, which implies that the marketing (pricing) and operational (inventory) decisions can be made independently. But the financial hedging decision must take into account of the inventory and pricing decisions which influence the prediction of future cash flows. 27 8.3. Transaction Costs Trading in the spot market often incurs transaction costs for both buying and selling activities. The transaction costs include the market cost (bid/ask spread) and operational cost (e.g., transportation cost). Our analysis can be extended to include all relevant variable transaction costs. Let kb and ks be the unit transaction costs for purchases or sales in the spot market. In the presence of the transaction costs, it is no longer costless to sell all the on-hand inventory to the market and then buy up to the optimal inventory level. Then, at the beginning of each period, the on-hand inventory does influence the optimal inventory decision. But, for consistency, we still represent the operational profit of period t in a similar way of Section 3.2: πt (xt , yt , St , St+1 ) = (rt − St )yt − (rt + ht − αSt+1 )(yt − dt )+ − kb (yt − xt )+ − ks (xt − yt )+ . Then the Bellman equation is Jt (x, s) { = max Et [Jt+1 (y − dt , St+1 )] + α−(T +1−t) Et [πt ] y≥0 −α −2(T +1−t) λEt [V ardt [πt + α T +1−t } vt+1 ((y − dt ) , St+1 )|St+1 ]] + (26) − α−2(T +1−t) λ min V art [αSt+1 Edt [(y − dt )+ ] + αT −t Edt [vt+1 ((y − dt )+ , St+1 )] + Ht (St+1 )],(27) Ht ∈Ht and the profit-to-go, vt (x, s), has the following dynamics vt (x, s) = Et [α−(T +1−t) ] + Et [vt+1 (yt∗ − dt , St+1 )] where the the terminal profit function is vT +1 (x, s) = x(s − ks )+ − xs = −x min(s, ks ). In the objective of the optimal hedging decision problem, since the profit-to-go is also a function of y, the risk associated with the excessive inventory Edt [(y − dt )+ ] cannot be perfected hedged by simply entering a short position in forwards. To isolate the inventory decision problem from the joint optimization, we need sophisticated derivative portfolio. Under similar conditions to subsection 4.2, we can use forwards and options of all strikes (exotic derivatives) to replicate the expected profit-to-go Edt [vt+1 ((y − dt )+ , St+1 )] as a function of y and St+1 . Note that the expectation is taken on the demand dt . Then, the optimal inventory decision can be obtained by solving the following Bellman equation: { Jt (x, s) = max Et [Jt+1 (y − dt , St+1 )] + α−(T +1−t) Et [πt ] y≥0 −α −2(T +1−t) λEt [V ardt [πt + α T +1−t } vt+1 ((y − dt ) , St+1 )|St+1 ]] . + 28 Thus, although the myopic inventory policy is no longer optimal when there are transaction costs, the inventory decision can still be isolated from the financial hedging decision. With the optimal inventory policies derived from the above dynamic program, one can then evaluate and replicate the profit-to-go as a function of price and the current inventory decision. The above discussion implies that if there are sufficient derivative instruments in the market then the operations manager can make inventory decisions separately whereas the risk manager must use the inventory policy to predict the future cash flows. This concurs with insight previously obtained. 9. Concluding Remarks This paper addresses the joint inventory and financial hedging decision problem for a commodity inventory system with lost sales. We provide a novel dynamic programming formulation which allows us to derive a time-consistent optimal policy. We characterize the structure of the optimal policies under both forward hedges and exotic hedges. We show that as long as the forward contract is used the inventory decisions can be separated from the financial hedging decisions and myopic base-stock policies are optimal. However, to derive the optimal hedging decision, one needs to evaluate the profit-to-go based on the optimal (state-dependent) base-stock levels in the future. When there exist forwards and options of all strikes, a hedging portfolio can be constructed to replicate the profit-to-go, which allows us to separate the inventory and financial decision in a very general context. We also show that different from those newsvendor models in literature, financial hedges may lead to lower order-up-to level. We further show that these results hold in rather general frameworks (e.g., backlog, price-dependent demand and transaction cost). These results provide new insights into the coordinated risk and operational management strategies. Firstly, operational (inventory) decisions and financial hedging decisions interplay with each other. On the one hand, financial hedges allow us to decompose a dynamic inventory decision problem into a sequence of myopic decision problems, which significantly simplifies the inventory decision process. On the other hand, financial hedging decisions rely on inventory strategies, since the future cash flow depends on the inventory policies. Although some of these points have been shown by Kouvelis et al. (2013), our results show that they hold in more general frameworks even when myopic inventory policies may no longer be optimal (e.g., when there are transaction costs). In real business world, the operational decisions and financial decisions are often made separately, partially because of the difficulty in coordinating the decisions between operational and financial managers. Our model suggests that the operational managers can still enjoy their independence (of the financial decisions) while financial managers must be able to understand the cash flow implications of operational strategies. 29 Secondly, financial hedges could be substitutes to inventory. In the single-period newsvendor model, or the last period of the dynamic model, the financial hedges and inventory levels are complementary. However, in the previous periods of the dynamic model, financial hedges may lead to lower inventory levels, which implies that the financial hedges could be a substitute for inventory. In practice, a firm tends to order more inventory in anticipation of a higher price in the future. Our result suggests that financial derivatives are better instruments to hedge the future price risk, while the inventory decisions focus on the current demand (quantity) risk which cannot be hedged by financial instruments. Thirdly, financial hedging does create values. We show that given a risk constraint, financial hedges lead to a higher terminal wealth. This is also quite intuitive: Financial hedging reduces the risk, which allows the firm to focus more on its value maximization decisions. In summary, integrating operational and financial risk management strategies is the best way to address comodity risk in an uncertainty demand environment. Our analysis framework provides us a useful conceptual tool to explore these insights. References Agrawal, V., S. Seshadri. 2000. Impact of uncertainty and risk aversion on price and order quantity in the newsvendor problem. Manufacutring Service Oper. Management 2(4) 410–423. Basak, S., G. Chabakauri. 2011. Dynamic hedging in incomplete markets: A simple solution. Forthcoming in Review of Financial Studies. Basak, S., G. Chabakauri. 2010. Dynamic mean-variance asset allocation. Review of Financial Studies, 23(8) 2970–3016. Berling, P., V. Martı́nez-de-Albéniz. 2011. Optimal inventory policies when purhcase price and demand are stochastic. Oper. Res., Articles in Advance, 1–16. Bodnar, G. M., G. S. Hayt, R. C. Marston. 1998. 1998 Wharton Survey of Financial Risk Management by U.S. Non-Financial Firms. Financial Management 27 70-91. Bouakiz, M., M.J. Sobel. 1992. Inventory control with an exponential utility criterion. Oper. Res. 40 603–608. Brown, G. W., K. B. Toft. 2002. How firms should hedge. Review of Financial Studies 14 1283–1324. Briys, E., M. Bellalah, H. M. Mai, F. D. Varenne. 1998. Options, Futures, and Exotic Derivatives: Theory, Application, and Practice, John Willey & Sons, New York. Caldentey, R., M. Haugh. 2006. Optimal control and hedging of operations in the presence of financial markets. Math. Oper. Res. 31(2) 285–304. Caldentey, R., M. Haugh. 2009. Supply contracts with financial hedging. Oper. Res. 57(1) 47–65. Carr, P., D. Madam. 2001. Optimal positioning in derivative securities. Quantative Finance 1(1) 19–37. 30 Chen, X., M. Sim, D. Simchi-Levi, P. Sun. 2007. Risk aversion in inventory management. Oper. Res. 55(5) 828–842. Chod J., N. Rudi, J. A. Van Mieghem. 2010. Operational flexibility and financial hedging: Complements or substiutes. Management Sci. 56(6) 1030–1045. Ding Q., L. Dong, P. Kouvelis. 2007. On the integration of production and financial hedging decisions in global markets. Operations Research 55(3) 470–489. Eeckhoudt, L., C. Gollier, H. Schlesinger. 1995. The risk-averse (and prudent) newsboy. Management Sci. 41(5) 786–794. Gaur V., S. Seshadri 2005. Hedging inventory risk through market instruments. Manufacturing & Service Operations Management 7(2) 103–120. Geman, H., S. Ohana. 2008. Time-consistency in managing a commodity portfolio: A dynamic risk measure approach. J. of Banking and Finance 32 1991–2005. Gao, F., F.Y. Chen, X. Chao. 2011. Joint optimal ordering and weather hedging decisions: mean-CVaR model. J. Flexible Services and Manufacturing 23(1) 1–25. Kleindorfer, P. R., L. Li. 2005. Multi-period VaR-constrained portfolio optimization with applications to the electric power sector. Energy Journal 26(1) 1–26. Kleindorfer, P. R. 2008a. Integrating physical and financial risk management in supply management. Book chapter in Risk Management for Commodity Markets edited by H. Geman. Kleindorfer, P. R. 2008b. Hedging commodity risks in supply management. Note for Encyclopedia of Quantitative Finance edited by Rama Cont, forthcoming from J. Wiley & Sons, New York. Kleindorfer, P. R., D. J. Wu. 2003. Integrating long- and short-term contracting via business-to-business exchanges for capital-intensive industries. Management Sci. 49(11), 1597–1615. Kouvelis, P., R. Li, Q. Ding. 2013. Managing storable commodity risks: Role of inventories and financial heges. Forthcoming in Manufacturing Service Oper. Management. Li, D., W. L. Ng. 2000. Optimal dynamic portfolio selection: Multiperiod mean-Variance formulation. Mathematical Finance 10 387–406. Markowitz, H. M. 1952. Portfolio selection. Journal of Finance 7 77–91. Martı́nez-de-Albéniz, V., D. Simchi-Levi. 2005. A portfolio approach to procurement contracts. Production Oper. Management 14 90–114. Nagali, V., J. Hwang, D. Sanghera, M. Gaskins, M. Pridgen, T. Thurston, P. Mack- enroth, D. Branvold, P. Scholler, G. Shoemaker. 2008. Procurement Risk Management (PRM) at Hewlett-Packard Company. Interfaces 38(1) 51–60. Oum, Y., S. Oren, S. Deng. 2006. Hedging quantity risks with standard power options in a competitive wholesale electricity market. Naval Research Logistics 53 697–712. 31 Oum, Y., S. Oren. 2006. Optimal static hedging of volumetric risk in a competitive wholesale electricity market. Decision Analysis 7(1) 107–122. Rolfo J. 1980. Optimal hedging under price and quantity uncertainty: The case of cocoa producer. Journal of Political Economy 88(1) 100–116. Routledge, B. R., D. J. Seppi, C. S. Spatt. 2000. Equilibrium forward curves for commodities. Journal of Finance 55(3) 1297–1338. Schwartz, E.S. 1997. The stochastic behavior of commodity prices: Implications for valuation and hedging. Journal of Finance 52(3) 923–973. Schwartz, E.S., and James E. Smith. 2000. Short-term variations and long-term dynamics in commodity prices. Management Science 46 893–911. Smith, J. E., R. F. Nau. 1995. Valuing risky projects: Option pricing theory and decision analysis. Management Sci. 51(3) 452-466. Sobel, M. 1994. Mean-variance tradeoffs in an undiscounted MDP. Oper. Res. 42 175–183. Topkis, D. M. 1998. Supermodularity and Complementarity, Princeton University Press. Weiss, N. 2005. A Course in Probability, Addison-Welsley. Williams, J.C., and B.D. Wright, 1991. Storage and Commodity Markets. Cambridge: Cambridge Unversity Press. Working, H. 1948. The theory of the price of storage. American Economic Review 39 1254–1262. Zhu, W., R. Kapuscinski. 2011. Optimal operational versus finacnial hedging for a risk-averse firm. Working paper, Singapore Management University, Singapore. Appendix. Proofs of Statements Proof of Proposition 1 By induction, we assume that the optimality equation (4) holds for t + 1, · · · , T . Let πi∗ denote the profit function in period i under the optimal policy, i = t + 1, · · · , T . Taking expectation on WT +1 in period t yields Et [WT +1 ] = α−(T +1−t) Wt + vt (s) = Et [α−(T +1−t) (Wt + πt (yt , St , St+1 ) + αHt (St+1 )) + vt+1 (St+1 )] = Et [Et+1 [α−(T −t) Wt+1 + vt+1 (St+1 )]] = Et [Et+1 [WT +1 ]], where vt+1 (St+1 ) = Et [ ∑ T i=t+1 ] α−(T +1−i) [πi∗ (yi∗ (Si ), Si , Si+1 )] , t = 0, 1, · · · , T. 32 It is not hard to see that vt (s) satisfies the recursive equation (5). Therefore, given the initial wealth level in period t, the expected terminal wealth level depends only on current spot price s (as the inventory level information has been included in Wt ). By the law-of-total-variance (e.g., Weiss 2005), the variance of the terminal wealth is given by [ ] T +1 ∑ α−(T +1−i) [πi∗ + αHi (Si+1 )] V art Wt + α−(T +1−t) [πt + αHt (St+1 )] + [ i=t+1 [ = Et V art+1 α −(T +1−t) [πt + αHt (St+1 )] + T ∑ α −(T +1−i) ]] [π + αHi (Si+1 )] ∗ i i=t+1 [ [ ]] T ∑ + V art Et+1 α−(T +1−t) [πt + αHt (St+1 )] + α−(T +1−i) [πi∗ + αHi (Si+1 )] [ = Et V art+1 [ ∑ T i=t+1 ]] α −(T +1−i) ∗ i [π + αHi (Si+1 )] i=t+1 [ [ ∑ ]] T +V art α−(T +1−t) [πt + αHt (St+1 )] + Et+1 α−(T +1−i) πi∗ [ = Et V art+1 [ [ ∑ T i=t+1 i=t+1 ]] α −(T +1−i) ] ∗ i [π + αHi (Si+1 )] [ = Et V art+1 [WT +1 ] + V art α −(T +1−t) ]] [ −(T +1−t) + V art α [πt + αHt (St+1 )] + vt+1 (St+1 ) ]] [πt + αHt (St+1 )] + vt+1 (St+1 ) . Notice that J˜t+1 (Wt+1 , St+1 ) = Et+1 [WT +1 ] − λV art+1 [WT +1 ]. Then the equation (4) holds. Proof of Proposition 2 Observe that the variance of the terminal wealth conditional on period t does not depend on the initial wealth Wt in period t and the expected terminal wealth is summation of α−(T +1−t) Wt and vt (s). Replacing J˜t (Wt , s) by Jt (s) + α−(T +1−t) Wt in (4) yields { Jt (s) + α−(T +1−t) Wt = max α−(T +1−t) Wt + α−(T +1−t) Et [πt ] + Et [Jt+1 (St+1 )] yt ≥0,Ht ∈Ht } [ −(T +1−t) ] −λV art α [πt + αHt (St+1 )] + vt+1 (St+1 ) . Subtracting α−(T +1−t) Wt from both sides of above equation leads to { Jt (s) = max Et [Jt+1 (St+1 )] yt ≥0,Ht ∈Ht ]} [ [ ] , +α−(T +1−t) Et [πt ] − α−(T +1−t) λV art πt + αHt (St+1 ) + αT +1−t vt+1 (St+1 ) { = max Et [Jt+1 (St+1 )] yt ≥0 [ ]} { [ ]} −(T +1−t) −(T +1−t) T +1−t +α Et [πt ] − α λ min V art πt + αHt (St+1 ) + α . (28) vt+1 (St+1 ) Ht ∈Ht Note that dt and St+1 are independent for any given St = s. Applying the law of total variance yields V art [πt + αHt (St+1 ) + αT +1−t vt+1 (St+1 )] = Et [V art [πt + αHt (St+1 ) + αT +1−t vt+1 (St+1 )|St+1 ]] + V art [Et [πt + αHt (St+1 ) + αT +1−t vt+1 (St+1 )|St+1 ]] 33 = Et [V art [πt |St+1 ]] + V art [(rt − s)y − (rt + ht − αSt+1 )Et [(y − dt )+ ] + αHt (St+1 ) + αT +1−t vt+1 (St+1 )] = Et [(rt + ht − αSt+1 )2 V art [(y − dt )+ ]] + V art [αSt+1 Et [(y − dt )+ ] + αHt (St+1 ) + αT +1−t vt+1 (St+1 )] = Et [(rt + ht − αSt+1 )2 ]V art [(y − dt )+ ] + V art [αSt+1 Et [(y − dt )+ ] + αHt (St+1 ) + αT +1−t vt+1 (St+1 )]. (29) Here the first equality applies the law of total variance directly, the second is due to the fact that the payoff of the hedge, αHt (St+1 ), becomes a constant when St+1 is given (conditioned on), and the third equality is by the fact that V art [πt |St+1 ] = V art [(rt − s)y − (rt + ht − St+1 )(y − dt )+ |St+1 ] = (rt + ht − St+1 )2 V art [(y − dt )+ ]. Replacing the variance term in the RHS of (28) by the RHS of (29) yields (7). Proof of Corollary 2 Note that ∂ 2 gt (y, s) = −2α−(T +1−t) E[(y − dt )+ ]At (s)Φ̄t (y|s) ≤ 0, ∂y∂λ which implies that gt (y, s) is submodular in (y, λ). By Theorem 2.8.1 of Topkis (1998), yt∗ (s; λ) is decreasing in λ. Proof of Corollary 3 First, we know that E[πt (y, s, St+1 )] is concave in y, which implies that it is increasing on the left hand side of the risk-neutral inventory level yt∗ (s; 0). For any λ > 0, it follows from Corollary 2 that yt∗ (s; λ) is decreasing in λ, which implies that yt∗ (s; λ) ≤ yt∗ (s; 0). Then, combining the concavity of E[πt (y, s, St+1 )] and monotonicity of yt∗ (s; λ), we know that E[πt∗ ] is decreasing in λ, which in turn implies that Et [WT∗+1 ] is also decreasing in λ. It is obvious that Ut is decreasing in λ for any given policy. Then, Ut∗ must be decreasing in λ. We argue that V art [WT∗+1 ] must be decreasing in λ. Suppose, otherwise, there exist λ1 > λ2 such that the corresponding variances, denoted by Vt1 and Vt2 respectively, satisfy Vt1 > Vt2 . Let u1 and u2 be the corresponding optimal 1 2 1 2 policies and WTu+1 and WTu+1 the terminal wealth levels under policies u1 and u2 . Note that WTu+1 ≤ WTu+1 . Then, we have 1 1 2 2 Utu1 = Et [WTu+1 ] − λ1 V art [WTu+1 ] < Et [WTu+1 ] − λ1 V art [WTu+1 ] = Utu2 , that is policy u2 is strictly better than the optimal policy u1 with respect to λ1 , which leads to a contradiction. The desired result holds. A Technical Lemma The Fundamental Theorem of Calculus states that: A function f : [a, b] → R is absolutely continuous if and only if it is differentiable almost everywhere, its derivative f ′ ∈ L1 [a, b] and for any t ∈ [a, b], ∫ t f ′ (s)ds. f (t) = f (a) + a 34 Lemma 2. Suppose that function f : R+ → R is first-order differentiable and f ′ is continuous in (0, ∞), ′′ ′ and differentiable in (0, r), |f (r−)| < ∞ and f ≡ 0 in (r, ∞). Then, f ′ is differentiable almost everywhere, ′′ f ∈ L1 [0, ∞) and, for any S > s, ∫ S f ′ (S) = f ′ (s) + ′′ f (u)du. s That is, f ′ is absolutely continuous. ′′ ′ Proof. Note that f (s) = 0 for s > r and f is differentiable in (0, r). Then, f ′ is differentiable almost ′′ ′′ ′′ everywhere. Moreover, |f (r−)| < ∞ and f (r+) = 0, which implies that f ∈ L1 [0, ∞). For any S > s ≥ 0, if S ≤ r, by the Fundamental Theorem of Calculus, we have ∫ S ′′ ′ ′ f (S) = f (s) + f (u)du. s If S > r > s, ∫ r f ′ (S) = f ′ (r) = f ′ (s) + ∫ ′′ r f (u)du = f ′ (s) + s ∫ ′′ S s If S > s > r, ∫ ′′ r ∫ S f ′ (S) = f ′ (s) = f ′ (s) + S f (u)du = f ′ (s) + f (u)du + ′′ f (u)du. s ′′ f (u)du. s The desired result holds. Q.E.D. Proof of Theorem 2 The proof is by induction. Note that vT +1 (s) = 0. For any t, suppose that vt+1 (·) is twice differentiable almost everywhere. Then, in a similar way to Carr and Madan (2001), applying Lemma 2 and the Fundamental Theorem of Calculus, vt+1 (St+1 ) can be decomposed as ′ ′ vt+1 (St+1 ) = [vt+1 (s) − vt+1 (s)s] + vt+1 (s)St+1 ∫ s ∫ ′′ + vt+1 (K)(K − St+1 )+ dK + 0 ∞ ′′ vt+1 (K)(St+1 − K)+ dK. s That is, the vt+1 (St+1 ) can be perfectly replicated by bonds, forwards and options. For any y, with complete term and strike structure, the optimal hedging decision is to offset St+1 Et [(y − dt )+ ] and αT −t vt+1 (St+1 ) so that ψt (y, s) = 0. That is, [ ′ Ht (St+1 ) = − (Et [(y − dt )+ ] + αT −t vt+1 (s))[St+1 − ft (s)] ∫ s ∫ ′′ + αT −t vt+1 (K)[(K − St+1 )+ − Pt (K)]dK + 0 ∞ α T −t ′′ t+1 v ] (K)[(St+1 − K) − Ct (K)]dK . + s ′ That is, given any inventory level y, the optimal hedging strategy is to short Et [(y − dt )+ ] + αT −t vt+1 (s) ′′ ′′ units of forwards, αT −t vt+1 (K)dK puts for all strikes K < s and αT −t vt+1 (K)dK calls for all strikes K > s maturing at next period. Then, gt (y, s) = Et [πt ] − λα−(T +1−t) At (s)V ar[(y − dt )+ ]. Taking derivative with respect to y yields ∂gt (y, s) = (rt − s) − (rt + ht − αft (s))Φt (y|s) − 2λα−(T +1−t) E[(y − dt )+ ]At (s)Φ̄t (y|s), ∂y [ ] = rt + ht − αft (s) − 2α−(T +1−t) λE[(y − dt )+ ]At (s) Φ̄t (y|s) − (s + ht − αft (s)). (30) 35 Note that both factors of of the first term in the RHS of the second equality are decreasing in y. By Assumption 1, s + ht − αft (s) ≥ 0. If the first term is negative at some point y 0 , then gt (y, s) is decreasing in y ≥ y 0 . This implies that the optimal point yt∗ ≤ y 0 . When the first term is nonnegative, it is clear that ∂gt (y,s) ∂y is decreasing in y, i.e., gt (y, s) is concave in y. More precisely, it is strictly concave in y. Then, gt (y, s) is unimodal in y for any s. Note that limy→0+ ∂gt (y,s) ∂y = rt − s. Then, if rt > s, yt∗ (s) is the unique maximizer of gt (·, s) which satisfies (17). If rt ≤ s, the unimodularity of gt implies that gt is decreasing in y, which implies that the least maximizer yt∗ (s) = 0. Notice that all the terms of the RHS of (30) are differentiable in s, which implies that ∂gt (y,s) ∂y is also differentiable in s. Taking derivatives on both sides of (30) with respect to s and y, respectively, and letting y = yt∗ (s), we have ∂ 2 gt (yt∗ (s), s) ′ = −1 + αft (s)Φt (yt∗ (s), s) − (rt + ht − αft (s))ϕt (yt∗ (s), s) ∂y∂s −2λα−(T +1−t) E[(yt∗ (s) − dt )+ ][A′t (s)Φ̄t (y|s) − At (s)ϕt (yt∗ (s), s)], 2 ∗ [ ] ∂ gt (yt (s), s) = − (rt + ht − αft (s)) − 2λα−(T +1−t) E[(yt∗ (s) − dt )+ ]At (s) ϕt (yt∗ (s), s) ∂y 2 −2λα−(T +1−t) At (s)Φt (yt∗ (s))Φ̄t (yt∗ (s), s) s + ht − αft (s) ϕt (yt∗ (s), s) − 2λα−(T +1−t) At (s)Φt (yt∗ (s))Φ̄t (yt∗ (s), s), =− Φ̄t (yt∗ (s), s) (31) (32) where the second equality is by the first-order condition. When s < rt , we know that ∂ 2 gt (yt∗ (s),s) ∂y 2 < 0. This implies that yt∗ (s) is differentiable in s and satisfies ∂ 2 gt (y ∗ (s),s) t dyt∗ (s) ∂y∂s = − ∂ 2 gt (y∗ (s),s) . t ds 2 ∂y It is clear that ∂ Φt (yt∗ (s),s) ∂y is differentiable in s since Φt is twice differentiable and yt∗ (s) is differentiable. Then, all the terms of the RHS of (31) are differentiable. Similarly, all the terms of the RHS of (32) are differentiable. Then dyt∗ (s) ds is differentiable in s < rt , i.e., yt∗ (s) is twice differentiable in s < rt . When s > rt , we have shown that yt∗ (s) = 0, which implies that the first- and second-order derivatives of yt∗ (s) are both zero as s > rt . However, as rt = s = αft (s) − ht , ∂ 2 gt (yt∗ (s),s) ∂y 2 = 0 which implies that yt∗ (s) may not be differentiable at the point s = rt . Thus, yt∗ (s) is twice differentiable in s almost everywhere. Note that ∫ Et [πt ] = (rt − s)y − (rt + ht − αft (s)) y Φt (ξ, s)dξ 0 is twice differentiable in s by Assumption 3 since ft (s) = E[St+1 |St = s] and Φt (ξ, s) are twice differentiable in s. Let ψt (s) = Et [πt (yt∗ (s), s, St+1 )]. We next show that ψt (s) is twice differentiable in s almost everywhere. First, for s < rt ∂ψt (s) = −yt∗ (s) + αft′ (s) ∂s ∫ yt∗ (s) ∫ yt∗ (s) Φt (ξ, s)dξ − (rt + ht − αft (s)) 0 +[(rt − s) − (rt + ht − αft (s))Φt (yt∗ (s), s)] ϕt (ξ, s)dξ 0 ∂yt∗ (s) . ∂s Since yt∗ (s) is twice differentiable in s < rt , then Et [πt (yt∗ (s), s, St+1 )] is twice differentiable in s < rt . 36 When s > rt , we know that yt∗ (s) = 0, which infers that ψt (s) = Et [πt (yt∗ (s), s, St+1 )] = 0, and therefore, ψt (s) is twice differentiable in s for s > rt . It is obvious that lims→rt + ′ t ∂ψt (s) ∂s = 0. Then, ψt (s) is differentiable ′ t in s for all s > 0 and ψ (s) is continuous in s with ψ (rt ) = 0. Thus, ψt (s) = Et [πt (yt∗ (s), s, St+1 )] is twice differentiable in s almost everywhere. By induction assumption that vt+1 (s) is twice differentiable in s almost everywhere and Assumption (3), we know that Et [vt+1 (St+1 )] is twice differentiable in s almost everywhere as well. By recursive equation (5), we know that vt (s) is twice differentiable in s almost everywhere as well. This completes the induction. Proof of Theorem 1 For any t, let Ht (St+1 ) = −Et [(y − dt )+ ](St+1 − ft (s)) + H̃t (St+1 ). Then, we have ψ(y, s) = max {V art [αT −t vt+1 (St+1 ) + H̃t (St+1 )]}, H̃t ∈Ht which is independent of y. Then, we have ] ∂gt (y, s) [ = rt + ht − αft (s) − 2α−(T +1−t) λE[(y − dt )+ ]At (s) Φ̄t (y|s) − (s + ht − αft (s)). ∂y In a similar way of the Proof of Theorem 2, we can show Parts (a) and (b). If only forward contracts maturing at next period are available, letting H̃t = q[St+1 − ft (s)] and solving min V art [q[St+1 − ft (s)] + αT −t vt+1 (St+1 )] q yields the optimal solution qt∗ (s) = − Covt [St+1 , αT −t vt+1 (St+1 )] . V art [St+1 ] Proof of Proposition 3 Note that vT +1 (s) = 0. Taking first order derivative for ĝT (y, s) yields [ ] ∂ĝT (y, s) = rt + hT − αfT (s) − 2α−1 λE[(y − dT )+ ]AT (s) Φ̄T (y, s) − (s + hT − αfT (s)) ∂y −2λαV arT [ST +1 ]ΦT (y, s)ET [y − dT ]+ . Observe that the first and the third terms of the RHS of above equation are decreasing in y. Similar to the proof of Theorem 2, we can show that ĝT (y, s) is unimodual in y. When rt ≤ s, ĝT (y, s) is decreasing in y and thus ȳT∗ (s) = 0. Otherwise, ĝT (y, s) is unimodual in y and the optimal solution satisfies (21). By Theorem 2 and Theorem 1, the hedged optimal inventory decision yT∗ (s) satisfies [ ] rt + hT − αfT (s) − 2α−1 λE[(y − dT )+ ]AT (s) F̄T (y, s) − (s + hT − αfT (s)) = 0. This implies that ∂ĝT (yT∗ , s) = −2λαV arT [St+1 ]FT (yT∗ (s), s)ET [yT∗ (s) − dT ]+ < 0. ∂y The unimodularity of ĝT (y, s) in y implies that ŷT∗ (s) < yT∗ (s) as rt > s. 37 Proof of Proposition 4 Differentiating ĝt with respect to y yields [ ] ∂ gt (y, s) = rt + ht − αft (s) − 2α−(T +1−t) λE[(y − dt )+ ]At (s) Φ̄t (y|s) − [s + ht − αft (s)] ∂y [ ] −(T −t−1) + T −t Covt (St+1 , vt+1 (St+1 )) −2α Φt (y|s)V art (St+1 ) E[(y − dt ) ] + α . V art (St+1 ) (33) Not that the sum of the first two terms of RHS of (33) is strictly positive if 0 < y < yt∗ while the third term is decreasing in y. If E[(yt∗ (s) − dt )+ ] < −αT −t implies that ∂ ∂y implies that , then the third term is positive, which ĝt (y, s) > 0 as y ≤ y and therefore the optimal solution ŷt∗ (s) > yt∗ (s). If, on the contrary, ∗ t E[(yt∗ (s) − dt )+ ] ≥ −αT −t ∂ ∂y Covt (St+1 ,vt+1 (St+1 )) V art (St+1 ) Covt (St+1 ,vt+1 (St+1 )) V art (St+1 ) , then the third term is negative and decreasing in y, which ĝt (y, s) ≤ 0 as y ≥ yt∗ and therefore the optimal solution ŷt∗ (s) ≤ yt∗ (s). Proof of Proposition 5 Let z = y − dt . Then, gt (dt + z, s) = (rt − s)(dt + z) − (rt + ht − αft )z + − λα−(T −1−t) V art [St+1 z + + αT −t vt+1 (St+1 )]. When z > 0, we have ∂ gt (z, s) = −(s + ht − αft ) − 2λα−(T −1−t) V art (St+1 )[z + αT −t Covt (St+1 , vt+1 (St+1 ))/V art (St+1 )], ∂z which is decreasing in z. This implies that gt (dt + z, s) is concave in z as z > 0. It is clear that for any s, ∂ ∂z gt (z, s) → −∞ as z → ∞. When z = 0, ∂ gt (dt , s) = −(s + ht − αft ) − 2λαCovt (St+1 , vt+1 (St+1 )). ∂z −αft If Covt (St+1 , vt+1 (St+1 )) < − s+h2tλα , then ∂ ∂z gt (dt , s) and the optimal inventory decision must be greater than dt ; otherwise, the optimal inventory decision must be less than dt . Proof of Theorem 3 First, the inequalities of part (a) follow directly from Lemma 1 since the inequalities can be preserved under the minimization operations. Part (a) holds. Note that the optimal lagrange multipliers must be nonnegative. We argue that they must be positive as δ < δ0 . If, otherwise, λ∗i = 0, the resulted variance must be equal to δ0 , which violates the constraint. By KKT theorem, a positive lagrange multiplier implies that the constraint is binding. Then, by part (a), we know that the inequalities of part (b) hold. Note that the objective function of the lagrange problem is supermodular in (λ, δ). Then, under the minimization operation with respect to λ, the supermodularity implies that λ∗i must be decreasing in δ. Since E[WTi +1 ] is decreasing in λ, we know that E[WT∗i+1 ] is increasing in δ. It is clear that as δ → δ0 , E[WT∗i+1 ] converges to risk-neutral expected terminal wealth. Part (c) holds. Recall that for any λ, E0 WTe +1 = E0 WTf +1 but V ar0 [WTe +1 ] ≤ V ar0 [WTf +1 ] (by Lemma 1). Then, when λ = λ∗f , E0 WTe +1 = E0 WT∗f+1 but V ar0 [WTe +1 ] ≤ V ar0 [WT∗f+1 ] = δ. By Corollary 3, we know that to bind the variance constraint, one must lower the value λ, i.e., λ∗e ≤ λ∗f . Applying Corollary 2, we know that yt∗e ≤ yt∗f . Part (d) holds.
© Copyright 2026 Paperzz