The Expected Value of Perfect Information in the Optimal Evolution of Stochastic Systems Dempster, M.A.H. IIASA Working Paper WP-81-055 April 1981 Dempster, M.A.H. (1981) The Expected Value of Perfect Information in the Optimal Evolution of Stochastic Systems. IIASA Working Paper. IIASA, Laxenburg, Austria, WP-81-055 Copyright © 1981 by the author(s). http://pure.iiasa.ac.at/1706/ Working Papers on work of the International Institute for Applied Systems Analysis receive only limited review. Views or opinions expressed herein do not necessarily represent those of the Institute, its National Member Organizations, or other organizations supporting the work. All rights reserved. Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage. All copies must bear this notice and the full citation on the first page. For other purposes, to republish, to post on servers or to redistribute to lists, permission must be sought by contacting [email protected] NOT FOR QUOTATION WITHOUT PERMISSION OF THE AUTHOR THE EXPECTED VALUE OF PERFECT INFORMATION I N THE OPTIMAL EVOLUTION OF STOCHASTIC SYSTEMS M.A.H. Dempster A p r i l , 1981 WP-81-55 W o r k i n g P a p e r s a r e i n t e r i m r e p o r t s o n work o f t h e I n t e r n a t i o n a l I n s t i t u t e f o r Applied Systems Analysis a n d h a v e r e c e i v e d o n l y l i m i t e d review. Views or o p i n i o n s e x p r e s s e d h e r e i n do n o t n e c e s s a r i l y r e p r e s e n t t h o s e o f t h e I n s t i t u t e o r o f i t s N a t i o n a l Member Organizations. INTERNATIONAL INSTITUTE FOR APPLIED SYSTEMS ANALYSIS A-2361 L a x e n b u r g , A u s t r i a To appear in: S t o c h a s t i c D i f f e r e n t i a l S y s t e m s ; P r o c e e d i n g s o f t h e IFIP Working C o n f e r e n c e , V i s i g r a d , Bungary, S e p t e m b e r 1 9 8 0 . M . Arato & A.V. Balakrishnan, eds. Springer-Verlag Lecture Notes in Control and Information Science. ABSTRACT This paper uses abstract optimization theory to characterize and analyze the stochastic process describing the current m a r g i n a l e x p e c t e d v a l u e o f p e r f e c t i n f o r m a t i o n in a class of discrete time dynamic stochastic optimization problems which include the familiar optimal control problem with an infinite planning horizon. Using abstract Lagrange multiplier techniques on the usual nonanticipativity constraints treated e x p l i c i t l y in terms of adaptation of the decision sequence, it is shown that the marginal expected value of perfect information is a nonanticipative supermartingale. For a given problem, the statistics of this process are of fundamental practical importance in deciding the necessity for continuing to take account of the stochastic variation in the evolution of the sequence of optimal decisions. CONTENTS 1. Introduction 2. The dynamic recourse problem as an abstract optimization problem 3. Characterization of optimal solutions 4. The marginal expected value of perfect information supermartingale 5. Possible extensions 6. Acknowledgements References This paper uses abstract optimization theory to characterize and analyze the stochastic process describing the current marginal expected value o f perfect information in a class of discrete time dynamic stochastic optimization problems.which include the familiar optimal control problem with an infinite planning horizon. Using abstract Lagrange multiplier techniques on the usual nonanticipativity constraints treated zxplicitly in terms of adaptation of the decision sequence, it is shown that the marginal expected value of perfect information in a nonanticipative supermartingale. For a given problem, the statistics of this process are of fundamental practical importance in deciding the necessity for continuing to take account of the stochastic variation in the evolution of the sequence of optimal decisions. ' Let x = Cxtjt=l be a sequence of d e c i s i o n s in Rn and let be a discrete time stochastic process in ( Z , C , p ) of sub_F = i;t)t=l sequent obscr2ations. A ?olicy ( d e c < s i o n r u l e or r e c o u r s e f u n c t i o n ) is a meesurable nap x : S - c x ( S ) Consider the problem - . (where nt 2 n T = n , mt ;mT = m) and the n o n a n t i c i p a 5 i v e condition that the current decision xt depends only on the sequence of observations 51,52,..., St-l , and realised decisions x1,x2,..., x ~ -(and ~ thus 5 ) to date. Here ft: = x Xtll nnt+IEl is assumed measurable in its first argumsnt and Bore1 mehsurable in its second and (A full set of technical f (XI: = ft ( ,x ( ) ) , and similarly for qt (5) -t assumptions will be introduced in $$2 and 3.) The problem (RPI--termed the d y n a m i c r e c o u r s e p r o b i e m - - h a s a number of important applications in the mathematical sciences ( c j . Dempster, 1980). Special cases include stochastic dynamic linear or quadratic programming formulations of e n e r g y - e c o n o n i c p l a n n t n g models, 3irge (1980), Louveaux and Smeers I1980,1981); t n v e n t o r g z g n t r o l models, see e.g. Veinott (1966); Mcrkov d e c i s i o n p r o c e s s e s with random transition matrices for m s n p o w e r p l a n n i n g , Grincld (1976,1980) and the classical d i s c r e e e t i m e o p t i m a l c o n t r o l model. To see the last assertion in more detail, make the following substitutions in (,W) : - - - . Then (RP) reduces to the familiar c o n t r o l p r o b l e m Control and state space constraints are easily added to (C) by suitable definition of Pt, t=l,...,T Characterization of the (optimal) solutions to the general problem (RP) for finite has been treated by Rockafellar and Wets(1976aIb, 1978) for the convex case under a Slater r e g u i a r i t y c o n d i t i o n ( c o n s t r a i n t a u ~ t i , + ' i ~ a t t o n ) using the duality theory of convex conjugate functions. :lore recently (19811, they have given a similar treatment of the convex Bolza problem--a special case of (C)--for finite r Hiriart-Urruty (1978,1981) has considered a more. general class of nonlinear special cases of (RP) for finite T He applied a theory characterizing in terms of generalized gradients the minimum of an integral fmctional involving a measurable locally Lipschitz integrand subject to a measurable closed valued multifunction constraint. The version of (Re) he treated has some nondifferentiable and 'some differentiable constraints and a correspondingly mixed Slater/llangasarian-Fromowitz constraint qualification is enforced. This paper discusses the use of recently developed abstract mathematical programing theory (Dempster,l976;Zowe and Kurcyusz,l979; Brokate,l980) to extend these results to the i n f t n i t e h o p i t o n ( r infinite) problem for the general nonlinear case under appropriate regularity conditions involving problem functions in LThe s a p p o r t r e p r e s e n $ a ? i o n problem (Dempster,l976) for (RP) is addressed to obtain the agropriate stochastic maximum principle and nonanticipative supermartingale representation of the L1 multiplier process corresponding to the nonanticipative constraints. The general results were announced in Dempster (1980); details and proofs will appear elsewhere. In this paper, emphasis is placed on precise problem formulation, results and interpretation ( $ 5 2 and 3 ) and attention is focused on the supermartingale multiplier process corresponding to the nonanticipative constraints ($4). Technical limitations to extending the analysis of this marginal expected value of perfect information process to continuous time systems --involving say diffusion or jump dynamics--are discussed briefly in $ 5 with a view to their possible relaxation jn future work using recent theories of ~athwiseintegration of stochastic differential equations (Sussman,l978;Marcus,l981). . . . . 2. THE DYNMIIC RECOURSE PROBLEM AS AN ABSTRACT OPTIMIZATIOX PROBLEM The purpose of this section is to set up the problem (RP) as an abstract mathematical programme of the form (PI supxeP f (2;) s.t. g (x) E: '2 where P and Q are sets in appropriate linear topological spaces U and V , f:U + R and g:U + V A natural assunption to make in the applications cited in $1 is that feasible policies should be e s s s n t t c l l y bounded--i.e. in Lm -on (S,Z,p) , c j . Rockafellar and Wets (1976,1978,1981), although a similar treatment of policies in L (1 ;p cm) is also possible, c f . P Eisner and Olsen (1975,1980), Hiriart-Urruty (1978,1981). Hence assume Z completed with respect to u and define . and equipping U with the usual equivalent of the product topology defined (we shall be party to the usual in t e m s of the norm (ul = suptlutlm abuse of terminology by referring to the equivalence class elements of Banach function spaces as functions and subject ?o the standard analyst/ probabilist's schizophrenia by denoting the elements of. for example, . - IJa, as both u and u depending on whether the analytic or probabilistic interpretation is to the fore.) The abstract objective function of (PI is obviously but a little more analysis will be necessary in order to define the abstract constraint function g and its range (image) space V The problem lies with the e z ~ l i c i tcharacterization of nonanticipation. Its solution was first proposed in the context of stochastic i i n e c r programming by Eisner and Olsen (1975,1'980) Let {Zt) be the usual increasing tower of a-fields, . . generated b y the process -5 . Cvzry feasible policy x- is assumed to n After canonically embedding Lm define the closed projections for t=l,...,r . in - Then x is adapted to n L_ -5 in the obvious wag. we may if, and only if, Here (when we make the usual assumption that x l is deterministic) , where K 1 denotes the linear sgan of the constants in ": ="1 n embedded in Lm . La, We may now define the constraint function of (PI as such that with V equipged with the normed-defined equivalent of the product topology (as with U ) . All these considerations apply equally well to In the sequel we shall consider only the case finite or infinite r of infinite r ; necessary changes for the simpler case of finite T are easily supplied by the reader. Further. define ttIxt to be the ,~c,...,x-t , of the observation, respechisrory, i.e. 51,;2,...,5t; tively policy, process to tine t We shall specialize in what follows to the case--relevant to all the practical examples of $1--of t r i ~ n g u t a ~ (RP), for which current constraints depend only on observation and decision histories to date, i.e. . . (Thus the measurability properties for the strictec? to It rather than : . ) gt cited in $1 may be re- This completes the basic set of assumptions needed to specify and analyze the dynamic recourse problem (RP) over an infinite horizon. Further(i1lustrative)technical assumptions will be introduced as required to complete the analysis in $3. The characterization of optimal solutions for the abstract mathematical programming problem (P) necessitates consideration (perhaps only implicitly) of its L a n g r a n ~ i c n f u n c t i o n for X E U and m u l t i p z i e r vectors ~ ' E V ,' the duaZ space of V (consisting of all linear functionals on V continuous in the given topology for V). * We shall thus here need the following characterization of L- , due essentially to Yosida and Hewitt (1952) (see also Dubovitskii and Ililyutin, 1965, Valadier, 1974, and Denpster, 1976). A finitely additive R "' on (1.1) is p u r e l y f i n i t e l y row n-vector valued measure T ' : L a Z d i t t 3 e ( p . j . a . ) if, and only if, for all countably additive real valued measures v on (:,XI and for all E>O there exists AEcE S U C ~that n' i. e. a p. f a. measure is carried by i v (AL)I < L and 1' ( hE) = O I L R sets assigned arbitrarily small measure by any countably additive measure. (Prime is used to denote a dual e1ement;in the finite dimensional case n* the this is consistent with vector transposition.) Denote by Ln (as defined in (2.1)1 , by ':L (Banach) dual spaca of Lthe space of (coordinatewise) absolutely integrable row n-vector valued functions on (E,1) and by pn' the space of purely finitely additive row n-vector valued measures on ( Z X > + . 8 . F'ropositiox: 3.1. Given the neasure space (f,Z,u) with respect to the u-finite measure p , then , if Here = denotes isometric isomorphism and the action of n-vector valued function xr~: is given by (3.2) y'x: = :-,y ' (;)x (2:) i - /-;?.'(dS)x(~) X is complete n* ylcLm on an The f i r s t i n t e g r a l i n ( 3 . 2 ) i s s i m p l y an a b s t r a c t Lebesque i n t e g r a l ; t h e second r e q u i r e s t h e a n a l o g o u s i n t e g r a t i o n t h e o r y d e v e l o p e d f o r f i n f t e Z y a d d i t i v e measures by Dunford and Schwartz ( 1 9 5 6 ) . In f a c t , Valadier ( 1 9 7 4 ) e x t e n d e d t h e r e s u l t o f P r o p o s i t i o n 3.1. t o t h e a - f i n i t e c a s e from t h e E i n i t e c a s e e s t a b l i s h e d by Yosida and H e w i t t (1952), whileDubovitskii and C l i l y u t i n ( 1 9 6 7 ) i n d e p e n d e n t l y gave a c o m p l e t e t r e a t m e n t o f * in Lm (y;- of ( 3 . 2 ) w i t h o u t r e f e r e n c e t o t h e i r i n t e g r a l r e p r e s e n t a t i o n s . A f i n e r c h a r a c t e r i z a t i o n o f L: i n t e r m s of n a t u r a l s u b s p a c e of p n r a p p e a r s i n Dempster (1976) terms of s i n g u L a r functior.aZs . tJe a r e now i n a p o s i t i o n t o ma!ce p r e c i s e s e n s e of t h e L a n s r a n g i a n . According t o ( 2 . 4 ) w e a r e i n t e r e s t e d i n r e p r e mt* n N n N s e n t a t i o n of t h e d u a l s p a c e , XtP1 L, x (L,) ) o f V (=xt_Y L? x (L,) where IN denota-s t h e n a t u r a l numbers. A s t r a i g h t f o r w a r d a p p i i c a t i o n of function (3.1) f o r (P) P r o p o s i t i o n 3.1 Q yields a s g i v e n by u s i n g t h e f a c t (Yosida and H e w i t t , -. 1952) t h a , t a l l p . f . a . m e a s u r e s on IN ( w i t h c o u n t i n g measure % t a k e n a s ground measure) a r e c a r r i e d by neighbourhoods of XI m I n (3.3) E P [ ( I x ~E X , Y ( N ) Iu X # ) ; =m+n ' 1 t h e s p a c e o f row (m+n)-vector v a l u e d measures on t h e p r o d u c t a - f i e l d shown, a n d i n t h e c o r r e s p o n d i n g integral gt h a s been c a n o r . i c a l l y embedded i n lRm Next we c h a r a c t e r i z e an optimum x of 0 c o n c e p t of d e r i v a t i v e s of t h e L a g r a n g i a n @ . (PI i n t e r m s of a s u i t a b l e g i v e n by ( 3 . 3 ) . Rather than u s e minimal c o n c e z t s and i n t r o d u c e h i g h l y t e c h n i c a l , c o n d i t i o n s on ( P ) , we s h a l l by way o f i l l u s t r a t i o n u s e ~ r g c h e td e r i v a t i v e s and g i v e regul a r i t y c o n d i t i c n s o:iliv f o r ; 2 P ) sufficient t o e n c c r e c h e t r u t h 02 t h e f o l l o w i n g : : . i ; : , 6 - J s c k z r, ' ; ~ ~ c i l ~ f omr (P), P.=. Zowe and :turcyusz (1979) . Suffice it to say here that versions of Proposition 3.2 below are available involving both generalized derivatives (cj. Niriart-Urruty, 1981) and (one-sided) Gateaux directionpl derivatives (Dempster, 1976) under m i n i m a l regularity conditions (cf. Dempster, 1976; Brokate, 1980) for (P) posed in locally convex Hausdorff topological vector spaces. We shall need the concept of the Jz.4~1 coze Q ' C V ' of a set Q C V as and similarly for sets in U. P r o p o s i t i o n : 3.2. Let U and V be Banach spaces and the problem functions f and g of (P) be Frdchet differentiable with derivatives V f and Vg respectively. Then under suitable regularity conditions on (P), xO an optimum for ( - 0 ) implies that there exists y i Q' ~ such that Vue P Conditions (3.5) are termed Kuhn-Tucker (necessary 1 c o n d i t i o n s for an optimum of (PI. 1f' 0 e P C U and n e a s condit'ions 0E QCV , the last two imply the c o m p Zementary s l o c k - Applying Proposition 3.2 under suitable regularity assumptions on (RP) yields a (necessary) characterization of its optimal policies in terms of the a b s t r a c t Lagrangian of (PI given by (3.3). However, inspection of (3.3) raises the question of conditions under which this characterization remains valid if the awkward terms involving integrals with respect to purely finitely additive measures are dropped. This is a special case of the ger~ersls ~ ; F J ~ * _ r; z ~ ~ l ~ ~ ~p r on j ltz ran t(Denpster,l975) ~ ~ n I which has appeared in the control (Dubovitskii and tlilyutin,1965), economics (e.g. Frescott and Lucss ,1372) 2nd o~;irr,iz~tic;n(Rockalcllzr ~ n d Wets,1976a,b,1378) literature. To solve the support representation problem for (RP) we must give conditions on the problem sufficient to make both the stochastic p.f.a. measures T t' and $I;, t=1,2,..., and the inter'enporal p.f.a. measure :X of (3.3) vanish. The followinc conditions are a distillation of the literature cited above. Some terms and definitions will be needed. A policy history xt is termed f e a s i b l e if it satisfies the constraint structure of (RP) to time t, i.e. if we have for its components Define "t xt: = lxt E I :xt = xt (6) , 6 E I , xt -. feasible 1, and Clearly Ct essentially lies in Xt, but the analytical (computational) intractability of (RP) arises from the fact that this inclusion is in general essentially strict. The c o n t r o l l a b i i i t y (or relatively cornpletz r e c o u r s e ) condition ensures almost sure decision recourse at all times from any realization of the observation (and decision) process to date and forces the optimal stochastic p.f.a. measure T t' of Proposition 3.2 to vanish. Rockafellar and Wets (1976a) obtain more technicai sufficient conditions. They show by example that, without the ezplicit introduction into the problem of the constraints which bind at an optimum induced on Ct by lhter stages, the support representation problem in terms of L1 multipliers for (RP) is insolubZe. Since a nonanticipative constraint (2.3) cannot lead to infeasibilities of subsequent nonanticipative constraints, we can conclude immediately that the stochastic p.f.a. measures $,; t=l,Z,. always vanish at the optimum. To ensure that the optimal intertemporal p.f.a. measure X: vanishes,'it suffices .to assume that 0 EPt, t=1,2, and the finite horizon a p p r o = i . ~ a t t o n conditioa: .. ... (3.8) for some r (211, x feasible implies (xt,O,O,. . . ) feasible for all t 1 r . Examples of nontrivial p.5.a. measnres (in the absence of this condition) a r e known ( s e e P r e s c o t t a n d L u c a s , 1 9 7 2 ) . N e x t we m u s t s t a t e s u i t a b l e r e g u l a r i t y c o n d i t i o n s o n (RP) s u f f i c i e n t t o ensure t h e c o n c l u s i o n s of P r o p o s i t i o n 3.2. i n terms o f L, m u l t i p l i e r s . a n d Q t c IRmt a r e Hence w e a s s u m e ( b y way o f i l l u s t r a t i o n ) t h a t P t C IF?t c l o s e d convex c o n e s , 0 c Q t and t h e p r o b l e m . f u n c t i o n s f t , g t are d i f f e r e n t i a b l e w i t h r e s p e c t t o t h e i r p o l i c y c o m p o n e n t s w i t h g r a d i e n t s V f t , Vgt, t=1,2, Given a set C ( i n a l i n e a r t o p o l o g i c a l s p a c e ) and a p o i n t X E C d e f i n e t h e inner approximation coze ... . w h e r e t h e i n t e r s e c t i o n is t a k e n o v e r n e i g h b o u r h o o d s x o f x . assume t h a t f o r a n o p t i m a l p o l i c y We shail xy ( l i n e a r ) p r o j e c t i o n s I -!I o f ( 2 . 3 ' ) a r e c l o s e d , ( 3 . 9 ) i s t s u f f i c i e n t t o e n s u r e t h a t t h e a b s t r a c t problem f u n c t i o n g o f (2.4) Since the satisfies i n terms o f i t s F r e c h e t d e r i v a t i v e a t a n optimum xo o f (PI f o r t h e c l o s e d c o n v e x c o n e Q C V , cf. D e m p s t e r ( 1 9 7 6 ) , Zowe a n d K u r c y u s z ( 1 9 7 9 ) F i n a l l y w e a r e i n a p o s i t i o n t o a p p l y P r o p o s i t i o n 3.2 t o obtain a Zocal v e r s i o n o f t h e Kuhn-Tucker n e c e s s a r y c o n d i t i o n s ( 3 . 5 ) f o r a n o p t i mal p o l i c y 2O o f ( R P ) , i n terms o f a s t o c h a s t i c mazimum p r i n c i p l e i n v o l v i n g t h e L m u l t i p l i e r p r o c e s s e s y' a n d p' , o f t h e f o r m 1 " - . I n t h e c a s e t h a t a l l problem f u n c t i o n s a r e concave i n t h e p o l i c y v a r i ables, conditions (3.11) a r e a l s o sufficient--in general, they a r e not. I n t h e n e x t s e c t i o n , w e t u r n t o an a n a l y s i s and i n t e r p r e t a t i o n of t h e (optimal) m u l t i p l i e r process p ' corresponding t o t h e nonanticipative c o n s t r a i n t s (2.3) of (RP) . 4. - THE MARGINAL EXPSCTED VALUE O F PERFECT 1YT'ORI.IATION SUPERMARTINGALE - I n t h i s s e c t i o n we s h a l l assume t h a t a f i x e d o p t i m a l p o l i c y x f o r 0 (RP) i s s p e c i f i e d . ( V a r i o u s f u r t h e r a s s u m p t i o n s may be adduced t o t h e problem t o g u a r a n t e e e x i s t e n c e and even u n i q u e n e s s o f t h e o p t i m a l p o l i c y , b u t t h e s e w i l l n o t c o n c e r n u s h e r e . ) W e s h a l l a p p l y modern p e r t u r b a t i o n t h e o r y f o r t h e a b s t r a c t programme (PI o f $ 2 , see e.g. Lempio and Maurer (1980), t o study t h e nonanticipative c o n s t r a i n t m u l t i p l i e r process p ' c o r r e s p o n d i n g t o t h e chosen o p t i m a l p o l i c y xo f o r (RP). Of i n t e r e s t a r e p e r t u r b a t i o n s t o t h e n o n a n t i c i p a t i v e c o n s t r a i n t s ( 2 . 3 ) o f t h e form - - where t h e zt a r e a r b i t r a r y n - v e c t o r v a l u e d f u n c t i o n s m e a s u r a b l e w i t h r e s p e c t t o Z s ( t ) ( s ( t > t ) and hence r e p r e s e n t i n g information on t h e fueuz-e of t h e o b s e r v a t i o n p r o c e s s 5 More s p e c i f i c a l l y , f i x t and an a r b i t r a r y n - v e c t o r v a l u e d f u n c t i o n z,L m e a s u r a b l e w i t h r e s p e c t t o f o r s o m e f i x e d s > t and c o n s i d e r p e r t u r b a t i o n s o f t h e tth nonIS a n t i c i p a t i v e c o n s t r a i n t of t h e form -. . f o r a E [ O , 6 1 ( 6 > 0) Denote t h e o p t i m i z a t i o n problem r e s u l t i n g from t h e p e r t u r b a t i o n ( 4 . 2 ) a s ( R P [ a z t l ) and i t s a b s t r a c t e q u i v a l e n t as D e f i n e t h e a b s t r a c t p e r t u r b a t i o n f u n c t i o n n : V + I R U I-} as (P[ a z t ] ) . Then, u n d e r c o n d i t i o n s on t h e problem d a t a o f (RP) s u c h t h a t t h e o r i g i n a l problem h a s an o p t i m a l s o l u t i o n , t h e p e r t u r b e d problems P [ a z t l w i l l have f e a s i b l e s o l u t i o n s . W e s h a l l assume t h a t w e nay f i n d a c u r v e x ( a ) Then, s u c h t h a t x (a) i s f e a s i b l e f o r P [ a z t ] and l i m o s o x ( a ) = x O E U s i n c e t h e c l o s e d p r o j e c t i o n ( I - !I t ) d e f i n s a s u b s p a c e of La , t h e n' Lagrange m u l t i p l i e r P; E: L, f o r t h e c o n s t r a i n t ( 2 . 3 ) i s an a n i h i l a t o r ( s u p p o r t i n g h y y e r ? l a ? e ) o f t h i s subs?aco. Under Dur a s s u ~ p t i o n s ( a p p l y i n 9 Theorem 4 . 3 , L e c p i o and biaurer,l9SC) we zay t h u s c o n c l u d e t h a t we . ' may choose where Vta denotes theFr6chet derivative of the perturbation function (4.3) of the abstract problem (PI evaluated at 0 under perturbations of the form (0.2) at time t That is, the current state E; of the nonanticipative constraint multiplier process in Ln1 ' represents the marginal expected value of perfect i n f o r m a t i o n (EVPI) at time t with respect to future states of the observation process E We first establish that this marginal EVPI process p ' --like the optimal policy process itself-is adapted to the observation process . _pi -. zO Lemma: 4.1. The process ..' in 1:' P - is nonantiripatiue, i.e. This fact follows from the observation that expression (4.4) for P: does not depend on any particular perturbation (4.2) representing some future knowledge of the observation process E m. Next we show that the process o ' has the s u p e r m a r t i n g a t e property. This reflects the fact that the earlier information on the future observation process F, is available, the more its marginal expected worth to optimal decision making. C -. - - Theorem: 4.2. (4.6) !?t'> The process E(.P;I Ltl By virtue of (4.5 - P' n' in L , a,s. is a s u p e r m a r t i n g a l e , i,e. for (12) t < s . we must show for fixed t and s> t that But a further consequence of (4.5) is that for all and hence (4.6) is equivalent to showing that s$t - But information on the future of 5 after time s-1, as represented by an n-vector valued perturbation function z measurable with respect to ZU for u ~ ,s cannot be -worth less in expectation the earlier it is known, i.e. Indeed, an optimal policy for the problem perturbed where zS.. - zt: = z at time t can iake this information into account earlier than a corresponding policy for the problem perturbed at s Hence, subtracting ~ ( 0 ) from each side of (4.7), dividing by a > 0 and passing to the limit as a + 0 , yields . . Since integration is nonnegativity preserving 5. POSSIBLE EXTENSIONS As noted in the introduction, th2 marginal expected value of perfect information (EVPI) process 9' is of considerable jotential importance for-stochastic systems of the dynamic recourse type arising in practice n' : = (see $1). If this nonanticipative supermartingale process in L1 Li [ (I,L ,u ) ; IRn' ] remains in a ball of (problem dependent) radius i > 0 for all t after some time s 2 l , then the stochastic elements of the problem are practically inessential from time s onward and a deterministic model--and simpler computational procedure--should suffice. Of course, this statenent raises the knotty problems of prior numerical computation of the marginal EVPI process, or--more realistically--of bounds on this process, etc. (in this context, see Birge,lSBO). Nevertheless, it'would be interesting to have theoretical results similar to those derived in $4 for familiar optimization of stochastic system problems in contiquous time involving dynamics driven by semimartingales (see e.g. Shiryaev,l980). The difficulty in attempting an analogue of the analysis presented in this paper for such systems is that the corresponding pertxrbed abstrzct problem (as utilized in :4) must make sense. Put 5ifferrntly, the ori~inelstochastic gptimization problem must remain well defined when nonanticipativity is r a e Jsing the Ito calculus a F r ~ % c ; :I3r.2 1-2-. z1csnt t~;t~r,s.'c.-.s - -. to semimzrtingslzs 3ener~:ir.; mixec C i 2 i ~ r i c z3 2 2 j..-- dynamics) this - -I., is not possible, since the rigorous analytic i n t e g r a l form of the dynar n i c s r e q u i r e s n o n a n t i c i p a t i v i t y of the integra~ldin the stochastic integrals involved. This t e c h z S c s Z requirement of the stochastic integration theories utilized has been relaxed for integration of G a u s s i o n processes but with respect to similar processes by Enchev and Stoyanov (19'80)~ this setting is of insufficient generality for many systems of interest. More promising is the application to the problem at hand of the recent p a t h w i s e theory of stochastic integration introduced for the study of stochastic differential equations whose integrals are driven by processes with continuous sample paths by Sussman (1978) and developed for semimartingales with jumps, for example, by Marcus (1981). In the case of successful application of the approach of this paper to optinization of stochastic systems in continuous t h e , with differential dynamics in lRn of the form ;= f - .(x) " , it may be conjectured that the f u l l expected value of perfect information process u" in L1 may be recovered from the m a r g i n a l EVPI process " p a in 1;' by (Lebesque) integration as - zS . for an appropriate definition of This is again a statement of'some potential practical importance for stochastic system modelling. 6. ACKNOWLEDGEMENTS I would like to express my gratitude to J-M. Bismut, who first pointed out to me the technical difficulties discussed in $4 regarding extension of the present analysis to stochastic systems in continuous t h e , and to M.H.A. Davis, whose conversation lead to the suggestions made therein for surmounting them. REFERENCES . 1. J. Birge (1981 ) Stochastic dynamic linear programs. Ph .D. Thesis Operations Research Department, Stanford University. 2. M. Brokate (1980). A regularity condition for optimization in Banach spaces: Counter examples. Appl.Math.Optim. 6, 189-192. 3. M.A.H. Dempster (1976). Lectures on Abstract Optimization and Its Applications. Department of Matheaatics, Melbourne University. 4 . M.A.H. Derr,pster (1980). Introduction to.stochastic programming. In: Dempster, ed. Academic, London. S t o c h a s t i c P r o g r a m r i n g . M.A.H. 3-59. 5. A . Y . Dubovitskii & A.A. Milyutin (1965). Extremum problems in the presence of restrictions. USSi? Comput. Math. & N c t h . P h y s . 5 . 3 , 1-80. 6. N. Dunford & J.J. Schwartz (1956). L i n e a r O p e r a t o r s . Vol.1. Interscience, Mew York. 7. M.J. Sisner & P. Olsen (1975). Duality for stochastic programming interpreted as L.P. in L -space. SIAM J.Appl.Math. 28, 779-792. P 8. M.J. Eisner 6 P. Olsen (1980). Duality in probabilistic programming. In: S t o c h a s t i c Programming, o p . c i t . , 147-158. 9. O.B. Enchev & J.M. Stoyanov (1980). Stochasti-c integrals for Gaussian rzndom functions. S t o c h a s t i , n s 3 , 277-289. 10. R.C. Grinold (1976). Operations Res. Manpower planning with uncertain demands. 387-399. 24, 11. R.C. Grinold (1980). A class of constrained linear control problems with stochastic coefficients. In: S t o c h a s t i c Programming, op.cit., 97-1 08. 12. J-B. Hiriart-Urruty (1978). Conditions necessaires d'optimalitg pour un programme stochastique avec recours. SIAM J . ControZ O p t i n . 1 6 . 2 , 317-329. 13. J-B. Hiriart-Urruty (1981). Extension of Lipschitz integrands and minimization of nonconvex integral functionals: Applications to the optimal recourse problem in discrete time. P r o b a b i l i t y c n d 5 t c t i s ; t c s I , to appear. 14. F. Lempio & A. Maurer (1980). Differential stability in infinitedimensional nonlinear programming. A?p I . Mcth. O p t i m . 6 , 139-152. 15. F. ~ouvea& & Y. Smeers (1981a). A stochastic node1 for electricity generation. In: ModeZZ<ng o f L a r g e - S c a l e Energy S y s t e m s . W. Haefele & L.K. Kirchmayer, eds. IIASA Proc.Series No.12. Pergamon, Oxford, 313-320. 16. F. Louveaw & Y. Smeers (1981b). Optimization of a stochastic model for slectricity generation. S t o c h a s t i c a . Special issue containing the Proceedings of the IIASA Task Force Meeting on Stochzstic Optimization, 9-13 December, 1980. To appear. 17. S.I. Marcus (1981). 1.1odelling and approxination of stochastic differential equations driven by semimartingales. S t o c h a s t i c s 4.3, 223-246. 18. E.C. Prescott & R.E. Lucas, Jr. (1972). A note on price systems in infinite dimensional space. 1 n t Z . E c o n . B e v . 1 3 , 416-422. 19. R.T. Rockafellar & R.J-B. Wets (1976a). Stochastic convex programming: Relatively complete recourse and induced feasibility. SIAM J.ControZ Op5im. 1 4 , 574-589. 20. R.T. Rockafellar & R . J-3. ICets ! 1976b) . ?:onanticipativity and L ' martingales in stochastic optimization problems. ~ , ' z t i . 2 r o g r c x ~ t x ; S t x d i e s 6 , 170-187. 21. R . T . Rockafellar & R.J-B. Wets (1978). The o p t i m a l r e c o u r s e problem i n d i s c r e t e time: L 1 - m u l t i p l i e r s f o r i n e q u a l i t y cons t r a i n t s . SIAM J . C o n r r o Z O p t i m . 1 6 . 1 , 16-36. 22. R . T . Rockafellar & R.J-B. Wets (1981). S t o c h a s t i c o p t i m i z a t i o n problems o f Bolza t y p e i n d i s c r e t e t i m e . S t o c h a s t i c s . IIASA S p e c i a l I s s u e , o p . c i 5 . To a p p e a r . 23. A.N. S h i r y a y e v (1980). M a r t i n g a l e s : Recent Developments, R e s u l t s , Applications. L e c t u r e n o t e s d i s t r i b u t e d a t t h e 1980 European Meeting of S t a t i s t i c i a n s , B r i g h t o n , England. 24. H . J . Sussmann (1978). On t h e gap between d e t e r m i n i s t i c and s t o c h a s t i c o r d i n a r y d i f f e r e n t i a l e q u a t i o n s . Ann.Prob. 6 . 1 , 19-41. , 25. M. V a l a d i e r (1974). A n a t u r a l supplement of L1 i n t h e d u a l of L Seminaire d l A n a l y s e Convexe, M o n t p e l i e r . 26. A.F. V e i n o t t (1966). The s t a t u s of m a t h e m a t i c a l i n v e n t o r y t h e o r y . Management S c i . 1 2 , 745-777. 27. X. Yosida & E . H e w i t t (1952). A . M . S . 7 2 , 46-66. F i n i t e l y a d d i t i v e measures. T r a n s . 28. J . Zowe & S. Kurcyusz (1979). R e g u l a r i t y and s t a b i l i t y f o r t h e m a t h e m a t i c a l programming problem i n Banach s p a c e s . AppZ.Math. O p t s m . 5, 49-62. .
© Copyright 2026 Paperzz