Smooth Optimal Decision Strategies for Static Team Optimization Problems and Their Approximations Giorgio Gnecco1,2 and Marcello Sanguineti2 1 Department of Computer and Information Science (DISI), University of Genova Via Dodecaneso, 35, 16146 Genova, Italy 2 Department of Communications, Computer, and System Sciences (DIST) University of Genova Via Opera Pia 13, 16145 Genova, Italy [email protected], [email protected] Abstract. Sufficient conditions for the existence and uniqueness of smooth optimal decision strategies for static team optimization problems with statistical information structure are derived. Approximation methods and algorithms to derive suboptimal solutions based on the obtained results are investigated. The application to network team optimization problems is discussed. Keywords: Team utility function, value of a team, statistical information structure, approximation schemes, suboptimal solutions, network optimization. 1 Introduction Decision makers (DMs) cooperating to achieve a common goal, expressed via a team utility function, model a variety of problems in engineering, economic systems, management science and operations research, in which centralization is not feasible and so distributed optimization processes have to be performed. Each DM has at disposal various possibilities of decisions generated via strategies, on the basis of the available information that it has about a random variable, called state of the world. In the model that we adopt, the information is expressed via a probability density function, so we have a statistical information structure [13, Chapter 3]. In general, one centralized DM that, relying on the whole available information, maximizes the common goal, provides a better performance than a set of decentralized DMs, each of them having partial information. However, centralization is not always feasible. For example, each DM may have access only to local information that cannot be instantaneously exchanged. Alternatively, the cost of making the whole information available to one single DM may be too high with respect to having several DMs with different information. This is often the case, e.g., in communication and computer networks extending in large J. van Leeuwen et al. (Eds.): SOFSEM 2010, LNCS 5901, pp. 440–451, 2010. c Springer-Verlag Berlin Heidelberg 2010 Smooth Optimal Decision Strategies 441 geographical areas, production plants, energy distribution systems, and traffic systems in large metropolitan areas divided into sectors. In the team optimization problems that we address in this paper, the information of each DM depends on the state of the world but is independent of the decisions of the other DMs. These are called static teams, in contrast to dynamic teams, for which each DM’s information can be affected by the decisions of the other members. However, many dynamic team optimization problems can be reformulated in terms of equivalent static ones [28]. Static teams were first investigated by Marschak and Radner [22,23,24], who derived closed-form solutions for some cases of interest. Then, dynamic teams were studied [5]. Unfortunately, closed-form solutions to team optimization problems can be derived only under quite strong assumptions on the team utility function and the way in which each DM’s information is influenced by the state of the world (and, in the case of dynamic teams, by the decisions previously taken by the other DMs). In particular, most available results hold under the so-called LQG hypotheses (i.e., linear information structure, concave quadratic team utility, and Gaussian random variables) and with partially nested information, i.e, when each DM can reconstruct all the information available to the DMs that affect its own information [7,12]. However, as remarked in [9], these assumptions are often too simplified or unrealistic. For more general problems, closed-form solutions are usually not available, so one has to search for suboptimal solutions. In this paper, we derive sufficient conditions for the existence and uniqueness of smooth optimal decision strategies, for static team optimization problems with statistical information structure. Then, we show that a sufficiently high degree of smoothness of the optimal decision strategies is a useful property when searching for suboptimal solutions. The paper is organized as follows. Section 2 introduces definitions and assumptions and formulates the family of static team optimization problems under consideration. Section 3 investigates existence and uniqueness of smooth optimal strategies for such problems. Section 4 examines some consequences of the obtained results in developing approximation methods and algorithms to derive suboptimal solutions. Section 5 discusses the application of our results to static network team optimization problems. 2 Problem Formulation The context in which we shall formalize the optimization problem and derive our results is the following. – Static team of n decision makers (DMs), i = 1, . . . , n. – x ∈ X ⊆ Rd0 : vector-valued random variable, called state of the world, describing a stochastic environment. The vector x models the uncertainties in the external world, which are not controlled by the DMs. – yi ∈ Yi ⊆ Rdi : vector-valued random variable, which represents the information that the DM i has about x. 442 G. Gnecco and M. Sanguineti – si : Yi → Ai ⊆ R: measurable strategy of the i-th DM. – ai = si (yi ): decision that the DM i chooses on the basis nof the information yi . n n – u : X × Πi=1 Yi × Πi=1 Ai ⊆ RN → R, where N = i=0 di + n: real-valued team utility function. – The information that the n DMs have on the state of the world x is modelled by an n-tuple of random variables y1 , . . . , yn , i.e., by a statistical information structure [6] represented by a joint probability density ρ(x, y1 , . . . , yn ) on the n set X × Πi=1 Yi . We shall address the following family of static team optimization problems. Problem STO (Static Team Optimization with Statistical Information). Given the statistical information structure ρ(x, y1 , . . . , yn ) and the team utility function u(x, y1 , . . . , yn , a1 , . . . , an ), find sup v(s1 , . . . , sn ) , s1 ,...sn where n v(s1 , . . . , sn ) = Ex,y1 ,...,yn {u(x, {yi }n i=1 , {si (yi )}i=1 )} . The quantity sups1 ,...sn v(s1 , . . . , sn ) is called the value of the team. Throughout the paper, we make the following three assumptions. For Ω ⊆ Rd , by C(Ω) we denote the space of continuous functions on Ω; for a positive integer m > 0, by C m (Ω) we denote the spaces of functions on Ω, which are continuous together with their partial derivatives up to the order m. A1. The sets X, Y1 , . . . , Yn are compact, and A1 , . . . , An are bounded closed intervals. For an integer m ≥ 2, the team utility u is of class C m on an open set n n Yi × Πi=1 Ai , and ρ a (strictly) positive probability density containing X × Πi=1 n on X × Πi=1 Yi , which can be extended to a function of class C m on an open set n containing X × Πi=1 Yi . A concave function f defined on a convex set Ω has concavity at least τ > 0 if for all u, v ∈ Ω and every supergradient1 pu of f at u one has f (v) − f (u) ≤ pu · (v − u) − τ v − u2 . If f is of class C 2 (Ω), then a necessary condition for its concavity at least τ is supu∈Ω λmax (∇2 f (u)) ≤ −τ , where λmax (∇2 f (u)) is the maximum eigenvalue of the Hessian ∇2 f (u). n Yi × A2. There exists τ > 0 such that the team utility function u : X × Πi=1 n Πi=1 Ai is separately concave in each of the decision variables, with concavity at least τ (i.e., if all the arguments of u are fixed except the decision variable ai , then the resulting function of ai has concavity at least τ ). Assumption A2 is motivated by tractability reasons and encountered in practice. For example, in economic problems it can be motivated by the “law of diminishing returns”, i.e., the fact that the marginal productivity of an input usually diminishes as the amount of output increases [23, p. 99 and p. 110]. 1 For Ω ⊆ Rd convex and f : Ω → R concave, pu ∈ Rd is a supergradient of f at u ∈ Ω if for every v ∈ Ω it satisfies f (v) − f (u) ≤ pu · (v − u) . Smooth Optimal Decision Strategies 443 A3. For every n-tuple {s1 , . . . , sn } of strategies, the strategies defined as n ŝ1 (y1 ) = argmax Ex,y2 ,...,yn |y1 {u(x, {yi }n i=1 , a1 , {si (yi )}i=2 )} ∀y1 ∈ Y1 , a1 ∈A1 ... n−1 ŝn (yn ) = argmax Ex,y1 ,...,yn−1 |yn {u(x, {yi }n i=1 , {si (yi )}i=1 , an )} ∀yn ∈ Yn an ∈An do not lie on the boundaries of A1 , . . . , An , respectively. The interiority condition in Assumption A3 can be imposed a-priori, by strongly penalizing the team utility function on the boundary. Simple examples of problems for which Assumptions A1, A2 and A3 hold simultaneously can be constructed by starting from a problem in which there is no interaction among n the DMs (i.e., u(x, y1 , . . . , yn , s1 (y1 ), . . . , sn (yn )) = i=1 ui (x, yi , . . . , yn , s1 (yi )), so that the assumptions are easy to impose), then adding to the team utility function a sufficiently small smooth interaction term. 3 Existence and Uniqueness of Smooth Optimal Strategies The next theorem (which takes the hint from [13, Theorem 11, p. 162], and extends it to a higher degree of smoothness) gives conditions guaranteeing that Problem STO has a solution made of an n-tuple of strategies that are Lipschitz continuous together with their partial derivatives up to a certain order. For limitations of space, we only sketch the proof for n = 2; details can be found in [8, Chapter 5]. Theorem 1. Let Assumptions A1, A2 and A3 hold. Then Problem STO admits an n-tuple (so1 , . . . , son ) of C m−2 optimal strategies with partial derivatives that are Lipschitz up to the order m − 2. Sketch of proof. Let n = 2. Consider a sequence {sj1 , sj2 } of pairs of strategies, indexed by j ∈ N+ , such that limj→∞ v(sj1 , sj2 ) = sups1 ,s2 v(s1 , s2 ) (such a sequence exists by the definition of supremum). From the sequence {sj1 , sj2 }, we generate another sequence {ŝj1 , ŝj2 } defined by ŝj1 (y1 ) = argmax M1j (y1 , a1 ) ∀y1 ∈ Y1 , a1 ∈A1 ŝj2 (y2 ) = argmax M2j (y2 , a2 ) ∀y2 ∈ Y2 , a2 ∈A2 where for every (y1 , a1 ) ∈ Y1 × A1 and (y2 , a2 ) ∈ Y2 × A2 , we let M1j (y1 , a1 ) = Ex,y2 |y1 {u(x, y1 , y2 , a1 , sj2 (y2 ))} , M2j (y2 , a2 ) = Ex,y1 |y2 {u(x, y1 , y2 , ŝj1 (y1 ), a2 )} . Since the probability density ρ(x, y1 , y2 ) is of class C m and strictly positive on an open set containing X ×Y1 ×Y2 , we obtain that the conditional density ρ(x, y2 |y1 ) 444 G. Gnecco and M. Sanguineti is of class C m on the compact set X × Y1 × Y2 and the team utility function u is of class C m on the compact set X × Y1 × Y2 × A1 × A2 . So M1j , as an integral dependent on parameters, is of class C m on the compact set Y1 × A1 , with upper bounds on the sizes of its partial derivatives up to the order m independent of y1 , a1 , and j. In particular, it is easy to show that M1j has concavity at least τ in a1 . By such continuity and concavity properties of M1j with respect to a1 , for all y1 ∈ Y1 the set argmaxa1 ∈A1 M1j (y1 , a1 ) consists of exactly one element. An analogous conclusion holds for argmaxa2 ∈A2 M2j (y2 , a2 ). So ŝj1 and ŝj2 are well-defined. Let y 1 , y 1 ∈ Y1 . By the definition of ŝj1 , exploiting the concavity τ of M1j with respect to a1 and taking the supergradient 0 of M1j with respect to the second variable at (y 1 , ŝj1 (y 1 )) and (y 1 , ŝj1 (y 1 )), respectively, we get and M1j (y 1 , ŝj1 (y 1 )) − M1j (y 1 , ŝj1 (y 1 )) ≤ −τ |ŝj1 (y 1 ) − ŝj1 (y 1 )|2 (1) M1j (y 1 , ŝj1 (y 1 )) − M1j (y 1 , ŝj1 (y 1 )) ≤ −τ |ŝj1 (y 1 ) − ŝj1 (y 1 )|2 . (2) By (1) and (2) we obtain |M1j (y 1 , ŝj1 (y 1 )) − M1j (y 1 , ŝj1 (y 1 ))| + |M1j (y 1 , ŝj1 (y 1 )) − M1j (y 1 , ŝj1 (y 1 ))| ≥ 2τ |ŝj1 (y 1 ) − ŝj1 (y 1 )|2 . (3) Let L1 > 0 (which can be chosen independently of j) be an upper bound on the Lipschitz constant of M1j . Then by (3) we obtain 2L1 y 1 − y 1 ≥ 2τ |ŝj1 (y 1 ) − ŝj1 (y 1 )|2 , i.e., L1 j j |ŝ1 (y 1 ) − ŝ1 (y 1 )| ≤ y 1 − y 1 , (4) τ which proves the Hölder continuity of ŝj1 , hence its continuity and measurability. Continuity and measurability of ŝj2 can be proved in the same way. Then it makes sense to evaluate v(ŝj1 , ŝj2 ), and by construction we have v(ŝj1 , ŝj2 ) ≥ v(sj1 , sj2 )). Let us focus on the strategies of the first DM. The next step consists in showing that there exists a subsequence of {ŝj1 } that converges uniformly to a strategy so1 ∈ C m−2 (Y1 ) with Lipschitz (m − 2)-order partial derivatives. By ∂M j Assumption A3, for every y1 ∈ Y1 ŝj1 (y1 ) is interior, so ∂a11 = 0. j a1 =ŝ1 (y1 ) ∂ŝj 1 Then, by the Implicit Function Theorem, for every k = 1, . . . , d1 we get ∂y1,k = −1 ∂ 2 M1j ∂ 2 M1j ∂ 2 M1j − , where ≤ −τ < 0 by the concavity at least τ j2 j2 ∂ŝj ∂y ∂ŝ1 1 1,k ∂ŝ1 of M1j in a1 and its smoothness. As M1j is of class C m , by taking higher-order partial derivatives we conclude that ŝj1 (y1 ) is locally of class C m−1 . As this holds for every y1 ∈ Y1 , it is of class C m−1 on all Y1 . Since M1j has upper bounds on the sizes of its partial Smooth Optimal Decision Strategies 445 derivatives up to the order m that are independent of y1 , a1 , and j, then for every (i1 , . . . , id1 ) such that i1 + . . . + id1 = m − 1, there exists a finite upper bound m−1 j ∂ ŝ1 on i1 , which is independent of y1 and j. So, one can easily show id ∂y1,1 ,...,∂y1,d11 that for every 1 ) such that i1 + . . . + id1 = m − 2, the functions of the (i1 , . . . , id ∂ m−2 ŝj1 id i1 ∂y1,1 ,...,∂y1,d1 1 sequence are equibounded and have the same upper bound on their Lipschitz constants, so they are uniformly equicontinuous on Y1 . Hence, by Ascoli-Arzelà’s Theorem [1, Theorem 1.30, p. 10], such sequence admits a subsequence that converges uniformly to a function defined on Y1 , which is also Lipschitz, with the same upper bound on its Lipschitz constant as above. By integrating m − 2 times, we conclude that also the integrals of these subsequences converge uniformly to the integrals of the limit functions. Therefore, there exists a subsequence of {ŝj1 } that converges uniformly to a strategy so1 ∈ C m−2 (Y1 ) with Lipschitz (m − 2)-order partial derivatives. Similarly, one proves that there exists a subsequence of {ŝj2 } that converges uniformly to so2 ∈ C m−2 (Y1 ) with Lipschitz (m − 2)-order partial derivatives. By the continuity of the functional v(s1 , s2 ) on C(Y1 ) × C(Y2 ) with the respective sup-norms, finally we obtain v(so1 , so2 ) = limj→∞ v(ŝj1 , ŝj2 ) = sups1 ,s2 v(s1 , s2 ) . The next theorem show that, under additional conditions, the optimal n-tuple of smooth strategies is unique. We denote by C(Yi , Ai ) the set of continuous functions from Yi to Ai with the sup-norm. Without loss of generality, we restrict the spaces of admissible strategies to C(Yi , Ai ), as one can show that under the assumptions of Theorem 1 any optimal strategy coincides almost everywhere with a continuous function. To simplify the statement, in Theorem 2 we consider the case of n = 2 DMs, but it can be extended to n ≥ 2 DMs. Theorem 2. Let the assumptions of Theorem 1 hold with m ≥ 3 and n = 2, ∂2 β1,2 and let also τ < 1, where β1,2 = max(a1 ,a2 )∈A1 ×A2 ∂a1 ∂a2 u(x, y1 , y2 , a1 , a2 ). Then (so1 , so2 ) given in Theorem 1 is the unique optimal pair of strategies in C(Y1 , A1 ) × C(Y2 , A2 ). Sketch of proof. Inspection of the first part of the proof of Theorem 1 shows that there exists a (possibly nonlinear) operator T : C(Y1 , A1 ) × C(Y2 , A2 ) → C(Y1 , A1 ) × C(Y2 , A2 ) such that T1 (s1 , s2 )(y1 ) = argmax Ex,y2 |y1 {u(x, y1 , y2 , a1 , s2 (y2 ))} ∀y1 ∈ Y1 , a1 ∈A1 T2 (s1 , s2 )(y2 ) = argmax Ex,y1 |y2 {u(x, y1 , y2 , T1 (s1 , s2 )(y1 ), a2 )} ∀y2 ∈ Y2 . a2 ∈A2 Let (so1 , so2 ) ∈ C(Y1 , A1 ) × C(Y2 , A2 ) be an optimal pair of strategies. Then it is easy to see that (so1 , so2 ) = T (so1 , so2 ) is a necessary conditions for its 446 G. Gnecco and M. Sanguineti optimality. By Assumption A3 and the compactness of Y1 and Y2 , for any (s1 , s2 ) ∈ C(Y1 , A1 ) × C(Y2 , A2 ) the strategies T1 (s1 , s2 ) and T2 (s1 , s2 ) belong respectively to the interiors of C(Y1 , A1 ) and C(Y2 , A2 ). So, Problem STO is reduced to an unconstrained infinite-dimensional game theory problem, for which one can apply the techniques developed in [17] to study the stability of Nash equilibria. This can be done since every pair of optimal strategies for Problem STO constitutes a Nash equilibrium for a two-player game, for which the individual utilities J 1 and J 2 are the same and are equal to v(s1 , s2 ). By using the norm Eyi {(si (yi ))2 } on C(Y1 , A1 ) and C(Y2 , A2 ) (instead of the usual sup-norms), computing the Frechét derivatives of the integral functional v up to the second order and applying [17, Theorem 1, formula (1)], one can show that, for m ≥ 3, T is a contraction operator with contraction constant bounded from above by 2 β1,2 τ2 < 1. So, T has at most a unique fixed point (so1 , so2 ) ∈ C(Y1 , A1 )×C(Y2 , A2 ), which by Theorem 1 coincides with (so1 , so2 ). 4 Approximation Methods and Algorithms In this section, we discuss how the existence and uniqueness of an optimal ntuple of strategies with a sufficiently high degree of smoothness can be exploited when searching for suboptimal solutions to Problem STO. 4.1 Estimates of the Accuracy of Suboptimal Solutions by Nonlinear Approximation Schemes In [10, Propositions 4.2 and 4.3] we have shown that, for a degree of smoothness m in Assumption A1 that is linear in maxi {di } (i.e., the maximum dimension of the information vectors yi ), the smooth optimal strategies so1 , . . . , son (whose existence is guaranteed by Theorem 1), can be approximated by suitable nonlinear approximation schemes modelling one-hidden-layer neural networks [11] with Gaussian and trigonometric computational units, with upper bounds on the approximation errors of order k −1/2 , where k is the number of computational units used in such schemes. This is an instance of the socalled blessing of smoothness [21]. We are currently investigating the extension of such results to other nonlinear approximation schemes with sigmoidal and spline computational units. The numerical results in [2,3,4] show that often these approximation schemes (which belong to the wider family of variable-basis approximation schemes [15,16]) are able to find accurate suboptimal solutions to team optimization problems with high-dimensional states, using a small number of parameters to be optimized. Variable-basis approximation schemes have been successfully exploited also in other optimization tasks (see the references in [30,31]). 4.2 Application of Quasi-Monte Carlo Methods Another consequence of a sufficiently high degree of smoothness of the optimal strategies is that it allows the application of quasi-Monte Carlo methods [20] Smooth Optimal Decision Strategies 447 and related ones (such as Korobov’s method; see [29] and [14, Chapter 6]) for the approximate computation of the multidimensional integrals v(so1 , . . . , son ) and v(s̃1 , . . . , s̃n ), where s̃1 , . . . , s̃n are smooth approximations of so1 , . . . , son . For example, upper bounds on the error in the approximate evaluation of a multidimensional integral by quasi-Monte Carlo methods can be obtained via KoksmaHlawka’s inequality [20, p. 20], which requires that the integrand has a finite variation in the sense of Hardy and Krause [20, p. 19]. Considering, e.g., the case of an integrand f defined on a r-dimensional unit-cube [0, 1]r , the most common formula [20, p. 19, formula (2.5)] used to prove that f has a finite variation in the sense of Hardy and Krause requires that f ∈ C r ([0, 1]r ) (i.e., its degree of smoothness has to be at least equal to the number of variables). With the obvious changes n in notation, Theorem 1 provides such a degree of smoothness, for m ≥ i=0 di + 2. 4.3 Algorithms for Suboptimal Solutions Finally, we investigate some implications of our results in the development of algorithms to find suboptimal solutions to Problem STO. For simplicity of exposition, we consider the case of n = 2 agents. Recall that under the assumptions of Theorem 2, the operator T defined in the proof of such theorem is a contraction operator. Then, given any initial pair of smooth suboptimal strategies (s̃01 , s̃02 ) and the unique (and a-priori unknown) optimal one (so1 , so2 ), for every positive integer M one has o M (y )|, max |s (y ) − s̃ (y )| max max |so1 (y1 ) − s̃M 1 2 1 2 2 2 ≤ 2 β1,2 τ2 y1 ∈Y1 M y2 ∈Y2 max max y1 ∈Y1 |so1 (y1 ) − s̃01 (y1 )|, max y2 ∈Y2 |so2 (y2 ) − s̃02 (y2 )| , (5) where M M 0 0 (s̃M (s̃1 , s̃2 ) 1 , s̃2 ) = T (6) 2 β1,2 τ2 and < 1. So, for the algorithm (6), the upper bound (5) shows that the rate of convergence to the optimal pair of strategies is exponential in M . In practice, however, the operator T itself has to be replaced by a finitedimensional approximating operator. Consider, e.g., an approximation scheme in which one searches for suboptimal strategies of the form s̃1 = h j=1 cj,1 φj,1 and s̃2 = h cj,2 φj,2 , j=1 where the positive integer h and the basis functions {φj,1 }hj=1 and {φj,2 }hj=1 are fixed, and {cj,1 }hj=1 , {cj,2 }hj=1 are real coefficients to be optimized. Let ũ(x, y1 , y2 , {cj,1 }, {cj,2 }) = u(x, y1 , y2 , s̃1 (y1 ), s̃2 (y2 )) . 448 G. Gnecco and M. Sanguineti Then, one can replace (6) by 0 M M {cj,1 }, {c0j,2 } , ({cM j,1 }, {cj,2 }) = T̃h (7) where one chooses the approximating operator T̃h such that T̃h,1 ({cj,1 }, {cj,2 }) = argmax Ex,y1 ,y2 {ũ(x, y1 , y2 , {ĉj,1 }, {cj,2 })} , (8) {ĉj,1 } T̃h,2 ({cj,1 }, {cj,2 }) = argmax Ex,y1 ,y2 {ũ(x, y1 , y2 , Th,1 ({cj,1 }, {cj,2 }), {ĉj,2 })} (9) {ĉj,2 } (for simplicity, we are assuming that there exist unique maxima). Exploiting Assumption A2, one can show that finding the argmax in (8) and (9) for fixed M {cM j,1 }, {cj,2 } requires one to solve two stochastic finite-dimensional concave optimization problems, to which the information-based-complexity results [27] and the efficient algorithms described in [19, Chapter 14] may be applied. Subjects of future research include studying the properties of the abovedefined operator T̃h and of other approximating operators. In particular, it is of interest finding conditions under which – the operator T̃h is a contraction operator (like T ); – the minimum positive integer h and the minimum number of elementary operations of the algorithms described in [19, Chapter 14], required to find a suboptimal solution to Problem STO with an error at most ε > 0, grow “slowly” with respect to 1/ε. 5 Network Team Optimization For static network team optimization problems [10], our smoothness results take on a simplified form. For these problems, the team utility function u can be written as the sum of a finite number of individual utility functions ui , each one associated with a single DM (e.g., a router) or with a shared resource in the network (e.g., a communication link). In addition, each ui depends only on a subset of the DMs. This situation can be described by a multigraph, where the DMs are the nodes and there is an edge between two DMs if and only if both appear in a same individual utility function. Figure 1 gives an idea of a network team optimization problem modeling a store-and-forward packet-switching telecommunication network (see [2,4]). Suppose that the DMs are n routers acting as members of a same team (i.e., they aim to maximize a common objective, decomposable into the sum of several individual objectives related, e.g., to the congestion of the links). Each router has at its disposal some private information (e.g., the total lengths of its incoming packet queues). Assume also that the traffic flows can be described by continuous variables. Then, on the basis of its private information, each router decides how to split the incoming traffic flows into its output links. The network team optimization problem consists in finding optimal (or nearly optimal) n-tuples of Smooth Optimal Decision Strategies 449 DM4 DM1 DM3 DM2 DM5 Fig. 1. An example of store-and-forward packet-switching telecommunication network (left). An example of graph model with buffers at the nodes (right). strategies according to some given optimality criterion (for simplicity we ignore any dynamics in the problem, and we model it as a static one). Compared with a general instance of Problem STO, the particular structure of a static network team optimization allows various simplifications: – For any n-tuple of strategies, the integral v(s1 , . . . , sn ) can be decomposed into the sum n of a finite number of integrals, each usually dependent on smoothness less than i=0 di real variables. So, the minimum degree m of n required to apply [20, p. 19, formula (2.5)] is usually less than i=0 di + 2 (compare with the general case in Section 4). – Since the strategy of each DM is influenced only by those of its neighbors in the network, Assumption A3 may be easier to impose. – One can show that an extension of Theorem 2 to n > 2 DMs can be formulated in terms of interaction terms βi,j , where (i, j) are pairs of different DMs in the team. For a static network team optimization problem, usually most of the βi,j are equal to 0 (since the interaction of each DM is limited to its neighbors in the graph), so such extension takes a simplified form. As to specific applications to static network team optimization problems, our smoothness results may be applied, e.g., to stochastic versions of the congestion, routing, and bandwidth allocation problems considered in [18, Lectures 3 and 4], which are stated in terms of smooth and concave individual utility functions. Acknowledgement. The authors were partially supported by a grant “Progetti di Ricerca di Ateneo 2008” of the University of Genova, project “Solution of Functional Optimization Problems by Nonlinear Approximators and Learning from Data”. 450 G. Gnecco and M. Sanguineti References 1. Adams, R.A.: Sobolev Spaces. Academic Press, New York (1975) 2. Baglietto, M., Parisini, T., Zoppoli, R.: Distributed-Information Neural Control: The Case of Dynamic Routing in Traffic Networks. IEEE Trans. on Neural Networks 12, 485–502 (2001) 3. Baglietto, M., Parisini, T., Zoppoli, R.: Numerical Solutions to the Witsenhausen Counterexample by Approximating Networks. IEEE Trans. on Automatic Control 46, 1471–1477 (2001) 4. Baglietto, M., Sanguineti, M., Zoppoli, R.: The Extended Ritz Method for Functional Optimization: Overview and Applications to Single-Person and Team Optimal Decision Problems. Optimization Methods and Software 24, 15–43 (2009) 5. Basar, T., Bansal, R.: The Theory of Teams: A Selective Annotated Bibliography. Lecture Notes in Control and Information Sciences, vol. 119. Springer, Heidelberg (1989) 6. Blackwell, D.: Equivalent Comparison of Experiments. Annals of Mathematical Statistics 24, 265–272 (1953) 7. Chu, K.C.: Team Decision Theory and Information Structures in Optimal Control Problems – Part II. IEEE Trans. on Automatic Control 17, 22–28 (1972) 8. Gnecco, G.: Functional Optimization by Variable-Basis Approximation Schemes. PhD Thesis in Mathematics and Applications, Dept. of Mathematics, University of Genova (2009) 9. Hess, J., Ider, Z., Kagiwada, H., Kalaba, R.: Team Decision Theory and Integral Equations. J. of Optimization Theory and Applications 22, 251–264 (1977) 10. Gnecco, G., Sanguineti, M.: Suboptimal Solutions to Network Team Optimization Problems. In: CD-Proc. of the Int. Network Optimization Conf. 2009 (INOC 2009), Pisa, Italy (April 2009) 11. Haykin, S.: Neural Networks. A Comprehensive Foundation. MacMillan, New York (1994) 12. Ho, Y.C., Chu, K.C.: Team Decision Theory and Information Structures in Optimal Control Problems - Part I. IEEE Trans. on Automatic Control 17, 15–22 (1972) 13. Kim, K.H., Roush, F.W.: Team Theory. Ellis Horwood, Chichester (1987) 14. Krommer, A.R., Ueberhuber, C.W.: Computational Integration. SIAM, Philadelphia (1998) 15. Kůrková, V., Sanguineti, M.: Error Estimates for Approximate Optimization by the Extended Ritz Method. SIAM J. of Optimization 18, 461–487 (2005) 16. Kůrková, V., Sanguineti, M.: Geometric upper Bounds on Rates of Variable-Basis Approximation. IEEE Trans. on Information Theory 54, 5681–5688 (2008) 17. Li, S., Basar, T.: Distributed Algorithms for the Computation of Noncooperative Equilibria. Automatica 23, 523–533 (1987) 18. Mansour, Y.: Computational Game Theory. Tel Aviv University, Israel (2006), http://www.cs.tau.ac.il/~mansour/course_games/2006/course_games_05_ 06.htm 19. Nemirovski, A.: Efficient Methods in Convex Programming. Technion, Haifa, Israel (1994), http://www2.isye.gatech.edu/~nemirovs/Lect_EMCO.pdf 20. Niederreiter, H.: Random Number Generation and Quasi-Monte Carlo Methods. SIAM, Philadelphia (1992) 21. Poggio, T., Smale, S.: The Mathematics of Learning: Dealing with Data. Notices of the American Mathematical Society 50, 537–544 (2003) Smooth Optimal Decision Strategies 451 22. Marschak, J.: Elements for a Theory of Teams. Management Science 1, 127–137 (1955) 23. Marschak, J., Radner, R.: Economic Theory of Teams. Yale University Press, New Haven (1972) 24. Radner, R.: Team Decision Problems. Annals of Mathematical Statistics 33, 857– 881 (1962) 25. Rantzer, A.: Using Game Theory for Distributed Control Engineering. In: 3rd World Congress of the Game Theory Society Games (2008); (Tech. Rep. ISRN LUTFD2/TFRT- -7620–SE, Dept. of Aut. Contr., Lund Univ., Sweden) (July 2008) 26. Segall, A.: The Modeling of Adaptive Routing in Data–Communication Networks. IEEE Trans. on Communications 25, 85–95 (1977) 27. Traub, J.F., Wasilkowski, G.W., Woźniakowski, H.: Information Based Complexity. Academic Press, New York (1988) 28. Witsenhausen, H.S.: Equivalent Stochastic Control Problems. Mathematics of Control, Signals, and Systems 1, 3–11 (1988) 29. Zinterhof, T.: High Dimensional Integration: New Weapons Fighting the Curse of Dimensionality. Physics of Particles and Nuclei Letters 5, 145–149 (2008) 30. Zoppoli, R., Parisini, T., Baglietto, M., Sanguineti, M.: Neural Approximations for Optimal Control and Decision. Springer, London (in preparation) 31. Zoppoli, R., Sanguineti, M., Parisini, T.: Approximating Networks and Extended Ritz Method for the Solution of Functional Optimization Problems. J. of Optimization Theory and Applications 112, 403–439 (2002)
© Copyright 2026 Paperzz