Distributed Optimization Yen-Ling Kuo Der-Yeuan Yu May 27, 2010 Outline [Yu] • Optimized Sensing: From Water to the Web • Distributed Dynamic Programming • Distributed Solutions to Markov Decision Problems Optimized Sensing • Problem Statement • Greedy Algorithms and Submodularity • Robust Sensing Optimization with Saturate Algorithm • Application in Blogs Problem Statement • How do we detect contamination in drinking water distribution networks? • Which blogs should we read to learn about the biggest, newest stories on the Web? • Fundamental Question: How can we get the most useful information at minimum cost (limited resources)? Solutions to Optimized Sensing • Covers fields of statistics, machine learning, sensor networks, and robotics • With partially observable Marko decision processes, we can get optimal solutions • But it is difficult to scale POMDP to large problems • Introducing a new algorithm based on submodularity Formulation • Sensing quality function F(A) – A: the set of sensor locations Si (i=1~k) – V: the set of all locations • We can also have cost constraints – Total cost of sensor deployment no greater than the budget • Goal: Find A* – This is NP-hard already Greedy Algorithm • Iteratively find Si • This naïve algorithm actually performs pretty well – Why? Submodularity – We get near-optimal solutions • Submodularity: diminishing returns Diminishing Returns Cost-Effective Lazy Forward-Selection (CELP) • Greedy algorithm • Lazy evaluations – Delaying computation until the result is required – A computational technique Robust Sensing Optimization • Idea: Protect system against adversaries that know of our deployment of sensors • Goal: Maximize the worst-case detection performance • Approach • Unfortunately, this naïve extension can fail Failure of Greedy Algorithm on WorstCase Scenarios • I1, I2: two contamination events • S1, S2, S3: three possible sensor locations – S1: detect I1 immediately, but never I2 – S2: detect I2 immediately, but never I1 – S3: detect both I1 and I2, but only after a long time • We can only place two sensors • Greedy would pick S3 first and then either S1 or S2 • But we know the optimal solution should be S1 and S2 • Solution? Saturate algorithm Saturate Algorithm • Idea: reduce the non-submodular worst-case objective to a submodular optimization problem – Transform non-submodular to submodular • Transformation – Guess optimal solution value C using binary search – Try to find A such that F(A) is no less than C Performance of Saturate From Water to the Web Blog Reading • Problem: Information cascading Improvements • Number-of-posts (NP) model – Reading a big blog can be time-consuming, so they define the cost to be the number of posts • CELP tends to choose blogs with many posts • NP model tends to choose summarizer blogs – But stories appear in summarizer blogs a little late Other Thoughts • What if we are looking for stories to read instead of blogs to read? – We can reverse our information management goal – Find posts instead of blogs – Ref. 10 • End of Paper Distributed Dynamic Programming for Path Planning • Asynchronous Dynamic Programming • Learning Real-Time A* Asynchronous Dynamic Programming • Propagate costs from target to start locations Learning Real-Time A* (LRTA*) LRTA*(n) • LRTA with n agents • Faster – Agents break ties differently – They can share the same h-value table LRTA*(2) Distributed Solutions to Markov Decision Problems • As previously mentioned in the Water to Web paper, MDPs can be difficult to scale to big problems • Solution: Exploit independence properties • We address the modularity of actions Action Selection in multiagent MDPs Implementation Subtask Distribution Problem • A global problem is broken down into subtasks • Subtasks are distributed among agents • Each agent has different capabilities 25 Contract Net • Stages – – – – Recognition Announce Bidding Awarding & Expediting • Initial assignment: Not optimal • Anytime property – Improve assignment in negotiation process 26 Assignment problem • Problem definition – A set N of n agents – A set X of n objects – A set M ⊆ N × X of possible assignment pairs, and – A function v : M → R X N M • Find optimal assignment 27 Corresponding Linear Program • Linear program (LP) formulation Profit maximization Resource constraint Optimal solution • Any LP can be solved in polynomial time O(n3) 28 Competitive Equilibrium • Consider a price vector p = (p1, …, pn) – The utility from an assignment j to agent i is u(i, j) = v(I, j) - pj • A feasible assignment S and a price vector p are in competitive equilibrium when for every pairing (i, j) ∈ S it is the case that ∀ k, u(i, j) ≥ u(i, k) Every agent will not change its selection S is a optimal solution 29 Naïve Auction Algorithm • Round-robin style • Bid increment is the difference between the utility to i of the best and second-best object The agent will not overbid 30 Problem in Naïve Auction • When more than one object offers maximal utility for an agent – Bid increment is zero 31 Terminating Auction Algorithm • Modify the bid increment – ε-competitive equilibrium: u(i, j) + ε ≥ u(i, k) Agents may overbid some objects 32 Scheduling Problem • Problem definition – – – – N is a set of n agents X is a set of m discrete and consecutive time slots q = (q1, . . . , qm) is a reserve price vector v = (v1, . . . , vn), where vi is the valuation function of agent I F • Find optimal allocation 33 Corresponding Integer Program • Integer program (IP) formulation • IPs are not solvable polynomial time 34 Competitive Equilibrium – General Form • Definition – For all i ∈ N it is the case that Fi = argmaxT ⊆ X (vi(T) − ∑j|xj∈T pj) – For all j such that xj ∈ F∅ it is the case that pj = qj – For all j such that xj ∈ F∅ it is the case that pj ≥ qj • May not exist competitive equilibrium Has a competitive equilibrium solution ↕ The LP relaxation of the associated integer program has a integer solution. 35 Ascending Auction Algorithm • Center advertise an ask price • Bid increment is constant 36 Problem in Ascending Auction • If the increment is too large • May not converge to optimal solution 37 Social Laws and Conventions • Social law – A restriction on the given strategies of the agents – Induce a sub-game • Social convention – The sub-game consists of a single strategy for all agent • Other topics – Social goal negotiation – Social norm negotiation – …. 38
© Copyright 2026 Paperzz