Task Planning and Multi-Agent Systems! •! •! •! •! •! Robert Stengel! Robotics and Intelligent Systems, MAE 345, Princeton University, 2015 Decision making Task decomposition, communities, and connectivity Cooperation, collaboration, competition, and conflict Path planning (see Lecture 5) Multi-agent architectures Copyright 2015 by Robert Stengel. All rights reserved. For educational use only. http://www.princeton.edu/~stengel/MAE345.html 1 Task Planning Goals •! Accomplish an objective •! Determine and follow a path •! Work toward a common goal –! –! –! –! –! Central Pacific and Union Pacific Railroads meet in Promontory, Utah, 1869 Make a decision Gather information Build something Analyze something Destroy something –! Minimize time or cost –! Take the shortest path –! Avoid obstacles or hazards –! Integrate behavior with higher objectives –! Do not impede other agents Driving The Golden Spike 2 More Task Planning Goals •! Provide leadership for other agents –! Issue commands –! Receive and decode information •! Provide assistance to other agents –! Coordinate actions –! Respond to requests •! Defeat opposing agents –! Compete and win •! Path planning –! See Lecture 5 3 Common Threads in Task Accomplishment •! •! •! •! •! •! •! •! Optimize a cost function Satisfy or avoid constraints Exhibit desirable behavior Tradeoff individual and team goals Use resources effectively and efficiently Negotiate Cooperate with team members Overcome adversity and ambiguity 4 Task Planning •! Situation awareness •! Decomposition and identification of communities •! Development of strategy and tactics Objective Tactical (short-term) Strategic (long-term) Phase Process Situation Assessment Comprehension Outcome Situation Awareness Understanding 5 Boyd s OODA Loop for Combat Operations •! Derived from air-combat maneuvering strategy •! General application to learning processes other than military 6 Elements of Situation Awareness Endsley, 1995 Important Dichotomies in Planning Strength, Weakness, Opportunity, and Threat (SWOT) Analysis Knok-Knoks and Unk-Unks 8 7 Strategy/Tactics Development and Deployment •! Development of long- and short-term actions/activities for implementation and operation •! Sequence of procedures to be executed –! fixed or adaptive •! Exposition of approach –! Rules of engagement –! Concept of Operations (CONOPS) •! Spectrum of flexibility –! Rigid sequence <---> Learning systems •! Think Expert System 9 Planning Tools! 10 Program Management: Gantt Chart •! •! •! •! Project schedule Task breakdown and dependency Start, interim, and finish elements Time elapsed, time to go 11 Program Evaluation and Review Technique (PERT) Chart •! •! •! •! •! •! Milestones Path descriptors Activities, precursors, and successors Timing and coordination Identification of critical path Optimization and constraint 12 Task Decomposition: Community Identification •! Connectivity of individuals •! Individuals assemble in communities or clusters •! Complex networks Fully connected Small world ring lattice –! Random networks –! Small-world networks –! Scale-free networks Random Clustered small world •! Degrees of separation Strogatz, 2001 Community <-> Communication 13 Communities and Networks! 14 Scale-Free Networks Frequency and cumulative distributions of cluster sizes, k, inversely proportional to kx, x ~ –2 or –3 No “knee” that implies a scale in the distribution Scale-Free Strogatz, 2001 https://en.wikipedia.org/wiki/Scale-free_network 15 Community Examples •! Families •! Classmates •! Neighbors •! Social Networks –! Facebook –! LinkedIn •! Media Networks •! Corporations •! Employees •! Customers •! Sports Leagues –! Teams •! Managers –! Players –! Trainers •! Airlines •! Cities •! Associations •! Governments –! Agencies •! Laboratories –! Managers –! Scientists •! Military organizations –! Army •! Corps –! Division »! Brigade –! Regiment •! Battalion –! Company »! Platoon –! Squad •! Soldier –! Special Operations •! Terrorist organizations 16 Multi-Agent Systems •! Specialized vs. general-purpose agents •! Organizational models •! Cooperators –! Leader/follower (hierarchical) –! Equal members •! Collaborators –! Air, ground, and sea traffic –! Customers •! Competitors –! Individual game players –! Sports teams –! Political/military organizations •! Negotiators –! Politicians –! Employer/employee representatives 17 Multi-Agent Systems •! Cooperation and collaboration should lead to win-win (non-zero-sum) solutions •! Competition should lead to winlose (zero-sum) solutions •! Negotiation should lead to win-win but may lead to win-lose solutions 18 Typical Characteristics of MultiAgent Architectures •! Federated (centralized) problem solving –! –! –! –! –! –! –! –! –! –! –! Doctrinaire Coupled Synchronous Fragile Complex Strategic Information-rich Unified Integrated Top-down Globally optimal •! Distributed problem solving –! –! –! –! –! –! –! –! –! –! –! Autonomous Independent Asynchronous Robust Simple Tactical Parsimonious Idiosyncratic Modular Bottom-up Locally optimal 19 Hierarchical Tree or Hub-and-Spoke Network? 20 What is the Nature, Quality, and Significance of Connections? •!Communication •!Collaboration •!Coordination •!Negotiation •!Competition •!Conflict 21 Connections May Connote Different Relationships •!Communication •!Collaboration •!Coordination •!Negotiation •!Competition •!Conflict 22 Competition 23 Conventional Conflict 24 Unconventional ( Asymmetric) Conflict 25 System Analysis of the 9/11 Terrorist Network •!Hijackers –!AA11 –!AA77 –!UA93 –!UA175 •!Accomplices http://pear.accc.uic.edu/ojs/index.php/fm/article/view/941/863 26 27 Multi-Agent Scenarios Modeled as Optimal Control Problems! 28 A Federated Optimization Problem •! Dynamic models for two agents, A and B, are coupled to each other and expressed as a single system ! F A x! (t) = Fx(t) + Gu(t) = # B # FA " •! FBA $ ! x A &# FB & #" x B % $ ! GA &+# B &% #" G A $ & &% Cost function minimizes performance-control tradeoff &( 1 t f *( E(J ) = E ' % !" xT (t)Qx(t) + uT (t)Ru(t) #$dt + () 2 to (, & tf ! ! Q QA # ! x # (1 A B . - A . + ! u AT = E ' % - ! xTA xTB # - B " $ 2 . -" x B .$ " Q Q B () to " $ " A •! G AB $ ! u A &# G B & #" u B % ! R A u BT # - B $- R " A R AB # ! u A .R B . -" u B $ # # *( . .dt + .$ . ( $ , Optimal feedback control laws are coupled to each other " uA u(t) = !Cx̂(t) = $ $# u B " C % A ' = !$ B $ CA '& # C AB % " x̂ A '$ C B ' $# x̂ B & % ' '& 29 A Distributed Optimization Problem •! Coupling between actions of two agents, A and B, is negligible ! FA x! (t) = Fx(t) + Gu(t) = # #" 0 0 $ ! xA &# FB & # x B %" $ ! GA &+# &% #" 0 &( 1 t f T *( E(J ) = E ' % !" x (t)Qx(t) + uT (t)Ru(t) #$dt + () 2 to (, & tf ! ! QA 0 # ! x A # (1 .. + ! u AT = E ' % - ! xTA xTB # " $ " 0 Q x 2 -" B .() to " $ " B .$ •! •! 0 $ ! uA &# G B & # uB %" ! RA u BT # $- 0 " $ & &% 0 # ! uA .R B . - uB $" # # *( . .dt + .$ .$ ( , Each sub-system can be optimized separately Each control depends only on separate sub-state " RA u(t) = ! $ $# 0 !1 " uA 0 % ' GT Sx̂(t) = !Cx(t) = $ RB ' $# u B & % " CA ' = !$ '& $# 0 0 % " x̂ A '$ C B ' $ x̂ B &# % ' '& 30 Pursuit-Evasion:! A Competitive Optimization Problem •! Pursuers goal: minimize final miss distance •! Evaders goal: maximize final miss distance •! Linear model with two competitors, P and E ! x! P x! (t) = Fx(t) + Gu(t) = # #" x! E •! $ ! FP &=# &% #" 0 0 $ ! xP &# FE & # x E %" $ ! GP &+# &% #" 0 0 $ ! uP &# G E & # uE %" $ & &% Example of a differential game, Isaacs (1965), Bryson & Ho (1969) 31 Pursuit-Evasion:! A Competitive Optimization Problem •! Quadratic minimax (saddle-point) cost function tf 1 (& 1 (* E(J ) = E ' !" xT (t f )S(t f )x(t f ) #$ + % "! xT (t)Qx(t) + uT (t)Ru(t) #$dt + 2 2 to )( ,( & ! S S PE # ( 1 ! xTP (t f ) xTE (t f ) # - P . . $ - S EP S E . (2 " $f " ( = E' ! QP QPE ( 1 tf ! (+ % - ! xTP (t) xTE (t) # $ - QEP QE () 2 to -" " " ! x P (t f ) - x E (t f ) " # . . $ # ! x P (t) # ! RP . + ! uPT (t) uET (t) # ." $ .$ -" x E (t) .$ -" 0 0 /R E * ( ( ( + # ! uP (t) # # ( . .dt ( ..$ -" uE (t) .$ .$ ( , •! Optimal control laws for pursuer and evader ! u P (t) $ ! C P (t) C PE (t) $ ! x̂ P (t) $ & = '# &# & u(t) = # #" u E (t) &% #" C EP (t) C E (t) &% #" x̂ E (t) &% 32 Coordination 33 Collaboration 34 Decomposition into Fast and Slow Models! 35 Reduction of Dynamic Model Order •! Separation of high-order models into loosely coupled or decoupled lower order approximations " !!x fast $ $# !!x slow % " Ffast '=$ slow '& $ Ffast # " Ff =$ $# small fast % " !x fast Fslow '$ Fslow ' $# !x slow & small % " !x f '$ Fs ' $ !x s &# % " G fast '+$ '& $ G fast slow # % " Gf '+$ '& $# small G slow fast % " !u fast '$ G slow ' $# !u slow & small % " !u f '$ G s ' $ !u s &# % ' '& % ' '& 36 Truncation of a Dynamic Model •! Dynamic model order reduction when –! Two modes are only slightly coupled –! Time scales of motions are far apart –! Forcing terms are largely independent " !!x f $ $# !!x s % " Ff '=$ '& $ Ffs # Fsf % " !x f '$ Fs ' $# !x s & " Ff =$ $# small " Ff ($ $# 0 •! G sf % " !u f '$ G s ' $# !u s & % " Gf '+$ '& $ G sf # small % " !x f '$ Fs ' $ !x s &# % " Gf '+$ '& $# small 0 % " !x f % " G f '$ '+$ Fs ' $ !x s ' $ 0 &# & # % ' '& small % " !u f '$ G s ' $ !u s &# % ' '& 0 % " !u f % '$ ' G s ' $ !u s ' &# & Approximation: Modes can be analyzed and control systems can be designed separately !!x f = Ff !x f + G f !u f !!x s = Fs !x s + G s !u s 37 Residualization of a Dynamic Model •! Dynamic model order reduction when –! Two modes are coupled –! Time scales of motions are separated –! Fast mode is stable " !!x f $ $# !!x s % " Ff '=$ '& $ Ffs # " Ff =$ $# small •! Fsf % " !x f '$ Fs ' $# !x s & small % " !x f '$ Fs ' $ !x s &# % " Gf '+$ '& $ G f s # % " Gf '+$ '& $# small G s f % " !u f '$ G s ' $# !u s & % ' '& small % " !u f '$ G s ' $ !u s &# % ' '& Approximation: Motions can be analyzed separately using different clocks –! Fast mode reaches steady state instantaneously on slow-mode time scale –! Slow mode produces slowly changing bias disturbances on fast-mode time scale 38 Residualized Fast Mode Fast mode dynamics !!x f = Ff !x f + G f !u f + ( Fsf !x s + G f s !u s )"Bias If fast mode is not stable, it could be stabilized by inner loop control ( !!x f = Ff !x f + G f !u c " C f !x f ( + F !x s + G !u s ( f s ) f s ) ) " Bias = Ff " G f C f !x f + G f !u fc ( + F !x s + G sf !u s Fast Mode Inner Loop Control Law f s ) " Bias 39 Fast Mode in Quasi-Steady State Assume that fast mode reaches steady state on a time scale that is short compared to the slow mode 0 ! Ff "x f + Fsf "x s + G f "u f + G sf "u s "!x s = Ffs "x f + Fs "x s + G s "u s + G sf "u f Algebraic solution for fast variable 0 ! Ff "x f + Fsf "x s + G f "u f + G sf "u s Ff "x f = #Fsf "x s # G f "u f # G sf "u s ( "x f = #Ff#1 Fsf "x s + G f "u f + G sf "u s ) 40 Residualized Slow Mode Substitute quasi-steady fast variable in differential equation for slow variable ( ) !!x s = "Ffs #$ Ff "1 Fsf !x s + G f !u f + G f s !u s %& + Fs !x s + G s !u s + G s f !u f = #$ Fs " Ffs Ff "1Fsf %& !x s + #$G s " Ffs Ff "1G sf %& !u s + #$G sf " Ffs Ff "1G f %& !u f Residualized equation for slow variable !!x s = FsNEW !x s + G sNEW " !u f $ $# !u s % ' '& Slow Mode Outer Loop Control Law Control law can be designed for reduced-order slow model, assuming inner loop has been stabilized separately 41 Air Traffic Management: A Collaborative Multi-Agent System https://www.flightradar24.com 42 Elements of Principled Negotiation [Fisher, Ury (1981) Fry (1991)] •! •! •! •! •! Example of decision-making Separate agents* from the problem Focus on interests, not positions Invent options for mutual gain Insist on using objective criteria * people, organizations, entities, … 43 Intelligent Agents in Air Traffic Management 44 Principled Negotiation Flow Chart 45 Expert System Diagram for Principled Negotiation (Wangermann and Stengel) •! •! •! •! Separate agents* from the problem Focus on interests, not positions Invent options for mutual gain Insist on using objective criteria 46 Graphical Representation of Knowledge: Principled Negotiation in Air Traffic Management 47 Principled Negotiation: Getting Past No (Ury, 1991) •! Prepare by identifying barriers to cooperation, options, standards, and your Best Alternative to a Negotiated Agreement (BATNA) •! Understand your goals, limits, and acceptable outcomes •! Buy time to think , deflect attacks •! Know your hot buttons •! Acknowledge opposing arguments •! Agree when you can without conceding •! Express your views without provoking •! I statements, not you statements •! Negotiate the rules of the game •! Reframe the negotiation •! Build a golden bridge that allows opponent to retreat gracefully •! Engage third-party mediation or arbitration •! Aim for mutual satisfaction, not victory •! Forge a lasting agreement 48 Supplementary Material 49 Intelligent Aircraft/Airspace System Flow Control 50 Intelligent Aircraft/Airspace System Departure Control 51 A Cooperative MultiAgent System 52 Multi-Agent Control Example Based on Linear-Quadratic-Gaussian ! (LQG) Optimal Control •! Linear dynamic model x! (t) = Fx(t) + Gu(t) + Lw(t) •! Quadratic cost function tf + ' ) )! "# x(t f ) $% + & L [ x(t), u(t)]dt ) ) to E(J ) = E ( , tf + 1 ') T ) ) ) = (x (t f )S f x(t f ) + & "# xT (t)Qx(t) + uT (t)Ru(t) $%dt , ) ) 2 *) to -) )*) 53 Conclusion •! Robots and Robotics •! Intelligent Systems •! Robots + Intelligent Systems = Intelligent Robotics –! –! –! –! Mechanical devices Design of mechanical devices Use of mechanical devices Control processes, sensors, and algorithms used in humans, animals, and machines –! Systems to perform useful functions driven by goals and current knowledge –! Systems that emulate biological and cognitive processes –! Systems that process information to achieve objectives –! Systems that learn by example –! Systems that adapt to a changing environment –! Optimization 54 MAE 345 Course Learning Objectives! !! !! !! !! !! !! !! !! !! !! Dynamics and control of robotic devices.! Cognitive and biological paradigms for system design. !! Estimate the behavior of dynamic systems. !! Apply of decision-making concepts, including neural networks, expert systems, and genetic algorithms.! Components of systems for decision-making and control, such as sensors, actuators, and computers. !! Systems-engineering approach to the analysis, design, and testing of robotic devices. !! Computational problem-solving, through thorough knowledge, application, and development of analytical software. !! Historical context within which robotics and intelligent systems have evolved.! Global and ethical impact of robotics and intelligent systems in the context of contemporary society. !! Oral and written presentation.! 55
© Copyright 2026 Paperzz