Necessary and Sufficient Graphical Conditions 1 for Formation Control of Unicycles Zhiyun Lin* Bruce Francis Manfredi Maggiore [email protected] [email protected] [email protected] Tel: 416-978-6289 Tel: 416-978-6343 Tel: 416-946-5095 Department of Electrical and Computer Engineering University of Toronto 10 King’s College Road Toronto, Canada, M5S 3G4 Fax: 416-978-0804 Submitted for publication November 6, 2003 Abstract The feasibility problem is studied of achieving a specified formation among a group of autonomous unicycles by local distributed control. The directed graph defined by the information flow plays a key role. It is proved that formation stabilization is feasible if and only if the sensor graph has a globally reachable node. A similar result is given for formation stabilization to a line and to more general geometric arrangements. Keywords: Multiagent system, distributed control, time-varying feedback, nonholonomic mobile robots. I. I NTRODUCTION The problem of coordinated control of a group of autonomous wheeled vehicles is of recent interest in control and robotics due to the broad range of applications of multi-vehicle systems in space missions, operations in hazardous environments, and military missions. Distributed vehicles can readily exhibit the attractive characteristics of structural flexibility, reliability through redundancy, and simple hardware as compared to sophisticated individual vehicles. * Corresponding author 2 Despite the many advantages inherent in distributed multi-vehicle systems, there are challenges in coordination and control due to the absence of a centralized supervisor and global information. Individual vehicles in a distributed system must be capable of collectively accomplishing tasks using only locally sensed information and little or no direct communication. Coordinated control of multi-vehicle systems includes many aspects, one of the most important and fundamental being formation control. Over the past decade, many researchers have worked on formation control problems with differences regarding the types of agent dynamics, the varieties of the control strategies, and the types of tasks demanded. In 1990, Sugihara and Suzuki [24] proposed a simple algorithm for a group of point-mass type robots to form approximations to circles and simple polygons. And in the years following, distributed algorithms were presented in [3], [2], and [25] with the objective of getting a group of such robots to congregate at a common location. Moving synchronously in discrete-time steps, the robots iteratively observe neighbors within some visibility range and follow simple rules to update their positions. Also, in [18], stability of asynchronous swarms with a fixed communication topology is studied, where stability is used to characterize the cohesiveness of a swarm. Convergence to a common point is an example of an agreement problem: Agents initially without a common reference frame eventually come to agree upon a reference point by virtue of having arrived at one. Besides being of relevance to the rendezvous problem, convergence to a common point is also of interest because if it is feasible, then so is convergence to other formations, as shown for example in [17]. Jadbabaie et al. [12] studied a different agreement problem: Getting autonomous agents in the plane to move in a common direction. In addition to the references mentioned so far, there have been many results on the mathematical analysis of formation control. In [21], formation stabilization of a group of agents with linear dynamics (double-integrator) is studied using structural potential functions. An alternative is to use artificial potential functions and virtual leaders as in [16]. The approach known as leader-following has been widely used in maintaining a desired formation while moving, e.g., [6], [7]. Similarly to [12], reference [17] focuses on the problem of achieving a specified formation among a group of mobile autonomous agents by distributed control when the communication topology is dynamic because agents come into and go out of sensor range. Reference [13] studies achievable equilibrium formations of unicycles 1 each moving at 1 By a unicycle we mean a nonholonomic system moving in the plane and with one angle of orientation. It’s sometimes also referred to as a shopping cart, the kinematics being identical. 3 unit speed and subject to steering control and presents stabilizing control laws, wherein each unicycle senses all others. And in [19], a circular formation is achieved for a group of unicycles using the strategy of cyclic pursuit. Motivated by the fact that no continuous time-invariant feedback control law can stabilize a nonholonomic system to a point, in 1991 Samson and Ait-Abderrahim [23] showed that smooth time-varying feedback can stabilize a unicycle. Since then, much work (for instance, [20], [22]) has focused on smooth time-varying feedback control of nonholonomic systems, and averaging theory has been used to study stability [20] and also motion planning [10]. On the question of formations, in [27], [28] a time-varying feedback control law is proposed for multiple Hilare-type mobile robots and averaging theory is used to analyze stability. As a natural extension of our previous work [19], [17], and motivated by the proposed strategy in [28], [27], in this paper the problem is studied of achieving a specified formation among a group of unicycles by distributed control. Each unicycle is equipped with an onboard sensor, by which it can measure relative displacements to certain neighbors; in particular, we do not assume that the unicycles possess a common reference frame. Central to a discussion of formation control is the nature of the information flow throughout the formation. This information flow can be modeled by a sensor digraph (directed graph), where a link from node i to node j indicates that vehicle i can sense the position of vehicle j —but only with respect to the local coordinate frame of vehicle i. In this paper, we assume the sensor digraph is static—the dynamic case, where ad hoc links can be established or dropped, is harder. Our analysis relies on several tools from algebraic graph theory [9], non-negative matrix theory [5], [15], and averaging theory [1], [14], [20]. We introduce a new concept for our analysis: H(α, m) stability of the Laplacian of the digraph. Our first main result is that formation stabilization to a common point is feasible if and only if the sensor digraph has a globally reachable node (a node to which there is a directed path from every other node). That is, there exists at least one unicycle that is viewable, perhaps indirectly by hopping from one unicycle to another, by all other unicycles. This is precisely the degree of connectedness required and is much weaker than strong connectedness of the sensor digraph (as in cyclic pursuit [19], for example). The key to this main result is a new result in graph theory: A digraph has a globally reachable node iff 0 is a simple eigenvalue of the Laplacian of the digraph. Our proof of sufficiency is constructive: We present an explicit smooth periodic feedback controller, and prove convergence using averaging theory. 4 In addition, we show how formation stabilization to a common point can be adapted to any geometric pattern. Our second main result concerns formation stabilization to a line. This turns out to be feasible if and only if there are at most two disjoint closed sets of nodes in the sensor digraph. Again, our proof of sufficiency is constructive. Finally, we introduce a special sensor digraph which guarantees that all vehicles converge to a line segment, equally spaced. This is an extension to unicycles of a line-formation scheme of Wagner and Bruckstein [26]. II. M ATHEMATICAL P RELIMINARIES In this section, we review some basic notions in graph theory [4], [8], algebraic graph theory [9], and matrix theory [5], [15] that are useful in linking graph-theoretic and control-theoretic concepts. Finally, an averaging result is given that will be the key tool in the stability analysis throughout the paper. A. Graph Theory A directed graph or digraph G = (V, E) consists of a finite nonempty set V of nodes and a (possibly empty) set E of ordered pairs of nodes of G called edges or links, where e ij = (vi , vj ) ∈ E and vi , vj ∈ V . If U is a nonempty subset of V , then the digraph (U, E ∩ (U × U)) is termed an induced subgraph. If each edge is associated with a positive number, called the weight of the edge, then the digraph is named a weighted digraph. An unweighted digraph is just a weighted digraph with each of the edges bearing weight 1. The in-degree and out-degree of a node are the sum of the weights of the edges at its head and tail, respectively. A path on a digraph G = (V, E) of length N from vj0 to vjN is an ordered set of distinct nodes {vj0 , vj1 , . . . , vjN } such that (vji−1 , vji ) ∈ E, ∀ i = 1, 2, . . . , N . If there is a path in G from one node v i to another vj , then vj is said to be reachable from vi , written vi → vj . If not, then vj is said to be not reachable from vi , written vi 9 vj . If a node vi is reachable from every other node in the digraph, then we say it is globally reachable. A digraph is strongly connected if every two of its nodes, say u and v , are such that u is reachable from v and v is reachable from u. So a digraph with at least two nodes is strongly connected if and only if each node is globally reachable. For a digraph with only one node, the node is globally reachable, but the digraph is not strongly connected if the node has no self-loop. For a digraph G = (V, E), if U is a nonempty subset of V and u 9 v for all u ∈ U and v ∈ V − U , then U is said to be closed. 5 PSfrag replacements As an example, in Fig. 1 the digraph (a) is strongly connected but the digraphs (b) and (c) are not. The globally reachable node sets of digraphs (b) and (c) are {1, 2, 6} and {6} respectively. 1 1 6 2 6 2 6 2 5 3 5 3 5 3 4 (a) Fig. 1. 1 4 (b) 4 (c) Three digraphs. B. Matrix Theory We now turn to matrices associated with graphs. For this purpose, we first give some notions and properties of a class of matrices. A matrix A is said to be nonnegative (positive) if all elements are nonnegative (positive). An n × n matrix A is cogredient to a matrix E if for some permutation matrix P , P AP T = E . The matrix A is reducible if it is cogredient to B 0 , E= C D (q) where B and D are square matrices, or if n = 1 and A = 0. Otherwise, A is irreducible. Let a ij denote the (i, j) element of Aq . A nonnegative matrix A is irreducible if and only if for every (i, j) there exists (q) q ≥ 1 such that aij > 0. Suppose that the nodes of a weighted digraph G = (V, E) are enumerated and denoted {v 1 , . . . , vn }. The adjacency matrix of G , denoted A, is a square nonnegative matrix with rows and columns indexed by the nodes of G , such that the ij -entry of A is the weight associated with the edge if (v i , vj ) ∈ E , and is zero otherwise. Let D be the diagonal matrix with the out-degree of each node along the diagonal and call it the degree matrix of G . The Laplacian of the digraph is defined as L = D − A. Taking the digraph (b) in Fig. 1 as an example, suppose each edge bears weight 1. Then we have, 6 respectively, 0 1 0 0 0 1 A= 0 0 0 0 1 0 0 0 0 0 1 0 0 0 0 0 0 1 0 0 0 0 1 0 0 2 0 1 0 0 , D = 0 0 0 0 0 1 0 0 0 0 1 0 0 1 0 0 0 0 0 0 0 1 −1 0 0 0 0 0 1 0 0 0 −1 0 0 0 0 −1 2 −1 0 0 0 0 0 . , L = 0 0 0 1 −1 0 1 0 0 0 0 −1 0 2 −1 0 2 0 −1 0 0 0 0 1 0 0 1 On the other hand, given a nonnegative matrix, we can define a corresponding digraph. The associated digraph G(A) of an n × n matrix A = (aij ) consists of n nodes v1 , v2 , . . . , vn , where an edge leads from (q) vi to vj if and only if aij 6= 0. Then aij > 0 if and only if there exists a sequence of q edges from v i to vj . Furthermore, A is irreducible if and only if G(A) is strongly connected. An important class of nonnegative matrices are row stochastic matrices. A nonnegative matrix is called row stochastic, or stochastic for short, if all row sums are equal to one. In addition, a matrix A of the form A = sI − B (s > 0, B a nonnegative matrix) for which s ≥ ρ(B) is called an M -matrix. Next we will review some properties of the Kronecker product. If the orders of the matrices involved are such that all the operations below are defined, then (a) (A + B) ⊗ C = A ⊗ C + B ⊗ C ; (b) (A ⊗ B)T = AT ⊗ B T ; (c) If A, C ∈ Rm×m and B, D ∈ Rn×n , then (A ⊗ B)(C ⊗ D) = (AC) ⊗ (BD); (d) If λ1 , . . . , λm are the eigenvalues of A ∈ Rm×m and µ1 , . . . , µn are the eigenvalues of B ∈ Rn×n , then the eigenvalues of A ⊗ B are the mn numbers λr µs , r = 1, . . . , m, s = 1, . . . n. Finally a definition from [11] will be modified in order to better suit our application. Let α = {α1 , α2 , . . . , αp } be a partition of {1, 2, . . . , n}. A block diagonal matrix with diagonal blocks indexed by α1 , α2 , . . . , αp is said to be α-diagonal. A scalar multiple of the identity matrix is said to be a scalar matrix. An α-diagonal matrix D is said to be an α-scalar matrix if each block matrix in D is a scalar matrix. For α = {{1, 2}, 3}, simple examples of α-diagonal matrix 2 1 3 0 A1 = 2 2 0 and A2 = 0 0 0 0 5 and α-scalar matrix are, respectively, 0 0 2 0 . 0 5 7 Definition 1: Let α = {α1 , α2 , . . . , αp } be a partition of {1, 2, . . . , n} and m ≥ 0 an integer. An n × n matrix A is said to be H(α, m)-stable if (a) 0 is an eigenvalue of A of algebraic and geometric multiplicity m, while all other eigenvalues have negative real part, (b) for every α-diagonal positive definite symmetric matrix R, 0 is an eigenvalue of RA of algebraic and geometric multiplicity m, while all other eigenvalues have negative real part. Obviously, an H(α, 0)-stable matrix is asymptotically stable, but the converse does not hold in general. A simple counterexample is A= −5 −4 4 3 , and α = {{1}, {2}}. Then A is asymptotically stable with eigenvalues{−1, −1} but RA is unstable with the choice of α1 0 . diagonal positive definite symmetric matrix R = 0 2 C. Averaging Theory In this subsection we introduce an averaging result that is useful for analyzing slowly time-varying systems. More details can be found in [1], [14], [20]. Consider the system ẋ = f (t, x, ) (1) where f (t, x, ) is T -periodic in t for some T > 0. We associate with (1) an autonomous averaged system ẋ = fav (x), where 1 fav (x) = T Z (2) T f (τ, x, 0)dτ. 0 Then we arrive at the following theorem. Theorem 1: [14] Let f (t, x, ) be continuous and bounded, and have continuous and bounded derivatives up to the second order with respect to (x, ) for (t, x, ) ∈ [0, ∞) × D × [0, 0 ], where D is a neighborhood of the origin. Suppose f is T -periodic in t for some T > 0 and that f (t, 0, ) = 0 for all t ≥ 0 and ∈ [0, 0 ]. If 0 is an exponentially stable equilibrium point of the averaged system (2), then there is a positive constant ∗ ≤ 0 such that for all 0 < < ∗ , 0 is an exponentially stable equilibrium point of (1). 8 III. P ROBLEM S TATEMENT AND M AIN R ESULTS Before treating unicycles, it is perhaps illuminating to give a result for the much simpler case of point masses. Consider n “point-mass robots” whose positions are modeled by complex numbers, z 1 , . . . , zn , in the plane. Assume a kinematic model of velocity control: ż i = ui . Assume each robot senses the relative positions of a subgroup, Ni , of the other robots. Let yi denote the vector whose components are the relative positions zm − zi , as m ranges over Ni . Thus yi , a vector of dimension the cardinality of Ni , represents the information available to ui . We allow controllers of the form ui = Fi yi , or ui = 0 if Ni is empty. Thus, yi = 0 =⇒ ui = 0 =⇒ żi = 0; that is, robot i does not move if all robots it senses are collocated with it (or if it does not sense any other robot). The problem of convergence to a common point is this: Problem 1: Find, if possible, F1 , . . . , Fn such that (∀ z1 (0), . . . , zn (0))(∃ zss ) lim z1 (t) = · · · = lim zn (t) = zss . t→∞ t→∞ Now define the sensor graph G for this setup: There is a directed edge from node i to node m if and only if m ∈ Ni . Theorem 2: 1) Problem 1 is solvable if and only if G has a globally reachable node. 2) When Problem 1 is solvable, one solution is h Fi = 1 · · · The proof is omitted. 1 i . Now we turn to the main topic of unicycles. We can identify the real plane, R 2 , and the complex plane, C, by identifying a column vector, zi , and a complex number, zi . Now consider a wheeled vehicle PSfrag replacements with coordinates (xi , yi , θi ) with respect to PSfrag a globalreplacements frame g Σ (see Fig. 2). vi ri ωi θi yi iΣ gΣ Fig. 2. si Wheeled vehicle. iΣ zi gΣ xi Fig. 3. Frenet-Serret frame. trajectory 9 The location of the vehicle in the plane is xi zi = yi or zi = xi + jyi . The vehicle has the nonholonomic constraint of pure rolling and non-slipping and is described kinematically as ẋ = vi cos(θi ), i ẏi = vi sin(θi ), θ̇ = ω , i or i ż = v ejθi , i i θ̇ = ω . i (3) i Following [13], we construct a moving frame i Σ, the Frenet-Serret frame, that is fixed on the vehicle (see Fig. 3). Let ri be the unit vector tangent to the trajectory at the current location of the vehicle (r i is the normalized velocity vector) and let si be ri rotated by π/2. Since the vehicle is moving at speed vi , vi ri = żi , and so in complex form ri = ejθi , Thus ṙi = d jθi = jejθi θ̇i = si ωi , e dt si = jri . ṡi = j ṙi = jsi ωi = −ri ωi . The kinematic equations using the Frenet-Serret frame are ż = vi ri , i ṙi = ωi si , ṡ = −ω r . i (4) i i Now consider n wheeled vehicles, indexed by i. We refer to the individual vehicles as nodes and the information flows as links. Although the vehicles in the group are dynamically decoupled, meaning the motion of one vehicle does not directly affect any of the other vehicles, they are coupled through the sensor information flow. A natural way to model the interconnection topology is a digraph G = (V, E) with V = {1, 2 . . . , n} in which the vehicles are the nodes and the links are directed edges. The directed edge from node i to node m is one of the graph’s edges just in case vehicle i can sense vehicle m. (Thus, information flows in the direction opposite to the orientation of the edges.) We refer to this as a sensor digraph. Let Ni denote the set of labels of those vehicles sensed by vehicle i. In this paper, we assume N i is time-invariant, meaning the information flow topology is static. In the control law that we study, no PSfrag replacements mΣ yim xim zm zi gΣ Fig. 4. 10 iΣ Local information. vehicle can access the absolute positions of other vehicles or its own. Specifically, vehicle i can measure only the relative positions of sensed vehicles with respect to its own Frenet-Serret frame (see Fig. 4), xim = (zm − zi ) · ri , m ∈ Ni , (5) y = (z − z ) · s , im m i i where dot denotes dot product. This leads to the following definition. Definition 2: A controller (vi , wi ), i = 1, . . . n, is said to be a local information controller if v = g (t, x , y )| i i im im m∈Ni , i = 1, . . . , n ω = h (t, x , y )| i i im im m∈Ni where gi , hi are smooth functions of their arguments, and gi is such that {(∀ m ∈ Ni ) zm = zi } ⇒ {vi = 0}. Notice that in our definition a vehicle does not translate (but it can rotate) when either it cannot sense any other vehicle or its neighbors have all converged to its position. In what follows, we present the two main problems investigated in this paper, together with necessary and sufficient conditions for their solution. Problem 2: (Formation Stabilization to a Point) Find, if possible, a local information controller such that for all (xi (t0 ), yi (t0 ), θi (t0 )) ∈ R3 , i = 1, . . . n, and all t0 ∈ R, there exists zss ∈ R2 such that lim zi (t) = zss , ∀ i. t→∞ Theorem 3: (Section IV) Problem 2 is solvable if and only if the sensor digraph has a globally reachable node. Problem 3: (Formation Stabilization to a Line) Find, if possible, a local information controller such that for all (xi (t0 ), yi (t0 ), θi (t0 )) ∈ R3 , i = 1, . . . n, and all t0 ∈ R, all vehicles converge to form a line. 11 Theorem 4: (Section V) Problem 3 is solvable if and only if there are at most two disjoint closed sets of nodes in the sensor digraph. In Section V we also introduce a special sensor digraph which guarantees that all vehicles converge to a line segment, equally spaced. In Section VI we show how our solution to Problem 2 can be employed to achieve formation stabilization to any geometric pattern. IV. F ORMATION S TABILIZATION TO A P OINT In this section we prove Theorem 3. The proof requires the following lemmas. Lemma 1: A digraph G = (V, E) with |V| ≥ 2 has no globally reachable node if and only it has two disjoint closed subsets of V . Proof: (Sufficiency) The sufficient condition follows directly. (Necessity) We prove the necessary condition by means of a constructive algorithm. Firstly, select any node, say vi1 , in V and partition V as V = {vi1 } ∪ V1 ∪ V10 , where every node in V1 can reach vi1 and no node in V10 can reach vi1 . Then V10 is closed. Also, V10 6= φ, since vi1 is not globally reachable. Secondly, select any node, say vi2 in V10 . Since V10 is not empty, we can always find one. Check if the node vi2 is globally reachable in the induced subgraph G(V 10 ). If so, then partition V as V = W1 ∪ W10 ∪ V10 , where every node in W1 can reach (some node in) V10 and no node in W10 can reach V10 . Then W10 is closed. Also, W10 6= φ, since vi2 is not globally reachable. Thus V10 and W10 are two disjoint closed subsets for the digraph G . If instead the condition above is false, partition V 10 as V10 = {vi2 } ∪ V2 ∪ V20 , where every node in V2 can reach vi2 and no node in V20 can reach vi2 . Then V20 is closed and nonempty. Next, select any node, say vi3 , in V20 and check if vi3 is globally reachable in the induced subgraph G(V 20 ). If so, then the conclusion follows by the same argument as above. If it does not, repeat this procedure again until this condition holds. Since the digraph has a finite number of nodes and V k0 is getting smaller each step, eventually the condition must hold and two disjoint closed subsets will have been constructed. Next, we present a set of useful lemmas regarding the graph Laplacian. In what follows, let L be the Laplacian of a digraph G = (V, E) with n nodes. Lemma 2: 0 is an eigenvalue of L, while all other eigenvalues have positive real part. 12 Proof: By the definition of Laplacian matrix and the Geršgorin disk theorem, the conclusion follows immediately. Lemma 3: If the digraph is strongly connected, then 0 is a simple eigenvalue of L. Furthermore, L has a positive right-eigenvector 1 = [1 · · · 1]T and a positive left-eigenvector β = [ β1 β2 · · · β n ]T corresponding to 0. Proof: We write the Laplacian matrix as L = D−A, where D , A are the degree matrix and the adjacency matrix, respectively. By definition, the Laplacian L has an eigenvalue at 0 and a right eigenvector 1 = [1 · · · 1]T . Now we will show that 0 has algebraic multiplicity 1 and its corresponding left eigenvector e := D −1 L = I − D −1 A =: I − A e (D is invertible since each node has nonzero is positive. Define L e have the same distribution out-degree, since the digraph is strongly connected). The matrices A and A e is also strongly connected and of zero entries and nonzero entries, and so the associated digraph G( A) e is irreducible. In addition, A e is row stochastic by its definition, so it follows that ρ( A) e = 1. therefore A e = 1 is an eigenvalue of algebraic multiplicity 1 and its Hence, by the Perron-Frobenius theorem ρ(A) fn ]T . Hence, 0 is a simple corresponding left eigenvector is positive, denoted βe = [ βe1 βe2 · · · β e , and so of L. Furthermore, βe is the left eigenvector of L e corresponding to 0. As a eigenvalue of L e −1 is a left eigenvector of L corresponding to the 0 eigenvalue, which is positive. consequence, β = βD Lemma 4: The digraph has a globally reachable node if and only if 0 is a simple eigenvalue of L. Proof: (Necessity) Let V 0 be the set containing all the globally reachable nodes. Since the digraph has a globally reachable node, V 0 contains either all n nodes or r (1 ≤ r < n) nodes. In the first case that, V 0 = V , the digraph G is strongly connected. Then by Lemma 3, 0 is an eigenvalue of L of algebraic multiplicity 1. Next, for the case when V 0 contains r (1 ≤ r < n) nodes, we have that u → v and v 9 u for all u ∈ V − V 0 , v ∈ V 0 . If necessary, we can renumber the nodes of the digraph G so that V 0 = {1, 2, . . . , r}. So D, A, L have the form A1 0 D1 0 , , A = D= A2 A3 0 D3 L= L1 0 L2 L3 = D1 0 0 D3 − A1 0 A2 A3 , 13 where L1 ∈ Rr×r , L3 ∈ R(n−r)×(n−r) . In particular, when r = 1, L1 = D1 = A1 = 0. Since D may be singular, it’s convenient to define 1, if r = 1, Ds1 = D , if 1 < r < n, 1 and As1 1, if r = 1, = A , if 1 < r < n, 1 and rewrite L as (L doesn’t change) A 0 D 0 L1 0 . − s1 = s1 (6) L= A2 A3 0 D3 L2 L3 Ds1 0 is invertible. Then by The associated digraph G(As1 ) is strongly connected and Ds = 0 D3 Lemma 3, L1 = Ds1 − As1 has a simple eigenvalue at 0. Define e e A 0 A 0 L 0 e := D −1 s1 e := D −1 L = 1 e = 1 , and L = I − A. A s s e e e2 L e3 A2 A3 A2 A3 L e is row stochastic, and so is A ek (the k th power of A e). Furthermore, A ek can be Thus, the matrix A expressed in the form ek = A ek A 1 0 e(k) A ek A 3 2 e(k) depending upon the matrices A e1 , A e2 and A e3 . Since every node in V − V 0 can reach for some matrix A 2 e(n) must have a positive entry. Thus the one of the nodes in V 0 after n steps, it follows that each row of A 2 en is less than one and therefore ρ(A e3 ) < 1. Hence, L e3 = I − A e3 is nonsingular maximal row sum for A 3 and so is L3 . In conclusion, 0 is an eigenvalue of L of algebraic multiplicity 1. (Sufficiency) Suppose by way of contradiction that the digraph has no globally reachable node. By Lemma 1, it follows that there exist two disjoint closed subsets of V . So if necessary, we can renumber the nodes so that L has the form L11 0 0 L = 0 L22 0 L31 L32 L33 . By Lemma 2, L11 has a zero eigenvalue and L22 has a zero eigenvalue, too. Hence, 0 is not a simple eigenvalue of L, a contradiction. The sensors produce a kind of symmetry in the system equations, in that sensory capabilities with respect to x and y coordinates are identical. For this reason, the Laplacian L leads to the matrix L ⊗ I 2 , which we now study. 14 Lemma 5: Let L(2) = L ⊗ I2 . If the digraph is strongly connected, then −L(2) is H(α, 2) stable, where α = {{1, 2}, {3, 4}, . . . , {2n − 1, 2n}}. Proof: By Lemma 3, L1 = 0 and β T L = 0, where β is positive. Then for P = diag(β1 , β2 , . . . , βn ), P is a positive definite matrix. Let Q = LT P + P L. Then we have Q1 = LT P 1 + P L1 = LT β = 0 and Q = LT P + P L = (D − A)T P + P (D − A) = (D T P + P D) − (AT P + P A), where D T P +P D is a positive diagonal matrix, and AT P +P A is a nonnegative irreducible matrix since the sum of an irreducible matrix and a nonnegative matrix is still irreducible. If we treat A T P + P A as the adjacency matrix of a weighted digraph G1 and D T P + P D as its degree matrix, then we know G1 is strongly connected and Q is its Laplacian matrix. By Lemma 2 and Lemma 3, 0 is an eigenvalue of Q of algebraic multiplicity 1, while the others are in the open right half-plane, implying that Q is positive semi-definite. Furthermore, 1 is its right eigenvector corresponding to 0. Let α = {{1, 2}, {3, 4}, . . . , {2n − 1, 2n}}. Thus, for the matrix L (2) = L ⊗ I2 , we can define the positive-definite α-scalar matrix P(2) = diag(β1 I2 , β2 I2 , . . . , βn I2 ) = P ⊗ I2 , and then by properties (a), (b) and (c) of the Kronecker product we have LT(2) P(2) + P(2) L(2) = (L ⊗ I2 )T (P ⊗ I2 ) + (P ⊗ I2 )(L ⊗ I2 ) = (LT P ) ⊗ I2 + (P L) ⊗ I2 = (LT P + P L) ⊗ I2 = Q ⊗ I2 =: Q(2) . By property (d) of the Kronecker product, it follows that both L (2) and Q(2) have 2 zero eigenvalues and the others are in the open right half-plane and therefore Q (2) is positive semi-definite. In addition, ker(Q(2) ) = ker(L(2) ). Given an α-diagonal positive definite symmetric matrix R, R −1 is also α-diagonal, positive definite and symmetric. Since P(2) is a positive definite α-scalar matrix, P(2) R−1 = R−1 P(2) is an α-diagonal positive definite matrix, too. It now follows that (−RL(2) )T (R−1 P(2) ) + (P(2) R−1 )(−RL(2) ) = −LT(2) P(2) − P(2) L(2) = −Q(2) , 15 which is negative semi-definite. Hence, by Lyapunov theory, the eigenvalues of −RL (2) are in the open left half-plane or on the imaginary axis. Further, from the spectral properties of −L (2) we can infer that 0 is an eigenvalue of −RL(2) of algebraic and geometric multiplicity 2. Therefore, to show −L (2) is H(α, 2) stable, we only need to show that no other eigenvalues of −RL (2) , except the 2 zero eigenvalues, are on the imaginary axis. Suppose by way of contradiction that λ = jω (ω 6= 0) is one of the eigenvalues with corresponding eigenvector x. Then 0 = x∗ (RL(2) )T (R−1 P(2) ) + (P(2) R−1 )(RL(2) ) − Q(2) x = −jωx∗ (R−1 P(2) )x + jωx∗ (P(2) R−1 )x − x∗ Q(2) x = −x∗ Q(2) x. It follows that x ∈ ker(Q(2) ) = ker(L(2) ) = ker(RL(2) ) and therefore ω = 0, a contradiction. Lemma 6: Let L(2) = L ⊗ I2 . If and only if the digraph has a globally reachable node, −L (2) is H(α, 2) stable, where α = {{1, 2}, {3, 4}, . . . , {2n − 1, 2n}}. Proof: (Sufficiency) Let V 0 be the set containing all the globally reachable nodes. It is not empty but contains either all n nodes or r (1 ≤ r < n) nodes. In the former case, the digraph G is strongly connected and therefore −L(2) is H(α, 2) stable by Lemma 5. In the latter case, from the proof of Lemma 4, we have that L can be expressed as in (6), i.e., L1 0 Ds1 0 As1 0 = − , L= L2 L3 0 D3 A2 A3 where the associated digraph G(As1 ) is strongly connected and L3 is nonsingular. Hence, by Lemma 5, −L1(2) is H(α, 2) stable. Furthermore, one can easily verify that L 3 is a nonsingular M -matrix. Then it follows from the characterization of nonsingular M -matrices (page 136, [5]) that there exists a positive diagonal matrix P = diag(p1 , p2 , . . . , p(n−r) ) such that Q = LT3 P + P L3 is positive definite. Applying the Kronecker product with I 2 to both sides of the above equation and using properties (a), (b) and (c) of the Kronecker product yields Q(2) = LT3(2) P(2) + P(2) L3(2) , where P(2) is α-scalar positive definite and Q(2) is positive definite. Given an α-diagonal positive definite symmetric matrix R2 , we let P̄ = P(2) R2−1 = R2−1 P(2) , which is positive definite. Then (−R2 L3(2) )T P̄ + P̄ (−R2 L3(2) ) = −Q(2) < 0, 16 which meansthat −L3(2) is H(α, 0) stable. Notice that given an α-diagonal positive definite symmetric R1 0 R L 0 , RL(2) = 1 1(2) . Therefore, −L(2) is H(α, 2) stable. matrix R = 0 R2 R2 L2(2) R2 L3(2) (Necessity) Since −L(2) = −L ⊗ I2 is H(α, 2) stable, it follows that −L(2) has a 0 eigenvalue of algebraic multiplicity 2 and then by property (d) of the Kronecker product, −L has a simple eigenvalue at 0. Using Lemma 4, the digraph has a globally reachable node. Proof of Theorem 3: (Sufficiency) We show it by presenting the explicit smooth time-periodic feedback controller P P vi (t) = k (zm (t) − zi (t)) · ri (t), xim (t) = k i = 1, 2, . . . , n. m∈Ni m∈Ni ω (t) = cos(t), i (7) We begin by noticing that {(∀ m ∈ Ni ) zm = zi } ⇒ {vi = 0}. Further, for any t0 ∈ R, θi (t) = θi (t0 ) + sin(t), i = 1, . . . , n, which is periodic with period 2π . Next, using the identity (z · r)r = (rr T )z , we get żi = vi ri = k X [(zm − zi ) · ri ] ri = k ri riT M (θi (t)) := ri riT = and H(θ(t)) := cos2 (θi (t)) cos(θi (t)) sin(θi (t)) cos(θi (t)) sin(θi (t)) sin2 (θi (t)) M (θ1 (t)) 0 0 0 0 M (θ2 (t)) 0 0 0 0 .. 0 0 0 Thus, the overall position dynamics read as where z ∈ R2n (zm − zi ) . m∈Ni m∈Ni Define X . 0 M (θn (t)) . (8) (9) ż = −kH(θ(t))L(2) z, (10) iT h and L(2) = L ⊗ I2 (L is the Laplacian of is the position vector z = z1T z2T · · · znT the sensor digraph). The averaged system of (10) is ż = −kHav L(2) z, (11) 17 where Hav = diag(M̄1 , . . . , M̄n ) and R 2π 1 2 1 2 mi mi 2π 0 cos (θi (τ ))dτ = M̄i := R 2π 1 m2i m3i 2π 0 cos(θi (τ )) sin(θi (τ ))dτ 1 2π R 2π 0 1 2π 2 cos(θi (τ )) sin(θi (τ ))dτ . R 2π 2 0 sin (θi (τ ))dτ Then by the Cauchy-Schwarz inequality, m1i m3i ≥ (m2i ) . Since θi (t) is not constant, the inequality holds strictly. It follows that M̄i is positive definite and therefore Hav is positive definite. More exactly, we say Hav is α-diagonal positive definite with α = {{1, 2}, {3, 4}, . . . , {2n − 1, 2n}}. Since the sensor digraph has a globally reachable node, by Lemma 6 it follows that −L (2) is H(α, 2) stable. So there is a similarity transformation F such that −F −1 L(2) F = −L11 0 0 02×2 , where −L11 is Hurwitz column vectors of F must be in the null space of L (2) , i.e. the last two and a 2n , a, b ∈ R . Without loss of generality, we can choose the last two column ξ ∈R ξ =1⊗ b vectors of F to be 1 ⊗ I2×2 . Applying the transformation e = ė1 ė2 e1 e2 = F −1 z to system (10) yields = −kF −1 H(θ(t))L(2) F e = −kF −1 H(θ(t))F =: k A11 (t) 0 A12 (t) 02×2 e1 e2 L11 0 0 02×2 e1 e2 . And correspondingly, for the averaged system (11), we have e L 0 ė 1 1 = −kF −1 Hav L(2) F e = −kF −1 Hav F 11 e2 0 02×2 ė2 =: k Ā11 0 Ā12 02×2 e1 e2 . Thus, the reduced averaged system ė1 = k Ā11 e1 is exponentially stable since −L(2) is H(α, 2) stable. Then, by Theorem 1, there exists a positive constant k ∗ such that, for all 0 < k < k ∗ , global exponential stability of the reduced original system ė1 = kA11 (t)e1 is established. Also, since ė2 = kA12 (t)e1 and 18 A12 (t) is uniformly bounded, it follows that ė2 → 0 exponentially when t → ∞. This implies that e2 tends to some finite constant vector, say [xss yss ]T . In conclusion, e1 (t) xss =1⊗ , lim z(t) = lim F t→∞ t→∞ e2 (t) yss showing that the overall position dynamics globally exponentially converge to an invariant subspace, thus solving Problem 2. (Necessity) Assume formation stabilization to a point by local information controller is feasible. By way of contradiction, suppose the sensor digraph has no globally reachable node. Then it follows from Lemma 1 that there are two disjoint closed sets of nodes in the sensor digraph G = (V, E), say V 1 and V2 . Given the initial conditions satisfying zi (0) = zss1 , i ∈ V1 and zj (0) = zss2 , j ∈ V2 , then for each vehicle i in V1 , (∀ m ∈ Ni ) zm = zi and so żi = 0. Meanwhile, for each vehicle j in V2 , (∀ m ∈ Nj ) zm = zj and so żj = 0. Hence, if zss1 6= zss2 , they can not gather at the same point, a contradiction. Remark 1: In [28], a general average of M (θi (t)) in (8) R R 1 T 1 T 2 (θ (τ ))dτ cos cos(θ (τ )) sin(θ (τ ))dτ i i i T 0 Mav (T ) = R T 0 R 1 T 1 T 2 cos(θ (τ )) sin(θ (τ ))dτ sin (θ (τ ))dτ i i i T 0 T 0 is considered and it is claimed that det(Mav ) > 0, where Mav = lim Mav (T ), if and only if θi (t) is T →∞ not constant. This is only true if T is finite. Specifically, if T < ∞ and θ i (t) is not constant, then Z T 2 Z T Z T 1 1 1 2 2 cos (θi (τ ))dτ sin (θi (τ ))dτ > cos(θi (τ )) sin(θi (τ ))dτ . T 0 T 0 T 0 When T → ∞ the above strict inequality may not hold. To see that, choose θ i (t) = sin−1 (e−t ). Then, RT lim T1 0 sin2 (θi (τ ))dτ = 0 and T →∞ lim T →∞ 1 T Z T 2 cos (θi (τ ))dτ 0 1 T Z 0 T 2 sin (θi (τ ))dτ = lim T →∞ 1 T Z T cos(θi (τ )) sin(θi (τ ))dτ 0 2 = 0, and thus det(Mav ) = 0. Remark 2: An alternative choice of controller for formation stabilization to a point is P vi (t) = xim (t), m∈Ni i = 1, 2, . . . , n. ω (t) = γ cos(γt), i (12) By applying a time scaling, τ = γt , one can use Theorem 3 and conclude that if the sensor digraph has a globally reachable node, there exists a positive constant γ ∗ such that, for all γ ∗ < γ < ∞, (12) achieves formation stabilization to a point. 19 Remark 3: The above results show that all the wheeled vehicles exponentially converge to a point formation through simple local actions, or we say they achieve an agreement about a common point. Notice that the sensor digraph is required to have just one globally reachable node (as in Fig. 1(c)). So if we treat a beacon placed at the proper location as one member of the group of vehicles and it is the globally reachable node, the local actions of each individual vehicle result in the group gathering at the beacon. Fig. 5 depicts simulation results for formation stabilization to a point of ten wheeled vehicles using the smooth time-varying feedback control law (7) with the choice of k = 1. The initial conditions are randomly produced and the sensor digraph is given in Fig. 6. For example, node 5 is globally reachable, while node 1 is not. PSfrag replacements Fig. 5. Fig. 6. Ten wheeled vehicles gather at a common position. 1 3 5 7 9 2 4 6 8 10 The sensor digraph of a group of ten wheeled vehicles. 20 V. F ORMATION S TABILIZATION TO A L INE We begin this section with the proof of Theorem 4. Proof of Theorem 4: (Sufficiency) By the condition that there are at most two disjoint closed sets of nodes in the sensor digraph, by Lemma 1, either the sensor digraph has a globally reachable node, or there are exactly two disjoint closed sets of nodes in it. In the first case, by Theorem 3, formation stabilization to a point is feasible, which is a special instance of line formation. In the second case, we let V 1 , V2 be the two disjoint closed sets of nodes in the sensor digraph G = (V, E), and let V 3 = V − V1 − V2 . Thus, the induced subgraphs G1 = (V1 , E ∩ (V1 × V1 )), G2 = (V2 , E ∩ (V2 × V2 )) both have a globally reachable node. (To see this point, suppose one of these two induced subgraphs has no globally reachable set. Then by Lemma 1 there are two disjoint closed sets in it and therefore there are three disjoint closed sets in G , a contradiction.) If V3 is empty, then by Theorem 3 each group of vehicles whose indices are in Vi , i = 1, 2 can converge to a point and therefore the whole group of wheeled vehicles form a line. On the other hand, if V3 is not empty, then every node in V3 can reach either V1 or V2 . Without loss of generality, we assume the graph Laplacian has the form L 0 0 11 L = 0 L22 0 L31 L32 L33 , where L11 and L22 are the Laplacian matrices corresponding to the induced subgraph G 1 and G2 . One can easily verify that L33 is a nonsingular M -matrix. By using the controller (7), the overall position dynamics are given by 1 1 1 ż = −kH(θ (t))L11(2) z , ż 2 = −kH(θ 2 (t))L22(2) z 2 , ż 3 = −kH(θ 3 (t)) L 2 , 1 3 33(2) z + L31(2) z + L32(2) z where z i , θ i , i = 1, 2, 3 are the corresponding position vector and orientation vector respectively. From the ∗ ∗ ∗ proof of Theorem 3, that, for all 0 < k < k1 , that there exist positive constants k 1 and k2 such we have x xss1 and for all 0 < k < k2∗ , lim z 2 (t) = 1 ⊗ ss2 with exponential lim z 1 (t) = 1 ⊗ t→∞ t→∞ yss2 yss1 convergence rate. The change of variables ς = L33(2) z 3 + L31(2) z 1 + L32(2) z 2 yields ς˙ = −kL33(2) H(θ 3 (t))ς − kL31(2) H(θ 1 (t))L11(2) z 1 − kL32(2) H(θ 2 (t))L22(2) z 2 . (13) 21 Since L33 is a nonsingular M -matrix, by the same argument as in the proof of Lemma 6, −L 33(2) and −LT33(2) are both H(α, 0) stable. By Theorem 1, there exists a positive constant k 3∗ such that, for all 0 < k < k3∗ , the origin of the nominal system ς˙ = −kL33(2) H(θ 3 (t))ς is globally exponentially stable. Furthermore, notice that the other two terms in (13) both exponentially converge to zero. Hence (13) can be viewed as an exponentially stable system with an exponentially vanishing input, and thus its origin is exponentially stable. Let k ∗ = min{k1∗ , k2∗ , k3∗ }. Hence, for all 0 < k < k∗ , lim z 3 (t) = − L33(2) t→∞ = − L33(2) h −1 −1 L31(2) lim z 1 (t) − L33(2) t→∞ L31(2) · 1 ⊗ = − L−1 33 · L31 · 1 ⊗ i xss1 yss1 xss1 yss1 −1 L32(2) lim z 2 (t) t→∞ − L33(2) −1 − L−1 33 · L32 · 1 ⊗ L32(2) · 1 ⊗ xss2 yss2 xss2 yss2 . −1 · 1 = 0 and so − L−1 33 · L31 · 1 − L33 · L32 · 1 = 1. Hence, all xss1 xss2 and , which means the wheeled vehicles (zi (t))i∈V3 approach a convex combination of yss1 yss2 with indices in V3 eventually move to the line formed by two points (x ss1 , yss1 ) and (xss2 , yss2 ) in the Notice that L31 L32 L33 plane. (Necessity) Suppose by way of contradiction that there are three disjoint closed sets of nodes in the sensor digraph, say V1 , V2 and V3 . Then let the initial conditions of the vehicles in V j , j = 1, 2, 3 be chosen such that zi (0) = zssj , i ∈ Vj . Hence, for each vehicle i in Vj , (∀ m ∈ Ni )zm = zi and so żi = 0. Then three groups of vehicles form a geometric pattern specified by three points z ss1 , zss2 and zss3 . These three points can be arbitrarily set that may not form a line, a contradiction. Theorem 4 has an interesting special case when the two disjoint closed sets of nodes in the sensor digraph both have only one member, say nodes 1 and n. Vehicles 1 and n are called edge leaders. The edge leaders here are not necessarily wheeled vehicles. They can be virtual beacons or landmarks. But the vehicles respond to these edge leaders much like they respond to real neighbor vehicles. The purpose of the edge leaders is to introduce the mission: to direct the vehicle group behavior. We emphasize that 22 the edge leaders are not central coordinators. They do not broadcast instructions. They only play the role of individual vehicles, but cannot sense other vehicles or communicate with them. As for the remaining PSfrag replacements vehicles, i, i = 2, . . . , n − 1, we assume that each agent can sense agents i − 1 and i + 1. This gives the sensor digraph in Fig. 7. It is readily seen that the digraph in Fig. 7 has exactly 2 disjoint closed sets of nodes. We now show that in this special case all vehicles converge to a uniform distribution on the line segment specified by the two edge leaders. 2 1 Fig. 7. n−2 3 n−1 n The sensor digraph for a group of wheeled vehicles with two edge leaders. Theorem 5: Consider a group of n wheeled vehicles with two stationary edge leaders labeled 1 and n. Then, there exists a positive constant k ∗ such that for all 0 < k < k ∗ , the following smooth time-varying feedback control law P vi (t) = k xij (t), Ni = {i − 1, i + 1}, i = 2, . . . , n − 1 j=Ni ω (t) = cos(t), i (14) guarantees that all the vehicles converge to a uniform distribution on the line segment specified by the two edge leaders. Proof: First we observe that the Laplacian of the sensor digraph in Fig. 7 is given by 0 0 0 0 ··· 0 −1 2 −1 0 · · · 0 0 −1 2 −1 · · · 0 L= . .. . .. .. .. .. . . . . . . . 0 · · · 0 −1 2 −1 0 Let z = h z1 z2 · · · zn iT ··· 0 0 0 0 . It follows from Theorem 4 that L(2) z(∞) = 0. Consider the following partition of {1, 2, . . . , n}, {m1 , m2 , . . . , m2n } = {1, 3, 5, . . . , 2n−1, 2, 4, 6, . . . , 2n}. Then the associated permutation matrix P has the unit coordinate vectors e m1 , em2 , . . . , em2n for its columns. Now observe 23 that the matrix P performs the transformation where x = h P T (L ⊗ I2 )P = I2 ⊗ L = x1 L 0 0 L iT and P T z = x y , iT h and y = . Thus, x2 · · · x n y1 y2 · · · y n x(∞) L 0 L 0 P T z(∞) = P 0 = L(2) z(∞) = P y(∞) 0 L 0 L implies Lx(∞) = 0 and Ly(∞) = 0. Also note that 0 1 Ker(L) = span ξ1 = 2 .. . n−1 , ξ2 = n − 2 n−3 , .. . 0 n−1 so x(∞) = α1 ξ1 + α2 ξ2 and y(∞) = β1 ξ1 + β2 ξ2 . Since x1 (∞) = x1 (0), xn (∞) = xn (0) and y1 (∞) = y1 (0), yn (∞) = yn (0), so we solve for α1 = In conclusion, we have x1 (∞) x2 (∞) .. = . xn−1 (∞) xn (∞) x1 (0) (n−2)x1 (0)+xn (0) n−1 .. . x1 (0)+(n−2)xn (0) n−1 xn (0) , xn (0) n−1 , α2 = x1 (0) n−1 y1 (∞) y2 (∞) .. . yn−1 (∞) yn (∞) = and β1 = yn (0) n−1 , y1 (0) (n−2)y1 (0)+yn (0) n−1 .. . y1 (0)+(n−2)yn (0) n−1 yn (0) This shows that all vehicles asymptotically approach a uniform distribution on the line. β2 = y1 (0) n−1 . . Fig. 8 shows the simulation result of four wheeled vehicles and two stationary edge leaders to form a uniform distribution on a line using the control law (14) with the choice of k = 1. VI. F ORMATION S TABILIZATION TO A NY G EOMETRIC PATTERN In this section, we turn our attention to the problem of formation stabilization to any geometric pattern. Following [25], we let Π be a predicate describing a geometric pattern, such as a point, a regular polygon, a line segment, etc. Such a predicate specifies a formation up to translation and rotation. By formation stabilization of a group of n vehicles to Π, we mean that the vehicles (globally exponentially) converge to a distribution satisfying Π. 24 Fig. 8. Formation stabilization to a line of four wheeled vehicles and two edge leaders. We suppose that a group of wheeled vehicles have a common sense of direction, represented by the angle ψ in Fig. 9 . For instance, each vehicle carries a navigation device such as a compass. Alternatively, all vehicles initially agree on their orientation and use it as the common direction. The common direction may not coincide with the positive x-axis of the global frame. Let φ i = θi − ψ (see Fig. 9). We assume PSfrag replacements that vehicle i can measure its own φi . φk ψ k φj ψ φi ψ j i gΣ Fig. 9. A group of wheeled vehicles have a common sense of direction. There are two ways to describe a geometric pattern in the plane. One way is by inter-node distances, dij , as in the rigid formation framework of [21]. The other way is by specifying the position vector, c i , of 25 each node with respect to a common coordinate frame. The two equivalent representations are illustrated in Figure 10(a) and (b) for a square formation. d12 2 2 1 1 d41 d23 d24 3 c4 c3 3 4 d34 4 (a) Fig. 10. c1 c2 d13 (b) An example for a square formation. It is worth noting that, for all (R, b) ∈ SE(2), the vectors ĉ i = Rci + b describe the same geometric formation as the one specified by ci . So given a desired geometric formation pictured by c i , i = 1, . . . , n, our objective is to stabilize the position state z i of each vehicle to ĉi = Rci + b, i = 1, . . . , n for some R and b. To achieve a desired geometric formation characterized by c = [c T1 · · · cTn ]T , we can simply translate the formation vector c into a control offset d = L (2) c so that the forward control velocity is 0 PSfrag whenreplacements the group of vehicles has achieved a formation. We denote the offset for each vehicle by d xi or di = dxi + jdyi . di = d yi As an example, consider the simple sensor digraph shown in Fig. 11(a) and a triangle formation described by ci , i = 1, 2, 3. The corresponding control offsets di , i = 1, 2, 3 are shown in Fig. 11(b). 2 1 d1 c2 c1 d2 2 3 (a) Sensor digraph Fig. 11. An example of a simple triangle formation. c3 1 d3 3 (b) Illustration for ci and di 26 Next, we show that the time-varying control law ) ( h i P vi (t) = k xim (t) , 1 0 R(−φi (t))di + i = 1, 2, . . . , n, m∈Ni ω (t) = cos(t), i where R is a rotation matrix defined by R(φ) = cos(φ) − sin(φ) sin(φ) cos(φ) achieves formation stabilization to Π. (15) , Theorem 6: Let Π be a desired geometric formation described by c. Suppose a group of n wheeled vehicles have a common sense of direction and formation stabilization to a point is feasible. Then there exists a positive constant k ∗ such that for all 0 < k < k ∗ , the smooth time-varying feedback control law (15) with d = L(2) c guarantees global exponential formation stabilization to Π. Proof: Notice that vi = k =k ( ( h 1 0 i R(−φi (t))di + R(ψ)di + xim m∈Ni P (zm − zi ) m∈Ni Thus, we have the following closed-loop system ( P żi = kM (θi (t)) R(ψ)di + X ) ) · ri . ) (zm − zi ) , m∈Ni or in vector form, we have ż = kH(θ(t)) −L(2) z + (In ⊗ R(ψ)) d . By the property of Kronecker product, it follows that (In ⊗ R(ψ))(L ⊗ I2 ) = L ⊗ R(ψ) = (L ⊗ I2 )(In ⊗ R(ψ)). Furthermore, since d = L(2) c, we obtain ż = −kH(θ(t))L(2) {z − (In ⊗ R(ψ))c} . Under the coordinate transformation ς = z − (In ⊗ R(ψ))c, we get ς˙ = −kH(θ(t))L(2) ς. By the proof of Theorem 3, lim ς(t) = 1 ⊗ t→∞ xss yss for some constant position (xss , yss ). Hence, lim zi (t) = R(ψ)ci + t→∞ 27 xss yss , i = 1, . . . , n, which means that the group of vehicles form a geometric formation specified by c. Remark 4: Notice that θi (t) = θi (t0 ) + sin(t), so if a group of n wheeled vehicles achieve an agreement on their initial orientation θi (t0 ) and choose it as their common direction, the control law (15) becomes ) ( P vi (t) = k Re e−j sin(t) di + xim (t) , ω (t) = cos(t), i m∈Ni i = 1, 2, . . . , n. (16) The agreement on their orientation can be implemented by an alignment strategy as shown in [12]. However, the control law (16) is not robust in practice. Fig. 12. Ten wheeled vehicles form a circle formation. As an example, the simulation result for a circle formation of ten wheeled vehicles with the sensor digraph in Fig. 6 is shown in Fig. 12. The circle formation is described by c i = 75e(j The initial conditions are x(0) = [10, 10, 10, 0, 0, 0, 0, −10, −10, −10] T , 2(i−1)π 10 ) , i = 1, . . . , 10. 28 T y(0) = [−5, 0, 5, 10, 3, −3, −10, −5, 0, 5] , h π π π π π π π π π π iT θ(0) = , , , , , , , , , , 6 6 6 6 6 6 6 6 6 6 and the control law (16) is used with the parameters k = 1, d 1 = 61.74 − j107.72, d2 = 78.26 − j25.43, d3 = 91.63 + j66.57, d4 = 35 + j25.43, d5 = j82.29, d6 = −48.37 + j66.57, d7 = −78.26 + j25.43, d8 = −78.26 − j25.43, d9 = −35 − j25.43, d10 = −j82.29. R EFERENCES [1] D. Aeyels and J. Peuteman, “On exponential stability of nonlinear time-varying differential equations,” Automatica, vol. 35, pp. 1091–1100, 1999. [2] H. Ando, Y. Oasa, I. Suzuki, and M. Yamashita, “Distributed memoryless point convergence algorithm for mobile robots with limited visibility,” IEEE Transaction on Robotics and Automation, vol. 15, no. 5, pp. 818–828, 1999. [3] H. Ando, I. Suzuki, and M. Yamashita, “Formation and agreement problems for synchronous mobile robots with limited visibility,” in Proceedings of the 1995 IEEE International Symposium on Intelligence Control, Monterey, CA, 1995, pp. 453–460. [4] J. Bang-Jensen and G. Gutin, Digraphs Theory, Algorithms and Applications. Springer, 2002. [5] A. Berman and R. J. Plemmons, Nonnegative Matrices in the Mathematical Sciences. Philadelphia: Classics in Appl. Math. 9, SIAM, 1994. [6] J. P. Desai, J. P. Ostrowski, and V. Kumar, “Modelling and control of formation of nonholonomic mobile robots,” IEEE Transaction on Robotics and Automation, vol. 17, no. 6, pp. 905–908, 2001. [7] M. Egerstedt and X. Hu, “Formation constrained multi-agent control,” IEEE Transaction on Robotics and Automation, vol. 17, no. 6, pp. 947–951, 2001. [8] L. R. Foulds, Graph Theory Applications. Springer-Verlag, 1992. [9] C. Godsil and G. Royle, Algebraic Graph Theory. Springer, 2001. [10] L. Gurvits and Z. X. Li, “Smooth time-periodic feedback solutions for nonholonomic motion plannning,” in Nonholonomic Motion Planning, Z. X. Li and J. F. Canny, Eds. Kluwer Academic Publishers, 1993, pp. 53–108. α [11] D. Hershkowitz and N. Mashal, “P -matrices and Lyapunov scalar stability,” The Electronic Journal of Linear Algebra, vol. 4, pp. 39–47, 1998. [12] A. Jadbabaie, J. Lin, and A. S. Morse, “Coordination of groups of mobile autonomous agents using nearest neighbor rules,” IEEE Transaction on Automatic Control, vol. 48, no. 6, pp. 988–1001, 2003. [13] E. W. Justh and P. S. Krishnaprasad, “Steering laws and continuum models for planar formations,” in Proceedings of the 42nd IEEE Conference on Decision and Control, Maui, Hawaii USA, December 2003. [14] H. K. Khalil, Nonlinear Systems, 2nd ed. Prentice Hall, 1996. [15] P. Lancaster and M. Tismenetsky, The Theory of Matrices with Applications, 2nd ed. Academic Press, 1985. [16] N. E. Leonard and E. Fiorelli, “Virtual leaders, artificial potentials and coordinated control of groups,” in Proceedings of the 40th IEEE Conference on Decision and Control, Orlando, Florida USA, December 2001, pp. 2968–2973. [17] Z. Lin, M. Broucke, and B. A. Francis, “Local control strategies for groups of mobile autonomous agents,” in Proceedings of the 42nd IEEE Conference on Decision and Control, Maui, Hawaii USA, December 2003. 29 [18] Y. Liu, K. M. Passino, and M. M. Polycarpou, “Stability analysis of m-dimensional asynchronous swarms with a fixed communication topology,” IEEE Trans. Automatic Control, vol. 48, no. 1, pp. 76–95, 2003. [19] J. Marshall, M. Broucke, and B. A. Francis, “A pursuit strategy for wheeled-vehicle formations,” in Proceedings of the 42nd IEEE Conference on Decision and Control, Maui, Hawaii USA, December 2003. [20] R. T. M’Closkey and R. M. Murray, “Nonholonomic systems and exponential convergence: some analysis tools,” in Proceedings of the 32th IEEE Conference on Decision and Control, San Antonlo, Texas USA, December 1993, pp. 943– 948. [21] R. O. Saber and R. M. Murray, “Distributed cooperative control of multiple vehicles formations using structural potential functions,” in 2002 IFAC World Congress, 2002. [22] C. Samson, “Control of chained systems application to path following and time-varying point-stabilization of mobile roobts,” IEEE Transactions on Automatic Control, vol. 40, no. 1, pp. 64–77, 1995. [23] C. Samson and K. Ait-Abderrahim, “Feedback stabilization of a nonholonomic wheeled mobile robot,” in IEEE/RSJ International Workshop on Intelligent Robot and Systems IROS’91, Osaka, Japan, November 1991, pp. 1242–1247. [24] K. Sugihara and I. Suzuki, “Distributed motion coordination of multiple robots,” in Proceedings of IEEE Int. Symp. on Intelligent Control, 1990, pp. 138–143. [25] I. Suzuki and M. Yamashita, “Distributed anonymous mobile robots: formation of geometric patterns,” SIAM J. Comput., vol. 28, no. 4, pp. 1347–1363, 1999. [26] I. A. Wagner and A. M. Bruckstein, “Row straighening via local interactions,” Circuits, Systems, and Signal Processing, vol. 16, no. 3, pp. 287–305, 1997. [27] H. Yamaguchi, “A distributed motion coordination strategy for multiple nonholonomic mobile robots in cooperative hunting operations,” Robotics and Autonomous Systems, vol. 43, pp. 257–282, 2003. [28] H. Yamaguchi and J. W. Burdick, “Time-varying feedback control for nonholonomic mobile robots forming group formations,” in Proceedings of the 37th IEEE Conference on Decision and Control, Tampa, Florida USA, December 1998, pp. 4156–4163.
© Copyright 2026 Paperzz