The k-assignment Polytope, Phylogenetic Trees, and

Linköping Studies in Science and Technology.
Dissertations, No. 1544
The k-assignment Polytope,
Phylogenetic Trees,
and
Permutation Patterns
Jonna Gill
Division of Mathematics and Applied Mathematics
Department of Mathematics
Linköping University, SE–581 83 Linköping, Sweden
Linköping 2013
The k-assignement Polytope, Phylogenetic Trees, and Permutation Patterns
Jonna Gill
Matematiska institutionen
Linköpings universitet
SE-581 83 Linköping, Sweden
[email protected]
Linköping Studies in Science and Technology.
Dissertations, No 1544
ISBN 978-91-7519-510-0
ISSN 0345-7524
http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-98263
Copyright © 2013 Jonna Gill, unless otherwise noted.
Printed by LiU-Tryck, Linköping, Sweden 2013
Abstract
In this thesis three combinatorial problems are studied in four papers.
In Paper 1 we study the structure of the k-assignment polytope, whose vertices are the m × n (0,1)-matrices with exactly k 1:s and at most one 1 in each row
and each column. This is a natural generalisation of the Birkhoff polytope and
many of the known properties of the Birkhoff polytope are generalised. A representation of the faces by certain bipartite graphs is given. This representation is
used to describe the properties of the polytope, such as a complete description
of the cover relation in the face poset of the polytope and an exact expression
for the diameter of its graph. An ear decomposition of these bipartite graphs is
constructed.
In Paper 2 we investigate the topology and combinatorics of a topological
space, called the edge-product space, that is generated by a set of edge-weighted
finite semi-labelled trees. This space arises by multiplying the weights of edges
on paths in trees, and is closely connected to tree-indexed Markov processes
in molecular evolutionary biology. In particular, by considering combinatorial
properties of the Tuffley poset of semi-labelled forests, we show that the edgeproduct space has a regular cell decomposition with face poset equal to the
Tuffley poset.
The elements of the Tuffley poset are called X-forests, where X is a finite
set of labels. A generating function of the X-forests with respect to natural
statistics is studied in Paper 3 and a closed formula is found. In addition, a
closed formula for the corresponding generating function of X-trees is found.
Singularity analysis is used on these formulas to find asymptotics for the number
of components, edges, and unlabelled nodes in X-forests and X-trees as | X | →
∞.
In Paper 4 permutation statistics counting occurrences of patterns are studied. Their expected values on a product of t permutations chosen randomly from
Γ ⊆ Sn , where Γ is a union of conjugacy classes, are considered. Hultman has
described a method for computing such an expected value, denoted EΓ (s, t), of
a statistic s, when Γ is a union of conjugacy classes of Sn . The only prerequisite
is that the mean of s over the conjugacy classes is written as a linear combination of irreducible characters of Sn . Therefore, the main focus of this paper is
to express the means of pattern-counting statistics as such linear combinations.
A procedure for calculating such expressions for statistics counting occurrences
of classical and vincular patterns of length 3 is developed, and is then used to
calculate all these expressions.
iii
Populärvetenskaplig sammanfattning
I denna avhandling, som består av fyra artiklar, studeras tre olika områden inom
diskret matematik.
I den första artikeln studeras en generalisering av en polytop som kallas Birkhoffpolytopen. En polytop är ett geometriskt objekt med hörn och platta sidor,
som t.ex. ett mjölkpaket, en pyramid eller ett A4-papper. Hur en polytop ser ut
bestäms av koordinaterna för dess hörn. En polytops diameter är längsta avståndet mellan två hörn, där avståndet mellan två hörn är det minsta antalet kanter
som förbinder hörnen. Vad en sida i en polytop är beskrivs bäst av ett exempel:
Ett mjölkpaket har åtta 0-dimensionella sidor (hörnen), tolv 1-dimensionella sidor (kanterna), sex 2-dimensionella sidor (det vi vanligtvis kallar sidor), och en
3-dimensionell sida (hela mjölkpaketet). Diametern av ett mjölkpaket är 3.
Birkhoffpolytopen är polytopen vars hörns koordinater i n2 dimensioner ges
av alla n × n-matriser med exakt en etta i varje rad och i varje kolonn, och nollor
i övrigt. Polytopen som studeras här betecknas M (m, n, k), och koordinaterna
i m × n dimensioner för dess hörn ges av alla m × n-matriser med k ettor och
resten nollor, varav högst en etta i varje rad och varje kolonn. Om k = m = n fås
Birkhoffpolytopen.
En graf består av ett antal noder (punkter) och bågar som förbinder dem.
En bipartit graf är en graf där man kan dela upp noderna i två mängder, så
att alla bågar går mellan den ena mängden och den andra. En beskrivning av
M(m, n, k):s sidor som en speciell sorts bipartita grafer ges. Denna beskrivning
används sedan för att härleda ett exakt uttryck för M (m, n, k):s diameter.
I den andra och tredje artikeln studeras en partiellt ordnad mängd (en mängd
där vissa, men inte alla, element går att jämföra). Den kallas Tuffleypomängden,
och den är relaterad till en matematisk modell för hur mutationer i DNA sker.
En graf kallas ett träd om den inte innehåller några cykler (man kan inte gå
runt i cirkel i den). En graf som består av flera träd kallas en skog. En X-skog,
där X är en mängd märken, är en skog där man placerat ut märkena från X på
noderna enligt vissa regler. Tuffleypomängden består av alla X-skogar, och en
X-skog sägs vara mindre än en annan X-skog om den förra kan fås från den
senare genom att ta bort eller dra ihop bågar på ett visst sätt.
Det visas att Tuffleypomängden har något som kallas en rekursiv koatomordning, och att detta medför mycket trevliga egenskaper hos den matematiska modellen för mutationer.
En genererande funktion för X-skogar (som räknar antalet X-skogar med avseende på antalet märken, träd, bågar och noder utan märken) undersöks och
beskrivs som en komplex funktion. Denna funktion analyseras sedan för att ge
t.ex. medelvärde och varians för antalet bågar, noder utan märken och träd för
X-skogar med n märken (då n är stort).
I den fjärde artikeln studeras funktioner som räknar förekomsten av vissa
permutationsmönster. En permutation av talen {1, 2, . . . , n} är en uppräkning av
v
vi
Populärvetenskaplig sammanfattning
talen i någon ordning. En permutation π av {1, 2, . . . , n} kan skrivas π1 π2 . . . πn .
Ett permutationsmönster är en sekvens p1 p2 . . . pk , där k är längden på mönstret,
och mönstret sägs förekomma i permutationen π om det finns en delsekvens
πi1 πi2 . . . πik i π med samma relativa ordning som p1 p2 . . . pk . T.ex. förekommer
mönstret 132 i permutationen 624531, eftersom delsekvensen 243 har samma
relativa ordning som 132 (minst, störst, mittemellan). Totalt sett förekommer
detta mönster två gånger i permutationen (som delsekvenserna 243 och 253).
Man kan dela upp alla permutationer av talen {1, 2, . . . , n} i vissa naturliga
grupper som kallas konjugatklasser. Det finns också en slags multiplikation av
permutationer, vars resultat är en ny permutation. En fråga man kan studera
är följande: Om man slumpmässigt väljer t stycken permutationer från samma
konjugatklass och multiplicerar dem, vad är det förväntade antalet förekomster av ett visst permutationsmönster i resultatet? Den frågan kan besvaras om
ett slags medelvärdesfunktion för antalet förekomster av mönstret skrivs i en
speciell bas.
En metod för att finna sådana uttryck för alla mönster av längd 3 samt för ett
slags generaliserade mönster av längd 3 (där man t.ex. kan kräva att delsekvensen ska börja med π1 ) beskrivs, och alla deras medelvärdesfunktioner skrivs i
den speciella basen.
Acknowledgements
First of all I would like to thank my original supervisor Svante Linusson for his
support and enthusiasm for the research problems. I would also like to thank
my second supervisor Olle Axling, who has taught me a lot about teaching.
Secondly, I would like to thank my present supervisor Jan Snellman for taking over the supervision when Svante Linusson left Linköping University for
KTH Royal Institute of Technology. I would also like to thank my present second supervisor Axel Hultman for all help and support, and especially for his
thorough reading of and many comments on this manuscript.
Thanks also to the Swedish Research Council which has been supporting a
part of my graduate studies.
I would also like to thank all wonderful colleagues at the Department of
Mathematics who have made these 12 years of Ph.D. studies so pleasant.
Finally I would like to thank Jesus Christ for giving me my talent and love
for mathematics, and my family for their massive support during my studies. I
thank my parents and parents-in-law for all their help with babysitting and other
things. I also thank my five children for their patience during my writing and
for making my life so marvellous. At last I thank my beloved husband Johan for
more than it is possible to mention.
vii
Contents
I Introduction
1
Introduction
1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2
Posets . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3
Graphs . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
4
Polytopes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
4.1
Basic theory of polytopes . . . . . . . . . . . . . . . . . . . .
4.2
Transportation polytopes and network flow polytopes . . .
4.3
The Birkhoff polytope . . . . . . . . . . . . . . . . . . . . . .
5
The edge-product space and its face poset . . . . . . . . . . . . . .
5.1
Finite CW complexes . . . . . . . . . . . . . . . . . . . . . . .
5.2
The edge-product space of phylogenetic trees . . . . . . . .
5.3
The Tuffley poset . . . . . . . . . . . . . . . . . . . . . . . . .
6
The Lambert W function . . . . . . . . . . . . . . . . . . . . . . . . .
7
Permutations . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
7.1
The group Sn of permutations . . . . . . . . . . . . . . . . .
7.2
Representations of Sn and their characters . . . . . . . . . .
7.3
Permutation statistics . . . . . . . . . . . . . . . . . . . . . .
8
Overview of the papers . . . . . . . . . . . . . . . . . . . . . . . . .
Paper 1: The k-assignment polytope . . . . . . . . . . . . . . . . . .
Paper 2: A regular decomposition of the edge-product space of
phylogenetic trees . . . . . . . . . . . . . . . . . . . . . . . .
Paper 3: A generating function for X-forests . . . . . . . . . . . . .
Paper 4: Pattern containment in random permutations . . . . . . .
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
ix
3
3
3
5
7
7
8
9
10
10
11
12
13
13
13
14
15
17
17
17
18
19
20
x
Contents
II Papers
1 The k-assignment polytope
1
Introduction . . . . . . . . . . . . . . . . . . . . . . .
2
Some basic properties of the k-assignment polytope
3
Description of the face poset . . . . . . . . . . . . .
4
The diameter of M (m, n, k) . . . . . . . . . . . . . .
5
Ear decomposition . . . . . . . . . . . . . . . . . . .
Appendix: Proof of Lemma 5.8 . . . . . . . . . . . . . . .
References . . . . . . . . . . . . . . . . . . . . . . . . . . .
23
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
25
27
28
30
36
43
45
48
2 A regular decomposition of the edge-product space of phylogenetic
trees
1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2
Trees, forests and the Tuffley poset . . . . . . . . . . . . . . . . . .
3
Recursive coatom orderings . . . . . . . . . . . . . . . . . . . . . . .
3.1
Preliminaries and definitions . . . . . . . . . . . . . . . . . .
3.2
Outline of proof of Theorem 3.1 . . . . . . . . . . . . . . . .
3.3
There is a recursive coatom ordering for [0̂, Γ] . . . . . . . .
3.4
An example of a coatom ordering satisfying (V3) . . . . . .
4
The edge-product space is a regular cell complex . . . . . . . . . .
5
Proof of some combinatorial lemmas . . . . . . . . . . . . . . . . . .
5.1
Reformulation of (V1) with implications . . . . . . . . . . .
5.2
Common elements of [0̂, α1 ] and [0̂, α2 ] . . . . . . . . . . . .
5.3
A and B are compatible with (V3) . . . . . . . . . . . . . . .
5.4
The order of the coatoms near two vertices . . . . . . . . . .
Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Supplement . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
51
54
55
57
58
60
60
62
62
65
66
66
67
71
71
73
77
3 A generating function for X-forests
1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . .
2
A generating function for X-forests . . . . . . . . . . . .
3
Recurrence relations for the coefficients . . . . . . . . . .
4
The generating functions . . . . . . . . . . . . . . . . . . .
5
Statistics . . . . . . . . . . . . . . . . . . . . . . . . . . . .
5.1
Statistics of generating functions . . . . . . . . . .
5.2
Preliminaries on singularity analysis . . . . . . .
5.3
Singular expansions of derivatives of F and T . .
5.4
Asymptotic statistics of the generating functions
Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
A.1
Computation of the constants p j in Lemma 5.11 .
A.2
The functions f used in Section 5.3 . . . . . . . .
79
81
83
85
87
89
89
90
93
95
98
98
99
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
xi
Contents
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101
4 Pattern containment in random permutations
1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2
Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3
Mean statistics of 3-patterns . . . . . . . . . . . . . . . . . . . .
4
A procedure for calculating the means of vincular 3-patterns
5
Expected values . . . . . . . . . . . . . . . . . . . . . . . . . . .
Appendix . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
103
105
107
108
114
120
122
126
Part I
Introduction
1
Introduction
1 Introduction
This thesis consists of four papers.
First some basic theory about the subjects in the papers will be introduced,
and then an overview of the papers follows.
2 Posets
In all the following it will be assumed that all posets are finite. The notations
and definitions in this section can be found in e.g. [26].
Definition 2.1. A partially ordered set (or poset) is a set S equipped with a relation
≤, satisfying the following:
(i) For all x ∈ S, x ≤ x. (reflexivity)
(ii) If x ≤ y and y ≤ z, then x ≤ z. (transitivity)
(iii) If x ≤ y and y ≤ x, then x = y. (antisymmetry)
That x ≤ y and x ̸= y is denoted x < y. The element y covers x if x < y and
there is no z ∈ S such that x < z < y. For elements x, y ∈ S with x ≤ y, the
interval between x and y is [ x, y] := {z ∈ S : x ≤ z ≤ y}.
The Hasse diagram of a finite poset S is the graph whose nodes are the elements of S, whose edges are the cover relations, and such that if x < y then the
node y is drawn above x. In Figures 1 and 2, the Hasse diagrams of posets with
different properties are drawn.
There is a special property of posets, sometimes called the diamond property,
sometimes called thinness. A poset has this property if the interval [ x, y] has
3
4
Introduction
exactly four elements for each pair x, y ∈ S such that there exists a z ∈ S that
covers x and is covered by y.
A poset S is bounded if there are a unique minimal element 0̂ ∈ S and a
unique maximal element 1̂ ∈ S such that 0̂ ≤ x ≤ 1̂ for all x ∈ S. It is possible to
add an artificial 0̂ and/or 1̂ if needed.
Figure 1: Pure poset with the diamond property, bounded poset, graded poset.
A chain C ⊆ S is a sequence x0 < x1 < · · · < xn . The length of the chain is
ℓ(C ) = n. The length of a poset S is ℓ(S) := max{ℓ(C ) : C is a chain of S}. A
chain in S is maximal if no element in S can be added to the chain. The poset S is
pure if all maximal chains in S have the same length, and graded if it in addition
has a 0̂ and a 1̂. In a pure poset S there is a unique integer-valued rank function
r on S such that r ( x ), the rank of x, is 0 if x is a minimal element of S, and
r (y) = r ( x ) + 1 if y covers x in S.
An upper bound of x and y in S is an element z ∈ S such that x, y ≤ z. A lower
bound of x and y is an element w ∈ S such that w ≤ x, y.
Definition 2.2. A poset is a lattice if every two elements x, y ∈ S have a unique
minimal upper bound, called the join x ∨ y, and a unique maximal lower bound,
called the meet x ∧ y.
It follows that all (finite) lattices must have a 0̂ and a 1̂. If S is a lattice, then
the minimal elements of S ∖ 0̂ are called atoms, and the maximal elements of
S ∖ 1̂ are called coatoms.
Figure 2: A graded lattice, a non-pure lattice, and a poset which is not a lattice.
3 Graphs
5
Definition 2.3. A graded poset S is said to admit a recursive coatom ordering if the
length of S is 1 or if the length of S is greater than 1 and there is an ordering
α1 , . . . , αt of its coatoms that satisfies the following two conditions:
(i) For all i < j and γ < αi , α j there is a k < j and an element β such that β is
covered by αk and α j and γ ≤ β.
(ii) For all j = 1, . . . , t, the interval [0̂, α j ] admits a recursive coatom ordering in
which the coatoms that come first in the ordering are those that are covered
by some αk where k < j.
Recursive coatom orderings and properties of posets having such orderings
are described in [6, Chapter 4.7].
3 Graphs
The nodes in a graph are often called vertices, but here they will be called nodes
to avoid confusion, since polytopes (described in next section) have another kind
of vertices. Most of the theory in this section can be found in [21].
The nodes in a graph G connected by an edge x are called the endpoints of
x, and they are said to be incident to x. Two edges sharing a node are called
adjacent. A path in G is a sequence of distinct adjacent edges which connect a
sequence of distinct nodes. If every two nodes of a graph G are joined by a path,
then G is connected. A cycle is a path with at least two edges, together with an
edge joining the first and the last node in the path.
Definition 3.1. The diameter of a graph G will be denoted δ( G ) and is defined as
the smallest number δ such that any two nodes in G can be connected by a path
with at most δ edges. If G is not connected, the diameter is defined to be ∞.
The sets of vertices and edges of a graph G are often denoted V ( G ) and E( G ),
respectively. The graph H is a subgraph of G if V ( H ) ⊆ V ( G ) and E( H ) ⊆ E( G ).
An edge-induced subgraph of a graph G consists of some of the edges of G and
their endpoints. The degree deg(v) of a vertex v ∈ V ( G ) is the number of edges
incident to v. A weighted graph associates a weight (usually a real number) with
every edge in the graph.
Definition 3.2. A graph without cycles is called a forest. If it is also connected,
it is called a tree. Vertices with degree 1 are called leaves. A binary tree is a tree
where all vertices except the leaves have degree 3.
Definition 3.3. A matching in a graph is a subset of its edges such that no two
edges share an endpoint. A perfect matching is a matching that matches all nodes
of the graph, i.e. all nodes are incident to exactly one edge.
Recall that a bipartite graph is a graph whose nodes can be divided into two
disjoint sets V1 and V2 , such that every edge connects a node in V1 to one in
6
Introduction
Figure 3: A forest, a graph not being a forest, a tree, and a binary tree.
V2 . The complete bipartite graph Km,n is the bipartite graph with |V1 | = m and
|V2 | = n, where there is an edge (v1 , v2 ) for each pair of nodes v1 ∈ V1 and
v2 ∈ V2 .
Definition 3.4. A bipartite graph G is said to be elementary if each edge of G lies
in some perfect matching of G.
Figure 4: Matching, perfect matching, elementary graph, not elementary graph (the
highlighted edge does not lie in any perfect matching).
In [21] an elementary graph is required to be connected, but that is not necessary here. Each component of an elementary graph here will be elementary
according to the original definition.
Now the concept of ear decompositions will be described. The following notation is used: If E1 and E2 are subsets of E( G ), then E1 + E2 denotes the subgraph
of G induced by E1 ∪ E2 .
Definition 3.5. Let x be an edge. Join its endpoints by a path E1 (not containing
x) of odd length. Then a sequence of bipartite graphs can be constructed as
follows: If Gs−1 = x + E1 + · · · + Es−1 has already been constructed, add a new
ear Es by picking any two nodes that are connected by an odd path in Gs−1
and joining them by an odd path Es having no other node (and no edge) in
common with Gs−1 . The decomposition Gs = x + E1 + · · · + Es will be called an
ear decomposition of Gs , and Ei will be called an ear (i = 1, . . . , s).
Theorem 3.6. A bipartite graph G is elementary if and only if each component of G has
an ear decomposition.
7
4 Polytopes
4 Polytopes
The following is mainly from [11] and [27]. Only convex polytopes will be
considered, so the word convex will often be omitted.
4.1 Basic theory of polytopes
Definition 4.1. A (convex) polytope is a subset P ⊆ Rd that is the convex hull of a
finite point set,
P = Conv(V1 , . . . , Vn )
for some V1 , . . . , Vn ∈ Rd
or, equivalently, a subset P ⊆ Rd that is a bounded intersection of half-spaces,
P = {x ∈ Rd : ci · x ≤ zi , i = 1, . . . , m}
for some ci ∈ Rd , zi ∈ R.
That P is bounded means that P does not contain a ray {x + ty : t ≥ 0} for
any y ̸= 0.
Points, lines, planes, and so forth, are affine subspaces of Rd (they are not
required to include the origin), also called flats. The affine hull of a finite point
set is the intersection of all affine flats that contain the set. The dimension of a
polytope is the dimension of its affine hull.
Definition 4.2. Let P ⊆ Rd be a convex polytope. A face of P is any set of the
form
F = P ∩ { x ∈ Rd : c · x = c0 }
where c · x ≤ c0 for all x ∈ P.
Since 0 · x ≤ 0 for all x ∈ P, P is a face of itself. The other faces, satisfying
F ⊂ P, are called proper faces. The empty set, ∅, is always a face of P since
0 · x ≤ 1 for all x ∈ P.
The faces of dimensions 0, 1, dim( P) − 2, and dim( P) − 1 are called vertices,
edges, ridges, and facets, respectively. The set of vertices is denoted vert( P).
Theorem 4.3. Every polytope is the convex hull of its vertices. If a polytope can be
written as the convex hull of a finite point set, then the set contains all the vertices of the
polytope.
Theorem 4.4. Let P ⊆ Rd be a polytope, and let V := vert( P). Suppose that F is a
face of P. Then the following statements are true.
(i) The face F is a polytope, with vert( F ) = F ∩ V.
(ii) Every intersection of faces of P is a face of P.
(iii) The faces of F are exactly the faces of P that are contained in F.
Definition 4.5. The face poset of a convex polytope P is the poset L( P) of all faces
of P, partially ordered by inclusion (i.e. the relation ’≤’ is ⊆).
8
Introduction
P
1245
3
123
135
234
345
P
5
12
1
13
4
5
2
1
15
24
23
2
3
4
34
35
4
45
5
3
1
2
Ø
Figure 5: A polytope P, its face lattice L( P), and its graph G ( P).
Theorem 4.6. Let P be a convex polytope. The face poset L( P) is a graded lattice of
length dim( P) + 1, with rank function r ( F ) = dim( F ) + 1. (Hence L( P) is often
called the face lattice of P.) The face lattice L( P) has the diamond property.
Definition 4.7. Let P be a convex polytope. The graph of P, denoted G ( P), is
the graph formed by the vertices and the edges of P. The diameter of P is the
diameter of its graph G ( P) and will be denoted δ( P).
4.2 Transportation polytopes and network flow polytopes
The transportation problem is a classic problem in optimisation. Suppose that a
product is to be transported from m warehouses to n customers. The i:th warehouse produces ri > 0 units of the product per time unit, and the j:th customer
requires a j > 0 units of the product per time unit. The cost for transporting one
unit of the product from the i:th warehouse to the j:th customer is cij , and the
number of units transported is xij . The goal is to minimise the total transportation cost. Hence the transportation problem of order m × n is to minimise the
linear function
m
n
∑ ∑ cij xij
i =1 j =1
subject to the conditions
n
∑ xij = ri ,
i = 1, . . . , m,
j =1
m
∑ xij = a j ,
j = 1, . . . , n,
i =1
xij ≥ 0,
i = 1, . . . , m, j = 1, . . . , n.
The set of matrices ( xij )m×n satisfying these conditions is called the transportation
polytope, usually denoted T (r, a).
9
4 Polytopes
Network flows are described in [1]. Let G = ( N, A) be a directed network
(directed graph) defined by a set N of nodes and a set A of directed edges. Each
edge (i, j) ∈ A has a capacity uij that denotes the maximum flow on the edge,
and a lower bound ℓij that denotes the minimum flow on the edge. Each node
has a number bi representing its supply/demand. An example of a graph of a
network flow is shown in Figure 6. The variable xij denotes the flow on the edge
(i, j) ∈ A. The set of possible solutions to
∑
{ j:(i,j)∈ A}
xij −
∑
{ j:( j,i )∈ A}
for all i ∈ N,
x ji = bi
ℓij ≤ xij ≤ uij
for all (i, j) ∈ A,
is a network flow polytope.
When minimising or maximising a linear function of the flows on the edges,
an optimal solution can always be found in a vertex of the network flow polytope.
b3
3
4
(l45 ,u45)
b4
2 −a2
r3
3
(0, )
8
(l24 ,u24)
1 −a1
8
b2
5 b5
(0, )
8
2
r2
(0, )
(0, )
2
(0, )
(0, )
(l35 ,u35)
(l43 ,u43)
(l34 ,u34)
1
8
(l12 ,u12)
r1
8
1
(l13 ,u13)
8
b1
Figure 6: A graph of a network flow, and the transportation problem of order 3 × 2
represented as a network flow.
It is rather obvious that all transportation polytopes are network flow polytopes. The m warehouses are represented by m supply nodes with bi = ri and the
n customers are represented by n demand nodes with b j = − a j . Each warehouse
has distribution channels to each customer, represented by directed edges from
the supply nodes to the demand nodes. All lower bounds are 0 and all capacities
are infinite. An example is given in Figure 6.
4.3 The Birkhoff polytope
The Birkhoff polytope has many names, such as the permutation polytope, the
assignment polytope, the polytope of doubly stochastic matrices, the perfect matching
polytope, and so forth. It can be defined using permutations. (Permutations and
the symmetric group Sd are described in Section 7.1.)
10
Introduction
Definition 4.8. For every permutation σ ∈ Sd , construct a d × d matrix X σ by
{
1 if σ (i ) = j
σ
Xij =
0 otherwise.
The matrices X σ are the 0/1-matrices with exactly one 1 in each row and exactly
one 1 in each column. They can be seen as 0/1-vectors in Rd×d , and their convex
hull forms a 0/1-polytope (a polytope where all vertex coordinates are 0 or 1)
called the Birkhoff polytope:
Bd := Conv{ X σ : σ ∈ Sd } ⊆ Rd×d .
The Birkhoff polytope Bd has d! vertices, d2 facets, and dimension (d − 1)2 .
Two vertices X σ and X π are the vertices of an edge if and only if the permutation
σ−1 π has exactly one cycle of length greater than 1. The diameter δ( Bd ) is 1 if
d ≤ 3, and 2 if d ≥ 4. The points in Bd are precisely
{
X∈R
d×d
: xij ≥ 0 for all i, j,
d
d
i =1
j =1
∑ xij = 1 for all j, ∑ xij = 1 for all i.
}
Hence the Birkhoff polytope Bd is the transportation polytope T (r, a) where
m = n = d, all ri = 1, and all a j = 1.
In [4], the following bijection between the faces of Bd and the elementary
graphs with 2d nodes is given. Every vertex V of Bd corresponds to a perfect
matching where the edge (i, j) is in the matching if and only if vij = 1. A face
of Bd corresponds to the elementary graph G that is the union of the perfect
matchings corresponding to the vertices of the face. If the face corresponding to
an elementary graph G is denoted F B ( G ), then the vertices of F B ( G ) are exactly
the vertices that correspond to all perfect matchings P such that P ⊆ G. In that
way the face poset of Bd is isomorphic to the lattice of all elementary subgraphs
of Kd,d ordered by inclusion.
Remember that all components of elementary graphs have ear decompositions. The following relationship between the dimension of a face in Bd and ear
decompositions of the corresponding graph is proved in [4].
Theorem 4.9. If G is an elementary bipartite graph, then the total number of ears in
ear decompositions of all the components of G is equal to the dimension of F B ( G ).
For more information about the Birkhoff polytope, see e.g. [4], [7], [8], and
[11].
5 The edge-product space and its face poset
5.1 Finite CW complexes
The following definitions can be found in any book on point set topology and
[5].
11
5 The edge-product space and its face poset
The closed d-ball Bd is defined to be the set { x ∈ Rd : | x | ≤ 1} (where | · | is
the standard norm in Rd ).
A topological space Y is a Hausdorff space if, for each x, y ∈ Y such that x ̸= y
there are disjoint open sets U, V with x ∈ U and y ∈ V.
If f : X → Y is bijective and f and f −1 are both continuous, f is called a
homeomorphism, and X and Y are said to be homeomorphic.
Let Y be a Hausdorff space. A subset σ is called an open d-cell if there exists
a continuous mapping ψ : Bd → Y whose restriction to the interior of the d-ball
is a homeomorphism ψ : Int Bd → σ. This defines the dimension dim σ = d
uniquely. The closure σ is the corresponding closed cell. In fact, σ = ψ( Bd ). Let
δ(σ ) denote the boundary σ ∖ σ.
Definition 5.1. Suppose that there is a finite collection C = {σα : α ∈ A}
of disjoint open cells whose union is Y, with corresponding maps ψα . The
space Y is a finite CW complex and the collection C is a cell decomposition of Y if
δ(σα ) ⊆ C <dim σα (the union of all cells in C of dimension less than dim σα ) for all
α ∈ A.
The face poset of a finite CW complex Y is the collection of closed cells σα
partially ordered by inclusion.
If each mapping ψα : Bdim σα → Y can be chosen to be a homeomorphism on
all of Bdim σα , then C is a regular cell decomposition of Y. An important property
of a regular CW complex Y is that it is homeomorphic to the so-called geometric
realisation of its face poset. This means that the topological properties of Y are
given by the combinatorial properties of its face poset.
5.2 The edge-product space of phylogenetic trees
The edge-product space of phylogenetic trees is defined in [22], as follows:
Take a finite set X. The binary trees T with the elements of X as their leaves
are all the possible “binary phylogenetic trees” for X. See [24]. If these trees are
given edge weights in the interval
( )[0, 1], they give rise to a space E ( X ) which
will now be described. (The set X2 denotes the set of all 2-element subsets of
X.)
Definition 5.2. Let λ be a map from E( T ) to [0, 1]. Define a new map p(T,λ) from
(X)
to [0, 1] by
2
p(T,λ) ( x, y) =
∏ λ ( e ),
e∈ P( T;x,y)
where P( T; x, y) is the set of edges in the path in T from x to y.
X
Let E ( X, T ) ⊂ [0, 1]( 2 ) denote the image of the map
X
Λ T : [0, 1] E(T ) → [0, 1]( 2 ) , λ 7→ p(T,λ)
X
and let E ( X ) denote the union of the subspaces E ( X, T ) of [0, 1]( 2 ) over all binary
trees T with X as their set of leaves. Then E ( X ) is called the edge-product space
for trees on X.
12
Introduction
In [22] it was shown that E ( X ) is a finite CW complex, and that the Tuffley
poset S( X ) is isomorphic to the face poset of E ( X ).
5.3 The Tuffley poset
The elements in the Tuffley poset are called X-forests. They and their partial
order relation will be defined here. More details about X-trees, X-forests and
the Tuffley poset can be found in [22] and [24].
Definition 5.3. An X-tree T is a pair ( T; ϕ), where T is a tree and ϕ : X → V ( T )
is a map with the property that all vertices of T of degree at most two belong to
ϕ( X ). The vertices in V ( T ) ∖ ϕ( X ) are called unlabelled.
Definition 5.4. An X-forest is a collection F = {( A, T A ) : A ∈ µ} where µ is a
set partition of X (a collection of disjoint subsets of X whose union is X) and T A
is an A-tree for each A ∈ µ.
4
2
1,3,8,11
5
1
6
4,7,10
10
5
7
9
11
6
2, 9
8
3
Figure 7: An X-tree and an X-forest, where X = {1, 2, . . . , 11}.
To remove an edge e = (u, v) from an X-forest T = ( T; ϕ) and identify u and
v, labelling the new vertex ϕ−1 (u) ∪ ϕ−1 (v), is called a contraction of the edge e.
Let S( X ) denote the set of X-forests. A partial order relation on S( X ) is
defined by F2 ≤ F1 if F2 can be obtained from F1 by contracting certain edges,
and deleting certain other edges, with any resulting unlabelled vertices of degree
2 being suppressed. The poset S( X ) is called the Tuffley poset on X.
Now the cell decomposition of E ( X ) given in [22] will be described. To
an X-tree T , associate the closed ‘cube’ B(T ) = [0, 1] E(T ) and the open ‘cube’
(
)
Int B(T ) = (0, 1) E(T ) . Then for an X-forest F = {( A, T A ) : A ∈ µ}, define
(
)
(
)
B(F ) = ∏ A∈µ B(T A ), so that Int B(F ) = ∏ A∈µ Int B(T A ) . The sets B(F )
(
)
and Int B(F ) are homeomorphic to a closed ball and an open ball, respectively,
of dimension ∑ A∈µ | E(T A )| (this quantity will be called the dimension of F ).
Given
( )an X-tree T = ( T; ϕ) and a map λ : E( T ) → [0, 1], define the map
p(T ,λ) : X2 → [0, 1] by
p(T ,λ) ( x, y) =
∏
e∈ P( T;ϕ( x ),ϕ(y))
λ ( e ),
(
)
where P T; ϕ( x ), ϕ(y) is the set of edges in the path in T from the node labelled
x to the node labelled y. If ϕ( x ) = ϕ(y), then p(T ,λ) ( x, y) := 1.
13
6 The Lambert W function
For an X-forest F = {( A, T A ) : A ∈ µ} and a map λ = (λ A , A ∈ µ), let
X
ψF : B(F ) → [0, 1]( 2 ) be defined by
{
p(T A ,λ A ) ( x, y)
ψF (λ)( x, y) =
0
if ∃ A ∈ µ such that x, y ∈ A,
otherwise.
The edge-product space E ( X ) is a CW complex with cell decomposition
{ (
)
}
ψF Int(B(F )) } : F ∈ S( X ) .
{ (
)
}
The face poset ψF B(F ) } : F ∈ S( X ) ordered by inclusion is isomorphic
to the Tuffley poset.
6 The Lambert W function
Definition 6.1. The Lambert W function (W (ζ )) is defined by W (ζ )eW (ζ ) = ζ.
There are several solutions to this equation, and they are the different branches
of the Lambert W function.
The derivative is given by
∂
∂ζ W ( ζ )
= (
W (ζ )
ζ 1 +W ( ζ )
) if ζ ̸= 0. The principal branch
W0 (ζ ) of the Lambert W function is the only branch defined at zero. It is analytic
in the whole complex plane except in the real interval ] − ∞, − 1e ], it is defined at
− 1e and real on the interval [− 1e , ∞[, and has derivative 1 at zero.
The Lambert W function is related to a generating function of trees. Let tn
be the number of rooted trees on n labelled nodes. The exponential generating
zn
function is T (z) = ∑∞
n=1 tn n! . It is known that the function T ( z ) = −W (− z ) and
that tn = nn−1 .
More about the Lambert W function can be found in [9] and [10].
7 Permutations
Most of the theory in this section can be found in [23] and [26].
7.1 The group Sn of permutations
Definition 7.1. A permutation of the set [n] := {1, 2, . . . , n} is a linear ordering
π1 π2 . . . πn of the elements of [n]. The expression π1 π2 . . . πn is called a word,
and the elements πi are consequently called letters. The permutation π can also
be seen as a bijective function π : [n] → [n] given by π (i ) = πi .
The inverse π −1 of a permutation π = π1 π2 . . . πn is given by π −1 (πi ) = i,
and the product of two permutations π and τ is defined to be the composition of
them as functions — that is (πτ )(i ) = π (τ (i )).
14
Introduction
The permutations of [n] with the operation multiplication form a group. This
group is called the symmetric group and will be denoted Sn .
Permutations can also be written in cycle notation. If (i1 i2 i3 . . . ik ) is a cycle
of π, then π (i1 ) = i2 , π (i2 ) = i3 , . . . , π (ik−1 ) = ik , and π (ik ) = i1 . Obviously,
this cycle can be regarded as identical to the cycle (i2 i3 . . . ik i1 ). It is easy to
see that every element of [n] appears in a unique cycle of π.
As an example, if π ∈ S7 is written as the word 3 2 1 7 5 4 6, then it can be
written as (2)(6 4 7)(5)(3 1) in cycle notation.
Definition 7.2. A partition of a positive integer n is a sequence of positive integers
λ = (λ1 , λ2 , . . . , λk ) such that ∑kj=1 λ j = n and λ1 ≥ λ2 ≥ · · · ≥ λk . That λ is a
partition of n is denoted λ ⊢ n.
The sequence of the lengths of all cycles in π in weakly decreasing order is
called the cycle type of π. Hence the cycle types of permutations in Sn are the
partitions of n. The permutation in the example above has cycle type (3, 2, 1, 1).
Two permutations are conjugate if and only if they have the same cycle
type.
Suppose that π,
(
) σ ∈ Sn and that (i1 i2 . . . ik ) is( a cycle
) of (π. Then
)
σ (i1 ) σ (i2 ) . . . σ (ik ) is a cycle of σπσ−1 , since (σπσ−1 ) σ(i1 ) = σ π (i1 ) =
σ (i2 ) and so on. Thus there is a bijection between partitions of n and conjugacy
classes of Sn .
Definition 7.3. A class function on Sn is a function from Sn to C that is constant
on conjugacy classes. A class function can also be seen as a function from the
partitions of n to C.
7.2 Representations of Sn and their characters
Now the representations of Sn and their characters will be briefly described. For
the full details, see [23]. Let GLd denote all invertible d × d matrices of complex
numbers, and let id denote the identity permutation, i.e. id = 1 2 . . . n.
Definition 7.4. A (matrix) representation of Sn is a map ρ : Sn → GLd such that
ρ(id) = Id (the identity matrix), and ρ(πτ ) = ρ(π )ρ(τ ) for all π, τ ∈ Sn . The
character of the representation ρ is the map χ : Sn → C which is defined by
χ(π ) = tr ρ(π ), where tr denotes the trace of a matrix.
Two matrix representations ρ1 and ρ2 of Sn are equivalent if and only if there
exists a fixed matrix T such that ρ2 (π ) = Tρ1 (π ) T −1 for all π ∈ Sn . In that case
their characters are identical.
It is easy to see that all characters of representations of Sn are class functions,
since χ(σπσ−1 ) = tr ρ(σπσ−1 ) = tr ρ(σ )ρ(π )ρ(σ)−1 = tr ρ(π ) = χ(π ).
Some of the representations of Sn are called irreducible representations (see
for example [23] for the exact definition). All the other representations, which
are said to be reducible, can be constructed using the irreducible representations
as building blocks. The characters of the irreducible representations are called
irreducible characters.
15
7 Permutations
There are as many irreducible representations of Sn as there are conjugacy
classes, and there is a standard bijection between them. The irreducible representation corresponding to the conjugacy class with cycle type λ will be denoted
ρλ , and the character of that representation will be denoted χλ .
Definition 7.5. Let f and g be any two functions from Sn to C. The inner product
of f and g is
1
f (π ) g(π )∗ ,
⟨ f , g⟩ =
n! π∑
∈S
n
where g(π )∗ is the complex conjugate of g(π ).
Theorem 7.6. The irreducible characters of Sn form an orthonormal basis for the space
of all class functions on Sn , with respect to the inner product defined above.
7.3 Permutation statistics
There are many interesting statistics on permutations, and some of them will be
described here. See for example [3].
Definition 7.7. A function s : Sn → N is called a permutation statistic.
The following definition from [18] constructs a class function s from any
permutation statistic s. If s is a class function, then s = s.
Definition 7.8. The mean statistic s is the class function which computes the mean
of s over conjugacy classes. If Cλ is the conjugacy class with cycle type λ, then
s(λ) =
1
s ( π ).
|Cλ | π∑
∈C
λ
By Theorem 7.6, every mean statistic s can be written as a linear combination
of irreducible characters.
Here follow some examples of common permutation statistics, and their
means will be expressed in the basis of irreducible characters. Suppose that
π = π1 π2 . . . πn is a permutation in Sn :
• The index i is a descent of π if πi > πi+1 . The permutation statistic counting
the number of descents of π is often denoted des(π ). The mean can be
1 (n)
written as des = n−
− n1 χ(n−1,1) − n1 χ(n−2,1,1) (see [16]).
2 χ
• The index i is an ascent of π if πi < πi+1 . The number of ascents of
π, asc(π ), is given by n − 1 − des(π ) = (n − 1)χ(n) (π ) − des(π ). Hence
1 (n)
asc = n−
+ n1 χ(n−1,1) + n1 χ(n−2,1,1) .
2 χ
• The pair (πi , π j ) is an inversion of π if i < j and πi > π j . The mean of the
number of inversions is inv =
[18]).
n ( n −1) ( n )
χ
4
−
n+1 (n−1,1)
6 χ
− 61 χ(n−2,1,1) (see
16
Introduction
• A fixed point of π is an index i such that πi = i. The number of fixed points
of π is the number of 1-cycles, so this is a class function and can be written
as χ(n) + χ(n−1,1) (see [19]).
Definition 7.9. A permutation π that swaps two elements is called a transposition. This means that π (i ) = i for all i ∈ [n] except two, call them i1 and i2 , for
which π (i1 ) = i2 and π (i2 ) = i1 . If i2 and i1 are adjacent, then π is called an
adjacent transposition.
The set of all transpositions is the conjugacy class with cycle type (2, 1, . . . , 1).
Definition 7.10. An occurrence of a (classical) pattern ϕ = ϕ1 ϕ2 . . . ϕk ∈ Sk in a
permutation π = π1 π2 . . . πn ∈ Sn is a subsequence in π of length k whose
letters are in the same relative order as those in ϕ.
For example, an occurrence of the classical pattern 123 in π ∈ Sn is a subsequence πi1 πi2 πi3 , where i1 < i2 < i3 , such that πi1 < πi2 < πi3 .
A generalisation of permutation patterns was described in [2]. Such patterns
are now called vincular patterns and are defined as follows:
Definition 7.11. A vincular pattern ϕ is written as a permutation in Sk enclosed by
brackets which may have dashes between adjacent letters. If two adjacent letters
are not separated by a dash, then the corresponding letters in an occurrence
of ϕ in π ∈ Sn must be adjacent. If ϕ begins with a square bracket then any
occurrence of ϕ in π must begin with π1 , and if ϕ ends with a square bracket
then any occurrence of ϕ in π must end with πn .
As an example, an occurrence of the vincular pattern [1-23) in π ∈ Sn is a
subsequence πi1 πi2 πi3 , where i1 < i2 < i3 , i1 = 1, and i3 = i2 + 1, such that
π i1 < π i2 < π i3 .
Let the number of occurrences of a pattern ϕ in π be denoted patϕ (π ). Many
permutation statistics can be written as sums of statistics counting occurrences
of vincular patterns. Consider for example the following statistic: A letter πi in
a permutation π is a peak if πi−1 < πi > πi+1 . If peak(π ) is the number of peaks
in π, it is easy to see that peak(π ) = pat(132) (π ) + pat(231) (π ).
Definition 7.12. If a permutation statistic s has the same distribution on Sn as
inv, then s is called Mahonian. That is, s is Mahonian if for all integers n ≥ 1
and k ≥ 0 the number of permutations π ∈ Sn with inv(π ) = k is equal to the
number of permutations τ ∈ Sn with s(τ ) = k.
A pattern function is a linear combination of statistics that count occurrences
of patterns. If each of these patterns have length at most d, then the pattern
function is called a d-function.
In [2] it is shown that many of the known Mahonian permutation statistics
are pattern functions, and all Mahonian 3-functions (up to some simple equivalences) are listed as such linear combinations.
8 Overview of the papers
17
8 Overview of the papers
Paper 1: The k-assignment polytope
The first paper is a study of a polytope called the k-assignment polytope. This
polytope is a generalisation of the well-known Birkhoff polytope Bn which has
the n × n permutation matrices as its vertices.
A natural generalisation of permutation matrices occurring both in optimisation and in theoretical combinatorics comes from k-assignments. A k-assignment
is k entries in a matrix that are required to be in different rows and columns. This
can also be described as placing k non-attacking rooks on a chess-board. The
k-assignment polytope M(m, n, k) is defined to be the polytope whose vertices are
the m × n (0, 1)-matrices with exactly k 1:s, and at most one 1 in each row and
each column.
In this paper, a description of the points in M(m, n, k) is given, and M(m, n, k)
is also described as a facet of a transportation polytope, and as a projection of
a network flow polytope. It is indicated how the description as a network flow
polytope can be used for linear optimisation over M(m, n, k).
The face poset of M(m, n, k) is investigated. A representation of the faces as
certain bipartite graphs, here called doped elementary graphs, is given. (There is an
equivalent representation of the faces as certain (0, 1)-matrices.) The representation as doped elementary graphs is used to describe the cover relation in the face
lattice of the polytope, and to give an exact expression for the diameter, which
turns out to be 1 when m, n ≤ (k + 1) if (m + n −(k ) ≤ 3 and 2 if (m) + n − k) ≥ 4.
If max(m, n) ≥ (k + 2), then the diameter is min max(m, n) − k, k .
Finally the concept of ear decompositions of bipartite graphs is generalised
to fit this problem, and an ear decomposition of the doped elementary graphs
is constructed. It is shown how this decomposition can be used to compute the
dimensions of the faces of M(m, n, k ).
This paper is a joint work with Svante Linusson. It is published in Discrete
Optimization, volume 6 (2009), pages 148–161.
Paper 2: A regular decomposition of the edge-product space of
phylogenetic trees
The second paper studies the edge-product space E ( X ) for trees on X, where X
is a fixed finite set.
One reason for investigating these spaces is that they are closely connected
to tree-indexed Markov processes in molecular evolutionary biology, see [22]. In
[22] it was shown that E ( X ) has a natural CW complex structure for any finite
set X, and a combinatorial description of the associated face poset was given.
This combinatorial description is the Tuffley poset S( X ) of X-forests.
In this paper it is shown that the edge-product space is a regular cell complex.
Here is an outline of the proof:
(
)
Using the notation in Section 5, it remains to show that the set ψF B(F )
is homeomorphic to [0, 1]dim F for all F ∈ S( X ). First it is concluded that it
18
Introduction
is enough to show the above for all X-trees T . Then
induction
on dim T is
(
)
used, with the induction hypothesis that the set ψF B(F ) is homeomorphic to
[0, 1]dim F for all( F ∈ S( X )) such that dim F < d. If dim T = d, it is shown that
the boundary δ ψT (B(T )) is a regular CW complex with face poset isomorphic
to S( X )<T (all X-forests in S( X ) less than T ). The poset [0̂, T ] is obtained by
adding a 0̂ and a 1̂ = T . In [22] it was shown that [0̂, T ] is graded
( and thin.
)
If [0̂, T ] also has a recursive coatom ordering, it follows that ψT δ(B(T )) is
(
)
(
)
homeomorphic to δ [0, 1]d . That ψT B(T ) is homeomorphic to [0, 1]d now
(
)
follows from the fact that for each y ∈ ψT B(T ) , ψT−1 (y) is a contractible regular
cell complex (which is shown in [22]).
The main ingredient of the proof is to conclude that all intervals [0̂, F ], where
F ∈ S( X ), have recursive coatom orderings. The method to show this, is not to
find one coatom ordering that is valid in each step of the recursion, but to find
a set of coatom orderings such that in each step of the recursion it is possible to
choose one valid coatom ordering from the set.
This paper is a joint work with Svante Linusson, Vincent Moulton and Mike
Steel, and my main contribution is to prove that each interval [0̂, F ] has a recursive coatom ordering. It is published in Advances in Applied Mathematics,
volume 41 (2008), pages 158–176.
Some rather straightforward proofs, similar to other proofs in the article,
were omitted in the journal version of the paper. For completeness, they are
included here in a supplement in the end of the article, and footnotes are added
at the references to the full proofs.
Paper 3: A generating function for X -forests
The third paper studies a generating function for the elements of the Tuffley
poset. The elements are semi-labelled forests, called X-forests, where X is a
finite set of labels.
Since the edge-product space E ( X ) has a regular cell decomposition with
face poset given by the Tuffley poset S( X ), it is natural to be interested in the
poset itself.
Interesting questions concern, e.g., the number of X-trees or X-forests with
a fixed number of edges, or the number of trees in the X-forests. Also, statistics
like the average number of edges in all X-forests or X-trees with | X | = n, or the
distribution of the number of trees in X-forests, could be interesting to study.
To be able to answer such questions, a useful tool is to have a generating
function which counts the number of edges, labelled nodes, components, and
labels in X.
It happens to be that it is easier to study a generating function counting
unlabelled nodes instead of labelled, but it gives as much information, as the
number of labelled nodes is the same as the number of edges plus the number
of components minus the number of unlabelled nodes.
The generating function studied looks like this:
19
8 Overview of the papers
∞
1+
∑
(
∑
n=1 F ∈S([n])
x |C(F )| y|E(F )| z|V (F )|
) wn
n!
,
where x, y, and z count the number of components, edges, and unlabelled nodes,
respectively, in the [n]-forests.
A closed formula for this generating function is found. It is
(
yz + 1
exp x
2y
(
( ( y(ew − 1 + z)
)2
yz
1
− yz+1 )
−
W
−
e
+
1
0
yz + 1
(yz + 1)2
))
,
where W0 is the principal branch of the Lambert W function.
In addition, closed formulas are found for the generating function of X-trees,
and for the generating functions of X-trees and X-forests with the restriction of
having at most one label on each node.
Since this generating function is analytic in a neighbourhood of (w, x, y, z) =
(0, 1, 1, 1), it is possible to use the powerful tool of singularity analysis to analyse
its coefficients as n → ∞.
Singularity analysis of generating functions is a method that can be used to
compute asymptotic expressions of the coefficients of a generating function by
analysing the singularities of the function. This method is described in [15].
In that way the asymptotic mean, variance, etc. are calculated for the number of edges, components, and unlabelled nodes in X-trees and X-forests as
| X | → ∞.
The asymptotic distributions of edges and unlabelled nodes seem to be normal distributions, while the asymptotic distribution of the number of components amazes with nice rational values on asymptotic mean, variance etc..
Paper 4: Pattern containment in random permutations
In this paper permutation statistics counting occurrences of patterns are studied
on a certain kind of random permutations. Let s be a permutation statistic.
Consider the product π of t permutations chosen uniformly at random from a
subset Γ of the symmetric group Sn . Now s(π ) is a random variable, and an
important characteristic is of course the expected value of s(π ), which will be
denoted EΓ (s, t).
This is interesting for example in phylogenetics, where Γ is mostly taken to
be some set of transpositions. See for example [12], [13], [14], and [19].
There is a method, developed by Hultman in [18], that makes it easy to compute EΓ (s, t) when Γ is a union of conjugacy classes of Sn . The only prerequisite
for using the method is that the mean s of s over the conjugacy classes has to be
expressed as a linear combination of irreducible characters of Sn .
This paper is focused on expressing the means of statistics counting occurrences of classical and vincular patterns as linear combinations of the irreducible
characters.
20
Introduction
A procedure for calculating these expressions when the patterns have length
3 is developed, and is then used to write the means of all statistics counting
occurrences of classical and vincular patterns of length 3 as linear combinations
of irreducible characters. It turns out that only five irreducible characters are
needed in all these expressions.
It is exemplified how the expressions of the means can be used to find the
expected values EΓ (s, t), where Γ is the set of all transpositions in Sn , for all
statistics s counting occurrences of classical patterns of length 3.
References
[1] R. K. Ahuja, T. L. Magnanti, and J. B. Orlin, Network flows: theory, algorithms,
and applications, Prentice Hall, Upper Saddle River, NJ, 1993.
[2] E. Babson and E. Steingrímsson, Generalized permutation patterns and a classification of the Mahonian statistics, Sém. Lothar. Combin. 44 (2000), Art. B44b.
[3] M. Bóna, Combinatorics of permutations, Discrete Mathematics and its Applications 72, CRC Press, Boca Raton, FL, 2012.
[4] L. J. Billera and A. Sarangarajan, The combinatorics of permutation polytopes,
Formal power series and algebraic combinatorics (New Brunswick, NJ,
1994) DIMACS Ser. Discrete Math. Theoret. Comput. Sci. 24, Amer. Math.
Soc., Providence, RI, 1996, 1–23.
[5] A. Björner, Topological methods, Handbook of Combinatorics, Vol. 2, MIT
Press, Cambridge, MA, 1995, pp. 1819–1872.
[6] A. Björner, M. Las Vergnas, B. Sturmfels, N. White, and G. Ziegler, Oriented
matroids, Cambridge University Press, Cambridge, 1993.
[7] R. A. Brualdi and P. M. Gibson, Convex polyhedra of doubly stochastic matrices,
I. Applications of the permanent function, J. Combin. Theory, Ser. A 22 (1977),
no. 2, 194–230.
[8] R. A. Brualdi and P. M. Gibson, Convex polyhedra of doubly stochastic matrices,
II. Graph of Ωn , J. Combin. Theory, Ser. B 22 (1977), no. 2, 175–198.
[9] R. M. Corless, G. H. Gonnet, D. E. G. Hare, D. J. Jeffrey, and D. E. Knuth,
On the Lambert W function, Adv. Comput. Math. 5 (1996), no. 4, 329–359.
[10] R. M. Corless, D. J. Jeffrey, and D. E. Knuth, A sequence of series for the
Lambert W function, International symposium on symbolic and algebraic
computation (Kihei, HI, 1997), ACM, New York, 1997, pp. 197–204.
[11] V. A. Emeličev, M. M. Kovalev, and M. K. Kravtsov, Polytopes, graphs and
optimisation, Cambridge University Press, Cambridge, 1984.
References
21
[12] N. Eriksen, Expected number of inversions after a sequence of random adjacent transpositions — an exact expression, Discrete Math. 298 (2005), no. 1–3,
155–168.
[13] N. Eriksen and A. Hultman, Estimating the expected reversal distance after a
fixed number of reversals, Adv. in Appl. Math. 32 (2004), no. 3, 439–453.
[14] H. Eriksson, K. Eriksson, and J. Sjöstrand, Expected number of inversions after
a sequence of random adjacent transpositions, Formal power series and algebraic combinatorics (Moscow, 2000), Springer, Berlin, 2000, pp. 677–685.
[15] P. Flajolet and R. Sedgewick, Analytic Combinatorics, Cambridge University
Press, Cambridge, 2009.
[16] J. Fulman, The distribution of descents in fixed conjugacy classes of the symmetric
groups, J. Combin. Theory Ser. A 84 (1998), no. 2, 171–180.
[17] J. Gill, The k-assignment polytope and the space of evolutionary trees, Licentiate
thesis, Linköping Studies in Science and Technology. Theses No. 1117,
Linköping University, 2004. Available from Liu Electronic Press at
http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-5677
[18] A. Hultman, Permutation statistics of products of random permutations,
arXiv:1301.0430 [math.CO], 2013.
[19] A. Jönsson, Evolutionary fixed point distance problems, M.Sc. Thesis, Chalmers
University of Technology, Gothenburg, 2009.
[20] D. E. Knuth, The art of computer programming, Vol. 1, 2nd ed., AddisonWesley, Reading, MA, 1973.
[21] L. Lovász and M. D. Plummer, Matching theory, North-Holland Mathematics Studies 121, Annals of Discrete Mathematics 29, North-Holland, Amsterdam, 1986.
[22] V. Moulton and M. Steel, Peeling phylogenetic ‘oranges’, Adv. in Appl.
Math. 33 (2004), no. 4, 710–727.
[23] B. E. Sagan, The symmetric group: representations, combinatorial algorithms, and
symmetric functions, 2nd ed., Graduate Texts in Mathematics 203, SpringerVerlag, New York, 2001.
[24] C. Semple and M. Steel, Phylogenetics, Oxford Lecture Series in Mathematics
and its Applications 24, Oxford University Press, Oxford, 2003.
[25] R. Simion and F. W. Schmidt, Restricted permutations, European J. Combin. 6
(1985), no. 4, 383–406.
[26] R. P. Stanley, Enumerative combinatorics, Vol. 1, Cambridge Studies in
Advanced Mathematics 49, Cambridge University Press, Cambridge, 1997.
[27] G. M. Ziegler, Lectures on polytopes, Graduate Texts in Mathematics 152,
Springer-Verlag, New York, 1995.
Included Papers
The articles associated with this thesis have been removed for copyright
reasons. For more details about these see:
http://urn.kb.se/resolve?urn=urn:nbn:se:liu:diva-98263