Control of cancellations that restrain the growth of a binomial recursion

J Geom Anal
DOI 10.1007/s12220-014-9489-y
Control of Cancellations that Restrain the Growth
of a Binomial Recursion
Magnus Aspenberg · Rodrigo Pérez
Received: 18 August 2010
© Mathematica Josephina, Inc. 2014
Abstract We study a recursion that generates real sequences depending on a parameter x. Given a negative x the growth of the sequence is very difficult to estimate due
to canceling terms. We reduce the study of the recursion to a problem about a family
of integral operators, and prove that for every parameter value except −1, the growth
of the sequence is factorial. In the combinatorial part of the proof we show that when
x = −1 the resulting recurrence yields the sequence of alternating Catalan numbers,
and thus has exponential growth. We expect our methods to be useful in a variety of
similar situations.
Keywords
Catalan numbers · Factorial growth · Integral operators
Mathematics Subject Classification
05A10 · 45P05
1 Introduction
Fix an arbitrary real number x = 0, and consider the sequence defined by the recursive
expression1
1 Indices r less than
n
2
are omitted, as they return vanishing binomials.
Communicated by Jeffrey Diller.
M. Aspenberg
Centre for Mathematical Sciences, Lund University, Box 118, 221 00 Lund, Sweden
e-mail: [email protected]
R. Pérez (B)
Department of Mathematical Sciences, LD-224R IUPUI, 402 N. Blackford St.,
Indianapolis, IN 46202, USA
e-mail: [email protected]
123
M. Aspenberg, R. Pérez
a1 = x,
an = x
n−1 r
ar .
n −r
n
r = 2 (1)
For x = 1, for instance,
we obtain the sequence 1, 1, 2, 7, 34, 214, 1652, . . .. Note
that the last summand n−1
1 an−1 guarantees that an ≥ (n − 1)!. This means that {an }
grows very fast since n! > (n/e)n . We prove
Theorem 1 For any real x = −1, 0 the sequence {an } defined by (1) grows superexponentially.
This is an interesting behavior, and not altogether obvious because when x < 0,
there are a lot of cancellations. In fact, when x = −1, the positive and negative terms
exactly balance out to yield a surprising contrast:
Theorem 2 When x = −1, formula (1) produces the sequence (−1)n Cn of Catalan
numbers with alternating signs, and therefore grows exponentially.
This seems to be a brand new formula for Catalan numbers, which is compelling
all by itself. However, our main interest is in calling attention to our method of proof
for Theorem 1. Here is why:
Let f : C → C be an analytic function with f (0) = 0 and f (0) = eπ iθ , where θ
is an irrational angle. A simple computation shows that there is always a formal power
series ϕ that conjugates f and its linear part z → eπ iθ z, but the coefficients of this
series can grow fast enough for its radius of convergence to be 0.
Under what conditions on θ is it possible to control the growth of the coefficients
so that ϕ results analytic? Since each coefficient is a sum of products of the form
1
(see [6]), the answer depends on the number-theoretic properties of
π
ikθ
e
− eπ iθ
θ . This is because when θ is nearly rational, some of these quotients can become very
large. This is an example of a small denominator problem.
In a famous 1942 paper [7], C. L. Siegel gave a lower bound on the radius of
convergence of the conjugating map when θ is Diophantine. In such situation it was
enough to estimate the growth of the largest product that appears in the formula for the
coefficients of ϕ. Now, it is possible for ϕ to have positive radius of convergence even
if the largest product grows super-exponentially. But for this to happen it is necessary
to detect large cancellations between products, so the coefficients of ϕ have a chance
to grow at a slower exponential rate. Our method of proof for Theorem 1 is tailored
for the detection of these kinds of cancellations.
Formula (1) is a modified version of the recursion that generates the coefficients
of the conjugating map ϕ when f is quadratic. The main simplification is that the
constant x replaces the small denominator terms. This allows us to concentrate on the
structural properties of the conjugating equation that make possible the presence of
cancellations.
The phenomenon at play is very interesting. In the first part of the paper some
combinatorial constructions will allow us to prove the results when x > 0 and when
x ≤ −1. However, when x ∈ (−1, 0), the cancellations and the small size of x
conspire to render elementary arguments ineffective. In the second part we introduce
123
Control of Cancellations that Restrain...
functional analytic methods to control the effect of the cancellations in that case. An
application to Siegel disks will be investigated in [1]; also, we expect these ideas to
be useful in a variety of similar situations.
1.1 Structure
The paper has two parts. In the first (Sects. 2 to 4) we prove Theorem 2 and the situation
x∈
/ [−1, 0] of Theorem 1. Section 2 presents some basic facts about hypercube graphs
and the Catalan numbers; Section 3 defines the combinatorial structure we use; and
Sect. 4 contains the proofs.
The second part (Sects. 5 to 7) tackles the case x ∈ (−1, 0) of Theorem 1. Section 5
uses the combinatorial knowledge gained in the first part to derive an alternative
expression for an as the sum of a sequence of numbers Sn (1), . . . , Sn (n−1) constructed
recursively. This sequence is translated into a function sn ∈ L 2 [0, 1], and the recursion
is interpreted as an integral operator. Section 6 contains the proof of the theorem
assuming the statement of Lemma 11, and Sect. 7 is devoted to the proof of Lemma 11.
2 Basic Combinatorial Facts
2.1 Hypercubes
The hypercube graph Hn is the graph whose set of vertices Vn consists of all n-vectors
with coordinates 0 or 1 [3]. Two vertices are adjacent whenever they differ in one
coordinate. There is a natural stratification of Vn by the number of coordinates of each
value in a vertex; accordingly, let Vn,r ⊂ Vn denote the vertices with r coordinates
equal to 1. There are other equivalent definitions of hypercube graphs. The advantage
of the definition in terms of binary coordinates is that the following facts become
obvious; compare Fig. 1.
(H1) |Vn | = 2n. (H2) |Vn,r | = nr .
(H3) If n = m 1 + m 2 , then Hn = Hm 1 × Hm 2 .
items (H1) and (H2) give a succinct proof of the binomial identity
n nIncidentally,
n (Fig. 1) . If instead of just counting vertices, they are assigned weight
=
2
r =0 r
x r , item (H3) furnishes a recursive proof of Newton’s binomial formula:
n
n r
n
(H4)
r x = (1 + x) .
r =0
000
00
0
010
001
01
10
100
011
110
1
11
101
111
Fig. 1 The hypercube graphs H1 , H2 , H3 , and a decomposition of the latter as H1 × H2
123
M. Aspenberg, R. Pérez
Fig. 2 The C3 = 5 monotone paths from (0, 0) to (3, 3)
2.2 Catalan Numbers and Lattice Paths
One of many equivalent definitions of the Catalan numbers 1, 1, 2, 5, 14, 42, 132, . . .
[8, A000108] is via the formula
2n Cn+1 =
n
(n+1) .
The exponential rate of growth of the sequence {Cn } follows easily from Stirling’s
formula:
Cn+1
2n
√
2π(2n) 2n
(2n)!
4n
22n
e
∼ √
=
.
∼
=
√
√
2
2
n n
(n!) (n + 1)
π n 3/2
π n(n + 1)
2π n e
(n + 1)
Definition A lattice path is a path in the lattice Z × Z that moves one horizontal
or vertical unit at every step without self-intersections. We consider monotone paths,
which never move left nor down (Fig. 2). Note that a monotone path from (0, 0) to
(m, n) requires m + n steps. Choosing one such path is tantamount to deciding which
of these steps will be the m horizontal
steps, so the number of monotone lattice paths
.
from (0, 0) to (m, n) is m+n
m
Lemma 3 The number of monotone paths from (0, 0) to (n, n) that do not cross over
the diagonal {y = x} is equal to Cn [9].
Proof of Lemma 3 A monotone path γ from (0, 0) to (n, n) that crosses over the
diagonal will pass through a point ( j, j + 1). Let P be the first such point, and γ the portion of γ going from P to (n, n). Reflecting γ on the diagonal {y = x + 1}
transforms γ into a monotone path from (0, 0) to (n − 1, n + 1). This operation is
bijective because such paths must cross over the diagonal. Therefore, the number of
monotone paths from (0, 0) to (n, n) that do not cross over the diagonal equals the
number of all monotone paths from (0, 0) to (n, n), minus the number of monotone
paths from (0, 0) to (n − 1, n + 1); i.e.,
2n 2n
2n
2n
2n
n
= n .
−
=
−
n+1 n
n+1
n
n−1
n
123
Control of Cancellations that Restrain...
3 Structure of an
3.1 Signatures and Binomial Products
The internal structure of the expression an is better understood by separating the
different contributions of weight x r . After expanding the recursive expressions in (1),
the first few terms are
a1 =
a2 =
a3 =
a4 =
a5 =
= x,
= 11 x 2 ,
x 11 a1
= 21 11 x 3 ,
x 21 a2
x 22 a2 + 31 a3 = 22 11 x 3 + 31 21 11 x 4 ,
x 23 a3 + 41 a4 = 23 21 11 x 4 + 41 22 11 x 4
+ 41 31 21 11 x 5 .
(2)
This symbolic manipulation makes it clear that an is the sum of all products of the
form
bs
bs−1
b2
b1
···
· x s+1 ,
(3)
n − bs bs − bs−1
b3 − b2
1
such that
n =: bs+1 > bs > bs−1 > · · · > b1 = 1 , and b j+1 ≤ 2b j ( j = 1, . . . , s). (4)
The last condition is a consequence of the fact that the sum in (1) starts at r =
Note that this condition forces b2 = 2.
n
2
.
Definition A tuple σ = (n, bs , . . . , b1 ) satisfying (4) is called an n-signature. The
n-signature that contains all the numbers from 1 to n is called canonical.
Note Compare the recursion (1) with the similar looking α1 = x, αn = x rn−1
=n/2 αr
in which the binomial coefficients have been removed. From the above discussion we
see that αn is the sum of weights x s+1 , taken over all signatures (n, bs , . . . , b1 ).
Replacing x with 1 shows that the number of distinct n-signatures is given by the
recursion
N1 = 1,
Nn =
n−1
r =
n
2
Nr .
The numbers {Nn } = {1, 1, 1, 2, 3, 6, 11, 22, 42, . . .} form the Narayana–Zidek–
Capell sequence [8, A002083].
123
M. Aspenberg, R. Pérez
5
5
5
5
5
5
4
4
4
4
4
4
3
3
3
3
3
3
2
2
2
2
2
2
1
1
1
1
1
1
(6,3,2,1)
(6,4,2,1)
(6,4,3,2,1)
(6,5,3,2,1)
(6,5,4,2,1)
(6,5,4,3,2,1)
Fig. 3 The towers associated with all 6-signatures. The second tower, for instance, has blocks at positions
1, 2, and 4. The rightmost signature is the canonical one
3.2 Arrays and Blocks
When faced with an expression made of binomial coefficients, the natural thing to
ask is what kind of combinatorial object is being counted. To a given signature
σ = (n, bs, . . . ,b1 ) we
b1 will assign a tower that can be filled with an array of numbers
bs
·
·
·
in exactly n−b
1 ways.
s
Definition Given σ = (n, bs , . . . , b1 ), consider a tower of n −1 square cells split into
blocks of lengths (n − bs ), (bs − bs−1 ), . . . , (b3 − b2 ), (b2 − b1 ) from top to bottom
as in Fig. 3. The position of a block is the height b j of its lowest cell, so the signature
condition b j+1 ≤ 2b j implies that a block is never taller than its position. An array
associated with σ is an assignment of numbers to every cell in the tower of σ such that
the numbers in the j-th block (at position b j ) are chosen from the set {1, 2, . . . , b j }
and appear in descending order. An array associated with the canonical signature is
also called canonical.
Lemma 4
(a) The number
b1 of n-arrays associated with a signature σ = (n, bs , . . . , b1 ) is
bs ·
·
·
n−bs
1 .
(b) The number of canonical n-arrays is (n − 1)!
(c) The total number of n-arrays is given by formula (1) when x = 1.
Proof The tower associated with σ has s blocks. The j-th block is based at position
b j and its length is b j+1 − b j (for the topmost block the length is n − bs ). Therefore,
the j-th block can be filled with an arbitrary choice of b j+1 − b j numbers between
j
possibilities. This proves (a), from which item (b) follows
1 and b j ; i.e., b b −b
j+1
j
immediately. To prove (c), assign the weight x s to every n-array with s blocks, and
note from (3) that an represents the sum of weights over all n-arrays. Thus, when
x = 1, an simply counts the number of n-arrays as claimed.
Definition The sequence of numbers that specifies an array is called a pattern. Our
convention is to read patterns from the bottom up; thus, for instance, the rightmost
array in Fig. 4 has pattern [1232].
Lemma 5 A tuple of numbers [t1 , . . . , tn−1 ] is realized as the pattern of some array
if and only if
123
Control of Cancellations that Restrain...
4
1
4
1
4
2
4
1
4
1
4
2
3
2
3
3
3
3
3
2
3
3
3
3
2
1
2
1
2
1
2
2
2
2
2
2
1
1
1
1
1
1
1
1
1
1
1
1
Fig. 4 The 5-signature (5, 3, 2, 1) has six associated arrays: The block at position 2 can hold either a 1 or
a 2, while the block at position 3 can hold any descending combination of the numbers 1, 2, 3
t j ≤ j for all 1 ≤ j ≤ n − 1.
Proof In a canonical array every block has length one. This means that the position
of the j-th block is b j = j, and the number in this block is t j . Thus, in this case, the
pattern condition is equivalent to t j ≤ b j = j, proving the result for canonical arrays.
In a non-canonical array, the cell at position j belongs to a block at position i ≤ j.
The pattern condition states that the number t j in that cell must be at most i, and the
result follows.
3.3 Array Hypercubes
Definition If an array has a block at position p with more than one cell, the block
can be split into two shorter blocks. The result is a valid array since the blocks have
positions p and p + η > p (η is the location of the split within the original block),
and the numbers in both blocks are all at most p. We call this operation on arrays a
split. Note that an array can usually be split in several ways, all of which commute.
Moreover, repeated splitting eventually results in a canonical array.
The reverse operation is also well defined. If an array has two consecutive blocks
at positions p and p + η, and the numbers contained in both blocks run together in
descending order, the two blocks can be combined into a single one. This is because
the new block is at position p and contains numbers in descending order, which
means that the length of the new block cannot exceed its position. In other words,
condition (4) is satisfied. This operation on arrays is called a merge. As with splitting,
merge operations are commutative, and repeated merging must terminate. An array
where no pair of blocks can be merged is called primitive.
Definition The graph Gn on the set of n-arrays is defined by joining any two arrays
related by a single split/merge operation.
Note that arrays with the same pattern of numbers are connected by successive
splits to the same canonical array, and thus this condition is equivalent to belonging in
the same connected component of Gn . Moreover, a split/merge is possible at a given
position if and only if the numbers at that position are in descending order. In particular,
splitting/merging does not depend on the structure of blocks in an array, but only on
the pattern of numbers. This yields the following lemma.
Lemma 6 Every connected component of G n is isomorphic (in the graph-theoretical
sense) to a hypercube graph.
123
M. Aspenberg, R. Pérez
Fig. 5 The pattern [12142] has
2 descents, and thus determines
four 6-arrays connected by
split/merge operations into a
square H2
5
2
4
4
3
1
2
2
5
2
1
1
4
4
(6,4,2,1)
3
2
1
1
5
(6,4,3,2,1)
5
2
4
4
1
3
1
2
2
2
2
1
1
4
4
(6,5,4,2,1)
3
1
2
2
1
1
(6,5,4,3,2,1)
Proof Consider an array A ∈ Gn . The connected component C of A consists of all
arrays with the same pattern of numbers as A. This pattern has descents (locations
where the numbers are in descending order). Now view such locations as placeholders
for a symbol 1 or 0 depending on whether two blocks of A meet at that location or
not. This puts the arrays of C in correspondence with vertices of the hypercube graph
H ; see Fig. 5. Since a split/merge depends only on the pattern of numbers, all edges
of H are included and C is isomorphic to H .
Observation Every hypercube C ⊂ Gn has a unique primitive array and a unique
canonical array. Since canonical arrays are in one-to-one correspondence with valid
patterns, the numbers of hypercubes in Gn and of primitive n-arrays are both equal
to (n − 1)! by Lemma 5. Also, if is as in the proof above, the primitive array has
s = n − − 1 blocks (because the canonical array has n − 1), so C is isomorphic to
Hn−s−1 .
4 The First Proofs
Each an is a polynomial in x. When written with the monomials ξr x r in ascending
order by degree, we say an is in basic format. Since every n-array with r − 1 blocks
contributes x r to the total an , the coefficient ξr counts the number of such arrays.
Example The first few an in basic format are (compare (2)):
a1 = x,
a2 = x 2 ,
a3 = 2x 3 ,
123
Control of Cancellations that Restrain...
a4 = x 3 + 6x 4 ,
a5 = 10x 4 + 24x 5 ,
a6 = 8x 4 + 86x 5 + 120x 6 .
(5)
When n = 6, for instance, we know that the towers with 4 blocks correspond to
the three
signatures
(6, 4,3,2,1),(6,
5, 3, 2, 1), and
(6,
5,
4, 2, 1) (see Fig. 3). These
have 24 31 21 11 = 36, 51 23 21 11 = 30, and 51 41 22 11 = 20 associated arrays
respectively, and we see that each of these 86 arrays contributes x 5 to the value of a6 .
Recall that the array graph Gn consists of isolated hypercube components. We can
also break an down as a sum of contributions by hypercubes. Let A be a primitive narray with s = r − 1 blocks. We know that A contributes x r to an . Since the connected
component C of G containing A is homeomorphic to Hn−r , items (H2) and (H4) in
Sect. 2.1 give
a∈C
(H2)
x (#blocks of a)+1 =
n−r n −r
j=0
j
(H4)
x r + j = x r (1 + x)n−r .
(6)
Let primn (r ) be the number of primitive arrays with r − 1 blocks; this is also the
number of components of Gn homeomorphic to Hn−r . Equation (6) shows that the
total contribution to an of all arrays in all such hypercubes is primn (r ) · x r (1 + x)n−r ,
so
n
(7)
an =
primn (r ) · x r (1 + x)n−r .
r =log2 n (Condition (4) implies that the least possible number of blocks is log2 n .)
When an is written in that form, we can read that each of the primn (r ) components
of Gn with dimension n − r contributes x r (1 + x)n−r to the total sum an . We will say
that an is in binomial format.
Example Let us compute the binomial format of a6 . In Eq. (5) we found that the lowest
degree monomial of a6 is 8x 4 . This indicates that the least number of blocks in a 6array is 3, and that there are 8 arrays with 3 blocks. All of these are clearly primitive,
so prim6 (4) = 8. These arrays (together with those obtained by splitting) determine 8
copies of H2 in G6 . Gathering the monomials of all arrays in these hypercubes gives
a6 = 8x 4 + 86x 5 + 120x 6
= (8x 4 + 16x 5 + 8x 6 ) + 70x 5 + 112x 6
= 8x 4 (1 + x)2 + 70x 5 + 112x 6 .
The remaining 70 arrays with 4 blocks must be primitive since they were not
incorporated into an H2 ; thus, prim6 (5) = 70. These arrays (together with those
obtained by splitting) determine 70 copies of H1 in G6 . Thus,
8x 4 (1 + x)2 + 70x 5 + 112x 6 = 8x 4 (1 + x)2 + 70x 5 (1 + x)1 + 42x 6 .
123
M. Aspenberg, R. Pérez
The first few an in binomial format are:
a1 = x 1 y 0 ,
a2 = x 2 y 0 ,
a3 = 2x 3 y 0 ,
a4 = x 3 y 1 + 5x 4 y 0 ,
a5 = 10x 4 y 1 + 14x 5 y 0 ,
a6 = 8x 4 y 2 + 70x 5 y 1 + 42x 6 y 0 ,
where y stands for 1 + x. This convention makes for cleaner looking expressions, and
will be used consistently in the rest of the paper.
Now that the combinatorial structure is in place, the proofs of Theorem 2 and the
case x ∈
/ [−1, 0] of Theorem 1 are straightforward.
Proof of Theorem 2 Since x = −1, the only non-zero term in (7) occurs when r = n.
In other words,
an = (−1)n primn (n).
Now, primn (n) is the number of n-arrays with n − 1 blocks; that is, arrays that are both
primitive and canonical. By definition, these are arrays with non-decreasing patterns
of n − 1 numbers. Any such pattern can be associated with a monotone lattice path
from (0, 0) to (n − 1, n − 1) that does not cross over the diagonal. Simply subtract
1 from each entry in the pattern and interpret the results as heights of the horizontal
steps of a monotone path (compare Fig. 2). This procedure is bijective, so by Lemma 3,
primn (n) = Cn .
Proof of Theorem 1 when x ∈
/ [−1, 0] Depending on the sign of x, one of the two
formats for an displays no cancellations.
x > 0: All monomials ξr x r in the basic format of an are positive because the coefficient ξr counts arrays with r − 1 blocks. In particular, an is larger than the highest
order monomial. The coefficient of this monomial is the number of arrays with the
most blocks; i.e., canonical arrays. By Lemma 4,
an > (n − 1)! · x n .
x < −1: Since y < 0, all terms primn (r ) · x r y n−r in the binomial format of an have
the same sign, and do not cancel each other. Also, |x| > |y|, so
|an | =
r
primn (r ) · |x r y n−r | >
primn (r ) · |y n | = (n − 1)! · |y n |,
r
where the last equality follows from the observation after Lemma 6.
In both cases, |an | is larger than (n − 1)! · w n for some positive w, and the result
holds.
123
Control of Cancellations that Restrain...
5 The Case x ∈ (−1, 0)
The situation when x ∈ (−1, 0) is more delicate because the terms in both the basic
and binomial formats of an have alternating signs. Our strategy in this second part
involves a different representation of an (Eqs. (8) and (9)). We will interpret the
sequences {Sn (1), . . . , Sn (n − 1)} as functions sn ∈ L 2 [0, 1] and the recursion (9) as
a sequence of integral operators An : sn → sn+1 . Then we deduce some facts about
the shape of the graph of sn , and about the limit operator T = limn→∞ An . We will
use this information to show that the largest eigenvalue λ of T bounds from below the
exponential rate of decay of an /(n − 2)!
5.1 A New Recursion
So far we have established that an is the sum of contributions of the form x r over
a large set of arrays (for each array, r − 1 is the number of blocks). We grouped
arrays with the same number pattern into hypercube graphs, and showed that an is
the sum of contributions x n− (1 + x) = x n− y over hypercubes C, where is
the dimension of each C. This dimension is the number of descents in the associated
pattern, so we can abandon arrays and express an directly as a sum of contributions over
patterns:
n−log2 n −1
an =
=0
patterns with
descents
x n− y .
This allows us to sort the contributions to an made by individual patterns. To this
end, consider an n-pattern π = [t1 , . . . , tn−1 ]. If the truncated pattern [t1 , . . . , tn−2 ]
contributes x a y b to an−1 , then π contributes x a y b+1 or x a+1 y b to an depending on
whether tn−1 < tn−2 or not (i.e., on whether π has one extra descent or not at the last
position). This motivates the following definition.
Definition For n ≥ 2 let Sn (r ) denote the sum of contributions of all patterns
[t1 , . . . , tn−1 ] such that tn−1 equals r (thus, r can take values in {1, . . . , n − 1}).
In particular, S2 (1) = x 2 , and
n−1
an =
Sn ( j).
(8)
j=1
By the previous argument, Sn+1 (r ) can be computed from the contributions of
n-patterns:
r
n−1
Sn ( j) + y ·
Sn ( j)
(9)
Sn+1 (r ) = x ·
j=1
j=r +1
(when r = n−1 or n, there is
no descent in the last position, so (9) should be interpreted
to mean Sn+1 (n − 1) = x · n−1
j=1 Sn ( j) = Sn+1 (n)).
123
M. Aspenberg, R. Pérez
Table 1 Entries of the
sequences Sn for 2 ≤ n ≤ 5,
computed according to the
recursion (9)
n \r
1
2
x2
3
x3
x3
4
x 4 + yx 3
2x 4
2x 4
5
x 5 + 5yx 4
3x 5 + 3yx 4
5x 5 + yx 4
2
3
4
5x 5 + yx 4
5.2 Sinusoidal Shape of Sn
The last term of the left sum in (9) is x · Sn (r ). Substituting this with the equivalent
expression (y − 1) · Sn (r ), we obtain
Sn+1 (r ) = x ·
r −1
Sn ( j) + y ·
r −1
j=1
Sn ( j) + (y − 1) · Sn (r )
j=r +1
j=1
=x ·
n−1
Sn ( j) + y ·
n−1
Sn ( j) − Sn (r )
j=r
= Sn+1 (r − 1) − Sn (r ),
(10)
valid for 2 ≤
r ≤ n − 2. Moreover, Eq. (10) remains valid for r = 1 if we define
Sn (0) = y · n−2
j=1 Sn−1 ( j). In this manner we derive a more practical method of
computing the sequence Sn :
• S2 (1) = x 2 .
For n ≥ 3:
• Sn (0) = y n−2
j=1 Sn−1 ( j),
• Sn (r ) = Sn (r − 1) − Sn−1 (r ), (1 ≤ r ≤ n − 2),
• Sn (n − 1) = Sn (n − 2).
Observation Sn (0) is not part of the original sequence, but including this auxiliary
term makes the definition of the sequence considerably simpler. It is vital to the coming
arguments that Sn (0) = yan−1 and Sn (n − 2) = xan−1 are both 0 or have opposite
signs.
For illustration purposes Table 2 shows the first few sequences Sn (2 ≤ n ≤ 5),
including the auxiliary values Sn (0) (compare with the expressions in Table 1). Also,
Fig. 6 shows a sample plot of values for Sn .
Definition Let n ≥ 3. Given a fixed x, the sequence Sn has
• a sign change at (A, B) if 0 ≤ A < B ≤ n − 1, all terms Sn ( j) are equal to 0 for
every index A < j < B (perhaps an empty set), and either
Sn (A) < 0 < Sn (B) (an up-change) or
Sn (A) > 0 > Sn (B) (a down-change);
123
Control of Cancellations that Restrain...
Table 2 Entries of the sequences Sn for 2 ≤ n ≤ 5, computed according to the recursion (10)
n \r
0
1
2
—
x2
3
yx 2
(y − 1)x 2
(y − 1)x 2
4
(2y 2 − 2y)x 2
(2y 2 − 3y + 1)x 2
(2y 2 − 4y + 2)x 2
(2y 2 − 4y + 2)x 2
5
(6y 3 −11y 2 +5y)x 2
(6y 3 −13y 2 +8y−1)x 2
(6y 3 −15y 2 +12y−3)x 2
(6y 3 −17y 2 +16y−5)x 2
2
3
4
(6y 3 −17y 2 +16y−5)x 2
Substituting y = 1 + x confirms that these values are identical to those in Table 1
Fig. 6 The sequence S20 when x = −2/7
• an extreme at (A, B) if Sn−1 has a sign change at (A + 1, B). Note that there is at
least one index between A and B, and Sn ( j) equals some fixed value M for every
A < j < B;
If Sn (A) < M > Sn (B), the extreme value Mis a local maximum;
If Sn (A) > M < Sn (B), the extreme value Mis a local minimum;
• an inflection at (A, B) if Sn−1 has an extreme at (A + 1, B). Note that there are at
least two indices between A and B.
The sign change/extreme/inflection is located 2 at B − 1.
Observation From (10) we get Sn−1 (r ) = Sn (r − 1) − Sn (r ), so the sequence Sn−1
acts as a “discrete derivative” of the sequence Sn . A corollary of property (ShB)
below is that in our inflections the “slope” is always steeper in the middle than
on the sides. Also, note that Sn (n − 1) cannot be part of a change, extreme, or
inflection because Sn (n − 1) = Sn (n − 2) (we allow this range in the definition solely to make the proofs easier to write). Thus, in practice, a sign change
has B ≤ n − 2 and, since the definition of extreme at level n depends on a sign
change at level n − 1, an extreme has B ≤ n − 3. Similarly, an inflection has
B ≤ n − 4.
2 The proof of (ShE) below illustrates the convenience of this extra definition.
123
M. Aspenberg, R. Pérez
Proposition 7 For all n ≥ 3 the sequence Sn satisfies
For some r in the range 1 to n − 2 we have Sn (r ) = 0.
There are at most one sign change, one extreme, and one inflection.
A local maximum M is positive. A local minimum M is negative.
There is at most one r in the range 1 to n − 2 such that Sn (r ) = 0.
At least one of the two values minr Sn (r ) and maxr Sn (r ) (r = 0, . . . , n − 2)
is attained at an endpoint: Sn (0) or Sn (n − 2).
(ShE) Assume Sn has an extreme and an inflection. Let their locations be E ≥ 1 and
I ≥ 2, respectively. Then
(ShO)
(ShA)
(ShB)
(ShC)
(ShD)
(a) E < I ⇒ |Sn−1 (1)| ≤ |Sn−1 (0)| and |Sn−1 (n − 3)| ≥ |Sn−1 (n − 2)|,
(b) E ≥ I ⇒ |Sn−1 (1)| ≥ |Sn−1 (0)| and |Sn−1 (n − 3)| ≤ |Sn−1 (n − 2)|.
Proof of (ShO) Suppose that Sn (r ) = 0 for 1 ≤ r ≤ n − 2. Then Sn (0) = xy Sn (n −
2) = 0 too, and we find from (10) that Sn−1 (r ) = 0 for every 1 ≤ r ≤ n − 3. This
leads to a contradiction since S3 (1) = x 3 < 0.
Proof of (ShA) By the definition of extreme and inflection we only need to prove that
Sn has at most one sign change for n ≥ 3. From Table 2 it is clear that S3 has a sign
change (indeed, a down-change) for any value of x. Let us assume then that n ≥ 4.
If Sn has multiple sign changes, they must alternate between up- and down-changes.
Say Sn has an up-change followed by a down-change (the other case is analogous).
In between these two sign changes there is a (positive) local maximum. Now, at least
one of Sn (0) and Sn (n − 2) is non-negative (both are zero or have opposite signs), so
Sn also has a (negative) local minimum.
Two extremes for Sn mean two sign changes for Sn−1 . Thus, we have found that
multiple sign changes for Sn would imply multiple sign changes for Sn−1 . But S4 can
only have one sign change between S4 (0) and S4 (2) = xy S4 (0).
Proof of (ShB) Suppose Sn has a local minimum M ≥ 0 at (A, B) (a local maximum
M ≤ 0 is ruled out in an analogous way). Since there are no other extremes, both Sn (0)
and Sn (n − 2) must be strictly larger than Sn (A + 1) = M ≥ 0, which contradicts the
fact that they have opposite signs.
Proof of (ShC) The statement is trivially true for n = 3 and follows from (ShO) when
n = 4, so we can assume n ≥ 5.
Note that it is possible for Sn to have two zeros in the range 0 to n − 2; namely,
when Sn (0) = Sn (n − 2) = 0. In this situation (ShO) forces Sn to have a non-zero
extreme at E (1 ≤ E ≤ n − 3), and consequently, Sn−1 has a sign change at E. It
follows that Sn−1 (0) and Sn−1 (n − 3) are different from 0; otherwise, the sign change
of Sn−1 would imply two extremes, contradicting (ShA).
Still assuming Sn (0) = Sn (n − 2) = 0, consider what would happen if Sn had one
more zero:
– If Sn (n − 3) = 0, then Sn−1 (n − 2) = 0 (using (10)), and we saw above that this
contradicts the sign change of Sn−1 at E.
123
Control of Cancellations that Restrain...
– If Sn (1) = 0, we would have Sn−1 (1) = 0 (again, using (10)). As above, Sn−1 (0)
cannot be 0, so let us compare the sign of Sn−1 (0) with the sign immediately to the
left of the sign change at E:
The signs cannot be the same because Sn−1 (1) = 0 would be an extreme, contradicting (ShB). But the signs cannot be opposite either because that would imply a
second sign change to the left of E, contradicting (ShA).
– If Sn (1) = 0 and Sn (n − 3) = 0, a zero in between would imply two extremes,
giving again a contradiction.
When Sn (0) and Sn (n − 2) are different from 0 the signs of these endpoints are
opposite and Sn has (exactly) one sign change. It remains to consider the possibility
that this sign change consists of a long run of consecutive zeros. Accordingly, assume
that Sn (r ) = · · · = Sn (r +k) = 0 (r, k ≥ 1), with Sn (r −1) < 0 and Sn (r +k +1) = 0
(the case Sn (r − 1) > 0 is analogous). By (ShB) Sn (r + k + 1) must be positive. But
then
Sn−1 (r ) < 0 = Sn−1 (r + 1) = · · · = Sn−1 (r + k) > Sn−1 (r + k + 1),
so Sn−1 would have a local maximum M = 0, contradicting (ShB).
Proof of (ShD) Assume Sn−1 (1) ≤ 0 so Sn (1) ≥ yan−1 (the case Sn−1 (1) > 0 is
analogous). It follows that Sn−1 cannot have a down-change at (A, B) unless A = 0,
and therefore Sn cannot have a local minimum. As a consequence, the value minr Sn (r )
is attained at either r = 0 or r = n − 2, and the conclusion follows.
Proof of (ShE) We can assume that the extreme of Sn is a local maximum (the local
minimum case is analogous). Then Sn−1 has an up-change at E, and an extreme at I .
Note in particular that yan−2 = Sn−1 (0) < 0. There are two cases:
(a) The up-change of Sn−1 is located before its extreme. Hence the extreme is a local
maximum, and therefore it lies above Sn−1 (n − 2) = xy Sn−1 (0) > 0. By (ShD),
the minimum of Sn−1 is Sn−1 (0), so
Sn−1 (0) ≤ Sn−1 (1) ≤ Sn−1 (E) ≤ 0.
(The last inequality holds because E is the last point before Sn−1 becomes
positive.)
(b) The up-change of Sn−1 is located at, or after its extreme. Hence the extreme is
a local minimum, and therefore it lies below Sn−1 (0). Now, all values Sn−1 (r )
for 0 ≤ r ≤ I are negative because the up-change is located at E ≥ I . We
claim further that these values are non-increasing; otherwise, Sn−1 would have a
negative local maximum. Since Sn−1 has an up-change, Sn−1 (n − 2) = xan−2 is
non-negative, and we have
0 ≥ yan−2 = Sn−1 (0) ≥ Sn−1 (1).
The proofs of the inequalities at the right extreme of Sn−1 are analogous.
123
M. Aspenberg, R. Pérez
5.3 Sn Becomes a Step Function
Formula (9) induces a linear operator An : Rn−1 −→ Rn . Here we will embed An as
an integral operator An : L 2 [0, 1] −→ L 2 [0, 1], and find an operator T which is the
limit of {An } in the operator norm. The goal will be to link the growth of {an } to the
spectral properties of T .
For n ≥ 2, the r -th entry of the column vector
⎤
⎡
Sn (1)/(n − 2)!
⎥
⎢
..
n−1
sn = ⎣
⎦∈R
.
Sn (n − 1)/(n − 2)!
represents the average contribution to an of patterns with last entry tn−1 = r (of
which there are (n − 2)!). With this notation, Eq. (9) can be interpreted as a linear
transformation
sn+1 = (An · sn )/(n − 1),
(11)
where An is the n × (n − 1) matrix whose (i, j)-entry is x if i ≥ j, and y otherwise.
the standard basis vector e j
Let E n : Rn−1 −→ L 2 [0, 1] be the linearmap that sends
j−1
j
n−1 , n−1
. The vector sn maps to the step
j−1
j
.
function sn = E n (sn ), such that sn (u) = Sn ( j)/(n − 2)! whenever u ∈ n−1
, n−1
to the characteristic function of the interval
The maps {E n } embed the linear operators An : Rn−1 −→ Rn into linear operators
An : L 2 [0, 1] −→ L 2 [0, 1]. In particular, Eq. (11) takes the form
1
sn+1 (u) = [An (sn )] (u) =
αn (u, v) · sn (v)dv,
(12)
0
where the kernel αn is a piecewise constant function whose value at (u, v) ∈
j−1
j
× n−1
is (Fig. 7)
, n−1
x if i ≥ j
αn (u, v) =
i.e., equal to (An )i j .
y otherwise
i−1
n
, ni
Observation The factor 1/(n − 1)in (11) is hidden as a normalization factor in (12).
i
Indeed, when u lies in the interval i−1
n ,n ,
1
[An (sn )] (u) =
αn (u, v) · E n (sn )(v)dv
0
=
n−1
(An )i j
j=1 j−1
j
n−1 , n−1
123
Sn ( j)
dv
(n − 2)!
Control of Cancellations that Restrain...
Fig. 7 The kernel α10 (the
shaded region is 10 ). Note that
the unit square is divided into
1 by 1
rectangles of size 10
9
v
y
x
u
n−1
Sn ( j)
1
=
(An )i j
·
,
(n − 2)! n − 1
j=1
which is the i-th entry of sn+1 . In particular, (compare Eq. (8)):
1
an = (n − 1)!
sn (v)dv.
0
To prove the theorem we need to show that there is a sequence of indices {n k } so
1
the rate of exponential decay of the integrals 0 sn k (v) dv is bounded from below. The
bound will be dictated by the largest eigenvalue λ of the limit operator of An .
5.4 The Limit Operator T
Here we define the limit operator T of the sequence {An }, and establish some of its
basic properties [4], [5, Sect. 11].
Let T : L 2 [0, 1] −→ L 2 [0, 1] by
1
κ(u, v) · f (v)dv,
(T f )(u) =
0
with kernel (Fig. 7)
κ(u, v) =
x
y
if u ≥ v
otherwise.
123
M. Aspenberg, R. Pérez
Lemma 8 The operator T is the limit of {An } in the operator norm:
1
T − An 2 ≤ √ .
n
Proof The kernel of T − An is the function κ − αn . Since y is just shorthand for 1 + x,
we see that κ − αn is the characteristic function of the staircase region n in the unit
square, consisting
of the upper triangle {0 ≤ u, v ≤ 1 | u < v}, minus those rectangles
i−1 i j−1 j n , n × n−1 , n−1 such that i < j. Then the Lebesgue measure of n is
μ(n ) =
1
1
1 (n − 2)(n − 1)
−
·
= .
2
2
n(n − 1)
n
It follows that
1/2
1
T − An 2 ≤ μ(n )
=√ .
n
5.5 Eigenfunctions of T
The operator T can be expressed as follows:
1
κ(u, v) · f (v) dv
(T f )(u) =
0
1
=x·
1
f (v)dv + y ·
0
f (v) dv
0
= −F(u) + y · F(1) − x · F(0),
where F is any primitive of f . To find the eigenvalues of T , set
(λ f )(u) = (T f )(u) = −F(u) + y · F(1) − x · F(0),
and differentiate to obtain the ODE
f (u) =
− f (u)
,
λ
with general solution
f (u) = Ce−u/λ .
123
(13)
Control of Cancellations that Restrain...
A primitive of f is F(u) = −λCe−u/λ , so substituting in (13) gives
λ Ce−u/λ = λCe−u/λ − λC y · e−1/λ − x .
Thus, λ is an eigenvalue if and only if
e−1/λ =
Note that
x
y
x
.
y
< 0 exactly when x ∈ (−1, 0). Then we can write the eigenvalues as
λm =
log
x
y
−1
−1
=
for all m ∈ Z,
+ 2mπ i
log xy + (2m + 1)π i
and the eigenfunction corresponding to λm is
u
f m (u) = xy e(2m+1)π iu .
Definition For ease of notation, we write the absolute values of the two largest eigenvalues as λ := |λ−1 | = |λ0 | and μ := |λ−2 | = |λ1 |.
5.6 Weighted 2-Norm
Definition The family of functions { f m }m∈Z is not orthonormal with respect to the
1
standard inner product 0 f m (v)gn (v) dv. To turn it into one (and to simplify our
computations) we introduce the weighted inner product
1
f, g :=
x −2v
f (v)g(v) dv,
(14)
y
0
and its associated 2-norm f 2 := f, f 1/2 . We will not use the standard inner
product and 2-norm in the core of the proof, so this choice of notation should create
no confusion. However, as a constant reminder we will use the term weighted 2-norm.
Lemma 9 There is a finite constant G > 1 such that the standard 2-norm of a function
f is bounded between G1 f 2 and G f 2 (the weighted 2-norm).
u
Proof It follows since the function xy is bounded away from 0 and ∞ when
0 ≤ u ≤ 1.
As a corollary, we get
Lemma 8 The operator T is the limit of {An } in the weighted operator norm:
G
T − An 2 ≤ √ .
n
123
M. Aspenberg, R. Pérez
With the new notation in place, we can interpret the family of eigenfunctions of T
as follows.
Lemma 10 The family { f m } forms a Hilbert basis of L 2 [0, 1].
Proof Let
an arbitrary function in L 2 [0, 1]. Since x ∈ (−1, 0), the function
y fu be
−π
iu
g(u) = x e
is continuous and bounded from above, so f g ∈ L 2 [0, 1]. After
rescaling the standard basis of L 2 [−π, π ], we obtain the representation
( f g)(u) =
cm · e2mπ iu ,
m∈Z
which implies
f (u) =
u
cm · xy e(2m+1)π iu =
cm · f m (u).
m∈Z
m∈Z
Thus, the family { f m } is complete, and since it is orthonormal the claim follows
because L 2 [0, 1] with the weighted product is a Hilbert space (see, e.g., [2, Theorem
4.13]).
Note that for m ≥ 0 the pair of functions f −(m+1) , f m are complex conjugate and
their eigenvalues have the same magnitude. As a consequence, a convenient basis for
the subspace L 2R [0, 1] ⊂ L 2 [0, 1] of real-valued functions is
x u cos (2m + 1)π u , x u sin (2m + 1)π u
y
y
m≥0
.
6 The Functional Approach
In this section we establish the lower bound on the exponential rate of decay of the
sequence an /(n − 2)!. The long proof of Lemma 11 interferes with the flow of logic,
and is consequently deferred to the next section.
Definition The eigenfunctions f −1 and f 0 with largest eigenvalue λ span a complex
two-dimensional subspace of L 2 [0, 1]. Let E ⊂ L 2R [0, 1] denote the real slice of this
u
u
subspace generated by x cos π u , x sin π u . The space E ⊥ spanned by all
y
y
other eigenfunctions is orthogonal to E, so that (by Lemma 10) L 2R [0, 1] = E ⊕ E ⊥ .
The projections onto E and E ⊥ (with respect to our new weighted inner product) are
denoted P and P ⊥ , respectively. By Parseval’s theorem we can define the angle θn (in
the range [0, π/2]) by any of the three equivalent formulas
sin θn :=
123
P ⊥ sn 2
Psn 2
P ⊥ sn 2
, cos θn :=
, tan θn :=
.
sn 2
sn 2
Psn 2
Control of Cancellations that Restrain...
Intuitively, the closer θn is to 0, the better sn resembles a function in E.
Now we can describe the strategy of the proof:
Step 1: We use the shape properties of the sequence Sn to show that the angles θn are
bounded away from π/2.
Step 2: The sequence {θn } converges to 0, so the functions sn become progressively
sinusoidal.
Step 3: There is a sequence of indices {n k } such that {|an k |} is comparable to {sn k 2 }.
Meanwhile, sn 2 ≥ (λ−ε)n for arbitrarily small ε, and the result will follow.
6.1 Step 1 (θn ≤ < π/2)
Fix n. If Sn has a sign change, let (a, b) denote its locus. Then the value z n
:= a/(n − 1) ∈ [0, 1) is such that sn (u) · sin(π(u − z n )) never changes sign. When
there is no sign change, z n := 0 trivially satisfies the same condition. We will show
there is a K > 0 such that for all n,
sn (u), x u sin π(u − z n ) > K sn 2 .
y
(15)
u
The projection Psn is larger than |sn (u), xy sin π(u − z n )|/ √1 , and thus the angle
2
θn is bounded away from π/2 by
√
θn < := arccos( 2K ) < π/2.
The proof of (15) follows from Corollary 12 and Lemma 13 below, using
K = C C .
Lemma 11 The sequences {sn 1 } and {sn ∞ } are comparable in the sense that
there is a constant C such that for all n,
sn ∞ ≥ sn 1 ≥ Csn ∞ .
This is the main technical lemma, and its proof is deferred to Sect. 7.
Corollary 12 The sequences {sn 1 } and {sn 2 } are comparable in the sense that
there is a constant C such that for all n,
1
C sn 2
≥ sn 1 ≥ C sn 2 .
Proof The left side follows from the Cauchy–Schwarz inequality together with
Lemma 9. On the right we have
1
sn 21
≥C
2
sn 2∞
≥C
sn2 (v) dv,
2
0
123
M. Aspenberg, R. Pérez
where C is as in Corollary 12. The integral is the square of the standard 2-norm of sn ,
C 2
sn 22 .
so the last quantity is larger than G
Lemma 13 Let z n be defined as above. Then there is a constant C such that
sn (u), x u sin π(u − z n ) > C sn 1 .
y
Proof For every z ∈ [0, 1], let Mz,ε be the maximum of the function | xy |−u ·| sin π(u −
z)| in a small interval [z − ε, z + ε] ∩ [0, 1]. This quantity is strictly positive and it
varies continuously on both z and ε. Given a fixed z ∈ [0, 1], the Lebesgue measure
of the set
L z,ε := {u ∈ [0, 1] s.t. | xy |−u · | sin π(u − z)| < Mz,ε }
varies continuously on ε because | xy |−u · | sin π(u − z)| is nowhere constant. Then we
can choose ε (depending on z) so that the measure of L z,ε is C/2, where C ≤ 1 is the
constant of Lemma 11. Since [0, 1] is compact, the lower bound M := inf z∈[0,1] Mz,ε
is positive.
Recall that z n is chosen so that sn (u) · sin π(u − z n ) has constant sign. For simplicity,
let
us assume that this sign is positive. Then the weighted inner product
sn (u), x u sin π(u − z n ) equals (compare (14)):
y
1
| xy |−v sn (v) · sin π(v − z n ) dv ≥
| xy |−v sn (v) · sin π(v − z n ) dv >
L z n ,ε
0
M
L z n ,ε
But
⎛
⎞
1
⎜
⎟
|sn (v)| dv = M ⎝ |sn (v)| dv −
|sn (v)| dv ⎠.
0
L z n ,ε
|sn (v)| dv < μ(L z n ,ε )sn ∞ =
C
2 sn ∞
< 21 sn 1
L z n ,ε
by Lemma 11, so we get
sn (u), x u sin π(u − z n ) >
y
M
2 sn 1 .
6.2 Step 2 (θn → 0)
We will show in Lemma 14 that when n is large enough, the sequence {tan θn } enters a
decreasing regime that makes it eventually converge to 0. This establishes the desired
result.
123
Control of Cancellations that Restrain...
First note that
P ⊥ An sn 2
P ⊥ (T sn + (An − T )sn )2
=
P An sn 2
P(T sn + (An − T )sn )2
⊥
μP ⊥ sn 2 + P ⊥ (An − T )sn 2
P T sn 2 + P ⊥ (An − T )sn 2
= ≤ P T sn 2 − P(An − T )sn 2 λPsn 2 − P(An − T )sn 2 tan θn+1 =
because T commutes with the projections P and P ⊥ . Moreover, since θn < , the two
terms in the denominator are comparable to λsn 2 and (An − T )sn 2 , respectively.
In particular, Lemma 8 allows us to remove the absolute value in the denominator by
assuming n is large enough.
We will need two variations (inequalities (16) and (17)) on this basic estimate for
the tangent. Using Lemma 8 again we find that
√
μP ⊥ sn 2 + sn 2 G/ n
√
λPsn 2 − sn 2 G/ n
√
nμ sin θn + G
= √
nλ cos θn − G
⎛
⎞
μ + √n G
sin θn
⎠ tan θn .
=⎝
G
λ − √n cos
θ
tan θn+1 ≤
(16)
(17)
n
Lemma 14 Let n be larger than both constants Ca
9G 2
(λ−μ)2
G2
,
λ2
=
9G 2
(λ−μ)2 cos2 and
+
and suppose further that θn < . Then there are constants
Cb =
0 < ε < 1 and R > 0 such that
√
3G
a) If n sin θn ≥ λ−μ
, then
tan θn+1 < (1 − ε) tan θn .
b) If
√
n sin θn ≤
3G
λ−μ ,
then
R
tan θn+1 < √ .
n
In other words, when n is sufficiently large, each step in the sequence {tan θn }
affords a definite relative decrease, or a slower but absolute decrease.
Proof of Lemma 14
a) The assumptions θn < and n > Ca imply
G
λ−μ
G
.
>λ− √
>λ−
λ− √
3
n cos θn
n cos 123
M. Aspenberg, R. Pérez
The condition
√
n sin θn ≥
3G
λ−μ
is equivalent to
G
λ−μ
μ+ √
<μ+
,
3
n sin θn
so (17) is smaller than
#
μ+
λ−
λ−μ
3
λ−μ
3
$
tan θn =
2μ + λ
tan θn .
2λ + μ
2μ+λ
2λ+μ
< 1, this case is proved.
%
√
√
3G
b) When n sin θn ≤ λ−μ
, we also have n cos θn ≥ n −
(16) gives
Since
9G 2
.
(λ−μ)2
Substituting in
3G
+G
μ λ−μ
tan θn+1 < %
,
9G 2
λ n − (λ−μ)
−
G
2
where the denominator is positive since n > Cb . The result follows.
6.3 Step 3 (| sn k | > cst (λ − ε)n k )
We are ready to prove that there is a constant such that
1
sn (v) dv ≥ (λ − ε)n
(18)
0
along a subsequence of indices. The immediate consequence is factorial growth of |an |,
1
since an = (n − 2)! · 0 sn (v) dv. Equation (18) follows at once from Propositions 15
and 16.
Proposition 15 There is an infinite integer sequence n 1 < n 2 < · · · , and a constant
0 < W ≤ 1 such that for all k,
1
sn (v) dv ≥ W sn 2 .
k
k
0
Proposition 16 For every ε > 0 there are constants H, N > 0 such that for n > N ,
sn 2 > H (λ − ε)n .
The proof of Proposition 15 uses the following auxiliary result:
123
Control of Cancellations that Restrain...
Lemma 17 There is a constant 0 < K < 1, and an infinite integer sequence n 1 <
n 2 < · · · such that for all k,
1
1
Psn (v) dv ≥ K |Psn (v)| dv.
(19)
k
k
0
0
We prove Lemma 17 first, and then Propositions 15 and 16.
Proof of Lemma 17 The projection Psn (u) lies in the eigenspace E generated by the
functions | xy |u · sin(π u) and | xy |u · cos(π u). Therefore, it has the form
Psn (u) = Hn | xy |u · sin(π u + ωn ),
(20)
for some positive constant Hn and phase shift ωn ∈ R/(2π Z). We need some facts
about the weighted sine function f ω (u) := | xy |u sin(π u + ω) (ω ∈ R/(2π Z)):
a) Since
1
0
| f ω (u)| du > 0, the function
1
0 f ω (u) du h(ω) := 1
0 | f ω (u)| du
is well defined for all ω and satisfies |h(ω)| ≤ 1. The equality is attained only at
ω = 0, π ; i.e., at the angles for which f ω (u) is a strictly non-positive or strictly
non-negative function.
b) A two-step integration by parts shows that
log | xy | sin(π u + ω) − π cos(π u + ω)
x u
+ c.
f ω (u) du = | y |
π 2 + log2 | xy |
The linear combination of trigonometric functions in the numerator can be written
as
%
± π 2 + log2 | xy | · sin(π u + ω + ϕ),
(21)
x (or
where the sign is negative exactly when log | xy | < 0, and ϕ = arctan −π
log y &%
ϕ = π/2 when x = −1/2) depends only on | xy |. Since |λ| = 1
π 2 + log2 | xy |,
one antiderivative of f ω (u) is thus given by
±|λ|| xy |u · sin(π u + ω + ϕ).
It follows that there are exactly two phase angles '
ω and '
ω + π for which h(ω) = 0.
The angle '
ω is different from 0, π because h(0) = h(π ) = 1. Thus, for ε0 sufficiently small we can find closed intervals I1 and I2 = I1 + π , both of length π/2, so
◦
◦
that 0, π ∈ I1 ∪ I2 , but '
ω and '
ω + π are more than a distance ε0 away from I1 ∪ I2 . Let
123
M. Aspenberg, R. Pérez
I be the union of I1 , I2 , and the four closed intervals of length ε0 at their endpoints.
Clearly, there is a uniform bound h(ω) > K > 0 for every ω ∈ I. We will prove the
lemma by finding arbitrarily large indices n k for which ωn k ∈ I.
First, let us compute the effect of T on any function in the eigenspace E. Recall
that E is generated by the eigenfunctions f −1 and f 0 , and that
| xy |u cos(π u) = ( f −1 + f 0 )/2 , | xy |u sin(π u) = ( f 0 − f −1 )/2i,
so
#
T (| xy |u
sin(π u)) = (λ0 f 0 − λ−1 f −1 )/2i =
=
=
=
where as in (21), ϕ = arctan
−π
x
log | y |
−| xy |u eiπ u
−| xy |u e−iπ u
$
(
−
2i
log | xy | + π i log | xy | − π i
|λ|2 | xy |u Im (− log | xy | + π i)eiπ u
|λ|2 | xy |u − log | xy | sin(π u)+π cos(π u)
±|λ|| xy |u sin(π u + ϕ),
(or ϕ = π/2 when x = −1/2). A similar
computation yields the same phase ϕ when T is applied to | xy |u cos(π u). By linearity,
T effects the same phase shift on every function f ω ∈ E.
In particular, consider the following. Since the operators T and P commute,
Psn+1 = T Psn + P(An − T )sn = ±Hn |λ|| xy |u sin(π u + ωn + ϕ) + P(An − T )sn ,
where Hn is the constant from (20) modulo a choice of sign as in (21). Now,
P(An − T )sn ∈ E, so the above can be written as
Psn+1 = Hn |λ|| xy |u sin(π u + ωn + ϕ + εn ).
Comparing this with (20), we see that Hn = Hn+1 and
ωn+1 = ωn + ϕ + εn .
√
But θn < < π/2, so Hn = 2Psn 2 (by (20)) is comparable to sn 2 , and
P(An − T )sn 2 is not larger than (An − T )sn 2 . Since Lemma 8 implies that
(An −T )sn 2
approaches 0, we see that the error εn will become arbitrarily small.
sn 2
We have |ϕ| ≤ π/2, so there is a number N , depending only on ϕ, so that the angles
{ϕ, 2ϕ, . . . , N ϕ} cut the circle in N intervals of length ≤ π/2. It follows that for any
x ∈ R/(2π Z), at least one of the angles {x, x + ϕ, x + 2ϕ, . . . , x + N ϕ} must lie in
I1 ∪ I2 .
Now, given any n large enough for
|εn | + · · · + |εn+N | < ε0
123
Control of Cancellations that Restrain...
to hold, choose 0 ≤ r ≤ N so that ωn + r ϕ ∈ I1 ∪ I2 . Then
ωn+r = ωn + r ϕ + (εn + · · · + εn+r )
is in the set I of good angles.
With the above result we are ready to prove Propositions 15 and 16, establishing
(18) and our main result.
Proof of Proposition 15 Consider the sequence {n k } from Lemma 17, truncated in the
beginning so that the last expression in
1
1
1
sn (v) dv = Psn (v) dv + P ⊥ sn (v) dv k
k
k
0
0
0
1
1
⊥
≥ Psn k (v) dv − P sn k (v) dv 0
(22)
0
is positive. Let us evaluate both terms. By Lemma 17,
1
Psn (v) dv ≥ K Psn 1 ≥ K W Psn 2 ,
k
k
k
0
where 0 < W ≤ 1 is a lower bound on the quotient of the 1-norm and weighted
u
2-norm of xy sin(π u + φ) (for an arbitrary phase shift φ). On the other hand, the
triangle and Cauchy–Schwarz inequalities give
1
P ⊥ sn (v) dv ≤ P ⊥ sn 2 .
k
k
0
If n k is sufficiently large, then tan θn k < K W/2 ≤ 1/2, where W is as before, and we
get
|P ⊥ sn k 2 = tan θn k Psn k 2 ≤
KW
2 Psn k 2 .
Plugging these estimates back into (22) gives
1
sn (v) dv ≥ K W Psn 2 −
k
k
KW
KW
KW
2 Psn k 2 = 2 Psn k 2 = 2
cos θn k sn k 2 .
0
But K 2W cos θn k has a positive lower bound W > 0 since tan θn k < 1/2, thus
proving the result.
123
M. Aspenberg, R. Pérez
Proof of Proposition 16 For large enough n, the right-hand side of
sn+1 2 = T sn + (An − T )sn 2 ≥ T sn 2 − (An − T )sn 2
is positive. In fact, the two terms on the right have the bounds
T sn 2 ≥ P T sn 2 = T Psn 2 = λ cos θn sn 2
and
(An − T )sn 2 ≤
G
√
s n n 2
by Lemma 8 , so we arrive at
sn+1 2 ≥ λ cos θn −
G
√
n
sn 2 .
Since θn → 0, the result follows.
7 Proof of Lemma 11
The inequality sn 1 ≤ sn ∞ is trivial, but the opposite direction requires estimates
based on the particular shape of the sequences Sn . Recall that sn is defined as a step
function using the rescaling
sn
j+1/2
n−1
=
Sn ( j)
.
(n − 2)!
For technical reasons we will work instead with a continuous piecewise linear function
sn that approximates sn . First, let
2(n−1)−1
2(n−1) ,
n :=
1
2(n−1)
, rn := 1 − n =
and define sn at the points
2 j−1
2(n−1)
( j = 1, . . . , n − 1) by
sn
2 j−1
2(n−1)
=
Sn ( j)
.
(n − 2)!
The function sn is defined in [n , rn ] by linear interpolation. Note that the interval
[n , rn ] covers almost all of [0, 1], but not quite. In particular, while the interval of
sn is [n , rn ].
integration used to compute sn 1 is [0, 1], the corresponding interval for
However, even if the 1-norms are not necessarily the same, we do have
rn
rn
sn =
n
123
sn .
n
Control of Cancellations that Restrain...
Lemma 11 There exists some N > 0 (only depending on x) such that if n ≥ N then
sn 1 ≥ C
sn ∞ .
This implies Lemma 11 because the 1-norms of sn and sn differ by at most
> 0. Indeed, the integrals of |s | and |
C
sn | in any
C sn ∞/n for some constant
n
2 j−1
2 j+1
interval 2(n−1) , 2(n−1) coincide, except at the left and right endpoints, and around
the (unique) sign-change. Since the discrepancy between integrals in each of these
sn follows. The
three intervals is of the order of n1 , the comparison between sn and sn ∞ , gives
above, together with Lemma 11 , and the fact that sn ∞ = sn 1 ≥ sn 1 − C sn ∞
sn ∞
≥ C
sn ∞ − C = sn ∞ C −
n
n
C
n
,
establishing Lemma 11.
To prove Lemma 11 we will move back and forth between the language of
sn . All statements about the shape of
sequences Sn and the language of functions sn .
the sequence Sn can be interpreted as applying to the function Definition Let the sign change of Sn be located at Z , the extreme at E, and the
inflection at I (these are integer indices between 0 and n − 3). To avoid carrying
1
we follow the convention that indices from 1 to n − 1 are represented
factors of n−1
by capital letters, and their counterparts in the interval [0, 1] by the corresponding
I +1/2
lowercase letter. In particular, we let e := E+1/2
n−1 , i := n−1 , and we let z correspond
Z +1/2
to the zero of sn near n−1 . If Sn has no sign change, no extreme, or no inflection,
we let z = n , e = n , or i = rn as necessary.
Idea of the proof Our definition of inflection yields a natural notion of intervals
of concavity for sn . We will find a suitable lower bound for the integral of |
sn | within
each of these intervals as illustrated in Fig. 8. These bounds are linear combinations of
sn (e)|, and |
sn (rn )|, and they suffice to obtain a bound of the form sn 1 >
|
sn (n )|, |
const.
sn ∞ because
y
|
sn (rn )|,
– |
sn (n )| = |x|
sn (n )|, |
sn (rn )|},
– property (ShD) implies |
sn (e)| ≥ min{|
sn (e)|, |
sn (rn )|} = sn ∞ (= sn ∞ ).
– max{|
sn (n )|, |
For later reference let us call this the Maximum-is-Equivalent-to-Extreme Principle,
or meep for short.
Note The proof of Lemma 18 below uses property (ShE) in an essential way. This
assumes that Sn does have an extreme and an inflection. Sequences Sn with no extreme
or no inflection are taken care of before Lemma 18, in the observation preceding
inequality (23).
Basic assumption If there is an extreme, we will assume that it is a local minimum
(a local maximum is treated in an analogous manner). If there is also an inflection,
we assume that E < I . This yields the three cases of Fig. 8, and elicits Lemma 18
123
M. Aspenberg, R. Pérez
eiz
ezi
zei
Fig. 8 These continuous functions are caricatures of the piecewise linear function sn in the different
situations of cases 1, 2, 3. The 1-norm is bounded from below by the shaded areas limited by the linear
functions n . We show that these areas are comparable to the maximum sn ∞
in the form presented here, which discusses inflections near the far left (this uses the
inequalities near r = 0 in (ShE)). The situation E ≥ I involves the mirror image of
Fig. 8 and a discussion of inflections near the far right (this uses the inequalities near
r = n −2 in (ShE)). It is entirely analogous to the discussion for E < I . (The situation
E = I is included in the other case simply because of how we defined the location of
inflections and extremes.)
Note from the basic assumption on Sn that Sn−1 has a down-change at E and an
extreme at I . In particular, Sn−1 is negative to the right of E (so the extreme at I is a local
minimum by (ShB)), and it is positive to the left of E, so Sn−1 (0) = xy Sn−1 (n−2) > 0.
Then Sn−1 (1) < Sn−1 (0) by property (ShE), and Sn−1 (1) > 0 because it is at, or to
the left of, the down-change.
Definition For given n and a, b ∈ [0, 1] we denote by n [a, w 1 , b,
w2 ] : R −→ R
the linear function whose graph is the straight line from a, w1 to b, w2 . Also, let
μ(a, b) be the length b − a of the interval [a, b].
The function sn can adopt one of three forms depending on the order of e, i, and z
(keep in mind that e = z).
In each case we split [0, 1] into the same three intervals [n , e], [e, i], and [i, rn ],
and describe linear functions on these intervals that bound |
sn | from below (compare
Fig. 8).
Case 1 (e < i ≤ z): The linear functions are
(a) On [n , e]: [n , 0, e,
sn (e)]. The area of the triangle is
1
2
· |
sn (e)| · μ(n , e)
(b) On [e, i]: [e,
sn (e), i, 0]. The area of the triangle is
1
2
123
· |
sn (e)| · μ(e, i)
Control of Cancellations that Restrain...
(c) On [i, rn ]: [z, 0, rn ,
sn (rn )]. The slope is
triangles are
|
sn (rn )|
·
μ(z, rn )
μ(i, z)2
μ(z, rn )2
+
2
2
|
sn (rn )|
μ(z,rn )
≥
so the areas of the two
|
sn (rn )| μ(i, rn )2
·
μ(z, rn )
4
≥
1
4
· |
sn (rn )| · μ(i, rn )
The first inequality above follows from a 2 + b2 ≥ (a + b)2 /2.
Case 2 (e < z < i): The linear functions are
sn (e)]. The area of the triangle is
(a) On [n , e]: [n , 0, e,
· |
sn (e)| · μ(n , e)
1
2
(b) On [e, i]: [e,
sn (e), z, 0]. The slope is
are
|
sn (e)|
·
μ(e, z)
|
sn (e)|
μ(e,z) .
μ(e, z)2
μ(z, i)2
+
2
2
≥
The areas of the two triangles
|
sn (e)| μ(e, i)2
·
μ(e, z)
4
≥
1
4
· |
sn (e)| · μ(e, i)
The first inequality above follows as in Case 1(c).
sn (rn )]. The area of the triangle is
(c) On [i, rn ]: [i, 0, rn ,
1
2
· |
sn (rn )| · μ(i, rn )
Case 3 (z < e < i): The linear functions are
|
sn (e)|
sn (e)]. The slope is μ(z,e)
so the areas of the two triangles
(a) On [n , e]: [z, 0, e,
are
|
sn (e)|
·
μ(z, e)
μ(n , z)2
μ(z, e)2
+
2
2
≥
|
sn (e)| μ(n , e)2
·
μ(z, e)
4
≥
1
4
· |
sn (e)| · μ(n , e)
(b) On [e, i]: [e,
sn (e), i, 0]. The area of the triangle is
1
2
· |
sn (e)| · μ(e, i)
n−2+1/2
(c) On [i, rn ]: [a,
sn (a), b,
sn (b)], where a := n−3+1/2
n−1 , b :=
n−1 .
The region bounded by this consists of a rectangle of base μ(i,
r
n
) and
n−1
b , where
·
sn−1 '
height sn (rn ), plus a triangle of base μ(i, rn ) and slope n−2
123
M. Aspenberg, R. Pérez
'
b :=
n−3+1/2
n−2 .
The area of sn is at least
|
sn (rn )| · μ(i, rn ) +
n−1
n−2
sn−1 '
b ·
· μ(i,rn )2
2
In each of the three cases, sn 1 is bounded from below by a sum of three expressions. A trivial weakening of these expressions allows us to consolidate cases Case 1
and 2 into one inequality:
sn (e)| · μ(n , i) + |
1&2 : If (e < z), then sn 1 ≥ 41 |
sn (rn )| · μ(i, rn ) .
n−1
sn (e)| · μ(n , i) + |
3 : If (z < e), then sn 1 ≥ 41 |
sn (rn )| · μ(i, rn ) + n−2
·
sn−1 '
b · μ(i, rn )2 .
Observation Cases 1 and 2 allow the possibility that Sn has no extreme. In that
sn (n )| and |
sn (rn )|. Here,
situation e = n , and the bound for both cases depends on |
the conclusion of Lemma 11 follows directly from the meep (Idea of the proof). For
the rest of the proof we can assume now that Sn has an extreme.
sn 1 ≥
If Sn has no inflection then i = rn and the consolidated estimates give 1
follows as well. For the rest of the proof
|
s
(e)|.
Then
the
conclusion
of
Lemma
11
n
4
we can assume now that Sn has an inflection. In particular, we are at freedom to use
property (ShE) from now on.
, then we can neglect the
If the interval [n , i] has definite size, say i ≥ min{|x|,y}
6
portion containing μ(i, rn ) in both bounds, and simplify all three cases to
sn 1 ≥ 14 μ(n , i)|
sn (e)| ≥
1 min{|x|,y}
|
sn (e)|,
4
12
(23)
1
since μ(n , i) ≥ min{|x|,y}
− 2(n−1)
≥ min{|x|,y}
, when n is large enough. Equation (23)
6
12
yields the conclusion of Lemma 11 in this situation, again because of the meep.
To finish the proof we have to consider what happens when μ(n , i) is small so
. In this situation we neglect the portion of the bounds 1&2 and 3
that i < min{|x|,y}
6
n−1 sn−1 '
b satisfies one of
that contains μ(n , i), and show that either |
sn (rn )| or n−2
·
the lower bounds (24), (27), (28), all of which have the form constant times |
sn (e)|.
sn ∞ , and we will be
Just as above, this implies a lower bound for sn 1 in terms of done.
As a note of caution, we remark that for this final step we revert to the language
n−1
b in
sn (rn ) and n−2
·
sn−1 '
of sequences Sn . Thus, instead of seeking bounds for terms of sn (e), we get equivalent bounds (with normalizing constants) for Sn (n − 2)
and Sn−1 (n − 3) in terms of Sn (E). Note here that the value μ(i, rn ) is rescaled to
(n − 2) − I .
Lemma 18 below uncovers the key relationship between Sn and Sn−1 . The proof is
delayed till the end of this section so as not to interfere with the proof of Lemma 7.
)
yn *
1
Lemma 18 If I < min |x|n
6 , 6 then |Sn−1 (1)| ≤ 2 |Sn (0)|.
123
Control of Cancellations that Restrain...
Recall from (10) that Sn (1) = Sn (0) − Sn−1 (1), so under the conclusion of
Lemma 18 we get |Sn (0)| ≥ 23 |Sn (1)|. Hence, when |Sn (1)| ≥ |Sn 2(E)| we get
|Sn (n − 2)| =
y
2 y
y
|Sn (0)| ≥
|Sn (1)| ≥
|Sn (E)|.
|x|
3 |x|
3|x|
(24)
It remains only to consider what happens when
|Sn (1)| ≤
|Sn (E)|
.
2
(25)
We will give two estimates for this situation. The first (inequality (27)) holds for
cases 1&2. The second (inequality (28)) holds for case 3.
Case 1 &2: Since there is no change in convexity on (I, n − 2), we have
|Sn (n − 2) − Sn (I )|
≥ |Sn−1 (n − 2)|.
(n − 2) − I
Since the extreme lies lower than the inflection (Sn (E) ≤ Sn (I )), and Sn (E) < 0 <
Sn (n − 2), we can replace the numerator on the left with
y
y
|Sn (n − 2)| + |Sn (E)|
≥ |Sn−1 (n − 2)| =
|Sn−1 (0)| ≥
|Sn−1 (1)|,
(n − 2) − I
|x|
|x|
where the last inequality is due to property (ShE). Again, by constant convexity on
(1, E) (recall E < I ), the above is larger than
y |Sn (E) − Sn (1)|
,
|x|
E
and we have
y (n − 2) − I
y (n − 2) − I
|Sn (E) − Sn (1)| ≥
|Sn (E)|
|x|
E
|x|
2E
(26)
by (25).
To
estimate
the
factor
next
to
|S
(E)|
we
use
n
≥
6
and
the
fact
that
E
<
I <
n
)
*
min |x|n/6, yn/6 . For ease of notation, let m := min{|x|, y}, so E, I < mn/6.
|Sn (n − 2)| + |Sn (E)| ≥
y (1 − 2/n − m/6)
y 4−m
4−m
y (n − 2) − I
≥
≥
≥
|x|
2E
|x|
2m/6
|x| 2m
2|x|
3−x
since y ≥ m. The last expression above equals −2x
when x ∈ (−1, −1/2], and
4+x
equals −2x when x ∈ [−1/2, 0). Both functions are continuous and increasing for all
x < 0, so substituting x = −1 into the first and x = −1/2 into the second shows that
4−m
2|x| > 2. Substituting this back into (26) gives
|Sn (n − 2)| ≥ 2|Sn (E)| − |Sn (E)| = |Sn (E)|.
(27)
123
M. Aspenberg, R. Pérez
Case 3: Here, property (ShE) gives a different bound:
|Sn−1 (n − 3)| =
y
y
y |Sn (E)|
|Sn−1 (0)| ≥
|Sn−1 (1)| ≥
,
|x|
|x|
|x| E − Z
and this concludes the proof of Lemma 11 .
(28)
Proof of Lemma 18 Throughout this section the basic assumption has been that
Sn (E) < 0 and E < I , so that Sn−1 has a local minimum at I and a down-change at E.
Recall that this implies Sn−1 is negative to the right of E, and 0 < Sn−1 (1) < Sn−1 (0).
Moreover, suppose that |x| ≤ y so that the hypothesis of Lemma 18 reads I < |x|n/6
(the complementary case is similar). This inequality, together with E < I , gives the
straightforward
y E + x(n − 1 − I ) < y I + x(n − 1 − I ) = I + x(n − 1) < I +
xn
< 0,
6
(29)
which we will use to derive inequality (31). Since y − x = 1 and n/2 ≤ n − 1,
I <
−xn
−x(n/2)
−x(n − 1)
=
≤
,
6
2+y−x
2+y−x
so
−x(n − 1 − I )
≥ 1.
(2 + y)I
The left expression increases if I is replaced by E in the denominator, giving
− 2E ≥ y E + x(n − 1 − I ).
(30)
As stated in (29), the right expression in (30) is negative, so after multiplying by y,
we get
y
−y E
≤ .
(31)
0<
y E + x(n − 1 − I )
2
Using inequality (31) we proceed to prove Lemma 18 as follows. The sequence
Sn−1 is positive and decreasing from 1 to E, so the average in this span is smaller than
the leftmost term Sn−1 (1). Similarly, Sn−1 is negative and increasing from I to n − 2,
so the average of |Sn−1 | in this span exceeds the rightmost value |Sn−1 (n − 2)|. Thus,
E
j=1
Sn−1 ( j) < E · Sn−1 (1) < E · Sn−1 (0)
n−2
y
yE Sn−1 ( j) ≤
= E |x| |Sn−1 (n − 2)| < |x|'
n j=I
123
n−2
yE Sn−1 ( j) , (32)
|x|'
n j=E+1
Control of Cancellations that Restrain...
yE
where '
n := n − 1 − I . Notice that (29) implies 0 < |x|'
n < 1, so the sum of positive
terms (those to the left of E) is less in absolute value than the sum of negative terms
(those to the right of E):
n−2
E
−
S
(
j)
Sn−1 ( j)
n−1
j=E+1
j=1
n−2
yE ≥ 1 − |x|'
Sn−1 ( j) .
n j=E+1
n−2
Sn−1 ( j) =
j=1
(33)
Now, using (32), (33), and (31),
E
|Sn−1 (1)| < Sn−1 ( j) <
j=1
n−2
−y E = y E−|x|'
Sn−1 ( j) ≤
n j=1
n−2
n−2
yE
|x|'
n
yE <
S
(
j)
S
(
j)
n−1
n−1
|x|'
n yE
j=E+1
1 − |x|'
j=1
n
n−2
y Sn−1 ( j) = 21 |Sn (0)|.
2 j=1
Acknowledgments We wish to give thanks to F. Przytycki and the Polish Academy of Sciences for their
hospitality at the conference center in Be˛dlewo, Poland; to the Institut Mittag-Leffler for its hospitality
during the final phase of this project; to M. Benedicks for conversations that led to the project this work
stems from; and to P. Bleher for suggesting the use of integral operator theory as a tool for estimating
asymptotic growth. We are also highly indebted to the anonymous referee for a very thorough reading
of the manuscript, and in particular for calling our attention to a problem with the original statement of
Proposition 7. The referee spotted many places where mathematical, typographical, and stylistic corrections
enhanced considerably the final version. We are appreciative of the fact that someone took the time to read
our work with such thoroughness. Rodrigo Pérez was supported by NSF grant DMS-0701557.
References
1. Aspenberg, M., Pérez, R.: Catalan, Siegel, and Fibonacci. Work in progress.
2. Conway, John B.: A Course in functional analysis. Graduate texts in mathematics, vol. 96, 2nd edn.
Springer, New York (1990)
3. Harary, Frank, Hayes, John P., Wu, Horng-Jyh: A survey of the theory of hypercube graphs. Comput.
Math. Appl. 15(4), 277–289 (1988)
4. Kato, T.: Perturbation theory for linear operators. Classics in mathematics. Springer, Berlin (1995)
5. Misiurewicz, M., Nitecki, Z.: Combinatorial patterns for maps of the interval. Mem. Am. Math. Soc.
94(456), vi+112 (1991)
6. Pérez, Rodrigo A.: A brief but historic article of Siegel. Not. Am. Math. Soc. 58, 558–566 (2011)
7. Siegel, C.L.: Iteration of analytic functions. Ann. Math. 2(43), 607–612 (1942)
8. Sloane, N. J. A.: The on-line encyclopedia of integer sequences, 2010. Published electronically at www.
research.att.com/~njas/sequences/. Accessed 10 Aug 2012
9. Stanley, R.P.: Enumerative combinatorics. Cambridge studies in advanced mathematics, vol. 2. Cambridge University Press, Cambridge (1999)
123