MODULAR FORMS WEI ZHANG NOTES TAKEN BY PAK-HIN LEE Abstract. Here are the notes I took for Wei Zhang’s course on modular forms offered at Columbia University in Spring 2013 (MATH G4657: Algebraic Number Theory). Hopefully these notes will appear in a more complete form during Fall 2014. I recommend that you visit my website from time to time for the most updated version. Due to my own lack of understanding of the materials, I have inevitably introduced both mathematical and typographical errors in these notes. Please send corrections and comments to [email protected]. Contents Classical Modular Forms 1. Lecture 1 (January 23, 2013) 1.1. Introduction 1.2. Basic Definitions 1.3. Eisenstein series 2. Lecture 2 (January 28, 2013) 2.1. Eisenstein series 3. Lecture 3 (January 30, 2013) 3.1. Modular Curves 4. Lecture 4 (February 4, 2013) 4.1. Modular Curves 4.2. Elliptic curves with level structures 5. Lecture 5 (February 6, 2013) 5.1. Heegner points 5.2. Dimension formulas 6. Lecture 6 (February 11, 2013) 6.1. Hecke operators 7. Lecture 7 (February 13, 2013) 7.1. Hecke operators for level 1 7.2. Hecke operators for level N 7.3. Petersson inner product 8. Lecture 8 (February 18, 2013) 8.1. Petersson inner product 8.2. L-functions 9. Lecture 9 (February 20, 2013) Last updated: September 6, 2014. 1 3 3 3 5 6 6 9 9 13 13 15 16 16 17 19 19 21 21 23 25 25 25 27 29 9.1. Hecke operators 9.2. Atkin-Lehner theory 10. Lecture 10 (February 25, 2013) 10.1. Atkin-Lehner theory 10.2. Rationality and Integrality 10.3. Plan Warning: The following lectures have not been edited. 11. Lecture 11 (February 27, 2013) 12. Lecture 12 (March 4, 2013) 13. Lecture 13 (March 6, 2013) 14. Lecture 14 (March 11, 2013) 15. Lecture 15 (March 13, 2013) p-adic Modular Forms 16. Lecture 16 (March 25, 2013) 17. Lecture 17 (March 27, 2013) 18. Lecture 18 (April 1, 2013) 19. Lecture 19 (April 3, 2013) 20. Lecture 20 (April 8, 2013) 21. Lecture 21 (April 10, 2013) 22. Lecture 22 (April 15, 2013) 23. Lecture 23 (April 17, 2013) 24. Lecture 24 (April 22, 2013) 25. Lecture 25 (April 24, 2013) 26. Lecture 26 (April 29, 2013) 27. Lecture 27 (May 1, 2013) 2 29 31 32 32 34 35 35 35 35 35 35 36 36 36 36 36 36 36 36 36 36 36 36 1. Lecture 1 (January 23, 2013) 1.1. Introduction. Last year this course was about class field theory. This year we will focus on modular forms. The two main topics are: (1) classical (holomorphic) modular forms for the full modular group SL2 (Z) and its congruence subgroups. We will be interested in both the analytic and arithmetic theory. (2) p-adic properties and p-adic modular forms. We will not follow any specific textbook closely, but a list of references includes: (1) Diamond, Shurman, A First Course in Modular Forms. Homework will mainly be assigned from this book (among other resources). (2) Serre, A Course in Arithmetic. The last part of this book has a concise explanation of modular forms for SL2 (Z). (3) Shimura, Introduction to the Arithmetic Theory of Automorphic Functions. (4) Lang, Introduction to Modular Forms. 1.2. Basic Definitions. We begin with some basic definitions. Definition 1.1. The upper half plane is H = {τ ∈ C : Im(τ ) > 0}. We will reserve the letter zfor any complex number not necessarily contained in H. a b Any γ ∈ SL2 (R) = : a, b, c, d ∈ R, det = 1 acts on H via c d aτ + b . cτ + d Check that this defines a group action. The key point is Im(τ ) Im(γ · τ ) = |cτ + d|2 γ·τ = using the fact that det(γ) = 1. Moreover this action defines a biholomorphic automorphism of H. The set of biholomorphic automorphisms of H is Aut(H) ∼ = PSL2 (R) = SL2 (R)/{±I}. We will be interested in discrete subgroups of SL2 (Z). More specifically, Definition 1.2. Congruence subgroups are subgroups Γ ⊂ SL2 (Z) that contain Γ(N ) for some positive integer N , where Γ(N ) is the principal congruence subgroup of level N defined by a b a b 1 0 Γ(N ) = ∈ SL2 (Z) : ≡ (mod N ) . c d c d 0 1 We can define various other subgroups that have finite index in SL2 (Z), for example, by changing the condition into a b a b ∗ ∗ Γ0 (N ) = ∈ SL2 (Z) : ≡ (mod N ) . c d c d 0 ∗ Definition 1.3. A weakly modular function of weight k is a function f : H → C such that: (1) (meromorphicity) f is meromorphic on H; 3 (2) (transformation rule) f (γ · τ ) = f aτ + b cτ + d a b = (cτ + d) f (τ ) for all γ = ∈ c d k SL2 (Z). The group SL2 (Z) is generated by two elements, which are the matrices corresponding to translation τ 7→ τ + 1 and inversion τ 7→ − τ1 . 1 1 0 −1 Lemma 1.4. SL2 (Z) = , . 0 1 1 0 With this we can show Lemma 1.5. A fundamental domain of H modulo SL2 (Z) is 1 1 1 D = τ ∈ H : − ≤ Re(τ ) < , |τ | > 1 ∪ τ ∈ H : − ≤ Re(τ ) ≤ 0, |τ | = 1 2 2 2 i.e. under SL2 (Z)-transformation, every point in H is equivalent to exactly one point in D. (Diagram here) A proof can be found in Serre’s book. The key idea is to maximize Im(γ · τ ). Maximum is achieved precisely when it lies in the fundamental domain. Since −I acts trivially on H, we consider PSL2 (Z) = SL2 (Z)/{±I}. Suppose f is a weakly modular function that is holomorphic on H. Transformation by 1 1 γ= shows that f satisfies f (τ + 1) = f (τ ), i.e. f has period 1. We can then consider 0 1 its Fourier expansion: for any τ = x + iy, viewing f as a function of x gives X f (x + iy) = an (y)e2πinx n∈Z where the Fourier coefficients are given by Z 1 Z −2πinx −2πny f (x + iy)e dx = e an (y) = 0 1+iy f (τ )e−2πinτ dτ 0+iy So far we have not used the holomorphicity Rof f . By contour integration over the rectangle 1+iy with vertices iy, 1+iy, iy 0 , 1+iy 0 , we have that 0+iy f (τ )e−2πinτ dτ is independent of y because the integral over the two vertical edges cancel. Hence we can write X f (τ ) = an e2πinτ . n∈Z It is convenient to introduce q = e 2πiτ . Then the Fourier expansion of f is given by X f (τ ) = an q n . n∈Z Geometrically, the map τ 7→ q = e2πiτ sends the upper half plane H into D − {0} where D = {z ∈ C : |z| < 1} is the open unit disk. In fact this induces a biholomorphism H/Z ∼ = D − {0}. Here we see that the singularity around the origin is important. A priori we could have any Fourier expansion. 4 Definition 1.6. f is meromorphic at τ = ∞ if there exists M ∈ Z such that an = 0 for all n < M . f is holomorphic at τ = ∞ if an = 0 for all n < 0. This is equivalent to saying that f (q) extends to a meromorphic (holomorphic) function on the unit disk D. Definition 1.7. A modular form of weight k is a function f : H → C such that: (1) f is holomorphic on H∪{∞} (i.e. the Fourier expansion only has nonnegative powers of q); a b (2) f (γ · τ ) = (cτ + d)k f (τ ) for all γ = ∈ SL2 (Z). c d Let us look at some examples, otherwise we would just be doing function theory. 1.3. Eisenstein series. We associate to τ ∈ H the lattice Λτ = Z + Zτ ⊂ C, which is free abelian of rank 2 since τ has positive imaginary part. Define X X0 Gk (τ ) = w−k = (mτ + n)−k (m,n)∈Z2 w∈Λτ −{0} P where 0 means we are summing over (m, n) 6= (0, 0). Gk satisfies the following transformation property: if we replace τ 7→ γ · τ = aτ +b , cτ +d then Gk (γ · τ ) = (cτ + d)k Gk (τ ). Gk is identically zero when k is odd, so it is only interesting when k is even. We have not addressed convergence issue yet. Lemma 1.8. Gk (τ ) is absolutely convergent if k ≥ 4, and uniformly convergent on any compact subset of H, so it defines a holomorphic function on H. Proof (Sketch). To prove convergence, we can assume τ ∈ D is in the fundamental domain. Then τ ∈D |w|2 = |mτ + n|2 = m2 τ τ + 2mn Re(τ ) + n2 ≥ m2 − mn + n2 = |mρ + n|2 (where ρ is a cube root of unity). Therefore we can bound X0 X0 |mρ + n|−k |Gk (τ )| ≤ |w|−k = m,n m,n which is convergent when k ≥ 3. This proves convergence and uniform convergence over compact subsets. By absolute convergence, we can evaluate limits term-wise. X0 Lemma 1.9. lim Gk (τ ) = n−k = 2ζ(k) where ζ is the Riemann zeta function. τ →∞ n∈Z Corollary 1.10. For k ≥ 4 even, Gk is a modular form of weight k, called the Eisenstein series of weight k. 5 Later we will see that these are essentially the only modular forms. Next we consider the Fourier expansion of Gk (τ ). It is well-known that ∞ 1 1 cos πz 1 X + + = π cot πz = π z d=1 z + d z − d sin πz which has simple at the integers and is holomorphic elsewhere. Be careful not to write P poles 1 the sum as d∈Z z+d because this is not convergent! For a proof of this identity, Diamond-Shurman gives the hint Y z2 sin πz = πz 1− 2 n n≥1 but this is like cheating because the two identities are equivalent – the identity we want is just the logarithmic derivative of this one. Instead we invoke a general theorem. Lemma 1.11 (Special case of Mittag-Leffler). If f : C → C is a meromorphic function having simple poles a1 , a2 , · · · , ai , · · · with residues b1 , b2 , · · · , bi , · · · respectively such that (1) the sequence 0 < |a1 | ≤ |a2 | ≤ · · · ≤ |ai | ≤ · · · goes to ∞; (2) there exist closed contours Cm with length lm and distance Rm to 0 such that lm /Rm < N1 , Rm → ∞, and |f |Cm | < N2 for given positive numbers N1 , N2 , then ∞ X 1 1 f (z) = f (0) + bi + z − ai ai i=1 The proof uses the Cauchy integral formula and is left as an exercise. 2. Lecture 2 (January 28, 2013) 2.1. Eisenstein series. Last time we defined the space of modular forms and gave the example of Eisenstein series X0 X0 (cτ + d)−2k ω −2k = G2k (τ ) = (c,d)∈Z2 ω∈Λτ where Λτ = Z + Zτ . For k ≥ 2, this is absolutely convergent and uniformly convergent on compact sets. Thus G2k ∈ M2k , the space of modular forms of weight 2k. Also recall that G2k (∞) = 2ζ(2k). We want to find the Fourier expansion. We make use of the following identity ∞ ∞ 1 X 1 1 1 X 2z π cot πz = + + = + z d=1 z − d z + d z d=1 z 2 − d2 (the way to remember this is to look at the zeroes and poles of cot πz) which is convergent. Multiplying z on both sides gives ∞ ∞ X X cos πz z2 πz =1+2 = 1 − 2 ζ(2k)z 2k 2 − d2 sin πz z d=1 k=1 ∞ X z2 z2 1 z2 k where we expanded at z = 0 using the geometric series 2 = − = − ( ) . 2 z − d2 d2 1 − dz 2 d2 k=1 6 Incidentally we found a way to evalute the Riemann zeta function! This relates to the Bernoulli numbers as follows. The left hand side of the above equation can be written as t πiz −πiz = 2t eet +1 where t = 2πiz, so πiz eeπiz +e −e−πiz −1 ∞ X t 2 ζ(2k) 2k 1+ t =1−2 t 2 e −1 (2πi)2k k=1 Recall that Definition 2.1. The Bernoulli numbers are defined by ∞ X Bk k t = t . t e − 1 k=0 k! As a corollary, ζ(2k) = − (2πi)2k B2k ∈ π 2k Q× 2 (2k)! for k ≥ 1. Let us go back to the question of finding the Fourier expansion of the Eisenstein series ∞ X X0 X 1 G2k (τ ) = (cτ + d)−2k = 2ζ(2k) + 2 . 2k (cτ + d) c=1 d∈Z (c,d) The series expansion of π cot πz becomes 1 X −1 (z + d) = π cot πz = πi 1 + d∈Z 2 e2πiz − 1 = πi −1 − 2 ∞ X ! e2πinz n=1 2πiz which is valid for Im(z) > 0 (so that |e | < 1). Taking (2k − 1) times derivative with respect to z, we have ∞ X X −2k 2k (z + d) = −(2πi) n2k−1 e2πinz . −(2k − 1)! n=1 d∈Z Taking z = cτ and suming over all positive integers c, we obtain ∞ (2πi)2k X G2k (τ ) = 2ζ(2k) + σ2k−1 (n)q n (2k − 1)! n=1 P where the divisor sum function is defined as σk (n) = 1≤m|n mk . The normalized Eisenstein series of weight 2k is defined to be ∞ G2k (τ ) 2k X E2k (τ ) = =1− σ2k−1 (n)q n . 2ζ(2k) B2k n=1 For example, E4 = 1 + 240 ∞ X σ3 (n)q n ∈ Z[[q]]; n=1 1The left hand side is written this way for simplicity. There will not be any convergence issues after differentiation. 7 E6 = 1 − 504 ∞ X σ5 (n)q n ∈ Z[[q]]. n=1 In general we only have E2k (τ ) ∈ Q[[q]]. The fact that E2k has rational coefficients is important in the second half of the course when we discuss p-adic modular forms. X χ(d) We can generalize modular forms for SL2 (Z) to Γ(N ), for example . (cτ + d)k c≡0 (mod N ) We will prove that, in a certain sense, the Eisenstein series generate all the modular forms. If f is a weakly modular function of weight k for SL2 (Z) that is meromorphic everywhere on H ∪ {∞}, we define its order of vanishing vτ (f ) at τ ∈ H ∪ {∞} as follows. At τ ∈ H, this is just the routine definition for complex-valued functions. For τ = ∞, consider the Fourier expansion ! X X an q n f (τ ) = an q n = q n0 an0 + n>n0 n−∞ where an0 6= 0. Then the order of vanishing is defined to be v∞ (f ) = n0 . We can check that vτ depends only on the SL2 (Z)-orbit of τ . Lemma 2.2. X mτ vτ (f ) = τ ∈D∪{∞} where mτ = k 12 (1) 1 2 1 3 if τ = i if τ = ρ 1 if τ = ∞ or τ ∈ D\{i, ρ} Remark. The weights are present because i and ρ have non-trivial stabilizers. Later we will give a more geometric proof using modular curves and Riemann-Roch. Proof (Sketch). Use contour integration along the boundary of D truncated by a horizontal segment which bounds all the zeroes and poles in D, detouring around ρ, i, ρ + 1 along small circular arcs. We first assume there are no zeroes or poles on the boundary. By the argument principle, Z 0 X 1 f (τ ) dτ = vτ (f ). 2πi f (τ ) τ ∈D\{i,ρ} 1 π Around τ = ρ, ρ + 1, the contribution from the two arcs is 2(− 2π v (f )) = − 31 vρ (f ). 3 ρ Similarly, around τ = i, the contribution is − 12 vi (f ). k Under τ 7→ − τ1 , f (− τ1 ) = τ k f (τ ). The arc on the unit circle has contribution 12 . 2πiτ Under τ 7→ q = e , the horizontal path gives −v∞ (f ). If there are zeroes or poles on the boundary, we detour along small circular arcs such that each equivalence class of zeroes and poles is counted exactly once inside the contour. This handy formula can help us find the dimension and basis of modular forms for SL2 (Z). Theorem 2.3. 8 (1) The graded algebra of all modular forms for SL2 (Z) is ∞ M Mk = C[E4 , E6 ] k=0 i.e. any modular form is a polynomial in E4 and E6 . 1 (E 3 − E62 ) is a cusp form of weight 12. ∆(τ ) 6= 0 for τ ∈ H and has a (2) ∆ = 1728 4 simple zero at ∞. Proof. If k ≤ 2, there is no integral solution to Equation (1), so there are no non-zero modular forms of weight ≤ 2 for SL2 (Z). The same fact for Γ0 (2) is important to the proof of Fermat’s Last Theorem! If k = 4, the only solution is vρ (E4 ) = 1 and E4 doesn’t vanish at any other point. We have M4 = CE4 . If k = 6, the only solution is vi (E6 ) = 1. We have M6 = CE6 . If k = 8, the only solution is vρ (f ) = 2. Since E42 and E8 have the same constant term 1, we have E42 = E8 (which implies a relation between σ3 and σ7 ). If k = 10, we have E4 E6 = E10 . If k ≥ 12, we see that Mk = Sk ⊕ CEk where Sk is the space of cusp forms of weight k. f If f ∈ Sk , by definition v∞ (f ) ≥ 1. But v∞ (∆) = 1 and vτ (∆) = 0 for all τ ∈ H, so ∆ is holomorphic everywhere and the map Mk−12 → Sk given by f 7→ f ∆ is an isomorphism. Thus Mk = ∆Mk−12 ⊕ CEk , so it suffices to prove Ek is a polynomial of E4 and E6 , and the theorem will follow by induction on k. But the above splitting is not unique. We only required that Ek is not a cusp form. Since the equation 4a + 6b = k always has a solution in non-negative integers if k ≥ 4 is even, we rewrite Mk = ∆Mk−12 ⊕ CE4a E6b which finishes the proof of (1). (2) is trivial. We set E4 (τ )3 1 = + ··· . ∆(τ ) q Then j is holomorphic for τ ∈ H with v∞ (j) = −1. Next time we will see that it defines an isomorphism SL2 (Z)\H ∪ {∞} ∼ = P1 . j(τ ) = 3. Lecture 3 (January 30, 2013) 3.1. Modular Curves. We will discuss modular curves, which are Riemann surfaces over C. So far we have studied modular forms over SL2 (Z), but the definition works if we replace it by congruence subgroups2, for example Γ(N ) ⊂ Γ1 (N ) ⊂ Γ0 (N ) 2Holomorphicity at cusps requires some care. See Chapter 1.2 of Diamond-Shurman for details. 9 for any N ≥ 1, where a Γ(N ) = c a Γ1 (N ) = c a Γ0 (N ) = c b a ∈ SL2 (Z) : d c b a ∈ SL2 (Z) : d c b a ∈ SL2 (Z) : d c b 1 ≡ d 0 b 1 ≡ d 0 b ∗ ≡ d 0 0 1 (mod N ) ; ∗ 1 ∗ ∗ (mod N ) ; (mod N ) . Note that the principal congruence subgroup Γ(N ) is the kernel of the map mod N SL2 (Z) −→ SL2 (Z/N ). More generally, if we define the Borel subgroup ∗ ∗ B0 (Z/N ) = ⊂ SL2 (Z/N ) 0 ∗ and denote 1 ∗ B1 (Z/N ) = ⊂ SL2 (Z/N ); 0 1 1 0 B(Z/N ) = ⊂ SL2 (Z/N ); 0 1 then Γ∗ (N ) is the preimage of B∗ (Z/N ) ⊂ SL2 (Z/N ), where ∗ = 0, 1 or nothing. These subgroups all have finite indices. To find [SL2 (Z) : Γ∗ (N )] for ∗ = 0, 1 or nothing, note that it is equal to the indices e ∗ (Z/N )] [SL2 (Z/N ) : B∗ (Z/N )] = [GL2 (Z/N ) : B e ∗ is the corresponding subgroup of GL2 (Z/N ). It is a standard exercise to where the latter B find the order of GL2 (Z/N ). For N = p a prime, the order is (p2 − 1)(p2 − p), the number of bases of (Z/p)2 . For N = pk a prime power, the order is ( Np )4 (p2 − 1)(p2 − p) = N 4 (1 − p−1 )(1 − p−2 ). Q In general, | GL2 (Z/N )| = N 4 p|N (1 − p−1 )(1 − p−2 ). e 0 (Z/pk )| = (N (1 − p−1 ))2 · N = N 3 (1 − On the other hand, for N = pk a prime power, |B Q −1 2 e 0 (Z/N )| = N 3 p−1 )2 , so in general |B p|N (1 − p ) . Therefore, we have Y e 0 (Z/N )] = N [SL2 (Z) : Γ0 (N )] = [GL2 (Z/N ) : B (1 + p−1 ). p|N Let Γ ⊂ SL2 (Z) be a congruence subgroup containing ±I. We equip YΓ = Γ\H with the quotient topology given by π : H → Γ\H, i.e. U ⊂ YΓ is open if and only if π −1 (U ) ⊂ H is open. It follows that π is an open map. Lemma 3.1. Under this topology, YΓ is Hausdorff. 10 Proof (Sketch). We show that the action of Γ on H is properly discontinuous, i.e. for any τ1 , τ2 ∈ H (possibly τ1 = τ2 ), there exist neighborhoods U1 and U2 of τ1 and τ2 respectively such that for γ ∈ Γ, γU1 ∩ U2 = ∅ if and only if γτ1 = τ2 . Thus π(U1 ) and π(U2 ) separate any π(τ1 ) 6= π(τ2 ). If this is true for Γ = SL2 (Z), then it is true for any subgroups. If τ1 and τ2 are in the interior of the fundamental domain, then it is clearly true. If they are on the boundary, we use two half-disks. Definition 3.2. The stabilizer group of τ ∈ H is Γτ = {γ ∈ Γ : γτ = τ } ⊂ Γ. Lemma 3.3. Γτ is finite cyclic. Proof. Consider H = SL2 (R)/ SO(2) (SO(2) is a compact abelian group isomorphic to S 1 ). Then Γτ = Γ ∩ γ −1 SO(2)γ is the intersection of a discrete group and a compact group, hence is finite. But any finite subgroup of S 1 must be cyclic. We will see later that Γτ /{±I} is isomorphic to one of {1}, Z/2, Z/3. Up to SL2 (Z)equivalence, the only points with non-trivial stabilizers are i and ρ. Definition 3.4. τ ∈ H is an elliptic points if Γτ / ± I is nontrivial. Lemma 3.5. There are only finitely many elliptic points modulo Γ. Proof. This is true for SL2 (Z), hence true for any subgroup of finite index. Recall that a Riemann surface is defined to be 1-dimensional complex manifold, and charts are local coordinates for each point of YΓ which are compatible. It is a completely routine process to check that YΓ is a Riemann surface. Every point of YΓ is of the form Γτ for τ ∈ H. If τ is not elliptic, applying the proof of Lemma 3.1 to τ1 = τ2 = τ shows that τ has a neighborhood U such that γU ∩ U 6= ∅ implies γτ = τ . Then U gives a local coordinate at τ . If τ is elliptic, locally its neighborhood is D/µ2 or D/µ3 with local coordinate given by z 7→ z 2 or z 3 . (Diagram here) Example 3.6. For Γ = SL2 (Z), YΓ = SL2 (Z)\H. Recall that the j-function is defined as E43 . Viewing j as a function on YΓ , the order of vanishing is given by ∆ 1 vπ(ρ) (j) = vρ (j). 3 We will simply accept compatibility of these local charts... YΓ is a non-compact Riemann surface. For Γ = SL2 (Z), j is holomorphic everywhere on YΓ . We want to compatify YΓ . Recall “one-point compactification” from topology: if X is a e = X ∪ {∞} where the open sets of X e are the open sets of X topological space, consider X e is compact. together with {∞}∪ cocompact open sets of X. This implies X We will do some sort of one-point compactification for H modulo SL2 (Z). Define H∗ = H ∪ P1 (Q) = H ∪ Q ∪ {∞}. 11 Note SL2 (Z) acts on P1 (Q): for [x, y] ∈ P1 (Q), a b x ax + by = c d y cx + dy so P1 (Q) = SL2 (Z) · ∞. The topology on H∗ is generated by open sets of H, sets of the form RM = {∞} ∪ {τ ∈ H : Im(τ ) > M } for all M > 0, and all possible SL2 (Z)-translates of RM . Then H∗ is compact. In terms of the fundamental domain, D ∪ {∞} is compact (same proof as onepoint compactification). This implies XΓ := Γ\H∗ = YΓ ∪ (Γ\P1 (Q)). The points of Γ\P1 (Q) are called cusps. Since π : H∗ → XΓ is surjective, XΓ is compact for Γ = SL2 (Z). For congruence subgroups Γ, XΓ is a union of finitely many translations of D ∪ {∞}, hence compact as well. consider the local coordinate at ∞. Since the stabilizer SL2 (Z)∞ is given by Next we 1 ∗ 1 NZ ± , in general we have Γ∞ = Γ ∩ SL2 (Z)∞ ⊃ for some positive 0 1 0 1 1 h∞ integer N . Let h∞ be the minimal positive integer such that ∈ Γ∞ , called the 0 1 width or period. Give local coordinate at ∞ by q = e2πiτ /h∞ . In general, for τ ∈ P1 (Q), hτ is defined by moving τ to ∞ by SL2 (Z). To summarize, we have shown that Γ\H = YΓ ⊂ XΓ = Γ\H∗ are Riemann surfaces, and XΓ is compact. We are interested in understanding the invariants of XΓ as a Riemann surface, e.g. its genus. Notation. For Γ = Γ∗ (N ) where ∗ = 0, 1 or nothing, we denote Y∗ (N ) = YΓ and X∗ (N ) = XΓ . Theorem 3.7. The map j : X0 (1) → P1 sending SL2 (Z)τ 7→ j(τ ) and ∞ 7→ ∞ is an isomorphism. Proof. This is a degree 1 map between two compact Riemann surfaces. We use this to study the genus of modular curves via coverings. Suppose ±I ∈ Γ1 ⊂ Γ2 and write X1 = XΓ1 , X2 = XΓ1 . Then φ : X1 → X2 is a covering map of degree deg(φ) = [Γ2 : Γ1 ], since for a non-elliptic point τ we have #(Γ2 \Γ1 τ ) = [Γ2 : Γ1 ], and there are only finitely many elliptic points. To calculate the genus, we use the Riemann-Hurwitz formula. It is enough to calculate ramifications, which is where we need to use the local coordinates. 2g1 − 2 = (2g2 − 2) deg(φ) + deg Ram P P where deg Ram = x∈X1 (ex − 1) = y∈X2 deg Ry , deg Ry = x∈φ−1 (y) (ex − 1), and ex is the ramification degree at x ∈ X. Note that the ramification points are a subset of the fibers over elliptic points of X2 (but the preimage of an elliptic point of X2 may or may not be an elliptic point of X1 since Γ1 is smaller). Thus the ramification degrees must divide 2 or 3, the only possible orders of elliptic points. If there were elliptic points of order 4 we would be dead! To compute the genus of any modular curve, we take Γ2 = SL2 (Z) and let d = deg(φ). P 12 For h = 2, 3, define eh to be the number of elliptic points in the fiber over the order h elliptic point Ph of SL2 (Z)\H∗ , where P2 = i and P3 = ρ. Then the number of non-elliptic h h points in the fiber over Ph is d−e , so Rh = (h − 1) d−e . h h (Diagram here) P Define e∞ to be the number of cusps in X1 . Then R∞ = (ex − 1) = d − e∞ . Plugging in everything, X h−1 1 2 (d − eh ) = −2d + (d − e2 ) + (d − e3 ) + (d − e∞ ) 2g1 − 2 = −2d + h 2 3 h=2,3,∞ So we have proved Theorem 3.8. g(XΓ ) = 1 + d 1 1 1 − e2 − e3 − e∞ . 12 4 3 2 Next time we will compute this X0 (N ), somewhat the most useful case. In this case, the elliptic points are interesting. We will apply the so-called “moduli interpretation” using elliptic curves. In general it is hard to calculate eh . It is easy if Γ is a normal subgroup, e.g. Γ(N ), because every point in a fiber will either be simultaneously elliptic or non-elliptic. But Γ0 (N ) is not normal. 4. Lecture 4 (February 4, 2013) 4.1. Modular Curves. Last time we constructed the modular curve XΓ , which is a compact Riemann surface with genus given by d e2 (Γ) e3 (Γ) e∞ (Γ) − − − 12 4 3 2 where d is the degree of the natural map XΓ → XSL2 (Z) , e2 , e3 are the number of elliptic points of order 2, 3 respectively, and e∞ is the number of cuspidal points of XΓ . If Γ is normal, e.g. Γ(N ), everything is easy to compute. Today we will do the example of X0 (N ) = Γ0 (N )\H∗ . g(XΓ ) = 1 + Theorem 4.1. For Γ = Γ0 (N ), we have ( 0 if 4 | N, e2 (Γ0 (N )) = Q −4 if 4 - N, p|N 1 + p ( √ 0 if (Q( −1), N ) does not satisfy the Heegner condition, √ = 2#{odd p|N } if (Q( −1), N ) satisfies the Heegner condition, ( 0 e3 (Γ0 (N )) = Q if 2 | N or 9 | N, p|N 1+ ( 0 = 2#{odd p|N } −3 p otherwise, √ if (Q( −3), N ) does not satisfy the Heegner condition, √ if (Q( −3), N ) satisfies the Heegner condition, 13 −4 2 where we define = 0; otherwise p∗ is the Legendre symbol, and X N e∞ (Γ0 (N )) = φ gcd d, , d = 0 and −3 3 d|N where φ(n) = P c|n 1. Definition 4.2. The “Heegner condition for (K, N )” means for every p | N , either p is split in K or p is ramified with p || N . Remark. This condition is important for the development of the Gross-Zagier formula. To prove these formulas it is better to use the “moduli interpretation of Y0 (N )”, at least as a set. For N = 1, we can give meaning to SL2 (Z)\H as the “moduli” of elliptic curves over C, i.e. the set of 1-dimensional complex tori C/Λ for some lattice Λ modulo isomorphism classes as Riemann surfaces. Equivalently, this is the set of lattices Λ ⊂ C modulo homothety, namely Λ1 ∼ Λ2 if and only if there exists λ ∈ C× such that λΛ1 = Λ2 . Indeed, we can a bijection by sending τ ∈ SL2 (Z)\H to Λτ = Z + Zτ . Replacing establish a b τ by γτ where ∈ SL2 (Z), the lattice becomes c d Z+Z aτ + b 1 1 = (Z(cτ + d) + Z(aτ + d)) = (Z + Zτ ), cτ + d cτ + d cτ + d i.e. 1 Λτ cτ + d and so the map is well-defined. Bijectivity can then be verified easily. Λγτ = Remark. A function f on H satisfying the weight k condition f (γτ ) = (cτ + d)k f (τ ) can be interpreted as a function g on the set of lattices of homogeneous degree −k g(λΛ) = λ−k g(Λ). Given g, we can define f on H by f (τ ) = g(Λτ ). Then 1 Λτ ) = (cτ + d)k g(Λτ ) = (cτ + d)k f (τ ) f (γτ ) = g(Λγτ ) = g( cτ + d and vice versa. We can further interpret this as a function h on the set of equivalence classes (E, ω) of elliptic curves with a 1-form 0 6= ω ∈ H 0 (E, Ω1E/C ) such that h(E, λω) = λk h(E, ω). Given a function h, how do we get f ? To any given τ ∈ H we associate the elliptic curve E = C/Λτ and the canonical 1-form dz, which descends to E because it is invariant under translation, and define f (τ ) = h(E, dz). Then C/ cτ1+d Λτ is isomorphic to C/Λτ via multiplication by cτ + d, so 1 h(C/Λγτ , dz) = h(C/ Λτ , dz) = h(C/Λτ , (cτ + d)dz) = (cτ + d)k h(C/Λτ , dz), cτ + d 14 i.e. f (γτ ) = (cτ + d)k f (τ ). Note that we have not put any holomorphic conditions on H or at the cusps. We are just trying to interpret the weight k condition by considering the points of the modular curves as a set. 4.2. Elliptic curves with level structures. By definition, an elliptic curves with Γ∗ (N )level structure (∗ = 0, 1 or nothing) is a pair (E, ι∗ ) where if ∗ = 0, cyclic subgroup of order N (of E[N ]) 3 ι∗ = order N point (of E[N ]) if ∗ = 1, a Z/N -basis of E[N ]: (α, β) ∈ E[N ]2 with Z/N α + Z/N β = E[N ] if ∗ = nothing, where E[N ] is the subgroup of N -torsion. If E = C/Λ with group structure induced by the complex numbers, then E[N ] = N1 Λ/Λ ∼ = (Z/N )2 as abelian groups. The equivalence relation is given in the obvious way: two pairs (E1 , G1 ) and (E2 , G2 ) are φ isomorphic if there is an isomorphism E1 → E2 with φ(G1 ) = G2 . ∼ Lemma 4.3. There is a bijection between Y∗ (N ) and {(E, ι∗ )} modulo equivalence given by Γ∗ (N )τ 7→ (C/Λτ , ι∗ ) where (For E = C/Λτ , E[N ] = 1 N + Λτ ι∗ = N1 + Λτ ( 1 , τ ) N N 1 Λ /Λτ N τ = ( N1 Z + if ∗ = 0, if ∗ = 1, if ∗ = nothing. 1 Zτ )/Λτ .) N Proof (Sketch). Let us check this for the Γ0 (N )-level structure, which is the is well-defined a b only case we will use. Let γ = ∈ Γ0 (N ). We have c d 1 1 cτ + d d 1 cτ +d C/ Λτ , −→ C/Λτ , = C/Λτ , = C/Λτ , cτ + d N N N N since N | c and gcd(d, N ) = 1. In fact the same calculation proves injectivity as well: if τ and γτ go to the same pair, then γ must be in Γ0 (N ). Surjectivity can be proved by changing basis. Now we return to the interpretation of elliptic points of X0 (N ) or Y0 (N ). The correspondence above implies that the stabilizer group of any point in the orbit Γ0 (N )τ is isomorphic to Γ0 (N )τ ∼ = Aut(E, G) where G is an order N cyclic subgroup. In the case N = 1, this corresponds to the fact that Aut(E) can only be Z/2, Z/4, Z/6, with the last two cases occurring when E has complex multiplication structure by the Gaussian integers Z[i] or the Eisenstein integers Z[e2πi/3 ] respectively. In general we have 3A Γ(N )-level structure is also called the full level structure. In fact the basis (α, β) is required to have Weil pairing equal to a fixed root of unity. See Section 1.5 of Diamond-Shurman. 15 Theorem 4.4. There is a bijection between the set of elliptic points of X0 (N ) of order h = 2 and the set of ideals N ⊂ Z[i] such that Z[i]/N ∼ = Z/N , and a bijection between the set of elliptic points of X0 (N ) of order h = 3 and the set of ideals N ⊂ Z[e2πi/3 ] such that Z[e2πi/3 ]/N ∼ = Z/N . Thus the number of elliptic points is equal to the number of ideals with certain properties, which are easy to count since Z[i] and Z[e2πi/3 ] are Dedekind domains (even PID!). Assuming this, we can give a proof of Theorem 4.1. Proof of Theorem 4.1 (Sketch). It suffices by the Chinese Remainder Theorem to consider N = pa . For a quadratic field K, we want to find ideals N such that OK /N ∼ = Z/pa is cyclic. There are three situations, depending on whether p is ramified, inert or split. If p is ramified, then p = p2 and N = pb . If b ≥ 2, then p | N and OK /N cannot be cyclic, since OK /(p) is (Z/p)2 . Thus N = p. Now p must exactly divide N because OK /p ∼ = Z/p. 2 ∼ If p is inert, then pOK is a prime ideal with OK /(p) = (Z/p) , so OK /N cannot be cyclic. If p is split, then pOK = pp. Then the only choice of N is either pa or pa , because if N = pi pj then it is contained in (p)min(i,j) , but Ok /(p) is not cyclic. This is how we get precisely the factor 1 + ( dp ), where d is the discriminant of K. The cusps of P1 (Q) are counted by brute-force. Now we explain how to get the bijectivity in Theorem 4.4. The key point is that the extra automorphisms of the pair (E, G) force certain lattices to be fractional ideals. Proof of Theorem 4.4 (Sketch). In the order 2 case, the lattice is Λ = Z[i] = OK for the quadratic field K = Q(i). Our goal is to count the number of subgroups G ⊂ C/OK such that G is cyclic of order N and the pair (C/OK , G) admits non-trivial automorphisms. Since the automorphism group of the lattice OK is generated by multiplication by i (iOK = OK = Z[i]), we want [i] : C/OK → C/OK to preserve G. Writing G = M/OK , where M ⊂ K = Q ⊗ OK is contained in N1 OK , the condition is translated as iM = M. Hence M is a Z[i]-module of K, i.e. a fractional ideal. Therefore, it suffices to count the number of ideals N = M−1 ⊂ OK such that OK /N = M/OK = Z/N . Corollary 4.5. If N = p is a prime, then ( b p+1 c − 1 if p ≡ 1 (mod 12), 12 g(X0 (p)) = p+1 b 12 c otherwise. Note √ the congruence condition on p precisely depends on its decomposition in Q(i) and Q( −3). Corollary 4.6. g(X0 (N )) = 1 if and only if N ∈ {11, 14, 15, 17, 19, 20, 21, 24, 27, 32, 36, 49}. These are important because they are both modular curves and elliptic curves. 5. Lecture 5 (February 6, 2013) 5.1. Heegner points. Last time we used the moduli interpretation on the modular curve. Let us talk a bit more about the Heegner condition defined last time. The points of Y0 (N ) = Γ0 (N )\H can be interpreted as elliptic curves with level structure, i.e. pairs (E, G) where E/C is an elliptic curve and G ⊂ E[N ] is a cyclic subgroup of order 16 N . If E = C/Λ, then the set of N -torsion points is E[N ] = N1 Λ/Λ. Write G = Λ0 /Λ where Λ0 ⊂ N1 Λ. An equivalent but more symmetric way is to consider pairs of elliptic curve (E1 , E2 ) with an (surjective) isogeny φ : E1 → E2 with ker(φ) ∼ = Z/N . Under the notations above, this is C/Λ → C/Λ0 . Define EndC (E) to be {φ : E → E : φ(0) = 0}. Then EndC (C/Λ) = {λ ∈ C : λΛ ⊂ Λ} (so that λ induces the multiplication √ map [λ] : C/Λ √ → C/Λ). Last time we considered K = Q( −1) or Q( −3). In general if K is any imaginary quadratic field with ring of integers OK , then EndC (C/OK ) = {λ ∈ C : λOK ⊂ OK } = OK , since (λ)OK ⊂ OK means λ ∈ OK by inverting ideals. The modular curve X0 (N ) contains pairs of the form (C/OK , C/Λ0 ) where OK ⊂ Λ0 ⊂ 1 O and Λ0 /OK ∼ = Z/N . N K Definition 5.1. A Heegner point attached to OK on X0 (N ) is such a pair (C/OK , C/Λ0 ) where both curves have endomorphism ring OK . This condition can be translated into Lemma 5.2. EndC (C/Λ0 ) = OK if and only if Λ0 is an ideal of K, i.e. a fractional ideal of OK . Proof. If Λ0 is an ideal, then λΛ0 ⊂ Λ0 ⇔ λ ∈ OK by inverting ideals. The converse is easy, since {λ ∈ C : λΛ0 ⊂ Λ0 } ⊃ OK means Λ0 is an ideal. We are reduced to the question of finding ideals Λ0 such that Λ0 /OK ∼ = Z/N . Last time we showed Lemma 5.3. A fractional ideal Λ0 such that Λ0 /OK ∼ = Z/N exists if and only if p | N implies p ramifies with p || N or p is split. In particular, if p has an inert factor, then Λ0 does not exist. This is called the Heegner condition. We summarize our work last time as follows. Let Eh be the set of elliptic points on X0 (N ) of order h. We proved Theorem 5.4. E2 (resp. E3 ) corresponds to the set of Heegner points attached to Z[i] (resp. Z[e2πi/3 ]). Remark. X0 (N ) has a canonical model over Q. X0 (N ) → E/Q is always parametrized by a modular form of weight 2 (using the modularity theorem as a black box!). Mapping the Heegner points into E shows that they are defined over the Hilbert class field of K, and taking trace pushes them down to Q. This is the only systematic way of producing rational points on elliptic curves over Q and provides evidence for the BSD conjecture. 5.2. Dimension formulas. We derive the dimension formulas for modular forms and cusp forms, which is almost trivial after the genus formula, at least for even weights. The idea is to interpret the space of modular forms as the space of global sections of line bundles over Riemann surfaces. For simplicity, assume k is even. Fix a congruence subgroup Γ ⊂ SL2 (Z), and X = XΓ which is a compact Riemann surface. We will use local charts to compute orders of vanishing. Let ΩX be the canonical line bundle (holomorphic differential 1-forms on X), Mk = Mk (Γ) (resp. Sk = Sk (Γ)) be the space of modular forms (resp. cusp forms) of weight k. 17 Lemma 5.5 (Key Lemma). ⊗k/2 (1) The map Mk → H 0 (X, ΩX (k )) given by f 7→ ωf = f (τ )(2πidτ )k/2 , where k := b k4 c2 + b k3 c3 + k2 ∞ ∈ Div(X), is an isomorphism. More precisely, this is the space of meromorphic forms ω of degree k2 with poles div(ω) ≥ −k . ∼ ⊗k/2 (2) The same map gives an isomorphism Sk → H 0 (X, ΩX (k − ∞ )). Proof. For any γ ∈ Γ ⊂ SL2 (Z), define the automorphy factor j(γ, τ ) = cτ + d. Then d(γτ ) = j(γ, τ )−2 dτ . Since f (γτ ) = j(γ, τ )k f (τ ), ωf is invariant under Γ and indeed descends to a meromorphic form of degree k2 on X. Let us express the condition of f being holomorphic in terms of ωf . Choose local coordinate z at elliptic points so that τ = z 1/h . Then 1 k f (τ )(dτ )k/2 = f (z 1/h )(dz 1/h )k/2 = z ( h −1) 2 f (z 1/h )(dz)k/2 . k c. (Note that invariance Hence f (τ ) is holomorphic on H if and only if vz (ωf ) ≥ −b h−1 h 2 under stabilizers shows that f (τ ) has only h-th powers of τ .) ∼ At the cusps, let us only consider ∞. Let h be the width at ∞, i.e. Γ∞ /{±I} = 1 h . The local coordinate is given by q = e2πiτ /h , so dq/q = 2πi/hdτ . If f has 0 1 Fourier expansion f (τ ) = g(q) = a0 + a1 q + · · · , then ωf = f (τ )(2πidτ )k/2 = g(q)(h dq k/2 ) = Cq −k/2 g(q)(dq)k/2 . q Therefore, f holomorphic (resp. zero) at ∞ if and only if vq=0 (ωf ) ≥ − k2 (resp. − k2 + 1). Corollary 5.6. S2 ∼ = H 0 (X, ΩX ), so dim S2 = gX . With this identification, we can apply Riemann-Roch. For any line bundle L on X, χ(L) = h0 (L) − h0 (Ω ⊗ L−1 ) = deg(L) + (1 − gX ) where h0 (L) = dim H 0 (X, L). For k ≥ 2, we have k k k k deg Ωk/2 (k ) = (2g − 2) + b ce2 + b ce3 + e∞ ≥ (2g − 2) + e∞ > 2g − 2. 2 4 3 2 For k ≥ 4 (to study cusp forms), we have k k k k (2g − 2) + b ce2 + b ce3 + ( − 1)e∞ ≥ 2g − 2 + e∞ > 2g − 2. 2 4 3 2 k/2 k/2 Applying to L = Ω (k ) and Ω (k − ∞ ) respectively, we conclude deg Ωk/2 (k − ∞ ) = k k k Corollary 5.7. For k ≥ 2, dim Mk = (k − 1)(g − 1) + b ce2 + b ce3 + e∞ . 4 3 2 k k k For k ≥ 4, dim Sk = (k − 1)(g − 1) + b ce2 + b ce3 + ( − 1)e∞ . 4 3 2 For k odd, the same idea works but the calculations are more complicated. It is most difficult to apply Riemann-Roch to weight 1 forms. 18 6. Lecture 6 (February 11, 2013) 6.1. Hecke operators. Today we will discuss Hecke operators. For congruence subgroups Γ (so Γ(N ) ⊂ Γ ⊂ SL2 (Z) for some N ≥ 1), we have defined Mk and Sk , the spaces of modular forms and cusp forms of weight k respectively. We want to study the endomorphisms of these vector spaces. Denote GL2 (Q)+ = {γ ∈ GL2 (Q) : det γ > 0}. It defines an action on H and has the following property: if γ ∈ GL2 (Q)+ and Γ is a congruence subgroup, then γΓγ −1 ∩ SL2 (Z) is still a congruence subgroup (possibly for a bigger N ), and in particular has finite index. Consider the pairs (G, H) where H is a subgroup of G satisfying the following hypothesis: for every γ ∈ G, γ −1 Hγ and H are “commensurable”, i.e. γ −1 Hγ ∩ H is of finite index in H and γ −1 Hγ. Example 6.1. (GL2 (Q)+ , Γ), where Γ is a congruence subgroup. Example 6.2. (GL2 (Qp ), GL2 (Zp )), where p is a prime number. Let C[H\G/H] = C[G//H] be the space of functions φ : G → C which are bi-H-invariant (i.e. φ(hgh0 ) = φ(g) for all h, h0 ∈ H and g ∈ G) and supported on finitely many double cosets. The last condition is the analogue for φ to have compact support when G is equipped with a topology, e.g. Example 6.2. We can define an algebra structure on C[G//H] by X (φ ∗ ϕ)(g) = φ(gx−1 )ϕ(x) x∈H\G Z which can be interpreted as the integral φ(gx−1 )ϕ(x)dx where H\G is given the counting G measure 4. Then (C[G//H], ∗) is an algebra with identity 1H , the characteristic function of H. In general this may not be commutative. In the case where C[G//H] is commutative, there is essentially only one idea to prove so, which is the Gelfand trick. Theorem 6.3. For (G, H) = (GL2 (Qp ), GL2 (Zp )), C[G//H] is commutative. Proof. The idea is to find an anti-involution σ : G → G, i.e. (xg)σ = g σ xσ , such that every double coset has a representative fixed by σ. In the case of GL2 (Qp ), there is a natural choice – take σ to be transposition on GL2 (Qp ). α a p 0 Lemma 6.4. GL2 (Qp ) = GL2 (Zp ) GL2 (Zp ). 0 pβ α≥β∈Z This immediately implies the second requirement that every double coset has a representative fixed by σ. Note σ defines an action on C[G//H] as well by φσ (g) = φ(g σ ), and is an anti-involution (φ ∗ ϕ)σ = ϕσ ∗ φσ . Since each double coset has an invariant element, σ acts trivially on C[G//H]. Combining these, we conclude that φ ∗ ϕ = ϕ ∗ φ, so C[G//H] is commutative. 4The hypothesis on (G, H) implies every double coset HgH is a finite union of cosets in H\G. See Lemma 5.1.2 of Diamond-Shurman. 19 Remark. (G, H) is called a Gelfand pair, i.e. for all irreducible representations π of G, the space HomH (π, C) is at most 1-dimensional. How do we deal with Example 6.1 where (G, H) = (GL2 (Q)+ , SL2 (Z))? We will only consider the subalgebra generated by functions supported in M2 (Z) ∩ GL2 (Q)+ . This set is certainly bi-H-invariant. Theorem 6.5. This subalgebra is commutative. Lemma 6.6. For n ≥ 1, M [n] = M2 (Z)[n] = {γ ∈ M2 (Z) : det γ = n} is equal to the disjoint unions a α 0 a n/β x Γ Γ= Γ 0 β 0 β β|α>0 αβ=n β|n 0≤x<β where Γ = SL2 (Z). In particular, we have #Γ\M [n] = X β = σ1 (n). β|n (In order to find a double coset representative for any matrix in M [n], note that β will be the gcd of the four entries.) Proof (Sketch). The same proof works by taking σ to be transposition again. Consider the characteristic function 1M [n] for n ≥ 1. Theorem 6.7. (1) 1M [n] is multiplicative, i.e. 1M [n] ∗ 1M [m] = 1M [nm] if gcd(n, m) = 1. (2) For p prime, 1M [p] ∗ 1M [pn ] = 1M [pn+1 ] + p1pM [pn−1 ] as equality in C[G//H]. Proof. (1) We can write M [nm] = M [n] · M [m] 0 −1 0 γm has deter, then γn0−1 γn = γm in an essentially unique way: if γ = γn γm = γn0 γm 1 1 minant 1 and is contained in n M2 (Z) ∩ m M2 (Z) = M2 (Z), hence is in Γ = SL2 (Z). 0 Thus γm ∈ Γγm differ by an element in Γ. (2) We can easily check 1M [p] ∗ 1M [pn ] ≥ 1M [pn+1 ] + p1pM [pn−1 ] , so it is enough to prove they have the same integral, i.e. vol(M [pn ]) vol(M [p]) = vol(M [pn+1 ]) + p vol(M [pn−1 ]) ⇔ ⇔ σ1 (pn )σ1 (p) = σ1 (pn+1 ) + pσ1 (pn−1 ) pn+2 − 1 pn − 1 pn+1 − 1 p2 − 1 · = +p· . p−1 p−1 p−1 p−1 Today we will study Mk and Sk for the full modular group Γ = SL2 (Z). For γ ∈ GL2 (Q)+ , define the weight-k γ-operator f [γ]k (τ ) = (f |γ, k)(τ ) = (det γ)k−1 j(γ, τ )−k f (γτ ) a b where for γ = , j(γ, τ ) = cτ + d is the automorphy factor. Using this notation, the c d weight-k condition for γ ∈ SL2 (Z) means f [γ]k = f . We can verify that f [αβ]k = (f [α]k )[β]k . 20 The underlying fact is that the automorphy factor satisfies the cocycle condition j(αβ, τ ) = j(α, βτ )j(β, τ ). For φ ∈ C[GL2 (Q)+ //Γ], define X f [φ]k = f [γ]k φ(γ) Γ\ GL2 (Q)+ Z which is a finite sum. Again this can be interpreted as a formal integral f [γ]k φ(γ)dγ, GL2 (Q)+ by putting the counting measure on Γ\ GL2 (Q)+ . Lemma 6.8. [φ]k defines operators Mk → Mk and Sk → Sk . Proof (Sketch). If f [γ]k = f for all γ ∈ Γ, then f [φ]k satisfies the same property. We can also check holomphicity at the cusp. Definition 6.9. The Hecke operator Tn is defined to be the operator [φ]k for φ = 1M [n] . Thus Tn is contained in End(Mk ) and End(Sk ). Note that φ = 1pM [pn−1 ] corresponds to p Tpn−1 because k−2 f [pγ]k = det(pγ)k−1 j(pγ, τ )−k f (γτ ) = p2(k−1) p−k f [γ]k since γ and pγ act on H in the same way. With this, Theorem 6.7 translates into Theorem 6.10. (1) Tn Tm = Tnm for gcd(n, m) = 1. (2) For p prime, Tp Tpn = Tpn+1 + pk−1 Tpn−1 . P n Lemma 6.11 (Effect on q-expansion). If f = ∞ n=0 an (f )q ∈ Mk , then Tm f ∈ Mk with n-th Fourier coefficient given by X an (Tm f ) = dk−1 anm/d2 (f ). d|(n,m) In particular, we have a1 (Tm f ) = am (f ). Example 6.12. Let ∆ = q ∞ Y n 24 (1 − q ) n=1 = ∞ X τ (n)q n = q + · · · , where τ is known as the n=1 Ramanujan tau-function. Since S12 ∼ = C∆ is one-dimensional, ∆ must be an eigenform for Tm with eigenvalue τ (m), i.e. Tm ∆ = τ (m)∆ . The eigenvalues must satisfy the properties in Theorem 6.10, so ( τ (mn) = τ (m)τ (n) if (m, n) = 1, τ (pn+1 ) = τ (p)τ (pn ) − p11 τ (pn−1 ). This is Ramanujan’s conjecture, proved by Mordell. 7. Lecture 7 (February 13, 2013) 7.1. Hecke operators for level 1. Last time we considered the double coset space Γ\ GL2 (Q)+ /Γ, where Γ = SL2 (Z). Theorem 7.1. Z[Γ\ GL2 (Q)+ /Γ] is a commutative algebra. 21 We defined the Hecke operator X Tn (f ) = f [β]k β∈Γ\M [n] (which is a finite sum) for f ∈ Mk and n ≥ 1, where f [β]k (τ ) = (det β)k−1 j(β, τ )−k f (β(τ )) P∞ n Lemma P 7.2 (Effect on q-expansion). Let f = ∈ Mk (SL2 (Z)), and denote n=0 an q ∞ n Tm (f ) = n=0 an (Tm f )q . Then X an (Tm f ) = dk−1 anm/d2 . d|(m,n) In particular, a1 (Tm f ) = am and a0 (Tm f ) = σk−1 (m)a0 . From this, we can show an (Tm Tm0 f ) = an (Tm0 Tm f ) which verifies that the subalgebra of End(Mk (SL2 (Z)) generated by Tn for all n ≥ 1 is commutative. Note this is an immediate consequence of Theorem 7.1. Proof of Lemma 7.2. Recall M [m] = {γ ∈ M2 (Z) : det(γ) = m} can be decomposed as a a b Γ . 0 d ad=m 0≤b≤d−1 Hence Tm (f )(τ ) = mk−1 X j(β, τ )−k f (β(τ )) β a b . Since β(τ ) = where β runs through the coset representatives 0 d aτ +b e2πin d and j(β, τ ) = d, we have Tm (f )(τ ) = mk−1 ∞ X X d−k ad=m n=0 k−1 =m X X d−1 X aτ aτ +b , d e2πinβ(τ ) = b an e2πin( d + d ) b=0 −k aτ d dan e2πin( d ) ad=m d|n ∞ X X m k−1 0 = adn0 e2πin aτ d ad=m n0 =0 ∞ X X = qn ak−1 adn0 . n=0 n0 a=n ad=m Example 7.3 (Eisenstein series). Recall that the Eisenstein series of weight k is given by ∞ X (2πi)k X −k Gk (τ ) = w = 2ζ(k) + σk−1 (n)q n (k − 1)! n=1 w∈Λτ −{0} 22 where Λτ = Z + Zτ . It turns out to be an eigenform for Tm , i.e. Lemma 7.4. Tm (Gk ) = σk−1 (m)Gk . Proof. It is clear that if Gk is indeed an eigenform, then the eigenvalue must be σk−1 (m) by comparing the constant terms. In fact we can directly check the formula nm X dk−1 σk−1 = σk−1 (m)σk−1 (n). d2 d|(n,m) Remark. Recall that functions on H satisfying the weight-k condition can be interpreted as functions on lattices of homogeneous of degree −k. With this, we can give a simpler definition of Hecke operator X f (Λ0 ). (Tn f )(Λ) = nk−1 Λ0 ⊂Λ [Λ:Λ0 ]=n For a fixed lattice Λ = Λτ = Z + Zτ ⊂ C, there is a bijection {Λ0 ⊂ Λ of index n} ←→ SL2 (Z)\M [n] a b where we associate to each β = ∈ M [n] the lattice βΛ = span(a + bτ, c + dτ ), which c d is indeed a sublattice of Λ of index n. 7.2. Hecke operators for level N . So far we have been working over SL2 (Z). We can extend Hecke operators to Γ∗ (N ) where ∗ = 0, 1 or nothing. It is enough to study Γ1 (N ), because Γ1 (N ) ⊂ Γ0 (N ) and −1 N 0 N 0 = Γ1 (N 2 ) Γ(N ) 0 1 0 1 a b a b/N ( is sent to ). Recall Γ1 (N ) is normal in Γ0 (N ): there is an exact c d cN d sequence 1 → Γ1 (N ) → Γ0 (N ) → (Z/N )× → 1 a b where the last map is given by 7→ d (mod N ). c d Let χ : (Z/N )× → C× be a fixed Dirichlet character, called the Nebentypus. Define Mk (Γ1 (N ), χ) = {f ∈ Mk (Γ1 (N )) : f [β]k = χ(β)f for all β ∈ Γ0 (N )} where χ is lifted to Γ0 (N ) → C× . Then Mk (Γ0 (N )) can be identified as Mk (Γ1 (N ), χ0 ) for the trivial character χ0 . It is almost trivial to see that M Mk (Γ1 (N )) = Mk (Γ1 (N ), χ). χ:(Z/N )× →C× Remark. Here are three interpretations of Hecke operators for Γ0 (N ). (1) For (n, N ) = 1, we can view f ∈ Mk (Γ0 (N )) as a function {(Λ1 , Λ2 ) : Λ1 ,→ Λ2 with Λ2 /Λ1 ∼ = Z/N } → C. 23 Then X (Tn f )(Λ1 , Λ2 ) = nk−1 f ((Λ02 ∩ Λ1 ), Λ02 ). Λ02 ⊂Λ2 [Λ2 :Λ02 ]=n (2) Equivalently, in the language of elliptic curves, viewing f : {(E, G, ω)}/ ∼→ C gives X f (E/H, (G + H)/H, ωH ) Tn (f )(E, G, ω) = nk−1 H⊂E subgroup of order n (because (n, N ) = 1) where ωH is the pullback of ω under the dual isogeny of E → E/H. a b ∗ ∗ (3) In terms of double cosets, let MN = ≡ (mod N ) and MN [n] = c d 0 ∗ {γ ∈ MN : det γ = n} for (n, N ) = 1. Then we have the coset decomposition a a b Γ0 (N ) MN [n] = . 0 d ad=n 0≤b≤d−1 We define Hecke operators for Γ = Γ∗ (N ) where ∗ = 0, 1 as follows. For p prime, define 1 0 Tp = Γ Γ 0 p and Tpi = Tp Tpi−1 − pk−1 hpiTpi−2 (the diamond operator hpi will be defined below). Note when p - N , the second equation reduces to Tpi = (Tp )i . We then extend the definition to Tn by multiplicativity. We can the algebra generated by these is commutative, by using the involution prove −1 N 0 N 0 Xt σ(X) = on Γ∗ (N ). 0 1 0 1 Lemma 7.5. Let f ∈ Mk (Γ1 (N ), χ) with Nebentypus χ : (Z/N )× → C× . Then X an (Tm f ) = χ(d)dk−1 anm/d2 d|(n,m) (as usual, we extend χ to Z → C by 0 at integers not coprime to N ). Now we define the diamond operator, which acts on the space Mk (Γ1 (N )). Since Γ1 (N ) is a normal subgroup of Γ0 (N ), any coset satisfies Γ1 (N )γ = γΓ1 (N ) for γ ∈ Γ0 (N ) and hence Γ1 (N )γΓ1 (N ) = Γ1 (N )γ = γΓ1 (N ). This defines a double coset operator which depends only on the image of γ under Γ0 (N ) → (Z/N )× → 1 a b sending γ = to d (mod N ). This double coset operator is denoted by hdi and called c d the diamond operator. Theorem 7.6. The operators Tn and hdi, where n ≥ 1 and d ∈ (Z/N )× , are commutative on Mk (Γ1 (N )). 24 7.3. Petersson inner product. If f, g ∈ Mk (Γ∗ (N )), then f (τ )g(τ ) is of weight (k, k) in the following sense. Definition 7.7. φ : H → C has weight (m, n) if n φ(γτ ) = j(γ, τ )m j(γ, τ ) φ(τ ). Example 7.8. Im(τ ) = y = 1 (τ 2i − τ ) is of weight (−1, −1) since Im(γτ ) = Im(τ ) j(γ, τ )j(γ, τ ) . Define the Petersson inner product: for f, g ∈ Sk (Γ∗ (N )), set Z dτ dτ hf, giΓ = f (τ )g(τ )y k 2 y Γ\H Claim. If one of f or g is a cusp form, then limy→∞ f (τ )g(τ )y k = 0. Claim. hGk , Gk i diverges. Note that the Eisenstein series can be given by X X0 (cτ + d)−k = j(γ, τ )−k Gk = γ∈Γ∞ \Γ c,d 1 ∗ . Then for any g ∈ Mk (Γ), 0 1 Z Z X dτ dτ k dτ dτ Gk (τ )g(τ )y j(γ, τ )−k j(γ, τ )−k g(γτ ) Im(τ )k 2 = 2 y y Γ\H Γ\H γ∈Γ \Γ ∞ Z X dτ dτ g(γτ ) Im(γτ )k 2 = y Γ\H γ∈Γ \Γ ∞ Z dτ dτ g(τ ) Im(τ )k 2 = y Γ \H Z ∞ dτ dτ = g(τ )y k 2 y Γ∞ \H where Γ = SL2 (Z) and Γ∞ = is divergent unless g(τ ) → 0 as y → ∞, i.e. unless g is a cusp form. This implies the second claim. Next time we will study the inner product space (Sk (Γ∗ (N )), h·, ·i), and show that Hecke operators are normal, i.e. they commute with their adjoints X ∗ X = XX ∗ . This implies X is diagonalizable. Therefore we can simultaneously diagonalize the family of Hecke operators and decompose the space as direct sum of Hecke eigenforms. 8. Lecture 8 (February 18, 2013) dxdy on H which is y2 invariant under SL2 (R). If f and g are modular forms of weight k for a congruence subgroup 8.1. Petersson inner product. Consider the hyperbolic measure 25 Γ, then the product f (τ )g(τ ) Im(τ )k is Γ-invariant. Define Z dxdy f (τ )g(τ )y k 2 . hf, giΓ = y Γ\H Lemma 8.1. If one of f or g is a cusp form, then the integral converges absolutely. Proof. Let us check this for Γ = SL2 (Z) and D the fundamental domain (in general the fundamental domain will be finitely many copies of D). Write f (τ ) = a0 + a1 q + · · · and g(τ ) = b0 + b1 q + · · · . Since a0 b0 = 0, there exists some constant C > 0 such that |f (τ )g(τ )y k | < Ce−2πy y k . Hence the integral converges on y 0. The remaining region is compact. Remark. If f and g are both not cusp forms, then the integral diverges whenever k ≥ 1 since Z dxdy y k 2 does. y y0 Therefore, hf, giΓ is a well-defined inner product on Sk (Γ), the space of cusp forms, so it makes sense to talk about adjoint operators. For Γ = Γ∗ (N ) where ∗ = 0, 1, consider the Hecke operators Tn for (n, N ) = 1. (Morally speaking, they already determine Tp for primes p dividing N . Also we don’t lose too much just by considering Γ0 (N ), since Γ1 (N ) is normal in Γ0 (N ) with abelian quotient. Last time we introduced the diamond operators hdi for (d, N ) = 1. In fact we don’t lose too much just by considering SL2 (Z). The same proof techniques work. Anyway we’ll assume Γ = Γ∗ (N ) where ∗ = 0, 1.) Let Tn∗ be the adjoint operator of Tn on Sk (Γ), characterized by hTn f, giΓ = hf, Tn∗ giΓ . Theorem 8.2. Let Tα be the operator associated to ΓαΓ, where α ∈ M2 (Z) with det(α) 6= 0. Then Tα∗ = Tα∗ where α∗ = det(α)α−1 ∈ M2 (Z). Corollary 8.3. For Γ = Γ∗ (N ) and (n, N ) = 1, we have Tn∗ = hni−1 Tn . In particular, Tn is self-adjoint if ∗ = 0 (because the diamond operators are trivial on Γ0 (N )). Thus Tn is “almost” self-adjoint. In fact Tn is normal. Assuming the theorem, let us deduce the corollary. Proof of Corollary 8.3. For n = p prime with (p, = 1, Tp corresponds N) to the double coset 1 0 p 0 ΓαΓ and Tp∗ corresponds to Γα∗ Γ, where α = and α∗ = . We claim that 0 p 0 1 p 0 1 0 ∈ Γ1 (N ) Γ0 (N ). 0 1 0 p 0 −1 (Note: In SL2 (Z) we could have simply conjugated by .) 1 0 26 If β ∈ Γ1 (N ) and γ ∈ Γ0 (N ) are such that α∗ = β −1 αγ, then αγ(α∗ )−1 = β. Writing a b γ= ∈ Γ0 (N ), we want c d −1 1 0 a b p 0 a/p b = 0 p c d 0 1 c pd to be contained in Γ1 (N ). We can pick a = p, and solve for N | c with pd = bc + 1. Therefore, Tp∗ corresponds to the double coset 1 0 1 0 1 0 Γ γΓ = Γ Γγ = Γ ΓγΓ 0 p 0 p 0 p where γ ∈ Γ0 (N ), i.e. Tp∗ = Tp hγi = Tp hdi = Tp hp−1 i. We will not prove the theorem, which amounts to changing basis. The complexity of the theory is just about finding double cosets. Corollary 8.4. The operators Tn for (n, N ) = 1 are a commuting family of normal operators, so there exists an orthogonal eigenbasis of Sk (Γ). Definition 8.5. f ∈ Sk (Γ) is an eigenform if it is an eigenvector for each Hecke operator Tn and hdi, where (n, N ) = 1 and (d, N ) = 1. (For Γ0 (N ), we just need to consider Tn since hdi is trivial.) Remark. For Γ = SL2 (Z), Mk (SL2 (Z)) = CEk ⊕ Sk (SL2 (Z)). Thus Ek is an eigenform with eigenvalue σk−1 (n) for Tn . The same conclusion in Corollary 8.4 holds for Mk (SL2 (Z)), with “orthogonal” suitably interpreted (since hEk , Ek iΓ diverges). 8.2. L-functions. Next we will an application to L-functions. Let f ∈ Mk (Γ) Pdiscussion ∞ n have Fourier expansion f (q) = n=1 an q . Define L(s, f ) = ∞ X an n−s . n=1 To address convergence issues, note that we have the following trivial bounds on Fourier coefficients. Lemma 8.6. If f is a cusp form, then |an | ≤ Cnk/2 for some constant C. In general, we have |an | ≤ Cnk . Proof. For simplicity, assume Γ = Γ∗ (N ) where ∗ = 0, 1, so the local parameter is given by q = e2πiτ (for general congruence subgroups we need to take into account the width of Γ at ∞). Then Z Z 1 1 −n dq an = f (q)q = f (x + iy)e−2πin(x+iy) dx 2πi q 0 is independent of y > 0 by holomorphicity. Choosing y = n1 , we have Z 1 i 2π an = e f x+ e−2πinx dx n 0 27 Recall that |f (τ )|2 | Im(τ )|k is invariant under Γ. If f is cuspidal, |f (τ )|2 | Im(τ )|k ≤ C, so |f (x + ni )| ≤ C( n1 )−k/2 = Cnk/2 and so |an | ≤ Cnk/2 . For the Eisenstein series Ek for SL2 (Z), we have X an = σk−1 (n) = dk−1 ≤ Cnk . d|n Since any modular form for SL2 (Z) can be written as a sum of an Eisenstein series and a cusp form, we obtain the trivial bound. We omit the proof for general congruence subgroups. ( 1 + k2 if f is cuspidal and ∗ = 0, 1, Corollary 8.7. L(s, f ) converges for Re(s) > 1 + k if f ∈ Mk (Γ). We can consider the Euler product for L(s, f ), just like for the Riemann zeta-function. By definition, we say f is normalized if a1 = 1. Theorem 8.8. For f ∈ Sk (SL2 (Z)), the following are equivalent: (1) f is a normalized eigenform; (2) L(s, f ) has an Euler product Y L(s, f ) = (1 − ap p−s + pk−1−2s )−1 . p prime The property of having an Euler product is obviously not stable under addition. Being an eigenform means having a simple spectrum. Euler product thus reflects the spectrum. Proof. (1) ⇒ (2). Suppose f is an eigenform with a1 = 1. For Tm where m ∈ Z≥1 , recall that X an (Tm (f )) = dk−1 anm/d2 . d|(m,n) This implies that the eigenvalue of Tm is am . Since Tm Tn = Tmn for (m, n) = 1 and Tpi+1 = Tp Tpi − pk−1 Tpi−1 , we have the same relations ( am an = amn if (m, n) = 1, (2) k−1 api+1 = ap api − p api−1 if i ≥ 1. Hence, L(s, f ) = Y ∞ X p prime k=1 ! apk p−ks . pi The sequence bi = a has recursive relation bi+1 = ap bi − pk−1 bi−1 , hence characteristic polynomial T 2 − ap T + pk−1 . Thus ∞ X apk p−ks = (1 − ap p−s + pk−1−2s )−1 . k=1 28 (2) ⇒ (1). The same proof goes backwards. The Euler product implies that the Fourier coefficients an satisfy the relations (2). It suffices to verify Tp f = ap f for primes p (since Tp generate the Hecke algebra), which is equivalent to an (Tp f ) = an ap for all n, i.e. X dk−1 anp/d2 = an ap . d|(n,p) ( anp But the left hand side is just anp + pk−1 an/p if (n, p) = 1, which is equal to an ap by (2). if p | n, 9. Lecture 9 (February 20, 2013) 9.1. Hecke operators. We have studied the Hecke theory for the level 1 case Γ = SL2 (Z) using double cosets. For all n ≥ 1, we defined the Hecke operator Tn . Fix k ≥ 1 and consider the space of cusp forms Sk (Γ). Then Tn ∈ End(Sk (Γ)). Consider the subalgebra T = C[Tn ]n≥1 ⊂ End(Sk (Γ)), which is a commutative C-algebra. Since each operator Tn commutes with its adjoint, i.e. is normal, T is semisimple and hence isomorphic to C × · · · × C. We can extend this to level N . Let Γ = Γ∗ (N ), where ∗ = 0, 1. We start by defining Tp for all primes (p, N ) = 1. Tp corresponds to the double coset a 1 0 1 j a b p 0 Γ Γ= Γ tΓ 0 p 0 p N p 0 1 0≤j≤p−1 a b ∈ Γ0 (N ). N p We can define hdi for (d, N ) = 1. We can extend the definition to Tpi by using the recursive relation where Tpi := Tp Tpi−1 − pk−1 hpiTpi−2 and to Tn for (n, N ) = 1 by Tm Tn = Tmn whenever (m, n) = 1. It remains to define Tp for p dividing N . This operator is often called Up , but still Tp in Diamond-Shurman. In this case, we have the double coset decomposition a 1 0 1 j Γ Γ= Γ . 0 p 0 p 0≤j≤p−1 We use the same recursive relations as above to define Tn , and define hdi = 0 if (d, N ) > 1. Remark. All the above relations can be combined into ∞ X Y Y Tn n−s = (1 − Tp p−s + hpipk−1−2s )−1 · (1 − Tp p−s )−1 . n=1 p-N p|N 29 Therefore, we have defined Tn and hni for all n ≥ 1, which are endomorphisms of Sk (Γ). The Hecke algebra T generated by them is still a commutative C-algebra, but not necessarily semisimple. Lemma 9.1. Let (n, N ) = 1. Then the adjoint of Tn with respect to the Petersson inner product is Tn∗ = hni−1 Tn . In particular, Tn is a normal operator if (n, N ) = 1. This5 suggests we should look at the subalgebra T0 ⊂ T given by T0 = C[Tn , hni](n,N )=1 , which is semisimple by the lemma. What happens to Tp = Up when p | N ? We can still find a formula for its adjoint. Recall 1 0 that Tp is associated to the double coset Γ Γ, so Tp∗ is associated to the double coset 0 p p 0 Γ Γ. These two are in general not the same double coset. Let us introduce one more 0 1 operator. For Γ = Γ∗ (N ), define the normalizer N (Γ) = NGL2 (R)+ (Γ) = {γ ∈ GL2 (R)+ : γ −1 Γγ = Γ}. The quotient N (Γ)/Γ is interesting because it induces automorphisms on the modular curve, i.e. we have a map N (Γ)/Γ → Aut(Γ\H∗ ). It is non-trivial to find a normalizer, but here is one. Let 0 −1 1 0 0 −1 wN = = N 0 0 N 1 0 which is contained in N (Γ), so that the double coset ΓwN Γ is just a coset ΓwN . Then we can check that f 7→ f [wN ]k maps into the same space of cusp forms, so we can consider wN = [wN ]k ∈ End(Sk (Γ)). 2 = −N , we see that wN is an involution on the modular curve Γ\H∗ , but it is Since wN not quite an involution on Sk (Γ). Lemma 9.2. 2 (1) wN acts by (−1)k N k−2 on Sk (Γ). (2) For all n, we have ( −1 Tn∗ = wN Tn wN , −1 ∗ hni = wN hniwN . Proof. 2 2 k k−2 (1) wN = [wN ]k = (N 2 )k−1 (−N )−k = (−1) N . 1 0 p 0 and differ by conjugation by wN . (2) The key point is that the matrices 0 p 0 1 5We can also check that hni∗ = hni−1 for (n, N ) = 1, so hni is indeed normal. 30 Corollary 9.3. For Γ = Γ0 (N ) and (n, N ) = 1, we have −1 Tn∗ = Tn = wN Tn wN . Thus wN commutes with Tn for (n, N ) = 1. 9.2. Atkin-Lehner theory. To understand the relationship between T0 and T, we need the Atkin-Lehner theory (also known as the newform theory), and later we will give an example where T is not semisimple. The idea is to study the newforms of Sk (Γ1 (N )), which are the “primitive” forms in some sense. When M | N , we always have the inclusion Sk (Γ1 (M )) ,→ Sk (Γ1 (N )). Consider the span d 0 N . In fact it is enough of f (dτ ) = d1−k f where f ∈ Sk (Γ1 (M )), M | N and d | M 0 1 k to consider forms of level dividing N/p where p | N is a prime: Skold (Γ1 (N )) = span{f (dτ ) : f ∈ Sk (Γ1 (N/p)), d | p, p | N } ⊂ Sk (Γ1 (N )) and define the newspace Sknew (Γ1 (N )) = Skold (Γ1 (N ))⊥ . Note that we are not calling this the “space of newforms” yet. Newform will have a specific meaning – it is an eigenform. Atkin and Lehner proved that the newspace behaves like the level 1 case, where we just need to look at T0 . Before we make this more precise, let us prove a few lemmas. Lemma 9.4. The spaces Skold and Sknew are T-stable, where T is the full Hecke algebra. Proof (Sketch). Introduce T∗ , the algebra generated by the adjoints Tn∗ and hni∗ . It suffices −1 to prove that Skold is both T- and T∗ -stable. But T∗ = wN TwN , so it is enough to prove Skold is T-stable and wN -stable. Define the operator Vp : Sk (Γ1 (N/p)) →P Sk (Γ1 (N )) by f 7→ f (pτ ). This is a one-sided n inverse of the Up operator. Indeed, for f = ∞ n=0 an q , X X X τ +j 1 j −1 =p f Up (f ) = f = an q n/p 0 p k p 0≤j≤p−1 0≤j≤p−1 p|n whereas Vp (f ) = ∞ X an q pn . n=0 It is clear that Up Vp = Id. It remains to show that the sum of the images of Sk (Γ1 (N/p)) ,→ Sk (Γ1 (N )) and Vp : Sk (Γ1 (N/p)) → Sk (Γ1 (N )) is stable under the operators Tq , hqi and w for all primes q. For example, consider the diagram (note the two Tp ’s are different!) Sk (Γ1 (N/p)) Vp / Tp Sk (Γ1 (N )) Sk (Γ1 (N/p)) Vp / Tp Sk (Γ1 (N )) which is not quite commutative. Since p | N , the Tp on the right is equal to Up and so Tp Vp (f ) = f for all f ∈ Sk (Γ1 (N/p)). This shows Tp preserves Skold . 31 The other operators for other primes can be checked similarly6. Example 9.5. For p - N , Up acts on Sk (Γ0 (p3 N )). Let f ∈ Sk (Γ0 (N )) be an eigenform of Tp . Consider the space spanned by fi = f (pi τ ) for 0 ≤ i ≤ 3. We claim that Up acts non-semisimply on it. Since Vp fi = fi+1 for 0 ≤ i ≤ 2, we have Up fi = fi−1 for 1 ≤ i ≤ 3. Using the formula X Tp (f ) = an q n/p + pk−1 hpif (pτ ) p|N and the fact that f = f0 is an eigenform, we see that there exists λ ∈ C such that7 λf0 = Tp f0 = Up f0 + pk−1 f1 , i.e. Up f0 = λf0 − pk−1 f1 . λ 1 −pk−1 0 1 , which is not semisimple8. Thus Up has matrix 0 1 0 Theorem 9.6 (Atkin-Lehner). If f ∈ Sknew (Γ1 (N )) is an eigenform for T0 , then f is also an eigenform for the full Hecke algebra T. We are not going to prove this, since the proof is just some combinatorics and not inspirational. There is a more conceptual proof using representation theory. Definition 9.7. f ∈ Sk (Γ1 (N )) is a newform if it is contained in Sknew (Γ1 (N )), an eigenform for T and normalized such that a1 (f ) = 1. Newforms have a rigid meaning. In particular, newforms do not form a vector space, just a set! Corollary 9.8 (Multiplicity one). If f is a newform with eigenvalue λ : T → C, then dim{g ∈ Sk (Γ1 (N )) : T g = λ(T )g for all T ∈ T} = 1. 10. Lecture 10 (February 25, 2013) 10.1. Atkin-Lehner theory. We were discussing the Aktin-Lehner theory about newforms, and trying to explain the failure of semisimplicity of Hecke algebras, as well as to study the structure of the Hecke module Sk (Γ) for Γ = Γ∗ (N ) where ∗ = 0, 1. There is no need to consider the principal congruence subgroup Γ(N ) because it is conjugate to Γ1 (N 2 ). Consider T = C[Tn , hdi : n ≥ 1, (d, N ) = 1] ⊂ EndC (Sk (Γ)) (recall that the operators depend on the level N and weight k, although the notation doesn’t suggest so!). Literature also considers EndC (Mk (Γ)), but we restrict ourselves to the simpler case of cusp forms. Consider the subalgebra T0 = C[Tn , hdi : (n, N ) = 1, (d, N ) = 1] ⊂ T. Last time we saw that T0 acts semisimply, but T does not. 6Refer to Proposition 5.6.2 of Diamond-Shurman for details. (d, N ) = 1, the diamond operator hdi acts trivially on Sk (Γ0 (N )), so hpif0 (pτ ) = f0 (pτ ) = f1 . Also, note that Up ∈ End(Sk (Γ0 (p3 N )) but Tp ∈ End(Sk (Γ0 (N )). 8This matrix has characteristic polynomial X 2 (X 2 − λX + pk−1 ), but its kernel is only 1-dimensional. 7For 32 an q n , X Up f = anp q n Write Tp = Up if p | N . Then for f = P n≥1 n≥1 and Vp f = p 1−k X p 0 f = an q np 0 1 k n≥1 so that Up is a left inverse to Vp . The guiding example is as follows. For p - N , let f 6= 0 be an eigenform for Tp of level N , with Tp f = λf . Consider the span of ei = (Vp )i f where i = 0, 1, · · · , d, which are linearly independent. Then Up , as a Hecke operator of level N pd , is given by ( Up ei = ei−1 for 1 ≤ i ≤ d, k−1 Up e0 = λe0 − p e1 , so its matrix is of the form λ 1 −pk−1 0 1 . Up,d = 0 1 0 If d ≥ 3, then this matrix is not diagonalizable, i.e. the action of Up is not semisimple. This follows from an easy exercise in linear algebra. Exercise. The minimal polynomial of Up,d is X d−1 (X 2 − λX + pk−1 ). The Atkin-Lehner theory takes into account these operators Up . Lemma 10.1. The map T × Sk (Γ) → C defined by (T, f ) 7→ a1 (T f ) is a perfect pairing. Proof. Recall that a1 (Tm f ) = am (f ). If (T, f ) = 0 for all T , then in particular (Tn , f ) = 0, so an (f ) = a1 (Tn f ) = 0 for all n and f = 0. If (T0 , f ) = 0 for all f ∈ Sk (Γ), we want to show T0 = 0, i.e. T0 f = 0. Indeed, for all n, an (T0 f ) = a1 (Tn T0 f ) = a1 (T0 Tn f ) = (T0 , Tn f ) = 0. If f ∈ Sk (Γ) is an eigenform of T, then T f = λf (T )f satisfies λf (Tn Tm ) = λf (Tn )λf (Tm ) and so defines a character λf : T → C. Define Sk (Γ)[λf ] to be the λf -eigenspace, i.e. Sk (Γ)[λf ] = {g ∈ Sk (Γ) : T g = λf (T )g for all T ∈ T}. Lemma 10.2 (Multiplicity one). If f ∈ Sk (Γ) is an eigenform of T, then dim Sk (Γ)[λf ] = 1. Proof. For g ∈ Sk (Γ)[λf ], we have an (g) = a1 (Tn g) = λf (Tn )a1 (g) for all n ≥ 1. Theorem 10.3 (Atkin-Lehner). (1) If f ∈ Sknew (Γ) is an eigenform of T0 , then it is an eigenform of T. (2) Let f be a newform of level Nf dividing N , and Sf be the span of the linearly independent elements Vd f = f (dτ ) where d | N/Nf . Then Sf = {g ∈ Sk (Γ∗ (N )) : T g = λf (T )g for all T ∈ T0 }. In particular, Sf is stable under T. 33 (3) There is a decomposition Sk (Γ∗ (N )) = M M Sf M |N f newform of level M as Hecke modules. Remark. By Lemma 10.2, if M = N , then dim Sf = 1. In general, Sf has dimension P 1. d|N/Nf Remark (Strong multiplicity one9). The association {f newform of level dividing N } → {non-zero algebra homomorphism λ : T0 → C} is injective (in fact bijective). Corollary 10.4. (1) If f ∈ Sk (Γ∗ (N )) with Fourier coefficients an (f ) = 0 for all (n, N ) = 1, then X f∈ Vp (Sk (Γ∗ (N/p))). p|N (2) If f, g ∈ Sknew (Γ) with an (f ) = an (g) for all (n, N ) = 1, then f = g. We will not prove any of these theorems. In fact, Corollary 10.4 is known as the Main Lemma and proved first in Diamond-Shurman. There are better proofs, e.g. in Casselman. Example 10.5. Recall that dim S2 (Γ0 (N )) = g(X0 (N )). For example, using the genus formula, we get dim S2 (Γ0 (11)) = 1 and dim S2 (Γ0 (22)) = 2. For any non-zero f ∈ S2 (Γ0 (11)) (necessarily an eigenform on level 11), we have S2 (Γ0 (22)) = Cf (τ ) ⊕ Cf (2τ ). λ2 1 With respect to this basis, U2 has matrix , where T2 f = λ2 f in S2 (Γ0 (11)). We −2 0 can check that10 λ2 = −2, so U2 is semisimple. Thus, T0N =22 ∼ = C and T = T0 [U2 ] = C × C. 10.2. Rationality and Integrality. We move the story over C to Q. Define TQ = Q[Tn , hdi : n ≥ 1, (d, N ) = 1] ⊂ EndC (Sk (Γ)) and similarly for TZ . We do not know if they are even finitely generated. Example 10.6. Let V be a 1-dimensional vector space over C, and set A = 2, B = π. Then Q[A, B] ⊂ EndC (V ) ∼ = C is not a finitely generated Q-algebra. We want to give a rational or integral structure to the space of modular forms. Theorem 10.7. For R = Q, Z, there exists Sk,R (Γ) ⊂ Sk,C (Γ) := Sk (Γ) which is TR -stable, and such that Sk,C (Γ) = Sk,R (Γ) ⊗R C. 9This statement is stronger than “multiplicity one” above, but not as strong as the one in automorphic representations. Perhaps we should only call this “stronger multiplicity one”. 10Or look up Cremona’s table! 34 In fact we can describe Sk,R (Γ) explicitly in terms of Fourier coefficients. For any subring R ⊂ C, define Sk,R (Γ) = {f ∈ Sk,C (Γ) : an (f ) ∈ R for all n} and similarly Mk,R (Γ) = {f ∈ Mk,C (Γ) : an (f ) ∈ R for all n}. Today let us only consider the level 1 case Γ = SL2 (Z). Recall the graded algebra M Mk,C = C[E4 , E6 ] k∈Z where Ek = 1 − immediately get k Bk P∞ n=1 σk−1 (n)q n for k ≥ 4 even. Since Ek has rational coefficients, we Corollary 10.8. Mk,Q ⊗ C ∼ = Mk,C . The basis element E4a E6b is contained in Mk,Q , and this space is stable under TQ since an (Tm f ) = X dk−1 anm/d2 (f ). d|(n,m) Thus TQ is a subalgebra of EndQ (Mk,Q ), hence finitely generated over Q, i.e. a finitedimensional Q-vector space. Similarly, we have Sk,Q ⊗ C ∼ = Sk,C , and TQ × Sk,Q → Q is a perfect pairing. 10.3. Plan. The plan before spring break is to study the rational and integral structures of spaces of modular forms, and prove a result of Shimura on the algebraicity of special values of L-functions. We will start discussing p-adic modular forms after the break. 11. Lecture 11 (February 27, 2013) 12. Lecture 12 (March 4, 2013) 13. Lecture 13 (March 6, 2013) 14. Lecture 14 (March 11, 2013) I was away for the Arizona Winter School. 15. Lecture 15 (March 13, 2013) I was away for the Arizona Winter School. 35 16. Lecture 16 (March 25, 2013) 17. Lecture 17 (March 27, 2013) 18. Lecture 18 (April 1, 2013) 19. Lecture 19 (April 3, 2013) 20. Lecture 20 (April 8, 2013) 21. Lecture 21 (April 10, 2013) 22. Lecture 22 (April 15, 2013) 23. Lecture 23 (April 17, 2013) 24. Lecture 24 (April 22, 2013) 25. Lecture 25 (April 24, 2013) 26. Lecture 26 (April 29, 2013) 27. Lecture 27 (May 1, 2013) 36
© Copyright 2026 Paperzz