1 QUANTUM FIELD THEORY II (PHYS7652) LECTURE NOTES Lecture notes based on a course given by Maxim Perelstein. We begin with discussing the path integral formalism in Quantum Mechanics and move on to it’s use in Quantum Field Theory. We then study renormalization and running couplings in abelian and non-abelian gauge theories in detail. Presented by: MAXIM PERELSTEIN LATEX Notes by: JEFF ASAF DROR 2013 CORNELL UNIVERSITY Contents 1 Preface 1.1 Conventions . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 Path Integrals in Quantum Mechanics 2.1 Harmonic Oscillator . . . . . . . . . . . 2.2 Anharmonic Oscillator . . . . . . . . . 2.3 Correlation Functions . . . . . . . . . . 2.A Jacobian of the Fourier Transformation . . . . 3 Quantum Field Theory 3.1 Free Klien Gordan Field . . . . . . . . . 3.2 External Force . . . . . . . . . . . . . . . 3.3 Interacting Theory . . . . . . . . . . . . 3.4 LSZ Reduction . . . . . . . . . . . . . . 3.5 Statistical Mechanics and Path Integrals 4 Quantization of Electromagnetism 4.1 Classical Electromagnetism . . . 4.2 Quantization . . . . . . . . . . . 4.3 Path Integrals in QED . . . . . . 4.4 Correlation Functions . . . . . . . 5 Fermions with Path Integrals 5.1 Grassman Numbers . . . . . 5.2 Free Dirac Field . . . . . . . 5.3 QED . . . . . . . . . . . . . 5.4 Three-point function . . . . 5.5 e+ e− → µ+ µ− . . . . . . . . 5.6 e− µ− → e− µ− . . . . . . . 5.7 Compton Scattering . . . . 5.8 Proof of Ward Identity . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 4 . . . . 5 9 14 18 22 . . . . . 24 24 26 29 30 30 . . . . 31 31 34 35 40 . . . . . . . . 42 42 45 47 49 53 55 57 59 CONTENTS 3 6 Loop Diagrams and UV Divergences in QED 6.1 Superficial Divergences . . . . . . . . . . . . . 6.2 Electron Self-Energy . . . . . . . . . . . . . . 6.3 Classical Renormalization and Regulators . . . 6.4 Dimensional Regularization . . . . . . . . . . 6.5 Vacuum Polarization . . . . . . . . . . . . . . 6.6 Vertex Function . . . . . . . . . . . . . . . . . 6.6.1 Physics of F2 . . . . . . . . . . . . . . . . . . . . . 61 61 63 69 71 71 76 78 7 Renormalized Perturbation Theory 7.1 φ4 Theory . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7.2 QED . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 80 80 81 8 Running Couplings and Renormalization Group 8.1 “Large Logs” . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8.2 QED β-function . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 84 84 90 9 Non-Abelian Gauge Theories (“Yang-Mills”) 9.1 Looking Closely at Abelian Gauge Theories . 9.2 SU (2) . . . . . . . . . . . . . . . . . . . . . . 9.3 General Recipe . . . . . . . . . . . . . . . . . 9.4 Universal Couplings . . . . . . . . . . . . . . . 9.5 Quantum Chromodynamics . . . . . . . . . . 9.6 Quantization of Yang Mills . . . . . . . . . . . 9.7 Unitary and Ghosts . . . . . . . . . . . . . . . 9.8 Renormalized Perturbation Theory . . . . . . 9.9 RG flows (evolution) . . . . . . . . . . . . . . 10 Broken Symmetries 10.1 SSB of Discrete Global Symmetries . . 10.2 SSB of Continuous Global Symmetry . 10.3 Linear Sigma Model . . . . . . . . . . 10.4 More on the Mexican Hat Potential . . 10.5 SSB of Gauge Symmetries (“The Higg’s . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 94 94 95 98 101 102 103 106 108 114 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . Mechanism”) . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 116 116 119 121 122 123 . . . . . . . . . . . . . . . . . . . . . . . . . . . Chapter 1 Preface This set of notes are based on lectures given by Maxim Perelstein in the Quantum Field Theory II course at Cornell University during Spring 2013. The course uses both Srednicki’s Quantum Field Theory and Peskin and Schroeder’s An Introduction to Quantum Field Theory as reference texts. I wrote these notes during lectures and as such may contain some small typographical errors and sloppy diagrams. I also make notes for myself throughout the text of the form, [Q#:...]. They are just to remind to me to go back and take a look at something and should be ignored. I’ve attempted to proofread and fix as many of these problems as I can. I hope to continue to revise and add to these notes until I feel they are complete. If you have any questions or would like a copy of the LATEX file feel free to let me know at [email protected]. 1.1 Conventions In these notes we use the conventions used by Peskin and Schroeder. We use a “West coast metric”, 1 0 0 0 0 −1 0 0 gµν = (1.1) 0 0 −1 0 0 0 0 −1 the Weyl basis, µ γ = 0 σµ σ̄ µ 0 (1.2) where σ µ ≡ (1, σ i ) and σ̄ ≡ (1, −σ i ). When applying Fourier transform (FT) we put the 2π on the momenta integral such that: Z dp ˜ f (p)e−ipx (1.3) f (x) = 2π 4 Chapter 2 Path Integrals in Quantum Mechanics The primary tool of this course will be what’s known as the path integral. In nonrelativistic quantum mechanics (NRQM) it is an alternative formulation to using the Schrodinger equation. Consider a particle at xa that evolves with a Hamiltonian H = H(x, p; t) (2.1) What we want to compute is the transition amplitude for this particle to go from its initial position some other position xb in a time 2T . In QM we denote this amplitude with a braket: U = hxb , T |xa , −T i (2.2) To simplify this expression consider inserting in a completeness relation N − 1 completeness relations: Z = dx1 dx2 ... dxN −1 hxb , T |xN −1 , T − ∆ti hxN −1 , T − ∆t|xN −2 , T − 2 ∆ti ... (2.3) hx1 , −T + ∆t|xa , −T i Now if we have a braket at two nearby times then you can evolve one side to have an equal time braket1 : hx, t0 |x0 , t0 + ∆ti = hx, t0 |e−iH(t1 )·(t1 −t0 ) |x0 , t0 i = hx|e−iH(t1 )∆t |x0 i (2.4) where in the last step we rewrote the bra and ket without time dependence as the absolute value of time has no significance. Z = dx1 dx2 ... dxN −1 hxb |e−iH(T −∆t)·∆t |xN −1 i hxN −1 |e−iH(T −2∆t)·∆t |xN −2 i ... hx1 |e−iH(−T )·∆t |xa i 1 To do this properly with finite ∆t requires Dyson’s formula 5 (2.5) 6 CHAPTER 2. PATH INTEGRALS IN QUANTUM MECHANICS Now let t1 = −T, t2 = −T + ∆t, ..., tN = T . Thus we now have Z = dx1 dx2 ... dxN −1 hxb |e−iH(tN −1 )·∆t |xN −1 i hxN −1 |e−iH(tN −2 )·∆t |xN −2 i ... hx1 |e−iH(t1 )·∆t |xa i U= = N −1 Z Y i=1 N −1 Z Y ! dxi ! dxi i=1 N −1 Y i=0 N −1 Y (2.6) ! hxi+1 |e−iH(ti )∆t |xi i (2.7) ! U (xi → xi+1 ; ∆t) (2.8) i=0 Note that technically these equations need two less integrals then the number of xi values. However often people don’t bother to write it like this (since it somewhat ruins the ascetic beauty in the formula). The point is that you write down each of the brakets with every position included, but you only integrate over x1 , ..., xN −1 , not over x0 = xa and xN = xb . Either way they don’t matter much at the end, as long as you know not to integrate over your end points... Instead this expression is more commonly written as N Z Y U= ! dxi U (xi → xi+1 ; ∆t) (2.9) i=0 Pictorially this integral is show below t T xa xb x .. . 2 1 −T This is called a path integral. For simplicity we focus on Hamiltonians are of the form H= p2 + V (x, t) 2m (2.10) 7 Consider the braket above (if we know the value of this term we almost know everything). We take ∆t to be small. p2 + V (x, ti )|xi i 2m p2 = δ(xi+1 − xi ) − i∆t hxi+1 | |xi i − i∆tδ(xi+1 − xi )V (xi , ti ) 2m Z dpi −ipi (xi+1 −xi ) p2 = e − i∆t hxi+1 | |xi i − i∆tV (xi , ti ) 2π 2m Z dpi −ipi (xi+1 −xi ) × e (no sum) 2π hxi+1 |e−iH∆t |xi i = hxi+1 |xi i − i∆t hxi+1 | By insertion of complete set of states (note also that hpi |xi i = e−xi pi ) we have, Z 0 00 2 hxi+1 |p |xi i = dp0 dp00 eixi+1 p e−ixi p p00 2 hp0 |p00 i Z dp0 ixi+1 p0 −ixi p0 0 2 = e e p 2π Z dpi i(xi+1 −xi )pi 2 2 pi (no sum) e hxi+1 |p |xi i = 2π (2.11) (2.12) (2.13) (2.14) (2.15) (2.16) Therefore, −iH(ti )∆t hxi+1 |e 2 pi dpi −ipi (xi+1 −xi ) e + V (xi , ti ) |xi i = 1 − i∆t 2π 2m Z dpi −ipi (xi+1 −xi ) −i∆tH(xi ,pi ,ti ) e = e 2π Z dpi −ipi ∆t (xi+1 −xi ) −i∆tH(xi ,pi ,ti ) ∆t e = e 2π Z (2.17) (2.18) (2.19) Note that in the last step we have combined our infinitesimal result into our exponential (in the end we take ∆t → 0). Notice that we no longer have operators; pi and Hi are numbers!The expression has only numbers, we have moved away from regular quantum mechanics. now taking the ∆t → 0 limit we have Z dpi −i∆t(pi ẋi −Hi ) −iHi ∆t e (2.20) hxi+1 |e |xi i = 2π Thus we have N −1 Z Y U= i=1 Z = dp0 2π ! N −1 Z Y ! dpi −i∆t(ẋi pi −Hi ) dxi e 2π i=0 ! Z N −1 Z Y PN −1 dpi dxi e−i i=0 ∆t(ẋi pi −Hi ) 2π i=1 (2.21) (2.22) 8 CHAPTER 2. PATH INTEGRALS IN QUANTUM MECHANICS Taking the limit we finally have N −1 Z Y U= Z dxi i=1 dpi 2π ! e−i RT −T dt(ẋi pi −Hi ) (2.23) R where now that we have taken the limit we have just dropped the dp0 integral and the value of p0 in the exponential. Since we have an infinite number of integrals, adding one more shouldn’t effect the final result. Only when we discretize our integrals should such a distinction be of any importance. For convenience we define N −1 Z ∞ Y Z Dx = lim N →∞ i=1 Z dxi Dp = lim ; N →∞ −∞ N −1 Z ∞ Y i=1 dpi (2.24) −∞ We can now write Z U= DxDp e−i RT −T dt(ẋi pi −Hi ) (2.25) Note that the momenta integral has nothing to do with the particular dynamics of the problems and can be done in general since they are just Gaussian integrals. The integral we need is (we now go back to the discrete limit for a moment, see equation 2.22) Z Z dpi i∆t(pi ẋi − p2i −Vi ) dpi −i ∆t (pi −mẋi )2 i ∆t (mx˙i )2 −i∆tVi 2m e = e 2m e 2m (2.26) 2π 2π Z dpi −i ∆t (pi )2 i ∆t (mx˙i )2 −i∆tVi e 2m (2.27) = e 2m 2π ∆t 2 = C(∆t, m)ei 2m (mx˙i ) −i∆tVi m 2 = C(∆t, m)ei∆t( 2 x˙i −Vi ) (2.28) = C(∆t, m)ei∆tL[xi ,ti ] (2.30) (2.29) where we have denoted the value of the momenta integral by C(∆t, m). Though not obvoius, it’s value will turn out to have no impact on physical quantities. Nevertheless we include for completeness, r 2πm C(∆t, m) = (2.31) i∆t Thus we have ! Z T Z N −1 Z N Y X L(xj , t) = C Dx exp i L[x(t), t]dt U =C dxi exp i∆t i=1 Z =C iS Dx e j=1 −T (2.32) x(−T )=xa ,x(T )=xb 2.1. HARMONIC OSCILLATOR 9 where S is the action. This can be thought of as X U= eS(path) (2.33) paths We worked in natural units. Going back to regular units we have X U= eS(path)/~ (2.34) paths A classical system is one that has a large action (alternatively one can take the Bohr limit of ~ → 0). We are dealing with a rapidly oscillating integrand. The trajectory that ends up being the dominant contribution is the one that the smallest phase (S(path)/~). This is the one that has minimum S, as in classical mechanics. 2.1 Harmonic Oscillator We will now do an example. Consider a simple harmonic oscillator with an external force, f (t) . The Hamiltonian is given by H0 p2 ω 2 x2 ω z }| { + − −f (t)x H= 2 2 2} | {z (2.35) H0 (set m = 1). We consider a force that is turned on and then turned off at some time later: f (t) −t0 t0 t Initially the system is in the ground state (|0i), what’s the probability to remain in |0i after t0 ? In standard quantum mechanics we compute probabilities as P = |h0, t0 |0, −t0 i|2 (2.36) It’s unclear how this relates to the path integral formulation since so far we’ve only discussed probabilities of a particle moving from some x eigenstate to another x eigenstate.[Q 1: At the end repeat this problem using standard PT.] Consider (we use Dyson’s formula discussed above) U (xa → xa , 2T ) ≡ hxa |e−i RT −T Hdt |xa i (2.37) 10 CHAPTER 2. PATH INTEGRALS IN QUANTUM MECHANICS with T > t0 . We split the braket into 3 parts: U (xa → xa , 2T ) = hxa |e−2iH0 T e−i R t0 −t0 = hxa |e−iH0 (T −t0 ) e−i = hxa |e−iH0 T e−i R t0 −t0 H 0 dt R t0 −t0 |xa i H0 (2.38) dt −iH0 (t0 −(−T )) e H dt −iH0 T e |xa i |xa i (2.39) (2.40) where we have used the fact that H 0 is only applied between −t0 and t0 and in the last step we were careful to write H 0 in Dysons formula not H0 . We now insert two complete set of states: Z U (xa → xa , 2T ) = dx+ dx− hxa |e−iH0 T |x+ i hx+ |e−i R t0 t0 Hdt |x− i hx− |eiH0 T |xa i (2.41) We now try to simplify the hx− |e−iH0 T |xa i term. Consider, −iH0 T e |xa i = ∞ X e−iH0 T |ni hn|xa i (2.42) ψn (xa )e−iEn T |ni (2.43) n=0 = ∞ X n=0 where we have inserted the set of eigenstates of H0 . Now comes a “dirty trick”. Take H0 → (1 − i)H (at the end we will take → 0). −iH0 (1−i)T lim lim e →0 T →∞ |xa i = lim lim →0 T →∞ ∞ X ψn (xa )e−iEn T e−T En |ni (2.44) n=0 Choose spectrum as En > 0, n > 0 and E0 = 0. If we take T → 0 then the only term that contributes from the sum is the one that isn’t effected by the damping term, e−T En (the |0i term). lim lim e−iH0 (1−i)T |xa i = lim ψ0 (xa ) |0i →0 T →∞ →0 = ψ0 (xa ) |0i (2.45) (2.46) This key step helps us take position eigenstates and relate them to what we really want which is energy eigenstates. Define Ū as U with H0 → H0 (1 − i) Z R t0 2 dx+ dx− h0|x+ i hx+ |e−i −t0 Hdt |x− i hx− |0i lim lim Ū (xa → xa , 2T ) = |ψ0 (xa )| →0 T →∞ (2.47) = |ψ0 (xa )|2 h0|e− 2 R t0 −t0 Hdt |0i = |ψ0 (xa )| h0, t0 |0, −t0 i Z (2.48) (2.49) ∞ dxa lim lim Ū (xa → xa , 2T ) = h0, t0 |0, −t0 i −∞ →0 T →∞ (2.50) 2.1. HARMONIC OSCILLATOR 11 due to the normalization in the wavefunction. We now go back to our formula for the path integral. Note that don’t go to our “final” form of our path integral equation since p2 to simplify the momentum integral we assumed H of the form 2m + V (which is not the case now that we threw in the i). Instead we go to our earlier form (see equation 2.25). Z ∞ Z dt(pẋ − (1 − i)H0 + f (t)x) (2.51) lim Ū = DxDp exp i T →∞ −∞ Z Z Y dp (1 − i) = Dx exp i∆t − p2 + 2pẋ (1 + i) − ẋ2 (1 + i)2 2π 2 2 2 2 2 (2.52) + ẋ (1 + i) + 2 (1 + i) f (t)x − ω x where we have used 1+i ≈ (1 − i)−1 , and we ignore the constant term in H0 since it has no impact on the final result. The term in the exponential can be factored into a constant term (with respect to dp) and a −(p − ẋ(1 + i))2 term. We define p0 ≡ p − ẋ(1 + i). We have the integral r Z 1 − i 2 1 2π dp exp −i∆t p = (1 + i)1/2 (2.53) 2π 2 2π i∆t r −i = (1 + i) (2.54) 2π∆t Finally we have Z ∞ Z ẋ2 ω 2 x2 dt (1 + i) − lim Ū = C Dx exp i (1 − i) + f (t)x (2.55) T →∞ 2 2 −∞ To do this integral we need a way to put ẋ and x in a similar form. We do this using a Fourier transform: Z ∞ dE −iEt x(t) = e x̃(E) (2.56) −∞ 2π Z dE ẋ(t) = (−iE)e−iEt x̃(E) (2.57) 2π Z dE −iEt ˜ f (t) = e f (E) (2.58) 2π The integration measure can be changed to Dx̃. This is because every variable in x space has a one-to-one correspondence with the variables in k space and the Jacobian to switch spaces is one (this is shown in appendix 2.A). We have Z Z Z dE ˜ dE 0 dt f (t)x = f (E) x̃(E 0 )2πδ(E + E 0 ) (2.59) 2π 2π Z dE ˜ = f (E)x̃(−E) (2.60) 2π Z dE ˜ 1 ˜ f (E)x̃(−E) + f (−E)x̃(E) (2.61) = 2 2π 12 CHAPTER 2. PATH INTEGRALS IN QUANTUM MECHANICS Similarly, Z Z dE dE 0 dt ẋ = x̃(E) x̃(E 0 )2πδ(E + E 0 )(−1)EE 0 2π 2π Z dE = x̃(E)x̃(−E)E 2 2π Z dE 2 1 E x̃(E)x̃(−E) + x̃(−E)x̃(E) = 2 2π 2 Z 2 Z lim Ū = C T →∞ (2.62) (2.63) (2.64) 2 E −ω +i Z }| { 1 dE z 2 2 Dx̃ exp i E (1 + i) − ω (1 − i) x̃(E)x̃(−E) 2 2π ˜ ˜ + f (E)x̃(−E) + f (−E)x̃(E) (2.65) The idea is to complete the square for each dx̃. The variable that does that is ỹ(E) = f˜(E) x̃(E) + E 2 −ω 2 +i . The arguement in the exponential turns into, ! ! Z i f˜(E) f˜(−E) dE 2 2 (...) = ỹ (−E) − 2 E − ω + i ỹ (E) − 2 2 2π E − ω 2 + i E − ω 2 + i (2.66) ! ! f˜(−E) f˜(E) ˜(−E) ỹ(E) − + f˜(E) ỹ(−E) − 2 + f (2.67) E − ω 2 + i E 2 − ω 2 + i Z f˜(E)f˜(−E) dE i E 2 − ω 2 + i ỹ(−E)ỹ(E) − 2 (2.68) = 2 2π E − ω 2 + i Thus we can write, dE f˜(E)f˜(−E) lim Ū = C T →∞ 2π −E 2 + ω 2 − i (2.69) Now consider the special case of f → 0. Then we have no perturbation and the amplitude to stay in the ground is just 1. So in this case we have (the second exponential is trivial) Z Z 1 dE 2 2 1 = C Dỹ exp i ỹ(E)(E − ω + i)ỹ(−E) (2.70) 2 2π Z Z 1 dE 2 2 Dỹ exp i ỹ(E)(E − ω + i)ỹ(−E) exp 2 2π i 2 Z This is a mathematical relation completely independent of the particular system that we consider. Thus we can insert it into equation 2.69 to get a final succinct expression for the amplitude, ! Z i dE f˜(E)f˜(−E) h0, t0 |0, −t0 i = lim exp (2.71) →0 2 2π −E 2 + ω 2 + i ! 2.1. HARMONIC OSCILLATOR 13 The (E 2 − ω 2 )−1 factor is remenicient of the inverse of the simple harmonic oscillator equation. This hints that this can be rexpressed in terms of the Green’s function for simple harmonic motion defined by: ∂t2 + ω 2 G(t − t0 ) = δ(t − t0 ) (2.72) Writing both sides our using a Fourier decomposition, Z Z dE dE −iE(t−t0 ) 2 2 −iE(t−t0 ) −E + ω G̃(E)e e = 2π 2π (2.73) and so, G̃ = E2 1 − ω2 (2.74) Now taking the inverse Fourier Transform, Z 0 dE e−iE(t−t ) 0 G(t − t) = 2π −E 2 + ω 2 (2.75) As always there is an ambiguity on the path of integration. We choose the Feynman decomposition: Z 0 dE e−iE(t−t ) G = lim (2.76) →0 2π −E 2 + ω 2 − i Using this expression we can rewrite the amplitude in equation 2.71. The argument in the exponential is Z Z 0 dE eiEt e−iEt i 0 0 dtdt f (t)f (t ) 2 (2.77) (...) = 2 2π E − ω 2 + i Z i dE = dtdt0 f (t)G(t − t0 )f (t0 ) (2.78) 2 2π where we have used the symmetric property of the Green’s functions: G(t−t0 ) = G(t0 −t). Applying this into our equation above we have Z t0 i 0 0 0 dtdt f (t)G(t − t )f (t ) (2.79) h0, t0 |0, −t0 i = exp 2 −t0 Alternatively we can express the amplitude using the definition, Z T Z Z Z W [f (t)] ≡ lim Dx exp i dt(L̄0 + f x) = Dx exp i T →∞ x(T )=x(−T ) −T ∞ dt(L̄0 + f x) −∞ (2.80) where L̄0 is the free Lagrangian with all the inserted. Using Eq. 2.55 we see that we can write, h0, t0 |0, −t0 i = lim CW [f ] (2.81) →0 14 CHAPTER 2. PATH INTEGRALS IN QUANTUM MECHANICS if f = 0 then h0, t0 |0, −t0 i = 1 ⇒C= 1 W [0] (2.82) So in our new notation the amplitude is given by, h0, t0 |0, −t0 i = W [f ] W [0] (2.83) where, " Z # ˜ ˜ dE f (E)f (−E) i W [f ] = W [0] exp 2 2π −E 2 + ω 2 − i Z i 0 0 0 = W [0] exp dtdt f (t)G(t − t )f (t ) 2 (2.84) (2.85) since W [0] just cancels away in the calculation of a transition amplitude we don’t need to calculate it explicitly. 2.2 Anharmonic Oscillator Up until now the Hamiltonian of our system was the simply quadratic free Hamiltonian with a time dependent perturbation. The eigenstates were left unchanged but the f (t)x term allowed the possibility of a state transitioning into another state. The fact that we were able to complete the square and exactly integrate the exponential was instrumental in finding a simple form for the amplitude. This was a consequence of not having any terms in the exponential that are more then quadratic in x. We now consider modifying the Hamiltonian to include such interactions. Consider the anharmonic oscillator with a driving force: p 2 ω 2 x2 + H0 = 2 2 Ha = (2.86) λx4 4! (2.87) and H = H0 + Ha − f (t)x (2.88) We have some ground state, |Ωi. We add a constant to the Hamiltonian such that Eω = 0. Our discussion earlier was rather general and we can reuse some of our earlier results hΩ, t0 |Ω, −t0 i = where Z Wλ [f ] ≡ Z Dx exp i ∞ −∞ Wλ [f ] ‘ Wλ [0] λx4 dt(L̄0 − ( )(1 − i) + f (t)x) 4! (2.89) (2.90) 2.2. ANHARMONIC OSCILLATOR 15 Due to the quartic term we cannot simply do this integral as before. There is not general way to integrate this Gaussian. What we can do, is do a small λ expansion (perturbation theory). The 1 − i is important in the initial Lagrangian (L̄0 ) to pick out the ground state and relate W to the amplitude of transition from the ground state. However it turns out that the 1 − i has no effect on the anharmonic term. [Q 2: Check Peskin.]. Thus we can take the that’s there to zero. We do a small λ expansion (i.e. Taylor expansion): Z ∞ Z Z −iλ ∞ 4 dt(L̄0 − f x) exp dtx (2.91) Wλ [f ] = Dx exp i 4! −∞ −∞ Z ∞ Z Z iλ ≈ Dx exp i dt(L̄0 − f x) 1 − dtx4 (t) 4! −∞ 2 Z λ 4 4 + dt1 dt2 x (t1 )x (t2 ) + ... (2.92) (4!)2 We can write this discretely as Wλ [f ] ≈ N −1 Z Y " dxi exp i X n i=1 # iλ X 4 ∆t xj 4! j X λ2 2 4 4 + (∆t) xj1 xj2 + ... (4!)2 j ,j ∆t(L̄0,n − fn xn ) 1− 1 (2.93) 2 or compactly as Z ∞ Z Z iλ ∞ 4 dt Dx x (t) exp i dt(L̄0 + f x) Wλ [f ] = W0 [f ] − 4! −∞ −∞ 2 Z Z Z (−i)2 λ 4 4 + dt1 dt2 Dx x (t1 )x (t2 ) exp i dt(L̄0 + f x) + ... 2! 4! (2.94) First consider W0 [f ]. We already know this function since we calculated it last class (we called it W [f ] before we had a perturbation): Z ∞ Z dt(L̄0 + f x) (2.95) W0 [f ] = Dx exp i −∞ " N # N Z Y X = dxi exp i ∆t(L̄0 j + fj xj ) (2.96) i=1 j=1 where fj = f (−T + j∆t) and xj = x(−T + j∆t). We know how to do this integral explicitly (see Eq. 2.85). We hope to rewrite the other terms in terms of W0 [f ]. To do this we define something called a “functional derivative”: δW0 1 ∂W0 ≡ δf (tj 0 ) ∆t ∂fj 0 (2.97) 16 CHAPTER 2. PATH INTEGRALS IN QUANTUM MECHANICS In our case this is given by " N # N Z Y X δW0 = dxi ixj 0 exp i ∆t(L̄0,j + fj xj ) δf (tj 0 ) i=1 j=1 Z Z = i Dx x(tj 0 ) exp i dt(L̄0 + f x) (2.98) (2.99) So at least in this case taking the functional derivative of W0 with respect to f at a time tj 0 does the thing you would expect; it pulls out a factor of ix(j 0 ). With this in mind we can write 4 Z 1 δ iλ ∞ dt W0 [f ] Wλ [f ] = W0 [f ] − 4! −∞ i δf (t) 2 Z 4 4 (−i)2 λ 1 δ 1 δ + W0 [f ] + ... (2.100) dt1 dt2 2! 4! i δf1 i δf2 We begin by considering O(λ): 4 Z Z 1 δ i iλ 0 0 0 exp dτ dτ f (τ )G(τ − τ )f (τ ) − W0 [0] dt 4! i δf (t) 2 (2.101) For brevity we write, Z i 0 0 0 exp [...] ≡ exp dτ dτ G(τ − τ )f (τ )f (τ ) 2 We now take the functional derivatives: " # 1 δ 1 ∂ iX exp [...] = exp ∆t∆tG(τi − τj )fi fj i δf (t) i∆t ∂fj 0 2 i,j X 1 1 i 2 X ∆t ( G(τi − tj 0 )fi + G(tj 0 − τj )fj ) exp [...] = i ∆t 2 i j X X 1 = ∆t( G(τi − tj 0 )fi + G(tj 0 − tj )fj ) exp [...] 2 i j X = ∆tG(tj 0 − τj )fj exp [...] (2.102) (2.103) (2.104) (2.105) (2.106) j Z = dτ 0 G(t − τ 0 )f (τ 0 ) exp [...] (2.107) So again the functional derivative does what one would expect. It takes the derivative of the arguement of the exponential and uses the product rule on it. Continuing on gives, 2 Z 1 1 δ exp ... = G(t − t) exp ... + dτ 0 dτ 00 G(t − τ 0 )G(t − τ 00 )f (τ 0 )f (τ 00 ) exp ... i δf (t) i (2.108) 2.2. ANHARMONIC OSCILLATOR 1 δ i δf (t) 1 δ i δf (t) 17 3 Z Z 1 2 0 0 0 exp ... = G(t − t) dτ G(t − τ )f (τ ) + G(t − t) dτ 0 G(t − τ 0 )f (τ 0 ) i i Z 0 00 000 0 00 000 0 00 000 + dτ dτ dτ G(t − τ )G(t − τ )G(t − τ )f (τ )f (τ )f (τ ) exp [...] (2.109) Z 3 = G(t − t) dτ 0 G(t − τ 0 )f (τ 0 ) i Z 0 00 000 0 00 000 0 00 000 + dτ dτ dτ G(t − τ )G(t − τ )G(t − τ )f (τ )f (τ )f (τ ) exp [...] (2.110) 4 Z 3 − 3G (t − t) + G(t − t) dτ 0 dτ 00 G(t − τ 0 )G(t − τ 00 )f (τ 0 )f (τ 00 ) i Z 3 + G(t − t) dτ 0 dτ 00 G(t − τ 0 )G(t − τ 00 )f (τ 0 )f (τ 00 ) (2.111) i Z + dτ 0 dτ 00 dτ 000 dτ 0000 G(t − τ 0 )G(t − τ 00 )G(t − τ 000 )G(t − τ 0000 ) 0 00 000 0000 × f (τ )f (τ )f (τ )f (τ ) exp [...] (2.112) Z 6 2 = − 3G (t − t) + G(t − t) dτ 0 dτ 00 G(t − τ 0 )G(t − τ 00 )f (τ 0 )f (τ 00 ) i Z + dτ 0 dτ 00 dτ 000 dτ 0000 G(t − τ 0 )G(t − τ 00 )G(t − τ 000 )G(t − τ 0000 ) 0 00 000 0000 × f (τ )f (τ )f (τ )f (τ ) exp [...] (2.113) exp ... = 2 We now consider Wλ [0] (i.e. the above expression with f = 0). This is the normalization term for every amplitude. Z ∞ 3λ Wλ [0] = W0 [0] + i W0 [0] G2 (t − t) dt (2.114) 4! −∞ and in summary we have, Z iλ ∞ Wλ [f ] = W0 [f ] − W0 [f ] dt − 3G2 (t − t) 4! −∞ Z 6 + G(t − t) dτ 0 dτ 00 G(t − τ 0 )G(t − τ 00 )f (τ 0 )f (τ 00 ) i Z + dτ 0 dτ 00 dτ 000 dτ 0000 G(t − τ 0 )G(t − τ 00 )G(t − τ 000 )G(t − τ 0000 ) 0 00 000 0000 × f (τ )f (τ )f (τ )f (τ ) (2.115) 18 CHAPTER 2. PATH INTEGRALS IN QUANTUM MECHANICS Recall that the transitions amplitudes are, h0, −T |0, T i = Wλ [f ] Wλ [0] (2.116) So to first order the transition amplitude is, Z Z iλ ∞ 6 2 h0, −T |0, T i = 1 − dt − 3G (t − t) + G(t − t) dτ 0 dτ 00 G(t − τ 0 )G(t − τ 00 ) 4! −∞ i Z × f (τ 0 )f (τ 00 ) + dτ 0 dτ 00 dτ 000 dτ 0000 G(t − τ 0 )G(t − τ 00 )G(t − τ 000 )G(t − τ 0000 ) 0 00 000 0000 × f (τ )f (τ )f (τ )f (τ ) (2.117) This calculation was quite tedius and was only to first order. Nevertheless there are two main independent ingredients that we can identify. The order of the interaction (which is always accompanied by an integration of t) and the Green’s function. You may think that the factors f (τ )dτ are also independent but that’s not true. The number of these factors and their arguements are competely determined by the form of the Green’s functions. Furthemore, as we will see for our purposes we will take these to zero at the end of the calculations. So we can now identify the pieces that will always be part of the transition amplitude calculations. These will be our “Feynman Rules”: → −iλ R∞ −∞ dt → G(t1 − t2 ) Furthermore, a time will be given to each intersection point in a diagram and a symmetry factor is also required for each diagram. So for example we can write, Wλ [0] = W0 [0] 1 + 2.3 + + + +... Correlation Functions Now that we have an idea of how to generalize our discussion to more complicated potentials we quickly go back again to consider the simple harmonic oscillator. Consider the following derivative with t1 < t2 : 1 δ 2 W0 [f ] = W0 [0] G(t1 − t2 ) (2.118) i2 δf (t1 )δf (t2 ) f →0 2.3. CORRELATION FUNCTIONS 19 Alternatively we can rewrite this expression in terms of the Lagrangian, Z Z 1 δ 2 W0 = Dx x(t1 )x(t2 ) exp i dtL̄0 i2 δf1 δf2 f →0 Z Z t1 Z T Z Z t2 Dx x2 x1 Dx dx1 = lim Dx dx2 T →∞ t −T t1 2 Z t1 Z t2 Z T dtL̄ + + × exp i −T Z T Dx = lim T →∞ Z t2 Z t2 Z = t1 Dx Dx dx1 x1 −T Z t1 Z t2 Z T dtL̄ + + × exp i dx2 x2 t1 −T Z (2.120) t2 t1 Z (2.119) t1 (2.121) t2 Z dx1 dx2 x1 x2 h0, T |x2 , t2 i hx2 , t2 |x1 , t1 i hx1 , t1 |0, −T i W0 [0] (2.122) where in the second line we have split the integration interval into the different parts. This is trivial in the exponential. However, on the outside extra spatial integrals over dx1 and dx2 appear due to the way the Dx are defined; they have the end points fixed so to split them up you need to have an integral over each midpoint. We now switch to the Heisenberg picture. We choose our reference time −T . By definition then we have |0i ≡ |0, −T i (2.123) likewise we define |xi ≡ |x, −T i . (2.124) With this reference point we can move into the Heisenberg interperpotation. The Heisenberg form of the position operator is, x̂(t) = e−iH0 (t+T ) x̂(−T )eiH0 (t+T ) . We want to think of x1 as an eigenvalue of this operator. In other words xi |xi i = x̂(ti ) |xi i This enables us to get rid of the x integrals: 1 δ 2 W = W0 [0] h0|x̂(t2 )x̂(t1 )|0i i2 δf1 δf2 f →0 (2.125) (2.126) This expression holds if t2 > t1 . If t1 > t2 everything is the same by symmetry but t1 ↔ t2 and so we have, 1 1 δ 2 W = h0|T (x̂(t2 )x̂(t1 )) |0i (2.127) W0 [0] i2 δf1 δf2 f →0 where T denotes the time ordering operator which puts the operator with the earliest time to the right. So we see that correlation functions (key objects in Quantum Field 20 CHAPTER 2. PATH INTEGRALS IN QUANTUM MECHANICS Theory!) can be related to W by taking derivatives. In the case of the anharmonic oscillator we have already taken the first derivative of W0 [f ] (see Eq 2.107) we just need to take a second at a different time, 1 1 δ h0|T (x̂(t2 )x̂(t1 )) |0i = W0 [0] i δf1 1 = G(t2 − t1 ) i Z dτ G(t2 − τ )f (τ ) exp [...] 0 0 0 (2.128) f →0 (2.129) Our discussion did not depend much on the form of the Hamiltonian. If we had Eq 2.113 with or without λ = 0 the results would be the same as long as we label our ground state by |Ωi (the ground state of the full system (in our case the anharmonic theory), 1 δ 2 Wλ [f ] hΩ|T (x̂(t1 )x̂(t2 )) |Ωi = Wλ [0] i2 δf1 δf2 f →0 1 1 δ2 = Wλ [0] i2 δf1 δf2 1 iλ W0 − 4! (2.130) Z dt 1 δ i δf 4 ! W0 + ... f →0 (2.131) We already sketched out Wλ [0]. Calculating δ 2 Wλ δf1 δf2 f →0 (2.132) explicitly gives any possible diagram with two external points (two unconnected times, t1 and t2 ). We show this to first order. The expression for Wλ is given in Eq 2.113. Taking its derivatives gives to first order, Z 1 δ 2 Wλ iλ 1 = G(t1 − t2 ) − dt 3iG2 (0)G(t1 − t2 ) i2 δf1 δf2 f →0 i 4! + 12iG(0)G(t − t1 )G(t − t2 ) W0 [0] (2.133) The zero’th order contribution is a Green’s function starting at the initial point and ending at the second (i.e. from t1 to t2 ). The first order contributions are the a double bubble diagram (due to the G2 (0) factor) with the zero’th order Green’s function and the second term is a bubble and Green’s function going in and one going out. Now to find the amplitudes we need to divide by Wλ [0] to get the correlation functions. We calculated this quantity earlier on and to first order is given by, Z 3λi 2 Wλ [0] = W0 [0] 1 − dtG (0) 4! (2.134) 2.3. CORRELATION FUNCTIONS 21 So to first order we have, 1 G(t1 i λ 4! R dt3G2 (0)G(t1 − t2 ) + 12G(0)G(t − t1 )G(t − t2 ) R hΩ|T (x̂(t1 )x̂(t2 )) |Ωi = dtG2 (0) 1 − 3iλ 4! (2.135) R R 1 iλ iλ G(t1 − t2 ) + 4! 4G(0)G(t − t1 )G(t − t2 ) 1 − 4! dt3G2 (0) i R = dtG2 (0) 1 − 3iλ 4! (2.136) Z 1 iλ = G(t1 − t2 ) + 4G(0)G(t − t1 )G(t − t2 ) (2.137) i 4! To all orders we have, + − t2 ) + + = + + 1 + + + + + + ... + ... + ... We now briefly summarize what we have done. We have the correlation functions for anharmonic oscillator (x4 ): X 1 δ 2 Wλ [f ] hΩ|T (x̂(t1 )x̂(t2 )) |Ωi = = connected Feynman diagrams Wλ [0] δf1 δf2 f →0 (2.138) where Wλ [f ] is the generating functional and fi ≡ f (ti ). We found the propagator (to lowest order) is given by, R e−iE(t1 −t2 ) → G(t1 − t2 ) = dE 2π −E 2 +ω 2 +i and the vertex is given by x R → −iλ dt R → −iλ dt. These rules are the space time Feynman rules. By Fourier transforming both sides we have the momentum space rules: → 1 −E 2 +ω 2 +i P → −iλδ( i Ei ) 22 CHAPTER 2. PATH INTEGRALS IN QUANTUM MECHANICS 2.A Jacobian of the Fourier Transformation The discrete Fourier transform is given by 1 X −2πink/N x̃k = √ e xn N n (2.139) The Jacobian is then the matrix Jn,k = 1 ∂ x̃k = √ e−2πink/N ∂xn N (2.140) We want to show that the absolute value of the determinate of this matrix is equal to 1. Note that the determinate any orthonormal matrix is ±1 since: 1 = det A−1 A = det AT det A = (det A)2 (2.141) thus if we can show that J is orthonormal then |det J| = 1. A matrix is orthonormal if its rows and columns are orthonormal. We need to show that e−2πij/N −2πi2j/N 1 2πik/N 2πi2k/N e (2.142) hJk |Jj i = e ,e , ..., e2πik .. N . e−2πij 1 2πi(k−j)/N = e + e2πi2(k−j)/N + ...e2πi(k−j) (2.143) N is equal to 0 for all row and column numbers, k − j 6= 0. Let k − j = m ∈ Z. Then we have, N 1 X 2πijm/N = e N j=1 N/2m 1 X 2πijm/N e + = N j=1 = 1 N 2m−2 X (`+1)N/2m X (2.144) 2N/2m X N X e2πijm/N + ... + j=N/2m+1 e2πijm/N (2.145) j=(2m−1)N/2m+1 e2πijm/N (2.146) `=0 j=`N/2m+1 shifting the exponential such that each sum starts at j = 1 gives, (`+1)N/2m X j=`N/2m+1 N/2m 2πijm/N e `πi →e X j=1 e2πijm/N (2.147) 2.A. JACOBIAN OF THE FOURIER TRANSFORMATION 23 Thus we finally get that the dot product between an arbitrary row and column is given by, ! N/2m 2m−2 X X 1 = e2πijm/N e`πi (2.148) N j=1 `=0 | {z } =0 =0 (2.149) There is one loophole. If k − j = m = 0 then we can’t divide by m. In this case we get, N N 1 X 2πijm/N 1 X e = =1 N j=1 N j=1 (2.150) Thus hJj |Jk i = δjk and the Jacobian is indeed orthonormal and thus |det J| = 1 as required. (2.151) Chapter 3 Quantum Field Theory 3.1 Free Klien Gordan Field To keep things simple we consider one non-interacting real scalar field (real Klien Gordan theory). We work in 1 + 1 dimensions. Since we are working in 1 dimension we have discretize space φ(x, t) −−−−−→ φ = {φ1 (t), φ2 (t), ..., φN (t)} (3.1) x ` z}|{ √ √ where φ` (t) = φ( `∆ , t) ∆ and the ∆ is a normalization factor. The equations of motion are φ + m2 φ = 0 (3.2) To discretize them we need to find the second derivative in terms of finite differences. We have, ∂ φ(x + h/2) − φ(x − h/2) ∂ 2φ = lim 2 ∂x ∂x h→0 h 1 = lim 2 (φ(x + h) − 2φ(x) + φ(x − h)) h→0 h 1 → 2 (φ`+1 − 2φ` + φ`−1 ) ∆ So the discrete equations of motion are, 2 1 2 φ̈` + + m φ − (φ`+1 + φ`−1 ) = 0 ` ∆2 ∆2 R Since we are working in one spatial dimesion we have L = dxL with 1 1 L = (∂µ φ)(∂ µ φ) − m2 φ2 2 2 24 (3.3) (3.4) (3.5) (3.6) (3.7) 3.1. FREE KLIEN GORDAN FIELD 25 where our metrix is g µν = 1 0 0 −1 (3.8) Note that this Lagrangian is equivalent to having N of Harmonic oscillators (no sources or interaction terms) with one at each ` value. Discretizing our Lagrangian gives ! 2 N X φ − φ 1 `+1 ` L= φ̇2` − − m2 φ2` (3.9) 2 ∆ `=1 = N X 1 `=1 where 2 φ̇2` .. V ≡ 0 . N X 1 − φ` V`m φm 2 `,m=1 .. − ∆22 . − m2 0 .. . 0 .. . 0 0 0 ... 0 1 0 0 ∆2 .. .. . 0 . ... ... 0 0 0 (3.10) (3.11) is a coupling matrix between the fields at different spatial points. Note the following oscillator correspondence: Klien-Gordan t x φ` φ̂` (t) J (x,t) NR oscillator t label ` x` (t) x̂` (t) f` (t) (for the time being we use hats to denote operators). We work in the Schrodinger picture: n o φ̂ = φ̂1 , ..., φ̂N (3.12) √ and limN →∞ φ̂/ ∆ = φ̂(x). A definite field state is given by |φ, ti : φ̂ |φ0 , ti = φ0 |φ0 , ti (3.13) we are interested in a transition between two states hφf , T |φi , −T i. Each state consists of a direct product of states, one at each point in space. Thus finding the braket between two field configurations is equivalent to a product of brakets. In other words we can write, hφf , T |φi , −T i = hφ01 , T |φ1 , −T i hφ02 , T |φ2 , −T i ... hφ0N , T |φN , −T i (3.14) 26 CHAPTER 3. QUANTUM FIELD THEORY with hφ0` , T |φ` , −T i being the product of oscillator states at ∆`. But we already know what the value of each of these brakets is for a singlet oscillator. Now we are doing the same calculation but for many oscillators. It is given by (c.f. equation 2.32) " M −1 # Z N −1 M −1 X Y Y dφ`,j exp i ∆tL [φn ] (3.15) hφf , T |φi , −T i = C n=1 `=1 j=1 φ =φ `,0 `i φ`,M =φ`f where the label ` refers to spatial points and the j denotes to time slices. Furthermore, we have introduced the notation dφ`j ≡ dφ` (tj ). We now take the limit of N → ∞ and M → ∞: Z Z Z ∞ T hφf , T |φi , −T i = C Dφ exp i dt where Z Dφ ≡ N −1 M −1 Y Y dxL [φ(x, t)] , (3.16) −∞ T dφi,j . (3.17) `=1 j=1 The interpretation is similar to before. The amplitude of going from one field configuration to another is equal to the sum over all possible field configurations. Note the extension to multidimensional derivation is completely analogous. For 3 spatial dimensions: Z T Z Z ∞ 3 hφf , T |φi , −T i = C Dφ exp i dt d xL [φ(x, t)] (3.18) −T 3.2 −∞ External Force √ Consider some one dimensional external force, J(x, t) which discretized is given by J` (t) ∆ (one force for each point and since this is just a field it discretizes in the same way as the field φ). We take the force to turn on at −t0 and end at t0 . We want to compute the probability for the system to stay in the ground state,|0i. We take the energy of the state to be 0 (E0 = 0). Everything we did for a single oscillator can be done for N oscillators. Each of these oscillators has a force which is dependent on time. We denote the force by (J = {J1 (t), J2 (t), ..., JN −1 (t)}) (here Ji (t) is the f (t) for each oscillator). Y W [Ji ] W0 [J(t)] ≡ lim (3.19) h0, t0 |0, −t0 i = lim →0 W0 [0] →0 W [0] 0 i where (c.f. Eq. 2.55) W0 [J(t)] = lim T →∞ N Y Z Z Dφj exp i T −T j=1 φj (−T )= φj (T ) + N X `=1 X N N X 1 2 1 dt φ̇` (1 + i) − (1 − i)φ` V`n φn 2 2 `=1 `,n=1 φ` J` (t) (3.20) 3.2. EXTERNAL FORCE 27 here we have added the 1 + i and 1 − i factors in an analagous way to our discussion of Non-relativistic QM. While they are present for notational convenience we will just ignore them. More typically we will write Z Z N −1 Y lim Dφ` → Dφ (3.21) T →∞ j=1 φj (−T )= φ` (T ) and dt dx → d2 x such that Z Z 1 2 2 2 1 µ µ µ W0 [J(t)] = Dφ exp i d x (∂µ φ)(∂ φ) − m φ + J(x )φ(x ) 2 2 (3.22) As before then if we can do the Gaussian integral then we are set. The way you do it is to diagonalize the argument so you just a product of individual integrals. Since the term which gives us trouble connects different space points (the derivative) we can diagonalize it using a Fourier transform, Z Z dk 1 ˙ 2 1 2 2 1 2 2 ˜∗ φ̃ − k φ̃ − m φ̃ + J φ̃ W0 [J(t)] = C Dφ̃ exp i dt (3.23) 2π 2 2 2 Z Z dk 1 ˙ 2 1 2 2 ∗ φ̃ − ω(k) φ̃ + J˜ φ̃ = C Dφ̃ exp i dt (3.24) 2π 2 2 where Z dk −ikx φ(x, t) = e φ̃(k, t) , 2π ω(k)2 ≡ k 2 + m2 Z J(x) = dk −ikx ˜ e J(k, t) 2π (3.25) and we have changed our integration measure using, N −1 Z Y `=1 Dφ` = N −1 M −1 Z Y Y dφ`m = J `=1 m=1 N −1 M −1 Z Y Y dφ̃j,m (3.26) j=1 m=1 where J is a Jacobian which we can forget about since it just adds to the overall constant. Thus we can write N −1 Z N −1 Z Y Y Dφ` → Dφ̃j (3.27) `=1 j=1 Now we have (we don’t need to worry about the factor of 2π as it goes to the constant in front), Z ∞ N Z Y 1 ˙ 2 1 2 2 ∗ ˜ W0 [J(t)] = C Dφ exp i dt |φ̃k | − ωk |φ̃k | + φ̃k Jk (3.28) 2 2 −∞ k=1 =C N Y k=1 h i W0SHO J˜k (t), ωk (3.29) 28 CHAPTER 3. QUANTUM FIELD THEORY where W0SHO is a quantity that we already calculated earlier, Z i SHO 0 0 0 W0 [f, ω] = exp dt dt f (t)G(t − t )f (t ) 2 # " Z dE f˜(E)f˜(−E) i = exp 2 2π −E 2 + ω 2 − i (3.30) (3.31) We can finally write " N iX W0 [J] = W0 [0] exp 2 k=1 Z dE J¯k (E)J¯k (−E) 2π −E 2 + ωk2 − i # (3.32) R R P ikj −iEt where J¯k (E) = dtJ˜k (t)e−iEt = dt N . We have xj = ∆j and we j=1 e Jj (t)e define the Fourier conjugate variable to x as p and it must be given by k/∆. Taking the continuum limit gives Z Z √ −iEt dx ikj ¯ Jk (E) = dt (3.33) e J(x, t) ∆ e ∆ Z 1 =√ dt dxJ(x, t)ei∆jp e−iEt (3.34) ∆Z 1 d2 xJ(x, t)e−i(Et−xp) (3.35) =√ ∆Z 1 µ =√ d2 xJ(x, t)e−ipµ x (3.36) ∆ 1 ˜ E) (3.37) ≡ √ J(p, ∆ R dp P ˜ ˜ now k → ∆ 2π and (Note: We got J(−p) by relating J(−p) to J˜∗ (p). " Z # Z ˜ ˜ i dp dE J(p)J(−p) W0 [J] = W0 [0] exp (3.38) 2 2 2π 2π −E + p2 + m2 − i In 3 + 1 dimensions we have " i W0 [J] = W0 [0] exp − 2 Z ˜ J(−p) ˜ d4 p J(p) (2π)4 p2 − m2 − i # we can equally well write this equation as Z i 4 4 0 0 0 W0 [J] = W0 [0] exp − d x d x J(x)G(x − x )J(x ) 2 (3.39) (3.40) where the Green’s function is defined by (x + m2 − i)G(x − x0 ) = iδ 4 (x − x0 ) ⇒ Z 0 d4 p e−ip·(x−x ) 0 G(x − x ) = (3.41) (2π)4 p2 − m2 + i 3.3. INTERACTING THEORY 3.3 29 Interacting Theory Suppose we turn on an interaction term H = Z Wλ [J] = Dφ exp i Z iλ = W0 [J] − 4! Z iλ = W0 [J] − 4! Z iλ = W0 [J] − 4! Z d3 x 4!λ φ4 (x) → R λ 4 ` 4! φ` . P We obtain λφ4 d x L̄0 − + J(x)φ 4! Z Z 4 4 4 d x Dφ φ (x) exp i d x L̄0 + Jφ + ... 4 Z Z 1 δ 4 4 d x Dφ exp i d x L̄0 + Jφ + ... i δJ(x) 4 1 δ 4 W0 [J] + ... dx i δJ(x) 4 This equation has the exact same form as for a single harmonic oscillator. We have Z iλ Wλ [0] = W0 [0] − d4 x − 3G(x − x)2 + ... 4! (3.42) (3.43) (3.44) (3.45) (3.46) As before every term is made up of two types of terms. We have → −iλ x1 R dx x2 → G(x1 − x2 ) In Quantum Mechanics we had δ 2 Wλ [f ] hΩ|T (x̂(t1 )x̂(t2 )) |Ωi = Wλ [0] δf (t1 )δf (t2 ) f →0 1 (3.47) In field theory we have [Q 3: show] δ 2 Wλ [J] hΩ|T (φ̂(x1 )φ̂(x2 ))|Ωi = Wλ [0] δJ(x1 )δJ(x2 ) J→0 1 (3.48) (all we did was replace the quantum mechanics source to the field theory source). One can then develop perturbation theory for fields as we did for the harmonic oscillator. We get hΩ|T (φ̂(x1 )φ̂(x2 ))|Ωi = + + + ... 30 CHAPTER 3. QUANTUM FIELD THEORY 3.4 LSZ Reduction [Q 4: Maybe insert a proof here instead of just quoting the final result.] The correlation functions are defined as, (n) G (x1 , ..., x2 ) ≡ hΩ|T φ̂(x1 )...φ̂(x2 ) |Ωi (3.49) The Fourier transform is G̃(n) (p1 , ..., pn ) = Z n Y ! d4 xi eipi xi G(n) (x1 , ..., xn ) (3.50) i=1 The LSZ formula says that (n) G̃ (p1 , ..., pn ) = n Y j=1 ! √ i Z (out hp3 ...pn |p1 p2 iin + permutations) p2j − m2 + i + (terms with < n poles) (3.51) (3.52) where the product is the product over the external lines. The bottom line is once you know that correlators you can then get all the matrix elements. The scattering problem is reduced to a problem of calculating correlators. 3.5 Statistical Mechanics and Path Integrals Suppose you have a harmonic oscillator that’s in equilibrium with it’s environment. We want to treat this oscillator quantum mechanically. If you studied quantum statistical mechanics then you know that the fundamental object in statistical mechanics is the partition function which is defined as, h i X −β Ĥ Z = tr e = hn|e−β Ĥ |ni (3.53) n where β = 1/kB T . Let’s define β = it0 (since β is real we know that t0 is imaginary). With this we know that X Z= hn|e−iĤt0 |ni (3.54) n = X U (n, t = 0 → n, t = t0 ) (3.55) n Where U (n, t = 0 → n, t = t0 ) is the transition amplitude for a state to go to itself after time t0 . Using our path integral formalism we can write this as Z t0 Z (3.56) Dx exp i dtL 0 x(0)=x(t0 ) i BT t0 →− k In other words computing the partition function at some temperature is given by a path integral with imaginary time. Chapter 4 Quantization of Electromagnetism 4.1 Classical Electromagnetism The electromagnetic (EM) field is Aµ (x) = (φ(x), A(x)) (4.1) with E = −∇φ − A and B = ∇ × A. The field strength tensor is defined as Fµν = ∂µ Aν − ∂ν Aµ 0 Ex Ey Ez −Ex 0 Bz −By = −Ey −Bz 0 Bx −Ez By −Bx 0 (4.2) (4.3) The gauge symmetry is the observation that there is some redundancy in our description. The physics is given by the electric and magnetic fields. One can show that the only transformation you can do that won’t change the EM fields is Aµ → Aµ + ∂µ ξ(x) (4.4) where ξ(x) is an arbitrary function. Under this gauge symmetry, Fµν → Fµν (4.5) We want to get the equations of motion for this system (Maxwell equations) using the Lagrangian density.One possibility is to make the Lagrangian out of Fµν . We want to have a gauge invariant Lagrangian so we use Fµν . To make a Lorentz scalar out of Fµν we take the product of two Fµν (Alternatively we could take the trace, but Fµν is traceless). This gives us two possible terms 1 Lgeneral = − Fµν F µν + µναβ F µν F αβ 4 31 (4.6) 32 CHAPTER 4. QUANTIZATION OF ELECTROMAGNETISM These are the only two non-renormalizable terms. It also turns out that the second term violates parity. Since we know that electromagnetism does respect parity we don’t have the second term in EM. Thus 1 L = − Fµν F µν 4 1 = − (∂µ Aν − ∂ν Aµ ) (∂ µ Aν − ∂ ν Aµ ) 4 1 = − ∂µ Aν (∂ µ Aν − ∂ ν Aµ ) 2 (4.7) (4.8) (4.9) Note that a m2 Aµ Aµ term is not gauge invariant! So we don’t have such a term in our Lagrangian. The Euler Lagrange equations give δL =0 δ∂µ Aν Aν − ∂ν (∂ µ Aµ ) = 0 ∂µ (4.10) (4.11) Note that we have yet to make a gauge choice. We now fix the gauge to our convenience. Pictorially this is given by Aµ Aµ + dξµ gauge-fixing curve gauge orbits The gauge orbits are lines which show all the possible values of Aµ along some gauge choice. All points along the gauge orbit correspond to physically equivalent systems. All points along the gauge fixing curve correspond to the different values of Aµ with a single gauge. One of the simplest gauge choices is the Lorentz gauge: ∂µ Aµ = 0. The solutions of 4.11 in the Lorentz gauge are plane waves, Z Aµ = d4 k µ e−ik·x (2π)4 (4.12) with k 2 = 0 and µ is the polarization vector (and also the Fourier transform of the vector field). These are just wave solutions with no mass. To make sure that we are still in the 4.1. CLASSICAL ELECTROMAGNETISM 33 Lorentz gauge we require, ∂ µ Aµ = 0 Z (4.13) 4 d k µ −i(k·x) ik µ e =0 (2π)4 ⇒ k·=0 (4.14) (4.15) by uniqueness of the Fourier transform. As an example consider a photon moving the z direction: kµ = (E, 0, 0, E) (4.16) Our when dotted into kµ must give zero. A basis of polarization vectors that obeys the condition above is, 1 (1) = √ (0, 1, +i, 0) 2 1 (2) = √ (0, 1, −i, 0) 2 1 (3) = √ (1, 0, 0, 1) 2 (4.17) (4.18) (4.19) (4.20) (1) and (2) correspond to left and right handed circular polarized waves. The third polarization vector corresponds to a longitudinally polarized wave. However we know that such a wave does not exist! This extra possible state arose because we didn’t full fix the gauge of our field. To see this consider some gauge transformed field µ . This gauge transformed field also satisfies the Lorentz gauge if the field connecting the two gauge fields obeys, µ = Aµ + ∂µ ξ ⇒ξ = 0 (4.21) (4.22) To get rid of this extra freedom consider the component of the Fourier transform of ξ that matches the wave. We have Z d4 q ˜ ξ(q)e−iq·x (4.23) ξ(x) = (2π)4 ˜ and we consider the particular contribution ξ(k) with k 2 = 0. We want to know how adding this ξ to Aν changes the solution. We have, ˜ 0ν (k) = ν (k) + kν ξ(k) (4.24) 0(3) ˜ = −(3) By choosing ξ(k) = 0. If one computed the magnetic and electric ν we can get fields corresponding to this polarization they would turn out to be zero. This is an unphysical solution. With these two conditions we can fully fix the gauge. 34 4.2 CHAPTER 4. QUANTIZATION OF ELECTROMAGNETISM Quantization To zeroth order we treat Aµ as a set of 4 fields. The most naive generalization of our discussion on scalar fields is Z Z µ 4 W0 [Jµ ] = DA exp i S̄ [A] + Aµ (x)J (x) d x (4.25) R 4 where DA = DA0 DA1 DA2 DA3 and S = d x − 41 Fµν F µν . Note however that we wrote a term Aµ (x)J µ (x) which does not appear gauge invariant. Applying a gauge transformation on the source term gives, Z Z Z µ 4 µ 4 A Jµ d x → A Jµ d x + ∂ µ ξJµ d4 x (4.26) Z Z µ 4 = A Jµ d x − ξ∂ µ Jµ d4 x (4.27) If we restrict our attention to sources with no divergence, i.e. that obey ∂ µ Jµ = 0, then the extra term is in fact gauge invariant. We are going to want to evaluate this path integrals explicitly. At first sight this seems feasible since the kinetic term is quadratic and our integral appears Gaussian. However it turns out that there are small technical problems which we will discuss. We take a short detour into how we treat photons with spin s using canonical quantization. Asµ ∼ sµ e−ip·x (4.28) with p2 = 0 or equivalently, p0 = ± |p|. We have a restriction in the Lorentz Gauge on our given by · p = 0 and 0 = 0. We have an added the superscript for the polarization vector for spin, s = 1, 2. We found a particular solution. The general solution is a linear combination of the solutions we found: ! Z 2 d3 p X ip·x As (p)sµ e−ip·x + As∗ (p)s∗ (4.29) Aµ = µ e (2π)3 s=1 p0 =|p| p0 =|p| This looks identical to the KG field only we now have a spin index s and we have polarization vectors. One can go about performing the Canonical Quantization method: 1 As (p) → √ asp 2p0 , As∗ (p) → √ 1 s† a 2p0 p (4.30) where a, a† are creation and annihilation operators respectively. We would then define the vacuum, |0i such that asp |0i (4.31) for all p, s. There is then a one-photon state given by as† p |0i = |p, si (4.32) 4.3. PATH INTEGRALS IN QED 4.3 35 Path Integrals in QED Recall that Z 1 4 µν µ W0 [J ] = DA exp i d x(− Fµν F +Aµ J ) (4.33) 4 | {z } F term where we restrict our attention to sources such that ∂µ J µ = 0. We can write the exponential argument as Z 1 4 ∂µ Aν (∂ µ Aν − ∂ ν Aµ ) (4.34) F term = dx − 2 Z 1 = d4 x Aν (g µν − ∂ µ ∂ ν ) Aµ (4.35) 2 Z µ where we have integrated by parts to play with the derivatives. Now (here we write the Fourier Transform of the vector potential as Ã(k)) Z d4 k ik·x Aµ (x) = e õ (k) (4.36) (2π)4 Thus we have d4 k d4 k 0 i(k+k0 )·x dx e Ãν (k 0 )(−k 2 g µν + k µ k ν )õ (k) 4 4 (2π) (2π) Z 4 1 dk =− Ãν (−k)(k 2 g µν − k µ k ν )õ (k) 4 | {z } 2 (2π) 1 F term = 2 Z 4 Z (4.37) (4.38) Dµν The second term is given by Z 4 µ d xAµ J = Z d4 k õ (k)J˜µ (−k) (2π)4 Recall that for a scalar field the source term was, Z d4 k ˜ φ̃(k)J(k) (2π)4 (4.39) (4.40) and the kinetic term was Z d4 k φ̃(k)(k 2 − m2 + i)φ̃(−k) 4 (2π) (4.41) and we completed the square by changing variables to −1 ˜ φ̃0 (k) = φ̃(k) + k 2 − m2 + i J(k) (4.42) 36 CHAPTER 4. QUANTIZATION OF ELECTROMAGNETISM With this in mind, we define a new à such that Ã0µ = õ + (D−1 )µν J˜ν (4.43) and then our Lagrangian is given by, 1 1 L = Ã0µ Dµν Ã0ν − J˜µ (D−1 )µν J˜ν 2 2 So we have (4.44) 4 ν d k µ −1 ˜ ˜ W0 [J ] = W0 [0] exp (4.45) J (D )µν J | {z } (2π)4 Green’s function where we have our Gaussian integral included in our definition of W0 [0]. Everything seems to have worked flawlessly. The subtlety here is that Dµν is not invertible! To prove this we study the eigenvalues of the D matrix. One such eigenvector is kν = (k0 , −k) with an eigenvalue of zero: k 2 g µν − k µ k ν kν = 0 (4.46) ν i − 2 Z Since the determinant is the product of the eigenvalues it must be zero, implies that D isn’t invertible. This problem arose because while we are integrating over the gauge fixed field we are also integrating over gauge inequivalent fields. We haven’t done anthing to set these contributions to zero. We need to find a way such that we integrate only about inequivalent fields. This corresponds to a single gauge-fixing curve (All the Aµ values for a single gauge choice): The procedure is called the Faddeev-Popov procedure. Suppose we have some field A0,µ . We have a gauge transformed field parameterized by α that is given by, Aα,µ = A0,µ + ∂µ α (4.47) We define a gauge transformation function as G [A]. For the Lorentz gauge we have G [A] = ∂µ Aµ = 0 (4.48) 4.3. PATH INTEGRALS IN QED 37 We will also later use the Generalized Lorentz Gauge given by G [A] = ∂µ Aµ − ω(x) = 0 (4.49) In order to ensure we stay on our gauge we introduce a functional delta function (this can be thought of as a product of delta functions, one at each spacetime point): Z Dαδ (G [Aα,µ ]) (4.50) By inserting this into our path integral the path integral will be zero unless G [A] = 0. However, we can’t insert this into the path integral unless it’s equal to 1. Recall that if f (x) has one zero at x0 then, Z df (x) δ (f (x)) = 1 (4.51) dx dx x=x0 We want to generalize this to instead of having f (x) we have, g(a) for vectors of arbitrary size. To do this consider the Taylor expansion of g around its root (we assume it only has one root, a0 ): X ∂gi (aj − a0,j ) + ... (4.52) gi (a) = gi (a0 ) + ∂aj a0 j We want to insert this into a delta function, δ (n) (g(a)). This will only be nonzero near a = a0 . Thus we have, Y δ (g(a)) = δ (gi (a)) (4.53) i ! = Y i δ X Jij (aj − a0,j ) where Jij is the Jacobian matrix defined by Jij ≡ ∂gi . ∂aj a0 We have, ! ! δ (g(a)) = δ X (4.54) j J1j (aj − a0,j ) δ j X J2j (aj − a0,j ) ... (4.55) j We now use the identiy, δ(α(a − a0 )) = δ(a − a0 ) |α| (4.56) We choose to isolate each delta function in Eq. 4.55 for a different aj : δ (g(a)) = δ(a1 − a0,1 ) δ(a2 − a0,2 ) ... |J1,1 | |J2,2 | (4.57) 38 CHAPTER 4. QUANTIZATION OF ELECTROMAGNETISM If we take the Jacobian matrix to be greater then zero then we have the product: (J1,1 J2,2 ..)−1 = 1 det J (4.58) where we have used the fact that the determinant of J is independent of a unitary transformation. So we finally have, Z Y ! ∂gi (n) dai δ (g(a)) det =1 (4.59) ∂a j i where it is understood that the Jacobian matrix is evaluated at the root of g. We write the continuum generalization of this equation as, Z δG(Aα ) α =1 Dα(x)δ (G(A )) det δα (4.60) We consider the generalized Lorentz gauge: G(Aα ) = ∂µ Aµ + 1e ∂µ α(x) − ω(x). In this case it is straight forward to calculate the determinate explicitly. The discretized gauge transformation is (we work here in 1D just for brevity however, its easy to extend the results to four dimensions. Gi (αi ) = 1 (αi+1 − 2αi + αi−1 ) + (∂µ Aµ0 − ω(x))i ∆2 e | {z } Indepdent of αj (4.61) Taking the derivative we have, 1 ∂Gi (αi ) δij = 2 (δi+1,j − 2δi,j + δi−1,j ) δij ∂αj ∆e (4.62) which doesn’t have any A dependence so it is already evaluated at the root of G. Thus the determinant has no A or α dependence and we can write: Z Z Z δG µ α 4 µ W0 [J ] = det Dα DAδ (G [A ]) exp i S + d xAµ J (4.63) δα However J, DA, and S are gauge invariant thus we can change our equation to the gauge shifted Aαµ : Z Z Z δG µ α α α 4 α µ W0 [J ] = det Dα DA δ (G [A ]) exp i S [A ] + d xAµ J (4.64) δα We now see that the α integral is trivial, it is just the volume of the α space (we include the determinant in what we call V ol): Z Z µ 4 µ d xAµ J (4.65) W0 [J ] = V ol × DAδ (G [A]) exp i S [A] + 4.3. PATH INTEGRALS IN QED 39 where we have redefined our integration variable from Aα → A. We use the Generalized Lorentz Gauge without α as we redefined, Aα → A. (we ignore the constant in front since as we saw for a scalar field, it cancels away when calculating observables): Z Z µ µ 4 µ (4.66) W0 [J ] = DAδ (∂µ A − ω(x)) exp i S [A] + d xAµ J Z Z Z R 2 −i d4 x ω2ξ µ 4 µ = N (ξ) Dωe DAδ (∂µ A − ω(x)) exp i S [A] + d xAµ J | {z } 1 (4.67) where ξ is just a constant and we are allowed to through in this integral since we know our W0 is independent of ω (it is gauge invariant). Thus we have Z Z Z µ 2 4 µ µ 4 (∂µ A ) +i d x (L + Aµ J ) (4.68) W0 [J ] = N (ξ) DA exp −i dx 2ξ where we have performed our ω integral (which is trivial due to the delta function). Thus our “gauge-fixing procedure” is equivalent to taking L → L − 2ξ1 (∂µ Aµ )2 . Recall that we had Z d4 k 1 S= õ (−k) k 2 g µν − k µ k ν Ãν (−k) (4.69) 4 (2π) 2 Adding this extra term we have (note that we can replace 2ξ1 (∂µ Aµ )2 → − 2ξ1 (Aµ ∂µ ∂ν Aν )) Z d4 k 1 1 2 µν µ ν Ãν (k) (4.70) S= õ (−k) k g − k k 1 − (2π)4 2 ξ {z } | Dξµν We now have det Dξµν 6= 0. We finally have Z 1 d4 k ˜µ −1 µ ν ˜ W0 [J ] = W0 [0] exp − J (k)(iDξ )µν J (−k) 2 (2π)4 (4.71) where i kµ kν gµν − (1 − ξ) 2 ≡ Gµν = − 2 (4.72) k + i k This is known as the photon propagator. To see that this in indeed the inverse of Dµν just multiply it out directly: 1 1 kµ kν −1 µν 2 ν Dξ Dξ,νρ = 2 gµν − (1 − ξ) 2 k gνρ − k kρ 1 − (4.73) k k ξ 1 1 1 2 µ µ µ µ = 2 k δ ρ − k kρ 1 − − (1 − ξ)k kρ + 1 − (1 − ξ) k kρ k ξ ξ (4.74) Dξ−1 µν = δ µρ (4.75) 40 CHAPTER 4. QUANTIZATION OF ELECTROMAGNETISM Naively one would say that the propagator result depends on ξ. How can this be since we said that ξ is a mathematical convenience? Being more careful, one notices ˜ 2 . However these are zero by our choice of source that all the ξ terms look like ξ(k · J) µ µ (∂µ J = 0 ⇒ kµ J˜ = 0). Any choice of ξ is valid. There are a few popular choices. One popular choice is ξ = 1. This is called the Feynman gauge. Another popular gauge is called the Landau gauge (ξ = 0). Alternatively, you can keep ξ as a parameter and watch it’s dependence drop at the end. There is also something interesting in the sign of the photon propagator. The spatial components of Aµ propagate with the same sign as the scalar field propagator, however the time-like component has a relative negative sign. This is a consequence of the fact that A0 doesn’t actually propagate since the momentum conjugate to A0 is zero: 1 ∂ ∂L =− Fµν F µν 0 ∂0 A 4 ∂0 A0 =0 4.4 (4.76) (4.77) Correlation Functions Consider the correlation function: h0|T µ (x1 )Âν (x2 ) |0i = 1 1 δ 2 W0 [J] 2 µ ν W0 [0] i δJ (x1 )δJ (x2 ) J→0 1 = Gµν (x1 − x2 ) i (4.78) (4.79) However we constructed our J µ such that ∂µ J µ = 0. Having a δJ µ implies we can “wiggle” J µ in any direction we want but in fact we have some restrictions. We now set out to find what those are Z Aµ (x) = d3 k õ (k)e−ik·x (4.80) and Z 4 µ Z d xAµ J → d4 k õ J˜µ (4.81) Now Z d4 x1 d4 x2 ... eik1 x1 ... h0|T (Aµ (x1 )...) |0i δ δ ∝ ... W0 J˜ δ J˜µ (k) δ J˜ν (k) ˜ J→0 G̃(k1 , ...) = (4.82) (4.83) where G̃ is the Fourier transform of the correlator. This can be done for any k. We take k 2 = 0 with k = (E, 0, 0, E). 3 X õ (k) = sµ (4.84) s=0 4.4. CORRELATION FUNCTIONS 41 with a basis 1 0µ = √ (1, 0, 0, −1) 2 1 µ = (0, 1, 0, 0) 2µ = (0, 0, 1, 0) 1 3µ = √ (1, 0, 0, 1) 2 Now J˜µ (k) = X J s sµ (4.85) and kµ J˜µ = 0 ⇒ J 0 = 0 so à · J˜ = 3 X As J s sµ · sµ = A1 J 1 + A2 J 2 (4.86) (4.87) s=1 Thus what we can find out is the correlation function between A1 and A2 . We can only know h0|T A1 (k)... |0i (4.88) h0|T A2 (k)... |0i (4.89) With this we can write down the LSZ reduction formula for photons: p 2 i Zγ X s h(k, s) ...i +... G̃µ (k, ...) = 2 k + i s=1 µ | {z } S matrix (4.90) Chapter 5 Fermions with Path Integrals Electrons are fermions, thus you can’t have more then one electron in a single state. That’s why we never thought of electrons as fields until now. On the other hand photons are bosons and they can all be in the same state. That’s why we had to think of them as fields right from the start. However the concept of path integrals requires one to think of fermion fields. 5.1 Grassman Numbers It turns out the correct way to think about fermion fields is through the concept of Grassman numbers, θ, η. They are not like ordinary numbers in many ways. However they still obey rules of addition and multiplication. For Grassman numbers the addition is typical (commutative). However the multiplication of such number is anti commutative. i.e. η+θ =θ+η ηθ = −θη (5.1) (5.2) θθ = −θθ ⇒θ2 = 0 (5.3) (5.4) f (θ) = a + bθ = a + θb (5.5) This implies that Furthermore, where a and b are ordinary numbers. Note that this is analogous to a Taylor expansion however we have θ2 = 0 so all higher order terms vanish. These numbers are non-intuitive and one needs to be careful when working with them. You can define derivatives of Grassman numbers such as df =b dθ 42 (5.6) 5.1. GRASSMAN NUMBERS 43 and integration Z Z dθf (θ) = dθf (θ + η) (5.7) The definition of the integral implies that (Exercise: use equation 5.5 to derive the first equation. Maxim isn’t sure how to derive the second equation) Z dθ = 0 (5.8) Z dθθ = 1 (5.9) This means that we have Z df dθ One can generalize this discussion to complex Grassman Numbers, dθf (θ) = b 1 θ∗ = √ (θ1 + iθ2 ), 2 and 1 θ∗ = √ (θ1 − iθ2 ) 2 Z Z dθθ = 1 = θ∗ θ∗ As an example consider integration over a Gaussian Z Z ∗ −iθ∗ bθ dθ dθe = dθ∗ dθ (1 − θ∗ bθ) Z = dθ∗ dθ (θθ∗ b) =b This should be compared to the result for real numbers, r Z 2π − 12 bx2 dxe = b (5.10) (5.11) (5.12) (5.13) (5.14) (5.15) (5.16) We now move on to multidimensional integrals. First consider the real variable integral: ! Z 1X dx1 dx2 ... exp − xi Aij xj (5.17) 2 i,j Suppose that A can be diagonalized by some orthogonal transformation, A = OT AD O (5.18) xT Ax = (Ox)T AD (Ox) (5.19) = x0T AD x0 (5.20) which implies that, 44 CHAPTER 5. FERMIONS WITH PATH INTEGRALS where we have defined, Ox ≡ x0 . The Jacobian for this transformation is, Jij ≡ ∂x0i = Oij ∂xj (5.21) Thus we have, Z 1X 0 = det O dx1 ; dx2 ; ... exp − xi AD,ii x0j 2 i Z Y 1 = det O dxi exp − xi AD,ii xi 2 Y N/2 −1 = det O(2π) ( AD,ii ) ! (5.22) (5.23) (5.24) i = (2π)N/2 det O det A (5.25) We now want to find the analogous result for the Grassman integral, Z ! dn θdn θ∗ exp − X θi∗ Bij θj (5.26) i,j We again want to tranform the integration variable and diagonalize B, however we need to be a bit careful as we are working with Grassman numbers. A set of N Grassman variables obey the relations: 1 i ...i θi θi ...θiN N! 1 N 1 2 i1 i2 ...iN θ1 θ2 ...θN = θi1 θi2 ...θiN θ1 θ2 ...θN = (5.27) (5.28) These relations are a consequence of the anticommutating nature of the Grassman variables and are easy to show for N = 3 though I can’t figure out a general proof yet. If we consider a unitary transformation on the Grassman variables: θi0 = Uij θj then, Y θi0 = i 1 i1 i2 ...iN 0 0 θi1 θi2 ...θi0N N! 1 i1 i2 ...iN Ui1 ,i01 θi01 Ui2 ,i02 θi002 ...UiN ,IN0 θi0N N! ! Y 1 i1 i2 ...iN = Ui1 ,i01 Ui2 ,i02 UiN ,IN0 i01 ,i02 ,...,i0N θj N! j Y Y θi0 = det U θj = i j (5.29) (5.30) (5.31) (5.32) 5.2. FREE DIRAC FIELD 45 Thus when changing variables we pick up a determinate, analogous to the earlier situation with the real variables. With this we have, Y dθi∗ i Y dθj = (det U )∗ det U Y j dθi∗ Y i = Y dθi∗ Y i dθj (5.33) j dθj (5.34) j and so the integral is, Z ! dn θdn θ∗ exp − X θi∗ Bij θj Z = ! dn θ0 dn θ0∗ exp − i,j X 0∗ θi BD,ij θj0 (5.35) i,j Z = ! dn θdn θ∗ exp − X θi∗ BD,ii θj (5.36) i = Y BD,ii (5.37) i = det B 5.2 (5.38) Free Dirac Field The Dirac field is given as ψ1 (x) ψ2 (x) ψ(x) = ψ3 (x) ψ4 (x) (5.39) To have Fermi Dirac statistics we make ψa (x) into a complex Grassman number valued field. The free Dirac field is given by LD = ψ̄(i∂/ − m)ψ (5.40) with ψ̄ = ψ † γ 0 and ∂/ = ∂µ γ µ . We want to produce our generating functional. We need to introduce sources. We want something of the form of source multiplied by the Dirac field. We want to add a term for the source that takes the form of a source multiplied by a scalar. A simplest thing one can imagine is η1 (x) η2 (x) η= (5.41) η3 (x) η4 (x) 46 CHAPTER 5. FERMIONS WITH PATH INTEGRALS where this term transforms in the same way as the Dirac spinor. The η source is also a complex Grassman number valued field. Since we also want a Hermitian Lagrangian our source takes the form η̄ψ + ψ̄η (5.42) One can treat η and η̄ as independent as well as ψ, ψ̄ as independent. Our generating functional is Z Z 4 W0 [η, η̄] = DψDψ̄ exp i d x L̄D + η̄ψ + ψ̄η (5.43) Q where Dψ ≡ 4a=1 Dψa = ψ1 ψ2 ψ3 ψ4 . The procedure is just as before. We now complete the square in an analogous way to what we did earlier. We have, Z Z Z d4 k d4 k 0 ikx −ik0 x 4 4 ψ̄(k) (−iγµ (ik µ ) − m) ψ(k) e e d x (...) = d x (2π)4 (2π)4 + η̄(k)ψ(k) + h.c. (5.44) Z d4 k = ψ̄(k) (k/ − m) ψ(k) + η̄(k)ψ(k) + h.c. (5.45) (2π)4 Now we shift the fermion fields: (k/ + m)η − m2 + i η̄(k/ + m) ψ̄ → ψ̄ − 2 k − m2 + i ψ→ψ− k2 (5.46) (5.47) With this we have, η̄(k/ + m)(k/ − m)(k/ + m)η (k 2 − m2 )2 η̄(k/ + m)η = ψ̄(k/ − m)ψ + 2 (k − m2 ) ψ̄(k/ − m)ψ + η̄ψ + h.c. → ψ̄(k/ − m)ψ + So our path integral becomes, Z Z η̄(k/ + m)η 4 W0 [η, η̄] = DψDψ̄ exp i d k ψ̄(k/ − m)ψ + 2 k − m2 Z η̄(k/ + m)η = W0 [0] exp i d4 k 2 k − m2 (5.48) (5.49) (5.50) (5.51) In real space, Z 4 4 W0 [η, η̄] = W0 [0] exp − d x d y η̄(x)S(x − y)η(y) (5.52) 5.3. QED 47 where S is the Green’s function, given by (i∂/x − m14×4 )S(x − y) = iδ 4 (x − y)14×4 (5.53) where 14×4 is the four by four unit matrix. Note that this is really four equations. Solving this equation via Fourier transforms we get Z d4 k i(k/ + m)e−ik·(x−y) S= (5.54) (2π)4 k 2 − m2 + i We now move on to functional differentiation. One can show that 1 δ −1 δ 1 ˆ ... ... W0 (5.55) h0|T ψ̂(x1 )...ψ̄ (y1 )... |0i = W0 [0] i δ η̄(x1 ) i δη(y1 ) η,η̄→0 where δδη̄ ... that we have one functional derivative for each fermionic field being time ordered. The set of fields can be both Grassman fields and scalar fields. We have δ δ † c 0 b exp i ψ̄η = 1 + i(ψ ) (γ ) η (5.56) cb δη a δη a = −i(ψ † )c (γ 0 )cb δ ba (5.57) = −iψ̄a (5.58) ˆ 2 ))|0i is antisymmetric since the η’s are Grassman Further note that h0|T (ψ̂(x1 )ψ(x numbers. 5.3 QED Quantum Electrodynamics(QED) describes both spin 21 objects,“matter” (electrons, protons, muons, ...), and a spin-1 object “light” (photons). The most general Lagrangian is given by 1 L = ψ̄(i∂/ − m)ψ − Fµν F µν + Lint (5.59) 4 The interaction Lagrangian must be Lorentz invariant, and renormalizable. Renormalizability requires that each term must have dimensionality less than or equal to 4. By our Lagrangian we know that [ψ] = 3/2 and [Aµ ] = 1. There are very few allowed terms. We cannot have an odd number of ψ terms since then we won’t have a Lorentz invariant Lagrangian. The only way to couple them to A0 s in a renormalizable way is to have 2 ψ and 1 A. There are two potential bilinears we can use: Aµ ψ̄γ µ ψ Aµ ψ̄γ µ γ 5 ψ (5.60) (5.61) The second term violates parity. Since we know that electromagnetism conserves Parity we have, Lint = eAµ ψ̄γ µ ψ (5.62) 48 CHAPTER 5. FERMIONS WITH PATH INTEGRALS where e is a dimensionless number. It will turn out that e is the electric charge of the quantized ψ fields. One may wonder why the charge has no units. That is a result of using ~ = 1. This can be seen since α≈ 1 e2 Natural Units e2 = −−−−−−−−−−→ 137 4π~c 4π (5.63) √ and hence e = 4πα ≈ 0.3. We now derive the same result in a deeper way. The Dirac Lagrangian, LD has a “global U (1)” symmetry: ψ → eiα ψ (5.64) i.e. LD → LD . There is a Noether current associated with this symmetry given by j µ = ψ̄γ µ ψ (5.65) with ∂µ j µ = 0. Promoting this global symmetry to a local symmetry, “local U (1)”: ψ → eiα(x) ψ (5.66) ∂µ ψ → ∂µ (eiα ψ) = eiα ∂µ ψ + i∂µ αeiα ψ (5.67) LD → LD + i(∂µ α)ψ̄γ µ ψ (5.68) This is no longer a symmetry since and and our Lagrangian is not invariant. The procedure to “fix” this is to introduce a gauge field that has the following transformation: 1 local U (1) Aµ −−−−−−→ Aµ + ∂µ α e We now introduce another term known as the covarient derivative: (5.69) Dµ = ∂µ − ieAµ (5.70) local U (1) Dµ ψ −−−−−−→ eiα Dµ ψ (5.71) and we end with local U (1) ψ̄Dµ ψ −−−−−−→ ψ̄Dµ ψ (invariant under local U (1)) (5.72) Our Dirac Lagrangian becomes / − m)ψ = ψ̄(i∂/ − m)ψ + eAµ ψ̄γ µ ψ L = ψ̄(iD (5.73) Note that if we had Parity violating U (1) then we would have: ψ → eiγ5 α ψ, ψ̄ → ψ̄eiγ5 α (5.74) term we could have added a slightly different covarient derivative and gotten the γ 5 term in our final Lagrangian. [Q 5: Show this!] Our final QED Lagrangian is given by 1 L = ψ̄(i∂/ − m)ψ − Fµν F µν + eAµ ψ̄γ µ ψ 4 (5.75) 5.4. THREE-POINT FUNCTION 5.4 49 Three-point function We get our Feynman rules by looking at different correlation functions. We begin by looking at a 3-pt function (|Ωi is the messy QFT ground state). 1 1 δ 1 δ 1 δ µ hΩ|T ψ̄a (x1 )ψb (x2 )Aµ (x3 ) |Ωi = W [η, η̄, J ] e We [0] i δηa (x1 ) i δ η̄b (x2 ) i δJ µ (x3 ) η→0,J→0 (5.76) where − 14 F 2 Z Z ↑ µ µ µ 4 We [η, η̄, J ] = DψDψ̄DA exp i d x L̄D + L̄g + eAµ ψ̄γ ψ + η̄ψ + ψ̄η + J Aµ (5.77) We almost know how to calculate this. Without the interaction term we could do it explicitly. We expand in powers of e: Z 1 δ 1 δ 1 δ µ 4 µ We [η, η̄, J ] = W0 + ie dx − (γ )cd W0 + O e2 µ i δJ (x) i δηc (x) i δ η̄d (x) (5.78) where we define Z Z 1 4 4 µ ν 4 4 d w d zJ (w)Gµν (w − z)J (z) exp − d w d z η̄(w)S(w − z)η(z) W0 = exp − 2 (5.79) We now calculate this explicitly: Z 1 δ exp [...] = i d4 zSdb (x − z)ηb (z) exp [...] (5.80) i δ η̄d (x) −1 δ 1 δ exp [...] = −Sdc (0) exp [...] i δηc (x) i δ η̄d (x) Z Z 4 − d zSdb (x − z)ηb (z) · (−1) d4 wη̄a (w)Sac (w − x) exp [...] and 1 δ exp [...] = i i δJ µ (x) Z d4 yJ µ (y)Gµν (x − y) exp [...] (5.81) (5.82) which gives, Z 4 4 µ We [η, ¯, J ] = 1 − ie d xd yJ (y)Gµν (x − y) Sdc (0) Z 4 4 ν − d zd wSdb (w − z)ηb (z)η̄a (x)Sac (w − x) γcd W0 µ (5.83) 50 CHAPTER 5. FERMIONS WITH PATH INTEGRALS To calculate the correlation function to leading order we need to take three derivatives of We . This is straight forward and the answer is [Q 6: Do this calculation!] Z 4 ν Sda (x − x1 ) hΩ|T ψ̄a (x1 )ψb (x2 )Aµ (x3 ) |Ωi = ie d xGµν (x3 − x) Sbc (x2 − x)γcd ν + Sba (x2 − x1 )(γ )cd Sdc (x − x) (5.84) We develop the position space Feynman rules given by ψa (x) ψ̄b (y) Aµ (x) Aν (y) → Sab (x − y) → Gµν (x − y) d Z → −ie µ d4 xγcd c In principle we can now work out every correlation function using these Feynman rules. The momentum space Feynman rules are d a b µ ν i(p/ + m)ab p2 − m2 + i i kµ kν →− 2 gµν − (1 − ξ) 2 k + i k → µ → −ieγab momentum + conservation c Note that when the momentum flows opposite to the direction in the fermion propagator we get a negative sign in front of p. We can now calculate the correlation function, a, p1 G̃abµ (p1 , p2 , p3 ) = b, p2 p3 +unconnected diagrams 5.4. THREE-POINT FUNCTION 51 we ignore unconnected diagrams as they don’t contribute to the S matrix. This diagram is i p/2 + m i p/1 + m (−i) gνµ − (1 − ξ) p3,µp2p3,ν 3 da bc (−ie) (γ µ )cd 2 G̃abµ (p1 , p2 , p3 ) = 2 p2 − m2 + i p1 − m2 + i p23 + i (5.85) We know from our experience with scalars that in order to get the S matrix elements through LSZ reduction we need to know the poles of this expression. Note that if you try to solve the three equations: p1 + p2 = p 3 p21 = p22 = m2 p23 = 0 (5.86) (5.87) (5.88) you get a contradiction since squaring the first equation gives p1 · p2 = 0 (5.89) and if we work in the center of mass frame we have p1 = (m, 0), p2 = (E, p) which would imply mE = 0 (5.90) which is impossible since m > 0. Hence it’s impossible to have a 3 point process that conserves energy. The first real S matrix process is a four point process: γ γ e e e However in order to see the structure of the S matrix it is sufficient to consider the 3 point function. p→ Consider an incoming fermion . Fp G̃ → (...) i p/ + m p2 − m2 + i (5.91) LSZ reductions for Fermions tells us that 2 X i h...|...(p, s)i ūs (p) + terms with pole in p2 G̃ = 2 2 p − m + i s=1 (5.92) here the p2 poles are not important. [Q 7: Isn’t this a p2 pole...?] G̃ is of this form because, |(p, s)i = as† (5.93) p |0i 52 CHAPTER 5. FERMIONS WITH PATH INTEGRALS and X d3 k s† s ik·x s s −ik·x √ ψ= ak u (k)e + bk v (k)e (2π)3 2Ek s Z X s† d3 k −ik·x s ik·x s s √ ψ̄ = v̄ (k)e a ū (k)e + b k k (2π)3 2Ek s Z (5.94) (5.95) So we cannot directly P2 get the S matrix from the G̃, but we can use trace technology to simplify our result s=1 us (p)ūs (p) = p/ + m. Using this result we have, G̃ = (...) i P2 s=1 us (p)ūs (p) p2 − m + i (5.96) Comparing with the LSZ formula we can write, h...|...(p, s)i = (...) us (p) (5.97) So for an incoming fermion we see we have a factor of u(p). The derivation for the other an incoming antifermion as well as for the outgoing fermions is similar. Now consider the full photon terms: (−i)(gµν − (1 − ξ)pµ pν /p2 ) ν N p2 + i (5.98) where N ν is just the rest of the object which has no µ dependence. Using LSZ reduction we have for an incoming photon, 2 X i 2 h...|...(p, s)i s∗ G̃µ = 2 µ + (no pole at p = 0) p + i s=1 (5.99) where we have used, Z Aµ = X d3 k −ik·x √ as sµ eik·x + as† s∗ µ e (2π)3 2Ek s (5.100) Using the Ward identity (we discuss this identity more later) we know that pµ N ν = 0 (5.101) This means that the (1 − ξ)pµ pν /p2 term in the incoming and outgoing photon contributions go to zero and we simply have, X −igµν ν i N = − s ∗ (p) h...|...(p, s)i p2 + i p2 + i s µ (5.102) In the fermion case we rearrange the numerator of the propagator using a spin sum. We would like to do the same thing here with gµν and write it in terms of µ ’s. To do this 5.5. e+ e− → µ+ µ− 53 we use a trick that we will prove later on which says that as long as gµν is going to be multiplied by an amplitude we can make the replacement, gµν → − X sµ s∗ ν (5.103) s Since the LSZ formula is true for any s, N ν sµ (p) = h...|...(p, s)i (5.104) Hence we can replace an incoming photon by sµ . We summarize all the rules for incoming and outgoing particles below incoming fermion incoming antifermion outgoing fermion outgoing antifermion incoming photon outgoing photon 5.5 Fp p pF p gp gp p→ = us (p) p→ = v̄ s (p) µ p→ µ p→ p→ = ūs (p) p→ = v s (p) = sµ (p) = s∗ µ (p) e+e− → µ+µ− Now that we have all our Feynman rules we can finally do some calculations. As an example lets consider two electrons turning into two muons. The interaction part of the Lagrangian is Lint = −eAν (ψ̄e γ ν ψ̄e + ψ̄µ γ ν ψµ ) (5.105) The leading order connected diagram is e, p2 µ, p3 γ k = p 1 + p2 e, p1 iM s1 ,s2 ,s3 ,s4 = µ, p4 (−i) kα kβ gαβ − (1 − ξ) 2 k 2 + i k s3 × ūµ (p3 )(−ieγ β )v̄µs4 (p4 ) [v̄es2 (p2 )(−ieγ α )use1 (p1 )] (5.106) 54 CHAPTER 5. FERMIONS WITH PATH INTEGRALS (note that here µ, e are not variables just labels so we know what mass each u and v corresponds to. Consider the term kα [v̄(p2 )γ α u(p1 )] = (−ie)v̄(p2 )k/u(p1 ) k2 = (−ie)v̄(p2 )(p/1 + p/2 )u(p1 ) = (−ie) (v̄(p2 )me u(p1 ) + v̄(p2 )(−me )u(p1 )) =0 (5.107) (5.108) (5.109) (5.110) since (p/ − m)u = 0, (p/ + m)v = 0. Thus the amplitude is indendent of the kα kβ contribution and hence independent of gauge parameter ξ. The amplitude simplifies to iMs1 s2 s3 s4 = i e2 [v̄2 γ α u1 ] [ū3 γα u4 ] s (5.111) where s ≡ (p1 + p2 )2 . Typical experiments have polarized beams. In this case we can average over incoming particle spins, and sum over outgoing particle spins. 2 X 1 |Ms1 ,s2 ,s3 ,s4 |2 ≡ |M|2 (5.112) 2 s ,s ,s ,s 1 2 3 4 In our case we have i h i 1 e4 h α β tr ( p − m )γ ( p + m )γ × tr p + m γ ( p − m )γ |M|2 = / / / / e e µ α µ β 2 1 3 4 4 s2 4 1e = (4) pα2 pβ1 + pα1 pβ2 − g αβ p1 · p2 + m2e (4) 2 4s α β α β αβ 2 × p4 p 3 + p3 p4 − g p3 · p4 + mµ (5.113) (5.114) In the high energy regime (E me , mµ ) we have |M|2 = 8e4 ((p1 · p4 )(p2 · pe ) + (p1 · p3 )(p2 · p4 )) s2 (5.115) but, p1 + p2 = p3 + p 4 (5.116) (p1 − p4 )2 = (p3 − p2 ) p1 · p4 ≈ p 3 · p2 (5.117) (5.118) and since we have in the high energy regime. So after switching the Mandlestam variables the amplitude squared takes the form, |M|2 = If we work in the COM frame: 2e4 2 2 t + u s2 (5.119) 5.6. e− µ− → e− µ− 55 e e θ then p1 p2 p3 p4 = (E, 0, 0, E) = (E, 0, 0, −E) = (E, E sin θ, 0, E cos θ) = (E, −E sin θ, 0, −E cos θ) where s = 4E 2 , t = −2E 2 (1 − cos θ), u = −2E 2 (1 + cos θ). We have the simple result |M|2 = e4 (1 + cos2 θ) (5.120) and dσ |M|2 e4 = = (1 + cos2 θ) dΩ 64π 2 s 32πs The functional dependence is as follows, (5.121) # of Events cos θ 5.6 e−µ− → e−µ− We have found the amplitude and cross sections for e+ e− → µ+ µ− . We now consider elastic e− µ− scattering but now in the non-relativistic regime, p1 → p4 → q = p1 − p4 p2 → p3 → 56 CHAPTER 5. FERMIONS WITH PATH INTEGRALS We can do a similar calculation to last time: iM = ie2 (ū4 γ µ u1 )(ū3 γµ u2 ) t (5.122) In the non-relativistic limit we have p1 p2 p3 p4 = (me + O(p2 ), p) = (mµ , 0) = (mµ + O(p2 ), q) = (me + O(p2 ), p0 ) where q = p − p0 . Hence we have t = (p1 − p4 )2 = −q2 + O(p4 ) The Dirac spinor in the non-relativistic limit is s √ ξ s + O(p) u (p) = m ξs 1 0 1 2 where ξ = ,ξ = . The scalars are: 0 1 √ ū4 γ 0 u1 = m(ξ s4 † ξ s1 + ξ s4 † ξ s1 ) = 2mξ s4 † ξ s1 = 2mδs4 s1 i ū4 γ u1 = O(p) (5.123) (5.124) (5.125) (5.126) Thus to leading order we have iM ≈ −ie2 (2me )(2mµ ) δs4 s1 δs3 s2 q2 (5.127) The cross-section is given by, dσ 1 Ee 1 2pCM e = |M|2 2 dΩ 2Ee 2mµ pe 32π ECM (5.128) We work in the limit of a stationary muon (though the cross-section is Lorentz invariant). In this case pcm ≈ pe and Ecm ≈ mµ . We have, e dσ 1 = |M|2 2 2 dΩ 64π mµ (5.129) 4α2 2 m q4 e (5.130) = where the spin averaging is trivial due to the spin delta functions. In non-relativistic quantum mechanics we’d expect the electron to be moving slowly and the muon to be at rest acting as a source of charge through the potential, V (r) = αr . 5.7. COMPTON SCATTERING 57 0 pe → pe → µ In this case the Born approximation tells us that dσ m2 = e2 Ṽ 2 (q) dΩ 4π 0 where q = p − p. Comparing the two expressions we identify, α2 16π 2 q4 4πα ⇒ Ṽ (q) = 2 q Ṽ 2 (q) = We have (5.131) (5.132) (5.133) 1 d3 q −iq·r 4πα e (5.134) (2π)3 q2 Z α = d cos θdqeiqr cos θ (5.135) π Z 1 2α dq sin(qr) (5.136) =− π q α =− (5.137) r where the integral was done by inserting a regulator and taking the regulator to 0 at the end. We see that in the non-relativistic limit QED repreduces classical electromagnetism as expected. Furthermore, this is the first confirmation that the coupilng in the Lagrangian, e, really represents an electric charge. Up to now we just assumed the most natural interpretation and we finally see that this is indeed the case. Z V (r) = 5.7 Compton Scattering Consider e− γ → e− γ. There are two second order vertices: + 1 Where there is no ambiguity with four-vectors we write, r ≡ |r| and q ≡ |q|. 58 CHAPTER 5. FERMIONS WITH PATH INTEGRALS We have the amplitude Ms = sµ (p)Mµ (5.138) To get the cross-section we need to average over spins: X X ν∗ |Ms |2 = sµ s∗ ν M s (5.139) s The claim is that we can make the replacement: X sµ ∗s ν → −gµν (5.140) s To see why this is consider the photon with the momenta, p = E(1, 0, 0, 1). The Ward identity tells us that pµ Mµ = 0. Hence M0 = M3 (5.141) Since we can choose any basis we wish for the polarization lets consider linear polarization, 1 = (0, 1, 0, 0) 2 = (0, 0, 1, 0) Explicitly we can now calculate X sµ s∗ ν s 0 0 = 0 0 0 1 0 0 0 0 1 0 0 0 0 0 (5.142) 0 1 0 0 0 0 1 0 0 0 0 1 (5.143) This is not quite − gµν −1 0 = 0 0 P But s sµ sν ∗ and gµν never appear on their own. Instead they come as as a product of amplitudes. We have, 2 1 2 2 2 3 2 1 2 2 2 0 M + M + M + M = M + M − gµν Mµ Mν∗ = − (5.144) X 2 2 µ ν∗ µ s∗ = M1 + M2 (5.145) ν M M s Hence due to the Ward Identity we can always make the substituion, X sµ sν ∗ → −gµν s (5.146) 5.8. PROOF OF WARD IDENTITY 5.8 59 Proof of Ward Identity We finally prove the Ward identity and then discuss the intuition behind it. The defining equation for the vector potential without choosing a gauge is given by ∂ν ∂ ν Aµ (x) = 0 (5.147) This equation is solved by the an satz, Z Aµ (x) = d4 p µ (p)e−ip·x (2π)4 (5.148) The gauge transformation of the vector potential under which physics is invarient is, A0µ = Aµ + ∂µ α (5.149) R d4 p −ip·x Plugging in the potential as well as the Fourier transform of α = (2π) α̃(p) we get 4e Z Z −ip·x d4 p d4 p 0 (−ipµ )e−ip·x α̃ (5.150) − e = µ (2π)4 µ (2π)4 0 Now multiplying by e−ip ·x and integrating over d4 x we have Z Z d4 p d4 p 0 0 pµ α̃δ(p − p0 ) − δ(p − p ) = −i µ (2π)3 µ (2π)4 0µ (p) = µ (p) − iα̃(p)pµ (5.151) (5.152) This is the effect of a gauge transformation on the polarization vector. Now in general any amplitude with N gauge bosons will take the form, µ1 µ2 ...µN Mµ1 µ2 ...µN (5.153) If a theory is gauge invariant then this amplitude must be invariant under any gauge transformation. Applying equation 5.152 to some polarization vector i we have µ1 µ2 ...µN Mµ1 µ2 ...µN → µ1 µ2 ...µN Mµ1 µ2 ...µN − iα̃ pµi Mµ1 ,...,µN (5.154) In order for the amplitude to be gauge invariant we must have Ward Identity, pµi Mµ1 ,...,µN = 0 which is just the Ward Identity. We now try to explore how this relationship holds. Consider the following p1 p→ p2 (5.155) 60 CHAPTER 5. FERMIONS WITH PATH INTEGRALS where we have p is off-shell since otherwise it would violate energy conservation. The amplitude is, M = ν ū(p1 )γ ν v(p2 )(−ie) (5.156) | {z } Mν ν Then the sum, pν M , is zero since, pν Mν = ū(p1 )p/v(p2 )(−ie) (5.157) = ū(p1 )(p/1 + p/2 )v(p2 )(−ie) (5.158) = (−ie)ū1 v2 (me − me )ū1 v2 =0 (5.159) (5.160) Now if we are considering a more complicated diagram, × + × where the “ ×” represents some combination of external lines. Then the Ward identity means that these contributions must cancel. See Peskin chapter 7.4 for details. [Q 8: Expand on this section] Chapter 6 Loop Diagrams and UV Divergences in QED 6.1 Superficial Divergences The first thing one thinks about when hearing about loop diagrams is estimating how divergent diagrams are. We use what’s known an “naive power-counting” to find the superficial degree of divergence. For D greater then or equal to zero we expected ∼ ΛD and we expect logarithmic divergence if D = 0. If D < R0 then our diagrams are not d4 p superficially divergent. Each loop contributions a factor of (2π) 4 (4 factors of momenta). Fermion propagators have dimension −1 and photon propagators have dimensions −2 hence (Pi the number of i propagators) D = 4L − Pf − 2Pγ (6.1) One can relate this expression to the number of external lines by (Exercise) 3 D = 4 − Nf − Nγ 2 61 (6.2) 62 CHAPTER 6. LOOP DIAGRAMS AND UV DIVERGENCES IN QED In reality the power counting is not precise and diagrams may or may not diverge independently of the superficial degree of freedom. Consider the D ≥ 0 diagrams in QED: p pg gpg vpg u gpg g g FpF Dpg E Diagram D D=4 D=3 D=2 Expected # of CT’s Name vacuum energy → unphysical zero!(HW) 3 counterterms( vacuum polarization D=1 zero!(HW) D=0 1 counterterm D=1 2 counterterms electron self-energy D=0 1 counterterm vertex correction Our Lagrangian is 1 L = ψ̄(i∂/ − m0 )ψ − Fµν F µν + e0 Aµ ψ̄γ µ ψ (6.3) 4 √ √ We want to introduce renormalized fields ψ = Z2 ψr and Aµ = Z3 Arµ . We then have p 1 r µν L = Z2 ψ̄r (i∂/ − m0 )ψr − Z3 Fµν Fr + e0 Z2 Z3 Arµ ψ̄r γ µ ψr (6.4) 4 We introduce a physical mass and we have δm = m − m0 : p 1 r µν L = Z2 ψ̄r (i∂/ − m)ψr − Z3 Fµν (6.5) Fr + e0 Z2 Z3 Arµ ψ̄r γ µ ψr + δmZ2 ψ̄r ψr 4 We still need to renormalize the coupling constant. We define the new coupling constant as √ e0 Z2 Z3 e= (6.6) Z1 How we define a coupling is not physical since it’s not a number like mass which has a definition on it’s own. We just need to require that it won’t be divergent. This form turns out to be convenient. With this we have 1 r µν L =ψ̄r (i∂/ − m)ψr − Fµν Fr + eArµ ψ̄r γ µ ψr + 4 1 r µν (Z1 − 1)Arµ ψ̄r γ µ ψr + (Z2 − 1)ψ̄r (i∂/ − m)ψr − (Z3 − 1) Fµν Fr + δmZ2 ψ̄r ψr (6.7) 4 We define δ1 ≡ Z1 − 1 δ2 ≡ Z2 − 1 δ3 ≡ Z3 − 1 δm ≡ (Z2 − 1)(m) − δmZ2 (6.8) 6.2. ELECTRON SELF-ENERGY 63 Our counterterms are LCT = δ1 Arµ ψ̄r γ µ ψr + ψ̄r (iδ2 − δm )ψr − δ3 r µν F F 4 µν r (6.9) Note that from here on we drop the r to denote renormalized fields. Naively we expect the number of counter terms to be given by the sum of all the divergences → 3 + 1 + 2 + 1. However we can only possibly write down 4 terms! We know this theory is renormalizable so the superficial degree of divergence must be overestimating the divergence of this theory. Consider our renormalized propagator gpg µ ν q→ q→ = Πµν (q) = A(q 2 )gµν + B(q 2 )qµ qν (6.10) The Ward identity tells us that q µ Πµν = 0 ⇒ Aq ν + Bq 2 q ν = 0 (6.11) since this must be true for all q µ we have A = −Bq 2 e gpg and hence µ ν q→ q→ = (gµν q 2 − q µ q ν )B(q 2 ) (6.12) (6.13) The Ward identity provides a constraint and shows that we only need 1 CT for the propagator. Similarly one can show in the same way that the four photon diagram is in fact finite and does not need a counterterm. So only need the 4 CT’s. 6.2 Electron Self-Energy Z G̃ab (p) = d4 xe−ix·p hΩ|T ψ̄a (x)ψb (0) |Ωi (6.14) where in general we have ψb (y) but we just choose y = 0. Using Feynman diagrams we can write G̃ = F FFyFF FFyFyFF FFyFFFyFF + +| + {z } +... not 1PI i i i = + (−iΣ(p)) p/ − m0 + i p/ − m0 + i p/ − m0 + i (6.15) (6.16) where Σ(p) is a matrix that’s equal to the sum of all the possible diagrams with two external lines and p is the incoming momentum. 64 CHAPTER 6. LOOP DIAGRAMS AND UV DIVERGENCES IN QED We define 1 particle irreducible (1PI) graphs to be diagrams that you cannot cut into 2 by cutting a single line. We define Σ̂ = sum of 1PI graphs only, = without external lines FyF FyFyF + + ... (6.17) we have (for brevity we initially suppress the +i in the denominator) i i i i i i + −iΣ̂ + −iΣ̂ −iΣ̂ + ... G̃ = p/ − m0 p/ − m0 p/ − m0 p/ − m0 p/ − m0 p/ − m0 (6.18) = i p/ − m0 ∞ X n=0 Σ̂ 1 p/ − m0 n (6.19) i 1 p/ − m0 1 − Σ̂ p /−m0 ! i p/ − m0 + i 1 = p/ − m0 + i p/ − m0 + i − Σ̂(p) = = i (6.20) (6.21) (6.22) p/ − m0 − Σ̂(p) + i where we have used a geometric series for matrices, ∞ X Ak = (1 − A)−1 (6.23) k=0 for any matrix A with eigenvalues less than 1. To better see what this means, we switch to the “proper form” of the propagator (this is straightforward to check by multiplying the p/ − m + i with its inverse), (p/ − m − Σ(p) + i)−1 = p/ + m + Σ(p) p2 − (m + Σ(p))2 + i (6.24) Hence the physical mass is shifted to the point where we still have a pole in the propagator, p2 = (m0 + Σ̂(p))2 (6.25) or m ≡ m0 + Σ̂(p2 = m2 ). Thus Σ̂ represents the change in mass from the “bare” mass calculated to all orders in perturbation theory. We can also write from 6.22 1 G̃−1 = G̃−1 0 − Σ̂(p) i (6.26) so the two-point function contains, apart from the inverse bare propagator, only 1PI contributions. 6.2. ELECTRON SELF-ENERGY 65 Now we have G̃ = hΩ|ψ̄r (x)ψr (y)|Ωi = Z2 hΩ|ψ̄(x)ψ(y)|Ωi iZ2 = p/ − m0 + i (6.27) (6.28) (6.29) where Z2 is the wave function renormalization factor (this is just the old unrenormalized propagator). By comparing this expression with equation 6.22 we can write p/ − m0 + i − p/ + m0 − i (6.30) Σ̂ = − Z2 dΣ̂ = −Z2−1 + 1 dp/ !−1 dΣ̂ ⇒ Z2 = 1 − dp/ (6.31) (6.32) Thus if we know how to calculate Σ̂ we can calculate both mass renormalization and wavefunction renormalization. Note that we have been a little careless above with our notation. Formally we can write Σ̂αβ = −(pµ γ µ − m0 + i)αβ (Z2−1 − 1) (6.33) ∂ Σ̂αβ = −(Z2−1 − 1)(γ µ )αβ ∂pµ (6.34) 1 ∂ Σ̂ = (Z2 − 1)γ µ i ∂pµ (6.35) so or This is what we really mean when we write ∂ . ∂p / We now explicitly calculate Σ̂ to second order in e. Σ̂(p) = p→ FyF `→ p→ (6.36) 4 d ` µ i(/̀ + m0 ) ν −igµν γ 2 γ 2 4 (2π) ` − m0 + i (` − p)2 + i Z d4 ` µ (`α γ α + m0 ) ν gµν = (−ie)2 γ γ 2 (2π)4 `2 − m0 + i (` − p)2 + i = (−ie)2 Z (6.37) (6.38) Now using (see for example [Griffith(2008)]) γ µ γ α γ µ = −2γ α and γ µ γµ = 4 it’s easy to see that Z i(−2/̀ + 4m0 ) d4 ` 2 (6.39) Σ̂(p) = −e 4 2 (2π) (` − m20 + i) ((` − p)2 + i) 66 CHAPTER 6. LOOP DIAGRAMS AND UV DIVERGENCES IN QED Note that this expression has a superficial degree of divergence of order 1. The true divergence turns out to be of zeroth order as we will see later due to some terms being odd. To solve this integral and incorporate the divergence we introduce an unphysical mass µ that will regulate the integral. Note that this is not a unique way to regulate. We could have used for example dim-reg. Z d4 ` (−2i/̀ + 4m0 ) 2 (6.40) Σ̂(p) → −e 2 4 2 (2π) (` − m0 + i) ((` − p)2 − µ2 + i) The steps are analogous to our discussion in the Fall of scalar loops diagrams. We introduce Feynman parameters (where A = ((` − p)2 − µ2 + i) and B = `2 − m20 + i) Z 1 dx 1 = (6.41) 2 AB 0 (Ax + B(1 − x)) Our expression becomes Z Z (−2i/̀ + 4m0 ) d4 ` 2 (6.42) Σ̂(p) = −e dx (2π)4 (((` − p)2 − µ2 + i) x + (`2 − m20 + i)(1 − x))2 Z Z d4 ` (−2i/̀ + 4m0 ) 2 = −e dx (2π)4 (`2 − 2`px + p2 x2 − p2 x2 + (p2 − µ2 ) x + (−m20 )(1 − x) + i)2 (6.43) Z Z 4 (−2i/̀ + 4m0 ) d` (6.44) = −e2 dx 4 (2π) ((` − px)2 − ∆ + i)2 (6.45) where ∆ ≡ −p2 x(1 − x) − (µ2 − m20 )x + µ2 . Now define k = ` − px: Z Z d4 k (−2i(k/ + p/x) + 4m0 ) 2 Σ̂ = −e dx (2π)4 (k 2 − ∆ + i)2 (6.46) (6.47) Applying a Wick rotation such that k0 → ik0E and k → kE . We now get (dk0 = idk0E and k0 → ik0E ) Z Z 4 d kE (2(ik/E − p/x) + 4m0 ) 2 Σ̂ = −e dx (6.48) (2π)4 (kE2 − ∆ + i)2 The denominator is even. Thus we can get rid of the top contribution from k/E 1 : Z Z 4 d kE −2(p/x) + 4m0 2 Σ̂ = −e dx (6.49) (2π)4 (kE2 − ∆ + i)2 (6.50) 1 this is where the reduction in the divergence of the diagram occurs 6.2. ELECTRON SELF-ENERGY 67 First since our integral is Wick rotated we are no longer near a pole and we can drop the i. Now taking the integral is a 4D spherical space we have (S3 = 2π 2 ) Z Z dkE kE3 −2(p/x) + 4m0 2 Σ̂ = −e dx (6.51) 2 8π 2 (kE2 − ∆) Z Z −e2 kE3 (6.52) = dx (− p x) + 2m dk / 0 E 2 4π 2 (kE2 − ∆) Z 1 −e2 −Λ2 Λ2 − ∆ = (6.53) dx (−p/x) + 2m0 + log 4π 2 2 Λ2 − ∆2 ∆ where in the last step we introduced a cutoff. The integral is easy enough to do in Mathematica. For simplicity we now assume that Λ2 ∆ then we get the somewhat simpler result, 2 Z α Λ (6.54) Σ̂ = − dx p/x − 2m0 log π ∆ Consider the limit that µ → 0. In this case ∆ → m2 (1 − x) and we have to integrate the logarithm from 0 to 1. This is divergent as logs aren’t defined at 0. This causes the electron normalization to diverge. However, the electron normalization itself isn’t physical. No observables will depend on µ, however we introduce it for convenience so all the terms in our calculations will be well defined. The physical mass is shifted by m = m0 + Σ̂(m) (6.55) We can use m or m0 in evaluating the Σ̂ since the correction will be of order α2 . To see this consider the structure of Σ̂: Σ̂(p/) = p/f (p/2 ) + g(p/2 ) (6.56) Setting p/ = m and expanding f, p, m we have: Σ̂(p/ = m) = m0 + δm(1) + ... f (0) + αf (1) + ... + g (0) + αg (1) + ... = 0 (6.57) The zero’th order result is just the propagator evaluated at p/ = m0 : f (0) = p2 1 , − m20 + i g (0) = − p2 m0 − m20 + i (6.58) and trivially satisfies the condition. To first order in α we have, δm(1) f (0) + m0 f (1) + g (1) = 0 (6.59) Inside f (1) and g (1) there are contributions proportional to m and m2 . However, when to this order we drop any additional factors of α which arise from inserting m0 + αδm(1) . Thus we can just use m ≈ m0 to first order. 68 CHAPTER 6. LOOP DIAGRAMS AND UV DIVERGENCES IN QED We then have the rather simple result, Z α Λ2 Σ̂ ≈ m0 dx (2 − x) log π −m20 x2 (6.60) This integral can be evaluated in Mathematica to be Σ̂ ≈ Λ2 3α log 2 4π m0 (6.61) So we finally have the mass shift, m = m0 + Σ̂(m0 ) = m0 Λ2 3α log 2 1+ 4π m0 (6.62) There are two approaches towards renormalization. One interpretation is that it is just mathematical trickery that is used in order to get finite answers. This point of view is fine and it certainly works however there is a more modern interpretation which says that there is some physics in renormalization. Suppose we are evaluating some loop diagram. Our Feynman rules say to integrate over some loop momentum. If we work in position space we know the relation between momenta and wavelength. Large momenta correspond to short wavelengths. At very high loop momenta it corresponds to fluctuations of the field with short wavelengths. With every theory there is some scale in which we know the theory works. For example for all we know the SM works well until about the energies that we have gotten to so far. However when we do loop integrals we are told to integrate all the way to infinity (all possible virtual particle momenta are allowed). That means that we are extrapolating to energy scales further then we know that the theory works. With this point of view, there probably means that this extrapolation has to fail at some energy. This is analogous to blackbody radiation problem in quantum mechanics. Physicists tried to extrapolate their understanding of blackbody radiation to regimes where they didn’t know it held. The suspicion about these divergences in QFT is that they are there because we are missing some essential pieces of physics that occur at some shorter length scale that we are unable to probe. Putting in a cutoff scale, Λ, is just some crude model for what happens at some short length scale. Another way to model how a theory behaves at small length scales is to switch from field theory to string theory. Depending on the particle high energy theory you must use a different regulator. Different regulators are essentially different toy models for how to model the effects at higher energy scale. The freedom of choice of regulator is a representation of our ignorance of high energy physics. In particular since we know that gravity cannot be explained by QFTs we believe that the highest Λ can be is at the Planck scale (though it is likely much earlier). Hence Λ is some scale that is of the Planck scale or less. Renormalizability tells us that the details of the high energy physics don’t effect the effects at the low energy regime. By definition it means that the dependence of the cutoff can be absorbed into the CT’s and won’t effect physics quantities. The cutoff scale is taken is some sense to be a real physical energy scale. The interpretation of a formula such as 6.62 is an observer at a low energy will see m ≈ m0 however at a large energy there are going to be corrections. 6.3. CLASSICAL RENORMALIZATION AND REGULATORS 6.3 69 Classical Renormalization and Regulators Consider an electron. It has some electric field e Einstein told us that energy is equivalent to mass hence the electric field must change the mass of the particle. This correction is what we calculated using equation 6.62. Classically one can go ahead to calculate this same result. |E|2 δm = d r 8π Z ∞ e 2 4π 2 r dr 2 = 8π 0 r Z ∞ dr ∝ e2 r2 0 →∞ Z 3 (6.63) (6.64) (6.65) (6.66) Classically this energy doesn’t converge!The problem occurred since we extrapolated classical EM to a length scale in which it was no longer valid. Classical EM is a low energy effective theory. While it does a good job at describing the physics at low energies (large length scales) it fails at high energies. Alternatively one can introduce a length cutoff (analogous to the energy cutoff, Λ that we are used to in QFT: 2 Z ∞ δm =∝ e r0 ∝ e2 r0 dr r2 (6.67) (6.68) This quantity is no longer infinite. One could ask why this length scale was required since to our current understanding the electron truly is a point charge. The problem is QM. In QFT the vacuum is not static, but electron positron pairs pop in and out of the vacuum. The vacuum of QED is filled with these pairs: 70 CHAPTER 6. LOOP DIAGRAMS AND UV DIVERGENCES IN QED + - + + + + + - These pairs can appear as long as they disappear in a timescale as determined by the uncertainty principle. Normally we don’t care about them, since they average to zero (equally likely to be oriented in all direction). However if you have a real electron then they are more likely to orient themselves along the electric field and screen the field. As you approach the electron of order the inverse electron mass these effects become important. The wavelength that such effects will occur is the Compton wavelength λ= 1 ≈ rc me (6.69) is known as the Compton wavelength. We identify r0 with the Compton wavelength and we get δm ∝ e2 me (6.70) This is starting to look very similar to our equation that we derived with QFT except with the Logarithmic divergence. Knowing the higher energy theory of QED told us the exact form of the correction. In essence our QED calculation is just an estimate that will be fixed when more physics is added. One can go ahead and renormalize classical EM if one wanted as well. 1 , m0 ≈ 511keV , Λ ≤ MP lanck ∼ Consider our correction term in equation 6.62. α = 137 19 10 GeV . This makes the correction 3α Λ2 log 2 ≈ 0.2 4π m0 (6.71) So even with having this huge cutoff scale the correction is still not large. Note such an argument doesn’t work using the φ4 theory. In the end we don’t really care since we renormalize our theories anyways. When picking a regulator it’s important to have it respect the symmetry of your problem, since its possible that the high energy theory does respect the theory (of course it is also possible that it won’t break the symmetry but you don’t want to remove the other possibility). As an example recall our derivation of the photon terms. We were not allowed to write down a mass term: 1 2 m Aµ Aµ 2 since we wanted gauge invariance. When finding the photon propagator: q→ q→ ∝ gµν q 2 − q µ q ν Π(q 2 ) gpg µ ν (6.72) (6.73) 6.4. DIMENSIONAL REGULARIZATION 71 we would in general renormalize the mass and create a mass. However experimentally we know the photon is massless to a high precision. We want to somehow ensure that the photon will remain massless at this high energy scale. We want to introduce a regulator that respects gauge invariance. The cutoff is not a gauge invariant cutoff (this is not obvious at this point). The preferred regulator in QED is through dimensional regularization or “dim reg”. 6.4 Dimensional Regularization Using dim reg we now consider the integral we found earlier with a hard cutoff Z γ µ (/̀ + m0 )γµ dd ` 2 Σ̂(p) = e (2π)d (`2 − m20 + i) ((` − p)2 − m20 + i) (6.74) Note that we have changed the dimension of each object in this expression from 4 → d. It’s not clear what the γ matrices even mean in more then 4 dimensions. A discussion in Peskin and Schroeder tells us that γ µ γµ = d, γ µ γ ν γν = −(d − 2)γ ν . These expression are analytic continuations of the expressions of integer d. Z Z 1 dd kE − d2 − 1 xp/ + d2 m0 2 dx Σ̂(p) = 2e (6.75) d(2π)d (kE2 + ∆)2 0 2−d/2 Z 1 1 2e2 d d Γ(2 − d/2) dx m − − 1 x p = (6.76) / 0 (4π)d/2 2 2 ∆ 0 In 4 dimensions this expression is infinite because of the Γ term. However in 4 − dimensions we have Z 1 Z 1 2 e2 Σ̂(p) = 2 dx xp/ − 2m0 − γ + log 4π − log ∆ + dx xp/ − m0 + O() 8π 0 0 (6.77) where γ is the Euler constant and equal to 0.566.... 6.5 Vacuum Polarization Consider G̃µν (q) = = gpg g gcg gcgcg q→ q→ + + + ... qµ qν i = − 2 gµν − 2 q q i qµ qα i qβ qν + − 2 gµα − 2 (iΠαβ ) − 2 gβν − 2 + ... q q q q (6.78) (6.79) (6.80) 72 CHAPTER 6. LOOP DIAGRAMS AND UV DIVERGENCES IN QED where we use the Landau gauge for the propagator. We define Π= c cgc + + ... (6.81) and Π̂ = X 1PI graphs (6.82) Note that this is completely analogous to Σ and Σ̂ from earlier. Then G̃µν i qµ q ν i qµ qα qβ qν αβ gµν − 2 + − 2 gµα − 2 iΠ̂ − 2 gβν − 2 q q q q q αβ σρ + (...)µα iΠ̂ (...)βσ iΠ̂ (...)ρν + ... (6.83) i =− 2 q The Ward identity says that (see equation 6.13) Π̂αβ qα q β = q gαβ − 2 Π(q 2 ) q 2 (6.84) which implies q µ qα qµ qα qαqβ αβ 2 αβ Π̂ = q gµα − 2 gµα − 2 g − 2 Π(q 2 ) q q q qµ q β 1 2 β β β = q gµ − 2 qµ q + qµ q + 2 Π(q 2 ) q q β qµ q Π(q 2 ) = q 2 gµβ − 2 q (6.85) (6.86) (6.87) and hence G̃µν qµ qν qµ q β qβ qν β gµν − 2 + gµ − 2 gβν − 2 (−i)(i)Π(q 2 ) q q q 2 + (...) (...) (...) −iΠ(q 2 ) + ... (6.88) i =− 2 q We define Aαβ ≡ gαβ − G̃µν qα qβ . q2 With this we can write as o −i n 2 αβ β 2 2 = 2 Aµν + Aµ Aβν Π(q ) + Aµα A Aβν Π(q ) + ... q (6.89) but qµ q β qβ qν qµ qν β gµ − 2 gβν − 2 = gµν − 2 q q q (6.90) 6.5. VACUUM POLARIZATION 73 or equivalently Aµβ Aβν = Aµν . This gives G̃µν −i Π(q 2 ) 2 2 = 2 Aµν + Aµν 2 + Aµν Π(q ) + ... q q −1 −iAµν 2 1 − Π(q )) = q2 −i qµ qν 1 = 2 gµν − 2 q q 1 − Π(q 2 ) (6.91) (6.92) (6.93) (6.94) So we have our old propagator with an extra factor, since the photon propagator is qµ qν −i gµν − 2 (6.95) q2 q Note that here the Π̂ doesn’t come additive to the q 2 value as it did for the massive fields. The pole is still at q 2 = 0 ⇒ mγ = 0. One can trace the reason that this happened to the Ward identity (and hence gauge invariance). The only renormalization comes about in the wavefunction renormalization: q µ qν i (6.96) G̃µν = − 2 gµν − 2 Z3 q q or equivalently in Feynman gauge: G̃µν = − igµν Z3 q2 (6.97) The amplitude we want is, ←` −iΠ̂µν = q→ q→ `+q → µν − iΠ̂ (q) = (−1) (−ie) 2 Z d4 ` Tr γ µ −/̀ + m γ ν /̀ + /q + m (2π)4 (`2 − m2 ) (` + q)2 − m2 (6.98) The most divergent part is (using a hard cutoff) Z d4 ` (a`2 gµν + b`µ `ν ) ∼ gµν Λ2 `4 (6.99) 74 CHAPTER 6. LOOP DIAGRAMS AND UV DIVERGENCES IN QED However this object doesn’t obey q µ Π̂µν = 0. Hence this is not true at O(Λ2 )! To solve this integral in a gauge invariant way we use dim reg. We can greatly simplify the amount of work that we need to do by identifying that the final answer must be in the form, Π̂µν = (q 2 g µν − q µ q ν )Π(q 2 ) (6.100) Thus we just need to calculate either the q µ q ν or the g µν part and just infer the other part. As we will see it will be easier to calculate the q µ q ν contribution. The denomenator is: Z 1 1 1 (6.101) = dx 2 2 2 2 2 2 ` − m + i (` + q) − m + i (` + qx) − ∆ + i where ∆ ≡ −q 2 x (1 − x) + m2 We now shift the integration variable and conside the trace: Tr [...] → Tr −/q + /̀ + /qx + m γν −/̀ − /qx + m γµ = −Tr /̀ + /q (x − 1) γν /̀ + /qx γµ + m2 Tr [γµ γν ] = −Tr /̀γν /̀γµ − Tr /q (x − 1) γν /qxγµ + m2 Tr [γµ γν ] + linear in ` = 4 2`µ `ν − `2 g µν + 4x (1 − x) 2q µ q ν − q 2 g µν (6.102) (6.103) (6.104) (6.105) (6.106) ∆ only depends on q 2 so there is only one term that has a q µ q ν contribution. We work out this term and infer the value of the rest of the terms. We need the integral, Z i 2 1 d4 ` = (1 + log 4π) − γ (1 − log ∆) (6.107) (2π)4 [`2 − ∆]2 (4π)2 i ∆ 2 = − log −γ (6.108) 16π 2 4π Thus Z e2 2 ∆ Π̂µν = −q q − 2 dx2x (1 − x) − log −γ + (...) g µν 4π 4π Z 2α 2 ∆ Π(q 2 ) = − dx (1 − x) − log −γ π 4π µ ν ⇒ (6.109) (6.110) Recall that earlier we showed that for an electron scattering off a muon in the nonrelativistic limit given by the amplitude went as iM ∼ e2 . q2 The Born approximation tells us that hp|iT |pi ∼ Ṽ (p − p0 ) (6.111) 6.5. VACUUM POLARIZATION 75 We had V = e2 e2 → Ṽ ∼ r |q|2 (6.112) what we really need to do is e2 Z 0 3 (we need to have the renormalized propagator for the photon). This results in iM ∼ |q| 2 √ using instead of the bare charge we have e = e0 Z3 . At higher energies we need to include the following correction as well: ! e20 Z3 1 − Π̂(0) iM ∼ (6.113) |q|2 1 − Π̂(−q2 ) Going through the calculation one can show α α e−2mr + ... V (r) = 1+ √ r 4 π (mr)3/2 (6.114) ++- The interaction gets stronger then the Coulomb expectation as the charges get close. This goes back to why this effect is called vacuum polarization. Imagine the muon: ++- ++- ++- Hence at low energy what you see is the screened potential. This is the effect of having e → e20 Z3 . As you start throwing more energetic electrons toward the muon, then you get the electron very close the muon. You can get rid of some of the screening effect and the stronger the field will be. You’re interaction with the muon will not be determined by e2 but more by something that’s more close to the bare charge. Eventually when you are very close the center of the muon you will be able to measure Z3 . This is the concept behind the running coupling constant. This effect is observed and well tested. 2 76 CHAPTER 6. LOOP DIAGRAMS AND UV DIVERGENCES IN QED 6.6 Vertex Function Consider the vertex function which represents the following diagram p0 q ≡ p0 − p 0 G̃µ (p, p ) = µ p in the limit of p2 → m2e , p02 → m2e but q has to stay off shell by energy momentum conservation. Now recall that P i(p/0 + m) i 2s=1 us (p0 )ūs (p0 ) i = 02 = (6.115) p/0 − m + i p − m2 + i p02 − m2 + i so we have a P s0 0 us (p0 )ūs (p0 ) ν u (p)ūs (p) 0 G̃µ = 02 Γ (p, p ) 2 Dµν (q) p − m2 + i p − m2 + i P (6.116) The interesting object is Γν . To leading order we have just a single vertex: → Γν = −ieγ ν (in the terminology we will introduce shortly, F1 = 1, F2 = 0). Next to leading order we have the diagram: q− p → -`−q q→ `−p↓ p → `% ν 3 Γ = (−ie) Z /̀ + m (−igρσ) dd ` ρ i(/̀ + /q + m) ν i γ γ 2 γσ d 2 2 2 (2π) (` + q) − m + i ` − m + i (` − p)2 + i (6.117) 6.6. VERTEX FUNCTION 77 We won’t solve this explicitly however we consider the Lorentz structure. The only nontrivial scalar in this problem is q 2 . From the fact it’s a four vector we already know that we will have something of the form, Γν = γ ν A(q 2 ) + (p + p0 )ν B(q 2 ) + (p − p0 )ν C(q 2 ) (6.118) In general B and C can be matrices as well however since mu(p) = p/u(p) we can write the expressions in terms of ordinary numbers. From the Ward identity we know that q µ G̃µ = 0 (6.119) and so, ū(p0 )qν Γν u(p) = 0 (6.120) z }| { 0 2 0 2 ū(p ) /q A(q ) + q · (p + p ) B(q ) + q · (p − p ) C(q ) u(p) = 0 | {z } |{z} (6.121) =p2 −p02 =0 0 2 q 2 6=0 =ū(p)(p / −p /0 )u(p0 )=0 where we have used the fact that q ≡ p0 − p and hence we have C = 0. Now we use the Gordon identity: 0 (p + p)µ iσ µν qν 0 µ 0 ū(p )γ u(p) = ū(p ) + u(p) (6.122) 2m 2m where σ µν = 2i [γ µ , γ ν ]. We quickly prove this identity below. Consider 1 µ ν (γ γ − γ ν γ µ ) pν 2 1 = −pµ − 2γ µ p/ − 2g µν pν 2 µ = −p − γ µ p/ − pµ −pµ + iσ µν pν = −pµ − µ = −γ p/ (6.123) (6.124) (6.125) (6.126) Furthermore, 1 p0µ + iσ µν p0ν = p0µ + p0ν (γ ν γ µ − γ µ γ ν ) 2 1 = p0µ + 2p/0 γ µ − 2p0µ 2 0 µ = p/ γ (6.127) (6.128) (6.129) Subtracting these two equations gives µ (p0 + p) + iσ µν (p0 − p)ν = p/0 γ µ + γ µ p/ 0 µ ūp0 (p + p) + iσ µν (p0 − p)ν up = ūp0 mγ µ up + ūp0 γ µ mup 0 0 µ (p + p)µ 0 µ µν (p − p) ū(p )γ u(p) = −ūp0 up + iσ 2m 2m (6.130) (6.131) (6.132) 78 CHAPTER 6. LOOP DIAGRAMS AND UV DIVERGENCES IN QED Using this identity we see that iσ µν qµ Γ (p, p ) = (−ie) γ F1 (q ) + F2 (q 2 ) 2m 0 ν ν 2 (6.133) where the Fi (q 2 ) are known as form factors. To lowest order we have no loop and F1 (q 2 ) = 1 and F2 (q 2 ) = 0. To second order they are given by [Q 9: Show this] Z 1 e2 Γ(2 − d/2) (2 − )2 F1 = dxdydzδ(x + y + z − 1) + (4π)d/2 0 ∆2−d/2 2 Γ(3 − d/2) 2 2 2 2 q [2(1 − x)(1 − y) − xy] + m 2(1 − 4z + z ) − (1 − z) ∆3−d/2 (6.134) where ∆ = (1 + z)2 m2 − xyq 2 + zµ2 and = 4 − d as well as α F2 = 2π Z 0 1 2m2 z(1 − z) dxdydzδ(x + y + z − 1) 2 ∆ µ →0 (6.135) α where F2 (q 2 → 0) = 2π . Note that the UV divergences are completely hidden inside F1 [Q 10: Is this true to all orders?] 6.6.1 Physics of F2 p0 → Consider the semi classical picture of a fermion in an EM field p→ EM field Lint = eµ ψ̄γ µ ψ (6.136) To leading order we have 0 hp0 , s0 |iT |p, si = ūs (p0 )(−ieγ µ )us (p0 )õ (p − p0 ) and the Feynman rule is, q × → −ieγ µ õ (q) (6.137) 6.6. VERTEX FUNCTION 79 In non relativistic electron scattering we have (for a scalar Aµ = (φ((x)), 0) ⇒ potential, s √ ξ õ = (φ̃(q), 0)). Recall that in the Weyl basis us = 2m ξs hp0 , s0 |iT |p, si = −ie ū(p0 )γ 0 u(p)φ̃(q) = −ie(2m)δss0 φ̃(q) (6.138) If we have a vector potential Aµ = (0, A) with B = ∇ × A then hp0 , s0 |iT |p, si = −ieūs0 (p0 )γ i us (p)Ãi (q) e s0† i s = −i(2m) ξ σ ξ B̃i (q) m 0 σi i [Q 11: I don’t see how we got from A to B.] (recall that γ = ). σi 0 Now in the quantum mechanics Born approxiamtation we have, hp0 , s0 |iT |p, si ∼ Ṽ (q) (6.139) (6.140) (6.141) This equation shows that the potential is given by V = me Ŝ · B = µ̂ · B. From none relativistic QM we know that µ = g 2m S, where g is the Landé g factor. Hence to leading order it was found that g = 2. See Peskin and Schroeder pg 187-188 for more details. Now consider the next to leading order expression iσ µν qν µ = −ieū(p ) γ F1 (0) + F2 (0) u(p)õ (q) 2m 0 × The scalar field is V = eF1 (0) · φ(x). With the vector field one can show that you get the same structure we found before, V ∝ S · B with a different g factor g = 2 + 2F2 (0) = 2 + α π (6.142) This was the first radiative correction that was predicted and confirmed by experiment. Chapter 7 Renormalized Perturbation Theory 7.1 φ4 Theory Recall φ4 theory: 1 1 1 L = (∂µ φ)2 − m20 φ2 − λ0 φ4 2 2 4! (7.1) where G̃(p) = p2 iZ + (no poles) − m2 + i Define new renormalized fields as φ̃r = Z −1/2 φ which implies that G̃r (p) = and we have L= (7.2) i p2 −m2 +i Z Z Z2 (∂µ φr )2 − m20 φ2r − λ0 φ4r 2 2 4! + ... (7.3) Experimentally we want to measure the mass, m and the coupling, λ. λ can be measured from φφ → φφ at p → 0. More precisely we can measure iM4 (s = 4m2 , u = t = 0) = λ (7.4) Notice that when we do this measurement we don’t know about what’s a loop diagram, what’s tree level. This term includes all diagrams. We want to rewrite our Lagrangian in terms of these physical parameters: 1 1 1 1 1 L = (∂µ φr )2 − m2 φ2r − λφ4r + δz (∂µ φ2 )2 − δµ φ2r − δλ φ4r 2 2 4! 2 2 4! where δz = Z − 1, δm = Zm20 − m2 , δλ = Z 2 λ0 − λ. Our new Feynman rules are 80 (7.5) 7.2. QED 81 = i p2 − m2 + i = −iλ × = i(p2 δz − δm) × = −iδλ we impose our renormalization conditions: −1 G̃2r (p2 ) 2 2=0 p =m d 2 2 −1 G̃r (p ) 2 2 = −i dp2 (7.6) (7.7) p =m At first order these conditions are trivially satisfied. At higher orders these conditions give us our values of δz , δm . 7.2 QED The QED Lagrangian in terms of the renormalized parameters is, 1 L = ψ̄r (i∂/ − m)ψr − eψ̄r γ µ ψr Ar,µ − (Fr,µν )2 4 1 + ψ̄r iδ2 ∂/ − δm ψr − eδ1 ψ̄1 γ µ ψr Ar,µ − δ3 (Fr,µν )2 4 (7.8) The original theory was gauge invariant so we better maintain that gauge invariance. The counter-terms must also be gauge invariant. The only way that we happen is if δ1 = δ2 . To see this consider part of QED Lagrangian (the rest is trivialy gauge invariant on its own), ∆L = δ2 ψ̄i∂/ψ + ieδ1 ψ̄1 γ µ ψAµ (7.9) Now consider a gauge shift Aµ → Aµ − 1e ∂µ α(x), ψ → e−iα : δL = −δ2 ψ̄iγ µ i∂µ αψ − iδ1 ψ̄1 γ µ ψ∂µ α = (δ2 − δ1 )ψ̄γ µ ψ∂µ α (7.10) (7.11) 82 CHAPTER 7. RENORMALIZED PERTURBATION THEORY In order for this to be gauge invariant we require δ1 = δ2 . This highly non-trivial result implies that the renormalization of the QED coupling is equal to the renormalization of the fermion wavefunction renormalization! This implies that Z1 = Z2 and the renormalization couplings is simplify given by (see Eq. 6.6), e = e0 p Z3 (7.12) We have the Feynman rules → i p /−m+i → −igµν q 2 +i × → i(p/δ2 − δm ) × −i(g µν q 2 − q µ q ν )δ3 −ieγ µ The renormalization conditions are d dp / d dq 2 −1 =0 p /=m −1 −1 = −i p /=m = −igµν q 2 =0 = −ieγµ The first two conditions provide us with δz , δm , the third conditions gives δ3 and the last condition gives δ1 . We now consider the two point function at 1-loop. We have already derived this expression (the amputated version): = i g µν q 2 − q µ q ν Π(q 2 ) (7.13) gcg 7.2. QED 83 We have gpg g gcg gxg = + + + ... = − igµν 1 ˆ 2) + i 1 − Π(q q2 (7.14) P where Π̂ = 1PI diagrams. Our condition is that Π̂(q 2 = 0) = 0. We have (the amputated version) + = i g µν q 2 − q µ q ν Πloop (q 2 ) + δ3 (7.15) gcg gxg and hence we know that δ3 = −Πloop (q 2 = 0) (calculated last lecture) (7.16) Chapter 8 Running Couplings and Renormalization Group 8.1 “Large Logs” Consider λφ4 theory. The phenomena happens in almost all QFT’s however it is convenient and easier to√work with φ4 theory. We consider 2 → 2 scattering. We will work at a COM energy of s = µ m. We will basically be computing Green’s function like this: s channel + + + u channel + = t channel + ... + The first diagram is trivial and given by λ. Based on dimensional analysis we know that the second diagram is λ2 Λ non-log Γ=c log √ + + δλ (8.1) Λ-ind terms 16π 2 s Λ λ2 δλ is given when Γ = 0. When s = 4m2 ⇒ δλ = −c 16π log m + ... . Hence Γ is given by 2 λ2 Λ Λ non-log log √ − log Γ=c + (8.2) Λ-ind terms 16π 2 m s λ2 m non-log =c log √ + (8.3) Λ-ind terms 16π 2 s (8.4) Going to higher order terms we have higher order powers of these logs: 2 λ2 m λ3 m 0 non-log Γ=c log √ + +c log √ + ... 16π 2 (16π 2 )2 s Λ-ind terms s 84 (8.5) 8.1. “LARGE LOGS” 85 √ Our perturbation theory is not a series in λ but it’s a series in λ log ms . This is an issue since you lose control of your calculation at sufficiently high energies. This issue can be avoided by organizing your perturbation theory in a better way. We define √ λ̃ ≡ i s=µ We impose a different renormalization condition given by δλ : Γ = 0 when s = µ2 ⇒ δλ = λ2 log Λµ and we have −c 16π 2 λ2 Γ=c 16π 2 µ log √ + non-log Λ-ind terms s (8.6) Instead of on shell renormalization condition you can impose your renormalization condition at any scale you want. In the end your results do not depend on the scale that you renormalize however that’s only true for the sum of the series. Since we always truncate our series, the error made by that truncation depends on the scale that you chose for your renormalization scheme. Our λ̃(µ) is called the running coupling. Type of PT Bare PT Renormalized PT (on-shell) Off-shell Renormalized PT Coupling λ0 → “bare coupling” λ → “physical coupling” λ̃(µ) → “running coupling” Notes doesn’t include screening (scat√ tering at s ≈ Λ) includes screening (scattering at √ s ≈ 2m ) √ partial screening 2m < s < Λ o screening The “Renormalization Group evolution curve” is shown below 86 CHAPTER 8. RUNNING COUPLINGS AND RENORMALIZATION GROUP λ̃(µ) λ0 λ µ Λ Now consider the Lagrangian: L= 1 1 1 1 1 1 (∂µ φ)2 − m̃(µ)2 φ2 − λ̃(µ)φ4 + δZ (∂µ φ)2 − δm φ2 − δλ φ4 2 2 4! 2 2 4! (8.7) where as before we have δZ = Z − 1, δm = Zm20 − m̃2 , δλ = Z 2 λ0 − λ̃ Before we had a certain scale and that was m. now our scale is µ instead. Before we go ahead and start calculating. We try to think what our δ ’s will depend on : (8.8) δi = δi m̃, λ̃, Λ, µ We can invert our formulas for δi to get λ̃ = λ̃ (m0 , λ0 , µ, Λ) m̃ = m̃ (m0 , λ0 , µ, Λ) (8.9) (8.10) Z̃ = Z(m0 , λ0 , µ, Λ) (8.11) We fix m0 , λ0 , Λ (UV physics). We define the Beta functions as ∂ λ̃ , β≡µ ∂µ m0 ,λ0 ,Λ fixed ∂ m̃ βm ≡ µ , ∂µ 1 ∂Z γ≡ µ 2Z ∂µ m0 ,λ0 ,Λ fixed where for historical reasons the last expression has a different name (the γ function). Note that if you use a different regulator you would keep that fixed instead of Λ. To 1 loop in φ4 theory one can show that (assuming µ m) β(λ̃) ≡ µ ∂ λ̃ 3λ̃2 = d log µ 16π 2 (8.12) 8.1. “LARGE LOGS” 87 This is called the RG equation (RGE). λ̃(Λ) ≡ λ0 ⇒ λ0 Z Λ dλ̃ 3 = d log µ 2 16π 2 µ=µ λ̃(µ) λ̃ 3 1 1 Λ = − log 2 16π µ λ̃(µ) λ0 Z (8.13) (8.14) 3λ0 Λ 1 + 16π 2 log µ 1 = λ0 λ̃(µ) λ0 λ̃(µ) = 3λ0 Λ 1 + 16π 2 log µ (8.15) (8.16) (8.17) This is the plot we showed earlier. We don’t need to use λ0 we could use the non-relativistic coupling to estimate the coupling at a different scale: λ̃(µ) = λN R ⇒ λ̃(µ) = 3λNλNRR µ . The large log will no 1+ 16π 2 log µ0 longer be in the PT. Note as µ → 0, λ → 0. Hence at arbitrarily low energies the particle is free. These kinds of theories are called IR − f ree theories. It turns out that in practice the running coupling does not run to zero. Instead when the coupling gets close to m, it begins to level off. We will not show this. This turns out to be due to particles having a nonzero mass. Note further that the coupling blows up at log 16π 2 Λ =− µ 3λ0 16π 2 ΛL = µ exp − 3λ(λ0 )e (8.18) (8.19) This is called the “Landau pole”. The coupling becomes large at some scale. At this point perturbation theory no longer makes sense. The theory is strongly coupled. We now develop the formalism required to calculate these β functions. Consider the following via CT z}|{ hΩ|T (φr (x1 )...φr (xn )) |Ωi = G(n) (x , ..., x ; λ̃, m̃, Λ, µ) 1 n r (8.20) we also have (n) hΩ|T (φ(x1 )...φ(xn )) |Ωi = Gb (x1 , ..., xn ; λ0 , m0 , Λ) (8.21) 88 CHAPTER 8. RUNNING COUPLINGS AND RENORMALIZATION GROUP (n) (n) (n) φr = Z −1/2 φ , Gb = Z n/2 Gr . But Gb is clearly independent of µ (the unrenormalized propagator doesn’t depend on what renormalization scale you use). Hence we know that d (n) G =0 dµ b d µ Z n/2 G(n) =0 r dµ ! ∂ λ̃ ∂ ∂ m̃ ∂ ∂ n n/2 ∂Z µ Z n/2 + Z n/2 + Z n/2 + Z G(n) r = 0 ∂µ ∂ λ̃ ∂µ ∂ m̃ ∂µ 2Z ∂µ ∂ ∂ ∂ βλ + βm + + nγ G(n) r = 0 ∂ m̃ ∂µ ∂ λ̃ µ (8.22) (8.23) (8.24) (8.25) (8.26) where in the last step we used the definitions of β and γ functions as well as divided by Z n/2 . This is called the Callan-Symanzik equation. It applies very generally and not just to 4 λφ . One can show that to one loop one doesn’t need to worry about the βm . Conceptually it is the same as the other functions however it is more convenient to ignore this term. In other words we take m̃ → 0. (2) (4) We consider 1-loop calculations: Gr and Gr . G(2) r f p→ = + c fff p→ p→ | 0, fxf p→ + δZ {z by construction p→ + O(λ2 ) (8.27) } Our one loop diagram gives the CT. To first order we have, Z 1 d4 ` 2 Π(p ) = λ = λN Λ2 4 2 (2π) ` + i (8.28) where N is a number. Note that this integral is independent of p. Our renormalization condition is Π(µ2 ) + δZ = 0, which implies that δZ = −Π(µ2 ). To first order we have G(2) r = p2 i + i (8.29) which is independent of λ̃, m̃, an µ. Thus the Callan-Symanzik equation says that in λφ4 the Gamma function is zero to second order. i.e. γ = 0 + O(λ2 ). Recall that our four point function was given by s channel + + + u channel + = t channel + ... + G̃(4) r = = (8.30) 4 Y i=1 i 2 pi + i ! 2 −iλ̃ + −iλ̃ (V (s) + V (t) + V (u)) − iδZ + O(λ̃3 ) (8.31) 8.1. “LARGE LOGS” 89 Consider the second loop: p2 p4 p3 p1 It contribution to the vertex Z 1 V (s) = 2 Z 1 = 2 Z 1 = 2 is dd ` 1 1 (2π)d `2 (` + p)2 p=p1 +p2 Z dd ` 1 dx d 2 (2π) (x(` + 2p` + p2 ) + (1 − x)`2 )2 Z dd ` 1 dx (2π)d (`2 + 2`px + p2 x2 + ∆)2 (8.32) (8.33) (8.34) where ∆ ≡ −p2 x2 + p2 x. Z Z 1 dd ` 1 = dx d 2 (2π) ((` + px)2 + ∆)2 Z Z 1 dd ` 1 = dx 2 (2π)d (`2 + ∆)2 Using the known integral we have Z Γ(2 − d/2) 1 1 dx i = (−∆)d/2−2 d/2 2 0 (4π) Γ(2) We now take d = 4 − : Z 1 Γ() = i dx (−∆)− 2 2 (4π) Γ(2) Z i 1 1 2 2 2 2 = dx − γ + (6γ + π ) 1 − log p (x − x) E E 32π 2 12 Z i 1 2 2 dx − log(p (x − x)) − γE = 32π 2 1 i 2 = + 2 − log(p ) − γE 32π 2 or using the Mandelstam variable s ≡ p1 + p2 = p we can finally write i 1 iM = − log s + 2 − γE 32π 2 (8.35) (8.36) (8.37) (8.38) (8.39) (8.40) (8.41) (8.42) 90 CHAPTER 8. RUNNING COUPLINGS AND RENORMALIZATION GROUP or (the two constant are equivalent...) i 1 =− − log s + γE log 4π 32π 2 (8.43) We set the renormalization condition given by s = µ2 ⇒ (∵ we have massless particles in 2 2 the CM frame) t = µ2 (1 − cos θ), u = µ2 (1 + cos θ). Our renormalization condition gives us (using crossing symmetry) δZ = −iλ̃2 (V (s) + V (t) + V (u)) 3 λ̃2 2 − 3 log µ + 3γE log 4π = 32π 2 We have ∂ ∂ µ +β + nγ G̃(4) r = 0 ∂µ ∂ λ̃ (8.44) (8.45) (8.46) At order λ̃ we have 0 = 0. At O(λ̃2 ) we have µ dependence: µ ∂ (4) ∂ 3iλ̃2 G̃r = µ (−iδλ ) = ∂µ ∂µ 16π 2 and we have (8.47) (4) ∂ G̃r β = −iβ + O(λ̃) (8.48) ∂ λ̃ To order λ̃2 the gamma term still doesn’t contribute since O(γ) = λ̃, O(Gr ) = λ̃2 . Using our equation relating the β functions we have β= 8.2 3λ̃2 16π 2 (8.49) QED β-function We begin by considering the two photon Green’s function Z (2γ) Gµν (p) = d4 xe−ix·p hΩ|T (Aµ (x)Aν (0))|Ωi = g gFFg gxg + = i gµν pµ p ν − 2 p + O(e4 ) convention z}|{ 2 1 + e f (p, ) − δ3 (e, , µ) + p2 1 + i (8.50) (8.51) (8.52) Here we also set me = 0 (or equivalently µ me ). Thus we have βm = 0. The CallanSymanzik (CS) equation is given by (this is exactly the same as before with renaming the variables) ∂ ∂ (8.53) µ +β + 2γA G(2γ) µν = 0 ∂µ ∂e 8.2. QED β-FUNCTION 91 We need to compute δ3 in order to know our µ dependence. pµ pν i gµν − p2 q→ 1 = 2 q 1 − Π̂(q 2 ) gpg (8.54) Our on-shell renormalization condition was Π̂(q 2 = 0) = 0. We now abandon this condition and have the condition Π̂(q 2 = µ2 ) = 0. At one loop we have FF q→ q→ + x = Πloop (q 2 ) − δ3 (8.55) and hence we have δ3 = Πloop (µ2 ). We found earlier that Z 1 −8e2 Γ(2 − d/2) 2 dx(1 − x)x Πloop (q ) = (4π)d/2 0 ∆2−d/2 | {z } (8.56) 1/6 0 z}|{ where ∆ = m2e −x(1 − x)q 2 . Expanding the denominator we have 4 e2 1 2 2 µ-independent Πloop (q ) = − − log µ + constants 3 16π 2 (8.57) where we have used ∆ = elog ∆ = e log ∆ = 1 + log ∆ + ... (8.58) 3 3 Now β = O(e )(we’ll see why soon). If we are consistent to O(e ) we have the CS equation gives: ∂ (−Pµν δ3 ) + 2γA Pµν = 0 ∂µ 1 = i gµν − pµp2pν p2 +i . Hence we have µ where we have defined Pµν (8.59) e2 1 ∂ γA = δ3 = 2 ∂µ 12π 2 (8.60) Hence 1 ∂ e2 hΩ|ψ̄(x)ψ(0)|Ωi ⇒ γψ = µ δ2 = (8.61) 2 ∂µ 16π 2 This function is somewhat abstract and doesn’t have a physical meaning. However we now go on to find the β function which is physical: + hΩ|T ψ̄(x)ψ(y)Aµ (0) |Ωi = + + + x x + + x + x + (8.62) 92 CHAPTER 8. RUNNING COUPLINGS AND RENORMALIZATION GROUP The CS equation is given by ∂ ∂ µ +β + γA + 2γψ G(3) = 0 ∂µ ∂e (8.63) To lowest order we have The counterterms are + }ED x = i ν i 0 0 (−ieγ ) Dµν (q) ≡ (−ie)Fµ (p, p ) p/ p/ + + x G(3) = x (8.64) x 1 = −ieFµ + (ip/δ2 )Fµ + ... p/ = −ie(δ1 − 2δ2 − δ3 )Fµ (8.65) (8.66) so we have ∂ 0 = (−ie)µ (δ1 − 2δ2 − δ3 )Fµ + β(−i)Fµ + (2γψ + γA )(−ieFµ ) ∂µ ∂ β = e −µ (δ1 − 2δ2 − δ3 ) − 2γψ − γA ∂µ 1 ∂ β = eµ −δ1 + δ2 + δ3 ∂µ | {z } 2 (8.67) (8.68) (8.69) =0 where δ1 = δ2 is zero by gauge invariance. Thus we have eµ ∂ e3 β= δ δ3 = 2 ∂µ 12π 2 Recall that α(µ) = α(µ0 ) 1− 2α(µ0 ) 3π log µµ0 The coupling goes as ( ) 1/137 me L (8.70) (8.71) 8.2. QED β-FUNCTION where ΛL = me exp 3π 2αN R 93 mP lanck Chapter 9 Non-Abelian Gauge Theories (“Yang-Mills”) 9.1 Looking Closely at Abelian Gauge Theories QED is an “Abelian Gauge Theory”. You start with a Dirac Lagrangian: L = iψ̄γ µ ∂µ ψ − mψ̄ψ (9.1) This is the only thing you can write down that is renormalizable, Lorentz invariant, and only has a fermion field. Once you impose these requirements you see an accidental “global U (1)” symmetry: ψ → eiα ψ, ψ̄ → e−iα ψ̄ and hence L → L. What if we promote the global symmetry to a local symmetry? Then we have “local U (1)” (this is also known as “gauge U (1)”). ψ → eiα(x) ψ, ψ̄ → e−iα(x) ψ̄ ⇒ ψµ ψ → eiα ψ + i∂µ αeiα ∂µ ψ (9.2) and hence L0 6= L. We can fix this by letting ∂µ → Dµ where the covarient derivative obeys Dµ ψ → eiα Dµ ψ (9.3) This holds if Dµ = ∂µ − ieAµ (9.4) 1 ∂ α. e µ where Aµ → Aµ + This procedure is unique. To turn this into a real field we need to add kinetic terms for A. These terms should be quadratic in ∂ µ Aµ and require gauge invariance. The only way this can be done is by taking Fµν F µν or µναβ F µν F αβ . The second term violates parity so we are left with the first term. We get our Lagrangian and adding in a normalization we have 1 L = iψ̄γ µ Dµ ψ − mψ̄ψ − Fµν F µν 4 94 (9.5) 9.2. SU (2) 95 Here it was easy to find Fµν . However in different gauge theories it would be harder to construct. We now consider a useful way to build such terms. Note that phase factors commute with the covarient derivative (by construction). In other words Dµ (eiα ...) = eiα Dµ (...) (9.6) with that in mind consider the commutator: [Dµ , Dν ] ψ → eiα [Dµ , Dν ] ψ iα (9.7) {(∂µ − ieAµ )(∂ν − ieAν ) − (∂ν − ieAν )(∂µ − ieAµ )} ψ → e [Dµ , Dν ] ψ (9.8) −ie(∂µ Aν − ∂ν Aµ )ψ → eiα [Dµ , Dν ] ψ (9.9) iα −ieFµν ψ → e [Dµ , Dν ] ψ (9.10) (9.11) but Fµν doesn’t transform under gauge symmetry. So we know that Fµν ψ → eiα Fµν ψ. Thus we have −eiα ieFµν ψ = eiα [Dµ , Dν ] ψ (9.12) [Dµ , Dν ] ψ = −ieFµν ψ (9.13) Thus we can finally write 9.2 SU (2) We now generalize U (1) → SU (2). We put two fields with the same mass in a “doublet”: ψ1 (x) Ψ(x) ≡ (9.14) ψ2 (x) L= 2 X iψ̄a γ µ ∂µ ψa − mψ̄a ψ (9.15) a=1 = iΨ̄γ µ ∂µ Ψ − mΨ̄Ψ (9.16) ψ̄1 where Ψ̄ ≡ . This Lagrangian has an accidental global symmetry given by rotation ψ̄2 between the two doublets, Ψ→VΨ (9.17) where V is a 2 × 2 unitary matrix. The possible V matrices form a group known as U (2). We can write V as ! 3 X σk (9.18) V = exp iα0 + i αk 2 k=1 96 CHAPTER 9. NON-ABELIAN GAUGE THEORIES (“YANG-MILLS”) where 1, σ2k are the generators of U (2) algebra. We can split this symmetry into two parts: U (2) = U (1) × SU (2). SU (2) is defined in terms of its algebra: hσ σ i σk j i , = iijk (9.19) 2 2 2 where ijk are the structure constants of SU (2). We first consider the first identity generator. If we demand a local symmetry out of this generator we would get back QED for two different fields. Now consider the case of det V = 1 (an SU (2) symmetry). The identity operator does not obey this condition since it is not traceless. Thus we only have the Pauli matrices as our generators and we have ! 3 X σk (9.20) V = exp i αk 2 k=1 We promote this symmetry to a local (gauge) SU (2) symmetry. σk V (x) = exp iαk (x) 2 (9.21) where Ψ→VΨ , Ψ̄ → V † Ψ̄ but ∂µ Ψ → V ∂µ Ψ + (∂µ V )Ψ (9.22) and hence L is not invariant under this transformation. We need to introduce a covarient derivative. We define Dµ Ψ → V Dµ Ψ (9.23) We have 12×2 ∂µ − ig õ (9.24) (∂µ − ig õ )Ψ → V ∂µ Ψ + (∂µ V )Ψ − ig Ã0µ V Ψ (9.25) where õ is some matrix. To find our covarient derivative. We enforce the requirement in equation 9.23. In other words we set this equation to V (∂µ − ig õ )Ψ: i (9.26) Ã0 V = V à − (∂µ V ) g i Ã0 = V ÃV † − (∂µ V )V † (9.27) g P For linear order in α we can write V = 1 + i 3k=1 αk σ2k . It is sufficient for us to restrict ourselves to Hermitian, traceless 2 × 2 matrices. In this case we can write õ = 3 X a=1 Aaµ σa 2 (9.28) 9.2. SU (2) 97 With this one can show that our requirement for the covarient derivatives finally says 1 a a abc b c Aaµ → Aa0 Aµ α µ = Aµ + ∂µ α + g (9.29) Note that our vector boson transformation looks identical to the U (1) transformation with an extra part. All of this can be applied equally well to U (1) however since the group is so simple it is fairly trivial. We write our Lagrangian as L = iΨ̄γ µ Dµ Ψ − mΨ̄Ψ (9.30) = LD [ψ1 ] + LD [ψ2 ] + g 3 X Aaµ 2 X a=1 ψ̄i ! σ a 2 i,j ij γ µ ψj (9.31) where LD [ψ] = iψ(∂/ − m)ψ. We can try to guess our Feynman rules. This is a bit premature since we haven’t quantized our gauge fields. However let’s give it a shot. We have the interactions gE µ,a j → −igγ µ σ a 2 ij i 0 1 we have different fermion interaction. 1 0 We have yet to make these gauge fields dynamical. We need to add a kinetic term for these fields. To do this we use the trick we learned from QED: a a a a aσ aσ aσ aσ [Dµ , Dν ] Ψ = ∂µ − igAµ ∂ν − igAν − ∂ν − igAν ∂µ − igAµ Ψ 2 2 2 2 For example for σ1 = (9.32) a b σ a a a b σ = −ig ∂µ Aν − ∂ν Aµ − igAµ Aν , Ψ 2 2 σa Ψ = −ig ∂µ Aaν − ∂ν Aaµ + gabc gAbµ Acν {z } 2 | (9.33) (9.34) a Fµν (9.35) a a a We call this Fµν our field strength. This object is antisymmetric, Fµν = −Fνµ . We define k k σ −ig F̃µν Ψ ≡ [Dµ , Dν ] Ψ or F̃µν = Fµν 2 . We have − ig F̃µν Ψ → −ig F̃µν V Ψ (9.36) F̃µν → V F̃µν V † (9.37) and 98 CHAPTER 9. NON-ABELIAN GAUGE THEORIES (“YANG-MILLS”) We have two options of terms that conserve parity to add into our Lagrangian F̃µν F̃ µν or F̃µµ F̃ νν . However F̃µν = 0. We’re not done since F̃µν is still a matrix. The simplest thing we can do is take the trace of this (we can take the determinate but that’s a mess): tr F̃µν F̃ µν (9.38) The trace is gauge invariant: tr F̃ F̃ → tr V F̃ V † V F̃ V † † = tr V V F̃ F̃ = tr F̃ F̃ (9.39) (9.40) (9.41) Lastly we need to normalize this term. In terms of the non-matrix fields we have 1 (9.42) L = − tr F̃µν F̃ µν 2 1 k,j δ z 2}| { σk σ i 1 X k j,µν Fµν F tr =− 2 k,j 2 2 (9.43) 3 1 X k k,µν =− F F 4 k=1 µν (9.44) so this normalization gives the same normalization we had in QED. This is the kinetic term of 3 “photons” + cubic and quartic terms in A (due to the extra gijk Ajµ Akν addition). These are self interactions of the gauge field!In QED these vertices did not exist at tree level. However this is a property of non-Abelian gauge fields. We have diagrams such as vg vu u uv and even without any other particles. What’s important to note is that the gijk Ajµ Akν where not optional, they were required by gauge invariance. As an example, consider ψ ψ̄ → V V where V is a gauge boson. This an analogue to e+ e− → γγ. This corresponds to fg fg b photon + crossing + deguv Note that the last diagram is fundamental to non-Abelian gauge theories and is required in order for the Ward Identity to work. 9.3 General Recipe We now discuss gauge theories more generally. To construct the theory you need to 9.3. GENERAL RECIPE 99 • Choose a symmetry group (any Lie group, G). In our example this was SU (2). • Choose a representation r(G). In our example we chose the fundamental representation (∼ , in Young Tableaux notation). • Introduce n ≡ dim (r(G)) fields - “n-tuplet”. In our example we had n = 2ψ fields. These fields are called “matter fields” (as opposed to gauge fields). These n fields must be in the same representation of the Lorentz group. For Yang-Mills theories these have to be spin 0 or spin 1/2. In our example we have spin 1/2. • Consider a global symmetry ψ →Vψ (9.45) where V ∈ r(G). We build the most general Lagrangian that is invariant under this transformation: L = iψ̄γ µ ∂µ ψ − ψ̄ψ (9.46) • Promote the global symmetry to local. In other words we rotate our ψ such that ψ → V (x)ψ with P Ng V (x) = ei i αi (x)Ti (9.47) where Ng is the number of generators and the Ti are generators of the algebra G in the representation r(G). The number of generators is independent of the representation chosen. The number, Ng is equal to the dimension of the ad-joint representation (dim(Adj(G))). Where for the ad-joint representation we have (T i )jk = fijk (9.48) The generators are given in terms of the structure constants by [Ti , Tj ] = fijk Tk , where fijk are the structure constants of G. These Ti ’s are n × n matrices. To promote to a local symmetry we need to require ∂µ → Dµ = ∂µ − ig õ (9.49) where õ always has the same form and given by õ = Ng X Aiµ (x)T i (9.50) i=1 and the T i are the generators in the representation r(G). We introduce one new number that is our gauge coupling. Note that in order to enforce gauge invariance you are forced to have the same coupling for each Aµ . This is the case for simple groups (groups that do not have any generators that commute with all other generators). For non-simple groups we are allowed to have such coupling differences. g z }| { For example you can split the couplings for U (2) through U (2) = U (1) × SU (2). | {z } g0 100 CHAPTER 9. NON-ABELIAN GAUGE THEORIES (“YANG-MILLS”) We have the gauge field transformation i õ → V õ V † − (∂µ V )V † g (9.51) It’s a simple exercise to show that 1 Aiµ → Aiµ + ∂µ αi + f ijk Ajµ αk g (9.52) Note that there is no dependence on r(G). Further note that you can have any multiple of matter fields. However there are a single set of gauge fields that are fixed by G. One last comment is often people say “gauge fields belong in the adjoint representation, Adj(G)” (The more correct way to say it is transforms as the adjoint representation). For global transformations (α = const) we have Aiµ = Aiµ + f ijk Ajµ αk = Aiµ + (T i )jk Ajµ αk (9.53) (9.54) so Aiµ transforms through adjoint representation. However this is only for a global transformation and is strictly wrong in general due to the ∂α αi term. • The Field Strength Tensor is defined as ig F̃µν ψ [Dµ , Dν ] ψ (9.55) where Dµ are n×n matrices where n is the dimension of the representation. Further F̃µν = Ng X i Fµν Ti (9.56) i=1 i where Fµν = ∂µ Aiν − ∂ν Aiµ + gf ijk Ajµ Akν . This is independent of r(G). The transformation law is given by F̃µν → V F̃µν V † (9.57) • The Gauge Field Lagrangian is given by 1 L = − tr F̃µν F µν 2 assuming that the generators are normalized as tr (T i T j ) = 12 δ ij . (9.58) 9.4. UNIVERSAL COUPLINGS 101 • The general Feynman rules are as follows given below (the first result will be derived soon): k→ −igµν δba gauge depend. = 2 + µ,a term q + i ν,b p→ iδij = p/ − m + i i j ggg fFf Dg j = −igγ µ (T a )ij µ,a i vug vu uv µ,a k& ←q p% ν,b = gf abc (g µν (k − p)ρ + g νρ (p − q)µ + g ρµ (q − k)ν ) ρ,c = −ig 2 f abe f cde (g µρ g νσ − g µσ g νρ + (b ↔ c, ν ↔ ρ) + (b ↔ d, ν ↔ σ)) where i, j = 1, ..., dim(r(G)) and a, b = 1, ..., Ng . We can motivate the cubic and quartic vertices by looking at the kinetic term for the gauge bosons: Ng X a=1 a Fµν F a,µν = X ∂µ Aaν − ∂ν Aaµ + gf abc Abµ Acν ∂ µ Aa,ν − ∂ ν Aa,µ + gf ade Ad,µ Ae,ν a (9.59) 9.4 Universal Couplings An important property of gauge theories is that every particle charge under that the gauge group will have the same coupling constant unless the group is Abelian. We now prove this statement below. Suppose you have some Lie group with abstract generators τa , then the generators obey, [τa , τb ] = ifabc τc (9.60) and a representation of the group with generators Ta obeys the same algebra, [Ta , Tb ] = ifabc Tc (9.61) Now if the theory has two fields ψA , ψB (we consider fermions though one could just as easily discuss bosons) in the same reprsentation of a gauge group then by the definition of a gauge group they must transform the same way: a ψA → eiTa θ ψA θa ψB → eiTa ψB (9.62) (9.63) 102 CHAPTER 9. NON-ABELIAN GAUGE THEORIES (“YANG-MILLS”) Now the Lagrangian of these fields is of the form, L = ψ̄A DµA γψA + ψ̄B iDµB γ µ ψ + ... (9.64) where the covariant derivatives are, DµA = ∂µ − gA Wµa Ta (9.65) DµB = ∂µ − gB Wµa Ta (9.66) where here we assume different gauge couplings. But the first derivative implies that the Wµa field transforms as, Wµa → Wµ + f abc Wµ,b θc + 1 ∂µ θa gA (9.67) 1 ∂µ θa gB (9.68) and the second derivative implies that Wµa → Wµ + f abc Wµ,b θc + However Wµa are the same fields in either case. Thus we must have gA = gB . In the above discussion we assumed that the fields were in the same representation of the gauge group. This discussion fails if the fields are in different representations. In an Abelian group since there are no commutation relations, every field can be in a different representation of the Lie group. Thus every Abelian field can have different gauge couplings. 9.5 Quantum Chromodynamics Quantum Chromodynamics (QCD) is Yang-Mills theory with an SU (3) gauge group. For SU (N ) the number of generators is Ng = N 2 − 1. In this case since N is 3 we have 8 generators which in turn gives 8 gauge fields Aaµ (a = 1, 2, ..., 8). We call these fields gluons. There is one coupling constant g. They are given by structure constants f abc . We want to write the matter fields that are charged under this gauge group. The fundamental matter fields are the quarks. We only consider QED so we call these spin 1 Dirac fermions (in reality due to the weak interaction we should split these into Weyl 2 fermions). The quarks are qf = (u, d, s, c, b, t) (f = 1, ..., 6) we call this the “flavor” of the quarks. The spin of the particles are the same. The electric charges split into 3 sets of doublets. Their could be a symmetry in between these sets. However it would not exact since they have different masses. We take one of the quarks for simplicity (they all have the same interactions). These transform as the fundamental representation of SU (3). The transformations act on vectors given by U . dim() = N (Young Tableaux notation). U1 U = U2 (9.69) U3 9.6. QUANTIZATION OF YANG MILLS 103 where 1, 2, 3 → r, g, b (“colour”). The Lagrangian is given by 9 L= X iq¯f γ µ Dµ qf − mq¯f qf − f 1 X a a,µν F F 4 a=1 µν (9.70) P where Dµ = ∂µ − ig 8a=1 Aaµ T a and T a are the generators of SU (3) in the fundamental representation, the “Gell-Mann matrices”. 9.6 Quantization of Yang Mills We have Z W0 = " Z !# Ng X 1 DA exp i d4 x − F a F a,µν 4 a=1 µν (9.71) Q g Q3 Q a,µ where DA = N a=1 µ=0 x dAx . We tried this in QED and we ran into a problem. The procedure was to go to Fourier space and we got a Green’s function which we needed to invert and it was not invertible. We traced that problem back to the fact that this integral is redundant (we were dealing with all the gauge fields). We required the Faddeev-Popov procedure. The idea is the same here. The gauge freedom gives 1 α(x) Aaµ −−→ Aaµ + ∂µ αa + f abc Abµ αc g (9.72) There is a huge amount of redundancy in this integral which we need to eliminate. The procedure is as follows. You choose some functional of the gauge field and you choose this to be zero, G [A] = 0. Geometrically: A Int space G[A]=0 Orbits We have a mathematical identity: Z δG [Aα ] α 1 = Dαδ (G [A ]) det δα (9.73) We have Z W0 = Z δG [Aα ] 4 α α DA Dα exp i d x (L [A ]) δ (G [A ]) det δα α (9.74) 104 CHAPTER 9. NON-ABELIAN GAUGE THEORIES (“YANG-MILLS”) where we shifted our integration measure to Aα . Furthermore, the Lagrangian is by definition gauge invariant so we have L [Aα ] = L [A]. We now relabel our integration variable. Z Z δG [Aα ] 4 W0 = DADα exp i d x (L [A]) δ (G [A]) det (9.75) α δα A →A R Now it’s clear that Dα = N is just a constant. So we can forget about it Z Z δG [Aα ] 4 W0 = DA exp i d x (L [A]) δ (G [A]) det (9.76) α δα A →A For U (1) we had G [A] = ∂µ Aµ − ω(x) and G [Aα ] = ∂µ Aµ + 1e ∂ µ α − ω(x). Clearly δG = const (A independent). δα For non-Abelian gauge theories we have G [A] = Ng Y (∂µ Aa,µ − ω a (x)) (9.77) a=1 and Y 1 µ a a,µ abc b,µ G [A ] = ∂µ A + ∂ α + f A αc − ωa (x) g a (9.78) δG δca = ∂µ ∂ µ + f abc ∂µ Ab,µ δαc g (9.79) α So we have The definition of the first term will become clear soon. In order to make sense expression we go back to the discrete version of our integrals. x → xi , α(x) Aaµ (x) → Aaµ,i . 1 α a α abc b,µ c a,µ G [A (x)] = ∂µ A + ∂ αa + f A α − ω a (x) g a,µ a,µ b,µ µ A − Ai 1 1 a a abc Ai+1 − Ai c → i+1 + α + α − 2α + f α i+1 i−1 i ∆ g ∆2 ∆ αc − αic + f abc Ab,µ i+1 − ωia ∆ of this → αi , (9.80) (9.81) So we have δGa dGai → δαc dαjc 1 = (δca )(δi+1,j + δi−1,j − 2δi,j ) + ... 2 g∆ a,i ≡ Mc,j (9.82) (9.83) (9.84) 9.6. QUANTIZATION OF YANG MILLS 105 The next step was to convert everything in the path integral into a term in the Lagrangian. The delta function becomes a gauge fixing term. In QED we had an extra Z R 4 2 Dωe− d xω /2ξ (9.85) and we have 1 X Lgauge fix. = (∂µ Aa,µ )2 2ξ a (9.86) which gives Z W = Z 1 a a,µν 1 4 a,µ 2 DA exp i d x − Fµν F + (∂µ A ) 4 2ξ (9.87) The last question is how to convert the determinant into a term into the Lagrangian. This is done by something which are known as ghosts. These in some sense behave like ordinary fields. They take part into the Feynman rules but just cannot propagate. Ghosts are essentially just way to exponentiate the determinate term so we will be able i,a Mj,b }| { z to perform a Taylor expansion. We want to rewrite the determinate: det ∂µ ∂ µ δ ab − gf abc ∂µ Ac,µ in an exponential form. Faddeev and Popov showed that you can write det ∂µ ∂ µ δ ab − gf abc ∂µ Ac,µ = Z Z b 4 a µ ab abc c,µ DcDc̄ exp i d xc̄ ∂µ ∂ δ − gf ∂µ A c (9.88) The new ghost fields, c, c̄ are Grassman variables. Recall that for two Grassman variables: Z dθdθ̄eθ̄Aθ = A (9.89) Earlier we showed that, Z ! Y dθi dθ̄i P e i,j θ̄i Ai,j θj = det A (9.90) i So any determinate you wish can be written as a product of integrals over Grassman numbers. The c(x) are “ghost fields”. They are anti-commuting however unlike Dirac fields they don’t have an extra index and are spin-0 particles. Anti-commuting spin-0 fields violate spin-statistics! On the other hand spin-statistics is only for real asymptotic particles. So far we have not introduced a source for these ghost fields and adding this source will violate spin-statistics theorem. In other words these cannot be external particles (there are no S matrix elements for these particles). 106 CHAPTER 9. NON-ABELIAN GAUGE THEORIES (“YANG-MILLS”) Adding in these ghost fields we have Z Z 1 a a,µν 1 4 a,µ 2 a µ ab W = DADcDc̄ exp i d x − Fµν F + (∂µ A ) + c̄ ∂µ (D ) cb (9.91) 4 2ξ Z Z 1 a a,µν 1 4 a,µ 2 µ ab = DADcDc̄ exp i d x − Fµν F + (∂µ A ) − (∂µ c̄a ) (D ) cb 4 2ξ (9.92) where we have just integrated by parts to which the derivative term. Furthermore Dµ is just the just regular covarient derivative since we have (9.93) ∂µ ∂ µ δ ab − gf abc ∂µ Ac,µ = ∂µ ∂ µ δ ab − gf abc Ac,µ ab c,µ c = ∂µ ∂ µ δ ab − g TAdj A (9.94) The Feynman rules for these ghosts are, k→ µ, a ν, b k→ µ, a = δ ab −i 2 k + i kµ kν gµν − (1 − ξ) 2 k −i 2 k + i ← a ν, b = δ ab p abc µ c, µ = gf p b Notice that ghosts can only show up in pairs. With this in mind and the fact that they cannot show up in external states, they only appear in loop diagrams. Thus in tree level you can just forget about ghosts. 9.7 Unitary and Ghosts The fact that ghosts can only show up in loops is not obviously consistent with the unitarity of the S matrix. This topic is a bit subtle. We will not do many calculations however we will go over the logic and series of statements that describe the situation. The statement of unitarity is the statement that (this is the same as saying that probability is correctly normalized) S †S = 1 (9.95) 9.7. UNITARY AND GHOSTS 107 gn where S is the S matrix. We split S into the part that’s associated with no collision and the interested part which is due to the interactions, S = 1 + iT (see for example [Peskin and Schroeder(1995)] pg. 104). S = 1 + iT (9.96) (1 + iT ) 1 − iT † = 1 1 + i T − T † + T †T = 1 ⇒ −i T − T † = T † T (9.97) Plugging this into 9.95 we have This is further discussed in [Peskin and Schroeder(1995)]. What this says is that (for φ4 ) + ... = ( * ) d3p/E1 d3p/E2 This is known as the optical theorem. Unitarity becomes more intricate for non-Abelian theories. We denote V as the following amplitude: (ψi ψ̄j → Va Vb , two fermions into two gauge bosons). The square of this corresponds to ψi ψ̄ → ψk ψ̄` . + + =Im + + + This is equivalent to Z 3 X µν s1 d k1 d3 k2 Vs1 ,s2 (1, 2 → k1 k2 ) Vs∗1 ,s2 (k1 , k2 → 3, 4) = M µ (k1 )sν2 (k2 )sρ1 ∗ sσ2 ∗ M∗ρσ E1 E2 s s 1 2 (9.98) P Recall that in QED we had 2s=1 sµ (k)s∗ ν (k) = −gµν + Akµ kν where we relied on the Ward identity to simplify our results. This was all relatively simple however things get complicated in non-Abelian gauge theories. It turns out that external gauge bosons this replacement is not P for ν2 or(more ( ( ( allowed (we don’t have ((( µ ( ν → −gµν ). This is because when you have two gauge bosons you can make one boson unphysical. If you make one boson longitudinal then Ward identity for the other boson will not work. What always holds is 3 X s=0 sµ ∗,s ν = −gµν (9.99) 108 CHAPTER 9. NON-ABELIAN GAUGE THEORIES (“YANG-MILLS”) however this requires summing over all the polarizations. You can work this out and we’ll basically be working this out on the homework. This now sounds like a problem. Since on the left hand side of the unitarity relation we have ’s (we have external gauge bosons) and on the right hand side we don’t have any such terms. The term with ghosts fixes this problem. This is quite neat. The ghosts are in a sense there to preserve unitarity. The formal proof of this is known as the BRST theorem. We now go back to tree level. We just said that at tree level we don’t worry about ghosts. However consider ψi ψ̄j → Va Vb . We have M = Mµν sµ1 (p1 )sν2 (9.100) In QED we would have QED X 2 z}|{ |M| = gµα gνβ Mµν M∗,αβ (9.101) s1 ,s2 However in non-Abelian theories we can’t do that. We have two options 1. We can have explicit ’s. If p = (E, 0, 0, E) then = √12 (0, 1, ±i, 0). You can just P compute s1 ,s2 |Ms1 ,s2 |2 without using the spin sum formula. We will do this on the homework. This always works and you never need to mention ghosts. 2. There is a second way to deal with this that is more but a little bit P3 convenient s ∗,s = −gµν (notice our weird. This is to remember that we still s=0 µ ν P have summing limits). What you can do is use µ ∗ν → −gµν but then you always need to include an extra diagram: 2 In other words our tactic is to add an extra diagram with the external gauge bosons are replaced with external ghosts. 9.8 Renormalized Perturbation Theory This just means that we have counter terms. We start with the Yang-Mills Lagrangian. We assume that the fermion is massless. This makes the calculations for the β functions a lot easier but this is not necessary. 1 a a,µν 1 / i,j ψ j − Fµν L = ψ̄ i iD F + (∂µ Aa,µ )2 − (∂µ c̄a )(Dµ )ab cb + LCT 4 2ξ (9.102) 9.8. RENORMALIZED PERTURBATION THEORY 109 where 1 LCT = δ2 ψ̄i∂/ψ − δ1 g ψ̄γ µ Aaµ T a ψ − δ3 (∂µ Aaν − ∂ν Aaµ )2 + (4 more CT’s) 4 (9.103) In the case of QED we did not have the last 4 terms since ghosts didn’t exist. In this case gauge invariance no longer implies that δ1 = δ2 . This is because now for example (∂µ Aaν − ∂ν Aaµ ) is not even gauge invariant since the gauge invariance is only saved by the ghost CT’s. What we are really interested in is the β function. To do this we need to use the CS equations. We need to compute gave hAAi −−→ γA gave ψ̄ψ −−→ γψ gave ∂e(M ) ψ̄ψA −−→ β = M ∂M The formula we had in QED for the β function was ∂ 1 β = eM −δ1 + δ2 + δ3 ∂M 2 (9.104) If you look back at that procedure. Nothing in it used the Feynman rules of the theory in anyway. We just used the structure of counterterms. In non-Abelian Yang Mills we can do the exact same thing. Just like in QED the three point function will be × + + × + × + + × + + i j This calculation is historically significant and lead to a Nobel prize. The counterterms have the following conditions δ2 : Σ2 (k/ = M ) = 0 We evaluate the following (9.105) 110 CHAPTER 9. NON-ABELIAN GAUGE THEORIES (“YANG-MILLS”) + k→ k→ × The first diagram gives, Feynman gauge z }| { d (−igµν ) d ` iδ jk ν b µ a iM = (ig)2 γ (T ) δab γ (T ) k` ij /̀ + k/ + i (2π)d k 2 + i Z X dd ` µ 1 1 a a 2 (T T )i` γµ 2 = (ig) γ d /̀ + k/ k (2π) a {z } | {z }| same as QED C2 (rf )δi` Z ig 2 Γ(2 − d/2) = k/C2 (rf )δi` (4π)d/2 (k 2 )2−d/2 (9.106) (9.107) (9.108) This gives (using our renormalization condition for Σ2 ) δ2 = − g 2 Γ(2 − d/2) C2 (rf ) (4π)d/2 (M 2 )2−d/2 (9.109) In this calculation we got lucky. We were able to factor our answer into a QED contribution and a group theory contribution. This will not happen again. We now calculate δ1 . For this we use → igδ1 γ µ (T a )ij Our renormalization condition is to set F1 (q 2 = M 2 ) = 0. We need to calculate = (T b T a T b )ij ΓµQED (q, ...) i j we have the matrix coefficient: T b T b T a + T b T a , T b = C2 (rf )T a + T b f abc T c (9.110) 9.8. RENORMALIZED PERTURBATION THEORY 111 Structure constants are antisymmetric so we can write this as 1 T b T b T a + T b T a , T b = C2 (rf )T a + f abc T b T c + T b T c 2 1 a = C2 (rf )T + f abc T b T c − T b T c 2 1 a = C2 (rf )T + f abc T b , T c 2 1 a = C2 (rf )T + f abc f bce T e 2 1 a f bac f bce T e = C2 (rf )T − | {z } 2 (9.111) (9.112) (9.113) (9.114) (9.115) b ) (T b ) (TAdj ac Adj ce 1 b b = C2 (rf )T a − (TAdj TAdj )ae T e 2 1 a = C2 (rf )T − C2 (Adj)δae T e 2 (9.116) (9.117) (9.118) We now discuss the group structure of another diagram contribution: a j i This has the group theory contribution of 1 abc bce e 1 (T b T c )ij f abc = f abc T b , T c = f f T 2 2 | {z } (9.119) −C2 (Adj)δae The full calculation gives δ1 = −g 2 Γ(2 − d/2) (C2 (rf ) + C2 (Adj)) (4π)d/2 (M 2 )2−d/2 There is one more set of diagrams to calculate which is + + × + (9.120) 112 CHAPTER 9. NON-ABELIAN GAUGE THEORIES (“YANG-MILLS”) We now calculate the group factors. The fermionic loop has a group theory factor of, (T a )ij (T b )ji = tr T a T b = C(rf )δ ab (9.121) while for the gluon loop we have, f acd f bcd = C2 (Ajd)δab (9.122) A full calculation of the last counterterm gives 2 X Γ(2 − d/2) 5 4 g C2 (Adj) − C(rf ) δ3 = d/2 2 2−d/2 (4π) (M ) 3 3 r (9.123) f 2 We have (M 2 ) = e log M = 1 + log M 2 + ... (and a similar expansion for 4π). Using these expressions we have 1 Γ(2 − d/2) 1 1 2 = + γE − log M − log(4π) + O() (9.124) (4π)d/2 (M 2 )2−d/2 d=4− 16π 2 ∂ Using M ∂M log M 2 = 2, Our expression for the β function is # " g3 4X 11 β(g) = − C2 (Adj) − C(rf ) 16π 2 3 3 f (9.125) We now interpret this result. The first interesting thing to notice is that this is non zero if there are no fermions at all. What’s even more interesting is that β < 0. This is completely different from the φ4 case and the QED case where the β function was positive. This is interesting since we know that β controls how the coupling changes with scale that we are studying the system at. We had the graph in QED of g M 9.8. RENORMALIZED PERTURBATION THEORY 113 The reason this worked in this way was because the β function was positive. However in this case we will have the opposite effect. Instead we have g 1 M This is called an asymptotically free theorem. However at low energies this theory becomes non-perturbative. You cannot take a zero momentum scattering (or equivalently take the quantum mechanics limit). QCD of course has fermions however it doesn’t have enough fermions to change the sign of the β function. In QCD the energy that we have non perturbative results is about 1GeV or below. In QED we had screening: -+ -+ + - + - + -+ + - + In non-Abelian Yang Mills we have “anti-screening” due to gauge fields. To understand where this anti-screening comes from we consider a SU (2) gauge theory. We have 1 a a,µν L = − Fµν F + gAaµ j a,µ + Lmatter 4 (9.126) where j a,µ is the current (it’s easy to show that our earlier form of the Lagrangian is a equivalent to this with the appropriate conserved current) and Fµν = ∂µ Aaν − ∂ν Aaµ + gabc Abµ Acν . We consider the equations of motion for the gauge field, δL δL = ∂ν a δAµ δ∂ν Aaµ (9.127) 114 CHAPTER 9. NON-ABELIAN GAUGE THEORIES (“YANG-MILLS”) Recall that in QED we had Ei = F0i . We can extend this idea to a non-Abelian gauge theories and write, Eia = F0ia . This is often called the “chromo-electric field”. Note this is a bit of a misnomer in this case since we are dealing with SU (2) but people often use this term for all non-Abelian gauge theories. The equations of motion are ∂i Eia = gρa − gabc Abi Eic (9.128) where ρa ≡ j a,0 is called (in analogy with QED) the charge density. This is the analogue to Gauss’s law. Consider a point-like charge, ρa = δ 3 (x)δ a,1 . Note that we needed to specify a direction in the group space for the charge. Notice that Gauss’s law is a non-linear equation. The g 2 QED classical solution is E1 = 4πr = E3 = 0. You can choose a gauge such that 2 r̂, E a a A = 0 and A0 6= 0. If we do this then (since aab = 0) the difficult term disappears. So we can recycle the QED solution. This was a classical computation. However the β function arose from a quantum solution (it came from loop corrections which necessarily imply the use of quantum mechanics). These non-linear terms inevitably come in at the quantum level. In quantum mechanics we can’t just set A = 0. In other words, δA 6= 0. We have -> A2 +1 +1 +3 -1 -> E1 anti-screening! Consider δA2 . The sources are δE3 . So we have ∇ · E3 = gδA2 · E⊥ (9.129) ∇ · δE1 = −igδA2 · δE3 (9.130) So we have This is demonstrated above. 9.9 RG flows (evolution) So far to display our running couplings we have shown plots of g as a function of the renormalization scale. There is an alternative way to represent this. 9.9. RG FLOWS (EVOLUTION) 115 IR free g g( ) Such theories are called IR free. These include λφ4 , QED , Yang-Mills when β > 0. Alternatively we can have UV or asymptotically free theories: g g( ) Asymptotically Free This includes theories such as Yang-Mills with no/little matter (β < 0). The alternative, which we didn’t see in our calculation but is possible is called “fixed-point”. This occurs when β = 0, but λ 6= 0. You have an interacting theory but the coupling doesn’t run. For example * * IR-stable fixed point The theory is “attracted” to some stable coupling. We have β ∼ aλ + bλ2 + ... (9.131) Another thing that’s possible is so called Conformal Field Theories. These have β = 0 for any λ. For this to be the case you require some symmetry to make it this way. This is also known as scale invariance. Chapter 10 Broken Symmetries 10.1 SSB of Discrete Global Symmetries Classical Analysis The symmetries we have discussed so far have been exact. There are two types of symmetry breaking. Explicit symmetry breaking is fairly trivial and less interesting. Spontaneous symmetry breaking (SSB) plays a key role and nature and can be very subtle. A field transformation is φ → Λφ (where φ is a scalar or a fermion, Λ ∈ Rep(G), and G can be a Lie or discrete group. Λ is x-independent, global or a function of x and local. It is a symmetry if L → L + ∂µ J µ (10.1) As an example consider a local discrete symmetry. We take the Lagrangian to be given by 1 L = (∂µ φ)2 − V (φ) (10.2) 2 2 with V (φ) = µ2 φ2 + λ4 φ4 . This theory is invariant under φ → φ0 = −φ. This is a Z2 symmetry. This theory cannot produce an odd number of external legs. This is clear diagrammatically: An odd number of external legs is forbidden due to the Z2 symmetry. This symmetry can be broken in two different ways. 116 10.1. SSB OF DISCRETE GLOBAL SYMMETRIES 117 1. This can be done explicitly by adding the term ∆V = φ3 to the Lagrangian. You can then do perturbation theory in . To order 0 Z2 is a good symmetry. 2. Alternatively this can be broken spontaneously. We denoted the quadratic term 2 as µ2 φ2 . However we don’t really know the sign of µ2 . The way to tell the sign of terms is by the requirement that the Hamiltonian ( 21 (φ̇2 + (∇φ)2 ) + V (φ)) is bounded from below. To be bounded from below we must have λ > 0. However we can have µ2 > 0 or µ2 < 0. In either case we have 2 V 2 >0 V -v <0 v The first case is what we’ve done before. Consider µ2 < 0. Classically our system 2 will settle down in the lowest energy state. δV = 0 ⇒ v ≡ −µ . You don’t know δφ λ which point the system will fall to. If we lived in a universe where this was present and the field had enough time to evolve and settle at a lowest energy state. By measuring this field we could find whether out field will be in either v or −v. So even though the Lagrangian is symmetric about this symmetry, to an observer it will appear the system is not symmetric. Once you specify which configuration your system is in you have spontaneously broken the symmetry. Quantization In the normal case we quantize the field by finding the classical equations of motion: φ + µ2 φ = −λφ3 (10.3) We first solve the system in the case of λ = 0: φ ∼ αk e−ik·x + αk∗ eik·x (10.4) where k 2 = µ2 . We then replace αk → âk , α̂k∗ = â†k . The vacuum is given by âk |0i = 0 for all k. Hence we have h0|φ̂|0i = 0 (10.5) It turns out that this approach doesn’t really work when µ2 < 0. The reason is very simple. If you try to just solve this classically with µ2 < 0 then formally the solution still works but now k 2 < 0 which means that at least something in k must be imaginary. In particular it means that k 0 is imaginary, k 0 = ik̄. Hence the solutions aren’t oscillator by grow in time. i.e.. φ ∝ ekt . Physically this is because we are looking at an unstable system (when λ = 0) since the potential goes to negative infinity. We can’t just throw 118 CHAPTER 10. BROKEN SYMMETRIES away λ because it’s what stabilizes the potential. We cannot take it to be zero. The lesson is that we cannot just ignore this λ in the beginning. The obvious thing to do is instead of ignoring λ which was basically equivalent to expansions around small perturbations around zero, what we need to do is expand about the actual minimum of the theory. This is the same thing you would do in classical mechanics. If you want to find the oscillations of a ball in such a potential well then you should expand about the minima. Around the minima we have (from the Euler Lagrange equations) φ + δV =0 δφ (10.6) φ0 = +v or φ0 = −v. We have δV 1 δ 3 V δ 2 V φ + (φ − φ0 ) + + (φ − φ0 )2 + ... = 0 δφ φ0 δφ2 φ0 2! δφ3 φ0 {z } | {z } | ≡m2 interactions (10.7) It’s easy to show that all the interactions are proportional to λ. We define a new field ξ(x) = φ(x) − φ0 . So we have ξ + m2ξ ξ = 0 (10.8) up to O(λ). Our new field is ξ and it has solutions of ξ ∼ αe−ik·x + α∗ eik·x (10.9) with k 2 = m2ξ where m2ξ = −2µ2 . The quantization is trivial. We do the same replacement we had before ξ ∼ âk e−ik·x + â†k eik·x (10.10) We have the vacuum given by âk |0i = 0 for all k and then h0|x̂|0i = 0. However what is the condition based on our initial field? Recall that φ̂ = φ0 + ξˆ and thus h0|φ̂|0i = φ0 (10.11) This is called the vacuum expectation value (VEV) of the system. This is a huge difference in the physics!If for example the electric field (and hence Aµ ) had a VEV there would be a background field everywhere which would mean if you had a charged particle it would be accelerated in some direction in space. We can now create particles â†k |0i = |ki (10.12) These are massive particles that obey E 2 = k2 + m2ξ . 10.2. SSB OF CONTINUOUS GLOBAL SYMMETRY 119 You can insert the interactions in two different ways. One way is to rewrite the potential in terms of ξ: µ2 2 λ 4 φ + φ 2 4 1 2 2 = mξ ξ + λ3 ξ 3 + λ4 ξ 4 |{z} |{z} 2 V (φ) = ∝λv (10.13) λ Note that we have gained a cubic interaction out of a Lagrangian invariant under Z2 . 10.2 SSB of Continuous Global Symmetry Classical Analysis +iφ2 Consider V (Φ) = µ2 |Φ|2 + λ |Φ|4 where Φ = φ1√ . The symmetry of this Lagrangian is 2 iα 2 Φ → e Φ where α is a constant. If µ > 0 then the VEV is at Φ = 0. If µ2 < 0 then we have V 1 Vacuum Manifold 2 2 2 −µ δV and δV = δΦ . Note that before we had two minima but now ∗ = 0 which gives |Φ| = δΦ λ we have an infinite number of vacua. All the different vacua are known as the vacuum manifold. We can get all the vacuums by applying the transformations on the VEV: r −µ2 1 = eiα Φ0 (10.14) Φα = eiα √ λ 2 This is a very general property of spontaneous breaking. We have spontaneous breaking of U (1). Quantization Our equation of motion are δV =0 δφ1 δV φ2 + =0 δφ2 φ1 + (10.15) (10.16) 120 CHAPTER 10. BROKEN SYMMETRIES Due to the symmetry the physical results will not depend on the vacuum we choose. We choose to expand about φ1 = v and φ2 = 0. The equations of motion give δV v δ 2 V δ 2 V = 0 (10.17) interaction √ φ1 + (φ − ) + + + 1 δφ1 δφ21 (v,0) δφ1 δφ2 (v,0) terms 2 (φ1 ,φ2 )=(v,0) δ 2 V δ 2 V ξ+ φ2 = 0 (10.18) ξ + 2 δφ δφ1 δφ2 1 (v,0) in terms of ξ ≡ φ1 − √v . 2 (v,0) We also have a second equation: δ 2 V δ 2 V φ2 = 0 (10.19) φ2 + ξ+ 2 δφ2 δφ1 (v,0) δφ2 (v,0) 2 V We define a mass matrix: m2ij ≡ δφδi δφ . We can put our equations together: j (v,0) For our example we have V = µ2 2 ξ φ2 +M (φ21 + φ22 ) + λ 4 ξ φ2 =0 (10.20) 2 (φ21 + φ22 ) . We can take the derivatives: δ2V = µ2 + 3λv 2 = −2µ2 2 δφ1 (10.21) δ2V (φ1 ,φ2 )=(v,0) = 2λφ1 φ2 −−−−−−−→ 0 δφ1 δφ2 (10.22) δ2V λ (φ1 ,φ2 )=(v,0) = µ2 + + λφ21 + 3λφ22 −−−−−−−→ 0 2 δφ2 2 (10.23) Lastly Hence not only is the mass matrix diagonal but one of the eigenvalues is zero. In the 2 language of quantum field theory the δφVi become masses. We have two particles with masses m2ξ = −2µ2 and mφ2 = 0. The massless particles are known as Goldstone bosons. This mass also turns out to be included in the quantum corrections. The existence of the particle is easy to understand classically. Consider a mechanical system at the point near (v, 0) (see diagram of potential above). It’s clear that there is one direction along φ1 that there is a restoring force. However we can also move along φ2 along the vacuum manifold and not change the energy and hence there is no restoring force. In the language of oscillator that means you have an oscillator with zero frequency. The fact that the potential is flat was a consequence of the theory. This just follows from the symmetry of the theory. Since the symmetry holds to all orders quantum corrections do not change the mass of the Goldstone boson. 10.3. LINEAR SIGMA MODEL 10.3 121 Linear Sigma Model Now that we know how this works in the U (1) case we will generalize these ideas in what’s known as the linear sigma model. We start with N real fields, φi : L= N X 1 i=1 2 (∂µ φi )2 + V (φi ) (10.24) We consider a potential that is invariant under rotation of the fields about themselves: φi = Ri,j φi . Where R is orthogonal: RT = R−1 . The only way this can occur is if the potential only depends on the magnitude of the vector φ1 φ2 φ = .. . φN We take V (φ) = µ2 φ2 + λφ4 (10.25) !2 = µ2 X i φ2i + λ X φ2i (10.26) i 2 . The manifold is As before we take µ2 < 0. The vacuum manifold is given by φ2 = −µ λ one dimension lower then the number of degrees of freedom in the system SN −1 . We pick a vacuum 0 .. φ= . 0 v We define σ(x) = φN (x) − v and we get 2 cubic and 1 2 −2µ σ + V (φ) = quartic 2 terms (10.27) The key point is that we don’t have any mass terms except for σ so we have N − 1 Goldstone bosons. We could have predicted how many Goldstone bosons will appear by just thinking about the symmetry of the theory. We can express the symmetry of the Lagrangian as R = eiαi Ti where Ti are the generators of O(N ). The dimensionality of the O(N ) group is N (N2−1) . This comes from the condition that Ti ’s need to be antisymmetric. Consider Rφ0 = (1 + iαi Ti + ...)φ0 (10.28) 122 CHAPTER 10. BROKEN SYMMETRIES We have two possibilities, Ti φ0 = 0 Unbroken gen. Ti φ0 6= 0 Broken gen. The Ti matrices take the form Ti = 0 .. . ... 1 −1 0 (10.29) The number of broken generators is equal to the number of Goldstone bosons. This is called Goldstone theorem. We did two examples but you can do any group you want. In nature we don’t think anything like this is happening exactly. But in QCD there are a lot of combinations of quarks. The pions for example are much lighter then the other particles. You can discuss the physics of pions as Goldstone bosons. You can add explicit breaking to give them some mass to correct that they are not massless. In many beyond the SM theories people speculate that different particles are approximate Goldstone bosons. 10.4 More on the Mexican Hat Potential We now go back to the original Mexican Hat potential, V (Φ) = µ2 |Φ|2 + λ |Φ|4 , with +iφ2 . If this were a mechanics problem then it would be clear µ2 = −λv 2 < 0 and Φ ≡ φ1√ 2 that it is more convenient to use fields that obey the symmetry of the problem. In this case this would be a degree of freedom that represents the angle. There is nothing that stops you from doing this in Quantum Field Theory. With this in mind we introduce a non-linear field redefinition 1 Φ = eiπ(x)/v √ (v + σ(x)) 2 (10.30) Our potential becomes µ2 λ (v + σ)2 + (v + σ)4 2 4 1 2 2 = mσ σ + λ3 σ 3 + λ4 σ 4 2 V (Φ) = (10.31) where we drop the unimportant constant term and the linear term disappeared since 10.5. SSB OF GAUGE SYMMETRIES (“THE HIGG’S MECHANISM”) δV δφ 123 = 0. The kinetic term is given by 2 1 σ + ∂µ σ |∂µ Φ| = i∂µ π 1 + 2 v 1 1 1 1 = (∂µ σ)2 + (∂µ π)2 + (∂µ π)2 σ + 2 (∂µ π)2 σ 2 2v |2 {z 2 } |v {z } interactions can. normalization 2 Notice that the interaction terms have couplings with dimensions of inverse mass!These interactions appear non-renormalizable. However that is very naive and power counting is not sufficient to test Renormalizability. This theory is still renormalizable (it is completely equivalent to the original φ4 theory). Further notice that π only enters the Lagrangian in terms of it’s derivative. Our U (1) symmetry acts on π and σ in a very simple way: π → π + α “shift” σ→σ (10.32) (10.33) To have an invariant Lagrangian we can’t have any dependence on π, since π is not invariant. For example if we had a aπ 2 term we could apply a shift and get a(π + α)2 which differs from the first Lagrangian. Hence we can only have (as we see above), L [σ, ∂µ π]. In the original theory renormalizability was clear and here the symmetry is clear. Further we have hΩ|T (π̃(p)...)|Ωi ∝ pµ · (...) (10.34) 10.5 SSB of Gauge Symmetries (“The Higg’s Mechanism”) Abelian Case Today we will consider the Abelian example of SSB. Later we will the more intricate SM SU (2) × U (1) → U (1) mechanism. We have the same potential as before only now we consider a gauge symmetry: 1 L = |Dµ φ|2 − V (Φ) − Fµν F µν 4 (10.35) where V (Φ) = −µ2 |Φ|2 + λ |Φ|4 (we make the minus sign on µ2 explicit such that now µ2 > 0) and Dµ = ∂µ + ieAµ . Our VEV is v2 = and our vacuum is Φα = eiα √v2 and Aµ = 0. µ2 λ (10.36) 124 CHAPTER 10. BROKEN SYMMETRIES We’ll start by solving this problem heuristically and follow up with a more rigorous derivation. We fix Φ to it’s vacuum value, Φ0 = √v2 . This gives 2 v e2 v 2 2 Aµ Aµ LΦ kin = |Dµ Φ| = ieAµ √ = (10.37) 2 | 2 {z } mass term We got a mass term for the gauge field, mA = ev. When we first began talking about QED we rejected these terms because they were not gauge invariant (Aµ → Aµ − 1e ∂µ α). We started with a theory that’s completely gauge invariant by shifting the Φ field, Φ → eiα(x) Φ spontaneously broke gauge invariance. Picking the vacuum. We now attack this problem more carefully. We begin by shifting our field: 1 Φ = eiπ/v √ (v + σ(x)) 2 (10.38) This shift gives 2 1 σ + ∂µ σ + ieAµ (v + σ) ∂µ + π 1 + 2 v 1 1 σ 2 = (∂µ σ)2 + (∂µ π + evAµ )2 1 + 2 2 v This is the same kinetic term as before except for the extra gauge field terms: |Dµ φ|2 = (10.39) 1 ev∂µ πAµ + e2 v 2 Aµ Aµ (10.40) 2 Note that 10.39 is still manifestly gauge invariant since σ → σ, π → π + α and Aµ → Aµ − 1e ∂α . The second mass term we knew would be there based on our earlier heuristic arguments. However there is a new interesting term, ev∂µ πAµ . This term is very strange. This leads to propagators starting as one field and turning into another. This means that we are probably not using a good basis. There is probably more then one way to solve this problem and understand what the physics is. We will use the standard solution. The idea is instead of trying to redefine fields, we have another approach. Recall that in QED Lgauge fix = − 2ξ1 (∂µ Aµ )2 . This allowed us to compute a propagator. This is a remnant of the choice we made that said that G [A] = ∂µ Aµ = 0. This was a good choice but it was not unique. Having said that, with theories with spontaneously broken symmetries we use what’s known as Rξ gauge. Our gauge fixing Lagrangian is 1 Lgauge fix = − (∂µ Aµ − ivξπ)2 2ξ 1 e2 v 2 ξ 2 (∂µ Aµ )2 + evπ(∂µ Aµ ) − π 2ξ 2 1 e2 v 2 ξ 2 = − (∂µ Aµ )2 − ev(∂µ π)Aµ − π 2ξ 2 =− (10.41) 10.5. SSB OF GAUGE SYMMETRIES (“THE HIGG’S MECHANISM”) 125 where we have integrated by parts. The middle term cancels the term we didn’t like ev∂µ πAµ in equation 10.40. However there is a new strange mass term for the π field. Based on physical arguments we there should not be any mass for π. We have three propagators: k→ −i kµ kν µ ν = 2 gµν − 2 (1 − ξ) k − m2A k − ξmA 1 p→ π = 2 p − m2A 1 p→ σ = 2 , m2σ = 2µ2 2 p − mσ ggg hhh hhh Note that we have two poles in the gauge boson propagator. One makes sense since it corresponds to that we have a massive gauge boson. However we have a second gauge dependent pole. This gauge dependent pole is cancelled by the π field. The propagators always come together: deged dehed A π + → gauge independent π cannot be real fields. We cannot have diagrams with external π fields. To understand our system better we introduce Unitary gauge, ξ → ∞. In this limit π drops out of the theory completely. The photon propagator becomes k→ kµ kν −i µ ν gµν − 2 (10.42) → 2 k − m2A mA ggg To understand the particle we consider the case of the particle on shell: ggg µ k→ ν 2 → − k → m2A 3 X −i s (p)∗s ν (p) k 2 − m2A s=1 µ (10.43) We started with the theory of the photon that has two degrees of freedom and a complex scalar field with two degrees of freedom. The overall number of degrees of freedom was 2 + 2 = 4. After the symmetry is broken the picture changes, the gauge boson gets a mass and has three degrees of freedom while the σ is a real scalar field and has 1 degree of freedom. So we still have 3 + 1 = 4 degrees of freedom. The gauge boson “ate” the π field and became massive. Non-Abelian Case Consider a scalar field, Φ, which is charged under some non-Abelian gauge symmetry, G. In other words that Φ → Φ0 = ΛΦ (10.44) 126 CHAPTER 10. BROKEN SYMMETRIES a where Λ† Λ = 1 and Λ = eiαa (x)T (a = 1...Ng ) with Ng = dim (ajd(G)). Our Lagrangian becomes 1 a a,µν F + |Dµ Φ|2 − V (Φ) (10.45) L = − Fµν 4 where Dµ = ∂µ − igAaµ T a . The vacuum state (Φ0 ) is where δV =0 (10.46) δΦi Φ0 a a If Φ0 is a solution then Φα = eiα T Φ0 is also a solution. There are two possibilities. Either we have T b Φ0 6= 0 or T a Φ0 = 0. The first case corresponds to the broken generators while the second corresponds to unbroken generators. We Nb such that b = 1, ..., Nb then the vacuum manifold is an Nb -dimensional space. Each broken generator gives a Goldstone boson. Just as we did last time parametrize our field non-linearly as Φ = eiΠ b T b /v Φ0 [(σ − fields)] (10.47) The gauge transformation gives π b (x) → π b + αb (x), and σ → σ. Expanding about the vacuum, Φ0 gives a term |Dµ Φ|2 → g 2 Nb X Aaµ Ab,µ (T a Φ0 )† (T b Φ0 ) (10.48) a,b=1 Note that the sum is over broken generators since the unbroken ones gives 0. We now examine this term more closely. Claim: T a Φ0 (where a runs over unbroken generators) 2V 2 with zero eigenvalue. Proof: Suppose are eigenvectors of the mass matrix Mi,j ≡ δΦδi δΦ j Φ0 that Φ → eiαa Ta Φ is a symmetry. Then if we consider infinitesimal α such that Φ → Φ + δΦ δΦi = αa (T a Φ) Since V must be invariant under this transformation (by assumption) then δV = δV δΦi = 0 δΦi which implies that αa (T a Φ)i δV =0 δΦi Taking the derivative of both sides gives: αa δ δV δ2V (T a Φ)i + αa (T a Φ)i =0 δΦj δΦi δΦi δΦj 10.5. SSB OF GAUGE SYMMETRIES (“THE HIGG’S MECHANISM”) Plugging in Φ = δV Φ0 δΦ i 127 gives Φ0 δ 2 V (T Φ0 )i δΦi δΦj Φ0 a (10.49) as required. The Mass matrix, M i,j is Hermitian, so eigenvectors can be chosen to be orthonormal: (10.50) (T a Φ0 )† T b Φ0 ∝ v 2 δ ab P which implies that g 2 v 2 b Abµ Aa,µ is a mass term for the gauge bosons. The number of massive gauge bosons is equal to the number of broken generators. We don’t have any Goldstone bosons left (all have been “eaten”). The masses are m2 ∝ |T a Φ0 |2 . As an examplewe consider SU (2). We define Φ is a fundamental “spin-1/2” rep 2 Φ1 resentation, Φ = . The potential is given by V = −µ2 Φ† Φ + λ Φ† Φ where Φ2 2 2 † Φ Φ = |Φ1 | + |Φ2 | . The Lagrangian is invariant under Φ → ΛΦ where Λα = e−iα·σ/2 . The vacuum is given by 0√ Φα = Λα (10.51) v/ 2 2 where v 2 = µλ . We have σ a Φα 6= 0 (10.52) for a = 1, 2, 3 so we have three broken generators and hence 3 Goldstone bosons. Our three gauge bosons are 2 2 σa 0 =v √ (10.53) 2 v/ 2 8 2 2 for all a = 1, 2, 3 which implies that m2A g 4v . This is very similar to the situation in the Standard Model, except in that case we have an extra U (1) symmetry. In the Standard Model we see weak interactions which give masses of mW ∼ 80GeV, mZ = 90GeV and only left handed fermions feel the weak interaction. The Lagrangian is given by L = iψL† ∂µ − igWµa ta σ̄ µ ψL + iψR† ∂µ σ µ ψR (10.54) ψL,1 where ψL = . ψL,2 † The naive guess for the mass terms (the Dirac mass terms) is m ψL,a ψR + ψR† ψL,a . However this is not gauge invariant!We need the Higg’s mechanism to fix this problem. ∗ H We have H = . The vacuum is given by H0 v hH 0 i = √ 2 (10.55) 128 CHAPTER 10. BROKEN SYMMETRIES which implies that mW = gv √ . 2 The Yukawa couplings are given by λv † ψR + ψR† ψL,1 + ... LY uk = λ ψL† HψR + ψR† H † ψL = √ ψL,1 2 |{z} (10.56) mF so but adding the Yukawa couplings to the Higgs field and give it a VEV we immediately get fermion masses. Note that to derive this formula we require the transformation 0 a a H = eiπ t /v √v where σ is the physical Higg’s boson. +σ 2 Bibliography [Griffith(2008)] D.J. Griffith. Introduction to Elementary Particles. Wiley-VCH, 2nd edition, 2008. [Peskin and Schroeder(1995)] M. Peskin and D. Schroeder. An Introduction to Quantum Field Theory. West View Press, 1995. 129
© Copyright 2026 Paperzz