On the Central Limit Theorem for the Prime Divisor Function Author(s): Patrick Billingsley Source: The American Mathematical Monthly, Vol. 76, No. 2 (Feb., 1969), pp. 132-139 Published by: Mathematical Association of America Stable URL: http://www.jstor.org/stable/2317259 Accessed: 01-05-2017 15:22 UTC JSTOR is a not-for-profit service that helps scholars, researchers, and students discover, use, and build upon a wide range of content in a trusted digital archive. We use information technology and tools to increase productivity and facilitate new forms of scholarship. For more information about JSTOR, please contact [email protected]. Your use of the JSTOR archive indicates your acceptance of the Terms & Conditions of Use, available at http://about.jstor.org/terms Mathematical Association of America is collaborating with JSTOR to digitize, preserve and extend access to The American Mathematical Monthly This content downloaded from 132.229.172.235 on Mon, 01 May 2017 15:22:40 UTC All use subject to http://about.jstor.org/terms This content downloaded from 132.229.172.235 on Mon, 01 May 2017 15:22:40 UTC All use subject to http://about.jstor.org/terms 19691 CENTRAL LIMIT THEOREM FOR PRIME DIVISOR FUNCTION 133 THE HARDY-RAMANUJAN THEOREM. If gm,, goes to infinity, no matter how slowly, then (2) D {m: v(M) - log log m } (log log m) g2 If a set of density 1 is regarded as containing "practically all" integers, and if we take gm to be, say, (log log m)1, so that gm(log log m)' is for large m very small in comparison with log log m, then (2) does say that "practically all" integers m have about log log m prime divisors. Thus an integer in the neighborhood of 108 will usually have about log log 108 - 3 prime divisors, and an integer in the neighborhood of 1070 will usually have about log log 1070 5 prime divisors-remarkably few. (See [6] p. 358, where v(m) is regarded as a measure of the "roundness" of m.) In 1934, Turan [10] gave a greatly simplified proof of the Hardy-Ramnanujan Theorem by an essentially probabilistic method. Further development of probabilistic ideas in number theory led Erdos and Kac [1 and 2] to conjecture and to prove in 1939 a remarkable refinement of (2): THE ERD6S-KAC THEOREM. If x_y, then (3) D' x _("t) - log log M 1 , (3) Djm:x < (log (lgoi) e-u'du. log nt)<Y -\/2gr This sharpening of (2) (we shall show later that (3) does imply (2)) shows how v(m) fluctuates about the central value log log m. For example, for -x = y =.9 the integral above is about .6, which is thus the approximate prop of m for which the ratio in (3) lies in the interval [-.9, +.9]. If m is near 1070, this ratio is approximately (v(m) -5)/51, which lies in [-.9, +.9] if and only if v(m) lies in [5-.9X5i, 5+.9X51]- [3, 7]. Thus something like 60 percent of the integers in the vicinity of 1070 have between 3 and 7 prime divisors. (Since the abnormality of a finite stretch of integers can have no effect on the density (3), there underlies a computation of this kind the premise that the integers near 1070 are not atypical in their divisibility properties.) Erdos and Kac in their original proof of (3) used difficult sieve methods. The probabilistically most natural approach to the result is that of Halberstam [4], which uses the method of moments (an idea first suggested by Kac [7]). The purpose of this paper is to show hiow, by the introduction of further probability to avoid some heavy calculatioils, Halberstam's proof can be rnade more transparent to the student of probability theory. Preliminaries. On the space of positive integers, let Pn be the probability measure that places mass 1/n at each of 1, 2, . , n, so that among the first n positive integers the proportion that are contained in a given set A is just P,(A) and hence (1) is the same thing as (4) D(A) = lim P?(A). n-*c This content downloaded from 132.229.172.235 on Mon, 01 May 2017 15:22:40 UTC All use subject to http://about.jstor.org/terms 134 CENTRAL LIMIT THEOREM FOR PRIME DIVISOR FUNCTION [February Thus the Erdo3s-Kac Theorem states that ( V(m - log log m (5) liM Pn M? () 3 < x} = (D) n-- (log log m) I where (6) (x) 1 er udu =- J 'dg and the Hardy-Ramanujain Theorem can be similarly recast. To see why (5) might. be true, observe first that (7) v(m) - E MI m) p where 6p(m) is 1 or 0 according as the prime p divides m or not. For a positive integer a, the number of multiples of a not exceeding n is [n/a], the integral part of n/a, so that (8) Pn{m: al m} Lil which is nearly 1/a for large n. If pi, , pk all i if and only if Ilipi m, so that the inter has Pa-measure n-1 [n/Hipi], wlhich, for lar of their individual Pn-measures. Thus, if m is chosen at random from 1 to n--according to P, that is-and if n is large, then the random variables 5,*,(m) are approximately independent, and there is som that the sum (7) will obey the central limit theorem when properly normalized. It is this sort of statistical plausibility argument that led originally to the surmise that (5) might be true. In the proof of the Erdos-Kac Tlheorem, we shall use from number theory (in addition to the fundamental theorem of arithmetic) only the estimate I (9) E-=loglogx+0(1); see [6; p. 351], for example. From probability theory, we shall use the facts embodied in the following four remarks, proofs of which can be found in Feller's book [31 and elsewhere. REMARK 1. If a random variable Dn converges in probability to 0, which will be true in particular if E{ | } -*0, then a second random variable Un (on the same probability space) has a given limiting distribution (say 4 as defined by (6)) if and only if Un+Dn does [3; p. 247]. If Dn converges in probability to 0 and the distribution of UX converges to 1, then DnUlJ converges in probability to 0 (since P{ Dn,Un| > j is at most P{ DThJ >e/x} +P{ I U,I >x}, i limit superior is at most 2(1 -'b(x)); let x--*o). If An converges in probability This content downloaded from 132.229.172.235 on Mon, 01 May 2017 15:22:40 UTC All use subject to http://about.jstor.org/terms 19691 CENTNrA.,L LIMITT LrmOREMNI FORl PRIKMFE DIVISOR FU'NCTION 135 to 1 and Bn to 0, then Un has limiting distribution 4) if and only if A,Un+Bn does (compare each with An U, = Un + (A n-1) U,) . REMARK 2. Since 4) is determined by its moments (10) gr =fJxrd f(x), if distribution functions Fn satisfy ft x dFn(x)->fir for r=1, 2, . . , then F.(x)->4) (x) for each x [3; p. 262], which is the basis of the method of momen REMARK 3. If Fn(x)-4)(x) for each x, and if f .I xI r+EdF,(x) is bou in n1 for some positive E, then f-,,0 xr dF (x)->ur [3; p. 245]. REMARK 4. If U1, U2, * * - are independent, uniformly bounded random variables with mean 0 and finite variances O2, and if ff2 diverges, then the distribution of Zt=l U/(= 2 converges to 4) [3; p. 258], a special case of the central limit theorem. From Remark 1 it follows that, if (5) holds for all x and g,-* oo, then (v(m) -log log m)g;'(log log m)-1, regarded as a random variable under the probability measure Pn, converges in probability to 0 as n-* oo, which implies (2). Thus the Erdds-Kac Theorem contains the Hardy-Ramanujan Theorem. Remark 1, together with the fact that log log m increases very slowly, can also be used to cast (5) in a more convenient form, namely, (11) lim p { v(M) - log logn < } n o-~ co (log log n) The equivalence of (5) and (11) will follow if we show that, for each positive E, (12) lim Pn {r4 | log log mr-log log n > n-+ co (log, log n)* If n <m_n and the inequality in (12) holds, then log log ni < log log n - e(log log n) *, which implies log log n<e-2 log2 2. Therefore, for all n exceeding some no(6), the probability in (12) is not greater than P, { m: m_nl }, which certainly goes to 0. Thus the Erdos-Kac Theorem is equivalent to (11), and this is the form in which we shall prove it. Proof: first part. The heuristic ideas favoring the Erdos-Kac Theorem (see (7)) figure in its proof as well. We shall compare the behavior of the 6,(m that of independent random variables Xp (on some probability space, one variable for each prime p) satisfying 1 1 P P (13) P{X, 11 = -I P{Xp. = 0l = 1 - This content downloaded from 132.229.172.235 on Mon, 01 May 2017 15:22:40 UTC All use subject to http://about.jstor.org/terms 136 CENTRAL LIMIT THEOREM FOR PRIME DIVISOR FUNCTION [February The point of the heuristic argument is that the P.-measure of { m: B (M) -1, i=1, . *, k} converges to P{Xi=1, i=1, * *, kl if the pi are distinc Comparing the 3, with the X. indicates also where the norming constants in (11) come from: If m<n, then no p actuially contributing to the sum (7) can exceed n, so the distributions under P. of 1'(m) and of 5 b,(m) coincide; the corresponding sum p5n Xr has mean i<n p-1 and variance "s< p-' (1-p-'), each of order log log n by (9). As the first step in the proof of (1 1), we shall show that it is unaffected if we still further restrict the range of p in (7), replacing v(m) by (14) vn(rn) = Z ap(m) where { a } is a seqtuence so chosen that (15) ,= O(it) for each positive e and 1 (16) - - o(log log I)'. an<p? n P The requirements (15) and (16) are met for examnple by a,- =1/log log n. this sequence goes to infinity slowly enough for (15), but because of (9), quickly enough for (16). For a function f of positive integers, let I n (17) Er{f} = - Ef(m) n m=l denote its expected value computed with respect to Pn. By (8), En {Z } = p? P4m: 6p() = 1 p>an, p>an an<<p:Sn P and it follows by (16) and Remark 1 that (11) holds for all x if and only if (18) lim Pn,{: ?x} nr --+ oo(log log n) =q(x) does. We shall compare (14) with the corresponding partial sum (19) Sn = E X,p P'a, of the independent X, introduced above. By This content downloaded from 132.229.172.235 on Mon, 01 May 2017 15:22:40 UTC All use subject to http://about.jstor.org/terms 19691 CENTRAL LIMIT THEOREM FOR PRIME DIVISOR FUNCTION 137 (20) =ex, Z E -J Sn -1 2 1(1 P<?a P p<an" p p which are the mean and variance of Sn, are each log log n+o(log log n)2, SO that (18) is the same thing as ( (rnz) - c,, ' (2 t) lim Pn jIm: _ c - b(x) n -+oo S, Proof: second part. Since they depend only on Remark 1, the reductions thus far (from (3) to (5) to (11) to (18) to (21)) present no technical difficulty, although it is by no means trivial that such steps as the truncation in (14)which figures in the original proof of Erdos and Kac as well as in that of Halberstam-do advance the solution of the problem. It remains to prove (21), which is the hard part. Now (21) will follow by the method of moiments (Remark 2) if we prove that, for r=1, 2, (22) EnG(Vn -Crl,,r converges to Actr (defined by (10)) as n-> o. To pr we shall first show that its difference with (23) El (Sn - C7)/sn} converges to 0 (it is here we make rigorous the lheuristic argunment) and then sliow that (23) itself converges to p,. By the multinomial theorem and the definition (19), E{ S} is the sum r ri ri rr (24) E ' - - E" El{ XPi .. XPU}X V-1 ri! .. r. * where ' extends over those u-tuples (rl, , r,u) of positive integers satisfying ri+ - - +r,,=r and " extends over those u-tuples (pi, * *, p) of primes satisfying Pi< < p <pu?<an. (We interpret a sumi as 0 if its range is empty.) Since Xp assumes only the values 0 and 1, from the independence of the Xp and the fact that the pi are distinct it follows that the summand in (24) is (25) E{XP1 * Xpu By the definition (14), En{z4J is just (24) with the summand replaced by En{ ta'l * 5u }. Since bp(m) assumes only the values 0 and 1, from (8) and the fact that the pi are distinct it follows that this summand is (26) Enla= [1 - P. = E . n -pi . .. p, This content downloaded from 132.229.172.235 on Mon, 01 May 2017 15:22:40 UTC All use subject to http://about.jstor.org/terms 138 CENTRAL LIMIT THEOREM FOR PRIME DIVISOR FUNC'i'ION [February But (25) and (26) differ by at most 1/n, and hence EtS } differ by more than the sum (24) with the summand replaced by 1/n. It now follows by the multinomial theorem that 1 anV (27) F{ S} - F" { v _- 1 - <n $n n an inequality valid for r =0, 1, * Now r k El(n- Cn k=O and En { k (Vn - )rI E Cn)r } term for term and applying (27) we see that k I El (Sn - C) - En { (vn - CO) }I I < (r) n r-k = a+ CO) k=O it n in Since (a?n+Cn)r/n-->0, as follows by (15) and th ence between (22) and (23) does go to 0. There remains only the purely probabilistic problem of showing that (2 converges to ,Ur. Although this can be established by the sort of calculation in the last century to prove the central limit theorem, here we shall deduce i from the central limit theorem. By Remark 4, the distribution of (Sn -Cn) converges to (. That (23) converges to A, will therefore follow from Remar if we show that the moments (23) are bounded in n when r is increased to next larger even integer. We shall in fact show that (28) sup I El (Sn - Cn)7/sn} | < X for every r. Put Yp=Xp-1/p. By the multinomial theorem anld the indepen the Yp, (29) E { (Sn )r} = E r! - E{'YEl} * ... El Y. " I U==1 ril r* Pi Pu where ' and E" have the same ranges they have in (24). Since E (29) still holds if we require in E' that each ri exceed 1. Since I Y implies I E{ Yri }I I E { y2 , so that the inner sum in (29) has modulu E,{ Yp} ... Et YP} < [ El EtVpI Sn p M; yan But if rl, - *, ru add to r and each is at least 2, then 2u <r. For n large enough that Sn > 1, (29) now implies This content downloaded from 132.229.172.235 on Mon, 01 May 2017 15:22:40 UTC All use subject to http://about.jstor.org/terms 19691 CENTRAL LIMIT THEOREM FOR PRlME DIVISOR FUNCTION 139 r rlt E(Sn - c.)J ?s, EZ' from which (28) follows. This completes the proof of the Erdds-Kac Theorem in the form (11). We showed that replaclng v by P. as defined by (14) has no effect on (11), and we proved the modlified (11) by the method of moments, showing that the moments of v,n (normalized) are near those of the corresponding sum S.n defined by (19) and that the latter moments converge to the p,.. We never did analyze the moments of v itself, although it is not hard to go on and do so (see [4]). It is easy to show that the Erdds-Kac and Hardy-Ramanujan theorems hold also if each prime divisor is counted according to its multiplicity: Let a;(m) be the exponent of p in the prime factorization of m =ll p6v'(m) and define V'(m) St O,(m). For k41, Q(m)-a(m) Ek if and only if pk+l jm, an event which by (8) has P.-measure at most 1/pk+I; hence Er { a a- EtI t { m 8p(m)-bp(m)?k} ?2/p2, which impliesE.{v'-V} =0(1). It follows by Remark I that (11) persists if r(m) is replaced by V(m), and by the same arguments as before we can successively deduce (5), (3), and (2) with 9l(m) in place of (4 The arguments here suffice with little change, as in [4], for the central lim theorem for a completely additive f for which f(p) is bounded and Ef2P diverges. For an introduction to probability methods in number theory, see 81; for a comprehensive account, see [9 This work supported by NSF GP-6562 and Nonr. 2121(23), NR 342-043. References 1. P. Erdds and M. Kac, On the Gausslan law of errors ia the theory of additive functions, Proc. Nat. Acad. Scd. U.S.A., 25 (1939) 206-207. 2. , The Gaussian law of errors in the theory of additive number theoretic functions, Amer. J. Math., 62 (1940) 738-742. 3. William Feller, An Introduction to Probability Theory and Its Applications, vol. I IV Wiley, New York, 1966. 4. H. Halberstam, On the distribution of additive number-theoretic functions, J. London Math. Soc., 30 (1955) 43-53. 5. G. H. Hardy and S. Ramanujan, The normal number of prime factors of a number n, Quarterly J. Math., 48 (1920) 76-92. 6. G. H. Hardy and E. M. Wright, An Introduction to the Theory of Numbers, 4th ed., Clarendon Press, Oxford, 1960. 7. M. Kac, Probability methods in some problems of analysis and number theory, Bull. Amer. Math. Soc., 55 (1949) 641-665. 8 - , Statistical Independence in Probability, Analysis and Number Theory, Carus Math. Monogr. 12. Wiley, New York, 1959. 9. J. Kubilius, Probabilistic Methods in the Theory of Numbers, 2nd ed. (1962). Vilna: Gosudarstv. Izdat. Politi. i Nau6n. Lit. Litovsk. SSR. (English translation 1964. Amer. Math. Soc. Transl. of Math. Monographs, Volume 11.) 10. PaulTur4n,Onatheoremof HardyandRamanujan, J. London Math. Soc.,9 (1934)274-276. This content downloaded from 132.229.172.235 on Mon, 01 May 2017 15:22:40 UTC All use subject to http://about.jstor.org/terms
© Copyright 2026 Paperzz