Introduction To Analytic and Probabilistic Number Theory
Introduction To Analytic and Probabilistic Number Theory
INTRODUCTION TO
ANALYTIC AND PROBABILISTIC
NUMBER THEORY
Already published
1 W.M.L. Holcombe Algebraic automata theory
2 K. Petersen Ergodic theory
3 P.T. Johnstone Stone spaces
4 W.H. Schikhof Ultrametric calculus
5 J.-P. Kahane Some random series of functions, 2nd edition
6 H. Cohn Introduction to the construction of class fields
7 J. Lambek & P.J. Scott Introduction to higher-order categorical logic
8 H. Matsumura Commutative ring theory
9 C.B. Thomas Characteristic classes and the cohomology of finite groups
10 M. Aschbacher Finite group theory
11 J.L. Alperin Local representation theory
12 P. Koosis The logarithmic integral I
13 A. Pietsch Eigenvalues and s-numbers
14 S.J. Patterson An introduction to the theory of the Riemann
zeta-function
15 H.J. Baues Algebraic homotopy
16 V.S. Varadarajan Introduction to harmonic analysis on semisimple
Lie groups
17 W. Dicks & M. Dunwoody Groups acting on graphs
18 L.J. Corwin & F.P. Greenleaf Representations of nilpotent Lie groups
and their applications
19 R. Fritsch & R. Piccinini Cellular structures in topology
20 H Klingen Introductory lectures on Siegel modular forms
21 P. Koosis The logarithmic integral II
22 M.J. Collins Representations and characters of finite groups
24 H. Kunita Stochastic flows and stochastic differential equations
25 P. Wojtaszczyk Banach spaces for analysts
26 J.E. Gilbert & M.A.M. Murray Clifford algebras and Dirac operators
in harmonic analysis
27 A. Frohlich & M.J. Taylor Algebraic number theory
28 K. Goebel & W.A. Kirk Topics in metric fixed point theory
29 J.F. Humphreys Reflection groups and Coxeter groups
30 D.J. Benson Representations and cohomology I
31 D.J. Benson Representations and cohomology II
32 C. Allday & V. Puppe Cohomological methods in transformation groups
33 C. Soule et al Lectures on Arakelov geometry
34 A. Ambrosetti & G. Prodi A primer of nonlinear analysis
35 J. Palis & F. Takens Hyperbolicity and sensitive chaotic dynamics at
homoclinic bifurcations
36 M. Auslander, I. Reiten & S. Smalo Representation theory of Artin algebras
37 Y. Meyer Wavelets and operators
38 C. Weibel An introduction to homological algebra
39 W. Bruns & J. Herzog Cohen-Macaulay rings
40 V. Snaith Explicit Brauer induction
42 E.B. Davies Spectral theory and differential operators
43 J. Diestel, H. Jarchow & A. Tonge Absolutely summing operators
44 P. Mattila Geometry of sets and measures in Euclidean spaces
45 R. Pinsky Positive harmonic functions and diffusion
Introduction to
Analytic and Probabilistic
umber Theory
Gerald Tenenbaum
'rofessor at Universite Henri Poincare-Nancy J
AMBRIDGE
UNIVERSITY PRESS
Published by the Press Syndicate of the University of Cambridge
The Pitt Building, Trumpington Street, Cambridge CB2 1RP
40 West 20th Street, New York, NY 10011-4211, USA
10 Stamford Road, Oakleigh, Victoria 3166, Australia
TAG
A Catherine Jablon,
The following notation and conventions are used systematically in the text.
We use Z, Q, R, C in their usual meaning, and N to denote the set of non-
negative integers.
The letter p, with or without subscript, denotes a prime number.
alb means: a divides b; p' a means: pva but pv +1 t a; alb means: pl a
P+ (n) (resp. P— (n)) denotes the largest (resp. smallest) prime factor of the
integer n> 1. By convention p+(1) =1, P— (1) =- co.
The integer and fractional parts of the real number x are respectively written
[x] and {x}. We write 114 := minnEz X — nl-
When the letter s denotes a complex number, we implicitly define the real
numbers a and T by the relation s =- a + iT
We write x+ := max(x, 0), and we put
e(x) := exp{27rix} (x E R), log + x := max{0, log x} (x > 0).
Furthermore, we denote by log k the k-fold iterated logarithm.
We use interchangeably Landau's notation f = 0(g) and Vinogradov's
f < g to both mean that I fl < CI91 for some positive constant C, which may
be absolute or depend upon various parameters, in which case these may be
indicated in subscript. Moreover, we write f x g to indicate that f < g and
g < f hold simultaneously. We draw the reader's attention to the fact that we
have therefore extended the common use of these symbols to complex-valued
functions.
We denote the number of elements of a finite set A either by card A or
by A.
We list below page numbers where various notations in the body of the text
are introduced.
br (x) 5 SA 272 0-a, 0-c 109
B, , Br() 5 5(n) 25 ak(n) 24
dA 270 ((s) 17 T (n) 24
j (n) 28 ((s, y) 358 T (n, 0) 148
k(n) 54 A(n) 55 co(n) 24
N(T) 152 A(n) 24 (I) (x , y) 59
N (x , y) 116 p(n) 24 X(n), xo(n) 251
Pi (n) 312 VN 271 '0(x) 31
pp 299 - (s) 151 ,Cx; a, q) 253
S(A,P,y) 58 7(x) 9 IF (x , y) 358
vp (n) 13 7(x; a, q) 253 w(n), C2(n) 23
1(n) 27 p(u) 365 Q± 80
This page intentionally left blan
Part I
Elementary methods
This page intentionally left blan
LO
Some tools from real analysis
N -1
A n bn — A n bn ±i
n=1 n=1 n=1
N -1
A n (bn — b n +i) ± ANbN.
n=1
In the setting of the Stieltjes integral, Abel summation takes the innocuous
form of partial integration, and therefore it is sometimes refered to as partial
summation. It constitutes a simple but effective tool for handling arithmetical
sums. The reader can find the essential notions concerning the Stieltjes integral
in Chapter 1 of Widder's book (1946).
Theorem 1. Let fa rd 1 be a sequence of complex numbers. Set
A ( t) = an (t > 0).
n<t
Let b(t) be a continuously differentiable function on the interval [1, x]. Then we
have
E an,b(n,) = A(x)b(x) — f x A(t) b' (t)dt .
I
1<n<x
Proof. Introduce the Stieltjes integral of f with respect to the measure 0].
Then
bb bb b
IL
b
E f(n) — f (t) dt = f f (t) d[t] —
fa f (t) dt = —L f (t) d{t} .
a<n<b
[—M
b lab
AO] a ± {t} df(t) =
f b
{t} df (t).
(3)
L i
b(x)
br (x)dx =- 0
r br-i(x)
(r > 1).
(r > 1),
00
E br (x) —
Yrx= Y eY
71 e-
Y 1
r=0
bo ( x ) = 1 b3 (X) =- X 3 — X 2 ±
One then defines the rth Bernoulli function B r (x) as the periodic function
of period 1 which coincides with br on [0, 1[. Set
B r := Br (0).
Br is the rth Bernoulli number. It is easy to see that B 2r±i = 0 for r> 1. One
has the numerical values
r 0 1 2 4 6 8 10
Br 1 _1 1 _1 1 1 5
2 6 30 42 30 66
b
E
b
f (n) = I f (t) 0] = I f (t) dt - f
a a a
b
f (t) dI3 1 (t) .
a<n<b
6 1.0 Some tools from real analysis
We can then calculate the integral with respect to B2 (t) by a new partial
integration, involving B3(t). Iterating the process, we obtain in this way the
following famous theorem.
Theorem 4 (Euler Maclaurin summation formula). For any integer
-
r=2
r - j]
t -k-l Bk(t)dt (k > 1)
which can be used to calculate - y numerically: one subtracts the above expres-
sion from the expansion for Enrr, 1 1/m and optimises k as a function of n. We
have ry c.--, 0.577215663.
Exercises
712
Wn =-(COS trdt
13
n Wn =- (n — 1)Wn-2 (n2).
§ 1.1 Introduction
Addition and multiplication equip the set of positive natural numbers
{1, 2, 3, ...} with a double structure of Abelian semi-group. The first is as-
sociated with a total order relation, and is generated by the single number 1.
The second, reflecting the partial order of divisibility, has an infinite number of
generators: the prime numbers. Defined since antiquity, this key concept has yet
to deliver up all its secrets and there are plenty of them. The central position
of prime number theory in arithmetic is amply justified by the following result,
the proof of which we sketch, using Euclid's first theorem, in Exercises 1 to 4.
Theorem 1 (Fundamental theorem of arithmetic). Each natural number
> 1 can be decomposed uniquely, up to the order of the factors, as a product
of prime numbers.
Euclid's second theorem asserts the infinity of the set of prime numbers.
It is an immediate consequence of the fundamental theorem of arithmetic: if
Pi = 2, /92 = 3, ... , pn are the n first primes, then the integer
n
N=
For more than 23 centuries, mathematicians have been concerned with provid-
ing quantitative versions of this qualitative relation. One of the aims of this
work is to describe the various methods which they have invented and imple-
mented to achieve this.
The proof given above of Euclid's second theorem is too simple to be inef-
fective. Indeed we have
n
Pn+1 __ 1 ± fi
pj
j=i
10 1.1 Prime numbers
{el + o(1)} x < 71- (x) < {c2 + o(1)} x (x —> cx)),
10 gx log x
with c1 = log(2 1 / 2 3 1 / 3 5 1 / 5 30 -1 / 3° )‘---- 0.92129, and c2 = c1 ;----' 1.10555.
We shall prove by a simple method the following result, which implies a
slightly weaker version of Bertrand's postulate: for each 6 > 0, there exists
some no = no (E) such that each interval In, (2 + E)ni, n > no , contains at least
one prime—for a complete proof of Bertrand's postulate, see Exercise 10.
1.2 Chebyshev's estimates 11
8 log2 n 1 n .
(log 2) n < 7(n) 5_ { log 4+
log n log n I log n
Proof. The upper bound is an easy consequence of the following classical result.
Theorem 4. For n > 1, we have
Hp
p<n
Indeed, accepting this result for the moment, we have for all t, 1 <t < n,
n log 4
7(n) < + t.
— log t
p<n p<n-1
where the last inequality arises from the fact that the coefficient ( 2m+1
m ) equals
(2:117) and thus appears twice in the binomial expansion of (1 + 1) 2171+ 1 . By
the inductive assumption applied to m + 1 < n we then have
11P = IT P
p<n p<m±1 m+1<p<2m+1
H p < 4m+1 4m = 4n,
dn —
P<T
Pv < JJ n = n7r (n) ,
p<n
which is equivalent to the stated inequality. The desired result is now implied
by the following theorem.
Theorem 5 (Nair). For n > 7, we have dn > 2.
Proof. The essential idea introduced by Nair consists in considering the integral
On the other hand /(m, n) is "small". Its actual value may easily be calculated
by noting that for all y, 0 < y <
( Tri
n 11) 1
y m,-., 1- (rn,n)
m=1 J. (1 - x xy) -l dx = -
so that
/(m,,n) = 1/n( m
n 11 ) 1/m ( n ) 7 (1 <m<n).
2n (2n+1 2n
n( n )1 a2n1 a2n+1/ and (n + 1) = (2n + 1) d--2 n+ 1.
n
1.3 p-adic valuation of n! 13
and finally, since ( 2:) is the largest of the (2n+1) binomial coefficients occurring
in the expansion of (1 + 1) 2n, that
d2n-pi > n 4n
We deduce that
2.4n . 22n+1
d2n+1 >
and
d2n+2 > d2n+i > 471+1
which proves the stated inequality d n > 2n for all n > 9. It can be easily
checked that it also holds for n = 7 and n = 8: d7 = 420, d8 = 840.
vp (n!) = ( m) =E 1= E 1.
m<n m<n 1<k<vp (m) k=1 m<n,vp (m)>k
The inner sum equals the number of integers m < n which are divisible by pk •
It therefore has value [n/pk ], which provides the desired expression.
Corollary 6.1. For each prime p, we have
n Ti n
— — 1 < v p (n!) 5_ — + (n >1).
P P P(P — l)
This readily follows from Theorem 6 and the bounds x —1 < [x] < x, valid
for all real x.
14 1.1 Prime numbers
\----, log p _
log x +OM.
p<x P
Moreover, the term OM arising in this formula lies in the open interval
] — 1 — log 4, log 4[.
NB. log 4 ,:-, 1.38629.
Proof. We evaluate the quantity log(n!) in two different ways for n = [x].
On the one hand we have from Corollary 0.2.1 that
logp + E log p
log n! <n \---‘ n
L' P p(p — 1)
p<n
p<n
and
log n! > 11 E log pP
E
log p.
p <n P<ri
By Theorem 4, the last p-sum does not exceed n log 4. We thus infer
log p
n log 4 < n log n — n + (1 + log n) <n log n
P
P<n
00
\---, log p log m
Z---i p(p — 1) m=
m(m — 1)
p<n
00 00
<E rlog2
m(m — 1)
rlog2
2r
= log 4,
r=1 2r -1 < rn <2r r=1
log p
+nlog4 >nlogn—n+1
p<n P
and
E log p > log n + —1 — (1 + log 4) > log x — (1 + log 4).
P n
p<x
1
and
p<x p<x P
Proof. With the given expression for c o , we see that the stated formula holds
with
11
0< 0(x) = 2(x — i)E {
log( 1
p>x PJ
00
1 k< \--, 2(x — 1)
= 2(x — 1) EE
p>x k=2
-
k
P L--1 2p(p — 1)
p>x
<
E (x _1) = x—1
n(n — 1) N — 1
n>x
where N is the smallest integer > x. This gives the required estimate.
Theorem 9. There exists a constant c1 such that, for x > 2, one has that
E _1 . log 2 x ± el + 0 ( log1 x ) •
p<x P
log p
R(t) := E log t = 0(1).
p<t P
fx 1 log p 1 r dt fx dR(t)
E
p<x
p = j2 _ log t {
p<t P J 12 tlogt
+
2_ logt
R(x) R(2—)f x R(t)
= log2 x — log2 2 + ± 2dt
log x log 2 2 t(log t) '
Ho-
p<x
__1=
e — (co +ci)
log x
{1+0( 1 )}.
l og X
This is an immediate corollary of Theorems 8 and 9.
H (i
P)
= e—')(
log x
{1+0( log
1 x)1 (x > 2).
p<x
1 - 0-
((1 + a) := -
n=1
1
((1 + a- = — + 0 (1)
) (a > 0).
a
Moreover, we have
since the product over p equals the sum E ric° 1 En n - 1 - 0- where e n equals 1 if
all the prime factors of n are < x and is 0 otherwise. Letting x tend to infinity,
we obtain the famous formula of Euler
0 ± a) _
H(1 p l_cr ) 1 .
-1 a ) p—i—cr} .
f(a) = log ((1 + a) — -
P
18 1.1 Prime numbers
Since the general term is positive and bounded above by 1/p(p - 1), the series
f (a) converges uniformly for a > 0 and in particular the sum is continuous at
a = 0. Hence
lir% f (o- ) = f (0) = co .
We shall transform the two terms of the sum f (o-). On the one hand
log ((1 + o-) = log (1/a + 0(1)) = log (1/o-) + 0(a) = log (1 e _, ) + 0(a)
1
00 00
- an n-1 + 0(u) = I e't dH (t) + 0(u)
n=1 o
where we have set
H (t) := 1/n.
1<n<t
=i
1
00 00
u'dP(u) = a ] u -1 ' P(u) du
1
00
= a- fo e-crtP (e t ) dt.
We hence obtain
x
7(x) r,-, c
log x
7(x) 71(x)
lim inf <1 < lim sup .
x—.Do x/ log x x-,0,0 x/ log x
71(x)
f := lim inf
x—,00 x/ log x •
For each e> 0, there exists some x o = xo(e) > 2 such that
t
71(t) > (f — e) (t xo(E))•
log t'
x
x thr(t) _ 7(x) 7(x0)
X0
+ IX0
71(t)t -2 dt
i dt
> (f — e) log2 x + 0,(1).
xo t log t —
I
By Theorem 9, it follows that t — E < 1 and hence t < 1, since 6 can be chosen
arbitrarily small.
20 1.1 Prime numbers
Notes
§ 1.2. The proof given here of Theorem 4 was found independently by Eras
and Kalmar in 1939.
The prime number theorem implies that ri p
_< n p < e(i ±') n for all E > 0
and n > n0 (6). It is however useful to have at one's disposal uniform upper
bounds, like those of Theorem 4, valid without any restriction on n. In this
spirit, Hanson showed in 1972 that one has
(n _>__ 2) .
p<n
For numerical bounds on 7(x) and Chebyshev functions (cf. § 3.6) see Rosser
& Schoenfeld (1962, 1975) and Schoenfeld (1976). For example, we have
x ( 1 ) x ( 3
1+ < (x) < 71 + (x ?_ 52).
log x 2 log x log x 2 log x i
The idea exploited by Nair (1982a) for the proof of Theorem 5 is not new:
see Gelfond (1946). Gorshkov showed that the prime number theorem cannot
be obtained by this method with polynomials in one variable and gave nu-
merical bounds for the best estimates that can be achieved in this way (cf.
Gorshov, 1956). These bounds were refined by Aparicio Bernardo (1981). The
generalisation given by Nair (1982b) is genuinely new. It provides very precise
numerical bounds and might in principle lead to a proof of the prime number
theorem.
For more classical proofs of Chebyshev-type bounds for 7r(n), see Exercises
6 to 10.
Exercises
(C) lim inf (dn 1 log n) < 1 < lim sup (dr,/ log n).
n--400 n--400
(d) For each a > 0 there exists a sequence of integers {n i , n2 ,. .}, increasing
in the weak sense, such that pni aj (j --> co).
(e) The set of rational numbers of the form p' 1 p, where p and p' are prime,
is dense in [0, +oo[.
6. Let {a n } 1 be a sequence of non-negative real numbers. Assume that
Prove Shapiro's Tauberian theorem (1950): there exist two positive constants a
and 13 such that ax < A(x) := a < 0x (x > xo).
Give precise values for a and 0. Consider the case when one only assumes
that B(x) = x log x + 0(x). [Hint: Consider A(x) — A(x12) and use the prop-
erties of the function u [u] — 2[u/2].]
7. Define A(n) : = logp, if n = pv , A(n) : = 0 where n is not a prime power.
Show that
(a) E dln A(d) = log n
(b) E d<x A(d) [x/d] = x log x — x + 0 (log x) (x 2).
8. Deduce from the two previous exercises that the function 0(x): = Eri<x A(n)
satisfies
ax <1P(x) < f3x (x > xo)
for two positive constants a and 3 which may be computed.
22 1.1 Prime numbers
11. Show that limx,00 EN/x<p<x 1/p -- log 2. Deduce that, as x —> co, a
positive proportion of integers n < x have their greatest common prime factor
> Vn. Can this statement be made more precise?
12. Evaluate E pq<x llpq, where p and q denote primes. Generalise.
13. Show that from Bertrand's postulate it is possible to deduce simply that
the number II, = Ei<rn<n 11M is not integral for any n > 1. Give another
proof of this result by considering powers of 2.
14. Show that there exist infinitely many primes of the form 4n ± 3. [Hint:
consider the integer N = 4•n! — 1.]
1.2
Arithmetic functions
§ 2.1 Definitions
An arithmetic function is a complex-valued function which is defined on
N* = {1, 2, 3, ...}. Two classes of arithmetic functions play a particularly
important role: additive functions and multiplicative functions. A function is
called additive if it satisfies
§ 2.2 Examples
The following arithmetic functions are classical and define fundamental con-
cepts attached to the multiplicative structure of n:
• The counting functions of the total number of prime factors of n, taken
with or without multiplicity, thus
Ci (n) := v, w(n) :=
Pv lin
24 1.2 Arithmetic functions
• The "number of divisors" function and the function "sum of kth powers
of divisors", traditionally denoted by
crk(n) := E dk (k E C) .
din din
Usually - 7 (n) is referred to as the divisor function and one writes o- 1 (n) = o- (n).
-
The sum and product of two formal Dirichlet series are defined in a natural
way by
oo
(4) D(f; s) D(g; s) = E h(n) n — S ,
n=1
with
f (m)g(k)(mk)'
m=1 k=1 m,k=1
>-: n
oo
= —s y: f (m)g(k).
n=1 km=n
It may be easily checked that the set of formal Dirichlet series equipped
with these two operations has the structure of a commutative ring with unity
given by the series
D(6; s) = 1,
associated with the arithmetic function
(n = 1)
6(n) := { 01
(n > 1).
26 1.2 Arithmetic functions
{ g(1) = My'
Conversely, if f(1) = 0, then (6) is not soluble for n = 1 and f is not invertible.
Theorem 3. The group g of units in the ring A consists of arithmetic functions
f such that f(1) 0.
An element it in A is prime if it is not a unit and if the relation it = U * v
implies that either u or v is a unit. One checks easily that the set of prime
elements in A strictly contains the set of functions f such that f (1) = 0 and
f (p) 0 for some prime number p—cf. Exercises 8 and 9.
Multiplicative functions are units, since they satisfy f(1) = 1 by definition.
The following result will help us to show that they actually form a subgroup
of g.
Theorem 4. A necessary and sufficient condition for a function f in A to be
multiplicative is that its associated formal series D(f; s) be expandable as an
infinite formal product of Eulerian type, that is
00
00 co
pp v )p _v9) ( l ± E Ap ilp ,) = 1
(1
v =1 v=1
00
1(d)1(n/d),
din d
from which
T = 1.*]..
We clearly have
(13) a=1*j
and hence:
a-k(n) = E dk . (1 * i k)(n)
din
,
(v . 0)
(1* t)(pv) = E to)) = { 01 (v > 1)
2=0
=
(14)
In other words,
1 (n = 1)
(15)
(n > 1).
din
j(p
(16) (1 71 ) V)
(i + E Apv)ev) = 1.
v=1
This remark can be very useful when bounds for I f (Pv ) I are needed. One applies
(16) to complex values of in a common disc of convergence of the two series
and gets the desired estimate via Cauchy's integral formula.
Theorem 8 (First Mobius inversion formula). Let f and g be arithmetic
functions. The two following properties are equivalent
Proof. Let us for instance prove the implication (i) = (ii), the converse being
similar. For x > 1 we have
By (15), the inner sum has value 6(k). This yields (ii).
Applying Theorem 9 to the case G(x) EE 1, we obtain
We shall see in Section 3.6 that (18) is equivalent to the prime number theorem.
(19) A := p, * log .
p(d) log d
that is
are important in the analytic theory of prime numbers. From Theorems 1.5
and 1.4 respectively, we deduce the inequalities
(24) 11) (X) ? [X] log 2 (x ? 7),
7(x) = 0(x) ± 0 ( x
(28)
log x (log x) 2 ) .
Proof. Formula (27) follows immediately from (25) and (26). In order to estab-
lish (28) we evaluate 0(x) by partial summation:
x x 7r (t)
0(x) = f log t thr(t) =- 7(x) log x — f t dt.
By Chebyshev's upper bound (Theorem 1.3) the integral is 0(x/ log x), giving
the stated formula.
Corollary 10.1. Let a, 0 be constants such that 0 < a < log 2, 0 > log 4.
For all sufficiently large x, we have
(29) cxx < 0(x) < 0(x) < Ox.
This follows immediately from Theorem 10 and Theorem 1.3.
32 1.2 Arithmetic functions
i<m<n
E 1 . E bt(d) -7-:1-.
m<ri di(m,n) din m<n din
m0(mod d)
(30)
Another proof of this result consists in noting that each rational fraction
with denominator n can be written in the form h/n=a/d, with din, (a, d) = 1.
For any function F of a real variable we can hence write
Applying this identity to the special case F(x) -.- 1, we obtain that
(33) n= co(d).
din
In other words, j = 1 * cio, and the desired result follows from (14).
The inclusion-exclusion principle (cf. Notes) provides an easy third proof
of Theorem 11.
Notes 33
Notes
§ 2.4. For other properties of the ring of arithmetic functions, see Shapiro
(1972).
§ 2.5. The inclusion-exclusion principle. A purely combinatorial version of the
Mobius inversion formula in its basic form 1 *p, = .5 is known as the inclusion-
exclusion principle. It can be stated thus:
Let A be a finite set of cardinality N and P = {(1), ... , (k)} be a family of
numbered properties. For each subset I of P let A(I) be the number of elements
of A which satisfy all of the properties in I. Writing S(A,P) for the number
of elements of A which do not satisfy any property in P, we have
It is easy to prove the identity (34) directly by introducing, for each element
a of A, the number m(a), 0 < m(a) < k, of properties satisfied by a. It is also
possible to argue indirectly as follows. Let 2 = p i < /92 < - • • < Pk be the k
smallest prime numbers, write
and to each a in A associate the integer F(a) = nae(i) Pi where " a E (07,
means that a satisfies property (i). Then we have
If d = II pi , we certainly have
(i)Ei
Exercises
1. Let ((s) be the Riemann zeta function, which we identify with the for-
mal Dirichlet series associated with the arithmetic function 1(n) = 1 (n > 1).
Express in terms of ((s) the Dirichlet series associated with the following arith-
metic functions: p,(n) , (n) 2 , so (n), o- (n) , r (n) , 2`-') (n) , A(n). Find the convolution
relations which correspond to these identities.
2. Show, for all n> 1, that 2' ( n) < r (n)
3. Show, for all n> 1, that 6n 2 /7 2 < a(n)p(n) < n2 .
4. Show that each integer n > 1 can be uniquely decomposed as n = qm 2 ,
where q is squarefree. Let Q(x) denote the number of squarefree integers q not
exceeding x. Show that:
(a) [x] .-- Q(x/m 2 ) (x ? 1)
m<vx
(b) Q(x) = > ,u(d)[x 1 d2 ] (x > 1)
d<vx
(c) Q(x) = 6x/7 2 + 0( \/x) (x > 1).
Generalise these considerations to "k-free" integers, that is, integers such
that pin Pk t n.
in,1
t li(mx)IT(m) < co.
where a determination of a complex square root has been chosen. Show that if
there exist four prime numbers p,q,r,s such that the 16 determinants of the
form
9i(19,q) 9k(q7s)
(i,j,k,f= 1,2)
93 (p, r) t (r 8 )
are all non-zero, then f is a prime element in A.
10. On monotone multiplicative functions. Let f : Z± —> R be a non-
decreasing multiplicative arithmetic function. For integers a > 3, t > 1, put
St := at _ Show that f (R t ) >f ( a )t > f (S t ). Deduce
that for all integers a, b, n exceeding 2 we have f (b)' < (n) < f(a)r± 2 with
r := [log n I log a], s := [log n1 log b]. Show that (a)1/ log a = /(b)1/ log b and
hence that there exists some non-negative constant k such that f (n) = 71k for
all n > 1. [This result was first proved by Eras (1946). The simple proof given
above is due to Moser e.4 Lambek (1953).]
1.3
Average orders
§ 3.1 Introduction
We define an average order of an arithmetic function f to be any elementary
function g of a real variable such that
A given function f may possess several average orders. In general one seeks a
function g which is simply expressible in terms of elementary functions, and
whose asymptotic behaviour is easy to determine. Of course, given the choice
between several candidates, it is preferable to use the one which minimises the
error term in the asymptotic relation (1).
Arithmetic functions often have very erratic behaviour, which is difficult to
analyse, even with the aid of very extensive numerical tables. For example this
is the case for the divisor function T (n), which oscillates between the value 2
(attained at each prime number) and occasional very large values see Chap-
ter 5. The averaging procedure often has a substantial regularising effect by
masking atypical values which occur only rarely. Normally, and in particular
when f takes only positive values, the average order is the easiest non-trivial
piece of information to determine for an arithmetic function.
E
md<x
f(m)g(d) =j
md<x, d<y
f (m)g(d) + >
md<x, d>y
f (m)g(d)
d<y
Expanding the last term, this immediately implies the stated result.
As noted above this result yields more precise information on the average
behaviour of 7(n).
log x + -y 0 (1 / Vx)
The name hyperbola method comes from the following geometric interpreta-
tion of Dirichlet's problem.
In the (u, v)-plane, let us consider the rectangular hyperbola uv = x.
38 1.3 Average orders
Let us write
A (x) := >-:
n<x
T (n) — x(log x + 2-y — 1).
and in 1915 Hardy and Landau proved independently that A(x) is not o(x 1 /4 ).
Let a be the infimum of the set of exponents such that
improving on the estimate of Iwaniec & Mozzochi (1988) a < 7/22. In Chapter 6
we shall show how van der Corput's method (1922) can be applied in order to
prove Voronas result in a simple way.
3.4 Euler's totient function 39
cr(n) = d
din
2
7
(5)
E
n<x
a(n) = 2x2 ± 0(x log x).
Proof. The idea of the proof is very simple: we write the function under consid-
eration as a sum taken over divisors of n, and we interchange summations. We
already met this approach in the previous section and we shall meet it again
in other contexts in this chapter. Thus
= d = 2-
n<x md<x m x
[] ( [-:T] ± 1)
2
X 1
= —21 m2 ± 0 (X V
z__w
m<x m<x
oo
1 72
(6)
E
m=1
-77,7,--i = w•
The best remainder term known to date for the asymptotic formula (5) is
0(x(log x) 2 / 3 ). It is due to Walfisz (1963), p. 99.
(7) co(n) =
md=n
E tt(d) E 771= —
t
(d)
[d] ([d] ±1 )
d<x m<x/d d<x
= —1 2 tt(d)
d<x d<x
(9)
co
E
tt(d) 6
72 •
d2 =
d=1
This implies the stated formula.
The best error term for (8) known at present is 0(x(log x) 2 / 3 (log2 x) 4/ 3 )
cf. Walfisz (1963), p. 144.
For each N > 1, the Farey series of order N, denoted .FN, is the ordered
sequence of rational fractions in lowest terms having denominator not greater
than N. Thus we have
f 0 li_ 10 1 1'1 10 1 1 2 11,
•Fl = 1 i, if, -F2 = 1Th 1 if, 1.1 1 3' 2' 3' 11 7
etc ...
From the definition of co(n), we immediately note that
Theorem 5. Let G(x,y) denote the number of pairs of integers fm,n1 such
that 1 < m < x, 1 < n < y, (m, n) = 1. Then G(x,y) — (6/7 2 )xy as x and y
tend to infinity. More precisely, setting z := min(x, y) we have
6 (log z
G(x,y) = xy{+0 )1 (x,y > 2).
7r z
Proof From the Mobius inversion formula (2.14), we have
ii(d)
= xy d2 + 0((x + y)E)
d<z d<z
= xy{---i
6 +0( 1 + (-1 + —1 ) logz)},
7r z x y
as required.
Formula (10) now follows from Theorem 1.9 and the upper estimate of Cheby-
shev (Theorem 1.3).
The case of the function 12(n) = EPu V is similar, but a little more delicate.
lin
Theorem 7. As x tends to infinity, we have
1
C2 = C1 ± 1.034653.
P
42 1.3 Average orders
Proof We have
-v _ 1
A(x) < x
p v>2 P
p P
( - 1 ) '
I x
( xp -11 - 1p( p — 1) ± ° ClOgg xp)}
p< N/x 2<v<logx/logp p<\/x
1
= ± O(X),
P
§ 3.6 Mean value of the Mobius function and the summatory func-
tions of Chebyshev
We saw in Section 2.6 that von Mangoldt's function
A = pc *log
is closely linked to the characteristic function of the prime numbers and has a
summatory function
1/)(x) = A(n)
n<x
satisfying
0(x) , 7(x) log x (x -> oo).
Therefore it is natural to ask if the average value of p(n) has a simple inter-
pretation in terms of the asymptotic behaviour of Chebyshev's functions 7r(x),
e(x), 0 (x). The following theorem, due to Landau (1909), provides a complete
answer to this question.
3.6 The Mobius function and Chebyshev's summatory functions 43
It remains to show that (i) = (iii). Note first of all that the convolution
relation p * 1 = 6 yields by summation that
= 6(n)
md=n
md
Working out the inner sum by means of Theorem 0.5, we deduce that
m<x m
= m(x) (log x ry) - G(x) + 0(1),
Thus
(1 - A(j)) p(k)
G(x) = 1 = 1 ± I x m(x It) dR(t)
jk<x 1-
3.6 The Mobius function and Chebyshev's summatory functions 45
with
R(t) := [t] - OW (t ? 1),
so that by hypothesis
R(t) = o(t) (t —> co).
By partial summation, it follows that
x x
G(x) =- -1+ I t-2 m(x/ t)R(t) dt - I t -1 R(t) dm(x /t)
J. -
x x
= 0(1) + I o(1 /t) dt - I o(1)I dm(x /01.
The first integral is o(log x). In view of the inequality between Stieltjes measures
Idm(Y)1< d{ E 1/n } ,
n<y
we note that the same holds for the second, from which we deduce (16) and
finally (iii).
with
E(z) := f(k)/k=C+0(1/Vz) (z >1).
k<z
This last relation follows by partial summation from the estimate (13) for a
suitable constant C. Letting x and then y tend to infinity, we obtain as before,
using (iii), that (17) has value -2-y ± o(1), giving the required conclusion.
46 1.3 Average orders
6
(18) Q(x) = 72 x + 0(N/x).
(19) n _ qm 2 , /1 (02 _ 1 .
Q(x) = E p,(d) =
,,(ci ixi_ 6 x i 0 (
1' ) L d2 J 72 m
-2 ± VX),
n<x d2 In d<Vx d> ,,/x
where the last step follows from (9). This yields (18).
Remark. We could have adopted the following slightly different argument. De-
note, for each m < Vx, by A m the set of integers n < x satisfying (19). Then
fn : n < x} is the disjoint union of the A m , and we have
'Anil =
3.7 Squarefree integers 47
from which
E Q(x/m2) [x] (x >1).
m<vx
Write x = y2 , and apply the second Mobius inversion formula (Theorem 2.9)
to the functions F(y) = [y 2 ] and G(y) = Q(y 2 ). Then it follows, as before, that
we have
6
(22) Q(x) = + o(Vx) (x —> oc).
A(n) ott(d) if n = d2 ,
if n is not a square.
By the hyperbola method, we obtain, for 1 < y < x, that
6x
Q(x) = 0(\/(X/y)) o(\/x),
6
2 = H (1— 1/p2 ) = 11(1 — 1/p) (1 + p(p) 2 /p).
One says that t2(71) 2 possesses a mean value, equal to 6/7 2 . It is easy to con-
struct arithmetic functions, even with values in [0, 1], which fail to have a mean
value. For example, with
The following theorem settles the question in the case of multiplicative func-
tions.
Theorem 11. Let f be a multiplicative function with values in [0, 1]. Set
00
M(f) :=1-1(1 —
P
f = fay * fi3y.
f 5_ f y := fay * Oy.
—1 if p < y and v = 1,
hy (If) = { 0
if p> y or v > 2.
Thus, for each fixed y, 13y has a mean value equal to M(f3 y ). Now we can write
.
c ° f (d)aY (d) m(f3y) = H(1 - 19-1 ) E f (Pv)P - v = M (fY)'
E
d=1 d P<Y v=0
50 1.3 Average orders
\ , 1 — f (p) .
/ I p
P
5 >-: (1 —
pviln,P>Y
f(Pv ))
1
± =: xE(y),
P>Y
P P(P — 1) }
say, where E(y) tends to 0 as y tends to infinity. The desired result follows from
this estimate, in view of (25), by making x and then y tend to infinity.
Notes
of the inequality a < 1/3: that of Landau (1912) using the theory of Bessel
functions and that of I.M. Vinogradov (1917), explained in the book of Gel-
fond & Linnik (1965), which also depends on the Euler–Maclaurin formula.
The upper bound a <23/73 of Huxley (1993a) stems from a refinement of the
method of Iwaniec & Mozzochi (1988) which yielded a < 7/22, thus improving
on a result of Kolesnik (1985), a < 139/429. The numerical gain is relatively
slight (139/429,----,- 0.324, 23/73,-----, 0.315), as in the whole history of this prob-
lem, but the ideas involved are important. Kolesnik's method is an extension to
many variables of that of van der Corput (1922) sketched in Chapter 6. That of
Iwaniec & Mozzochi essentially amounts, by a sophisticated process, to bound-
ing an average of double exponential sums. This treatment is a bidimensional
version of that of Bombieri & Iwaniec (1986), which led to a new upper bound
for the Riemann zeta function on the critical line, 1(( ± it) 1 <, t 9156+E. In
this direction, the best result known to date is due to Huxley (1993b), who
obtains, by elaborating upon the same method, the bound
570 ±E .
1( •- + it)1 < 6. t 89 /
The fact that A(x) is not o(x 1 /4 ) follows directly from the quadratic mean
evaluation of Tong (1956):
fo x
A (y) 2 dy = ( (3/2)4 x312 +0 (x(log x
6 20) )5) -
lir. in
sur{ 3 x2 }/x(log 2 x) 0.
72
n<x
§ 3.7. Theorem 10 is due to Landau (1909). The best error term known at
present for the formula (22) is
0,(x8/ 2 ).
§ 3.8. The method of the functions ay , Oy goes back at least to Eras in the
thirties. It has been used in a remarkable manner by Daboussi see in particular
1979, 1984. The case of convergence of the product M(f) can be generalised
as follows: Let f be a complex-valued multiplicative function such that
(26) If * WO <00.
n
Then we have
E f (n) = x{M (f) + o(1)}
n<x
h(d)
(x --- oo)
d dt-di d
E
d=1
h(d)
d = M(f)-
We shall see in Theorem 11.1.2 that assumption (26) is equivalent to
The case of zero mean value in Theorem 11 can be handled by Theorem 111.3.5.
Such an approach actually provides an effective estimate.
Exercises 53
Exercises
4. Show that
E 3w(n) = Cx(log x) 2 + 0(x log X),
n<x
lo gp =
E , 1 log x
p<x I'
- ry + OM
7. Let A(n) := E p-lin 'IP be the function of Alladi & Erdos (1977, 1979).
Assuming the prime number theorem, show that
7T 2 x2
E A(n) - 12 log X
(x --4 ±oo).
n<x
Make the error term in this relation more precise by appealing to a strong form
of the prime number theorem for example, Theorem 11.4.1.
54 1.3 Average orders
E ( Y) n(n)) a = 41P ( 1
n<x 1)
10. Squarefull integers.
Let S := {n _?_1 : pin p2 In}, S(x) :=1S n [1, xil.
(a) By estimating n<x,nES \/(x/n), show that
S(x) — Vx.
(( 3 )
[See Suryanarayana & Sita Rama Chandra (1973) for a more precise result.]
11. Let k(n) := flpin p be the squarefree kernel of an integer n.
(a) Deduce from Theorem 11 that En<x
k(n)In—Cx with
C := II (1 - 1Ip(p +1)).
P
E k(n)
n Cx_ + O(X).
n<x
(c) Show that En<x k(n) = -12-Cx 2 +0(x3/ 2 ). [See also Cohen (1960, 1964).]
Exercises 55
E co (r (n)) = Ax + 0 ( logx x)
and show that
12(7- (n)) = x log2 x + 0(x).
n<x
dt = -1 - -y.
t2
Sieve methods
P= H p.
p<N/x
Then an integer n with Vx <n < x is a prime number if and only if (n, P) = 1.
Thus, we can write
At this stage, if we insert the simple estimate [x/c/] = x/d + 0(1), we obtain
1
7(x) — 71( x) + 1 = x H (1 -
p<Vx P
but Chebyshev's estimates show that the error term is greater than any power
of x.
This calls for two comments. On the one hand the exact formula (1)—called
the sieve formula of Eratosthenes—involves too many terms for reasonable
practical validity. On the other hand, the estimate of the main term, taking
the prime number theorem into account, shows a posteriori that the "error
terms" created by replacing [x/d] by x/d have made a global contribution of
the same order of magnitude as the "main terms". This suggests that, even
suitably adapted, this method will never allow a proof of the prime number
theorem. However we will see that it can provide Chebyshev-type estimates in
a very general context.
4.2 Brun's combinatorial sieve 57
In order to obtain a non-trivial result starting from formula (1) one may
introduce a parameter y, 1 < y < x, and bound 7r(x)-7(y)± 1 by the number
of integers n not exceeding x and having no prime factor < y. With the same
calculations we obtain
1
Ir(x) 5_ x II (1- -)±0(2Y)
P<Y P
_ x {e - ')' + o(1)}
± 0(2Y)
log y
X
< { e—Y ± o(1)}
10g 2 x
for the essentially optimal choice y = log x.
It was with the aim of improving the efficiency of this method that the
Norwegian mathematician Viggo Brun invented the theory of the combinatorial
sieve between 1917 and 1924.
where the last equality is easily obtained by induction on t. This is all we need.
58 1.4 Sieve methods
Corollary 1.1. Let A be a finite set of integers and let P be a set of prime
numbers. Write
Ad := card {a E A: a 0 (mod d)},
P(Y) := 11 13 ,
pEP, p<y
S (A, P , y) := card {a E A: (a, P(y)) = 11.
Then for each integer h > 0 we have
Let us see how this result enables us to improve considerably the upper
bound for 7(x) provided by the sieve of Eratosthenes.
We select in the above corollary A = {n : n < x} and P = P(y) := {4<y p.
Then S(A, P, y) > 7(x) - 7(y) ± 1, from which we obtain
x
dIP(Y)
w(d)<2h
=x d
±u(y+ E
(5) dIP(Y) dIP (V)
1)
w(d) <2h w(d)<2h
1
X H (1 — —) + 0 (Y+ E 1+x
P<Y
P dIP (y) dIP(y)
w(d)<2h w(d)>2h
The second of the three error terms does not exceed y2h since this is an upper
bound for all integers d such that d 1 P(y), co(d) < 2h. The d-sum arising in the
third remainder term is bounded, for each value of the parameter u> 1, by
E u w (d) _2h /d _ u-2h ll(i+ u/p) 5_ exp{ -2h log u ± u >7, 1/p}.
dIP (y) P <Y P<Y
For the optimal choice u = 2h/ Ep<y /9 -1 , we obtain that this quantity is
<„ (log y)y
where v = u log u - u. This follows from Theorem 1.9. We note that when
u > 5, then v > 3. It is easy to show that for sufficiently large y there exists
some u = u(y), 5 < u < 6, such that
h := - 7_1,YI p
-1
P<Y
4.2 Brun's combinatorial sieve 59
is an integer. With this choice of the parameters, we have for sufficiently large x
y2h 5_ exp {6 log y(log
2 y + OM)} < x213
from which we deduce that
From this result one easily deduces a Chebyshev-type upper bound for 7(x),
as well as an evaluation of the order of magnitude of (I)(x, y) for y < x 6 , where
6 is some fixed positive real number. We leave the details to the reader.
log2 x 2.
(10) J(x) < x (
log x )
Corollary 4.1. We have
4.3 Application to prime twins 61
p(d)
(16) Ad = X +0(p(d)) (1t(d) 2 = 1).
d
Inserting this back into (12) and performing a calculation parallel to (5), it
follows that
valid for all R tuples {a i , ... ,ozR} of real numbers, which are 6 well spaced in
- -
where 114 denotes the distance of the real number u to the set of integers. Our
aim in this section is to prove the following optimal result.
Theorem 5 (Montgomery Si Vaughan; Selberg). Under the above con-
ditions the large sieve inequality (19) holds for the choice
However let us observe that the value given in (21) is attained for certain
values of the a i , N and 6. Indeed for each integer R > 1, set ai = j / R
(1 <j < R), so that 6 = 1/R, and when N a 1 (mod R) let us consider the
1
case an := 1 if R n, an := 0 if R t n. One then has
R R 2 = Ri N - 1 )2
EIS(ai)12 = E E 1 +1
j=1 j=1 0<n<N-1
R
rt0(mod R)
= (N — 1 ± R) (1 ± N R 1 ) = (N — 1 ± 6 -1 ) E lax.
0<n<N -1
E Cnry rX n
n,r
5_ DElxn12E1Yr12
n r
(Vx n , Yr E C),
EE CnrYr
2
< D r 1Yr1 2 (Vyr E C).
n r
Proof. Let us show the equivalence of (i) and (ii). That of (ii) and (iii) follows
by interchanging the roles of the indices r and n.
(i) = (ii): We have
2
E
n,r
CnrYrXn = E Yr E
n
CnrXrd 2
r
2
5- Elyr12 E E CnrXn < D E l xn 12 E 1 Yr 1 2
r r n
where the first upper bound stems from the Cauchy-Schwarz inequality.
(ii) = (i): For each r, set L r := En
cnr xn and apply (ii) with yr = L r .
Then
(E 1L9-1 2) 2 < DE1X77,1 2 EI L d 2 ,
r
In the sequel we will systematically use notation (18). The following result
is an immediate application of Lemma 5.1.
Lemma 5.2. Let a r (1 < r < R) be fixed real numbers. The two following
assertions, concerning the sequence of real numbers b n > 0 (n E Z) such that
bn > 0 (M < n < M + N) and the real positive number B, are equivalent:
(0 E
1<r<R
Isccor < B E
M<n<M±N
lanr/b. (Van E C)
2
(ii) E bn E yr e(n ar ) < B 2 (Vy, E C).
1Yr1
M<n<M±N 1<r<R 1<r<R
Proof We use Lemma 5.1 with c, = e(nar)On. Up to replacing a n by anVbn,
expression (i) is equivalent to
Using the equivalence of statements (i) and (iii) in Lemma 5.1, we see that the
above condition can be written
2
E E yr e(na r )Vbn <B 1Yr1 2 (VYr E C),
M<n<M-EN 1<r<R 1<r<R
(22) (N , 6) = B(0).
The remainder of this section is devoted to making explicit a suitable choice
for the sequence {b n } nEz of Fourier coefficients for B(a).
It is natural to try and write b n as the value at n of a function b E (R)
for which the Fourier transform
+00
1)(6) := f b(t)e(-0t) dt
with
A, (0) := -60 — a) { sin(17r9)
66 1.4 Sieve methods
Since 1± 6 ± al < 1 (it is here that we make use of the assumption 6 < ), we
have
A, e L'[ — 6 + a, 6 + a].
to -b(—a) = 0(a). This establishes (23) for la 1 < , and thus for all a, by
periodicity.
We are thus led to look for an integrable function b such that the quantity
+00
(26) B(0) = b(0) = I b(t) dt
we have
with the consequence that conditions (27) are certainly realised for the choice
C = 172 . This proves the inequality
which suffices for most applications. Selberg remarked that the following lemma
allows a better choice for b(t).
4.4 The large sieve—analytic form 67
(sin 71Z \ 2 00 1 1
F(z) :=
( z _ 7- ) 2 (z + n) 2 z2 } .
) I n=0
Then F defines an entire function of z, such that
and
f +00
(28) (F(x) — sgn(x)) dx = 1.
Remark. It plainly follows from (28) that F «; V- (IR). One can however interpret
the upper bound for F(z) as meaning that, in a certain sense, F(0) = 0 for
101 > 1.
Proof. The first two assertions are clear: writing z = x iy, then, assuming for
instance that ly1 > 1, we have lz ± n12 > 1 + (1x1 n)2 (n > 0). In order to
prove the third assertion we first recall Euler's formula
(Sill 71z
71
(sin 7rx 2 {
F(x) = +2
71
Finally
sin irz ) 2
F(0) = lim = 1 > 0 = sgn(0).
ITZ
68 1.4 Sieve methods
= fo co
(F(X) ± F(—X)) dx = 2
r
o
(sin 7rx )2 dx = 1.
/TX
S(a) := an,e(na).
M<77,<M±N
We apply Theorem 5 to the case when the a, are all rational numbers of the
form a, = alq with (a, q) = 1, q < Q. For r s one clearly has
which shows that the a r are Q 2-well spaced. We can thus write
The usefulness of this inequality rests in the observation that one can bound
the inner sum from below by an explicit function of q linked to the number
w(p) of classes modulo p (p dividing q) in which an vanishes identically. More
precisely, let us write, for each prime p,
and put
(We may assume that w(p) <p for all p, since otherwise a n 0.) The foun-
dations of the arithmetic form of the large sieve are set out in the following
result.
Theorem 6. With the previous notation, we have for all q > 1
2
(33) E
M<n<M+N
an g (q) <
1<a<q, (a,q)=1
Is(a/q)1 2 -
Corollary 6.1 (Arithmetic large sieve). For any finite sequence of complex
numbers {an : M <n < M ± N}, we have
(34) E an
2
<
N — 1 ± Q2
L E l anl
2
M<n<M+N M<rt<M±N
with
(35) g(q),
q<Q
Assume (37) is satisfied for q and q' with (q, q') = 1. By the Chinese remainder
theorem we can then write
E Is(e/qq/)12 —1<a<q Et
E i<b<q Is(a/q + b 1 q' )1 2
1<c<qq'
(c,qq')=1 (a,q)=1 (b ,q1)=1
E is(a/q)12gm ? is(0)12g(og(q').
1<a<q, (a,q)=1
Since g is multiplicative, it follows that (36) (and consequently (37)) is true for
qq' . Moreover g (q) = 0 when q is not squarefree, so we may confine ourselves
to establishing (36) when q is prime.
For any prime number p we have
p —1 p —1 p-1
rth (mod p)
Note that by assumption S(p, h) is zero for at least w(p) values of h modulo p.
So, by the Cauchy-Schwarz inequality, we have
p-1 p-1
2
1 ,9 (0)1 2 = S(p,h) < (19 — w (P)) E Is(p, h)12,
h=0 h=0
from which it follows by (38) that
p-1 p-1
Relation (39) is a weakened form of the large sieve inequality, since only the
contribution of those q that are primes is estimated. It is however a very useful
result for applications—cf. Notes. Moreover, it may be extended to congruence
classes for composite moduli. Montgomery (1968) showed that for all squarefree
q one has
q-1 2
,u(d)
gE E d S(q/cl ' h)
= E Is(a/01 2 .
h=0 dig 1<a<q, (a,q)=1
Inserting this into (30) we see that S (q / d, h) is, on average over d dividing q
and h in [0, q — 1], close to (d/q)S(0).
§ 4.6 Applications
By comparison with Brun's method, the large sieve provides a remarkably
effective upper bound for the number J(x) of prime twins not exceeding x.
Theorem 8. As x tends to infinity, we have
with
(41) C := 2 H ( 1_ (p — 1) -2 ).
p>3
This upper bound is thus asymptotically equal to eight times the conjectured
value for J(x)—cf. the Notes for references on improvements.
72 1.4 Sieve methods
h(p) _ 4 w p , ) _ 2( _ 1)1 (P ± 2
( v ?._ 2) .
i p - 2' - 2)
It is easy to check that the series E cicL i h(d)d —° is absolutely convergent for
a> , from which we deduce that
where the sum over m has been evaluated by partial summation from the
estimate
6
2W(m) E =E
1* it2 (m) ,---, y log y.
m<y m<y 7r
with
= H( 1 _ p - 2)-1 ...
H (1 +P(P4 — 2)
2(p + 2)
P2 (P - 2 )( 1 +p')
)
P p>3
1
2 li (1 (p - 1) 2 ).
p>3
Noting that each integer m < Q factorises uniquely in the form m = qdt
with (q, k) = 1, tt(q) 2 = 1, dlq", tlk" , we can write
1 1_ p(q) 2 q k1
E—m—
m<Q
<
q<Q,(q,k)=1
q d qc'')
t q
tikc'G q<Q,(q,k)=1
Notes
§ 4.2. In order to estimate the third error term of (5) we have appealed to
the parametric method, presented in detail in Chapter 0 of the book of Hall
& Tenenbaum (1988). The famous Rankin's method (cf. § 111.5.1) is another
example of this fruitful computational trick, which consists in bounding the
characteristic function of a set of integers by a constant multiple of a multi-
plicative function depending on a parameter to optimise.
The application of Brun's method to an upper bound for 7r(x) is of course a
mere illustration of the basic ideas, and should not be considered as a genuine
result. Not only, as we already observed, does it give a weaker estimate than
Chebyshev's bound, but also one could object, strict() sensu, to a loss of infor-
mation. Indeed, we have appealed to Theorem 1.9, which itself rests on Mertens'
first theorem. But this last result immediately yields a Chebyshev-type upper
bound for 7r(x) since
x log p x
7r(x) - 71(Vx) < < .
log x p log x
Vx<p<x
However it can be easily checked that what is really needed in Brun's treatment
is an asymptotic formula for E p<x 1/p. This is a much weaker assertion than
Theorems 1.9 or 1.7, and reveals precisely the reason why the scope of the
method is so large.
For other results concerning 1(x, y), see Chapter 111.6.
Alladi (1988) explains recent developments of Brun's method relating to
sums of multiplicative functions on certain subsets of N.
The reader interested in the up-to-date theory of the combinatorial sieve can
look at the survey of Diamond & Halberstam (1985), and at the deep articles
of Iwaniec (1980, 1981).
§ 4.4. Although the proof that we give here for the analytic version of the
large sieve appeals (at least for the optimal form of the result) to the theory
of analytic functions of a complex variable and to a deep result of harmonic
analysis, it is possible to consider the large sieve as an essentially elementary
tool. We have seen that the proof of the inequality
72 (N - 1+ (5-1 )
is elementary. One can also look at the proofs, all different, of Mont-
gomery (1971) [6. < N + 26 -1 ], Bombieri (1974) [idem], and Elliott (1980)
[A < N + 6-1].
Notes 75
The function F(z) of Lemma 5.3 was studied by Beurling at the end of the
thirties. For an exhaustive study of the optimisation problem (26)—(27) and
of the numerous applications of various generalisations of this question, see
Graham & Vaaler (1981, 1984), Vaaler (1985).
The large sieve is equally useful to estimate mean values of weighted averages
of Dirichlet characters—cf. § 11.8.1. Let us put
T(x) := anX(n).
M<n<111-EN
2
E* 1T(x)12 < 1<a<q, (a,q)=1
This inequality plays an essential role in the study of L-functions (cf. § 11.8.2)
and the distribution of prime numbers in arithmetic progressions. It is, in
particular, one of the fundamental ingredients of the proof of the Bombieri-
Vinogradov theorem (1965, 1966) cf. Chapter 11.8, Notes.
§ 4.5. Theorem 7 has been used in numerous elegant solutions of arithmetic
problems. In particular it underpins the original proof of Daboussi's theorem
(Daboussi & Delange, 1974). It is also the starting point of a new method of
Hildebrand (1986c, d, e, 1987a) for studying the mean value of multiplicative
functions of modulus at most 1. Hildebrand (1986b) also applies this inequality
to manufacture a new elementary proof of the prime number theorem. A variant
of Theorem 7 has been established by Elliott (1979, lemma 4.7) where only
the congruence classes modulo p are taken into account. Elliott proves the
inequality (see also Theorem 111.3.2)
1 2
2
E
p<N
P S (0, — — S (0) <16N
1<n<N
l an1 *
This result is not directly comparable to (39): summation over p is longer (since
(39) implies an upper bound of the same order only when Q << VN), but here
only a single congruence class for each p is considered.
76 1.4 Sieve methods
§ 4.6. The best upper bound at present for J(x) is due to Wu (1990), improving
on a result of Fouvry 8z Grupp (1986). The factor 8 in Theorem 8 is replaced
by 3.418. The basic result in this problem provides a factor 4. It is due to
Bombieri Sz Davenport (1966) cf. Halberstam & Richert (1974). Besides the
sieve (Selberg's, but the large sieve is also applicable) the proof requires the
Bombieri—Vinogradov theorem.
The inequality (43) is in fact valid uniformly without any error term, i.e.
2y
7r(x + y; t, k) — 7(x; f, k) < (1 < k < y < x).
co(k)log(y/k)
Exercises
(b) Use Brun's "pure" method to show that there is a positive absolute
constant c, such that one has, uniformly for 1 < y < z < exp{ c log x/ log2 x},
(c) Extend this result, using the fundamental lemma of combinatorial sieve
theory.
(d) Show, assuming the prime number theorem, that (46) is not valid uni-
formly for 1 < y < z < Vx.
Exercises 77
s(x) << vx H 0 _
p<x P
p_= 1(mod 4)
(d) Make the preceding calculation more precise by using Dirichlet's theorem
1 . I log x + OM.
E _P 2 2
p<x
731. (mod 4)
(e) Under the same assumptions show that there exists some absolute pos-
itive constant B such that
where Q(B , x) :. fm < x : pm =p > x1 /9. [The elements of this set are
usually referred to as "quasi-primes"--see Halberstam & Richert (1974), §,2.8.
In particular a quasi-prime has only a bounded number of prime factors. More-
over1Q (B , x)1 x 7r(x).]
3. Almost squares.
(a) Let p be an odd prime. Show that the kernel of the endomorphism of
(Z/pZ)* : x i— x2 is {±1} and deduce that the number of quadratic non-
residues modulo p is (p — 1)/2.
(b) By sieving the set A = In : n < x} by the prime numbers < Vx for
suitable classes, show that the number S of integers n < x such that n is a
quadratic residue modulo p for all p< Vx satisfies [Vx] < S < aVx where C
is an absolute constant to be calculated.
4. Majorising the number of representations in Goldbach's problem. Let N be
an even integer and let r(N) denote the number of representations of N in the
form N = p + q where p and q are primes.
(a) Show that, for any multiplicative function f > 0 with support included
in the set of squarefree numbers, one has
r(N) < G
N
il (1 + p2 ) (log N) 2 •
PIN
where 8 is a positive constant. Show that, uniformly for 0 < a < 1/ log 2 N, one
has
E ih(d)d' <log2 N.
d=1
(d) Applying the above result to a suitable function h, establish the following
inequality, which sharpens the result obtained in (b)
N
r(N) 5_ (16 + 0(1))CN
(log N) 2 '
cp(t) := f (n + t)
n Z
6. Integers coprime to q.
For each integer q > 1, set .1Vq (x) := Ifn < x : (n,q) = 11I.
(a) Show, for each fixed q> 1, that
lim Nq (x)Ix = co(q)1q.
(b) Given q, show that each integer n > 1 can be written uniquely in the
form n= hdt, with (h, q) = 1, d q, ,u(d)2 = 1, p t -191 d.
(c) Calculate Epit,pid
Vt.
(d) For q > 1, Q >1 set L(q,Q) :=Ediq,d<Q li(d) 2 IW(d). Show that
tion (47).
1.5
Extremal orders
q>Q1f(q)1 < E.
n = ni n2 n3 with ni := H q (i = 1,2,3).
On, qEQ,
T(nk) = 2 k
and
log nk = (9 (Pk) = log p < 7r(pk) log pk = k log pk.
P<Pk
5.3 The functions w(n) and 12(n) 83
Remark. The problem of the minimal order of r(n) is trivial: we have T(n) > 2
with equality whenever n is prime.
Each of these inequalities is optimal: the first two are equalities if and only if
n is squarefree, and the third whenever n is a power of 2. This allows a rapid
evaluation of the maximal orders for w(n) and Ct(n) the minimal orders being
trivial.
First, (4) and (9) imply
and inequality (8) shows that this upper bound is optimal up to the factor
(1 ± 0(1)), since the nk occurring in (8) are squarefree.
The situation is even simpler for Q(n) since the last inequality in (9) is
attained for an infinitely many integers n.
We can therefore state the following result.
Theorem 3. (i) A maximal order for the function w(n) is log n/ log2 n.
(ii) A maximal order for the function 11(n) is log n/ log 2.
84 1.5 Extremal orders
The upper bound (11) is trivial, the lower bound (12) follows from Theorem 1
applied to f (n) =
We can make these estimates more precise in the following result.
Theorem 4. A maximal order for so(n) is n. A minimal order is
e — Y n/ log2 n,
for all integers q with 71(q) > w(n). By Chebyshev's lower estimate (Theorem
1.3) this condition is realised as soon as
where A is some suitable positive constant. By Theorem 3(i) we have with this
choice
q < log n
5.5 The functions o -k (n), K > 0 85
( 1 e -1' ( 1 i.
(14) (p(n) ? n e--1 {1 + 0 >n 1+0
log q og q)} - log2 n { og2 n ) i
e--Y
co(nk) =
nk
H (1 /3 -1 ) =
log pk
{1 +0 ( 1 )1
log pk
P<pk
e--1 1 \)
= {1 -1-- 0
log2 nk (log2 nk )f'
o(n) =
shows that it suffices to study the extremal orders for positive values of the
parameter—the case rc = 0, corresponding to a o = 7, having already been
considered.
We have
p (11-1-1* 1 vtc 1 ___ p— (11+1*
(15) atc(Pv ) = =P
pic -1
,
so that
say. The product converges for tc > 1, and equals CY log q ± 0(1) when n = 1.
Moreover log q < log2 n ± 0(1). In particular, we can then deduce that
CY log2 n ± 0(1) if ic = 1,
(19) a-, (n)n' <
((k) (1 ± Otc (log' n)) if > 1.
These upper bounds are achieved asymptotically for the sequence {rn,k } defined
by
Notes
§§ 5.1 5.5. The first systematic study of the maximal order of an arithmetic
-
function is due to Ramanujan (1915), who was concerned not only with the
determination of large values of 1- (n), but also with making explicit the struc-
ture of the numbers for which these large values are attained. Today the study
of maximal orders for arithmetic functions has become an active area in the
theory of numbers, with powerful methods and original techniques. An excel-
lent survey, including an exhaustive bibliography, has recently been written by
Nicolas (1988). The reader who is interested in this subject will find other ex-
amples of problems and conjectures in Nicolas (1974/5, 1978, 1983a), Erd6s &
Nicolas (1981a and b, 1989), Erd6s Sz Tenenbaum (1989b) and Eras Sz Sarkozy
(1994).
Questions linked to maximal orders are often extremely non-trivial, par-
ticularly in the extent to which they are concerned with the fine structure of
integers. In general the problem of "large values" induces in a natural way a
study of the distribution of those integers at which they are attained. This
is often very difficult. Even in the well known case of the divisor function
r(n) and highly composite numbers of Ramanujan, the results obtained are
incomplete—cf. Nicolas (1988).
§ 5.5. For the asymptotic behaviour of ((k, q), see Lemma 111.5.9.1.
Exercises
1. What are the extremal orders for the function "sum of digits in base r"?
3. Our aim here is to recover a result of Erdos & Nicolas (1981b) concerning
the extremal orders of the function F(n) = co(n) ± co(n ± 1).
(a) Show that F(n) < 3n/2 (n > 1).
88 1.5 Extremal orders
(b) Let pj denote the jth prime and put Nk := 1 ± n i<j< k pi . Show that
PINk p > Pk and that pl(Nk+1) = p = 2 or p > pk. Show that log Nk Pk X
k log k. Deduce that, as k -> oo, one has yo(Nk) Nk and (p(Nk ± 1) - Nk•
What is the maximal order of F(n) ?
(c) Show that, as n co, one has (p(n) nn p i n, p<io g n( 1 - P -1 )•
(d) Deduce from the previous result that
H H (k --> oo).
1<j<r r<j<k
Show that
(Mk +1)
max (1+ 19(1)) H (1- 1/) (k --> oo).
((Mk) (Mk +1)) -<
Mk (P 1<j<7-
Deduce that the right-hand side of (21) is a minimal order for F(n).
4. (a) Determine the maximal order of the function n log r(n2 ).
(b) Let S := : pls = p2 1,91 be the set of "squarefull" integers. What is
the maximal order of the function s 1-> log r(s) (s E S) ?
(c) Let wi(n) be the number of prime factors p of n such that p 2{n. Show
that
T(n) D(n) A +0(1) 2 (1-A) w 1( n) (n oo)
with A := (log 3)/ log 4 and D(n) := max,,<,, r(m).
5. Let a >1. Set F(n) := Ei<i‹, (n) ((d +1/ di )-1) a , where {dj : 1 < j < r(n)}
denotes the increasing sequence of divisors of n.
{ 1 (if dind v1)
(a) Writing x(n; u, v) := show that
0 (if ]d I n, u < d < v)'
nfv
F(n) = x(n;u,v)(v - u)a -2 (av - u)u -1
F dudv.
that a sequence of real numbers {E -j : 1 < j < k} is admissible for {ni } if for
any j E [2, lc] and each z E 1V(nj_i), Vnii the interval ]z, (1 + Ej )z] contains
at least one divisor of n j . Show that
§ 6.1 Introduction
In the nineteen twenties van der Corput developed a method of estimating
trigonometric sums which has numerous applications in number theory. This
theory is typically relevant to the Dirichlet divisor problem and to the circle
problem, i.e. the evaluation of the number of pairs (m, n) E (Z + ) 2 such that
m2 + n2 < x. It is more generally applicable to counting the number of points
with integral coordinates within a given contour: see Exercise 5.
As early as 1922, van der Corput had obtained that the remainder term in
the Dirichlet problem has order <, x33 loo+E . This/ problem, and others of a
similar nature, have subsequently given rise to a prolific literature. A large part
of it rests on the ideas of van der Corput see also the notes on section 3.2.
In this chapter our aim consists essentially of describing the principle of
the method in its simplest form, which leads to the theorem of Voronoi, to the
estimate 0(x 1 / 3 ) for the remainder in the circle problem and to the exponent
1/6 in the bound for the zeta function on the critical line (cf. Section 11.3.4).
The reader interested in a deeper study of the method is referred to the work of
Titchmarsh (1951), to the original articles of van der Corput (1922-1937), to the
works of Kolesnik (1981, 1985), as well as to the more recent developments of
Bombieri & Iwaniec (1986), Iwaniec & Mozzochi (1988), Huxley & Watt (1988),
Watt (1989), Huxley (1990, 1993a, b), and Huxley & Kolesnik (1991). The
monograph of Graham & Kolesnik (1991) furnishes a thorough introduction to
the modern state of the subject.
We shall make use of the Poisson summation formula, which undoubtedly
constitutes the most natural approach to the study of trigonometric sums.
Define the Fourier transform by
We apply the Poisson formula with the hypotheses appearing in the following
statement.
6.2 Trigonometric integrals 91
(2) cp(t) = f (n ± t)
nEZ
converges for all t and that its sum defines a function of bounded variation on
[0,1] which is continuous at 0. Then we have
(3) Ern
N—> co
1-(v) = E f (n).
MN nEZ
Proof. Jordan's theorem on Fourier series (see, for example, Titchmarsh (1939),
p. 406) states that each periodic function of bounded variation over a period is
the sum of its Fourier series at each point of continuity. This implies (3) since
the Fourier coefficient of order v of co is f (v).
Theorem 3. Let f E C2 ]a,b[ such that I'M has constant sign on ]a, b[. Set
r := inf I f " (t).
a<t<b
Then we have
(5)
fab e(f (t)) dt _< 4V(2/7r).
92 1.6 The method of van der Corput
Proof. Let us suppose, for example, that f"(t) < —r < 0 for a < t < b. Then
PM vanishes at most once on ]a, b[, say at t = c. (If r(t) does not vanish, the
argument is similar, but simpler.) Now we can write
b c-45 f c+45 fb
I := i e(f(t)) dt = f +1 = 11+ 12 + 13,
a a c—S c+S
t
Int) = V ,r(v)dd ? rIt — cl > r6
/1 1+1/3 1 5_ 4/7r6.
and hence we have the stated result, by choosing 5= V(2/7r). It is clear that
this upper bound remains valid if, with this choice of 6, one has either c < a +6
or c> b — 6.
Indeed, formula (6) is invariant on replacing f(t) by f (t) + kt, for any
k E Z. We can equally well restrict ourselves to the case when a and b
are of the form m + with m E Z. Indeed, the error involved is 0(1) on
the left-hand side, and 0( log(0 — a + 2)) on the right. This last estimate
follows easily from a summation interchange, using the trivial upper bound
e(—vt) < min(0 — a + 2,1/ ), where denotes the distance from t to
the set of integers. Finally we suppose that f' is decreasing on ]a, b[. Let us set
with
P(II) = e(f (t) — vt) dt.
a
Taking account of (7), it remains to show that we have
Now
fb d{e(f (t) — vt)} _ (f (0) — /it)) b
27riP(v)= (ft(t)-0
e(f(b))
e
[ Cr(t —v : fa
b e lf (t) vt)d{ f (t) v }
It is plain that the contribution to (9) of the main terms is 0,(1). Moreover,
that of the error term is
«E 0 +1
v(v /3) 2..j ,=(0+1) 7,) + 0+1)
( E v(v1 (4)
vf[0,13+e [ vy/ /-)
1
<E + v—
+2(0+1) E 1og(0 + 2).
1<v<13+1 13+e<v<2 v>20
Then we have
(10) E e (f (n)) - a + 1) A 1 / 2 +
a<n<b
Proof. We can assume that A < 1, since otherwise (10) is trivially satisfied.
With the notation of Theorem 4, the left-hand side of (10) is
where the second upper bound follows from Theorem 3. Now we have
Then
a<n<b
Lemma 6.1. Let f be a real function defined on ]a, b]. For any integer q with
1 < q < b - a, we have
E
a<n<b
e (f(n)) <
2(b - a)
(12)
E e(f(n + r) - f (n))
a<n<b—r
6.3 Trigonometric sums 95
Proof of Lemma. Let F be the function defined by (8) and S := Enez F(n) be
the sum to be estimated. We have trivially
q
s. q-iEEF(n+m).
m=1 nEZ
where the dash indicates that the summation is restricted to integers n with
a<n+m<b for at least one m such that 1 < m < q. Thus the first sum over
n does not exceed b — a + q < 2(b — a). The inner sum equals
q-1
< 2(1) — a)q +2 E(q — r) E F(v)F(v — r)
r=1 vEZ
q-1
5_ 2q{(b — a) + EEF(v+r)F(v) }.
r=1 vEZ
(a < x < b — r) .
96 1.6 The method of van der Corput
This follows immediately from Taylor's theorem (of order 1) for f". Using
Theorem 5 to bound the sum over n in the right-hand side of (12), we have
that
q-1
E e (f (n)) < Lq -1 / 2 ± {Lq -1 E (L(rA)1/2 + (rA)-1/2) } 1/2
a<n<b r=1
Proof. Write N := [Vx]. The hyperbola method cf. formula (3.4) shows
that the left-hand side of (14) can be written as
2 E [ ii ]
d<N d<N
where B, (t) = {t} - denotes the first Bernoulli function. (Recall that {t}
is the fractional part of t.) Using Theorem 0.5 to estimate E d<N 1/d, we can
write the above expression in the form
P(x) - 2R(x)
with
1 1
P(x) := 2x (log N + y+
- ± 0(-N - N 2
N
and
R(x) := E Bi(XI 4
d<N
6.4 Application to the theorem of Voronof 97
E ai sin(27rjt)
00
(16) B(t) =
J=1
with
with
(19) bj =- 2 .2 sin2 .
71
Clearly we have
E ai E
00
R(x;M,T) = sin(27rjx/d)
j=1 M <d<T
(21)
cos(271jx/d)
j=1 M <d<T
Now it follows from Theorem 5 that for each real number y we have
M <d<T Y)
jx 1/2 M3 1/2
< 7+ x
where the sum over j has been estimated taking account of (20). For the optimal
choice J := Mx-1 /3 , we obtain
( M 3 \ 1/2
(23) R(x; M,T) < x 1 I3
x )
Notes 99
This bound is a priori valid only if J > 1, i.e. if M > x 1 /3 . It is clear that it
still holds, trivially, in the complementary case. Set r o := [log N/ log2] — 1. We
have
ro
R(x) = R(x; 2', 2r+ 1 ) R(x, 2'0 + 1 , N)
r=-1
To
E (x 1 / 3 ±2 3r ox _i/ 2) ± X 1/3
r=-1
< rox 1/3 N312 x -1/2 < x1/3
log x.
Notes
Exercises
E [f (n)] =
a
f (t) dt + (f (b) — f (a) + a — + 0 0 113 (b — a) +
a<n<b
[The best estimate known to date for the remainder term is < x23173+6 , due to
Huxley (1993a).]
7. Show that for all A E 111N{O} we have
(b) Integrating by parts fiN e (f (t)) dt, show that (25) does not hold if x (x)
tends to a finite limit.
This page intentionally left blan
Part II
:, E f (n,)z n
00
S(z)
n=1
converges in some neighbourhood of 0, several classical theorems allow us to
connect the analytic behaviour of the sum with the values of the coefficients—
for example, the relation S(n)(0) = n! f (n) or Cauchy's integral formulae.
Similarly, the analytic behaviour of a convergent Dirichlet series
E f (n)n - s
00
(1) F (s) :=
is closely linked with the asymptotic nature of the sequence An). One of the
principal aims of Part II is to develop methods which make this link explicit.
The letter s denotes a complex variable. The real numbers a and T are
implicitly defined by the relation
s = CI ± iT.
Definition. Let f be an arithmetic function. The Dirichlet series associated
with f is the function of the complex variable F(s) defined by (1) for those
points s where the series is convergent.
The properties of formal Dirichlet series studied in Chapter 1.2 suggest that
the concept of a convergent Dirichlet series might have interesting arithmeti-
cal consequences. We begin with a simple but fundamental result, concerning
Dirichlet convolution.
Theorem 1. Let f, g and h be arithmetic functions, with respective Dirichlet
series F, G and H. Let us suppose that
(2) h = f * g.
If F and G converge absolutely at any given point s, then so does H and,
further,
(3) H(s) = F(s)G(s).
106 11.1 Generating functions: Dirichlet series
Proof. If F and G are absolutely convergent at s, then for each x > 1 we have
< f(m)rn — 3.
1 Ig(d)d— s I .
ni X
This implies the absolute convergence of H(s). Relation (3) follows by the
associated formal identity cf. § 1.2.4.
is realised then the Dirichlet series (1) is absolutely convergent, and we have
00
Remarks. (a) If the series (1) is absolutely convergent, the same holds for (4).
So the absolute convergence of (1) is actually equivalent to that of (4).
(b) When F is completely (resp. strongly) multiplicative, the right-hand
side of (5) takes the simpler form
ri /1 f(p),
pS )
-1,
(res. H 0+ psf (P)
_ 1 )).
P P
1.3 Fundamental analytic properties of Dirichlet series 107
Proof. Observe first of all that condition (4) implies the convergence of the
infinite product M := II, (1 + E,Mi 1 f (Pv )13 - " 1) • Then for all x > 1 we can
write
00
This shows that the series F(s) is absolutely convergent. The inequality
00 00
E f(n)n - 8 — H E
n=1 p<x v=0
f (P1 1 )P- /
n>x
If (71) 71,- 8 1
A(t) := an •
Then the Dirichlet series associated with (a n ) can be written in the form
00 co
F(s) := an ' — f e -t s dA(t).
n=1 o-
This integral is called the Laplace-Stieltjes transform of the function A(t). Most
of the fundamental theorems concerning Dirichlet series can be generalised in
the context of this transform, where the Stieltjes integral reveals itself to be a
handy technical tool.
Let V be the class of functions defined on IR and with bounded variation on
every finite interval. Although we are primarily concerned with Dirichlet series,
we consider, whenever possible, the Laplace-Stieltjes transforms of functions
in V with more concern for clarity than generality.
108 11.1 Generating functions: Dirichlet series
Proof. (i). Let 0 E [0, 7/2[. The sector S(0) is the set of complex numbers s
such that
a — ao
— cos 0 •
For any E > 0, we show there exists a real number x o = x o (E, 0) such that,
whenever y > x > xo, we have
The left-hand side of this equality is thus an entire function of s, for which the
kth derivative takes the value
00 n-k fx x
E
n= k
(:_ k)! jo_ ( — tr d A(t)- -0f - (- t) k e - t s d A(t) .
Assertion (iii) follows from the above, by appealing to Weierstrass' theorem
on uniform limits of sequences of analytic functions on compact sets see for
example Cart an (1961), chapter V, theorem 1.
Let us consider an integral F(s) of type (7). By Theorem 3, (i) and (ii),
the set of real parts a of those points s where the integral converges (resp.
converges absolutely) is a half-line with origin a, (resp. cra ). One defines a,
and ac, to be respectively the abscissa of convergence and the abscissa of
absolute convergence of the Stieltjes integral F(s). By convention we allow
these numbers to be ±oo.
It is easy to construct examples of Dirichlet series converging everywhere or
diverging everywhere on the line of convergence a = a,. The following result
shows that the sum of the series may be evaluated on the line of convergence
by analytic continuation, if this exists.
Theorem 4. Suppose that the integral F(s) defined by (7) for a> a, has an
--- (s) for certain points s with real part a = a,. Then we
analytic continuation F
have
+0,0
P(s)
- =f e —t s dA(t)
Jo —
as required.
110 HA Generating functions: Dirichlet series
(12) G(s) := (
1)nn -8 = (2 1-8 - 1)((s)
n=1
we have a, = 0 (by the theorem for alternating series), and, of course, cya = 1.
By analogy with power series one might be tempted to believe that a Dirich-
let series necessarily possesses a singularity on the line of convergence. This is
not the case. The series (12) constitutes in this respect an excellent counter-
example. We shall indeed see in Chapter 3 that the Riemann zeta function can
be continued as a meromorphic function on all of C with only a single, simple,
pole at s = 1. This implies that G(s) may be continued as an entire function.
One can also prove directly that G(s) has a regular analytic continuation for
a> -1 cf. Exercise 2.
The following theorem, usually referred to as Landau's theorem (cf. Notes),
describes a situation in which the line of convergence always contains a singu-
larity.
1.3 Fundamental analytic properties of Dirichlet series 111
1
F( s) = E — tk(a- — s)ce—at dA(t)
°— k=0kl.
00
t(cr—s) e —crt dA(t) = i e — st dA(t).
= 107 e Jo—
Thus the Laplace—Stieltjes integral converges at s, which leads to a contradic-
tion by selecting s < ac . This concludes the proof.
The Phragmen—Landau theorem assumes a particular importance in prac-
tice because it forms the basis of the proofs of most oscillation theorems. Here
we confine ourselves to the following two results which are typical of its use.
Theorem 7. Let A : [1, co[---> IR be measurable and locally bounded. If the
integral
oo
(13) H(s) := I A(t)t — s -1 dt
1
has a finite abscissa of convergence a, and if H(s) has a regular analytic con-
tinuation at the point s = ac , then for each E > 0 we have
(14) A(x) =
112 11.1 Generating functions: Dirichlet series
Proof. Assume, for example, that there exists some constant K = K(E) such
that
A(t) < Kric - '
for all sufficiently large t. By modifying the value of K we can suppose that
this inequality is actually satisfied for all t > 1. So we can write
00 00
H(s)
K
=
8 - a-, + E , 1 (A(t) - Ktac -E )t - s -1 dt = - I B(u)e -su du
0
with
B(u) := Ke (cre - ') u - A(eu) > 0.
The abscissa of convergence of the last integral is still ac , since
I 00
t'c's -l dt
E an, = Q±(Xcr°).
n<x
Proof Write A(t) := En<t an and let a, denote the abscissa of convergence
of the integral H(s) defined by (13). By partial summation, we have, for suffi-
ciently large a,
F(s) = sH(s)
so that H(s) has a continuation which is regular on the half-line [cr o , oo[. We
assume that there exists some constant K such that A(t) ± Kt(70 has constant
sign, say positive, for sufficiently large t. Then there exists some constant C
such that
B(t) := A(t) ± KC° + C > 0 (t > 1).
The integral
00
K C
L(s) := i B(t)t' dt = H(s) + + —,
1 s - 0-0 s
1.3 Fundamental analytic properties of Dirichlet series 113
K A
L(ao + 6) L(so + 6) - (6 ---> 0+).
506171
This implies that IKI > IA/sol. In particular, A(t)+KC° does not have constant
sign for all sufficiently large t if
I KI < 1 A / so
with
m s
C(s):
k=1
f(m) m ± k )
By Theorem 5, this series converges absolutely for a > a, ± 1. For a > 0-1 >
a, ± 1, we then have
00
Thus as a - > oo, G(s) tends to 0, contradicting (15) and hence concluding the
proof.
114 11.1 Generating functions: Dirichlet series
(iii) If the integral (16) converges for s = so with ao < 0, then there exists
some real number a such that
A(x) = a + o(ea°x) (x ---> oo).
Proof. (i). For all x> 0, we have
fo x x
e - st dA(t) =-- A(x)e + s I e- s t A(t) dt.
o
The assumption concerning the rate of increase of A(t) then implies the con-
vergence of the integral (16) for all s such that a > 6. Hence a, < 6.
(ii). By hypothesis,
fo x
(18) B(x) := e'°t dA(t) = F(so ) + o(1) (x ---> cc).
(iii). By Theorem 3(i), we can assert that F(s) converges for s = 0. Setting
a := F(0), we have, with the notation (18),
00 00
a - A(x) = f es° t dB (t) = -e"x B(x) - s o f es° t B(t) dt
x x
00
= so xf cx) {B (x) - B(t)les°' dt = so f o(eac' t ) dt = o(e'°x).
x
Remark. Assertions (ii) and (iii) of Theorem 10 are false for cr o = 0. A counter-
example to the first is provided by the function
0 (0<x<l)
A (x ) = {
1 (x > 1).
The integral (16) then converges for s o = 0, but we do not have A(x) = o(1)
as x tends to infinity. For the second statement, consider
( \ (0 <x<1)
,z1x) = { 02vx ( x > 1) .
Clearly we have
F(i) = 2e - i + t -1 / 2 e -it dt.
1
The integral is well known to be convergent; however A(x) does not tend to a
finite limit as x -> oo .
Theorem 11. Put lc := limsup,c, x -1 logIA(x)l.
(i) If K 0, then we have a-, = k.
(ii) If IC = 0, then either A(x) does not tend to a finite limit as x --> oo and
we have a, = 0, or there exists some real number a such that A(x) ---> a as
x -> Do, and we have
Proof. For each fixed E > 0, we have A(x) <<, e('±')x . Assertion (i) of Theo-
rem 10 then implies that in all cases
(19) a, < K.
Suppose, initially, that Ic > 0. Then the integral F(s) diverges when 0 < a <
k, since otherwise we would have, by Theorem 10(ii), that A(x) = o(e'),
contradicting the definition of lc. Thus a, > ic, and the required equality follows.
116 11.1 Generating functions: Dirichlet series
If n < 0, then A(x) —> 0 as x —> co. Theorem 10(iii) then implies that
for all a > a,. But, by definition of k, no a < K satisfies (20). Hence cr, > K,
so once again we obtain Grc = IC.
Let us now consider the case K = 0. If A(x) does not have a limit as x -4 cc,
the integral F(s) diverges at s = 0, so that a c > 0 and hence the required
conclusion still follows from (19). On the other hand, if A(x) = a ± GPM, we
need to show that o-c = where is the infimum of real numbers a l such that
(22) k(n)
pin
Theorem 10 readily provides some information concerning the distribution func-
tion
N(x,y) := card { n < x : k(n) < y}.
Theorem 12. For any E> 0, and uniformly for 1 < y < x, we have
Proof. The function k(n) is multiplicative. Since the series of positive terms
00
p v=1 P
valid for all m > 0 and x > y > yo (m), is outlined in Exercise 5. (See the Notes
for the best results to date concerning N(x, y).)
The idea of comparing the summatory function of an arithmetic function
with an Euler product underpins Rankin's method (cf. 111.5.1). We show below
how to apply this simple method in order to refine (23).
Proof. If the integer n is counted in N (x , y), then we have trivially, for each E
with 0 < E < 1, that
1 4
5_ (-XTD E
Let us write v := log(x/y). We may assume that v > 2, since otherwise the
conclusion is obvious. It follows that
1 1
N (x , y) < ye" IT 11 + — +
p p(pe — 1) )
fl a; P<Y
P1nP<y
K ,k
< y exp {EV ± — ± 2 –1 }
4 n
E
P<Y I-
with K := Ep 1/(p logp) < 2. Choosing E = V(2/v), and estimating the last
p-sum by Theorem 1.1.9, we obtain the stated result.
118 11.1 Generating functions: Dirichlet series
114 := minx —
nEZ
FT FJh J11+11-
1-1-
o<ii,...,iN<Q h=1 Lc/ Q L
belong, modulo 1, to the same sub-cube. If m and m' are the corresponding
indices, (26) is satisfied with q = — m1D E [D, D-QN ].
Proof of Theorem 14. For any a> 1 and any integer N > 1, we can write
Apply the lemma with Q = 6, D := QN , and an := (1/27r) log n (1 <n < N).
Then there exists some real number 7 - with 6 N < r < 62N such that
Re((s) -a 1
((a)
n< N n> N n> N
(a — 1) -1 {1 — 3N 1— a}
where the last inequality follows from the usual comparison estimates between
series and integrals. For a = 1+4/ log N we have
if N > No . The extra condition 'I > T is satisfied by choosing N> log T/ log 6.
This completes the proof.
The following theorem shows that a Dirichlet series necessarily satisfies cer-
tain bounds in its domain of convergence.
Theorem 15. Let F(s) = ETT ann- s be a Dirichlet series with abscissa of
convergence a,. Let ao > a, and > 0. Then, uniformly for cro < a <a +1,
we have that
71‹ C t
00
(S )1 =
n< N
an 71 - ±
fog N
dA(t)
n< N
f00
±181 j J N
I A(t)le - *-- ac - ') dt.
120 11.1 Generating functions: Dirichlet series
The required upper bound follows from this estimate by choosing N = 1 + [ITI].
The problem of determining whether a given function, analytic in a certain
half-plane, is or is not representable in the form of a Dirichlet series is difficult
in general. Theorem 15 shows that functions satisfying a relation of the type
for some A > 0 play a special role. If F satisfies (29) in a domain 7,, we
say that F has finite order in D. A Dirichlet series has finite order in each
closed half-plane contained in the domain of convergence. If the sum of the
series can be analytically continued, it can happen that the continuation still
has finite order in a larger domain. For example, let us consider the series
G(s) := E TT=1 (-1)nn which furnishes the analytic continuation of ((s) for
the domain a> 0, s 1, in the form
and
F(Cri ± iT) < ITI k1 , F(Cr2 ± iT) < IT1 k2 (IT1 ? 1)
1.6 Order of magnitude in vertical strips 121
where k(c) is the linear function taking the values k 1 and k2 at a l and a2
respectively. We hence deduce that
Remark. The Phragmen—Lindelof theorem in fact implies that, for any E > 0,
the estimate
F(8 ) <6,0-1,0-2 irl k(a)±6
holds in the domain o-1 < a < 0-2, ITI > 1. We shall later have occasion to use
this local uniformity in a.
Theorem 17. For any Dirichlet series F(s), we have
0-2 — a.
(ai a)
0- 2 — al
with strict inequality a(a) < p,(o -i) if p,(a-i) 0 and a- > 0- 1.
Corollary 17.1. If F(s) is a Dirichlet series of finite order for a > ao with
0- 0 < aa, then p,(a-a) = 0.
Proof. This follows immediately from (31) and the last assertion of Theorem 16.
122 11.1 Generating functions: Dirichlet series
Notes
We obtain the result stated above by letting x tend to infinity, and then making
E tend to 0.
Using Cesar° summability for Dirichlet series, it can also be shown that the
equality H(s) = F(s)G(s) remains valid in any domain of convergence common
to the three series F, G, H—cf. Landau (1909), pp. 762, 904, or Hardy & Riesz
(1915), p. 64.
As shown by the example in Exercise 1, it is not in general possible to
deduce the convergence of H(s) = En'_1 (f * g)(n)n - s from that of F(s) and
G(s). By Theorem 5, if F(so) and G(so ) both converge, then, for any E > 0,
F(so + 1 + E) and G(s o + 1 + E) converge absolutely, so H(8) is (absolutely)
convergent for a > cro + 1. With slightly more effort, this result can be made
much more precise (cf. Landau (1909) pp. 759 et seq., or Hardy & Riesz (1915),
p. 67).
Theorem 18 (Stieltjes, 1887). If F(s) and G(s) converge for s = so, then
H(s) = F(8)G(s) converges at s = so +
(33) -1 / 2 =
E f(m)m'/ 2 E
h(n)n
g(d)c1-1 / 2 ± R1 ± R2
n<x m<Vx d<Vx
Ri := g(d)d'1'2 E f (m)m, -1 / 2
d<vx vx<m<x/d
We obtain similarly that R2 = o(1). By hypothesis, the first term of the right-
hand side of (33) tends to F( )G( - ) as x ---+ oo. This completes the proof.
This theorem can have non-trivial applications. Consider for example the
case of F(s) = G(s) = 1 (-1)nn' . By writing each integer n in the form
n = 2vm with 2 t m, we see that
(v = 0)
h(n) = { 7(v(M) 3)7(m)
(v > 1).
Theorem 18 then immediately implies that the series associated to h(n) con-
verges for a> Hence, by Theorem 10, we get
(34) E h(n)
n<x
where
T (x) :=
n<x
we can see, after an elementary calculation, that
Voronors theorem 1.6.7 thus allows us to replace the exponent 1/2 in (34)
by 1/3, and it is natural to conjecture that the abscissa of convergence for
H(s) = F(s)2 is a, = 1/4.
In Exercise 1 we give a simple example of a Dirichlet series F(s) converging
everywhere on the line of convergence a = a, but such that the series F(s)2
diverges everywhere on the same line. If ac (F) and ac (F2 ) are the abscissae of
convergence for F(s) and F(s) 2 respectively, Theorem 18 implies that
00
implies that (37) is equally valid for G(s), although ac (G) = 0. This implies
that
G(s) 4 = c2(1T1 2-4 a ) (0< a < ),
so Theorem 15 shows that a(G4 ) > 1. Hence (36) holds either for F =- G or
for F = G2 .
Actually one can even have equality in (35). Bohr (1910) gave an example
of a Dirichlet series such that a, = 0, aa = 1, and
Theorem 15 readily implies that, for this series, ac (F2 ) > , from which we
have equality by (35).
Notes 125
F(sk) = A n {n — sk — (n + 1) —s k} = E + E (.• •)
n= 77, Nirkn> 7 -;1
: 1-6 k
1
since A 0 when Vrk <n < Tk +6k . The first of the two sums above is clearly
.,‹ E n _ a <<a
n< Vrk
{ skAnn_i_sk +0(s2kn_2_sk ) }
E 1-1-bk
n>7- k
= Sk{Cr Tk
—1 —0-(1±6k) cl / —20" \ 1 1 rl 45k — Cr(1±60) ....,i y1 — Cr(1+61c) •
± Liajk ) -/ cr 'rk ak
126 11.1 Generating functions: Dirichlet series
§ 1.5. The best results known to date about N(x, y) are due to Squalli (1985).
Setting
v := log(x/y) (1 < y
he shows that for each 6 > 0 we have
N(x,y) = yF(v)1+0(i/lo(v+2))
g
(exp{(logx) (112) ±'} < y < x)
and
N(x, y) = yF (v) {1 + ( NAlog2 x/ log x))} exp{ (log x) (3/4) ±6 } <y < x)
where F(v) is the differentiable function defined for v > 0 by
1)
pHH( i
"- ).
m<ev Pim
Squalli also establishes the existence of a sequence of polynomials Qi (j > 1),
with deg Qi < j, such that for each N > 1 we have
Exercises
00
(40) G(s) = (s + 1) 1 t- 8-2 A(t) dt.
1
c° A(n) fn+1
(d) Show that the series E n=1 n t' l- dt converges for each E > 0.
(e) Deduce that (40) defines a regular analytic continuation of G(s) to the
half-plane a > -1.
3. In this exercise assume the following weak consequence of Theorems 3.3 and
3.13: The function ((s) has at least one zero in the closed half-plane a > and
the zeros in the strip 0 < a < 1 are placed symmetrically about the axis a = .
128 11.1 Generating functions: Dirichlet series
(a) Using the properties of the series G(s) := E 77_ 1 (-1)nn — s, show that
((s) is non-zero for real s E ]0, Co[, and is continuable as a meromorphic function
for a > 0, having as sole singularity a simple pole at s = 1.
(b) By considering the series ((8)+(t (s)/((s), show that 0 (x) =
(c) By considering the series 1/((s), show that
Q(x) := p(n) 2 =
n<x
5. (a) Show that the number of integral solutions vi > 0, v2 > 0, ... , vm > 0 of
the inequality
vi < N
1<i<m
(d) Establish the identity p(n) 2 = Ed2 In ti(d) and deduce that, uniformly
for M > 1, p,(M) 2 = 1, y > 1, we have
00
7. Show that, for any fixed real numbers a E 10, 1[, 0 0, the abscissa of
convergence of the Dirichlet series F(s) := EZ-1 e (6n)n_s equals a, = 1 - a.
11.2
Summation formulae
1'+'
(3) A* (X) = — F(s)xss -l ds (x > 0),
271i 1-ico
where the integral is conditionally convergent for x E RNN and converges in
the sense of Cauchy's principal value when x E N.
The proof rests on the following lemma, which amounts to an explicit cal-
culation of the Laplace inversion formula for the function
1 (x > 1)
(4) h(x) = {- (x = 1)
0 (0 < x < 1).
2.1 Perron formulae 131
1 ic±iT ds x'c 1 1\
(i) h(x) xs + (x 1),
27ri - 27rIlog ■T
1 ic±iT ds
27ri L iT s - T + IC •
Provisionally assuming this lemma, let us see how we can deduce for-
mula (3). First suppose that IC > aa. Then the series F(s) is absolutely and
uniformly convergent for a = IC, hence
co
1 f xs )s ds
F(s) ds = an
E(
27ri I S
n=1
K±iT Do
1 Xs
(5) F(s) ds A* (x)
27ri 27r
x k (T1 ± T
1 ') n=1 lioa:(x/n)1 .
i tc±iT K+1.-FiT
F(s)xs 8 -1 ds =- F(s)xss -1 ds + o(1).
fk+1—iT
Proof of Lemma 1.1. Consider first the case when x> 1. Let k be a sufficiently
large integer and let 7?-k denote the rectangle with vertices K - iT', lc + iT,
K — k + iT, lc — k - iT'. By the residue theorem, we may write
1 f ds
xs— = 1 = h(x).
27ri jp s
Now we have the following upper bounds
tc-k+iT
xss -1 ds <
fic±iT - TI log xl '
k—iT '
I tc—k—iT'
tc-k-iv
xss -1 ds <
- Tillogxr
x n—k
f
k—k±iT
es -1 ds
When TI log yl > 1, the estimate (7) follows from Lemma 1.1(i). Otherwise,
we can write
< TI log yI 5_ 1
(rlogy) /s dr-
<
Jo
and consequently, by Lemma 1. 1(u), we see that the left-hand side of (7) is
< yk. This concludes the proof.
Corollary 2.1 (Second effective Perron formula).
Let F(s) := an be a Dirichlet series with finite abscissa of absolute
convergence o-a . Suppose that there exists some real number a > 0 such that
00
1 k±i" ds
(9) E an log (x/n) F(s)xs
27rz
n<x
and
1 r±i' ds
(10) A(t) dt = — F(s)xs+1 8(s )
27ri
Proof. For w > 0 and x E IR+NN, the Perron formula (3) implies
tc±io.
1 ds
E annw
n<x
27rz fc_io.
F(s)x''
(s w)
1 fk +i'
(11) E an (xw — nu, ) — F(s) xs±w ds
27ri K-ioo s(s w)
n<x
and it is clear that this formula remains true also when x is an integer. By
Theorem 1.15, for a- = lc, we have
(12) F (s ) << 1 + 1 71 1--( K — ac m- E,
so both the integral and its derivative with respect to w are absolutely and
uniformly convergent. The conditions for differentiation under the integration
sign are therefore satisfied, and (9) is obtained by taking the derivative at w = 0
of each side of (11). Formula (10) corresponds to the case w = 1 of (11), for
which the left-hand side then equals
fo x
A(t)dt.
C •c < Cf0-
Let 6 be some real number with 0 < 6 < -ao. We apply Theorem 2 with
fixed k > aa ± 1 and x E + N. For each integer n, we have 11og(x/n)1 >
log ((n + D/n) >> 1/n, so that the remainder term in (6) is
00 00
Next, we deform the contour of integration [n - iT, k + ill into the polygonal
line passing through the points - 6 - iT, - 6 ±iT. This alters the integral by the
value of the residue at the pole s = 0, that is to say F(0). We can then write
1
• f F(s)xss -l ds
27rz
from the horizontal segments and from the vertical segment of the new contour.
Since, by assumption, ,u(a) = 0 for a> - 6 for all 6> 0, we have
(Note that we have used here the fact that the bound
F(s)<<T (1 7 1 5 T)
-
is uniform for - 6 < a < lc cf. the remark following the proof of Theo-
rem 11.1.16.)
Substituting in (13) and choosing T := x 6 , E := 6 /2(Ic ± 6), we obtain the
convergence of the series an in the "explicit" form
Theorem 5. Let F(s) :=-- En 00,1 an and G(s) := Enc°_, brim be two
Dirichlet series having respective abscissae of absolute convergence a l and a2 .
For a > al , 3> a-2 , we have
00
(14) urn
1 T
Too 2T fT
F (a ± iT) GO - ir) dr = E
anbn n- a -13 .
n=1
Proof. We have
1 TT oo
x-- 7,
anon x----. ab
mn (sin(Tlog(n/m))) .
2T LT F(a ±iT) G(13 - ir) dr = 2-ana+
13 ±7 ' rric)'n13 T log(n/m) )
n= 1 mon
00
1 f 7'
(15) lim — 1F(s)1 2 dr =
T—>oo 2T _ T
n=1
1 fT
(16) lim -,T, F(s)ns dr = an .
T00 21 _T
The formulae (15) and (16) provide two new instant proofs of the unique-
ness theorem for the representation of a function as a Dirichlet series (Theo-
rem 1.9). The range of validity of relation (15) is, in general, rather difficult to
determine. It can be shown (see Titchmarsh (1939), § 9.5) that the set of points
Notes 137
It is, however, possible that, even when F(s) is not holomorphic for a > cro,
the left-hand side of (15) converges for a = ao . In the case of the zeta function,
Titchmarsh (1951, § 7.2) showed that
1 IT 1
(17) li111 —7, j 1( (8)12 dT = ((20") (CY >
2 )
T —>oo ... 1
Notes
§ 2.2. The Schnee—Landau theorem can be stated thus: If an <, n for any
E > 0 (so that aa < 1) and if F(s) =Enc'cL i an n — s has, for some ao, a regular
continuation in a> ao which satisfies iu(a) < a in this half-plane, then
a, < min
at ± a , ao + a) .
1±a
Exercises
k! f ic-Fico
E an (log -)
x =
27rt
Ti
k
F(s)xss -k-1 ds
K-i00
n<x
+00
00
X 1 k+2°C)
E an ( - ) w (log = F(s)xsiv(r) ds.
n=1
n n 27rz -i00
w(0 (sin(t/2E) 2
)
K -FiT
an < CEek e IF(s)x 8 1 ds1 (lc >
log(x/n)I<e
1)
with C = (sin -2 , and T = 1 E.
4. Let F(s) := E
an be a Dirichlet series such that lim„, F(cy-kir) exists
for at least one value of a > cra . Let A denote this limit.
(a) Show that 1Al 2 = z-d:0-1 2 n-2 '•
(b) Show that A = ai. [Use Corollary 5.2.]
(c) Deduce that F(s) is constant.
11.3
The Riemann zeta function
§ 3.1 Introduction
The zeta function occupies a central role in arithmetic, particularly on ac-
count of its close correlation with prime numbers via Euler's formula (cf. §1.2).
It also has the important property that a wide variety of Dirichlet series may
be expressed in terms of it. In connection with the Perron formulae established
in Chapter 2, the analytic study of the zeta function is hence of major interest.
The evaluation of an integral of the type
r
J—T F(S)X s S -1 dT,
E (--i)nn—sk _ 2 1—sk
n<x n<x/2
n- Sk
n<x
-8 k -
x/2<n<x
n - Sk
obtained by partial summation. We thus see that the stated property holds for
the half-plane a > 0. The same result can also be obtained from the partial
integration formula
00 00 00
((s) = I t' d[t] = I t' dt - i t'd{t}
1- 1 1-
S
= s {t}t - s -l dt,
s- 1 f_'
initially valid for a > 1, which provides an alternative explicit form of the
continuation for a > 0.
The stated result could be derived by successive integrations, starting from
one or other of the preceding formulae—using, for example, in the second case,
the Euler-Maclaurin formula (see Exercise 10). We employ another method,
due to Riemann, which provides additional information useful in establishing
the functional equation.
The starting point is the formula
(1 ) r ( s ) n —s = It o
00
s—i e —nt dt
(0- > 0).
00 f oo 00
F(s)((s) = E ts -l e - nt dt = j t8-1 (et 1) -1 dt.
n =1 0 0
3.2 Analytic continuation 141
/(s) := I zs -1 (ez - 1) -1 dz
cp
is independent of p in ]0, 27[. It is absolutely convergent for each s E C and
uniformly convergent on any compact subset. Thus it defines an entire function
of s. We have
00
(2) /(s) = z8-1 (ez - 1) -1 dz + (e27ris - 1) i t8-1 (et - 1) -1 dt.
izl=P P
This formula, initially valid for a> 1, explicitly gives the analytic continuation
of ((s) to the whole complex plane. When a < 0, the factor r(1 - s) is holo-
morphic, hence ((s) has no singularity other than that already noted at s = 1.
This concludes the proof.
Formula 2 readily gives the value of ((s) at negative integers:
Theorem 2. Letting B, denote the nth Bernoulli number, we have
nBn+1
(4) (( —n) = ( - 1) (n > 0).
n+1
Hence, by (2),
Iz1=P (n + 1)! •
( 5 )
((s) = 23 7 3-1- sin (- 718)17 (1 — 8)0 — 8).
(6) (s 0,1)
(8)
[I (s )r ( s ± ) 21-2.5
,0rF(2s).
3.4 Approximations and bounds in the critical strip 143
Proof. For k > 1, let Rk be the Hankel contour with parameter Pk := (2k + 1)7r.
Then lez - 11 -1 is bounded for z on Hk, and we have
Lk
Let p satisfy 0 < p < 27r. The contour Cp - Rk encircles, in the negative sense,
the poles z = 2n7ri for n = ±1, ±2, ... , ±k. The residue theorem then implies
Using (9), and letting k tend to infinity, we obtain, for each s in the half-plane
a < 0, that
/(s) = (27ri) 8 (e' 8 1)(1 - s). -
Substituting in (3), this gives equation (5) for a- <0. By analytic continuation
it remains valid for all s.
An immediate consequence of Theorems 2 and 3 is the following formula
for ((2n), which can also be obtained as a special case of the Fourier series
expansion for Bernoulli functions — cf. Exercise 1.0.1.
Theorem 4. We have
The asymptotic behaviour of x(s) when a is fixed and 1-7- 1 --> oo can be easily
determined by means of the complex Stirling formula (cf. Titchmarsh (1939),
p. 151), the proof of which is outlined in Exercise 1. We have
(t)
(13) log F(s) = (s - ) log s - s+ log(27r) - f c° B1 dt,
13 t+8
144 11.3 The Riemann zeta function
where B 1 (t) is the first Bernoulli function, and where log s denotes the principal
value of the complex logarithm. From (13), an easy calculation leads to the
asymptotic formula
l _a
(14) 1X(s)ir- 7-r)
1 0 7 1 -4 00.
Thus it follows from (12) that the order of magnitude of ((s) on vertical lines
is completely determined when a < 0. With the notation of §1.6, we deduce in
particular that, for the zeta function,
1
-) = (a < 0)
(15) ii
,( 0
0 (o- ? 1).
The exact value of ,u(a) in the critical strip 0 < a- <1 is a deep problem which
is still unsolved. The most simple hypothesis consistant with (15) is that the
graph of p,(o- ) is formed by two half-lines, viz
This formula is known as Lindelof 's hypothesis. Taking into account the con-
vexity properties of tt(a) (cf. Theorem 1.16), it is equivalent to p,( - ) = 0.
The relations p(0) = and p,(1) = 0 immediately imply by convexity that
p(o-) < (1 - a) for 0 < a < 1. The following result allows us to improve this
estimate.
N1—s CO
(17) ((s) =
n<N
1- s sI IV
{t} t — 8-1 dt.
N 1-.9 00
n-s_ s L t- d,t,.
1 — s iv
t-. d{t}
n<N
(18)
N's
00
= s-f t s -1 {t} dt.
1-s N
Corollary 5.1. Let cro > 0, 0 < 6 < 1. Uniformly for a > a01 x> land
0 < IT1 < ( 1 6)27x, we have
-
X 1-8
(19) c(S) = +0(x-').
1-s
n<x
Proof. For 1'71 < ( 1 - 6)27rx and all y > x, we can write
(20) E Y
n -i, = f t -ir dt + 0(1),
x
x<n<y
applying Theorem 1.6.4 to the function f(t) := -(T/27r) log t, which satisfies
If'(01 <1- 6 for x < t < y. Then, for N > x, we obtain
N N
E n —8 =
x<n<N Ix
y dy +
—.9
i
x
N1-8 _ x i-8
. + 0(x- ').
1-s
Substituting in (17) and letting N ---+ oo, we obtain precisely (19).
Corollary 5.2. We have
(21) ((- + kr) <<1/6 log ( T1 --> co).
Proof. We may assume that r> 0. Theorems 1.6.5 and 1.6.6 applied to f(t) :=
-(T/27r) log t readily give the estimate
En -iT < min (7 1/2 ± a7-112 ) a 112 7 116 ± ay-1/6)
a<n<b
valid uniformly for T > 0, a < b < 2a. Under the additional assumption a << T,
we infer by partial integration that
E
a<n<b
n-i/2-i, << min (0101/2 ± ( a/7) 1/2 , T 1/6 ± a 1/2 T -1/6)
For r < log x/ log 2, let us choose a := 2', b := min(2r+ 1 ,x) and sum all the
corresponding estimates. It follows that
En
n<x
_1/2_ir < T i/6 log T
(X < 'T).
(23) 1((s)1
3 1T1 1-c' (cr -?- a, ITI -?- 1 ).
2a(1 - a)
Remark. Clearly, the explicit bound (23) has no theoretical interest unless a =
a(r) ----> 1. Otherwise, relation (22) provides a superior result.
Proof. By (17), we can write
n<N
Moreover we plainly have
N
E n - a < 1 + i Ca dt <
1
N1 a
1-a
-
1 ± 1 ± 2a +1-7-11
1((s)1 5- N 1-c' { 1 - a
ITI 2aN I
Choosing N = [1-7- 1], we obtain
N'a
I((s)i - a(1 - a) fa 4-a(1 - a) ± (1 4 - a)(1 - a)}.
<
Corollary 7.1. For any positive constant c and any integer k > 0, we have
Proof. Clearly we may suppose that ITI > 3. For some suitable constant c o and
r := co / logIT1, the disc 1z - ,s1 <r is contained in a domain where (24) applies.
The desired bound then follows from Cauchy's formula
((k) ( s ) _ k!
((s + z)
dz
27ri 1 1=r zk -m- •
that ((s) does not vanish for a > 1. We shall show in this section that this
property extends to the closed half-plane a > 1. The following result, simple
but cunning, enables us to perform the limit process.
Theorem 8 (de La Vallee-Poussin). Let F(s) := En=i an n s be a Dirich-
let series with non-negative coefficients and with abscissa of convergence a,.
Then we have
(26) 3F(a) + 4Re F(o- + ir) +Re F(a + 2iT) > 0 (a > as ).
A(n) _ s
F(s) = log ((s) = - log(1 _ p') n
log n
P n>2
shows that these non-trivial zeros are not real. The functional equation and the
fact that ((s) is real for real s imply that they are distributed symmetrically
with respect to the line a = and the real axis T = 0.
Another, particularly elegant proof of Theorem 9 is due to Ingham (1930).
It rests on Ramanujan's identity
s) 2 ((s ± i o) ((s i e)
00
(( —
(28) (
e)1 2 n—s
((2s)
i0
(29) 7 - (n, 0) •
di m
3.6 Lemmas from complex analysis 149
The formula (28) may be easily checked by identifying the factors of the Euler
products on the two sides see also the Notes. Let us consider the abscissa of
convergence a, of (28). We have a, < 1 since 17- (n, 0)1 < r(n). By Theorem
1.3, the sum of the series is holomorphic for a> a,. Since the coefficients are
non-negative, the point s = a, is a singularity. Now if s = 1+ i0 is a zero of
((s), so is s =1-0 (since ((s) = ((s)) and these two zeros compensate for the
double pole of ((s) 2 at s = 1. But ((s) has no singularity other than at s = 1,
hence it follows from the above that (28) is holomorphic on the real axis down
to the first zero of ((2s), that is to say s = —1. We would then have a, = —1,
which is absurd since 17- (n, 0)1 does not tend to 0.
2,
(30) - r I log IF (Rei e ')1 (10 .
n(r) dr = TT-
and sum the equalities thus obtained. In order to establish (30), it therefore
suffices to show that we can allow e to tend to 0. In other words that, for each
j > 1, we have
Since F has only a finite number of zeros on the circle 1,51= r j , it is clear that
it suffices to consider the case when s = rj is the only zero, say of multiplicity
m > 1, of F(s) on this circle. Then
r ,0
logIF(re ie )I = m log 1 — —e - H (r, 0)
Ti
for 1r — rj I < erg , 0 < 0 < 27r, where H(r,0) is a continuous function of r and 0.
Therefore (32) is equivalent to
1— (1 ± E)ei 6
(33) lirn f 7r log dO = 0.
J-7 1—(e)i
By a straightforward calculation it can be checked that, uniformly for 101 < 71,
we have
2
1 — (1+ E)ei 9 E2 ± 4(1 ± e) sin2 1 0
2 = 1 + 0(e).
1—(e)i E 2 ± 4(1 — e) sin2 0
Rn
(34) sup — 1F(n) (0)1 < 2A.
n>1 n!
Corollary 11.1. Under the same assumptions, for any r with 0 < r < R, we
have that
2Ar
(35) M(r) := max IF(s)I < .
R—r
E a sn
00
00
F(Reie ) = E
n=1
aRTh cos(n0 ± On),
3.7 Global distribution of zeros 151
the series being absolutely and uniformly convergent for 0 < 0 < 271. But the
fact that F is holomorphic implies that the function Re F(s) has the mean
value property
/0 27r
Re F (Reie ) dr9 = O.
Fi (s) 4R log M
(36) sup <
1.917- F(s) — (R — 20 2.
Proof. Let G(s) := F(s) 1 1pe z (1 — 8 / p) -1 . Then 1G(s)1 < M for Isl = R,
since each factor of the product over p has modulus at most 1. The function
H(s) := log G(s) satisfies the conditions of Theorem 11, with A = log M, in
the disc Is' < R. For Is' = r < R, we then have
1 rn-11H(n)(0)1
1 1-1'(s)1 c° (n — 1)!
n=1
4R log M
< 2 log M E nrn-i(R/2)-n ,
(R — 20 2.
n=1
The regularity of e for a > follows from that of (, since the pole at s = 1
is compensated by the factor (s — 1). By (38), e(s) is thus an entire function.
We note that e(0) = e(1) = 1.
From Theorem 9, e(s) 0 for a > 1. By (38), it follows that e(s) does not
vanish for a < 0. Hence all the zeros of e(s) lie in the critical strip 0 < a < 1.
In this strip, e(s) and ((s) have the same zeros. The situation is different in
the half-plane a < 0, since there the trivial zeros of ((s) are compensated by
the poles of F(s).
Traditionally, the general notation for a zero of e (i.e. a non-trivial zero of ()
is p = 0 + iry , and we set
N(T) := 1,
p: 0<-y <T
where, here and in the sequel, we adopt the convention that all zeros p are
counted with multiplicity.
In this section, our aim is to establish an asymptotic formula for N(T)
as T ---+ Do. With this in mind, it is clearly worthwhile to have information on
the order of growth of a(s). By Theorem 6, we have
Moreover, integrating by parts the last term in the complex Stirling formula
(13), we obtain
1 00 B2(t) A
(40) log F(s) = (s — ) log s — s -I- log(27r) ± 21 /0 (t ± s)2 ut
12s
where B2 (t) = {t} 2 — {0+ .26- denotes the second Bernoulli function. This implies
that
In the first instance, for all r with N/5 < r < 3, the disc IS < r contains the
rectangle with vertices —2, —2 + i, —1, —1 + i, hence n(r) > N(T + 1) — N(T).
Next, from (39) and (40), we readily deduce the estimate
T T T
(43) N(T) = — log — — — + 0(log T).
27r 27r 27r
Remark. This implies, in particular, that ((s) has infinitely many non-trivial
zeros.
Proof. Without loss of generality, we may assume that T is not the ordinate
of a zero of e(s). Let R, be the positively oriented rectangle with vertices 2 +
iT, 1+ iT. By a well-known formula,
—
1
2N(T) = -277i JRf , ' (s) th _ 1 •5.- rn f
e(s) - as.
JR c ( s )
",(8,)
27i
1 f (s) ds = 1 [ arg
(44) N(T) = ;Tr (s'rn (s)] ,c
he(s) 71
where L is the polygonal line 2,2 + iT, + iT. Now, we have
- '(s) 1 1
(45) ill( s) ± (' ( 8 )
The contribution to (44) of the first two terms is clearly 0(1). That of the third
is exactly
— 11 log 71.
27
154 11.3 The Riem,ann zeta function
1 s) ds 1
fc r(12_ s) 2 = -77-1 1'm log re,- +
T T T
= — log - - — + 0(1).
27 2 271
Thus, it only remains to prove that
1 2-HT 00
A(n) n-2
('(8) ds <00.
((s) log n
n=2
In order to deal with the horizontal segment, we appeal to Corollary 11.2 with
F(s) = ((2 iT + s) 1 ((2 + iT) and R = 4, r = 3/2. The numerator is of the
order of a power of T, and the denominator satisfies
00
Letting Z denote the finite sequence of zeros of ((s) (counted with multiplicity)
in the disc Is 2- ir 5_ we therefore obtain that, for each s in the horizontal
segment iT , 2 ± ,
1
< log T.
pEZ
C%-
ds
-S777, <
12-FiT s-p
P(s) := H ( 1 - s /p) es /P ( s c C)
P
where the general term is, for each s, of the form 1 ± 0(1p1 -2 ). P(s) is therefore
an entire function of s having the same zeros as e(s) and, since P(0) = e(0) = 1,
we can find a holomorphic determination F(s) of log (e(s)/P(s)) taking the
value 0 at s = 0. We shall see that
with J := [Clog k]. By the pigeonhole principle, there exists at least one
interval [k +2j1J,k + (2j +2)IJ]with 0 < j < J, containing no 1 /91. We can
thus choose R = k + (2j + 1)/J.
Consider some R satisfying (51). For 181= R, we have
R
E (2+ log(2CR log R)) +
v, R2
> 1YD.: —Ipl+ R/2<lpl<2R 1 L-w 110 1 2 } '
I PI-LL /2 IP1>2R
where we have used the inequalities
- 1z1 ( 1z I >2)
log1(1 - z)e z 1 - 2 -1- log Ii - z1 (- 5_ 1 .z1 < 2)
{
- 1z1 2 ( lz 1 < D.
Using (42), we deduce from the above lower bound that
1og(1/1P(s)1) << R(logR) 2 .
We omit the details, which are easy. Now, by (41), we also have
log 1"(s)1 << R log R.
This implies (50) and so completes the proof.
We conclude this section with a remark. The bound
N(T +1) - N(T) < log T
sufficed to prove the product formula (47) for (s). Now this formula easily
implies that N(T) -4 oo as T -> oo. Indeed, otherwise, (s) would have the
form
(s) = e" Q( 8 )
where c is a constant and Q(s) is a polynomial in s. The functional equation
(s) = (1 - s) would then imply that c = 0, but Stirling's formula applied
to real, positive s prevents from having polynomial growth. Actually, Titch-
marsh (1951, §9.1) provides a simple argument which enables one to deduce
the estimate
N(T + A) - N(T) >> 1
(for some suitable absolute constant A), merely from the functional equation.
3.9 Zero-free regions 157
Theorem 15. There exists some positive constant c such that ((s) has no
zero in the region of the complex plane defined by the inequality
CO
A(n)n'
has non-negative coefficients. By Theorem 8, it hence follows that, for all a > 1
and all real -y, we have
We shall obtain the stated result by bounding each of the three terms on the
left-hand side from above, for -y the ordinate of a zero p = 13 ± i-y of ((s).
In the first place, we have
('(a) = 1 + 0(1).
((a) a—1
(/(8) , 1 rv(-s + 1)
(54) - 1° 4- s — 1
4- 2r(v1 ± 1)
(GS)
Estimating the term involving F by Stirling's formula (13), and noting that,
for a > 1, the numbers p and (s — p) have positive real part, we have
and also, taking into account the contribution of the zero 0 + i-y,
— Re
((a ± vY) —
7)
('(cr ± i < 0(log hd)
a
1
—0.
Substituting these estimates in (53), we then deduce the existence of a constant
ci such that for 1-YI>_ 2 we have
3 4
> —ci log l'yl
a — 1 a —0
from which we get
1— 0 > 1 — c i (a — 1) log 'y
— (3/(a — 1)) + e l log lryl •
Choosing a =1 +1 /( 2 cilogi'Yi), it follows that
(59) -Re (1(s) < K log T (T > 4, a > 1 - 4c1 log 7-)
((s) -
('(so ± w)
-Re ,, < 2K log T.
(ASO ± W) —
F (w ) := C( (ssoo
( )
('(.90 + w)
(( se ± w)
satisfies the hypotheses of the Borel-Caratheodory theorem in the disc I wl < 477,
with A := 2K log T ± IC (S0)1C(S0) I. Since Is - so I __ 277, Corollary 11.1 implies
that
<4Klogr+3
Since the estimate (56) is immediate from (57), it suffices to establish this
last inequality. To this end, we use (55) in the form
Notes
Qs ) =
n x
—s ± x(s) E
n<y
ns -1 + 0(x - a ± T 1 —cf ycr-1 )
= E K a L b m a+b _ E E na,b
K Lmin din [1c,X]=d
(K,L)=1
= E kaAb = 0-a(n)o-b(n).
tc,Ain
§ 3.7. Theorem 13, conjectured by Riemann in his memoir of 1859, was proved
by von Mangoldt in 1895. Let S(T) = (1/7r) [ arg ((s)] r where r is the polygo-
nal line r joining 2, 2+ iT , ± iT . In his book, Titchmarsh (1951, §9.2) shows
that
T 1
N (T) = — lo ± 7- + S(T) ± 0 (- ) .
27r 27r 27r 8 T
§ 3.9. Note that, in general, an upper bound for ((s) on the line a = 1 provides,
via Corollary 11.2, a corresponding upper bound for -Re ((/ (s)/((s)) in terms
of the abscissae of the zeros, by an argument analogous to that used in the
proof of Theorem 15. Thus we can state that, conceptually, it is equivalent to
bound I ((1 ± i/7- )1 and to determine a zero-free region for ((s). Korobov (1958)
and Vinogradov (1958) established the upper bound
from which follows the best zero-free region known to date, namely
For a detailed proof of (61) and the deduction of (62), see Ivi6 (1985), chapter 6.
162 11.3 The Riemann zeta function
§ 3.10. The zero-free region (62) stated above easily implies improvements on
the estimates in Theorem 16. One obtains
Exercises
(a) Using the Euler—Maclaurin summation formula for 1(t) := log(t ± z),
show that one has for N> 1, z E Clik — ,
N
E log (1 ± Tiz ) =z(1 ± log N) — (z + n log z
n=1
" Bi (t)
log(2)
7rf± o — t ± z dt ± RN (Z)
Bi (t)
log F (z) = (z — 1 ) log z — z ± log(2)
7r) — cx) dt.
0 t ±z
Exercises 163
co ( q ) — 1 \--` x ( m ) x (n )
1, if n m(mod q), (m, q) = 1,
0, otherwise,
where the x-sum is taken over all Dirichlet characters to the modulus q.
(c) For each x, one defines the Dirichlet series
00
L(s,) =>x(n)n.
n=1
i 3a q -1/2
(s, x) := ( 8 ± a )L(s,
2 X), E(X) :=
E x (h)e(h/ q).
(:)
—i(s+a)r h(mod q)
4. Let Tk(n) denote the number of solutions in integers ml, m2, ... , m k > 1 of
the equation n = mim2 • • • mk•
(a) Show that ET7 1
k(m) -s = ((s) k .
(b) Show that, for each k > 1, there exists some 6k > 0, which can be
calculated explicitly, such that
where Pk_i denotes a polynomial of degree k -1. Determine the coefficients for
k = 1, 2, 3. What value for 6k can be obtained assuming Lindelof's hypothesis?
5. Show that, for s = 1+ ir, one has
E p(n)n-s = 2i
--7r riT
((s + w) -1 xww -1 dw + 0(T -1 log x + x -1 ),
n<x K.—iT
uniformly for x > 2, T > 1, and with lc = 1/ log x. Using the calculations in
§3.10, deduce the estimate
6. Use the effective Perron formula and some estimates from §3.10 to show that
7. Let r" (n) be the number of odd divisors of an integer n. Determine the
Dirichlet series associated with T*(n) and use complex integration to deduce
the asymptotic expansion
with ak (z) k -1 E dik bt(d)ziqd, in any simply connected open subset of the
half-plane a > 0 which excludes all values of s such that ks (k = 1,2, ...) is
either a pole or a zero of the zeta function.
9. Show that the following asymptotic formulae are equivalent to Lindelof's
hypothesis:
(1) 0(x) = x
(2) 7r(x) = li(x) ± 0 (xe-c V(log x)) .
Proof. The second formula follows easily from the first by partial summation.
Let us prove (1). Since A(n) _< log rt and 1( / (1+a)/((1 cr)i < 1/o - (a > 0), the
second effective Perron formula (cf. §2.1) allows us to write, for x > 2, T > 2,
(4)
1
27ri
ft-FiT
(s) x 3 ds = x 1
27ri
r ('(s) x 8
((s) s
ds
(( 8 ) 8
Choosing T := exp V(co log x) , we see that the integral over g is of order at
most that of the remainder term of (1) provided c < Vco. Substituting (4) in
(3) then implies the desired result.
I((so) 1 4 c 1i 3 ( log 1 7 1) -1 ,
from which it follows that
We thus obtain
It is remarkable that one can equally well arrive at the prime number theo-
rem (in its weakest form, i.e. 0(x) x) using only the properties of ((s) in the
half-plane a > 1.
Indeed, let us apply Perron's formula (2.10) to the Dirichlet series
00
F(s) = (i (s) ((s) =
((s) (A(n) — 1)n'.
n=1
with
1 +cx) F(1+iT)er
—
27 I 00 (1 ± iy) (2 ± ir)
Thus J(x) is the Fourier transform at the point (— log x) of an integrable func-
tion. By the Riemann—Lebesgue lemma, we then deduce that J(x) = o(1) (as
x oo), from which it we derive that
0(0 dt = x 2 + x 2 E(x)
with E(x) = J(x) + 0(11x) = o(1).
Let us define ri(x) := ma9x<y<i x IE(y)I. For all h, 0 < h < x, we can
write
i 2 n < 1 r
x_, h— it il(x)x`
x-h
1 x±h
f
dt < x + h + -4-x277(x).
h
170 11.4 The prime number theorem and the Riemann hypothesis
( 1 ( 8 )
= 8
is "approximately equal to —
2T, log L -"
(cf. Theorem 3.13), he writes: "indeed,
one encounters between these limits a number of real roots approximately equal
to this, and it is very probable that all the roots are real". However, a note
has been found in Riemann's personal papers, specifying that he has "not yet
completed the proof" of the first point—cf. Riemann (1859) pp. 168-169 and
175.
4.3 The Riemann hypothesis 171
Neither of Riemann's two claims has yet been proved or disproved. In 1914
Hardy established that ((s) has infinitely many zeros on the critical line a =
and in 1942 Selberg showed that a positive proportion of the zeros is definitely
located on the critical line, in other words that, for some c > 0,
(13) No (T) := card {r : 0 < T < T, (-(- + ir) = 0} > eN (T) (T --- oo).
Today explicit values of c are available, following the work of Levinson (1974),
who showed that for T sufficiently large one can take c = 0.342.
The statement that all the roots of (12) are real (that is to say that N o (T) =
N(T)) is known under the name of the Riemann hypothesis. It is one of the most
famous conjectures of mathematics. It has profound implications throughout
analytic number theory. In this section we shall develop two of them.
Theorem 2. The Riemann hypothesis implies that of Lindelof. More precisely,
if all the non-trivial zeros of ((s) have real part equal to , then we have
Proof of Lemma. For R1 <r < R2 the maximum modulus principle applied to
smF(s)n immediately implies that
RTM(Ri )n TE2nM(R2)n
rrnM(r)n < + .
r — R1 R2 — r
Let a be such that RTM(Ri) = R,YM(R2). In the above inequality, let m and
n tend to infinity in such a way that m/n --> a. We then obtain
log(R2 r) log(r/Ri)
(15) log M(r) < / log M(R 1 ) + ' log M(R2).
log(R2/Ri) log(R2/R1)
Proof of Theorem 2. Observe first of all that the Riemann hypothesis implies
that, for sufficiently large 'r, the function
is holomorphic in the disc 1w1 < --. Since it follows from Theorem 3.7 (for
example) that, for all positive 6,
Then, let s := a +iT with a > --, T > ro , and consider parameters al , E,
such that cr1 > 1, 0 < E < min{ai —1, a ---}. Applying the three circles lemma
to log ((al + iT ± W) for
Ri := al — 1 — E, r := al — a, R2 := al — 1 1— E
(so that, on these three circles, the point a l +ir +w passes respectively through
1 + E ± iT, S, ± E ± iT) we obtain
I log ( s
( )
with
log
log(r/Ri) ai — 1 — 6
a := , 2(1 — a + 6) (al —> co)
log(R2/R1) . al — 1 — E)
log (
al — 1 — E i
and
Mi. < sup I log ((a + ir)I <<, 1, M2 < e log T,
cr>1-1-e
where the last estimate follows from (16). Letting a l tend to infinity, we obtain
the stated result.
4.3 The Riemann hypothesis 173
Theorem 3. Let 0 be the lower bound of those real numbers such that
0(x) = x + 0(x).
Then we have
(17) 0 = sup 13
where the supremum is taken over all non-trivial zeros p =j3 + i7 of ((s).
Notes
§ 4.1. The best error term known for the prime number theorem is
then Z(T) certainly vanishes on the interval [T, 2T] for sufficiently large T.
Titchmarsh's choice is
Z (r) := x( ± 2 T) 1/2 ( ( ± 2 T )
where x(s) denotes the "elementary" factor of the functional equation of ((s)
cf. formula (3.11). The equation ((s) = x(s)((1 - s) shows immediately that
x(s)x(1 - s) = 1
((/4)+E: ( a < 1)
X( S ) -1 / 2 C( S ) <6 (3/8)+
(1 < a < i) •
fT2T
(22) Z(r) dr <, T7/ 8 .
17,2T 2T IT2T
I Z MI dr = I 1( (- ± HI d7 > ir) dr
T
By the residue theorem, we can write
2T
i jT ( (- ± iT) dT = f ((S) ds
g
where g is the polygonal line joining ± iT , 2 ± iT , 2 ± 2iT, ± 2iT.
Since ((s) <, T( 1 / 4) +e on 0, the contribution of the horizontal segments is
<6 T(1/4)+E . The contribution of the vertical segment is
oo
i E n -2
n=1
I
T
2T 00
We thus have
fT2T
1Z MI dr >> T,
which establishes (20).
A trivial modification of the argument presented above would give a lower
bound of the type
No (T) >>
(cf. Blanchard (1969), chapter IV.6).
The best numerical constant obtained so far via (a refinement of) the
method introduced by Levinson cf. (13) is due to Conrey (1989) and ex-
ceeds 2/5.
Exercises 177
valid for x> 1, x p'1 , where the sum over p converges in "principal value":
xP XP
E P
_p :, iim
T--+co
17I<T
P
Exercises
1
00
E
n<x
k w(n) = XPk_i (log x) + 0,(x 1-6 k+e)
1
x
(NO dt = ax 2 + 0(x2e-c0rvo0g x)) (x --> co)
with a := G(1) =
(f) Using the monotonicity of (1), show by an argument analogous to that
leading to (10) that one has
(I)(x) = ax ±
(g) Show that
an < ne —c-V(log n) (n > 1).
Exercises 179
3. Show that the Riemann hypothesis implies the convergence of the series
E u(n)n 3 for all s in the half-plane a > 1. [Hint: use Theorem 2 and the
Schnee-Landau theorem 2.41
4. Put r(n,0) := E din d0.' Using Ramanujan's identity (3.28), show that, uni-
formly for x> 2, 101 < 1, 0 0, one has
1 1 fl
w(1) f 17(n,9)1 2 dO 5_ T(n) < 1T(n, OW dO.
-1 - 2.71w(1) L i
V T(n)-----; x(log X) 2 .
ntdx
11.5
The Selberg—Delange method
E A(7-,,)=-
n<x
X ± 0 (Xe —c \ / (1°g x) )
0 o g x))
ii (n ) = 0(x e- c -V
n<x
E
n<x
Tic(n) = xPk-i(logx) + 0 k(X 1-6k )
for a > 1, where G(s, z) satisfies certain conditions to which we will return
later.
Before developing this theory, due essentially to Selberg (1954) and Delange
(1959, 1971), let us take a deeper look at the analytic nature of ((s)z where z
is a fixed complex number.
5.1 Complex powers of ((s) 181
,
1,-1
(w) :_ 1 ri ,
( w_ j) (w E C,
v v!
J=0
(3) v 1 )
(4) rk (n) = E 1.
dld2...dk=n
Of course, T2 = r.
plays a special role. It is defined on any simply connected domain which does
not contain a zero of ((s). We shall always suppose that this domain includes
the real half-line [1, ± CXD [ . We can then choose the principal value of the complex
logarithm, so that
Z(1, z) = 1.
182 11.5 The Selberg—Delange method
where the coefficients yi(z) are entire functions of z satisfying, for all A > 0
and E> 0, the upper bound
1
( 7) «j1,6 (1 ± (Iz1 5_ A).
3!
Proof. All the assertions in the statement of the theorem follow immediately
from the fact that ((s) does not vanish for Is - 11 <1, via Cauchy's formula
1 1 ds
(z) = Z(s; z)
3. 27ri is-11=r (s — 1)i+ 1- •
p= + i 14.13472 . . . .
For the sake of completeness, we give a quick proof of the fact that ((s) 0
for Is - 11 < 1. After integration by parts, formula (3.18) gives
By Theorem 3.15, there exists an absolute positive constant c such that ((s)
does not vanish in the region
In the rest of this chapter we let 7, denote the simply connected domain obtained
by deleting the real segment [1 — c,1] from the region (8). We then have the
analytic continuation
Moreover, the upper bound I log ((s)1 5_ log 2 I7-1 + 0(1) from Theorem 3.16
shows that we have, for each constant A> 0,
op,
--,
0 o
)
Theorem 2 (Hankel's formula). Let 7--t be a Hankel contour. For any com-
plex number z, we have
1 1 i
= ht s — zes ds .
F(z) 27ri
184 11.5 The Selberg—Delange method
Proof. The integral is absolutely and uniformly convergent for each z. It thus
defines an entire function of z which, by the residue theorem, is independent of
r, since the sole singularity of the integrand occurs at s = 0. When Re z < 1,
the integral round the circular part Isl = r of the Hankel contour tends to 0
with r, whereas the integral along the doubled half-line tends to
This proves (11) when Re z < 1, and hence for all z by analytic continuation.
Corollary 2.1. For each X > 1, let 7-i(X) denote the part of the Hankel
contour situated in the half-plane a > -X. Then we have uniformly for z E C
j7i
f (x) s - zes ds = 1
(12) +0(47 1 z 1 r(1 + izi) e - Ax).
F(z)
The difference between the left-hand side of (12) and the integral (11) is there-
fore
< 0-1zle-a do- <
Jx Jo
The change of variable a = 2t gives the stated bound, since 2e" <47.
G(s; z) := F(s)((s)—z
5.3 The main result 185
If F(s) has the property 'P(z; c o , 6, M) and if there exists a sequence of positive
real numbers fb ri l_ i such that
satisfies 'P(w, 03,6, M) for a certain complex number w, we say that F(s) has
type 7 (z , w; c o , 6, M). It is worthwhile to bear in mind that a series with positive
coefficients having property 'P(z; co , 6, M) is trivially of type T(z, z; co , 6, M).
In the domain where G(s, z) is holomorphic we set
ak
(14) G(k) (s; z) :=
ask
and
1
(15) )k (z) E 1 G(h)(1; z)-y3(z)
r(Z — k) h +3=k h!j!
(16) E an = x(logx)z -1 {
n<x k=
Ak (Z)
x)k
0(MRN
(log
(X)) }
with
:= e -ciM og x) (c2N ± 1 Ard-1
R N (x)
log x
186 11.5 The Selberg—Delange method
The positive constants c l , c2 and the implicit constant in the Landau symbol
depend at most on co, .5 and A.
We shall see in the next chapter (§6.2) how one can take advantage of
the uniformity in M in formula (16). Uniformity in N is equally significant.
Consider for example the case when z E Z. Formula (15) then shows that
Ak (z) = 0 whenever k > z. We can hence choose N so as to minimise the error
term in (16). For the value N := [(log x)/ec 2] we obtain
f (pu) = Pv + 0 (Pu-6 ) (v 1)
with .5 > 0.
5.4 Proof of Theorem 3 187
1 T'C+i°° ds
fx0 A(t) dt = - F(s)xs+1
2 7ri jft _ico s(s + 1)
Selecting T = exp NA(c15) log x), for x > xo, it then follows that
t A
iT
K CT
•
-iT
with
(20) (1)(x) := 27
1 i fr lds
s(s±F(s)x± 1) .
Here, and for the rest of the proof, we make the convention that all constants,
explicit or implicit, depend at most on co, S and A.
It remains to study the main term (1)(x) of (19). Clearly it is an infinitely
differentiable function of x on IR+, and in particular we have
(1)/ (x) = i F
(s)xs ds , c1)"(x) = f F (s)xs -1 ds.
27ri r s 27ri r
CO
with
1
9k(z) := 2 ( 4 )G (h) (1; z)-y (z) = 1(z - k)Ak(z).
h±j=k j
In addition, since G(s, z)Z (s; z) is holomorphic and 0(M) in the disc
- 11 < c, the Cauchy formulae imply that
g k(z) M c -k
N+1
G(s; z)Z (s; z gk(z)(s - 1) k + 0 (M - 111 O )
k=0
Therefore
(23)
k=0
gk(z)
1
fr -xs(s 1) k- zds +0(Mc-N R(x)),
with
R(x) :=
Jr I xs(s — i) N+1- 1 I ds1
f1—r
( 1 0.)N+1—aite z x cr -Fr r N+2-3ite z
dri xl
1—c/2
+cxj tN +1—Re z -t
x(log X) Re z— N —2 f e dt + 2-N }
N
A(z)
x(log x)z -1- { +0(EN)}
(log x)k
k=
with
N N
EN := x —c/4 Eigk(z)1(c,k+ 1 ) k
< MX —c/4 4 k! ( 5 \ k
log x clogx)
k=0 k=0
We shall show, by using (19) and (22), that 4)'(x) is a suitable approximation
for A(x). To this end, let us take a parameter h, 0 < h < x12, and apply (19)
for both x and x + h. Subtracting these estimates, we obtain
x-Fh,
A(t) dt = (I). (x + h) — ( I). (x) + 0 (M x 2 e — c3 \ /(11Dg x ) )
(25) fx
while (22) implies that
1
(I) (x ± h) — ( I)(x) = 1143.' (x) ± h2 f (1 — t)(1)" (x + th) dt
(26) o
= h' (x) ± 0 (M h2 (log x) A ) .
5.5 A variant of the main theorem 191
with
x±h
L := 1A(t) - A(x) I dt.
Jx
It is at this stage that we use the hypothesis that the series associated with
0
b n }: -10=1 has property P(w; co, 6, M). Writing B (t) := bn we have
„ t
f
( 28) L< (B (t) - B (x)) dt < B(t) dt - B(t) dt.
x—h
Now, our assumptions on the sequence {b n } 1 imply the existence of an in-
finitely differentiable function 43.1, satisfying (26), such that (25) is valid on
replacing A(t) by B(t) and 41. by (D i . By (28) this implies that
L mx 2_c9 00g x) mh 2( l0gx) A .
I N := 11),(n)I(log3n)1z1+N+2n-i.
fl=
192 11.5 The Selberg-Delange method
( c2N +1)N+1
with
RN(x) := e - "V( 1°gx) ±
log x )
,
(31) T(x) :=
n<x
we obtain
where the -yk(z) are the entire functions from Theorem 1, and where c3, C4 are
positive constants only depending on A.
Next, the convolution formula
an = rz * bz (n) (n = 1, 2, . . .)
enables us to write
A(x) =
mn<x
T(m)b(n) = >7,
n<x
bz (n)Tz (x/n).
The stated result follows from this formula, evaluating Tz (x1n) by (32). At
several points we will have occasion to use the following estimate, valid for
y > 1 and 0 < h < N + 2,
Vx<n<x12 ,Vx<n<x12
log n -1z1-1 n
<A 1(10g X) Re z-1 E lb(n)I(1 log x)
Vx<n<x12
where the final bound follows from (34) with h = 0, y = Substituting these
estimates in (33) it follows that
(35) A(x) = bz (n)Tz (x1n) 0A(2 N HN(z)x(logx) z-N-2 ).
n<Vx
Tz x -yk(z)
— log —
n k k!r(z — k) n
(36) =0
N+1)
± OA (e-c,voogx) (c6N ± 1 )
log x
The desired estimate for A(x) will be obtained by expanding the terms
(log x - log 71)z -k-1 in the above formula by means of the generalised bino-
mial formula, and then substituting in (35).
194 11.5 The Selberg-Delange method
x { E eyk (z) (z - k - 1)
- (log x)z-rn-1 (— log n) h
n
m,=
k! r(z - k) h
h±k=m
>7,
n<Vx
bz (n)Tz (x1n)
log x )
For each h the inner sum over n is equal to
G (h) (1; z) + 0A(2 N HN(z)(logx) h-N-2 ),
using (34). The contribution of the error term to the previous expression is
thus
± OA (IIN (Z)
Notes
§ 5.1. The method of contour integration allows us to handle the case of Dirich-
let series having singularities different in nature from those of (s - 1) - z as
considered here. Delange (1954) provided a formula for a singularity of type
)k
(s - 1) - w ( log s 1
((s) = sf {t}t - s -l dt
s-1
readily gives for j > 1
dt
= ( 1)j I
- {t}(logt)j- .
t2
In particular we have ry i (1) = -y - 1, where -y is Euler's constant.
196 11.5 The Selberg—Delange method
§ 5.3. The precise form obtained by Bateman for the error term in (17) is
0(x exp { - c i2 V(log x log2 x)}) for any constant c 12 < N/2. The method of
the present chapter gives the same result by slightly altering the contour of
integration. Balazard & Smati (1990) have recently shown that this estimate
can be precisely recovered by an elementary method.
It follows in particular from formula (17) that
an := card {m : co (m) = n} < ne —g
cV(lon) (n ?_ 1).
In 1935, Eras showed that there exists a constant (5> 0 such that
an > n6
for infinitely many integers n, and he conjectures that this result remains true
for any .5 < 1. In the same direction, he ventures the hypothesis that the error
term in (17) is
St (x exp { c 1°g x })
log2 x
for a suitable positive constant c.
§ 5.4. The proof which we give here is largely inspired by that of Delange
(1971) in which we have made explicit the dependence of the error term on M
and N.
Formula (29) can yield a simpler approximation for A(x) when the contri-
bution to V (x) from the circular part of the Hankel contour tends to zero with
the radius. For example, under the hypotheses of Theorem 3 and assuming in
addition that Re z < 1 - E, we obtain in this way that
C13
dt
A(x)=
J. X
1-t
a(t, z) — + 0 (xe- ci4Ologx))
tz
where c 13 and c14 are positive constants, depending only on A and e, and where
a(t; z) is the continuous function on [0, c 13 ] defined by
sin
a(t; z) = 7rz Z(1 — t; z)G(1 - t; z).
71
Of course one would be able to recover this result from (16) by choosing
N = N (x) to tend to infinity suitably, writing
1 sin 71-z
= ( i)kr(k + 1 - z)
r(z - k) 71
= sin 7rz ( 1)k x
e—ttk—z Cit ± remainder
71 J0
with X := c13 log x, and interchanging summation and integration in the main
term of (16).
Exercises 197
Exercises
3. Evaluate, uniformly for x > 2, 1 < q < x , the sum E 117- (n).
n<x,(n,q)=1
1<n<x
6. Use the method of the previous exercise to evaluate E i<n<x p,(n) 2 1 c.o (n).
7. Same question for E l<7, <x cp(n) I w (n) .
8. Let F(s) := E ann- s be a Dirichlet series with non-negative coefficients.
Suppose that F(s) may be continued by continuity for a = 1, s 1, and in a
punctured disc 0 < Is - 11 < c. Assume also that the continuation satisfies the
conditions
(b) In the above s-integral, use the change of variable s =1 +11w to show
that there exists a Laurent series L(w) := ao E7-7=1 nW - n , converging for
sufficiently large 1w I, and such that we have
X
with
X2 / 17±icx) dw
/(x) := explw -1 log x + AwIL(w) w2 (R > R0).
27rz R_ico
(c) From now on, choose R := V(A -1 log x). Show that I(x) is infinitely dif-
ferentiable, and that the contribution to r(x) from the domain 171 > (log x) 1 / 3 is
< x exp {2.\/(A log x) — A3/ 2 (logx) 1 / 6 }.
(d) Expanding w -1 log x+Aw as a power series to order 4 for 17-I < (log x) 1 / 3 ,
show that the contribution to _P(x) from the domain 17-1 < (log x) 1 / 3 is
A1/4 e2V(A log x) ( 1
x (log x) 3/4 V4) + \/(log x) f•
1 (n = 1)
10. A theorem of Diamond (1984). Let Fo (n) := 0 and for k> 1
let (n > 1)'
Fk(n) := { II log vi 1
1 i=
vi >1 (j=1,...,k)
Let F(n) denote the arithmetic function F(n) = ka° 0 Fk(n)lk! (n 1).
E F(n) = Kx 0(xe-cOl0g
n<x
x)
(x oo)
E 1 = ck x + 0(xmlo g x)).
n<x, kIT(n)
x (log2 X) k-1
Nk (X) :=- 1 {n, _< x : S2(n) = k} 1 rs (x --4 co )
' log x
-
(k — 1)!
— cf. Landau (1909). The same result holds for the function
(1) E Z w(n)
n<x
w (n) _s . H 0 ± z
Fi ( s; z) := ) (0- > 1).
ps _1
P
where b lz is the multiplicative function for which the values on prime powers
are determined by the identity
oo
1 ± E biz(pv)r = (1 + ez )(1 — )z Gel < 1).
v=1
1 _ e,
In particular we have
(2) biz(p) = 0
and Cauchy's inequality implies that, for IzI < A,
(3 ) Iblz(19P )1 < M*211/2 (11 2)
with
sup (1 ± .'z )(1 —Oz
Izl<A,11/N/2 1—
Relations (2) and (3) show that for a>
00
1 -1
EE Iblz(pv) p - i'' < 2 < cM/ (o- 2 )
p v=1 M >-:
P
13' (Pa. — V2 )
where c is an absolute constant. By Theorem 1.2, we deduce that G 1 (s; z) is
absolutely convergent for a > and that, for o- > 1, we have
G i (s;z) <A 1.
The assumptions of Theorem 5.3 are therefore satisfied, and we can formu-
late the following result.
Theorem 1. For any positive constant A, there exist positive constants
c1 = ci (A) and c 2 = c2(A) such that, uniformly for x > 3, N > 0, Izi < A, we
have
(log x)k
>.-, A(z)
+0A(RN(x))
k=0 1
with
± ( C2N ± 1 .I N I +1
(5) R N ( x ) := e — c i .(log x )
log x )
and
1
(6) Ak(Z) :=
11 (z — k)
>--- h!1j!G i( h) (1; z)-yi (z),
■
h+ j =k
where the -yi (z) are the entire functions defined in Theorem 5.1.
202 11.6 Two arithmetic applications
This implies the absolute convergence of G 2 (s, z) for a > 1 - co(6), and
Theorem 5.3 yields the result stated below.
Theorem 2. For all 6, 0 < 6 < 1, there exist positive constants c1 = ci(6)
and c2 = c2(6) such that, uniformly for x > 3, N > 0, 1z1 < 2 - 6 ,
vk (z)
(8) >z') . x(logx) z-i { ± 0,6(RN(x))
n<x
(logx)k
k=0 }
C k (X) := c (n)
n<x
. X 6 2 j ,k (log 2 x)
(ogX)i
logx{. A k!
3=
1
(12) Q i , k (X) := h (m) (0)X t (0 < j < N, k > 1).
m! f! 3
m -1-i= k —1
If, in addition, we suppose that 'hg(z)1 < B for Izi < A, then, uniformly for
x > 3, 1 < k < A log2 x, we have
(13)
x (log2 x)k-1 f ( k 1 - n (B.(k - 1) log2 X
= Ck(X) Ro(x))} .
log x (k 1)! i h° log2 x) ±
- 1--' A (log2 x) 2 ± k
Proof. First of all we prove formula (11). Since the main term is the coefficient of
Zk in the main term of (10), it suffices to estimate the error term. By Cauchy's
formula this is
xRN(x)
(14) <<A fizi=, (log x) Rezlzl -k-1 Idzi
log x
k) 10 27r
204 11.6 Two arithmetic applications
and
27r 7r/2 1
_2
fo ek cos 0 de < f e k cos 0 do ± 7r 2- ekt dt
7r
o o V(1 — t2) ±
1
e—k(1—t) dt
< 2ek ± 7r < 21-( 2--)ekk --- 1+ 7r.
o
We thus obtain the stated result by substituting in (14).
When k = 1, formula (13) reduces to (11) for N = 0. Suppose therefore
that k > 2, apply (11) with N = 0, and evaluate the main term by Cauchy's
formula. For all r < A, we have
1
Q0,k(X) = 27i. j: ho (z)ez x z -k dz.
z1=r
When k < AX, we can choose r = (k —1)1X. Noting that
1 X''_2X1
(Z r)e zX Z —k dz = r 1)! = 0,
27ri iz i,r (k — 2)! (k —
we see that
Ch,k(X) .h2°(ri) i
e zX z —k dz
7r jizi=r
and
(16) 140 := 11 (Z1 ± 1) II ( 1 -1 p—1 (1 — pliz
p i
.
Theorem 4. Let A > 0. There exist positive constants c 1 = c1 (A) and e2 -
c2(A) such that, uniformly for x> 3, 1 < k < A log2 x, N > 0, we have
xf xN ---. Pj,k(1°g2
(17) 7rk (x) — l og x 1 2.-a (log x) + OA (
(lc'g k2 x)k
RN(x)) }
i=o x)-7 !
x (log2x)k-lfAi k — 1) ( k
(18) 7k(x) = + 0
log x (k — 1)! 1 log2 x (log2 x) 2 ) }
Theorem 5. Let 6 satisfy 0 < 6 < 1. There exist positive constants c 1 = ci (6)
and c2 = c2(6) such that, uniformly for x > 3, 1 < k < (2 — .6) log2 x, N > 0,
we have
we have
with
c :, 1 H (i ± 1 0.378694.
p>2
2))
Proof. Let e = E (A) be a positive parameter to be specified later. First of all,
we apply Cauchy's formula to (8) with N = 0, for the circle Izi = 2 - E . This
gives
x
(22) Nk(x) = v(z)(log x)z -l- z -k dz + 0,(x(2 -Er k (logx) -1,
27ri izi=2 ,
where we have estimated trivially the contribution of the error term. For
k < A log2 x, we have
Thelastingrvy
with
> j5; 62.
71(6) := 2 { (1 + 6) log (1 + 6) - 1 61
This completes the proof.
In Exercise 3 we propose an extension of this result to large values of k, i.e.
such that A log 2 x < k < (log x)/ log 2.
6.2 The average distribution of divisors: the arcsine law 207
It is clear that the sequence {F,} 1 does not converge pointwise on [0, 1].
However, we shall see the sequence of Cesar° means
GN(u) := N-1 > Fn(u)
n<N
is uniformly convergent on [0, 1]. Remarkably, the limit is the distribution func-
tion of a probability law well-known to specialists: the arcsine law, with density
1/7 \/(u(1 — u)). Large and small values have high probability: if D is a random
variable with this distribution law, we have
Prob (D < 0.01 or D > 0.99) ',-,--: 0.128.
This indicates that, on average, an integer has many small (and correspondingly
many large) divisors.
Theorem 7. Uniformly for x > 2, 0 < u < 1, we have
2 1
(23) x- Fri (u) = — arcsin ,\,/u ± 0
7 ( v/ (log
n<x
Let us provisionally accept this theorem and see how it implies formula (23).
The symmetry of the divisors of n about Vn allows us to write
Let S(x,u) denote the left-hand side of (23). Summing the above equality for
11 < X and evaluating the error term by (24) with d = 1, we obtain
1 )
Since we also have (2/7) arcsin Vu, + (2/7) arcsin - \/(1 - u) = 1 (0 < u < 1), we
infer that it suffices to establish (23) for 0 < u <
With u in this range, we can write
1 1
(26) = R(x , u),
x 7 (md)
d<xu m<x/d
with
1 1 1 1
R(x , u) := - V _< E
x L--,
d<xu m<xld,(md)u<d
T(md) x d<xu m<x/d,m<d( 1- u) /u
T( m)
1 v--- ■ -1/2
< — L d(1-u)/u (1 ± log d (1-u) / u )
x
d<xu
1/2
< (1 ± log x l- u) < (log x) -1 / 2 .
h 1 1 )
S (x , u) = — (d) +0 ((3 10
/ 4g)wx( d) )1 + 01
V 7 d<xu dV(log x I d) {g 010 g x)
where we have used the fact that u < 1 to write 1/(log x I d) as 0(1/ log x).
6.2 The average distribution of divisors: the arcsine law 209
et
g (t) := g(n) = hveirt) {1± OG)} (t > 1).
n<et
We have
g(d) h rulog x e —t
dV(logx/d) JO_ V(log x — t) dg(t)
d< u
h i u log x e—t
1
(t) 1 1 dt + 0 (
-V7r NAlog x — t) { 2(10g x — t) /(log x) )
1 fu i'Dgx 1 ± 0(1/(t+ 1))
dt+0( 1
7r Jo Ot(log x — t)) NAlog x) )
1 fu dv ( ) 2 ( 1 1
+ 0 = arcsin +0
71 Jo -V(V(1 — V)) V(log x) 7r V(l og x)) .
00 S
P '
fd (s) = fl + vp ( d ) + 1 = gd(s)fi(s),
p v=0
with
00 00 -v
P —vs \7 S
r'
gd( s ) := H v + i)
Ild v=0
210 11.6 Two arithmetic applications
For each d, gd(s) is a finite product of ratios of series which are each absolutely
convergent for a > 0. For a > 1, the absolute values of the denominators are
bounded from below by an absolute positive constant, indeed
00 co ,-1/0"
E
ii=0
V +1
1 E P
v=1
v
= 1 ± log(1 - p') > 1 ±log(1 - 2 -314 ) > 0.
00
E bcpv)vi =
v=0
This implies the absolute convergence of h(s) for a> . The hypotheses of The-
orem 5.3 are therefore fulfilled for fd (s), with z = and M < (3/4)w(d). From
this, noting that h(1) = h, we deduce the validity of (24), with g(d) = gd(1).
It remains to show (25). To this end we appeal once more to the Selberg-
Delange theorem 5.3 (or 5.5), noting that g is the multiplicative function defined
by
, , , 00 ) --i
P
gv ) := 3=0
E j +13 v ± 1) ( 3= j+ 1
For a > 0, we can write
00 00
E
n=1
g(n)n -s = (-(,)11'2 E /3(n)n —s
n=1
6.2 The average distribution of divisors: the arcsine law 211
Since Ig(pv)1 < 1, the right-hand side is holomorphic for 0 < 1 and we have
In addition
0(p) = g(p) - = 0 (-1 ) .
Bx {1+0( 1 1
(29) E g(n) =
n<x
\/(7 log x) logx)i'
with
00
B := H /3 (pv )p - v .
p v=0
Using (28), we can write the v-sum in this product in the form
. 1 ii 1 \
( Eg(p ,, )p _v) (1 -1/2
P) P)
v=c)
=
ip
00
. - 3• \ -1 (
1
1)_1/2 t co p-7.1--3
L-d
. 3 ± 1) P) v+ j +1
3=0 v=0 3=0
=
00 _. \1
41 j +1) 0
- ( 1\
P)
- 1/2 = { v(p(p 1
)) log ( 1 1 1/19 ) 1 1 .
3=0
We therefore obtain
B= 1/h,
which concludes the proof of (25).
212 11.6 Two arithmetic applications
Notes
§ 6.1. For the study of Nk (x) and irk (x) by induction on k for bounded k, see
Hardy & Wright (1938), § 22.18. Theorems 1-5 are due to Selberg (1954) in
the case N = 0. For N > 0, arbitrary but fixed, they are special cases of results
of Delange (1971).
The asymptotic formula contained in the estimate of Theorem 6 was an-
nounced by Selberg (1954). Using an elementary method, Nicolas (1984)
extended this result by showing that we have, uniformly for x > 3 and
(2 + S) log2 x < k < log x/ log 2,
with y := x/2k, and 77 = To) > 0. Nicolas uses the fact that for these large
values of k, a dominating contribution to Nk(X) arises from integers of the form
2e m with E - > 21 k and Q(m) — 2 log2 y see Exercise 3, where we outline an
analytic proof based on the Selberg—Delange method.
Balazard, Delange & Nicolas (1988) make precise the behaviour of Nk (x)
when k is close to 2 log 2 x: for 1k — 2 log2 x1 < A,‘/(log2 x), we have
where 1 rz e t2/2
43.(z) dt.
:= \/(27) Loo
\ x (log2 X) k-1
Ck(X ) log x (k — 1)!
rhg(r)x+ 0 ( B (k — 1) 2 + log 2 x R o (x
x { ho (r) )) } .
2 log2 (log2 x) 4 k
The problem of the asymptotic behaviour of 71k (x) for k > A log2 x is much
more complicated than that of Nk(x), since no single prime number plays a
special role. It is only recently that the limit of validity of Selberg's formula
has been determined. Hensley showed in 1987 that (31) holds if, and only if,
k = o((log 2 x/ log3 x) 2 ).
where A and B are absolute positive constants. The same article contains nu-
merous other facts about the behaviour of 71k (x) for large values of k, notably
the relation
71 k+1 (X)
,
L
(1 _< k < (101:xx )2 )
7rk (x) k
with
log x \
L := log (
klog(k ± 1)) .
For further information on this subject, in particular concerning "very large"
values of k (k >> log x/ log2 x) see Pomerance (1984). Balazard (1990) has re-
cently shown that, for x sufficiently large, the sequence k i— 7 r k (x) is unimodal.
Theorems 7 and 8 are due to Deshouillers, Dress & Tenenbaum (1979).
As these authors remark, the remainder term OW \Alog x)) in formula (23)
is optimal if uniformity in u is required: for u E [0, (log2)/ log x[ we have
Fn (u) = 1 I T (n) (n _< x), so that
Exercises
n <X
with
1 0 (if m = 2k + 1)
(2k)!
tme -t2 / 2 dt = { 2kk!
V(27) f±c°
_00 (if m = 2k).
2. Give an explicit expression for the remainder term in formula (33) from the
previous exercise.
3. Nicolas' theorem (1984). Throughout this exercise, we write y := xl2k .
Let Nk(X) (resp. NL(x)) denote the number of integers (resp. odd integers) not
exceeding x and such that C2(n) = k.
(a) Show that for all x > 1, k > 1, one has Nk(x) = o Nj(2iy).
(b) Show that for all 6, 0 < 6 < 1, and uniformly for 1z1 < 3 - 6, x > 3,
one has
E
n<x
P (n) = x(logx)z -1 {zh(z) + O6
(log1 x)}
ril(mod 2)
with
h(z) := r( z2 ±z 1) 11(1 -
_ 1) z .
\ p>2 P) P)
Exercises 215
(c) Using the previous result with z = 3, establish the upper bound
(d) Set Y := log 2 3y, and assume that y > 1. Show that
x (1.
± 0(og2 x)
N'i (x) = Qi (log2 x) i
log x 3! log x )
1
with Q3 (X) := h ( m) (0)Xe .
m! f!
in-Fe=j-1
1
(f) Show that-77-7 h (m) (0) < ()m (m 0) and deduce that
Xj -2
Q3 (X) < (D i e8X/3 (j > 3X), Vi (X) << (2 < j < 3X).
(j — 2)!
(g) Deduce from parts (e) and (f) that, for y > 1, j <Y one has ,
2jy
(23 y) = {Q j (Y) + 0( 173±le Y )1
log 3y
with C := 2h(2) =
H+ 1
p(p — 2))
p>2
216 11.6 Two arithmetic applications
xk 1
lim e—x = A00 e- ° / 2 dt.
x,00 k! \/(27) f
k<x-PA N/x
(b) Put cox (t) := e— xx t lr(t ± 1). Show that for x > 1, It — xl < Vx, one
has
1
- (t - x) 2 /2x} (1 + 0(1k/x)).
(Px(t) = /(27rx)
\ exP{
A
Evaluate the Stieltjes integral j cpx (t)d[t] and thus recover the result of the
previous question. -00
(c) Applying the estimates obtained by the Selberg-Delange method for
71k (x) := I tri < x : co (n) = 01, prove the Erdos-Kac theorem:
1 fA
(VA E IR) lim 1- 1{n < x: co (n) _< log2 x+A N (log2 x)}1 = e -t2 / 2 dt.
x -4 00 X V(27) _co
sup an <6.
t ?.1\I N<n<t
218 11.7 Tauberian theorems
M
E
N <n< M
an z n fNMZ t dAN (t) = AN (I) Z M — log z f z t A N (t) dt
N
m
< E.+ 2re f IzI t dt _< E(1 — 2r/loglz1).
N
Noting that
log Izi =- Ilog (1 — 2r cos cp + r 2 ) < —r cos yo + r2 < A- r(2 cos 0 — 6),
4
E ( 1 ± 2 cos 0 — 6) .
This establishes the uniform convergence of the power series f (z) in S(9) and
thereby completes the proof.
Theorem 1 is the prototype of a class of statements called Abelian because
they share with it the following characteristic: they state that, if a sequence
(or a function) is sufficiently regular, then a certain average of its values also
has a regular behaviour. Thus the well-known implication
ELM an = a = hm —
1
n
E
n
am = a
m=0
7.1 Introduction: Abelian/Tauberian theorems duality 219
urn =b lim (1 - z) E bn zn = b.
n—+oo zEs(e)
n=0
The converse of an Abelian theorem is in general false. For example, the
sum of the power series
00
then we have
ca + o(1)
F(a)= (a --> a+).
(a - a)w+1
Proof. Let us write A(t) := En‹e t an, and
+00
G(h) := e -(a+h)t dA(t) > 0).
we can write
ca +co ca
G(h) = (a + e-(a+h)t A(t) dt
hw -ki
By hypothesis there exists a function e(t), with limt , e(t) = 0, such that
the expression between curly brackets may be written as
ca )
G(h) =o (h -, 0+).
lici)+1 ( hw+1
1
(T) E win, =
n<x
0(X)
For each pair of positive real numbers {A, x}, we can write
ffAx Ax
A(Ax) - A(x) = dA(t) = f t -1 d{ta(t)}
x
. Ax _
f t l a(t)dt +a(Ax)-
x
Considering this statement, we can both enlarge and make more precise the
notion of a Tauberian theorem. Given a real-valued function (19(t, s) defined on
IR+ x S where S C C, we define the co-transform of a function A of bounded
variation on any finite interval by
+co
(5) F(s) := oj co(t, s) dA(t)
whenever the integral is convergent. Let us next assume that the follow-
ing Abelian theorem holds for some number so in the closure of S: if
lim t,,, A(t) = f, then the integral (5) converges for all s in S, and we have
We shall show that this relation remains true when P is replaced by the
function x defined on [0,1] by
{ et (0 < t < 1)
X(et) =
0 (t > 1)
and when a tends to 0 in such a way that 1/u avoids the points of discontinuity
of A. Assuming this for the moment, it follows that
X(t) - t
(8) 11-(t) := t(1 t)
(0 < t < 1), H(0) = -1, H(1) = 2.
I
f (t) < H (t) < g(t) (0 < t < 1)
(9) g (t) - f (t) < E (It — 1/el > E.)
g(t) - f (t) < 12 (It - 1/el < e).
224 11.7 Tauberian theorems
(The number 12 serves here only to exceed the value of the discontinuity of H at
t = 11e.) From the Weierstrass approximation theorem there exist polynomials
p and q such that
(10) max
13<t<1
Ip(t) — f(t)I 5_ E, max
0<t<1
lq(t) — g(t)1 < E.
Setting
It is easily deduced from (9) and (10) that the last integral does not exceed
27E. By adjusting the value of E in the preceding calculations, we therefore
obtain the existence of two polynomials P and Q, which satisfy (11) and
Q(t) - P(t)
(12) dt < E
J1 to_ — 0 — -
It is now easy to establish that (7) holds for the function x and when a tends
towards 0+ avoiding the discontinuities of A(1/t). Under these conditions we
indeed have
co
A(1/5) = i X(eat)et dA(t)
o
co rp (121
fc±o)0.
< i Q(e —crt )e't dA(t) = fo cc Q(e -t )e -t tw-1 dt.
o
Similarly
A(i/a) > {c ± o(1)} f
-t dt.
0 c)° P(e -t )e -t e
r pQ (e -t )e -t w-i dt -< r ,
r(w)o-w
Now,
_t ) _t u.,_,
> Xle e t dt ± R
— o
7.3 The theorems of Hard y-Littlewood and Karamata 225
with 00
R := I {Q(e -t ) - P(e -t )le -t t' l dt
o
1 f Q(t) - ) -1
P(014 1 ___ 0( log-1w dt < EM
fo1 t(1 - t) i \ t
by (12), where M := suPo<t<it(1- t){log(1/t)}w -1 is independent of a and E .
By making successively a and then E tend to 0, we obtain the expected formula.
Remark. When ch) = 0, the conclusion of Theorem 5 remains true in the form
A(x) - A(0) ----> c (x --4 oo).
This follows trivially from the partial integration formula
00 00
-
F(a) = A(0) + a I e- at A(t)dt = i e -t {A(t/a)
Jo o
A(0)1 dt.
-
Hypothesis (14) implies that g(a) = 0(1) as o- - > 0+ and, by (13), we have
00 00
from which
00
that is to say
Enan = 0(x) (x --› cc).
n<x
Then we have
(17)
E an, _ {c + o(1)} (log x) , (x -> -Foo).
n r(w ± 1)
n<x
Remark. Condition (15) can be replaced, more generally, by a, > -bn (n > no)
where the br, are the coefficients of a Dirichlet series satisfying both (16) and
(17) for some suitable constant c, but with the same w. For example, a condition
of this type is
(18) an > -K1 (log n)" -1- - K27 (n) (n > 2)
with a := log w/ log 2.
and
(20) F(a) = {c± 0(0(1/a-) -1 )}o -- w (a --> 0+).
Then we have
1 xw
(21) A(x) = {c + 0( (x —> +oo) .
log 0(x) r(w ± 1)
228 11.7 Tauberian theorems
It is important to note right from the outset that the error term in (21)
cannot be improved without additional assumptions, as shown by the following
counter-example, due to Karamata (1952). Consider the increasing function
(22) A(x) = x
± c ± ( lo: x ) •
This follows by simple partial integration
x x t
cos { (log t) 2 } dt = /2 d sin { (log t) 2 }
.1.2 2logt
x x
[ t 1 1 ) sinf(log 0 2 }
sin {(log 0 2 }] dt
2logt 2 1 ( log t i 2 log t
where the last integral has been estimated by a second integration by parts.
On the other hand, we shall see that we have
1
It is clear that F (a) = —
a ± Re J(u) + 0(1), with
Da
J(a) := f exp { — at + i(log 0 2 } dt
1
We estimate J(a) by replacing the integral over [1, +co[ by the complex integral
over the contour formed by the circular arc F := lei ° : 0 < 0 < 7r/41 and the
half-line A := { re2:7r/4 : r > 1}. This transformation is justified for all a > 0 by
the bound
Now the integral over F is clearly 0(1), and the estimate (24) applied with
0 = 7r/4 > shows that the integral over A is absolutely convergent and is
bounded independently of a > 0. This implies (23).
7.4 The remainder term in Karamata's theorem 229
The idea behind the proof of Theorem 9 consists in making explicit the
polynomial approximations for the function x arising in the proof of Karamata's
theorem. Precision is measured in the L'-norm, and it is necessary to control
the size of the coefficients. We define the length f(P) of a polynomial P(x) :=
\---,n
z-arn=o ax m by
f(P) := lam 1 -
0<m<n
We prove the following one-sided approximation result.
Theorem 10. Let f(t) be a function of bounded variation on [0, 1]. There
exist constants A 1 , A2 depending only on f, such that, for any integer n > 1,
there exist two polynomials p and q, of degree at most n, satisfying
p(t) < f(t) < q(t) (0< t <1)
P(e - at )e - at dA(t)
(26) 10
1 f c
-t )e -t tw -1- dt+O(f(P)0(1/u) -1 )}.
1 F(w) r(w) o P(e
We now apply Theorem 10 to the function H defined by (8), and we set
P(t) := t ± t(1 - t)p(t), Q(t) := t ± t(1 - t)q(t). We have P(t) < x(t) < Q(t)
for all t in [0,1] and from (26) it follows that, when 1/u is not a discontinuity
of A, we have
<1 r c
A( 1 /0- ±
)
with
For the choice n := [log //)(1/a)/(2 log A2)] , we obtain the required estimate
(21).
Proof of Theorem 10. Let Y be the Heaviside function defined by
0 (t < 0)
Y(t) = { 1 (t > 0)
We shall show the existence of absolute constants B 1 , B2 such that, for each
n> 1, there exist polynomials R, S of degree at most n satisfying
on [0,1] by
11
P (t ) := f ( 0 ) + f R( t — 0 d.fi (0 — f S (t — 0 df2( 0
0 0
1 f1
q(t) := AO) + fo S(t - 0 dh() - jo R(t - 0 d.f2()
o
1
A 1 = Bi i d(ii+ .f2)(e), A2 = 1+ 2 1/(0)1 + 2 B2 I 0
1
Cl(fl ± f2)().
(28) f(Tr ) = E ( 2m h)
r ) (m (tr ) (eh)
3r .
and set m := [1(r ± 1)], so that x m±i <0 < xm . We then define R(t) as the
unique polynomial of degree n = 2r - 2 satisfying
(v <m±1)
S(x„) = S i (x„ ) = 0 ( v m ± 1).
{ o' (m+ 2 < v <r),
and check that this implies S(t) > Y(t) (-1 < t < 1).
In order to obtain the second of the properties (27), we introduce the poly-
nomials
Tr(x) 2
(29) V(x) := T,(x „) -2 ( ) (1 5._ v < r).
X - xv
We have
1 (i
vv (x 3 ) = { 0 (i = vv )'
)
Vi' (xj ) = 0 (j v).
232 11.7 Tauberian theorems
The polynomial W(x) := S(x) — R(x) V(x) V +1 (x) has degree 71 and
— —
vanishes at all the x, (1 < v < r). There thus exists a polynomial Z(x) of
degree r — 2 such that W(x) = Tr (x) Z (x) . Whence
7r
dx
cos ru•Z(cos u) du = 0
f_i W(x) ,\/(1 — x 2 )
and so
1 S (x) — R(x)
{S (t) — R(t)} dt <f 1 (1 x2) dx
(30) fo
= {Vm, (cos u) + Vm+i(cos u)} du.
Jo
7r ,
Set T := - V77, - , so that x rn = cos r. We have
sin r7
Tr' (COS T) = r =
sin T sin T
Substituting in (29) with v = m yields
, 2
7 2 1" cos ru
Vrn (cos u) du = (sin du
r ) /J0 ( COS U - COS T )
(sin T ) 2 s/71-
in (- r(u — T)) sin (- r(u ± 7))
{
2
du.
sin g (u — T)) sin q(u ± 7))
A similar calculation provides the same estimate for the integral concerning
Vm± i. Substituting in (30), we thus obtain the second property (27).
It remains to bound the lengths of R and S. Consider, for example, the case
of R. We have
m -1
R(x) = M (x)
=0
_2 (UiX Vi Tr (x) \ 2
(31) Mi (X) = xj
\ X — x rri )
M(x) = 1, M(x) = 0.
Ui = 1 ± XiX m — 2x , = Xi3 — X.
with 10,1 < 3'. The desired upper bound for (Ali) then follows easily from the
formula
mi (x)=7-20r (ui x+vi )(x_xm,) H (x xv)2.
1<v<r
v0i,rn
This completes the proof of Theorem 10.
234 11.7 Tauberian theorems
converges for a > a > 0. Suppose that there exist constants c > 0, w > —1,
such that the function
F(s + a) c
(33) G(s) := s ± a ( o - > 0)
s Lo-Fi.
satisfies
T
(34) ri(o-,T) := a'i IG (2a. + ir) — G (cr + iT)I dr = o(1) (a. -->0+)
-T
with
p(x) := inf
T>32(a+1)
IT-1- +77(1/x,T)+ (Tx) - '1 1.
Furthermore, the implicit constant in (35) depends only on a, c and w. An
admissible choice for this constant is
and
+00
(38) - (0 := f co e'' g(x) dx = 0
4 WTI 5_ T).
Then we have
(40) --4(T)i(T) = 0 (T E R)
for any integrable function x having Fourier transform with support contained
in [-1,1]. We choose
1 sin 2it) 2
X(t) = 27r ( Y
)
236 11.7 Tauberian theorems
and the stated inequality will follow by using the fact that x(t) "peaks" at
t = 0. More precisely, we shall appeal to the inequality
4' dt 4
(42) / := I x(t) dt _< — I
7 5 L
= &Tr .
Itl> 5
Let E , 0 < e < 1, be a real fixed number and let 0 = ±1 be so chosen that
119, 11 00 = sup{Og(x)}.
xER
There then exists an xo = xo(E) such that 0 g (x0) ? (1 E)11g11 oc . Applying (41)
—
where we have used the fact that, for all t such that 1t1 < 5, we have
0{g(x 0 ) — g(x0 — t — 50)1 _< sup {g(y) — g(x)} 5_ 10K.
x<y<x+10
We therefore obtain
{(1 — E)(1 - I) - /111g11 00 5_ 10K(1 — /)
and, letting E tend to 0,
10K(1 — /)
1191100 <5_ 1 21-
16K,
as stated.
An application of Theorem 12 on the lines of Ganelius' work would lead
to an effective Tauberian theorem with rather unusual conditions, and which
would not entirely generalise the Ikehara—Ingham theorem. We proceed in a
slightly different manner by noting that Theorem 12 easily implies a result
which simultaneously contains the Berry—Esseen inequality (cf. §7.6) and The-
orem 11.
7.5 Ikehara's theorem 237
2
cE(t):= 7rEt2 sin (Et/2) sin ((2T ± E)t/2)
T)
(ITI
6 (7) = { 1(T + E -1-7- 1) /E (T <1-7- 1 5_ T + E.)
0 (ITI > T + 6).
(43)
1 -
lifiloo - 7r 11/111 - 1 r f T+6
T_e -..4 7- dT
l ( )1 .
It follows that
whose Fourier transform is given, for any positive value of the parameter a, by
the formula
T T
f—T
± c(w ± 1)0-w+1 I —T
max(o-,17- 1) - w -2 dT}
Then the monotonicity and the positivity of A imply that, for x> 0, y > 0,
This inequality is also satisfied when x < 0, since then ga (x) = 0. Applying
Theorem 13 to -ga , with K = (a + a)b400 /T, we deduce, for 0 < a <1,
11g,li cx, < 6o- ' fii(o-,T) ±2c(co ± 2)1 + 16(a+ 1)11g„11 00 /T.
with
Mi (a) := 126(o -, Ta ) ± 2c(w ± 2)}.
Let us set
c e —t ( 1 ____ e —t)tw (t > 0)
B(t) := { I-1 (w ± 1)
o (t 5_ 0).
The second step of the proof consists in a further application of Theorem 13,
this time to the function
C
G(t) := go-(t) — a-- w B(at) = (A(t)e — at tie't (1 — e't )
11(w +1)
C
B' (t) = F(w +1) e —t tw-1 {2te —t — t + w(1 — e —t )}
Y CP + 1) c „,,,)+1
B(x + y) - B(x) 5_ fo Fp ± 1) e -t tw dt <
- F(w + 1) Y '
c(w + 1) x f x+Y
B(x + y) - B(x) 5_ Fp ± 1) e _ x tw dt
with OM := 1 for w > 0, and 9(w) := 0 for -1 < w < 0. In the case w < 0, we
have used the classical Minkowski inequality
w+1
(x + y) w+1 - x w+1 < y (-1 <w < 0).
By an easy calculation, we may deduce that, for x E R, 0 < y < 1/T < 1,
0 < a < 1, we have
a ( a ),..0+1
B(ax + ay) - B(ax) 5_ D{-1-7, + -17,
},
with
C ( 1 ± e i-w ( u) ± i) 0+2) .
D :=
F(w + 1) ‘
with
The conclusion follows by choosing a = 1/x. The implicit constant in (35) does
not exceed eM (1 e -1 ).
-
Ikehara's theorem implies the prime number theorem assuming only that
((s) 0 for a = 1, without requiring an upper bound for ((s) -1 —cf. Exercise 4.
Of course any explicit upper bound for ('(s)/((s) or for ((s) -1 will provide, via
Theorem 11, a corresponding effective version of the prime number theorem
see, in particular, Exercises 5 and 6.
f +00
f()
r) := ei" dF(x).
It is easily verified that H, is integrable and that its Fourier transform has the
value
+00 f HT) g( T) — —
Since 11H1100 5_ 1 and H(-oo) = 0, we easily deduce that we have for any
fixed x
lim H6 (x) = H (x).
6 ->0
Write a := 11G/1100. Clearly we have
and
fx
e
{ - EX 00 e Et H(t) dt} = eEt H(t) dt + H(x) < 2.
dx
(a + 2E)
T (x ET1, 0 < y < )
+ 2E
Applying Theorem 13 to - H, with K := we obtain
T
f(r)
1116(x)1 5_ 16 (a ± 2E) + 6 fT g(T) dr (x E lik).
-T
Notes
00
is holomorphic for I z I < 1, since the series converges uniformly on every compact
subset of the open disc. However we have
so f (z) fails to have a limit when z --4 1. Now the Taylor series of f (z) is
_Ez0. 0.
n n _, E 2-k ak-n ,
k=1 n=1
EE 2-k drict =
1<n<m k=1
for each fixed k > 0. The conclusion of Theorem 5 then holds on replacing x'
by 2 4 '1,(x) cf. Hardy (1949), theorem 108.
- —
Notes 243
§ 7.4. Theorem 10 is due to Freud (1952). See also Freud Sz Ganelius (1957).
The self-contained proof which we give here essentially follows that of Kore-
vaar (1954). Ingham (1965) gives a direct proof of the Karamata-Freud theorem
which does not explicitly appeal to the lemma on the L'-approximation by poly-
nomials, but directly incorporates the use of "peak functions" in Karamata's
proof.
§ 7.5. For developments relating to Bohr's inequality (36), see HOrmander
(1954).
As indicated in Exercise 5, Theorem 11 yields a remainder of order <
x(log x) -2 +E in the prime number theorem. Rieger (1983), also using the
Wiener-Ikehara method, obtained a remainder of order
xexp{ - c(logx) 1 / 25 }.
As Diamond remarked in his review (1988), it is not true stricto sensu that
the Hardy-Littlewood-Karamata theorem cannot lead to a proof of the prime
number theorem. Using a number of analytic properties of the Riemann zeta
function, such as the functional equation, the product formula, the finiteness of
the order, the absence of zeros on a = 1 and a bound of type N(T) < T A for
the number of zeros with ordinate not exceeding T, Littlewood indeed showed
in 1971 that the Hardy-Littlewood-Karamata theorem can be used to obtain
a "quick" proof of the prime number theorem. Since no known proof of the fact
that ((s) 0 for a = 1 furnishes an effective lower bound weaker than
-A
(50) 1((s)I >> (log(3 + r1)) (a > 1),
where co is some real number, and the g i (s) and g(s) are functions holomorphic
at s = a, the number gq (a) being non-zero. Then:
(i) if co is not a negative integer, we have as x —> Do
gq (a)
A(x) := > an --a xa (log x) (log2 xr,
ar(w ± 1)
n<x
Exercises
[This non-trivial estimate is of course much weaker than (1.3.15) but only
necessitates a very fragmentary piece of information concerning the Dirichlet
series ((s) -1 associated with the Mobius function.]
, C(log x) 2
L-e n
n<x
urn (a — an = 0.
0-.1+
n=1
4. Set Z(s) := -s -1- (1 (s)((s) -1 . Show that the assumption ((1 + ir) 0 for
0 implies that, for all fixed T,
1 1
Z(1+ 2o- + iT) - Z(1+ a + iT) 20. ir + di- = 0.
a + ir
Now apply Ikehara's theorem and deduce the prime number theorem.
5. Using the elementary bounds proved in §4.2
( (k) ( 8 )
show that the integral in Exercise 4 is < o - (log T) 1-9 for T > 2, a > 0. Apply
Theorem 11 and deduce the effective estimate
+00
eirx dF (x ) E p).
Finally, let Fn (x) denote the distribution function of (1/ N/n) E';'=1 X ) , where
the X 3 are independent variables with the same law as X.
(a) Show that p> 1.
246 11.7 Tauberian theorems
Tn,
1 1 ,
erY E R, m 0)
(m + 1)!
3,0
and
I log( 1 — + yl IY1 2 (IYI 5_
show that
Sx
(i) (ii) 11 (1 ± (19) K (log x) 5
19(P) rj log x' 19 1
p<x p<x
0(m) Sx 0(m)
0(n) log n (iv) T(x)
log x m
n<x m x m x
Exercises 247
p<exp(1/a) p>exp(1/cr)
00
9(p) \ -1
Deduce that e-76
H (1 -0
( a _, 0.
m=1 p<exp(1/a) P )
(c) Show that one has under hypotheses (i) and (ii)
61- ( e —Y 6
T (x) — iv +1) x(log x) 6-1 (x ---> oo).
11.8
Prime numbers in arithmetic progressions
G
3=1
n := GI = n 3'
-
j=1
( 1) (1 < rj < nj (I 5_ j k )
x(ry)=Hx(7.7r9-
j=1
8.1 Introduction: Dirichlet characters 249
Thus, x is entirely determined by the x(-y i ) (1 < j < k), and the fact that
is of order ni implies that x(-yi ) is an nth root of unity. Conversely, for each
choice
(1 < j < k)
with 1 < hi < ni , the map x : C ---+ C defined, for ry given by (1), by
V -Y) = H C13
j=1
n if -y =1
(2) (-y E G)
0 if -y 1
and
(3) E xor) = { On if x = xo
if x x o
E
-yEG
Proof. The proofs of these two dual relations being similar, we confine ourselves
to establishing the first. Let S denote the left-hand side of (2). If -y = 1, we
have x(ry) = 1 for all x, hence
S =161= n.
It follows from the above description of the character group that there are
<p(q) characters to the modulus q. They can be explicitly defined by decom-
posing (Z/qZ)* as a direct product of cyclic groups. By the Chinese remainder
theorem we have for any q > 1
(4) (Z I qZ)* r■.; (z/p u z) * .
Pv II q
Indeed, if we let qi (1 < j < (q)) denote the distinct prime powers dividing
q, and if, for each j, aj denotes the unique integer (mod q) such that
aj 1 (mod q3 ), ai 0 (mod qi) (i
then for each integer m we can write
m ajmi (mod q)
For each index j, let g be a primitive root modulo 41.3 . Then, for each integer
m with (m, q) = 1, we can uniquely define E(m), ri(m), pi(m) (1 j
satisfying
in (-1) (770 5'7(77) (mod 2"), E(m) = 0 or 1, 0 < n(m) <
m gillj (rn) (modp;I:i), 0< <(p3) (1 < j < k) .
8.1 Introduction: Dirichlet characters 251
(1,
{(m), if n m(mod q), 1 5_ ? q, (m, q) = 1,
(7) x(n) = if (n, q) > 1.
Ii if (n, q) = 1,
(8 ) Xo(n) =
if (n, q) >1.
(10) v ( q )-1
X(n)X l (n) = { 1 if X
1<n<q
0 otherwise.
252 11.8 Prime numbers in arithmetic progressions
In particular L(s, x) 0 when a > 1. For x = xo, the above formula can be
written in the form
(14) max
x>1 x(n) < q.
n< x
By Dirichlet's test, we deduce that the series L(s, x) converges for a > 0, which
actually determines the abscissa of convergence.
With the help of the orthogonality relations, the Euler expansion (12) allows
us to investigate prime numbers in an arithmetic progression. As in the case of
the Riemann zeta function, we proceed via the logarithmic derivative.
Theorem 3. For all integers a, q with (a, q) = 1, we have
00
(15) E A(n)n8 = p (q) 1 x(a) y:
(s, x) (a> 1),
n=1
Teta(rnod q)
where the summation over x is extended to the (p(q) Dirichlet characters to the
modulus q.
Proof. By (12), we have for any x
P p v=1 n=1
The result then follows from (9) with m = a, by interchanging the summations
on the right-hand side of (15).
8.2 L-series. The prime number theorem for arithmetic progressions 253
As in the case of the classical Chebyshev function //)(x) (= '0( x; 1, 1)), an in-
tegration by parts allows us to link the asymptotic behaviours of IP (x; a, q)
and of the counting function of prime numbers in the corresponding arithmetic
progression, i.e.
(17) 7(x, a, q) := 1.
p<x
7;ict(rnod q)
(18)
E 1 , log2 x (x --- oo).
p<x P 'PM
7:)a(rnod q)
(19) L (1, x) 0 (x xo )
by means of the Hardy—Littlewood—Karamata theorem (Theorem 7.8). Indeed,
we deduce from (15) and (19) that, for fixed a and q,
00
E A(n) = 1 —L' (c; r
(20) , x0) ± 0q (1) (a > 1),
na co(q) L
n=1
77,a(mod q)
1 log2 x
(23) E
P<x
p = se) (q) ± Oq (10g3 X) (X 00 ) •
p.a(mod q)
x
(25) 1P(x, a, q) ,---, (x -4 ao).
(P(q)
This result is commonly called the "prime number theorem for arithmetic pro-
gressions". One of the central questions in analytic number theory is to de-
termine effective versions of this asymptotic formula with respect to the three
variables involved.
In order to deduce (25) from (24), it suffices to apply Theorem 7.11 to the
Dirichlet series (15), i.e.
00
-Li ,
By (24), each function — ls X) X Xo, may be continued to a function
L "
holomorphic for a > 1 (that is to say in an open set containing this half-plane),
and it then follows from (13) and (15) that
F(s +1) 1
(26) G(s) := s+1 cams
we have
(lolgo2g Xx) 19 ) .
In particular, the asymptotic formula (25) holds, for any 6 > 0, uniformly in
the domain
A more careful study of the series L(s, x) enables one to obtain a zero-free
region analogous to that for the zeta function. Uniformity in q is crucial and
particularly difficult it incidentally raises an interesting open problem on the
existence of the famous "Siegel zero", of which we give a very brief account
in the Notes. The best results currently known yield the following statement,
which considerably strengthens Theorem 4—cf. Siegel (1936).
Theorem 5 (Siegel-Walfisz). Under the assumptions of Theorem 4, we have
remainder term OA (x(log x) - A) for all A > 0. The details are made precise
in Exercise 4. Here, we have however chosen to apply the effective Ikehara
theorem: on the one hand because it provides, in a very simple way, a domain
of validity for the asymptotic formula (25) which is practically identical to that
obtained by contour integration and, on the other hand, because this will give
us the opportunity for a detailed arithmetic use of a Tauberian theorem.
00
= Y,
n=1
X(n)(log n) k rt- s
< E (1ogn) k n -1 +
n<x Id-c (log t) k t - s dK(t)
+00
Proof We proceed analogously to the proof of Theorem 3.8, using the inequality
which served to estimate 10+i/01 from below. For each x, we define a function
qp(n), with values in [0, 27[, such that
(n) .
x (n) = x ( n) ei
log L(s, x) =
x(p) k
kp ks
E expli(0(p k ) — kr log pn
kp ka
p k=1 ptq k=1
> O.
kpk(7
ptq k=1
(35) 2
X = X0-
In this case the factor IL(o- ±2i,r, x2 )1 becomes infinite as a 1+, 7 ---> 0, which
has the consequence of reducing the effect of (33). The explicit description (6)
shows that x is real if and only if
A'
10 '3 (v < 2)
and Ai = 0 or co (p.1;3 ) (1 < j < k).
1 0, 2 (v > 3)
Since E(m) = 0 when v < 1, we readily obtain that he number r(q) of real
characters modulo q is given by the formula
parameter satisfying 0 < 77 < 1/r. It follows from (31) that we have
for a suitable absolute constant C o . On the other hand, Theorem 6 implies that
Using (38), we infer that there exists some positive constant C 1 such that
2
1 L ( S , X)1 ?- C1773/4L-1 /4 ____ c0i7r C2 {CrC2-1/4 - C0},C -7
Proof. We confine ourselves to the case 1 < a < 1 +1/7q. Indeed, we otherwise
have
1 a-1 1
L(a, x) = ri (1 - x(p)p - a) 1 ? H(1 ---p-a) = ((a) > > .
- a 8q
P P
Consider the arithmetic function
i a (n) :=
dIn
This is a multiplicative function. If a> 1, we can write for each prime number p
1 ___ ( x (p)pi-0-)v±i > 1 ± (_ 1 )v p(1-0-)(v+i)
fa(e) = 1 + pl-a
-
8.3 Lower bounds for IL(s,x)I when a > 1. Proof of Theorem 4. 259
where the inequality follows from the fact that x(p) = ± 1 or 0. In particular
fa (pa') > p2v (1 a) (v _?_ 1).
MP') ?_ 0 ,
Passing to the limit, these inequalities are still valid when a = 1. We then
obtain
fa (n) ?_ 0, fa (n2 ) > n 2(1-a) (n > 1, o- > 1).
F(o-, a) -an
fa (n)e
n=1
>
E n2(1-e_2 >
n=1
ct a-3/2“. ____ 0.)
1 t 2(1-0) e -at 2 dt
Noting that
log(10q 2 )
a a-1 > exp { > 4O'' ,
7 } ,
we then obtain
with
A := V71 • 40 -1 / 14 .
00 00
F(o-, a) = E x (d)di-ae-ornd - y, x (d)di-o- (etc/ - 1)-1
m,d=1 d=1
= x(d)0(d)
d=1
with
1
0(d) := di- a { e ad ____ 1
1
ad 1
260 11.8 Prime numbers in arithmetic progressions
It is easily checked that 0 is increasing on [1, H-oo[. The sum over d hence takes
the value
+00 00
OM dK (t) = - f K (t)6e (t) dt
f_ 1
1 1 1 1 <i
1 19 ( 1 )1= a e , _ 1 < a co ± a) 2 1
we obtain
1 q)> 1
L(cr,x) ? AN/a - a( + T.
4A
for the optimal choice a :=
(q + 14/3) 2-
Theorem 10. There exists an absolute constant co > 0 such that, for x2 = X0/
X X0/ a ? 1 /
Proof. The second of these estimates follows immediately from Theorems 6 and
9. Indeed, a Taylor expansion at order 1 gives for all 7", 171 < 1,
with
T
(46) 77(cr, T) := f 1G (2a + ir) - G (a + ir)1 dr (a >0).
-T
with
1 +
(.5 ± 1) 1 log P
{ — (
1
( s ± 1)((s ± 1) s (s + 1 (x = xo)
z(s, x) := p q Ps
—11(s + I-, x)
(s + 1)L(s + 1 ,X)
From the estimates of §4.2 on ((s), and from Theorems 6, 8, 10 concerning
L(s, X), X Xo, we obtain for a > 0, after a routine calculation,
{Lisl s+ 11 _1, if
A
_2 7- Xo or 171> (log 2q) -1 ,
Z'(s,x) < L 16 171 -2 , if X 2 = xo and c o (log2q) -2 q -l <17- 1< (log 2q) -1 ,
L4q2, if X2 = Xo and 171 < co (log2q) -2 q -1 .
Substituting first in (47) and then in (46), and taking account of (36), we
deduce
2w(q ) q
n(a,T) < a L(q,T) 18 {r(q,T) +
Choosing T = log x, we obtain from (45) the stated formula for 0(x; a, q).
262 11.8 Prime numbers in arithmetic progressions
Notes
The trivial inequality (32) can be considerably improved. Polya and I.M.
Vinogradov discovered independently, in 1918, the inequality
(51) max x(n) < 2-Vq. log q,
x>1
n<x
which is valid for all non-principal characters x mod q for a proof, see Ellison
& Mendes France (1975), theorem 9.7. This inequality is clearly useless for small
values of x, for example x < Vq. Burgess (1962, 1963) studied this question.
He shows, for example, that
(52) Y: X(n) < xq -6(6)
n<x
(x > q 3 / 8+6)
where (5 = S(E) > 0. In addition, (52) is valid for x > q1-14±' if q is cubefree.
See also Hildebrand (1986d).
Our proof of Theorem 9 essentially follows that of Ellison & Mendes France
(1975), chapter 7, Al. Using the Polya—Vinogradov inequality in place of (32),
it can be shown that L(o-,x) -1 < Vq. log q, thus obtaining a corresponding
improvement in Theorem 10. The gain in Theorem 4 is not significant if we
stick to the approach presented here, but the contour integration method does
yield in this way an improvement on the uniformity in the variable q see
Exercise 4.
Paley (1932) showed that the inequality (51) is almost optimal. Indeed, we
have for infinitely many q
max max E x(n) > -Vq• log2 q.
x0x0 i<x<q n<x
Montgomery Sz Vaughan (1977) have shown, under the generalised Riemann
hypothesis, that the right-hand side of (51) may be replaced by 0(Vq. log2 q).
It is easily seen that the left-hand side of (51) is ?_ c-Vq for some absolute
positive constant c and any non-principal primitive character x to the modu-
lus q. This follows from the classical fact that the modulus of the Gaussian sum
T(X) := Y: X(n)e(n1q)
1<n<q
is equal to Vq see e.g. Davenport (1980), p. 66. Indeed we have
q
T (x) = —(27i/q) I K(x)e(x1q)dx
o
with K(x) := Ei<n<x X(n), so the required lower bound follows with c = 1/27.
If we use the relation K(q — x) = —x(-1)K(x—), we obtain that c = 1 is
admissible. For further improvements on c, see Sarkozy (1977a), Sokolovskii
(1979). On the other hand, Montgomery Sz Vaughan (1979) have shown that,
for any E > 0, we have max x IK(x)1 <, Vq for all but at most E(q) non-
principal characters to the modulus q.
264 11.8 Prime numbers in arithmetic progressions
Exercises
1. (a) Let p > 2. For each d > 1, let 0(d) denote the number of elements in
(Z/pZ)* which are exactly of order d. Show that 11)(1) = p - 1.
Edi(p—i)
(b) Let y be an element of order d in (Z/pZ)*. Show that the equation
X d — 1 —= 0 (mod p) has exactly d solutions, which can be made explicit. Deduce
that each element of order d is one of the co(d) generators of the group generated
by Y.
(c) Show that 0(d) < co (d) for all d, and hence that this inequality is an
equality if cil (p - 1). Deduce that (Z/pZ)* is cyclic.
2. Let p> 2, and g be a primitive root modulo p.
(a) Show that the order of g (modp2 ) is either p - 1 or p(p - 1). Show that,
in the first case, g ± p does not have order p - 1. Deduce that (Z/p 2 Z)* is cyclic.
(b) Let v> 2 and g be a primitive root mod p 2 . Show that gP -1 = 1 ± hp
with (h,p) = 1. Deduce that
(p(pv) m p
1 f(mod v+i)
9
E 13-8
pa(mod q) .
= (P ( q )—
x
x (a) log L(s , x) ± h(s)
c1 L — 9 (x2 xo)
1 — a 5_ eir-8 (r ± IT 1 -1 ) -1 (X 2 = X01 1TIQ > C2)
{
Deduce the existence of an absolute constant c > 0 and of a quantity 6(q) >>
q-1 (log q) -1° such that one has, uniformly for q < exp(logx)v io ,
2w(o
'0(x, a , q) = ;q) ± 0 (x exp{ c(logx) 1 / 1° } ± x 1-6( g) ).
co(q)
Show by the same method, but now employing the Polya-Vinogradov in-
equality, that, for each E > 0, one has /P (x; a, q) --, x I co(q) uniformly for
q < (log x) 2- '.
5. Set
00
E /T
T 1L(1 + Jr, x)14 dr « (qT)1±6.
x(mod q)
that the number N3 (x) of those integers n not exceding x and all of whose
prime factors are of the form 4m ± 3 satisfies
-V2Ax
N3 (X) r•-,
7r \/(logx)
Give a remainder form of this result. Generalise. [See Landau (1909), vol. 2,
pp. 641-669, and Wirsing (1956).]
This page intentionally left blan
Part III
Probabilistic methods
This page intentionally left blan
Densities
P(zt, n ziP, ) . ( 1 - —
al ) ( 1 — —b1 )
P({m}) ,_p( n 4 ). H( 1
P
m<p<n m<p<n
E An = 1,
n=1
with 0 < An < 1 for all n. For any integer sequence A C Z+, we then have
P(A) = Aa
aEA
and we can appreciate the discrepancy between such a model and intuition by
noting that the probability of a sequence virtually depends only on its initial
terms. Indeed, for each E > 0 there exists N = N, such that
P({1,2,...,N}) > 1— E.
We thus find ourselves in a typical situation, where the only theory at our
disposal irrevocably invalidates two very natural "theorems", viz.
(i) P(aZ+) = 1/a (a = 1, 2,...),
(ii) P(A) = P(B) MAO < cc),
where AAB denotes the symmetric difference of A and B. The choice (stan-
dard throughout the history of mathematics) of preferring intuitive theorems
to established theories having been made, it is not hard to circumvent these
difficulties. Introducing a divergent series of non-negative terms
00
E An =
n=1
00,
as x ---> oo. It certainly follows that any finite sequence has zero density, but
the concept thus introduced is not a measure on Z+:
(iii) sequences possessing a density do not form a a-algebra,
(iv) density is not countably additive.
The simplest choice consists in taking An = 1 for all n > 1. We thus obtain
the notion of natural density or asymptotic density, both terms being commonly
used. When it exists, the natural density of A is given by the formula
One denotes by upper (resp. lower) natural (or asymptotic) density the quantity
dA (resp. d A) obtained by replacing the symbol lim by lim sup (resp. lim inf)
in (2).
Before proceeding further we make four simple observations.
(a) Any arithmetic progression n a(mod q) has natural density, equal
to 1/q. This is immediate, since in this case we have
<x :E
{a — = [x/q] + 0(1).
The intuitive property (i) stated above is hence verified.
(b) A necessary and sufficient condition for the increasing sequence
<a2 < ... to be of natural density a, 0 < a < 1, is that
(c) There exist integer sequences failing to have natural density. Con-
sider, for example, the sequence A of integers n with leading digit equal to 1
in the expansion to base 10. We have
00
This explains why natural density echoes intuitive criteria: the density of a
sequence is the limit of its frequency in the first N integers.
272 111.1 Densities
0<k<log x/ log 10
1.3 Analytic density 273
=( E An(a)/ E Am(a)) + 9,
n<N,nEA ni<IV
with 101 < 1. By choosing E = e(a) --4 0 as a ----> cro , we hence recover a definition
analogous to that of Section 1, but where the function An also depends on x.
In certain instances, it may be established by a Tauberian argument that there
exists an equivalent procedure which is strictly of type (1). Even then, such a
framework can be useful: when the series E An (a) are well-chosen, considera-
tion of (8) rather than (1) can notably simplify the calculations, or favour the
use of certain analytic techniques. This stems from replacing truncation by a
smoother procedure.
A fundamental example is obtained from the choice S =O., + oo[, o-0 = 1
and An (a) = n (n > 1). We then get
-a .
(9) P, (A) =
neA
The presence of the Dirichlet series associated to the characteristic function
of A opens the possibility of using all the analytic and algebraic properties of
Dirichlet series for the computation of density—especially those connected to
the convolution product.
274 111.1 Densities
—a
(10) (a - )
nEA
but it is often convenient to keep the factor ((a) -1 when Euler products are
involved.
The expected link between analytic density and a density of type (1) is
made explicit in the following result.
Theorem 3. Let A be an integer sequence. Then A has analytic density if
and only if A has logarithmic density. In this case the two densities are equal.
Proof. Let us retain the notation L(x) = E a<x
1/a, where summation is re-
stricted to elements a E A. By partial summation, we have
+00 +00
h o-
I
e — ht dL(et ) = h2 j
1
L(x)x -l-h dx = 6 + o(1) (h --4 0+).
Notes
§§ 1.1 1.3. Many other types of density of an integer sequence have been
-
A + B := fa +b:aEA, bEB},
A fairly simple proof of (13) may be found in Halberstam & Roth (1983),
Chapter 1.4.
(c) The divisor density of Hall (1978). When this exists, it is the unique
number DA satisfying
where the symbol "pp" (for the French presque partout, i.e. almost everywhere)
indicates that the asymptotic formula thus described occurs as n Do inside
some suitable sequence of natural density 1. This definition is quite different
from those of §§1.1-1.3. For example, for any a, 3 in [0,1] one can find a
sequence A such that dA = a, DA = 0 (Hall, 1978). For other properties
one can consult Hall (1981), Tenenbaum (1982), Dupain, Hall & Tenenbaum
(1982), Hall & Tenenbaum (1986). [See also Exercises 3.9 and 3.10.]
§ 1.3. A deeper study of the properties of arithmetic functions with respect to
the laws P(A) has been undertaken by Nanopoulos (1975, 1977, 1982).
Exercises
We recall that P+ (n) denotes the largest prime factor of an integer n, with
the convention that P+ (1) = 1. For y > 2, A C Z+, we set
1. For n> 1, write ny := max{d : din, 1)± (d) < y}. Let A C Z+ be a sequence
of integers satisfying, for some y > 2, A = In : fl y E A y } . Show that for s E C,
a> 1, we have
E n' = ((s) LI (1 -P -8) E a—S .
nEA P<Y aEA y
dA = H (1_
P<Y aGA y
my A := 1-1 (1 — p
-1 )
P<Y aEA y
1 1 ) -1
-
n
= H (1 - - Imx .M(A) - my M(A)}.
nE.A4(A)N.M(A y ) p<x
P± (n)<x
(c) By applying the result of Exercise 4 show that 5.A4(A) < m.A4(A).
(d) Show that m.A4(A) = 5.A4(A) = A(A) = d.A4(A), in other words: Any
set of multiples has logarithmic density, equal to its lower natural density, and
also to its multiplicative and sequential densities. [A set of multiples does not
necessarily have a natural density, cf. Exercise 3.8.]
6. The Davenport-Erdos theorem (proof of 1937).
Let A C Z. For j > 1, let 9 (n) denote the set-theoretic characteristic
function of M := .A4(A(i ) )N.A4(A( -1 )) and write e 3 (n) := E Oi (n).
We consider the Dirichlet series
00
1
—
1
H (1 _P1_)-1.
P+ (a)<y P+(b)<y P<Y
(c) Let (Py . A --4 A y be the map defined by so y (a) = 7A(a y )a/ay . Show
that if coy (a) . coy (0!), then a7B(a /y ) = a'7rB(a y ). Deduce that (10 y is injective
and establish that A(x) < Ay (x) for x > 1, y > 2.
280 111.1 Densities
1, —1 1
dA > (
P+ (b)<y
b) (1 — E0.
P+ (b)> y
f 1
dA =
bes
where the right-hand side is interpreted as zero when the series diverges.
[This result is due to Saffari (1976) and Erd5s, Saffari & Vaughan (1979). The
proof indicated here is essentially that of Daboussi (1979).]
111.2
Limiting distributions
of arithmetic functions
{n : 1 < n < N}
equipped with the uniform law UN. In this context, an arithmetic function may
be viewed as a sequence of random variables
fN = (f, vN )
taking the values f (n), 1 < n < N, with probability 1/N. We intend to inves-
tigate from this perspective the classical probabilistic notion of a distribution
function.
A distribution function (d.f.) is a non-decreasing function F : IR —> [0,1]
which is right-continuous and satisfies F(—cc) = 0, F(co) = 1.
The set D(F) of discontinuity points of F is thus at most countable and only
contains discontinuities of the first kind. We denote by C(F) the complement
of V(F), that is to say the set of continuity points of F. A point of increase for
F is a real number z such that F(z ± e) — F(z — e) > 0 for any 6 > 0. Each
discontinuity point is a point of increase but the converse does not hold.
Write D(F) = {z,} 1 and sv := F(z) — F(zv — ). The function
(31)( z) =
increases exclusively by jumps and is constant in any closed interval not con-
taining some z„. It is called a step-function. The quantity s, is the saltus (or
jump) of F at zu . If D(F) is not empty then (I) is, up to a constant multiple,
a distribution function. Such d.f. is said to be purely discrete or atomic. It is
immediately checked that F — (I). is continuous.
282 111.2 Limiting distributions of arithmetic functions
The d.f. F is called improper (or is said to be the d.f. of an improper law)
if it equals a one-point step-function, say
F(z) =
{
< zo)
(z zo)•
A simple example of a continuous d.f. is a function of the type
F(z) = I h(t) dt
F = a1 F1 a2 F2 ± a3 F3
with a l , a2, a3 > 0, ai + a2 + a3 = 1, and where the Fi are d.f.'s such that
F1 is absolutely continuous, F2 is purely singular and F3 is atomic.
A sequence {FT,} 1 of d.f.'s is said to converge weakly to a function F if
we have
( 1 )
Ern Fn (z) = F(z) (z e C(F)).
n -+00
First let us show that FN(z), as defined by (2), converges weakly to F. For
z E C(F), we have
By (iii) the first term of this upper bound equals F(z ±E,ri) +o(1) as N --4 Do.
The other two may be estimated using (i) and (ii). Letting N tend to infinity
and then 77 tend to 0, we obtain
The last inequality results from the facts that F (z' , 71) is a non-decreasing
function of z' of and that z E C (F). Similarly we obtain
lim inf FN (Z) > 11111SUp F (2" — 6(71) , T1) = F (z) .
N-400
Thus FN converges weakly to F, and we may normalise this function so that
it becomes right-continuous.
It remains to show that F ( — oo) = 0, F(oo) = 1. Since F (z) = lim FN(z)
for z E C(F), we clearly have 0 < F < 1. Let 6 > 0. Choose z in C(F) so that
z > max{ f (a) : a < T (E)} +E. Then 1(n) > z implies either that a,(n) > T(E),
or that 11(n) — f (a,(n))I > 6 . By (i) and (ii) the corresponding density 1— F(z)
tends to 0 as n ---› 0+. This implies that F(oo) = 1. We obtain F( — oo) = 0
analogously.
In the probabilistic study of an arithmetic function, a natural normalisation
is obtained by introducing the expectation and variance of f relative to vN,
viz.
and
(4)
vN( f) = DN( f ) 2 := Ic : { z — E N (f)} 2 dFN(z)
1
{f (n) — EN(f)} 2 .
N
1<n<N
This suggests a different approach to the problem of the distribution of
values of an arithmetic function. Instead of studying the asymptotic behaviour
of FN (z) , we consider that of
converges weakly towards a d.f. H(z) and if the convergence of the expectations
f z dB-N(z) is dominated in Lebesgue's sense, then we have
co
EN(f)= AN ± { f-00 z dH (z) + o(1)} BN (N -- co).
valid for z and z ± h in C(F). The proof of (7) is analogous to that of the
classical Fourier inversion formula, and presents no new difficulty. If F and G
have the same c.f., it follows from (7) that F(z + h) - F(z) = G(z + h) - G(z)
for almost all z, h. Letting h tend to -co, we obtain that F(z) = G(z) al-
most everywhere. Since F and G are non-decreasing and right-continuous, this
implies that F = G.
The famous continuity theorem of Paul Levy links the weak convergence of
the d.f.'s to the pointwise convergence of c.f.'s.
Theorem 3 (Continuity theorem, Levy, 1925). Let {Fn } 77_ 1 be a se-
quence of cl.f.'s and {co n l n"=1 the corresponding sequence of their c.f.'s. Then
Fn converges weakly to a d.f. F if and only if so n, converges pointwise on IR to a
function sp which is continuous at 0. In addition, in this case, so is the c.f. of F,
and the convergence of co n to co is uniform on any compact subset.
This classical result is established in detail in most books on probability—
see for example Cramer (1970), Feller (1971), Loeve (1963), Lukacs (1970). Here
we limit ourselves to indicating the main steps. The first point is an identity
which follows easily from (7).
286 111.2 Limiting distributions of arithmetic functions
Lemma 3.1. Let F be a d.f. and co its c.f. For z E T , h> 0, we have
fz±h z 1 f' (sin(T/2)V e _ iyz/hco(T) dr.
f
1
Proof of lemma. It suffices to apply (7) to the d.f. F o (z) := (1/h) F(t)dt.
1h
The c.f. of Fo (z) is co(r)(1 - e -irh )/irh, as shown by partial integration. An
obvious change of variable then yields the result.
Proof of the theorem. The necessity of the condition is easy. If Fn converges
weakly to F we obtain, by a standard argument, that for every E > 0, there
exists a real number X = X(e) such that
When the limit law F(z) is absolutely continuous with bounded density, say
1 T ( 7 ) — V7 (7 ) dr
(9) sup l Fn (z) — F(z)1 < sup Ih(t)1 +
zEIR / tEIR f n •
When f is additive, the function n e irf(n) is, for each fixed T, a multiplicative
function with modulus 1. The problem of the existence of a limit law for f is
thus equivalent to that of the existence of the mean value of a multiplicative
function with values in the unit disc. We shall see in Chapter 4 how to exploit
this duality.
288 111.2 Limiting distributions of arithmetic functions
Notes
§ 2.1. For further details concerning the theory of d.f.'s see Feller (1970,1971),
Loeve (1963), or Lukacs (1970).
Theorem 2 is identical to Lemma A2 of Hall Sz Tenenbaum (1988).
The normalisation (5) for d.f.'s of arithmetic functions has an obvious theo-
retic interest. In practice, it is often preferable to keep a greater flexibility and
to ask the more general question of determining those functions AN, BN for
which the sequence of d.f.'s
converges weakly to a d.f. H(z). This is the point of view adopted by Elliott
(1979). Today his book is considered to be the indisputed reference for proba-
bilistic number theory.
§ 2.2. Our proof of Levy's theorem essentially follows that of Cramer (1970).
The theory of c.f.'s is particularly suitable for the treatment of convolutions
of d.f.'s. The convolution product of two d.f.'s F and G is the d.f. H defined by
If p(r), -y (r) are the c.f.'s for F, G respectively, then n (r) = spr)-y(r) is the
c.f. of H.
Pars eval's formula reads
00
is also in L i (R).
Notes 289
Let Cb(R) denote the space of all continuous and bounded functions on R.
Equation (11) remains true in the form
TA- 00 1 A
(12) g (z) dF (z) = lim — fIT, (1 _ I )g'(y)(p(r) dr
. I -oo A ---+oo 27 j _ A \ A
whenever g E L 2 (R) n Cb(R). This can be quickly established using the proper-
ties of the Fejer kernel
with Fourier transform f(7 -) = (1- I T I /A) ± . Indeed, under the above assump-
tions for g, it is classically known that g),:= g * wA converges towards g both
pointwise and in quadratic mean. Furthermore I I gA 1100 <11g1100• Now -ix' =
lies in L i (R) as product of an element of L 2 (R) by a continuous function with
compact support. Applying (11) to gA , and evaluating the limit of the left-hand
side by Lebesgue's theorem of dominated convergence, we obtain (12).
Choosing in (12), for each fixed real y, g(z) = sin{T(z - y)}/{T(z - y)}, so
that -4(7-) = 7re -n-Y /T (IT! < T), -4(r) = 0 (17- 1> T), we may write
1 i
(15) lim
T --400 2T _T
00
(b) (Jessen & Wintner, 1935) If each Fn is atomic, then F is a pure law, i.e.
F is either atomic, or continuous and purely singular, or absolutely continuous.
The equivalence of conditions (i), (ii) and (iii) is an easy consequence of the
continuity theorem. Put
If the product (16) converges weakly to F with c.f. co, then co is continuous and
co(0) = 1, hence co does not vanish in a suitable neighbourhood of the origin,
say IT' < 6. Paul Levy's theorem then immediately implies (ii). The implication
(ii)(iii) is a consequence of the general inequality
(18) 1— co(T) (1 —
which holds for any c.f. c/o and follows, by integration with respect to dF(z),
from the elementary inequality 1— cos(zr) > 1-4 (1— cos(2zr)). Applying (18) to
c,om,n , we see that lim m, n , 00 (p rn,n (T) = 1 for IT! < 26 and finally, by iteration,
for all T. We deduce (iii) from this by the continuity theorem. It remains to
show that (iii) implies (i). By a new application of the continuity theorem, we
Notes 291
derive from (iii) that cp,,,,(r) 1 uniformly on every compact subset. This
implies that the product fr_ i coi (r) converges uniformly on every compact
subset, its limit being ipso facto continuous at the origin. A last application of
Theorem 3 yields the required conclusion.
A sufficient condition for the existence of an absolutely continuous limiting
distribution is given by Babu (1992)—see Exercise 7 for a special case.
A complete proof of statements (a) and (b) of Theorem 5 may be found in
Elliott (1979), lemma 1.22. The proof of criterion (17) rests upon the fruitful
notion of concentration function, introduced by Paul Levy in 1937. We give
below a brief survey of some basic results and inform the reader that a deeper
account on this topic may be found in the book of Hengartner & Theodorescu
(1973). The concentration function Q F of a d.f. F is defined on IR+ by the
formula
F (t) := sup (F(z + f) — F(z)).
zER
We further define the concentration Q(F) := QF (1) and observe that, for all
> 0, we have QF (f) = Q(t), with Ft(z) := F(tz). It is easy to see that F is
continuous if, and only if, QF(f) ----> 0 as f —> 0+, so the concentration function
may be considered as a measure of the distance from F to the set of continuous
d.f.'s. This notion fits particularly well with the study of convolutions of d.f.'s
(or, what is the same, of sums of random variables), and indeed we have
for any pair (F, G) of d.f.'s. Inequality (19) immediately follows from the defi-
nition of a convolution product:
Q(F * G) = sup
zER. I fz<x±y<z±1
dF(x) dG(y)
Prior to proving (20), let us see how this enables us to establish assertion
(a) of Theorem 5.
Assume first that E°° . (1 — cri ) < oc. We may suppose without loss of gen-
erality that (xi 0 for all j and hence that 1-17_1 crj > 0. For each j > 1, we
choose a real number u i such that crj = Fj (uj )— Fj (uj —). The series Er_i ui is
necessarily convergent: otherwise, there exists a number > 0 and two integer
sequences mk, nk tending to infinity such that I E
mk <nk u3 • > for all k,
whence
H Cr <
____ fizi>6 dFink,nk (Z),
ink<i<nk
3
and this yields the desired contradiction, for the left-hand side approaches 1
as k cc whereas the right-hand side tends to 0. Setting u := U we ,
J -0,0 iz-1/2
dF(v)dz >
+00
-cc
(F(z ) — F (z — )) 2 dz.
Notes 293
1 T sin(712)
F(z ) — F(z — ) = — lim cio(r) e—i " dr
27i T—oo I T T/2
n
C2(F) < K2 f H koi (T)I dr
j=1
(22) 1
_< K2 exp (1 - () 2 )}d
—1 1<j<n
Put / := [ and
+00
pi := 1
111Ni dGi (z) = dFi (y) I dFi (x) > 1 — Q(FJ) > O.
-co
f Gj(z)/pj if z /,
113(z) )/p if z E /.
We have dGi > pj dHj and insert this bound in (23). Put T := Ei<j<npi,
ai := TIpi , so that i<j<n l l ai = 1. By Holder's inequality with exponents
-
ai , we deduce from (22) and from the lower bound derived from (23) by intro-
ducing dHj that
n j1 +00 1/ai
Q(F) K2 H exp — aipisf oo (1 — co (rz)) dHj (z)} dy
j=1 —1
I
J-00
6—T(1-00s(yz))/2
(z)
fl -Foo 1 1/a 3
e -T(1-008(rz)) / 2 d y)
Q (F) < K 2 H dHi (z)
_1
=1
To bound the inner integral, we may assume that IzI > since / has zero
measure for dHi(z). It follows that
2 /I —T(1—c0sv)/2 dv
e —T(1—c08(T d- 7z)/2
- --j0 e
< 2 ( 1 ± Izi) re —T(1—cos v)/2 dv
- Izi
_Tv2/7r2
19
_< (4 ± f+00 e dv < .
7 0 — -VT
This yields the upper bound (20) when T > 1. Since the result is trivial other-
wise, this completes the proof of Theorem 6.
Exercises
P
Pin, P> Y
(d) Show that f(n) has a limiting d.f. [Deduce from (b) and (c) that the
conditions (i) and (ii) of Theorem 2 are satisfied.]
2. A theorem of Schoenberg (1928). For each T E R, determine an arith-
ir
metic function A, satisfying (o(n) 1 n) = Edln A(d).
r Show that the series
11)(T) := ET 1
AT(d)/d is absolutely convergent, and thus recover the result of
Exercise 1. Show that the limit law F is given by
F(ez) = lim Fp1 * Fp2 * • • • * Fpn (Z)
n-4.co
where pj denotes the jth prime number and dFp, is a linear combination of
two Dirac measures. By applying Theorem 5(a), deduce that F is continuous.
[This property can also be established by directly appealing to criterion (15) of
the Notes—cf. the end of the proof of Theorem 4.1.] Show that F is pure by
using Theorem 5(b).
3. Reconsider Exercises 1 and 2 but with c f(n) 1 n (where o- (n) := Edin d)
instead of co(n)In. Generalise.
4. A sufficient condition for singularity (Erdos, 1939). Denote by 13±(n) the
largest prime factor of an integer n, with the convention that P±(1) = 1.
Throughout the exercise assume as given a non-decreasing, integer-valued func-
tion R(y) satisfying
(0 E P 11pR(p) < oo,
(ii) for sufficiently large y, there exist R(y) integers m1 < n/2 < • < 771,R
such that
(a) P± (ma) 5- y, ii(rna)2 = 1 ( 1 5- j 5- R(Y))
(0) E 1/v (m2 )_>c log y,
1<j<R(y)
where c is an absolute positive constant.
296 111.2 Limiting distributions of arithmetic functions
(c) Let e > 0, and y = y(E) be sufficiently large. Set a(n) := flp<y,pin p.
Show that, for a > 1, u(a) 2 = 1, 13+ (a) < y, we have
d{n : a(n) = a} = co(a) —I- 11(1 — p -1- )
P<Y
and that Theorem 2 can be applied to provide a second proof of the existence
of the d.f. of f.
(d) The number y(E) being chosen to be sufficiently large, write
I := U (f On ,i) ± [ — R(Y) -2 , R(Y) -2 1) •
l< j < R(y)
Show that clIn : f (n) E II >> 1. Conclude.
5. Show that, for all 6 > 0, the function R(y) = y'5 satisfies the hypotheses of
Exercise 4. Deduce that the d.f.'s of cp (n) I n and a (n) I n are purely singular.
[Assume the results of Exercises 2 and 3.]
6. Let F(z) be a d.f. with c.f. yo(r). Show that if so E L2 (R), then F is absolutely
continuous. [One can appeal to Plancherel's theorem and to Parseval's formula
(11) with an arbitrary function g of class C2 with compact support.]
7. A result of Saffari (1979).
Let f be the strictly additive function defined by f (p) = (log p), where
a > 0 is given.
(a) Show that f possesses a d.f. F, and express its c.f. c,o(T).
(b) By calculating kw(r) 2 show that
8. Let h(n) be the number of prime factors p of n such that n 113 < p < n 1 /2 .
Show that h(n) has a distribution function.
1
Fn (z) := 1.
r(n)
din, f (d)<z
Show that A(n) << 7(n)/ \/(1 + w(n)), and establish that this upper bound is
optimal up to the value of the implicit constant.
10. Chains of divisors. For each integer n, we say that a set {d1, ... , dh} of
divisors of n is an n-chain if
(i) 12(d 1 ) + 12(dh ) = C2(n),
(ii) di ±i /di is a prime number for 1 < j < h.
(a) Let n = nip', with p t m. Show that if {d 1 , ... , dh } is an m-chain,
then Cr := {d r , drp, . . . , drpv+1-,,dr+ipv+i-r,...,dhpv+1-1
, is an n-chain for
all r, 1 < r < R := min(h, v + 1). Show that the C r induce a partition of
fdip 4 : 1 < j < h, 0 < p, < vl. Deduce, by induction on k = w(n), that for
each n the set of all divisors of n is a disjoint union of n-chains. We denote by
-y(n) the maximal number of these n-chains.
(b) Show that each n-chain contains exactly one divisor d such that
C2(d) = [12(n)/2].
(c) Show that the function S2 fulfills the condition of Exercise 9 with c = .
Deduce an inequality for -y(n).
[This result, as well as that of question (a) of the next exercise, is due to
de Bruijn, van Tengbergen eY Kruyswijk 0949- 54]
11. Primitive sequences. We recall the notation -y(n) from the preceding ex-
ercise. An integer sequence A is said to be primitive if none of its elements
divides any other. For each integer n, we put T (n, A) := Edin, dEA 1.
(a) Show that 7(n, A) < ry(n) (n > 1) for any primitive sequence A.
298 111.2 Limiting distributions of arithmetic functions
1 log x
(24) sup a<
A primitive a<x, aEA V(log2 x)
§ 3.1 Definition
The concept of normal order of an arithmetic function corresponds, in prob-
abilistic number theory, to that of almost sure equality of random variables in
probability theory. More precisely, we say that an arithmetic function f has
normal order g if g is an arithmetic function such that, for any e> 0, we have
In the course of this and the following chapters we shall see that numerous
arithmetic functions, with apparently chaotic variations, do indeed possess a
normal order. Their behaviour is thus satisfactorily described "almost every-
where", and the collection of information of this type contributes to elaborating
our current model of a "normal" number i.e. a number which is "random", in
a sense suited to natural density. That such a concept lends itself to quantitative
assertions, and can ultimately serve as the basis of a complete mathematical
theory, is by no means an insignificant attraction of the probabilistic point of
view on the theory of numbers.
f= f (Pi' )epv*
pv <N
3.2 The Turdn-Kubilius inequality 301
= ( 1 — P -1 )P-v ± 0(N -1 ).
For q p, we can further write
N 1 F N
E N (GI-, . gfti) - N
1 ([ pi, ie. ] [pv el [p
+1 v q12-1-1 i ± Lp v+i g ,,,+11)
( 1 p -i )p -, ( i _ q -i) q - A ± o ( N -1)
= EN(p)EN(q) ± 0(N -1 ).
(8) V(TIN) =
p<N
“(13)
p<N
E((?) ) 5_ >' Y
4
p<N v=1
00
I
f(P11 19-V
)2 1
where the first inequality is almost optimal, since it easily follows from the
Cauchy-Schwarz inequality that E(( p ) 2 < E((p2 )/p for each p, from which
V((p) > (1 - 1/p)E((p2 ). The Turan-Kubilius inequality states in essence that
the bound (8) which follows, by a well-known probabilistic theorem, from the
independence of the (19-remains valid for 17N ( f) up to an absolute multiplica-
tive constant.
302 111.3 Normal order
with
1/2 ) 1/2
6(x := 4x 11 ( E pu e)
E ± 4x-1- ( qmp - v )
pi i go<x
POq qi- <x
By standard partial integration, the details of which we omit, the prime number
theorem readily yields the estimate
say. We have
f(pv)f(e)
n<xp'lln, e IIn
Ap ,. , ) 2 x _i E 1+ E f(p,, )N1t )x -1 1.
pv <x n<x io' qi-' <x
13 lin
`j POq
The first inner sum does not exceed xp - v, and the second is at most equal to
xp - v q - I' (1 - 13 -1 )(1 - q - 1 ) ± 2. Therefore
= f(pv) f (Pv)x -1 E
n<x Pv Iln pv:x n<x, Pvlin
from which, inserting the lower bound xp - v (1 23 -1 ) - 1 for the inner sum, we
deduce that
(13) Mi > A(x)- f (pv).
pv <x
Substituting (12) and (13) in (11), we obtain that the left-hand side does
not exceed
B(x) 2 + 2x -1 E f (pi') f (V') 2A(x)x -1 E f (pv).
<x, 2*q pv <x
and
1/2
f (Pv) E
A(x) v/2
/2 p 5_ (B(x) 2 . B(x)2 E pv) .
<x pv <x pv <x
then we have
(15) C(x) = +0(1/V(logx)).
This result gives a remarkable measure of the discrepancy between probabilistic
number theory and the probability theory model: the fact that C(x) has a finite
limit exhibits the influence of the (partial) independence of the Gv the fact
that this limit is not equal to 1 underlines the limitations of this influence.
304 111.3 Normal order
r<N
so that, with the notation (14), the Turan—Kubilius inequality can be written as
2
EE CnrYr < NC(N) 1Yr1 2
n<N r<N r<N
Since the y r are arbitrary complex numbers, we can apply the duality prin-
ciple of Lemma 1.4.5.1. It follows that
2
E E cnrxn Nom E 2
1A(N) — A(n)1 =
E gp ilp _v (i p_, )
n<pv<N
) 1/2
< (
N
E
/N<pv<N
P-11
pv<N
If (Pv )1 219—v
for all E> 0. However, the left-hand side does not exceed
(21) EN(
f (71E)A—(N
Ar " ) < EB A( N
( N) ) 2 = o(1),
so that (19) is certainly satisfied. We thus obtain the remarkable result that the
number of prime factors of an integer n, counted with or without multiplicity, is
normally asymptotic to log2 n. It is this statement which is usually referred to
as the Hardy—Ramanujan theorem (1917). The following quantitative version
is due to Turan (1934).
(23) Rn _< N: lw(n) — log2 NI > "(N) 010g2 N)}1 < N/(N) 2 .
The original proof of Hardy & Ramanujan rested on an upper bound, uni-
form for N and k> 1, for the local laws vN{n : w(n) = k}. This implied rather
involved considerations—cf. Exercise 1. It was with the intention of simplifying
the argument that Turan proved in 1934 the initial form of Theorem 1.
In view of the inequality 2w(n) < 7- (n) < 2C/(n) , which is valid for any integer
n> 1, we immediately deduce from Theorem 4 that
n )log 2+0(1)
(24) r(n) = (log PP
for any function (n) --> oo. Stricto sensu, these relations ought to be inter-
preted as saying that log r(n) has a normal order. However (24) and (25) are
often referred to by the rather loose statement that r(n) has normal order
Th log 2 .
(log)
For the first time we meet here a common situation in probabilistic number
theory: the mean value (here log n) does not reflect the normal behaviour (here
(log)i log 2 )•
\ . This phenomenon can be explained by the fact that the sum
E r(n)
n<x
3.4 The Hardy-Ramanujan theorem and other applications 307
is dominated by a small number of abnormal integers, for which -7- (n) is large.
We shall see later (Exercise 11(b)) that these are precisely those integers n such
that w(n) ,---, 2 log2 n. In fact, for each E> 0,
E<x
fl
T (n) -, x log x -, (n) (x -4 oo).
1
(26) co (n; a, q) = cp(q) log2 n + 0 (e(n),\Alog2 n)) PP.
we have
Note the disparity between the "normal" value (log n) 1°g k and the mean value
(log n) k-1 /(k - 1)! of Tk(n) — cf. Theorem 11.5.3.
(c) For k > 1, let 6k(n) be the number of representations of n in the form
n = [mi, ... , rnk] with m 1 > 1, . .. , mk > 1. Then
(29) ok(n) = (log n) 1°g (2k -1) exp {0 Wn)V(log 2 n)) 1 pp.
308 111.3 Normal order
Here again the normal order is quite far from the mean value: using the convo-
lution formula Sk * 1 = T k , it may be easily deduced from Theorem 11.5.3 that
we have
with
00
Ck :=
1
H
(2k — 2)! -A-pi-
(1 - p-1
21c
E(ii + i)kp-ii.
v= 0
Proof. Let 8(x) be the left-hand side of (31) and L(x) := E n<x f (n)/n. We
have trivially
S(x) log x = E f (n) log --xn- -I- E f(n) E log p -I- E f (n) E log p'
n<x n<x Min n<x v>2, p`i Iln
(say).
3.5 Effective mean value estimates for multiplicative functions 309
where we have used (32) and the fact that L is non-decreasing. This completes
the proof.
In practice Theorem 5 is often used in the following form.
Corollary 5.1. Let A 1 , A2 be constants such that A1 > 0, 0 < A2 < 2. For any
multiplicative arithmetic function f satisfying
we have uniformly
00
Remark. Here we have not sought to optimise the numerical constants appear-
ing in (35).
Proof. It is clear that (33) implies conditions (i) and (ii) of Theorem 5. By
Theorem 1.1.4 we have A < A 1 log 4. In addition,
B _< A 1 E log V
(A2) 11-1
< 2)02
log p
P P v=2 P (p - A2) 2
P
oo
2)0 2 log 2 +4A log n
<
— (2 — A2 ) 2 n — 2) 2 •
n=
310 111.3 Normal order
f (if )p - v ,
n p<x v=0
n<x
Using Theorem 1.1.7 in the form F(t) := E p<t (logp)/p _< log(4t) (t > 2) and
partial summation, we readily get
log 4 - F(y)
E 1P < 1 ± ± log2 x - log 2 y (x > y ? 2),
log y
y<p<x
from which it follows, taking y = 300, that K(x) <2.87 < K(2) for x > 300.
This completes the proof.
The following result, which is an immediate consequence of Corollary 5.1,
will help to establish a strong form of Theorem 4. For t > 0 we write
Theorem 6. Let yo > 0. Uniformly for 0 < y < yo , x > t > 2, we have
(37) E yo., (n,t) ‹ x(log t)Y -1
n<x
Under the additional hypothesis yo <2, the same estimate holds for C2(n, t).
Proof. For each fixed t the function ni--+ yw(n't) is multiplicative and satisfies
(33) with A 1 = 1 + yo, A2 = 1. This condition is still realised for y °( ") , with
A1 = 1 + yo, A2 = max(1, yo ). The bound (34) is then
Theorem 7. We have, uniformly for x > t > 3 and 0 < e < \/(log2 t),
(38) 1ln _< x : lw(n, t) - log 2 tl > e N/(log 2 t)} I < xe-e /3 .
Furthermore, if < ( log2 0 1/6 we can replace e- e / 3 by e-e/2 . The same
assertions hold for 12(n, t).
When t = x, we have co (n , t) = co (n) for all n < x. In its range of validity,
the bound (38) hence constitutes a remarkable improvement of (23).
Proof. Let 6 := / \/(log 2 t), so that 0 < 6 < 1, and let x(n) denote the
characteristic function of the set of those integers n such that
with
y-1
(41) Q(y) := y log y - y + 1 = f log(1 + v) dv (y > 0).
o
For 0 < 6 < 1, we have Q(1 ± 6) > 1-62 , and Q(1 ± 6) = 62 + 0(63 ). This
implies the stated bounds. The proof is unchanged in the case of C2(n, t).
Proof. Let e be an arbitrarily large real number. Plainly, e(n) > e pp. It is
hence sufficient to show that the upper density of those integers n which do
not satisfy (43) with e(n) replaced by e tends to 0 as e oo. To this end, we
introduce the check-points
t3 := exp exp j (j > log2 e),
and note that, since log 2 t i — log2 t3 < 1 and since co (n, t) is a non-decreasing
function of t, we have for sufficiently large e
(n, t) log 2 t
— w (n , t) j
—
Notes
with
P 1ln,
f(m) := E f (13 v )p, (n) = . f (PP) if P
Li: pv <N ( 0 otherwise.
The random variables fp are not independent. If they were, the variance
VN (f) would be equal to
B( f)2 := E VN (f p ).
p<N
which satisfies
The inequality on the right is obvious; we obtain the one on the left by con-
sidering separately the cases pv+ 1 < N and Np -1 < pi' < N, applying
314 111.3 Normal order
the respective lower bounds 19' (1 — 3/2p) and p'(1 — 1/p). Relation (45)
follows easily from (47) and the Cauchy—Schwarz inequality
We actually have
and
the bound (49) follows from (50) by appealing to the probabilistic theorem on
the variance of a sum of independent random variables. Our proof of Theorem
1, which essentially follows that of Elliott (1979), is of direct type. The following
remarkable result of Ruzsa provides an indirect approach, of which (50) is an
immediate consequence.
Notes 315
§ 3.3. Elliott (1979), chapter 4, gives several variants of the inequality (18) of
Theorem 2. For instance we have the following result.
Theorem 12 (Elliott, 1979). For any sequence fan : 1 < n < x} and all
x > 2, we have
2 36
E P E
p<N/x n<x, pin
an — an
log x
)
n<x
§ 3.4. It was with the aim of proving Theorem 4 that, in his 1934 article,
Turan stated the inequality of Theorem 1 for the function w(n) without giving
a precise value to the constant. He soon generalised the result to the case
when 0 < f(p) <C (1936). The method was later systematically exploited by
Kubilius (1956, 1964).
§ 3.5. Theorem 5 appears in this form in Hall & Tenenbaum (1988), theo-
rem 01. It may be seen as a simplified, and slightly weaker, version of a result
of Halberstam & Richert, itself generalising a theorem of Hall (1974) cf. Ex-
ercise 13.
Theorem 13 (Halberstam–Richert, 1979). Let f be a multiplicative non-
negative function, which for some suitable constant lc > 0 satisfies
(55) 1 )i Kx
Ef(n)_{14_0( log x I J log X n
n x
n<x
The implicit constant depends at most on the constants implied in (i) and (ii).
Analogous results for upper and lower bounds are given by Hildebrand
(1984a, 1987b).
The constant n of (55) is optimal, as shown by the choice
§ 3.6. The more precise version of Theorem 9, announced by Erdos (1946) and
proved in detail in the book of Hall & Tenenbaum (1988), chapter 1, is the
following.
Notes 317
Theorem 14 (Eras, 1946). Let 6 > 0, and let (n) a function tending to oo
sufficiently slowly with n. We have
log2 pi (n) - j
(56) sup <1+ pp.
(7-t)co(n) 0 2i log2 i)
Moreover, if 1 E is replaced by 1 - E on the right-hand side, the density of
integers n satisfying (56) is zero.
The general idea in the proof of this result consists in showing that the law
of the iterated logarithm is asymptotically applicable to the arithmetic function
w(n, t) = X()
p<t
Theorem 16 (Hall & Tenenbaum, 1988). Let e > 0, and let (n) a func-
tion tending sufficiently slowly to co with n. We have
Ii1/ log 21 .
(60) di (n) ‘--z-2, exp
This agrees well with the normal order (log n) kg 2 of r(n), but can lead to
false conclusions if considered as an asymptotic formula with an excessively
local interpretation. Thus an old conjecture of Erdos, established by Maier Sz
Tenenbaum (1984), states that
d+1(n)
(61) min —*1 PP.
ii<T(n) di (n)
This implies that the above minimum is achieved as j --+ oo, which contradicts
(60) if we define the symbol,---z, with too precise a meaning.
Another model which also clashes with an excessively strict interpretation of
(60) as a description of the structure of the set of divisors of a normal number is
that of a fractal object with dimension log 2. Such a model may be shown to be
consistent with some rather strong arithmetical results concerning arithmetic
functions closely linked to the local structure of the set of divisors; see Mendes
France Sz Tenenbaum (1993).
Theorems 14-16 do not provide any information on the sequences of pi (n) or
di (n) for small values of j. In 1979, ErdOs introduced the densities Aj (d), Ai (p),
of the sequences of integers n satisfying di (n) = d or pi (n) = p, respectively.
An asymptotic study of these densities was undertaken by ErdOs Sz Tenenbaum
(1989a). In particular, Ai (d) > 0 if and only if r(d) < j < d.
Exercises 319
Exercises
and
e —x Xk 013)e—Q(13)x
E
k> 13x
k! < (0 — 1) V(27rx)
with Q(y) := y logy y ± 1. [See Norton, 1976, and, for more precise bounds,
—
1978.]
(b) Consider the function 7k (x) := Ifn < x : w(n) k} . Show that for
k > 1, x > 0,
(k + 1)7k±i(x) 5_ E
7rk (X/Pli )
pv<x
.
Deduce the existence of two absolute constants co, cl, such that
co x (log2 x ± ci ) k-1
71k(x) < (x > 3, k > 1).
log x (k — 1)!
(c) Recover in this way Theorem 7 for the function w(n) = c.o(n, x).
(d) Adapt the method to handle the case of the function C2(n).
[For generalizations and lower bounds, see Norton (1976, 1979, 1982), Balazard
(1987), and the references given in Exercise 4.5.]
2. Let A(n) be the additive arithmetic function defined by A(pv) = vp--cf.
Alladi & Erdos (1977, 1979).
(a) Show that
7122 {
V A(n) = 1 +0 ( 1 ".
ntxd
12logx log x ) I
(b) Show that for all n> 1 we have P+ (n) — 1 < A(n) < It(n)P+ (n).
(c) Show that the sequence of integers n such that P+ (n) > N/n has natural
density equal to log 2.
(d) Let x(n) be the characteristic function of the set A of integers n such
that P+(n) < 2/ 5 . Show that x(n) > 1 — pin, p>n 2/5 1, and deduce that A
has positive lower density.
(e) Show that A(n) does not have a monotonic normal order.
320 111.3 Normal order
3. Let g(n) be the strongly additive function such that g(p11 ) = logp.
(a) Show, with the notation of §2, that B(x) --, A(x)/ /2.
(b) Show that Eri<x ( log n — g(n)) << x.
(c) Deduce that g has a normal order, not accessible by the Turcin-Kubilius
inequality.
4. Let 6> 1, g6(n) := Ep1n (logp) 6 . Show that
with a n := log P± (n) I log n. Show that for each a, < a < 1, the sequence of
integers 11 such that a n, > a has a natural density, and compute this density.
Deduce that g6 does not have a monotonic normal order. [This also holds when
0 < 6 <1, see Exercise 5.41
5. For y > 2, let x(n, y) denote the characteristic function of the set of integers
n such that 13+ (n) < y and let W(x, y) be its summatory function.
(a) Show that for each a> 0, we have
_ c2 log x 1
41(x, y) < ci x exp {
log y I
C2(n, t) — log2 t
(a) Show that, for each E > 0, one has sup < 1+E, pp.
c(n)<t<n L(t)
(b) Deduce that
(c) Let 7(n; t, u) :=1{d : On, d> t, PH- (d) <u}. Show by using the result
of the previous exercise that one has
log t
E r (n; t , u) < x log u exp {_ c2 log u
}(2 < t, u< x).
n<x
P<Y P+(b)<y
By choosing z suitably, deduce that dB y' < K / (log y) -6 , where K' is ab-
solute, and thus E y < K0 (logy) -6 with K0 = K ± K'. [In fact one has
(log y) —6 e -co V(l0g2 y log3 y) < Ey < (log y) -6 (log2 y) -1 / 2 . See Hall & Tenen-
baum (1988), chapter 2.]
322 111.3 Normal order
(a) Show that x(d) = (1/q) e 2"ii ( w (d)- a ) /q (d > 1). Deduce the exis-
tence of dA and indeed that dA = 1/q.
(b) Show that r(n, A) = (1/q) IIIj e-21rija/q flPv lin (1 ± Ve27rii1q).
With the notation m := ['pun p, establish that (C)
(d) Show that the arithmetic function n w(m) is additive and has normal
order log 2 n.
(e) Deduce that DA = 1/q.
10. Another result on divisor density. Let 6 be a fixed real number, 0 < 6 < .
Set
A(6) := {m : m 3, co(m) + 6) log2 m}.
(a) Show that dA(6) = 0.
(b) Show that for each 6, 0 < 6 < 1, one has
and deduce that Rd : din, log 2 d> (1 — 6) log2 nli r-I- (n) pp.
Exercises 323
(c) Show that E din Y w(d) —< (1+y) ) (y > 0, n > 1) and use this inequality
to show that, for all 77, 0 < 77 < , one has
Rd : dln,w(d) q- +77)5-2(n)}1 , r(n)
(d) Show
Show that DA(S) = 1.
11. Method of vanishing moments. Show, for A- < z < , that
Sharpen the result by choosing a = a(x) ---> log 2—, 0 = 0(x) —> log 2+.
(b) Establish that the upper bound En<x
r(n)yw(n) < x(logx) 2Y -1 holds
uniformly for x > 2, 0 < y < yo. Deduce that, for 0 < E < 1,
(c) Establish, by making suitable use of Theorem 2.2, that f (n) has a lim-
iting di. [Ercl5s & Hall actually prove that this d.f is continuous.]
13. A theorem of Hall (1974). Let f be a multiplicative function with 0 < f < 1.
Set 8(x) := E n<x f(n).
(a) Let k (n) := ft*, p. Show that
and deduce that the left-hand side does not exceed xE m‹x f(m)Im+ 0(x).
324 111.3 Normal order
(b) Let N (x , y) be the number of integers n < x such that k(n) < y. Show
that for all y, 1 < y < x, one has
E f (n) log (x I k(n)) < N(x,y) log x + S(x) log(x/y).
n<x
where the implied constant is absolute. [One may choose y = x/ log3 x in (b).]
14. Let f be an arithmetic function with a non-decreasing normal order. Show
that, for all E> 0 , we have
1{(m,n) : 1 _< m _< n _< x, f (n) _< (1 - e)f (m)}1= o(x 2 ).
by proceeding as follows. For the upper bound, use Theorem 5. For the lower
bound, first show that A( n) ->'> dd'In 2—S2(dd ) , where the asterisk indicates
-- 7L.--1 *
that summation is restricted to those pairs of divisors fd, d'I with (d, d') = 1
and d < d' < 2d. The inner sum may then be estimated by the Selberg-Delange
method. [The lower estimate in (62) is due to Hall & Tenenbaum (1982) and
the upper estimate to Hooley (1979). Tenenbaum (1985) has shown that the
exponent 4/71 - 1 may be replaced by o(1); see also Hall & Tenenbaum (1988),
chapter 7.]
I11.4
for a real additive function f(n) to have a limiting distribution is that the
following three series converge simultaneously for at least one value of the
positive real number R:
1 E f (P) 2
(a) E P, (b) .
, (,)
P P
If (P)I> R If (P)I R If (p)I<R
When these conditions are satisfied, the characteristic function of the limit law
is given by the convergent product
00
(1) e iT f (p v ) p — v .
P v=0
The limit law is necessarily pure. It is continuous if, and only if,
1
(2) E —cc'
P
f(p)0
Remarks. (i) Should the three series above converge for some value of R> 0,
they do so for all values of R > 0. Therefore, in practice, there is no loss of
generality in taking R = 1.
(ii) We emphasise that the existence of a limiting distribution for f (n) does
not depend on the values f (pv) with v ? 2.
326 111.4 Distribution of additive and mean values of multiplicative functions
The original proof of this result, due solely to Erdos for sufficiency, is direct,
and rests upon delicate estimates for the frequencies v x {n : f (n) < z} . We shall
give here a simpler proof, due to Delange (1961), which depends in an essential
way on the continuity theorem of Paul Levy (Theorem 2.4). Delange establishes
the following fundamental result, which has an intrinsic interest, and of which
Theorem 1 is an easy consequence.
Theorem 2 (Delange). Let g be a multiplicative function with values in the
unit disc.
(i) If g possesses a non-zero mean value
M(g) := lim
X-#00 x
1 >- - -,- g(n)
n<x
then we have:
(a) the series Ep (1 g(p))/p converges,
—
(b) there exists some integer v > 1 such that g(r) —1.
(ii) If condition (a) above is satisfied, then g has a mean value, given by the
formula
00
M(g) = TT (1 — g ( p i/ ) p - V .
( 3 )
)
v=0
We shall give the proof of this result in the next section. For the moment,
let us show how Theorem 1 may be deduced from it—apart from the assertions
concerning the purity and continuity of the limit law.
The following simple lemma will be useful.
Lemma 2.1. Let lu n l n"=1 and {v,-,} 1 be two complex sequences such that
00
Then the infinite product fr=i (l+un +vn ) converges if, and only if, the series
Ec)c) u
n=1 n converges. In this case we have
CO 00
Proof. If luni ± Ivnl > , then lun1 2 ±Ivnl > lun1 2 — luni ± > 1. By (4), this
implies that the set E of integers n for which lu n 1 ± Ivn 1 > has at most 4H
elements.
4.1 The Erclas-Wintner theorem 327
<2
m<n<M
(17in1 2 + Ivn 12) E
m<n<M
(21unr + ivno.
roilE
This shows that the convergence of the infinite product and the convergence of
the series E
un are equivalent. Moreover, it follows from this calculation that
00
H (1 ± Un ± vrt )
n=1
oo \ 1/2
E lunl (4HE lux) 2H.
nEE n=1
Suppose initially that f has a limit law. By the continuity theorem (Theo-
rem 2.4), it follows that 9.7.- has a mean value for each T . Write co (T) := M (9 ,r ) .
Then cio is the characteristic function of the limit law, and in particular cp(0 ) = 1,
and (,,o is continuous at T = O. Thus there exists some T> 0 such that Iso(-7-)1 >
for 1-7- 1 < T. We shall prove the convergence of the three series of Theorem 1
when R = 21T.
328 111.4 Distribution of additive and mean values of multiplicative functions
For 1-7- 1 < T, assertion (i) of Delange's theorem implies the convergence of
the series
(8) ,
P
P
(11)
E 1 _ cos (7- f (p)) 1 (1 7 1 11) -
P
13
Given the inequality 1 — cos 0 > 202 /712 (101 < 7), it follows that
(12)
P
lf(p)I2/T
Averaging (11) over [0, T], we further obtain that
1sin (T f (p)) 1 1,
E —{1 Tf(p)
P
P
(14)
P
If(P)IFI
This completes the proof that the conditions of Theorem 1 are necessary.
4.1 The Erelds—Wintner theorem 329
Conversely, let us assume that the three series in the statement are conver-
gent for some positive real number R.
The trivial bound 11 — eirf (P) 1 < 2 then implies, for each T> 0, the uniform
convergence of the series
1 — eiTf (P)
( IT I 11) .
P
1 — ei Tf (P)
E
if(p)i,R P
* F
P
where Fp is the atomic distribution function given by
(15)
1
E
-= 00
P
f(P)00
is necessary and sufficient for the continuity of the limit law.
330 111.4 Distribution of additive and mean values of multiplicative functions
Let us show first of all that the condition is necessary. If the series (15) con-
verges, then the sequence A of squarefree integers n such that pin = f (p) = 0
has positive density. This follows, for instance, from Theorem 1.3.11, since
the characteristic function of A is multiplicative. (We obtain incidentally that
dA = 67r -2 nf (p)00 (1 ± 14) -1 , but this explicit value is not needed here.)
Since f (n) = 0 for all n in A, the limit law is not continuous at the origin.
In order to prove the sufficiency of condition (15), we shall establish the
continuity of the limit law in the equivalent form (cf. Chapter 2, Notes)
1 ET 1
(16) lim dr = lim —2 I 1 (P(TY)1 2 ClY = 13 •
IC0(T)12
T—*Do 2T T—oo —1
Let N be a sufficiently large integer, and let al <a2 < • • • < aj denote the
distinct non-zero values of f (p), p < N. We write
1 1
E •.= E _ (1 < < SN E Ei
p<N, f(p)=a3 1<j<J P<N) f(P)#3
By Holder's inequality, we have for each integer h > 1 and any real number y,
IYI 1,
2h < oh-1
Ei cos(ajTy)) —N Ei cos 2h (ai Ty),
<j<J <j<J
from which
1 2h dy < sN
—1
E ei cos(ajTy)) 2h-i E f cos2h (aTy) dy.
1<j<J 1<j<J
As T ---> cc, the last integral tends to
1
7r/2 COS 2h w dw < —
41J
where the bound follows from a classical evaluation of Wallis integrals—cf.
Exercise I.0.3(3). For each fixed N, and sufficiently large T, we can thus assert
that the measure of the set of those y in [-1, 1] such that
E Ej cos(ai Ty) > (1 -
1<j<J
(17)
E 1 - Re g(p) < 00,
P
P
we have
00
1
(18) ) g(pv)p - v +0(1) (x --4 00 ).
n<x p<x V=
Hy (x) := I hy( rn )I + x
Tit
m <x m>x
/
< vx cx) Ihy (m)1 < Vx
v•
- H v + 0P)2
p<y
1
\
).
m=1 V
We easily deduce that gy has a mean value, viz.
00
x 1
E gy (n) = hy (m)[— ] =x
m,
E
— h y (m) + 0 (Hy (x))
m,
n<x m<x m=1
= X {M (g y ) ± OM} ,
with
00
M(g) := H (1 — g(Pv)P-v.
P<Y
332 111.4 Distribution of additive and mean values of multiplicative functions
the argument being chosen to lie in ] — 71, 7]. Setting A(x) := > < 6(p)/p, we
use Cauchy's criterion to show that the quantity
1
S(x,y,z) := — gz(n)e( z) _ E gy(n)e _iA (y )
x II
n<x n<x
H g(p „ )
1
y<p<z
From the bound lu1 ... u rn — 1 1 jm=1 lui - 11, valid for all complex numbers
ui , ,U m in the unit disc, we obtain
1 1 le ilA(z)_A(y)_e y ,z(n), _
S(x,y,z) < ,x piln (1 _ r o,v ) )
nE _
x z-J
n<x
y<p<z
where 0y , z is the additive function defined by O Thz (pi := 0(pil if y < p < z,
and Oy , z (pv) := 0 otherwise.
We obtain an upper bound for the second sum over n by using the inequality
.t
leiu — 1 1= f ei dt1 <u (u E R). After interchanging summations in the
first sum and applying the Cauchy-Schwarz inequality in the second, we can
write
1 — r(p) ± 2 1
S(x, y, z) < E p(p — 1)
y<p<z y<p<z
} 1/2
S(x, y, z) 5_ >7,
y<p<z
1 — r(p)
P
± 0 (1 ±
Y \ y<p<z P
0 (p)2 ± 1
Y
0 (p) 2
<
y<p<z P
Indeed, either 1 6 ()I > - '7T, and the inequality is satisfied since Re g (p) < 0, or
1 0 (P)I < - 71, and we have
with
1/2
77(y) := 1/y+ (E (1 _ Re g (p)) 1 p) = o(1) (y --- 00).
P>Y
This shows that Cauchy's criterion is satisfied for (19), and hence there exists
an M such that
Choosing z = x in (21) and noting that g(n) = g (n) for n < x, it follows
that
g ( n ) e -iA(x) = _
1 gy r)e - iA(Y) +
X
(
° Hy *
n<x n<x
as required.
334 111.4 Distribution of additive and mean values of multiplicative functions
l im H ( 1 — g (p L) p - V CI M( g ) .
o- -> 1+
v=0
Noting that the general term of this product has modulus < 1 and that
00
1 - p - ( cr -1 ) (a 1) log p
—p ) -
Cr
Eg(pv)(p - v <<
v =0 (p - 1) (1 - p - a )
we can write
00
p<exp{lAcr-1)} V =0
by Lemma 2.1.
This implies the convergence of the series Ep (1-Re g(p))/p. By Theorem 3,
it then follows that the product
00
converges with value M (g) . Its general term is 1 - (1 - g (p)) / p + 0 (1/ p2 ) and,
by a further application of Lemma 2.1, we then obtain the convergence of the
series E (1 - g(p))/p. Finally, since M (g) 0, no term of the infinite product
(23) vanishes, and in particular g(2 11 ) -1 must hold for some v > 1.
Let us now prove assertion (ii) of Theorem 2. If the series (a) converges,
then, by Lemma 2.1, so does the product (23). The conclusion hence follows
immediately from Theorem 3.
4.3 Haldsz' theorem 335
E1 - Re(g(p)p - ir)
(24)
If there exists no real number T such that the series (24) converges, then we
have
The first assertion of this theorem follows easily from Theorem 3. We indeed
deduce from the convergence of (24) that
00
1 g(n)
(27) ( 1 - /9-1 ) Eg(pv)p—v(l±"- ) + 0(1) (x ;'oc).
nir
n<x p<x v=0
Moreover, formula (22) applied to g(n)n -iT allows us to replace the product
on the right-hand side by
myeiAT(x) G(1)
A(x) := E 9T(pvp
p<x
with 0,- (p) := arg fg(p)p -iT1 E — 71, 4 By (20), the series -4) 2 /P must Ep 9
converge, and so, by applying the Cauchy-Schwarz inequality, we obtain
1loog xy))
A T (x) - AT (y) = log
(
336 111.4 Distribution of additive and mean values of multiplicative functions
L T (u) -, L7-(v)
00
(29) lirn
1
-
x---.00 x
E g(n) = II (1 - )
1.=
g(P v)P - v
n<x P
1 - Re(g(P)13-2 )
P
P
1
— = 00
P
I cos(7- log p) I<
Remark. Wirsing's theorem lies at least as deep as the prime number theorem,
since the function g(n) = p,(n) is within its range of application.
4.3 Haldsz' theorem 337
For the "divergent" case of Halasz' theorem, we shall give an effective es-
timate due to Montgomery (1978b). The argument is essentially the same as
that employed by Halasz in his 1971 article, partially devoted to making his
1968 results effective, but Montgomery also introduces certain refinements and
numerous simplifications.
Let us write
00
Let us first show how the second part of Halasz' theorem may be deduced
from this statement. We need the following lemma.
Lemma 4.1. Let fv(7 -)1 1 be a sequence of continuous functions such that,
for each 7- with 1-7-1 < 1, con (T) increases to co. Then the convergence is uniform.
Proof We argue by contradiction. If the conclusion fails, then inf1 y 1 <1 (pn (T)
does not tend to infinity with n, and there exist some constant A and some
subsequence with indices Inj17' 1 , ni --- oo, such that inf1<1 con, (T) < A for
j > 1. Since the con, are continuous, for each j there exists a Tj E [-1, 1]
such that co n, (Ti ) = infi T i <i con, (7). By extracting some new subsequence if
necessary, we may assume that Tj ---* To as j ---* oo. Now let n be some arbitrary
fixed integer. For sufficiently large j, we have
since co n (rj) ---> co n (T0 ). It follows that con (To) < 2A, contradicting the hypoth-
esis that con (70) ---> oo as n --> oo.
We can now complete the proof of Halasz' theorem. By Lemma 2.1, we have
( — iT g (p V g (p V — 1)pir
1-90)p
F(s)«cr) -1- =
p '18
V=
(32)
— (g(p)p —i l
< exp{ —
338 111.4 Distribution of additive and mean values of multiplicative functions
and, under the assumption that the series (24) diverges for all T, Lemma 4.1
implies that the right-hand side converges to 0 uniformly in T on each compact
subset, as a ---> 1+. We deduce that H(a) = o(1/a) as a -4 0+, from which,
by substituting in (31), we get that G(x) = o(x) as x ---> Do.
For the proof of Theorem 6, we need two auxiliary results. The first is a
general inequality, also due to Montgomery (1971), p. 158, concerning quadratic
means of almost-periodic functions. We state it in the context of Dirichlet series.
fT T
(33) L TI A(s)1 2 dr 5._ 3 I IB(s)I 2 dT.
-T
Proof Consider the function x(r) := max(0, 1 - ITI IT) with Fourier transform
00 T (sin(tT/2)) 2
"(t) = I ei7- t x(r) dr =
tT/2 ) •
00 00
am an (n i,-
L .
X(T - To)IA(s)1 2 dr = V .rnn.
mtd
t=i. (
, m ) ° 54 log r1-7 )
)
00
nl
<
m
n-)
log- = i x(r)IB(s)1 2 dr.
i( 00
h(T)
from which
T oo T
111(02 dT < 3 ] X(T)I-B(S)12
oo dr < 3] IB(s)I 2 dr.
f-T -T
CO
with
00
D(s) := E g(2')2 -1 , Fi(s) := exp E g(p)p',
v=i p>2
Proof. We have
00
J(s) = exp{—g(p)p — s}Y , g(pv)p's
p>2 v=0
A typical factor of this product has value 1 + 0(19-2 (7 ), and so J(s) is certainly
holomorphic for a- > and uniformly bounded for a > In particular, this
implies the required upper bounds for J(s) and f(s) in (35). Furthermore,
since
CXD
E g(p v )p_vs 1
<1 (p > 2, o- > 1)
p —1 — 2
2
log J(s)1 5_
p>2
For this, we first observe that, putting B y := arg g(r) with 1 9 1, 1 < 7r, we have
00
1 1 +D( 1 +a - iT 1 > 1-
) — v I COS( 9 v — TV log2) > (1 —Icos(0 1 —3- log 2))•
v =1
340 111.4 Distribution of additive and mean values of multiplicative functions
Let IA) be one of the possibly two closest integers to (01 - 7)/ log 2, so that
r ko-Of 1c0±-12-
gP)
( f 1
log Fi (s) di- = p- ir dr < p < 1.
fie° - If p>2 Pa L 0 -12
p log
P
log2 x
(37) G ( x ) < loxg x f dt + x
x 1Gt(2t)1 log x
Choose y = x/ log x, and in the last integral use (39) to evaluate K(t). We
obtain
rix-y
x±y
I K MI dt <
fx+y
E log pvl G (t 1 13/1 dt + xy
x-y pi., <t
x--1-y
< lOgpv I 1G (t 1 pv)1 dt + xy
pv <2x x-y
(x+Y)/P v
. if log if lx_yvpii 1 G MI dt + xy
pv <2x
x
= I1
IG(t)I pv log pv dt + xy
(x-y)/t<pv <(x+y)/t
<x fx I G(t)1
log if dt ± xy.
ji. t
( x-y)/t<pv<(x+ y ) / t
y log(2x/t)
By the Brun—Titchmarsh theorem (1.4.9), the inner sum is <<
t log(y/t)
whenever y > 2t. For t < yl log x = xl log2 x, this upper bound is < y/t, and
the corresponding contribution to the last integral is
x dt
IG(t)1 t2 •
<y i1
For y/ log x < t < x, we bound 1G(t)1 trivially by t. The corresponding contri-
bution is
(x+Y)/P' log pv
<< log pv Ix_ yv pi, dt < y < y log2 x.
Pli
pv <2 log2 X
x IG(t)log t 2
(40)
I t2
dt < H
( log x )
log x (x > 2).
f x dt } 1 / 2
fi x 1Kt(2t)1 dt < { fi x liCt(3t)12 dt
1 T
342 111.4 Distribution of additive and mean values of multiplicative functions
(41)
J 11((t)12 dt
°° t 3 ± 2a <
H(a)2
a
(a > 0).
Indeed the desired upper bound follows from the choice a = 2/ log x. The
relation
F' (s)
K(eu)e - e -i" du =
f IK(t)1 2 d
t3+2a t
f IK(eu)12e-2u(1-1-ce) du = 1 r
F' (1 + a + ir) 2
1±a±ir
dT
27r -00
k +Jf F' 2
(43) max 1F(s)1 2 (1 ± a + iy) dr.
cs=1-1-ce
IT-kl<i
By (35), the last term is uniformly bounded. It thus contributes 0(1) to the
integral in (43). In addition, we have
2 2 2
(1-Fa-kik-kir)
(1-1-a+ir) dr =
f_ 2
dr
<3 f 1
2
2
z (1±a+ir)
2
dr <
—7
2 dr
a2 + 72
1
<—•
4.3 Haldsz' theorem 343
Furthermore
00
1
= 1+(- D(s))v (a > 1).
1+ D(s) v= 1
As a Dirichlet series, this expression has coefficients which do not exceed in
absolute value those of
00 00
1 + E ( E 2—msy1 = 2
28 - 2 Is - 1 1
s —1 ‹
1
v=1 m=1
Given that Di (s) < 1 uniformly for a > 1, we hence deduce from Lemma 6.1
that
1
i k+ i DI (1 + a + ir) 2 2 dr
dr <
jk _ 1 + D ( 1 + a ± iT) fl2 1 1+ D(1 ± a ± ik ± ir)12
1 121-1-ce-Fir 1 2
1
<
-1 2 1-±a±iT - 2 -
It follows from the previous calculations that the integral in (43) is 0(1/a)
uniformly in k. Substituting in (43), this gives (41) and hence (40).
We can now complete the argument. Since H(a) >> 1, the second term of
the bound (37) is of the required order of magnitude. In order to evaluate the
vL
first, we use (40) in the form
y
1G(t)i
t2
dt < H
(102g y ) (Y -?- e).
It follows that
2 ix 2
I G(t)I d t < fx IG(t) I dy TY 1 G(t) I dt
IX
2 t2 e2 t2
ft ty log
clY ydt < Jo y log y j.vv t 2
< TX 2 H ( 2 ) dy 1
H(a)
da.
je2 log y ) y log y = iv log x a
This concludes the proof of Theorem 6.
Corollary 6.3. Let g be a multiplicative function of modulus < 1. For x > 2,
T> 2, set
- Re(g(p)p-ir) := e -m(x,T)/2 ± T-1/2 .
m(x,T) := min , R(x, T)
ITIT p<x P
Then we have
(44) E g(n) < x R(x,T).
n<x
Remark. It can be shown that R(x, T) = m(x,T)e - m(x ,T )+T-1 / 2 is admissible:
see Exercise 6.
344 111.4 Distribution of additive and mean values of multiplicative functions
Proof. We may suppose that T > 0. We use the prime number theorem in the
form R(t) := 7(0 - li(t) < t exp - 2 \/(log t)}. By partial summation, we can
write the left-hand side of (48) as
< V( h)
f: (h(t) - h)dt
and using the second mean value theorem, it follows that the second term in
the above sum is < V(h)/(r log w). The third is
< e -V(logw)
f T logw+27r 00
< (1 ± IT)V(h)e — V(1°g",
T log w k=0
since the sum over k is < (1 ± 17- 1). This implies (48).
Theorem 7 (Hall & Tenenbaum, 1991). Let coo be the unique solution on
10, [ of the equation sin wo +(ir -coo) cos coo = 71. Put K = cos (pc, 0.32867.
Then, uniformly for all real multiplicative functions g with -1 < g < 1 and all
x > 2, we have
Remark. Hall & Tenenbaum in fact show that the quoted constant K is optimal.
Proof Let h(0) := I cos(9) - K. We shall first show that, uniformly for 0 <
a < 1, T E R, we have
We may plainly assume T > 0. For 0 < T < a, relation (50) follows imme-
diately by summation over p from the estimate
The inequality
Re (g(p)pr) = g(p) ( cos( T log p) – K) + K g(p)
5_ Icos(T log p) – K + K g(p) = her log p) + K g(p)
yields, after dividing by p, summing over p < exp(1/a) and using (50) and
(54), that
(55) Re E g(p) < (1 – K A) log(l/a) + 0( log2 (ITI + 3)).
pGexp(1/a)
We now embark on the last phase of the proof. If F (s) denotes the Dirichlet
series appearing in (30), then for s = 1 + a+ iT we have that
(56) F (s) < exp {Re
g(p)p_8} < exp {Re E
p<exp(1/a)
from which
2 —S 2 i
aA < a e log x.
Substituting in (57), we obtain
H(a) < ce 2K—l e —KS (10g X) 2K
,
and we need to estimate the average over the first N integers of a multiplicative
function which depends on N. The uniformity required for such a purpose does
not follow from Theorems 2, 3 or 4, and the application of Theorem 6 depends
on sometimes delicate estimates for the function H(a)—cf. the Notes to §4.3
for references to the literature.
348 111.4 Distribution of additive and mean values of multiplicative functions
1
(y) := TY e—"" dt
(27) J —
Theorem 8 (Erciiis & Kac, 1939; Renyi & Turan, 1958). We have uni-
forrnly for N > 2, y E
1
(60) vN{n : w(n) < log2 N y \/(log 2 N)} =
Proof. Let FN(y) denote the left-hand side of (60), and let coN('r) be the cor-
responding characteristic function, viz.
iT
log2 N)}.
VN(7) := —1\7 exP V(log 2 N) (w(n) -
n< N
(61) 1
E eitw (n) = A(eit ) ( log M eit ± 0((logN)c 0 s t-2 )
n< N
e im)/
i(nT _ (1 ± 0(7/T)) exp { irT - 7 2 + 0 (73 /T)} + 1
( log
N)
n<N
from which
d7
sup1FN(Y) 4) (y)1 < T f (pN(T) - e ' 2 / 2
YER 171
We split the integral into three parts I , 12, 13 corresponding to the respective
ranges of integration
TV3 < 171 < T, 1/logN <171 < TV 3 and 0 < 171 < 1/log N
<f e
V/3
_ 27.2 /7r2 dT
Y T'
00 T1/3
UT 1
12 < 00 ( -I- 2 e ' 2 / 2 dT + 1 I —< -
T log N I/ lo g N T
and
1/ log N
< I 1/ log N
< —.
T
Note that the error term in formula (60) is optimal. Indeed, we deduce from
Theorem 11.6.4 for k := [log2 Al
1 (log2 N)k -1 , 1
(65) v N {n : w(n) =- k} ,
log N (k — I)! 027 log2 N)•
Now, if R(N) denotes the supremum over y E IR of the error term in (60), we
obtain, expressing (65) as a difference of two frequencies of type (60), that
Notes
where 6, co are arbitrary parameters such that 0 < 6 < 1, 0 < co < 7r. Hall
& Tenenbaum (1991) give, for each pair (6,0, the best constant K(6, co) > 0
such that
Re g(p) } . -
For more general statements and the techniques necessary to prove them, see
Elliott (1980), chapter 19, where the estimates of Halasz (1971) are described
and refined. Elliott proves the two following results; the first corresponds to
Theorem 7, the second to an effective version of Delange's theorem (Theorem 2).
Theorem 9. Let g be a completely multiplicative function, which for some
suitable constant A> 0, satisfies g(p) = 0 or A < Ig(p) 1 < 2 - A for all p. Let Op
be the argument of g(p) when g(p) 0. Suppose that there exist real numbers
0o, 6 , 1001 < 7r, 6 > 0, such that I eieP - e i0° 1 > 6 (g(p) 0). Then we have
with h(n) := f (n) - clog n (n > 1). In this case, we can choose
1
w(r)
H / ■
W I9 T )e
—irh(p)/p
1 + icy , -1-1
ih(p)i>i ih(p)i<1
§ 4.4. It is clear that the Erdos-Kac theorem can also be deduced from es-
timates of the frequencies vN {n : w(n) = k} by summing over appropriate
values of k, as suggested in Exercise 11.6.4. These estimates have been obtained
in Chapter 11.6, using the same basic tool as the one used here: the Selberg-
Delange method.
The original proof of Era's & Kac (1939) did not give an explicit error term.
LeVeque conjectured the bound 0 ((log2 N) -112 ) in 1949, and showed that
0( log3 N(log2 N) -114 ) was admissible. This result was improved by Kubilius
in 1956, and LeVeque's conjecture was finally established by Renyi Sz Turin in
1958, employing the same method as in this chapter. By using the estimates
Exercises 353
for En<x zw(n) for complex z (cf. Theorem 11.6.1), Delange (1959) obtained a
more precise result: we have
where a,-,--' 0.40516 and where g(t) denotes the fractional part of t.
Exercises
1. Show that the three types of distribution function for an additive function
can actually occur. [Hint: see Exercises 2.4-2.7.]
2. On the distribution of multiplicative functions.
Throughout this exercise, g(n) denotes a real multiplicative function.
(a) Show (using for example Theorem 1.3.11) that we have
1
vN{n : g(n) 5_ z} = — V ti(a') 1.
N L-- ■
a<N a'<Nla m<N/aa' , g(a)h(m)<z
354 111.4 Distribution of additive and mean values of multiplicative functions
Deduce that if h has a limit distribution function H, then g has a limit distri-
bution function G, given almost everywhere by
If(d)v(d)I 2 E u(n)
n<xld
1
B2(x):= E fvy(d)fvy(cr) -
X
uy (n)e(an(d -
1<d,d' <x, dOd' n<x I d' , n<x / d
Exercises 355
(e) Show that limx , A(x) =11 p<y (1 -11p). Show that B i (x) < 1. Using
the fact that fv y (d) is non-zero for at-most a finite number of integers d, show
that B 2 (x) 0 as x cc.
(f) Letting y ---> cc, prove Daboussi's theorem: Every multiplicative function
of modulus <1 has property (D).
[The proof described here, though different from Daboussi's original proof
(see Daboussi Delange, 1982), is also due to Daboussi (1989).]
4. An estimate of Montgomery 4 Vaughan (1994).
Let M (n, x) din, d<x P(d) •
(a) Using Lemma 7.1, show that uniformly for 0 <a < 1, T 0, one has
E cos(, log p)
pl±ce
< 2 log2 (3 +171) + 0(1),
-
p >exp( 1/1T 0
(67) max
n>1
IM(n, x)1 « x( log x) (1/7r)-1 (x > 2).
(d) Prove that the left-hand side of (67) is >> x/ log x. For x > 2, eval-
uate m ( nx,t ) dt /t2±i with nx := flp<x, co. log p<0 p, and deduce that the
exponent 1/7 in (67) is sharp.
[By the method of Hall 4 Tenenbaum (1991), it can actually be shown that
<< in (67) may be replaced by
5. A theorem of Haldsz (1971).
Let E be a set of prime numbers. Write E(x) := pE E l I P and consider
the arithmetic function ci (n; E) := E pv IIn v
Show, using Theorem 9, that if 6> POE, E6 < 1z1< 2 - 6, then one has, for
some constant c 1 > 0 depending on 6 only,
E
n<x
Z C1(n;E) < X exp{(1z1- 1 - ci(z1 - Rez))E(x)}.
356 111.4 Distribution of additive and mean values of multiplicative functions
(b) Show, using Theorem 10, that if lz - 11 < then one has, for absolute
constants c2 > 0 and c 3 > 0, that
1 e (z —1) E (x)
X
n<x
<< 1 z ____ 1 1 e (Rez — 1)E ( x) ± e ( lz1— 1)E (x ) { e — c 2 / 1 z — 1 1 ± (lo g x ) - c3 }.
(c) Deduce from (a) and (b), by employing Cauchy's formula in a manner
similar to that in the proof of Theorem 11.6.3, that one has uniformly for
E(x) > 2, 6E(x) < m < (2 -
I m - E(x)I )1
E
n<x,12(n;E)=m
1= x E(x)m e -E( x ) {1+ 0(
m!
1
V(E(x))
+
E(x) l i •
so that, with the notation of Corollary 6.3, m(x, T) = log2 x - p,* (x,T) ± 0(1).
(a) Show that, for any T E [0, 7], one has
7+1/ log x
= 1.0g X p,(X, 9) a ± OM.
fr-1/ log x
00
00
(c) Let f be a real additive function which satisfies the following properties
as x --> Do:
(i) D x
( )2 •
f (P) 2 /P --+ co, (ii) f (P)/P = o(D(x)) ,
p<x p<x
(iii) If (P)1/ p = 0(D (x)) , (iv) max If (P)1 = 0(D(X)).
p<x
y<p<x
For T E IR, put gx (n, T) := expfir f (n) 1 D(x)}. Show that for fixed T and all
p < x we have
Deduce the existence of 1im x , 00 x -1 En < x gx (11, T) and find its value. Show
that for all z E JR we have
1 rz
lim x -1 1{n < x: f (n) _< zD(x)}I = e-t2/2dt.
x-+00 V(27) j_ oo
IH.5
Integers free of large prime factors.
The saddle-point method
The associated Dirichlet series is the leading factor of the Euler product for
the Riemann zeta function, i.e.
CO
(3 ) ((s, y) := H (1 — p - s) 1 X(n,Y)n —s -
P<Y n=1
The quantity
log x
(5) u := (x > y > 2)
log y
plays a pivotal role in the study of the asymptotic behaviour of xlf(x, y). In the
course of this chapter we make systematic use of the notation (5).
Theorem 1. We have
(6) W(x,y) < xe - u/ 2 (x > y > 2).
Proof. We may assume that y > 11: otherwise Alf(x, y) < kli(x, 7) < (log x) 4 ,
whereas the upper bound in (6) has order a power of x. This being the case,
we have for all a > 0 that
n \ce
(7) 7 i) X(n,y).
n<x X
For the choice a := 2/(3 log y), the multiplicative function n i- n'x(n,y)
satisfies the conditions of Corollary 3.5.1. We obtain
a log p)}
E ex(n, y) < x H (1 ± Pc' <
1 ) _ x exp {0(E < x.
n<x P<Y
P P<Y
P
Substituting in (7) and noting that 1 log x > lu for y > 11, the result follows.
The above proof is clearly not optimal. A more thorough study of Rankin's
bound (4) enabled de Bruijn (1966) to obtain the following result, which pro-
vides an asymptotic formula for log ‘11(x, y) as y ---> oo. We set
log x
Z := log g(1 + y ) ± y lo (1 + l°g x )
log y log x log y y
(8)
= u f0
i
log (1 ± y ) dv.
v log x
Theorem 2 (de Bruijn). We have, uniformly for x > y > 2,
1
(9) log klf(x, y) = Z{1 ± 0 ( log y
±log2
1 2x) I .
Proof. We may plainly assume x to be sufficiently large. Denote by
coy(a) : \--, logp
(10)
L-- ■ pc. - 1
P<Y
the quantity --(i(cr, y)/((o, y) (0 < a < 1). The optimal value of the parameter
a in (4) is the unique solution a = a(x, y) to the equation
(py (a) = log x.
360 111.5 Integers free of large prime factors. The saddle-point method
The first step of the proof consists in finding an explicit approximation for
a(x, y). To this end, we appeal to the prime number theorem so as to evaluate
soy (a) uniformly for y > 2, 0 < a < 1. We obtain
y 1 - y -0-- (7) { ( 1 i.
(11) Soy(a) = yo- - 1 1 +0
1-a logy ) I
We omit the proof, somewhat technical, of this estimate, but indicate the main
steps of the reasoning in Exercise 1. We can deduce from (11) the estimate
Y Y Cloogg2 yy)},
In g X — ^ - — (1 (3) (ya l ) {1+ 0
yP I
from which we deduce ya = yi3 {1 + 0(131og 2 y)}, so that (12) also holds in this
case.
The upper bound implied in Theorem 2 will follow by choosing a = 0 in (4).
We note that
The second error term is < y 113 . The integral over a equals
2/3 log y 2/3 CI log y
I0 yo . l du + 0(13 yo. 1 du)
dv << y
Jo logy v (1 — 0) log y •
We have thus established, in all cases, the required upper bound implicit
in (9). For the corresponding lower bound, we consider the smallest integer v
such that v > u and write z := x 1 / v, k := 7(z). Then ‘11(x, y) > ‘11(x, z) and
it is clear that each k-tuple (v 1 , , vk) of integers > 0 such that E ik <V
k v
provides at least one integer counted in z), namely fl i=i pi a (where p i
denotes the ith prime number). It follows that
7)
y) > (k ±v)
v )'
from which, applying Stirling's formula,
log kli(x,y) > (k v) log(k + v) — k log k — v log v +0(1 +logmin(k, V)) .
The main term of this lower bound can be rewritten as fo log(1 + v/t) dt.
We hence see that the error implied in replacing k by z/ log z is
z , log x
log (1
+ log (1 +
(log z) 2 k (log z) 2 z)•
We can thus write
(17) log klf(x,y) W(z){1 +0( )}
1og1 z
with
log x log ( 1 z log lox )
W(z) Z log (1 +
log z log x log z z
It may be easily checked that
W(t) 1 log a ± log x\
i
W(t) t log t ± log t t )
This implies, after a small computation, that we have, for Vy < t < y2 ,
log x
(y 5_ (log x) 4 )
1 tlogt
/47'(t) < log x log2 x
((log x) 4 <y < x) .
t(log t) 2
Bearing in mind that (log y)/ log z = 1 + 0(1/u), it follows in a few lines that
(18)
z k -r-r
k
< Nk( z ) <
cz k!
ai)
\k k
-1--r 1
II—•
j
k
i=1
For the choice k := 71- (y), z := log x, ai; := logpi (where pi denotes the ith
prime number), we have Nk(Z) = 41(x, y) and E ik ai = 0(y) < y. This hence
immediately yields the following result.
Corollary 3.1 (Ennola, 1969). Uniformly for 2 < y < \/(log x log2 x), we
have that
w(x,y) = 7(10 Tr (logx)r 2 \
(19) 1 + 0(
H logp) 1 log xy logy )
7;1 <y
from which
k-i k-1
1 -Fr 1
(z vak) k-1 <Nk(z)< -Fr 1 (w _ vao k-1 1
11
(k-1)! 2=1
. ai - (k-1)! ill
,lai 0<v<Z ak
0<v<z/ak
z ± \----, ik-1 _.
with the notation w := L-d =1 u'i .
Let n := [z/ad. The Euler-Maclaurin formula of order 0 (cf. Theorem 1.0.4)
implies that
with B 1 (t) := {t} - . In absolute value, the last integral is bounded above by
Therefore
1 {z k (z - nak )k Zk
_ ± (Z rtak)k-1
ak k kak'
0<v<n
since (z - nak )lka k <11k < 1. This gives the desired lower bound.
In order to obtain the upper bound, we again apply the Euler-Maclaurin
formula, but with z replaced by w. We obtain
TI
>7/
0<v<n
(w - vak) k-1 < (w - tak) k-1 dt +wk-1
which is (20).
In practice, this equation is often used to investigate the difference
klf(x, z) — T(x,y).
Corollary 4.1 (Buchstab's identity). For x > 1, z > y > 0, we have
(21) klf(x,y) = kli(x,z)— 41(x1p,p).
y<p<z
x(1 — log 2) —
x
— (1 — log ( 1°g(x/P) )) ,-- x p(u),
P log p
y<73-\/x
with u dv
p(u) := 1 — log u + f log(v — 1)
V
(2 < u < 3).
2
It is clear that this procedure yields, for any 6> 0, the asymptotic formula
(22) 41(x, y) , x p(u) (x <y
where p is defined by the initial condition p(u) = 1 (0 < u < 1) and the smooth
version of equation (21), namely
u
(23) p(u) = p(k) — f p(v — 1) dv (k <U < k+ 1).
k V
366 111.5 Integers free of large prime factors. The saddle-point method
This function p was discovered by Dickman in 1930, and today bears his
name. It is continuous at u = 1, differentiable for u > 1, and satisfies the
difference-differential equation
Proof. Part (i) follows immediately from (24) and the initial conditions for p.
Indeed the two sides of (i) have the same derivative for u > 1 and the same
value at u = 1.
Let us prove (ii). Let T := illf{71 : p(u) = 0}. If 'T is finite, then T > 1 since
p is continuous and satisfies p(u) = 1 for 0 < u < 1. By (i), we can then write
,
0 = r p(r) = f p(v)dv.
T-1
The continuity of p then implies, by the very definition of r, that the right-hand
side is strictly positive. This shows that T is not finite and (ii) follows.
Part (iii) follows immediately from (ii) and the functional equation (24).
We obtain (iv) by induction on k := [u]. The property holds for k = 0, since
p(u) = 1 (0 < u < 1). If k > 1, we deduce from (i), (ii), (iii) and the inductive
assumption that
i ru 1
P(u) = I p(v) dv < P(u - 1) < 1
u u-i u ur(u) r(u ± 1) •
We shall see (Corollary 9.3) that the asymptotic relation (22) remains true
in a very large (x, y)-domain. A first attempt at an inductive use of Buchstab's
identity is the object of the following theorem, which represents partial success
in this direction.
5.3 Functional equations 367
is finite and uniformly bounded. Let us hence assume that Ak (y) < cc, and
consider X, Y such that Y > y y2 < x- < yk+1 (k > z). Applying (21) for
(X, Y) with z = VX, and writing U := (log X)/ log Y, we obtain
±
X { 2
A(XV
,X)
E (X / p, p) log Y
log Y U p log p
Y<p< N/X
with
L(v) := -
E _ =log(v/u)+0(e-v(0gY)).
<p<X 1 /v
Property (iii) of Theorem 5 shows that p is non-decreasing. By partial sum-
mation, we hence immediately obtain that
log(X/p)
1< < U - 1 < k.
logp
For these values of p, we can thus bound A(X/p,p) from above by Lk(y) - 1.
Substituting in (26) it then follows that
f2 y<p‹,/x log Y
1+ 1A(X,Y)1 <1+ C o eA V (1°g + (Ak(Y) )t +
- 1
plogp
by the prime number theorem. Taking the supremum over X, Y such that
Y > y, k < U < k ± 1 and iterating, it follows that
y) )2log2 y < 1,
AP log 2 M(0 < (1 ± C1 e-i(bog
x(f p u - v)d(V1/Yv) (x Z)
(27) A(x,y) =
A(x+, y) (x E Z± ).
and
lz A (X dt
(29) A(x,y) = A(x,z) - (y < z).
i t ' t ) log t
5.3 Functional equations 369
Thus A(x, y) satisfies the same initial condition as klf(x, y) and obeys a
functional equation which is the smooth version of Buchstab's identity.
Employing a method analogous to that of Theorem 6, de Bruijn obtained
the formula
We shall see in Section 5 that the main term in (30) has order x p(u) when
y> (log x) 2 . From the upper bound (iv) of Theorem 5, we hence see that (30)
can only yield an asymptotic formula for 41(x, y) in the domain
Equation (33) has two advantages over (21). On the one hand, for fixed y,
it takes the form of an equation in one variable and is hence easier to iterate.
On the other hand, by expressing 41(x, y) as an average of itself, it also cre-
ates, by iteration, a powerful regulating effect. This latter feature was already
present, but dampened, in the case of Buchstab's identity. The cause of this
phenomenon can be traced back to the very source of the method, which con-
sists in calculating T(x, y) starting with klf(x, z) and deleting positive terms. As
one might foresee, iterating Buchstab's identity results in alternating signs
and hence potential cancellations which have to be regarded as unexploited
gains in the error terms.
Hildebrand established the validity of the formula
(log+
(u 1) 1
(34) kli(x,y) = x p(u) {1 ± 0
logy i I
S:= E log n
n<x, P+(n)<y
S = E A(d) = E A(d)111(x1d, y) .
n<x, P+ (n)<y din d<x, P+(d)<y
with the initial condition p(u) = 1 (0 < u < 1). It is convenient to extend p(u)
to all of IR in such a way that (36) is satisfied everywhere. We hence let
Thus defined, p(u) is right-continuous and has a single singularity, of the first
kind, at u = 0. It is immediate that the derivative of order k, p( k )(u), is de-
fined on RN{O, 1, ... ,k} and also has discontinuities of the first kind at the
exceptional points u = j (0 < j < k). We extend it to IR by right-continuity.
Our method for studying the Dickman function rests on the computation
of the Laplace transform
(40) .9-16(s) =
with
00 e -s-t
(41) dt.
j(s) := /0 s±t
The integral J(s) is evidently continuable as a holomorphic function on
CN] - 00, 0]. The following lemma, of crucial use here, gives supplementary
information about this continuation. We set
s et -1
(42) /(s) := i dt (s E C).
Jo t
Lemma 7.1. For s e CN] - Do, 0], we have
(43) I( —s) + As) ± -y ± log s = 0
where ry denotes Euler's constant.
Proof. The classical formula -y = - F1 (1) immediately yields, after an integration
by parts,
- 1 et 1 ' t dt
10 dt ± f e .
Without altering the value of this expression, we can replace the real path
[-1, -oc[ in the second integral by the polygonal line made up of the segment
[-1, -8] and the half-line {-s - t :t > 0}. It follows that
i et _ 1 v s et _ 1 : dt DO
e
-s-t
7= I dt ± L i t dt ± ± dt.
—
t f t 0 s±t
Rearranging the first two integrals, we obtain (43).
This result enables us to complete the calculation of ;3(s).
372 111.5 Integers free of large prime factors. The saddle-point method
(44) R s) _
Proof. By (40) and (43), we can write, when s is not real and negative,
from which
Rs) = C 0'.
Now, on the one hand, it follows from (39) that
lim 816(s) = C.
.9 -+00
converges for all u 0 cf. for example Widder (1946), theorem 11.7.3 or 11.7.5.
In accordance with the principles of the saddle-point method see de Bruijn
(1970), chapter 5 it can be expected that, by choosing a to be a zero of the
derivative of the integrand, the integral (45) as a whole will be dominated by
the contribution from a small neighbourhood of the real point a. This will
then allow the determination of an asymptotic formula for p(u) by utilising the
Taylor expansion of 16(s)eu 8 in the neighbourhood of s = a.
The explicit formula (44) immediately yields the value of a. It is necessary
to choose a = -e(u) where e = e(u) is the unique real non-zero root of the
equation
Proof. We trivially have 1 < e(u) < log u. This simple first estimate gives the
result by iteration of (46). We thus have
and
Proof. Note, first of all, that (49) follows immediately from (44) and (43) in the
form
(50) s(s) =
Indeed it suffices to apply the trivial bound J(s) < e - '17- 1 -1 with a = -e(u).
In order to establish (48), we introduce the quantity
1 (1 - cos(hr))
H(r) := /(e) - Re/(-s) = f e" dh.
o h
22 fi
71 2 0
-
When 1-7- 1 > 7r, we note that we can assume that u is sufficiently large since the
conclusion is otherwise trivial. Taking (47) into account, we may hence suppose
that
\ U71 2 2u u
H( T) ? 2(7 2 + e 2 ) 7 2 ± e2 •
72 + e2 >
equ) 1
(51) p(u) = exp {-y - ue + /(e)}{1 + 0(- )}.
27 u
Remarks. On differentiating (46) with respect to u, we immediately obtain
e/ (u) = e/ (1 + u(e - 1)). In particular, we have ei(u) -, 1/u as u -> cx). It is
also useful to note that the main term in (51) can be transformed by means of
the identity
1
K(u) := -i--7- i 66 ;--(S)eus dr
0
where s = - - (u) + iT . As a first step, we shall show that the difference
1
(53) p(u) - K(u) = I ii(s)eus ds
27ri ITI>6
To this end, we use Lemma 8.2. The contribution to the integral (53) from
the range S < 1 7 1 < 7 is
00 - ,g+/(0 foo
< e—g±/(e) i e —T2 u/27r2 c h_ < e -t dt
e
6 VU kg(u+1) U3/ 2
where we have appealed to the second mean value formula in order to handle
the term involving 1/-7- . Since I() -, u, this bound is also of the required order
of magnitude.
It remains to evaluate K(u). For this, we consider the Taylor expansion of
13(s) in the neighbourhood of T = 0. Noting that we have for Re s = -e, k > 1,
fo l
l i(k)(s)1 = hk—l e hs dh r(e) = u ,
it follows that
<
That of the main term is evaluated by extending the integral to infinity, and
bounding above the contribution from the range IT' > 6. Noting that
(55) r(e) =- 1/0u) = u - (u - 1)/e,
we obtain
f6
e 2 dT = 27r T1
/"(e)
Substituting this estimate in (54) and taking account of (55), we finally
obtain the stated formula (51).
Corollary 8.3. For any integer k > 0 and any real number u0 > 1, we have
(56) p ) (u) = (-1 ) k e (U) k P (U) { 1+ 0(1/11)} (u > uo).
Proof. Differentiating the functional equation (36) k times, we obtain
(57) up(k +1) (u) = - p (k) (U — 1) — kp (k) (u) (u > 1).
By induction on k, this immediately implies that
(58) (-1) k p(k) (u) >0 (u > 1).
This gives (56) when u is bounded.
Relation (55) shows that
(59) e'(u) ,,, 1/u, e"(u) r- -1/u 2 (u --- Do).
We deduce from this that, for sufficiently large u,
Proof. When u — 1 <v < u, we have p(u — v) = 1. In this case the bound (62)
follows from (51), since /(e) u. When 0 < v < u — 1, we deduce from (51)
and (52) that
p(u — v) < v
exp o e(u — t)dt}.
p(u) u—v f
By (59), we have e(u — t) < e(u) — ct/u for some suitable absolute positive
constant c. The required result then follows from the elementary inequality
2
CV U
(63) —> log( + 0(1) (0 < v < u — 1).
2u (u — v
1 a-Fic° ds
(64) (x, y) = 2 _, ((s , y)x s (x Z ±)
R cx—icc
We shall see that the saddle-point method, employed in the previous section
to evaluate Dickman's function, also works for the integral (64).
The optimal choice for a is the (unique) solution a(x , y) to the transcen-
dental equation
log p
(65) = log x.
pa — 1
PY
As we have seen in the course of the proof of Theorem 2, a(x, y) can equally well
be regarded as the optimal value of the parameter in Rankin's method and
formula (12) provides an asymptotic formula for a(x , y) as y co. However,
the implicit nature of a(x , y) leads one to expect a certain lack of flexibility in
the asymptotic formula resulting from such a treatment. The main aim of this
section is to establish that, without loss of precision, a(x , y) can be replaced by
an explicit approximation in a suitable (x, y)-range. The result is an extension
of both the formulae (30) and (34) due to de Bruijn and Hildebrand respectively.
The explicit approximation for a(x , y) is suggested in a natural way by the
following lemma. We recall the notation
log x
u := L E (y) := exp (log y) (315)—E ,
log y
which we use throughout this section.
378 111.5 Integers free of large prime factors. The saddle-point method
Lemma 9.1. Let 6 > 0. There exists some yo = yo(E) such that, under the
conditions
we have uniformly
(67) ((s,y) = ((s)(s -1) log y 16((s - 1) log 0{1 + 0( LE1(y) )}.
Proof. Observe first of all that, under the above hypotheses, we have
A(n)
(68) + 0(y 1 ').
ns
ri<y
For a < 1, the effective Perron formula (Corollary 11.2.2.1) enables us to write
the main term of (68) in the form
1 fft± iT (/ Yw 1—cr 2
dw ± 0( Y 1° g y)
-
2i L T ((s w)
(69)
log T
- 71 = 1 >1-
logy
that is, for a suitable choice of y o (6), the translated segment remains within
Vinogradov's zero-free region for ((s) cf. Chapter 11.3, Notes. Thus the seg-
ment of integration has crossed exactly two poles of the integrand, w = 0 and
w = 1 - s. By the residue theorem, the integral of (69) has the value
i-s 1 (1
-
where 14) denotes the polygonal line joining the points n ±iT, passing through
-77 ± iT, and traced out clockwise. When w E W, we have—cf. Chapter 11.3,
Notes—that
- (s + w) < log T < log y.
(
Hence the contribution from the vertical segment of W to the integral is
T log y
dt < y l 'LE (y) -2 .
When a> 1, an integration by parts shows, in view of (68), that this relation
continues to be valid up to multiplying the error term by a: we omit the de-
tails, which are standard. By integrating (71) thus modified on the half-line
{s + t : t > 0}, it follows that
c(S , y) (( s ) e Thi((.91) log y) li ± 0 (L E (y) —1 ) 1
with the notation (41) for J(s). The stated result then follows from (50).
The fact that the inverse Laplace integral for p(s) can be evaluated on the
line a = -e(u) and the occurrence of this same function in the approximation
of ((s, y) are two strong heuristic reasons for choosing the abscissa a in the
Perron integral (64) in such a way that (a - 1) logy = ----(u), that is a = a l:) ,
with
(u)
(72) ao := 1 -
log y
It is actually easy to verify, using estimates for
log p
4 oy (a) :=
P<Y
p°- — 1
from the prime number theorem cf. Exercise 1 , that a o is an excellent ap-
proximation of a(x , y) in a large (x, y)-range. We have
1
(73) a(x , y) = ao + 0 1 )
( LE (y) ± log x log y
uniformly for x > xo (E), (log x) 1 ±E < y < x.
380 111.5 Integers free of large prime factors. The saddle-point method
Under this assumption, Lemma 9.1 hence suggests that the quantity
ds
(74) 1 f a°±' (( s )( s _ 1) log y 16((s - 1) log y)xs
27ri L o _ ice s
X f -(u)d-i°° s us ds.
(75) ;6(s)((1 + S )
e
S
00
(76) 41 + S ) e - st d( [Y1
yt ).
s + log y log y = J 00
The convolution theorem then shows that the factor of x in (75) is the inverse
Laplace integral for
Prior to embarking on the proof, let us show how this theorem implies the
asymptotic formula (34) of Hildebrand.
5.5 Approximations to T (x, y) by the saddle-point method 381
we have
(log(u + 1))i .
(79) A(x, y) = xp(u){1 + 0
log y ) f
(81)1
u i v. ( e y(u))v
pi (u — v)y'
dv<
log(u + 1)p(u) dv
o
p(u) log(u + 1)
<
logy '
and yields the desired conclusion.
Corollary 9.3 (Hildebrand). Let E > 0. Uniformly under the condition
we have
(82)
log y )
Proof. When x > xo (E), this follows immediately from (77) and (79). Since
p(u) > 0 for u > 1, the conclusion is trivial for 2 < x < xo(E).
Of course, the information contained in (77) is much more precise than that
in formula (82). In particular, an asymptotic expansion for klf(x,y) in terms of
powers of log(u + 1)/ log y can be deduced from it cf. the Notes.
The first step in the proof of Theorem 9 consists in truncating the Perron
integral in such a way that Lemma 9.1 can be used for the integrand. We state
the result below.
382 111.5 Integers free of large prime factors. The saddle-point method
Lemma 9.4. Let 6 > 0. Set T := L E/2 (y). For x, y in (H,), we have
1oo+iT2
(83) 11/(x, y) = 27rz. ((s y) - ds + 0 CP(u)
fao- iT2
. 7 L E (Y)
Proof. We appeal to Theorem 11.2.2. Let R denote the error term in (83); then
we have
n —a° x"((ao 7 Y)
RT 7
E
P+(n)<y
1+ T2 1 log(x/n)1
with RT := xlf(x + xIT,y)— ‘11(x — xIT,y). When u is not too large, we use the
trivial bound
RT < x IT .
Otherwise we proceed as in Exercise 11.2.3, by introducing the weight function
We then have
fao-FiT
RT < E (_
x)
a0
w( logn =
1
27ri 00-iT
((s, y)x s 1-v(r) ds.
P+(n)<y
(84) R<
x"((cto7Y) • {—
+ mm
x
x" max I ((cto y)
T T<ITI<T
e(u)
(85) x"((a o , y) x e-u“u)((1 )(—e(u))/5(—e(u)) < x log yVu p(u).
log y
The first term on the right-hand side of (84) is hence acceptable. The same
holds for the second if u < 2 log L E (y). Indeed we then have
1 u -2u p(u)
T < ,VT L(y)•
5.5 Approximations to 111(x, y) by the saddle-point method 383
When u > 2 log L E (y), we can use Lemma 9.1 (with E instead of 6) and
Lemma 8.2. We have for N/T < 171 < T, s = co iT,
((s, < (s)(s — 1) log y16((s — 1) log y) < ((s) << log T,
since Irllog y > > 1 ± ue(u). By Theorem 8, we deduce from the above
that the second term on the right-hand side of (84) is
where we have used the fact that 1() u as u co. This completes the proof
of Lemma 9.4.
We are now in a position to complete the proof of Theorem 9. The first step
consists in replacing ((s, y) by its smooth approximation in the integral of (83).
Applying Lemma 9.1 with E instead of E, we see that the error introduced by
this process is
T2 dr
x`"((cto, Y) fo
la° + iTi
with T = L,/ 2 (y). From (85), we get that this bound is of the required order
of magnitude.
By carrying out the change of variables (8— 1) logy 8, we can rewrite the
new main term in the form
—(u)H-i,T 2 log y
X 8 8
P := i(s)((1 + et" ds
27ri f“u) _ iT2 log y log y s ± log y
—(u)-1-i,T 2 log y
X
A y (S) ds
271i k u)_iT2 log y
holds uniformly for u > 0, y > 2. This implies that the inverse Laplace integral
1 f -( u )-F ic)° -
(88) Ay (U) = Ay (s)e" ds
27ri j_ (u) _ i0,9
converges whenever (u) < log y, yu Z+, and therefore certainly in the
domain (He ) excluding pairs (x, y) such that yu E Z+. When x = yu E Z+, the
integral (88) converges in principal value to
1
(Ay (U) ± Ay OH) = Ay (u) + -2—
x.
and it follows from the usual estimates for the zeta function that
1
(91) (1
( + ) <1.91-1/2.
s+logy logy
In order to estimate the last integral, we use the approximation of ((s) by the
partial sum of the series in the form of Corollary 11.3.5.1, viz.
It follows that
00
ds
((s)xs— =
,
E
n=1 TI?max(n72)
x s ds
(+—+0, (xce°).
-n s T
Loo
ITI>T 2
The last term is clearly acceptable. Formula (11.2.7) allows us to estimate the
general term of the sum over n by
7 x \ c4) 1
< — n ) 1+ (n ± T 2 )1 10 g(x/n)l •
xc'c' xp(u)
3/2 + T2 + T •
This shows that the error term in (89) is of the required order, and thus com-
pletes the proof of Theorem 9.
When condition (He ) is not fulfilled, the saddle-point method is still effi-
cient, provided that one chooses the theoretical abscissa of integration a =
a(x, y) defined by (65). This route was followed by Hildebrand and the author
(1986), who obtain the following result, valid for x > y > 2. We write
( logx
(93) 44(0 ) = 1+ )logxlogy{1 +0( 1 ± 1 )}.
Y log(u + 1) logy
log(u ± 1) u 1
(94) T(x,y) = xp(u)exp {0( ±
log y L e (y) ± —)}.
u
386 111.5 Integers free of large prime factors. The saddle-point method
We shall not prove this result here, and restrict ourselves to some comments.
First of all observe that estimate (12) for a and formula (93) imply that
(95) logy < a \/(27(p'y (a)) ,--, log (1 ± Y ) 2 u(1 ± 1°gx ) < .\/ Y .
log x y logy
This allows us to compare Rankin's upper bound with the true order of mag-
nitude of ‘11(x, y). Although it never gives the exact order, Rankin's method is
remarkably efficient: when, for instance, y = log x, every estimate available in
the literature imply an error factor >> exp { y1+0(l) }.
Next, we note that (94) is almost equivalent to Corollary 9.3: it needs only
a slight strengthening of Theorem 6 to obtain the result.
Finally, we draw the reader's attention to the interest of a formula like
(92), depending on an implicit parameter such as a(x , y). As we have already
seen, inserting an explicit approximation for a yields a genuine asymptotic
formula. Moreover, there exists an application of another type, resting on the
fact that small variations in a(x, y) are relatively easy to study. This allows an
investigation of the local behaviour of klf(x, y), even in regions where the global
behaviour is not completely understood. The following result, characteristic
of the method, can be obtained by an immediate extension of Theorem 3 of
Hildebrand & Tenenbaum (1986).
Theorem 11. Uniformly for x > y > 2, c > 1 and with t := (log c)/ log y, we
have
log y)) 1
(96) klf(cx,y) = ‘11(x, y)e (x'Y ) {1 ± 0((t 2 + 1)(—i ±
provided that y < (log x)' - ' . Hence in this region there does not exist a con-
tinuous function asymptotic to kli(x, y). Hildebrand (1986f) showed that a con-
tinuous approximation cannot be "too precise" in the range y < (log x) 2 '.
The following theorem gives a uniform one-sided estimate for the local be-
haviour of ilf(x, y). It is an simple consequence of (92).
Notes 387
± logy)} .
(98) xlf(cx,y) < c'W(x,y){1 ±0(--
ul
Notes
§ 5.2. Ennola (1969) also gives a more complicated asymptotic formula when
y < (log x) 3R.
Proof. We may assume that u < y 2 . Otherwise Theorem 8 easily shows that
the right-hand side of (99) does not exceed 1.
When u < y 2 , we can suppose that y is sufficiently large, by modifying C3
if need be. Then let us fix y > y o and set
where
A(d) ( log d
(0 < 9 < 1).
>-: d logy )
d<y°
e ulog(u ± 1) -0 .
(101) So = log y f p(u — v) dv + 0 (p(u){1 +
o L E (Y) f )
with
1
1 log(u ± 1)
r(u) := 1 2 p(u v) dv,
up(u) fo u log y ± L(y)
.
Since p is decreasing, we have that r(u) < . Similarly, since 6 is also decreasing
we have
(-1 )3( )
p(u — v) = 3 (u)v3
( 1) 3 + 1
(103) rmi (V ± 771 — U)i
j=o j
u—v<rn<j !
m<u
( - 1) k+1 iv
— w) k p (k+1) (u ____ dw .
k!
Exercises 391
p () (u) p(k+1)(u)
(104) A(x, y) = +0 x
(log y)j (log )k+ 1-
j=0
uniformly under the conditions
u— j
) > 1og2 y
(105) x > 2, (log x) 1 +' <y < x, min
o<j<k,2 <uk +1— j — logy'
where the aj are the Taylor coefficients at s = 0 for s((s 1)/(s ± 1). The last
of the conditions (105) is indeed necessary cf. Saias (1989).
Although Theorem 9 was obtained in a thesis supervised by the author, the
proof given here is somewhat simpler than the original one.
Hildebrand (1984a) showed that extending the validity range of the formula
(x, y) = x p(u) {1 + 0 ( log (u + 1)/ log y)
to y > (log x) 2 +6 is equivalent to the Riemann hypothesis.
The method of proof of Lemma 9.4 provides an estimate for the number of
integers without large prime factor in short intervals. See Lemma 6.7.3, and
Theorem 4 of Hildebrand Sz Tenenbaum (1986).
A more complete account on the saddle-point method and its arithmetic
applications may be found in the author's survey (1988).
Exercises
(b) Let 6> 0. Show that, uniformly for a > 8, one has
TY dt 1 +0(1/LE(Y)) r
1 2 t -1 1 — y —cr I
dt+0(1).
tu
Show that the conclusion continues to hold for a > 3 log 2 2y/ log y, replacing
L E (y) by log y if necessary.
[One can use the relation (ta - 1) -1 = t - cr + 0(t-2 °(1 - 2 -0 ) -l ) (t > 2).]
(c) Suppose now that 0 < a < min (2/3, (3 log 2 2y)/ log y). Establish the
relations
Y dt {1+0(y-1/6)} f Y dt Y dt Y dt
= {1+0(logy)}f
y 1/6 vy t cr .
N/v t°- ' t(1- - 1
Show that
TY ( 1 1 dt
J y \1 — t —cr 1— y —cr ) t (y cr — 1) log y •
log p 1+ 0(E) IY dt
1 - y - cr °(1)
P<Y
with E <6 1/L(y) if a > 6, and E < 1/ log y for all a > 0. Show that
(1- y') log y <<y 1 - 1 for 0 < a < 1 and deduce formula (11) of § 5.1.
2. An arithmetical proof of the formula (0) = e-Y .
(a) Deduce from Theorem 1 that, for x > y > 2, one has
1
log y.
n>x, P+ (n)<y
(b) Deduce from Theorem 6 that, for x > y > 2, one has
1
- = log y p(v) dv + 0(u).
n
n<x, P± (n)<y
(c) Using Mertens' formula, show that we have for u > 1, y > 2,
e—u/2)
p(v) dv = + 0(
log y
4. Let 6, 0 < 6 < 1, be some fixed real number. Set g(n) := Epin (logp) 6
(a) Show that E n<x g6(n) (x/6)(logx) 6 (x oo).
(b) Let N(x; A) := < x : go (n) < (A/6)(logx) 8 }1 (A> 1). Show that
N(x, A) > {(A -1)/A+o(1)}x (x oo).
(c) For n> 1, write a (log P+ (n)) / log n. Show that a n has a distri-
bution function, and determine it.
(d) Show that, for all n> 1, one has go (n) > (an ) s-1 (log n) 6-1 gi (n). Using
Exercise 3.3 and the previous question, show that, for each A > 0, there exists
a quantity c(A, 6) > 0 such that N(x, A) < {1 - c(A,6) + o(1)}x (x co).
(e) Show that g6 fails to have a non-decreasing normal order.
394 111.5 Integers free of large prime factors. The saddle-point method
For which values of z is this bound more precise than that of Theorem 3 ?
6. Integers for which the product of the small prime factors is large.
(a) Using the large sieve (Corollary 1.4.6.1) show that, for x > y > 2, one
has
4)(x,y):= I In x: P— (n) > y} I << loxg y .
Deduce that, uniformly for x> z > y > 2, v := (log z)/ log y, one has
(c) Using Theorems 2 and 10, show that, for each E> 0 and uniformly for
x > y > 2, one has
(x, y) < xu u ± E.
Deduce that, for any E > 0, and uniformly for x > z > y > 2 with v :=
(log z)I log y, one has
e(x, y, z) <, xv — v ± xz —l +E .
7. Squarefree integers without large prime factors. Using Theorems 9 and 11,
as well as the estimate (12), show that, uniformly for x > y > 2, one has
E
n<x, P+(n)<y
il(n) 2 = { 1 1((/3 1) ± °(1 )} 41 (x , Y)
with 01 := max(1, 20(x, y)) where [3(x, y) is defined by (13). [For a more precise
result, see Ivie & Tenenbaum (1986), Nai'mi (1988).]
111.6
Integers free of small prime factors
§ 6.1 Introduction
Here we undertake a study dual to that of the previous chapter, namely the
evaluation of the quantity
As is the case with 41(x, y), this function occurs constantly in analytic and
probabilistic number theory. In particular, it is of fundamental use in sieve
problems.
We keep to the systematic notation
log x
U :=
log y
introduced in Chapter 5. We have seen (Theorem 1.4.2) that Brun's pure sieve
provides the formula
x
(1)
4)(x,y )= (2< y < x 1/10 log 2 x)
while the fundamental lemma of the combinatorial sieve (Theorem 1.4.3) implies
the following estimate, valid uniformly for x > y > 2,
x
(2) D(x , Y) =
( (1 , y) {1 ± 0 (u -u/ 2 )} ± 0(k1f(x,y)).
The appearance of klf(x, y) as error term in the evaluation of (I) (x, y) is not
surprising. The characteristic function ri(n, y) of the set of integers n such that
P- (n) > y is indeed multiplicative, and, by the Mobius inversion formula, it
can be written as
The main term of (2) corresponds to the quantity obtained from (4) by ignoring
the square brackets and extending the summation to infinity. It is clear that
the error implied in this process involves kli(x, y).
In order to obtain, by this approach, our first result, we will use the following
technical estimate concerning the saddle-point a(x, y), defined in (5.65) as the
unique solution of the equation
v, log p
(5) / 1
= log x.
P<Y 1-
y' - 11
—
(1 - a) log y -----( u *
(7)
Proof Formula (5.11) enables us to evaluate the left-hand side of (5). We obtain
y i-a 1 (
u(1 - y') . 1 +0 C ogl y ))•
(8)
(1 - a) logy
implies that
1 - y')----. min (1, y/ log x)
As in Chapter 5, let (u) denote the unique real non-zero root of the equation
ee = 1 ± '//, for u > 0, u 1, and define - (1) = 0. The estimates (5.47) and
(5.59) for e(u) and e l (u) allow us to deduce immediately from Lemma 1.1 that
log (7log(Tt + 1)) ± 0(1)
(10) a=1 (x > y > 2).
log y
Theorem 1. Uniformly for x > y > 2, we have that
x
(11) (13.(x,y) 0(F(x,y)).
=';(1,y)+
Remark. This estimate is only non-trivial for
y < xe log 3 x/ log 2 x 1
where c = c(x) ---> 1. Outside such a range, formula (11) is weaker than the
sieve upper bound (cf. for example Corollary 1.4.6.1)
X
(x > y > 2).
log y
Proof. As we just observed, we can assume that u is sufficiently large, so that
(9) and (10) imply
(12) (1 - a) log y > co > O.
By (4), the error term in (11) is
with
M(x, y) :=
d<x, P+ (d)<y
Bounding trivially 1M(t, y)1 by klf(t, y), we obtain, by Theorem 5.12, a result
slightly inferior to the required estimate. We have indeed
00 00
x f xlf(t, y)—
dt < x i-a,p( x, y )Itcr-2 dt < T(x y)
x t2 x 1-a
However, a result of the author (1990), also proved by the saddle-point method,
states that
exp{-ciu/ log 2 (u + 1)}
(13) M(x, y) < kli(x,y)( ± exp { - (l og y )( 3/ 2)_})
logy
uniformly for x > y > 2. Thus we can certainly divide the above upper bound
for the t-integral by logy, and the required result then follows from (12).
398 111.6 Integers free of small prime factors
(14) 4)(x, y) = 1 ±
y<p<x v>1
Remark. As in Chapter 5, this equation is normally used in the Buchstab form
which is (14).
By using the trivial upper bound x I p' of .1)(x I pv , p) for v > 2, we deduce
from (15) the approximate equation
If we had replaced the inequality P- (n) > y in the definition of (1)(x, y) by the
weaker P- (n) > y, then the approximate functional equation (16) would, in
fact, hold without an error term. Here, however, we continue to adhere to the
definition as previously stated: this is more convenient when decompositions of
integers into products with terms defined according to the size of their prime
factors are involved.
As in Section 5.3, the functional equation may be used in order to estimate
the sieve function. One argues by induction on [u] starting from the trivial case
u < 1. A consideration of the first two steps indicates the general form of the
approximation.
When Vx < y < x, the inner sum of (14) is identically equal to 1 (since
P- (1) = cc !) and we have (x, y) = 7(x) — 7r(y) +1. Substituting in (16) with
z = Vx we deduce that, for x 1 /3 <y
1
(17) H(v) :, E _ , log(v/2) + 0 (e -voog y))
x l/v <p<Vx P
where the estimate follows, in a standard way, from the prime number theorem.
The sum over p in the last evaluation of 43.(x, y) has value
Xfx v ri x 1 + log(u - 1) 1 + 0 ( x \
log x j2_
2 v - 1 "11-(v) logy{ u (log x) 2 ) '
xco(u) - y
(18) (x, y) = + x2 ) (x 1/3 < y < x).
logy 0( log y /
When y < x/ log x, the second main term -y/ log y is absorbed by the error
term. If x 1 / 4 < y < x 113 , we can use (18) to evaluate 41)(x1p,p) in (16). For as
long as (18) remains satisfied, we can iterate this procedure; the function w(u)
is then defined inductively on intervals of length 1 by the relation arising from
(16) and (18)
xw(u) x x W ilogx 1\ , x u
, + {1 + i w(v - 1) dv},
log y log x p log p log p ) log x 2
y<p<Vx
that is
u-i
(19) uw(u) = 1 + i w(v) dv (u > 2).
1
The iterative method for the evaluation of 43.(x, y) presented above leads in
a natural way to the following result, which is analogous to Theorem 5.6 for
kli(x, y). However, in spite of the similarity of method, there is a remarkable
discrepancy in the quality of the approximations obtained: while Theorem 5.6
is only of use in a restricted domain, here we obtain a genuine asymptotic
formula with remainder, uniform for y —> co, y < x/2.
Theorem 3. Uniformly for x > y > 2, we have
Proof. The result being trivial for bounded y, we suppose that y > y o , where
yo is a sufficiently large constant. In addition, we may also assume that u> 3
since we have already established the result for 1 < u < 3.
Let A(x, y) be the function implicitly defined by the formula
x r , 1
(24) 4) (x, Y) = log y tw (u) ± log y I •
x log x \ OpAk
(25) 4)(x, y) = (I)(x, N/x) + Tw( 1) ± 1 +On
plogp 1 logp log p Y
with Op = Op(x) E [-1, 1]. We estimate cI)(x, Vx) by the prime number theo-
rem. Furthermore, partial summation enables us to write, provided that yo is
sufficiently large,
(26)
1 2 ± 0(e — V(10gY ) ) 3
<
p(log p) 2 = (log y) 2 — 4(1og y) 2 .
P >Y
6.2 Functional equations 401
\--, 1 co (log x 0 _ 1 ju
v w(v - 1) dH(v)
L-i p log p log p ) log x 2_
y<p< -Vx
where H(v) is defined by (17). It follows from (20) and (22) that the function
v 1-4 v w(v - 1) is continuous for v 2, has a discontinuity of the first kind at
v = 2, and is differentiable for v 2, 3, with uniformly bounded derivative.
Using (17), we can then write
U u
12 v w(v - 1 ) dH(v) = 12
:
w(v - 1) dv + i2— vw(v - 1) df0(e --010gY) )1
u
= UW(U) — 1 + f2— 0(e - V(10") CifVUO — 1)1
V(log)
y) .
= 2/ WM — 1 ± 0 (ue -
k II (x, y) < xu u .
Comparing (11) and (23) and estimating ((1, y) by Mertens' formula, we readily
obtain (27).
It is not hard to improve (27) by working directly with the difference-
differential equation (20) provided the existence and value of the limit of w(u) at
infinity are known a priori. We make precise the result which can be obtained.
Define w i (u) by right-continuity at u = 1 and u = 2.
402 111.6 Integers free of small prime factors
Proof For 1 <u < 2, we have cil(u) = —1/u 2 and p(u) = 1 log u. An ele- —
mentary calculus computation then shows that ici(u)1 < P(u). This inequality
is also true for u = 2, since ci(2) = w"(2+) = 1/4. Writing
we hence have T> 2. Now, since co(u) is continuous for u> 1, we deduce from
(20) that, for u> 2,
and hence lui(7)1 < p(r), which is a contradiction. Therefore T is not finite,
and (28) follows.
Since p(u) is rapidly decreasing as u —> oo, we may write for u> 1
oo 00
w(u) — e —Y = — i w i (t) dt << I p(t)dt.
u u
(t) p(u)
(31) I° p(t) dt << fu°° —p/ dt <<
log(t ± 1) log(u ± 1) •
6.3 Buchstab 's function 403
The term in the square brackets is zero. Denoting by 6 1 the Dirac measure at
u = 1, we deduce from (20) and (21) the following equality of measures
d e-s
— 1:3(s) = - — (1 + C.D(s))
ds s
from which it follows that 1 +C3(s) = CeJ(s) for some suitable constant C and
00
J(s) := o
f s ± t dt.
C
(33) 1+(:)8=.
sp(s)
since
81
' 6(S) = fo e° p(u1s)e - u du --4 p(0) = 1 (s --> oo).
Now, it clearly follows from (22) that ED(s) —> 0 as s —> oo. We hence have
C = 1, and we can state the following result.
404 111.6 Integers free of small prime factors
Now it follows from (20) and (22) that uw' (u) << 1. For a suitable function
6(u) —> 0 and all h, 0 < h < u, we obtain, on the one hand, that
f u+h
w(t) dt = e —Y + 0 (E(u)-
u
h)
h u
and, on the other hand, that
1 fu±h
h u
(A)(t) dt = w(u) + f
0
h u
h
(- dt = w(u) + 0 (-
u
11 ).
w(u) — e
p ( U )H( u )_a (u > 0).
( 36 ) /I <
Proof. We can suppose that u > 2. First observe that the second stated upper
bound easily implies the first, since
00
w(u) - e -Y = - w / (t)dt.
The conclusion then follows by using the fact that H(u)_a is decreasing and
appealing to the estimate (31).
Let h(s) be the Laplace transform of w' (u). As we already noted, we have
1 k+i00
(37) w'(u) =- — h(s)eus ds (u > 2)
27ri k _ ico
where the second term arises from the discontinuity of w(u) at u = 1. Taking
account of (34), it follows that
Recall that
(40) e' = 1 ± su
(41) zA(z) = w
with
A(z) := (1 + ue)( ez z 1 ) u, w := -i7u -2.
Since e(u) -, log u (u ---+ oo)—cf. Lemma 5.8.1—it follows that there exists
some absolute positive constant u o such that
Since A(z) does not vanish for 1z1 < 27/ log(u + 1), u > uo, this implies, by
Rouche's theorem, that (41) has in this disc a unique solution. Lagrange's
theorem—see for example Whittaker and Watson (1927), §7.32 actually states
that this solution is an analytic function of w, and provides an explicit formula
for the Taylor expansion. We make no use of this last piece of information in
this proof.
We could evaluate the integral (37) by choosing precisely K = -- (11,) — Re z.
This would lead to a slightly sharper result—see the Notes. However, here
we content ourselves working with the simpler choice ic = -e(u), which is
heuristically justified by the fact that (41) has a solution approaching 0 as
u ---> oo. This will enable us to use the estimates obtained in Chapter 5 for 13(s)
on the line a =
For the remainder of this proof, we hence let n = -e(u). We first consider
the contribution to (37) from the domain 17 - 1 > e. From the estimate
J(s)
= e : s (1 si) ± o(eIrls3) (r 13)7
it follows using (38) and (35) that, for s = -e(u) + ir, 1-7- 1 > e, we have
e—s 1 e -2s
h(S) = 5{1 + s (1 (1 is ) 2 + 0 ( ler; ) 1 - 5 - e -s
S) ± 2s 2
e —s e -2s
= + +0
iT 2iT ( T3 ) .
We can estimate the contribution of the main terms to the integral by the sec-
ond mean value theorem; estimating that of the remainder in a straightforward
manner, we obtain that the contribution to (37) from the domain 1-7 - 1 > e is
In order to evaluate the contribution from the rest, we use h(s) in the form
(39). We shall show that, for some suitable positive constant a, we have
(42) ;6W -1 <;6( - )-H(u) -2a (a = 1 7 1 5- e).
From this upper bound, it follows that
by Theorem 5.8.
It remains to establish (42), that is (setting T := u/ log 2 (u + 1))
(43) Ref /( + iT)+/(e)} > 2aT + 0(1) (I'm < ee).
The left-hand side is
dv
{ev(1 +cos(Tv)) _2} v
1
2
JO
_
(1 — cos(rv)) dv +
V t et' (1 + cos(rv)) - 77,2 dv
4y 2 dv [ev
= -2 sin v + + Re ( ev(-Fir) )] 1 - log 4
JO v e±iT 1
2
± ± cos A +0(' /u)
0(100 ± T2)) ± 7 2) j
(47) YE := exp { (log y) (3 / 2)- '}, E(x, y) := H (u) - c L E (y) -1 + IT,- ' ,
00
(48) icy (u) := i cv(u - v)y - v dv, W(x, y) := xit y (u) e('''( 1-17 .
o
The letter c, with or without index, stands for some absolute positive constant.
The main purpose of this section consists in establishing the following result.
Theorem 7. Let e > 0. For x > y > 2, we have
Furthermore, it is easy to see that the approximation Z for log ilf(x,y) from
Theorem 5.2 satisfies
y y x log
Z := u log (1
+ )+ log (1 + ) > ii log 4
log x/ log y y
and also that, when u > 2,
X
(51) W ( X,
Y) c(1 , y) < log y
The second bound in (49) follows from this estimate and from Theorem 1.
Proof of Lemma 7.1. A simple integration by parts enables us to write
where the second term on the right-hand side arises from the discontinuity of
w(u) at u = 1.
Taking into consideration the bound for w(u) - e - -Y in Theorem 6, we see
that the proof of (50) reduces to showing that, for x > y > 2, we have
(53) fo CC
( 56 ) ( x, y) =
x
(( 1 ,y)
+
1 r o-Fi" ( ( s ) xs
27ri Lo -ico ((s, Os
ds.
Lemma 5.9.1 and Theorem 5 enable us, for 17 - 1 < Le (y), to approximate the
integrand of (56) by
xeuz (1 + a')(z))
(z := (s - 1) logy).
1 + z/ log y
6.4 Approximations to (1)(x, y) by the saddle-point method 411
Ignoring for the time being the influence of the domain 1-7 - 1 > Le (y) on the
integral in (56), we obtain the heuristic approximation to 4)(x, y)
(57) Wi (X,
X X f - (U) -FiCX)
1 + (s) eus
ds.
Y) := ((1 , y) ±
27ri L“u)-2;00 s+logy
x xe — ")'
x f k±i" 1 -1-1D(s)
(58) Wi (x, y) = +— e s ds.
C(1, y) logy 27rz k _ ico s + log y
1 e -Y
(59) Wi (x, y) =1 + xt
C(1, y) logy ±
Now we observe that, in the domain (He ), Lemma 7.1 implies that
1 e -Y /,y
Wi(x,y) - W(x , Y) = 1 ± xf (1- Ay (u)e log y)
(60) ((1,y) log y i
< xp(u)H (u) - cl L E (y) -1 < ill (x, y)E (x , y).
This difference is of the same order as the error term stated in (49). We can
therefore regard W(x, y) as the natural approximation for cD(x, y) arising from
the saddle-point method.
It remains to make rigorous the argument sketched above. On several oc-
casions we shall appeal to the following estimate, which is a consequence of
Theorem 5.8, Lemma 5.9.1 and Corollary 5.9.3. We have, uniformly in (He ),
i ((s)xs
(62) ds < klf(x, y)E(x, y),
Ja=a0 (( 8, WS
ITI>L
412 111.6 Integers free of small prime factors
((s)x'
(63) ds < klf (x , y) E (x , y),
Os(s,
1 + C,)(s) eus d s
(64) x f < xif(x, y)E(x, 0-
s ± log y
cr=—
1 7 1>L
from which
with
A(s) :=- E A(n)
ns
.
n<y
The error term in (68) takes into account the contribution of the non-prime
integers n in A(s). The 0-estimate is obtained as in Lemma 5.9.1.
6.4 Approximations to (I)(x, y) by the saddle-point method 413
_ -1 r iT (' Yw dw + 0( log
A(s) - (s w)—
411 w y a°
log y
< (log T) 2 y 1- a° expt exP - (log y) 6/ 2 1
(log T)(2/3)+0.0 < Y i
and it is checked without difficulty that the same bound also holds for the
contributions from the horizontal segments. We thus obtain
y l—s
A(s) = 1 8 ± 0 (y l- a° exp - ( log y)2}) .
„i-ct o 1 - a())
X log y > " (1 -I- 0 (y 1 '° exp - ( log y) 6/2})
1 -a0 1-s
and therefore
e “u) 72 uT2
x > , )2 +72) 0(exp fe(u) (logy)
y)'/ 2 1) >>
e(u)( (i - (10 (1 - (10 2 +7-2 '
T2 1 1
(69) (1 _ ce0 )2 >
7-2 - ( u )2 + 1 >> log 2 y -
Pla0 (1 +p-ao) 2
< exp { 1 -1- cos(T log p)
4pao
414 111.6 Integers free of small prime factors
If 1 7 1 > 1/ logy, the lower bound established for X is also valid for V: it
suffices to consider A(a 0 )+Re A(a o +i-r). Taking (69) into account, we certainly
have (66) in this case. For 17 - 1 < 1/ log y, we can write
(70) kli(x + x/z, y) — Ilf(x, y) < 41(x, y)/(u) log yll/z + e —c3u 1.
1/ sintzy
. w(t)e 't dt = 1 (1 —
1.6 (r) := fcc 7 )±
W(t) := tZ ' 2z
,
« E ( --_ ) a° W10g
\n/
(
\
----)
n
—
27 °±2i2ziz (GS, y)xs frier) ds
1 i ao_
=f
P+(n)<Y
1
< x`"((cto, y) + xa° max 1C(cto ± kr, 01-
i<iri<2z
( ( s ) xs , T1_041
ds < e°((ao, OH(u) c2
fcr= ((s, Osa0 ( 1 — ao) 2 < 41(x ' °E(x ' Y)
L<ITK
6.4 Approximations to '1(x, y) by the saddle-point method 415
for some suitable choice of the constant c appearing in the definition of E (x , y),
formula (47). Thus it remains to show that the integral
R :=
f
Cr= ckO
ITI>T
((s)xs ds
((s, y)s
also has the same order of magnitude as that of the previous upper bound.
The crude form of the approximate functional equation of the zeta function
(Corollary 11.3.5.1) enables us to write, for s = ao ir,
d xa °((cto, Y)
xa°((ao, y) fT ,r i+ao ilf(x,y)E(x,y),
VT
where we have again appealed to (61).
The first effective Perron formula, in the form of the bound (11.2.7), enables
us to write, uniformly for z > 0,
fcr= a0
I T I> T
The contribution to R from the main term in (73) can therefore be estimated
as follows:
00
x )8 ds
(74)
n=1 P+(m)<y I CI=Cto
TI oo
?lnaX(T,71,)
mn) s
(X/Trin) °
<E E n=1 P+(m)<Y
1+(T + n)1 log(x/rrin)1
The contribution to (74) arising from pairs (m, n) such that I log(x/mn)I > 1
is
co
1 xla °((ao,Y)
< (x , y)E(x, y).
ne, o(T+n) < N/T
n=1
416 111.6 Integers free of small prime factors
For the remaining pairs (m, n), with I 1og(x/mn)1 < 1, it is useful to bear in
mind that T < x whenever x and hence y is sufficiently large. Let Si, 82 denote
the respective contributions corresponding to the cases m < xIT,m> xIT.
On the one hand
1
Si <
1 + In - xtml
m<x/T, P ± (7n)Y x / enl<nex /rn
where the penultimate inequality is just Rankin's upper bound for W(x/T, y).
When (m, n) is counted in S2, we certainly have n < eT. From this we
obtain that
1
S2 <
E E
nGeT x/en<mGex /n
1+ Tlxlmn - 11
P± (m)<y
< klf(xln,y)E(x,y) 2
provided that u > (log y) 3 ( 1- ')/ 5 . In the contrary case, we appeal to the trivial
bound
<xl(nVT) < (x1n)u -2u T -1 I3 <T(xln,y)E(x,y) 2 .
Hence, in any case, we have that
n<eT n<eT
Proof of (64). Formula (35) for 1 +.-o(s) and the trivial bound J(s) < e-r1-1
imply that
1 + ci)(s) . 1 +0( 1 ±use(u) ) (1 7 1> L).
This shows that the integral in (64) is conditionally convergent. The second
mean value theorem allows us to write
x x p(u) ( H t u) - co ± K _i) .
(75)
(I) (x'Y) ((1,y) < logy
0( { _
(76) 43.(x, y) = (xco(u) - y) 0, y) + log2 y H(u) C6 ± Ye-1 }).
Proof. The error term in (76) is compatible with the first upper bound of (49).
It thus suffices to show that (76) is valid with 41)(x, y) replaced by W (x, y). By
(52), we have
xe Do
W(x,y) = (xco(u) - y) (( e:,y) ((i ty) 1 c,./(u - v)y' dv.
f oo ( e i±“u))v
dv < 11(uraP(u)
Jo y logy
since in (HE ), we have, for example, that e 1 (u) < Vy whenever x and hence
y is sufficiently large. This implies (76).
418 111.6 Integers free of small prime factors
Proof. The result is (amply) implied by (76) when (x, y) is in (H6 ). In the
contrary case, we deduce from (52) and (53) that
e-Y xe - u/ 3
W(x, y) - (xw(u) - y) ((1,y) « 41 (x, Y) < (l og y)2
where the last estimate follows from Theorem 5.1. This yields the stated result.
Notes
1 ) ± 0 ( log2 u
(80) 0'(u) = 2 7 (1 ±
log u ) (log u) 2 )
1 n log2 n )
log n + `-' ( (log n) 2 ) •
His proof rests essentially on the saddle-point method with a "good" choice of
the abscissa of integration, such as was described in the proof of Theorem 6.
Notes 419
Dk-Fi
+ co (k+1) (u - v)y - v dv
(log y)k
This is clear when (x, y) belongs to (H,). In the contrary case, the error term
0(kli(x, y)) of Theorem 7 is certainly
x exp { - L,(y)}.
It can be shown that Lemma 7.2 holds for x > y > log x, subject to replacing
al) by the actual solution ce(x, y) to equation (5)—cf. Hildebrand & Tenenbaum
(1986), lemma 8, and Tenenbaum (1990), corollary to lemma 1.
420 111.6 Integers free of small prime factors
Exercises
[x] = 1i(xln,y)+41(x,y)—W(xly,y).
n<x I y , P+ (n)<y
(b) Using Theorems 5.6 and 6.3, deduce from (a) the convolution identity
z<l)v,Pli lin
E
v. k.
(b) By using the results of Exercise 3 (a) Sz (b), show that, for 0 < A < 1,
u > 1,
aeo = 00(A, u — 1) — 00(A, u — A).
u au
(c) Show the existence of some absolute constant A such that
aeo < AAp(u) (3 < u < 4)
2 k! \ ' — A ,
A/ 1\k
(b) Ok (A, u) = -17. logj
--)-- {1 ± 0(p(u — k)) 1.
6. Show, uniformly under the conditions y > z(1 + 11 LE(z)), zy1+1 < X,
k < log(1/A)L e (z), that
Wk(x, y, z) = x9 k (A, u){1 + 0(1/logz)}.
sK,,,
(d) Calculate the Laplace transform R A (s) := foc"° e-u 0 u)du and de-
duce, by part (b), that
00 8
1
-fo(A, s) .
=1o e - "00(A, u) du = - exp { - f e' c.v }.
S As v
(e) Recover the above result by using the definition of 6 00 (A, u) given in
Exercise 2.
1 (u > 0)
8. Let Y(u) denote the Heaviside function Y(u) =
0 (u < 0) • {
(-1)k (n *k
* Y)(A,u)
00(A, 71) =
k!
k=0
with
dvi dvk
Fo (A,u) = 1, Fk(A,u) =
Vi Vk
'[A,
E vi< u
(b) Recover the differential equation 4(a).
9. Let VA(s) := fr e - usp(u/A) du. Show, for s 0, that
E 0 k (A,u)e k =
00
1
-1j Fk(A,u)(e - 1) k
k=0 k=0
K. Alladi,
1982. The Turan—Kubilius inequality for integers without large prime factor, J. reine
angew. Math. 335, 180-196.
1988. Probabilistic Number Theory and Brun's sieve, in: C. Goldstein (ed.), Semi-
naire de Theorie des Nombres, Paris 1986-87, Prog. Math. 75 (Birkhauser),
1-26.
K. Alladi & P. Erdos,
1977. On an additive arithmetic function, Pacific J. Math. 71, no. 2, 275-294.
1979. On the asymptotic behavior of large prime factors of integers, Pacific J. Math.
82, no. 2, 295-315.
E. Aparicio Bernardo,
1981. Sobre unas sistemas de numeros algebraicos de D. S. Gorshkov y sus aplica-
clones al calculo, Revista Matematica Hispano-Americana 41, 3-17.
R. Ayoub,
1963. An introduction to the analytic theory of numbers, AMS Math. Surveys 10
(Providence).
G.J. Babu,
1973. Some results on the distribution of additive arithmetic functions, II, Acta
Arith. 23, 315-328.
1992. Smoothness of the distributions of arithmetic functions, in: F. Schweiger &
E. Manstavieius (eds.), Analytic and probabilistic methods in number theory,
New Trends in Probab. and Statist. 2, 191-199, VSP/TEV.
C.G. Bachet, sieur de Meziriac,
1624. Problemes plaisans et delectables qui se font par les nom,bres, second edition;
first edition: 1612.
M. Balazard,
1987. Sur la repartition des valeurs de certaines fonctions arithmetiques, These,
Universite de Limoges.
1990. Unimodalite de la distribution du nombre de diviseurs premiers d'un entier,
Ann. Inst. Fourier (Grenoble), 40, no. 2, 255-270.
M. Balazard, H. Delange & J.-L. Nicolas,
1988. Sur le nombre de facteurs premiers des entiers, C. R. Acad. Sci. Paris, 306,
Serie I, 511-514.
M. Balazard & A. Smati,
1990. Elementary proof of a theorem of Bateman, in: B. Berndt, H. Diamond, H.
Halberstam & A. Hildebrand (eds.), Analytic Number Theory (Urbana, 1989),
Prog. Math. 85 (Birkhauser), 41-46.
P. Bateman,
1972. The distribution of values of the Euler function, Acta Arith. 21, 329-345.
F. Behrend,
1935. On sequences of numbers not divisible one by another, J. London Math. Soc.
10, 42-44.
Bibliography 425
A.C. Berry,
1941. The accuracy of the Gaussian approximation to the sum of independent vari-
ates, Trans. Amer. Math. Soc. 49, 122-136.
A.S. Besicovitch,
1934. On the density of certain sequences, Math. Annalen 110, 336-341.
N.H. Bingham, C.M. Goldie & J.L. Teugels,
1987. Regular variation, Cambridge University Press.
A. Blanchard,
1969. Initiation a la theorie analytique des nombres premiers, Dunod, Paris.
H. Bohr,
1910. Bidrag til de Dirichlet'ske Raekkers Theori, Thesis, Copenhagen (= Collected
Mathematical Works III, Si).
1935. Ein allgemeiner Satz iiber die Integration eines trigonometrischen Polynoms,
Prace Matem. Fiz., 273-288 (= Collected Mathematical Works II, C 36).
E. Bombieri,
1965. On the large sieve, Mathematika 12, 201-225.
1974. Le grand crible dans la theorie analytique des nombres, Asterisque 18 (Societe
mathematique de France).
E. Bombieri & H. Davenport,
1966. Small differences between prime numbers, Proc. Roy. Soc. Ser. A 293, 1-18.
E. Bombieri & H. Iwaniec,
1986. On the order of (“- + it), Ann. Sc. Norm. Sup. Pisa Cl. Sc. (4) 13, 449-472.
J.D. Bovey,
1977. On the size of prime factors of integers, Acta Arith. 33, 65-80.
N.G. de Bruijn,
1950. On the number of uncancelled elements in the sieve of Eratosthenes, Proc.
Kon. Ned. Akad. Wetensch. 53, 803-812.
1951a. The asymptotic behaviour of a function occurring in the theory of primes, J.
Indian Math. Soc. (N.S.) 15, 25-32.
1951b. On the number of positive integers < x and free of prime factors > y, Nederl.
Akad. Wetensch. Proc. Ser. A 54, 50-60.
1966. On the number of positive integers < x and free of prime factors > y, II,
Nederl. Akad. Wetensch. Proc. Ser. A 69, 239-247 = Indag. Math. 28, 239-
247.
1970. Asymptotic methods in Analysis, North Holland (Amsterdam), third edition;
new printing: Dover (New York), 1981.
N.G. de Bruijn, C. van Ebbenhorst Tengbergen & D. Kruyswijk,
1949-51. On the set of divisors of a number, Nieuw Arch. f. Wisk. Ser. II 23, 191-
193.
V. Brun,
1917. Sur les nombres premiers de la forme ap + b, Arkiv. for Math. og Naturvid. B
34, no. 14,9 pp.
1919a. Le crible d'Eratosthene et le theoreme de Goldbach, C. R. Acad. Sci. Paris
168, 544-546.
426 Bibliography
H. Daboussi,
1974. See: Daboussi Sz Delange (1974).
1979. On the density of direct factors of the set of positive integers, J. London Math.
Soc. (2) 19, 21-24.
1981. On the limiting distribution of non-negative additive functions, Compositio
Math. 43, 101-105.
1982. La methode de convolution, Theorie Elementaire et Analytique des Nombres
(J. Coquet ed.), Journees SMF—CNRS de Valenciennes, Dep. Math. Univ.
Valenciennes, 26-29.
1984. Sur le theoreme des nombres premiers, C. R. Acad. Sc. Paris 298, Serie I,
no. 8, 161-164.
1989. On a convolution method, in: E. Aparicio, C. Calderon & J.C. Peral (eds.),
Congreso de Teoria de los Nameros (Universitad del Pais Vasco), 110-137.
H. Daboussi & H. Delange,
1974. Quelques proprietes des fonctions multiplicatives de module au plus egal a 1,
C. R. Acad. Sci. Paris Ser. A 278, 657-660.
1982. On multiplicative arithmetical functions whose modulus does not exceed one,
J. London Math. Soc. (2) 26, 245-264.
1985. On a class of multiplicative functions, Acta Sci. Math. 49, 143-149.
H. Davenport,
1980. Multiplicative Number Theory (second edition), Springer, New York, Heidel-
berg, Berlin.
H. Davenport & P. ErdOs,
1937. On sequences of positive integers, Acta Arith. 2, 147-151.
1951. On sequences of positive integers, J. Indian Math. Soc. 15, 19-24.
H. Davenport & H. Halberstam,
1966. The values of a trigonometric polynomial at well spaced points, Mathematika
13, 91-96. Corrigendum and Addendum, Mathematika 14 (1967), 229-232.
H. Delange,
1954. Generalisation du Theoreme de Ikehara, Ann. Sci. Ec. Norm. Sup. (3) 71,
fasc. 3, 213-242.
1959. Sur des formules dues a Atle Selberg, Bull. Sc. Math. 2° serie 83, 101-111.
1961. Sur les fonctions arithmetiques multiplicatives, Ann. Scient. Ec. Norm. Sup.,
3° serie, 78, 273-304.
1971. Sur des formules de Atle Selberg, Acta Arith. 19, 105-146.
1982. Theorie probabiliste des nombres, in: Journees Arithmetiques, Metz 1981,
Asterisque 94, 31-42.
J.-M. Deshouillers, F. Dress & G. Tenenbaum,
1979. Lois de repartition des diviseurs, 1, Acta Arith. 23, 273-285.
H.G. Diamond,
1982. Elementary methods in the study of the distribution of prime numbers, Bul-
letin (N.S.) of the Amer. Math. Soc. 7, 553-589.
1984. A number theoretic series of I. Kasara, Pacific J. Math. 111, no. 2, 283-285.
1988. Review no. 88a: 40006, Mathematical Reviews.
428 Bibliography
W.J. Feller,
1970. An introduction to probability theory and its applications, vol. 1, John Wiley
(1st ed. 1950).
1971. An introduction to probability theory and its applications, vol. 2, John Wiley
(1st ed. 1966).
E. Fouvry Sz F. Grupp,
1986. On the switching principle in sieve theory, J. reine angew. Math. 370, 101-
125.
E. Fouvry Sz G. Tenenbaum,
1991. Entiers sans grand facteur premier en progressions arithmetiques, Proc. Lon-
don Math. Soc. (3) 63, 449-494.
G. Freud,
1952. Restglied eines Tauberschen Satzes I, Acta Math. Acad. Sci. Hung. 2, 299-308.
1953. Restglied eines Tauberschen Satzes II, Acta Math. Acad. Sci. Hung. 3, 299-
307.
1954. Restglied eines Tauberschen Satzes III, Acta Math. Acad. Sci. Hung. 5, 275-
289
G. Freud & T. Ganelius,
1957. Some remarks on one sided approximation, Math. Scand. 5, 276-284.
J.B. Friedlander & A. Granville,
1989. Limitations to the equidistribution of primes I, Ann. Math. 129, 363-382.
J.B. Friedlander, A. Granville, A. Hildebrand Sz H. Maier,
1991. Oscillation theorems for primes in arithmetic progressions and for sifting func-
tions, J. Amer. Math. Soc. 4, 25-86.
J. Galambos,
1970. Distribution of arithmetical functions. A survey, Ann. Inst. H. Poincare Sect.
B (N.S.) 6, 281-305.
1971. On the distribution of strongly multiplicative function, Bull. London Math.
Soc. 3, 307-312.
1976. The sequences of prime divisors of integers, Acta Arith. 31, 213-218.
J. Galambos & P. Sziisz,
1986. On the distribution of multiplicative arithmetical functions, Acta Arith. 47,
57-62.
P.X. Gallagher,
1967. The large sieve, Mathematika 14, 14-20.
T. Ganelius,
1971. Tauberian Remainder Theorems, Springer Lecture Notes 232.
A. Gelfond,
1946. Comments to the papers 'On the determination of the number of prime num-
bers not exceeding a given quantity' and 'On prime numbers' (in Russian),
in: Collected works of P. L. Chebyshev, vol. 1, Moscow-Leningrad, 285-288.
A. Gelfond Si Y. Linnik,
1965. Methodes elementaires dans la theorie analytique des nom,bres, Gauthier-
Villars, Paris.
Bibliography 431
L. S. Gorshkov,
1956. On the deviation of polynomials with rational integer coefficients from zero on
the interval [0, 1] (in Russian), in: Proceedings of the 3rd All-union congress
of Soviet mathematicians, vol. 3, 5-7, Moscow.
S.W. Graham,
1981a. The distribution of squarefree numbers, J. London Math. Soc. (2) 24, 54-64.
1981b. On Linnik's constant, Acta Arith. 39, 163-179.
S.W. Graham & G. Kolesnik,
1991. Van der Corput's method of exponential sums, London Math. Soc. Lecture
Notes 126, Cambridge University Press.
S.W. Graham Si J.D. Vaaler,
1981. A class of extremal functions for the Fourier transform, Trans. Amer. Math.
Soc. 265, 283-302.
1984. Extremal functions for the Fourier transform and the large sieve, in G. Halasz
(ed.) Topics in classical number theory, Colloq. Budapest 1981, vol. I, Colloq.
Math. Soc. Janos Bolyai 34, 599-615.
E. Grosswald,
1982. Oscillation theorems, Conference on the theory of arithmetic functions,
Springer Lecture Notes 251, 141-168.
G. Halasz,
1968. tber die Mittelwerte multiplikativer zahlentheoretischer Funktionen, Acta
Math. Acad. Sci. Hung. 19, 365-403.
1971. On the distribution of additive and the mean values of multiplicative arith-
metic functions, Studia Scient. Math. Hung. 6, 211-233.
1972. Remarks to my paper 'On the distribution of additive and the mean values
of multiplicative arithmetic functions', Acta Math, Acad. Scient. Hung. 23,
425-432.
H. Halberstam 8./ H.-E. Richert,
1974. Sieve Methods, Academic Press, London, New York, San Francisco.
1979. On a result of R.R. Hall, J. Number Theory (1) 11, 76-89.
H. Halberstam & K.F. Roth,
1966. Sequences, Oxford; second edition, Springer, 1983.
R.R. Hall,
1974. Halving an estimate obtained from Selberg's upper bound method, Acta Arith.
25, 347-351.
1978. A new definition of the density of an integer sequence, J. Austral. Math. Soc.
Ser. A 26, 487-500.
1981. The divisor density of integer sequences, J. London Math. Soc. (2) 24, 41-53.
R.R. Hall & G. Tenenbaum,
1982. On the average and normal orders of Hooley's A-function, J. London Math.
Soc. (2) 25, 392-406.
1986. Les ensembles de multiples et la densite divisorielle, J. Number Theory 22,
308-333.
1988. Divisors, Cambridge Tracts in Mathematics 90, Cambridge.
1991. Efffective mean value estimates for complex multiplicative functions, Math.
Proc. Camb. Phil. Soc. 110, 337-351.
432 Bibliography
D. Hanson,
1972. On the product of primes, Canad. Math. Bull. 15, 33-37.
G.H. Hardy,
1914. Sur les zeros de la fonction ((s) de Riemann, C. R. Acad. Sci. Paris 158,
1012-1014.
1916a. On Dirichlet's divisor problem, Proc. London Math. Soc. 15, (2) 1-25.
1916b. The average orders of the functions P(x) and A(x), Proc. London Math. Soc.
15 (2), 192-213.
1949. Divergent Series, Oxford at the Clarendon Press.
G.H. Hardy & J.E. Littlewood,
1913. Contributions to the arithmetic theory of series, Proc. London Math. Soc. (2)
11, 411-478.
1921. The zeros of Riemann's zeta function on the critical line, Math. Z. 10, 283—
317.
1922. Some problems of partitio numerorum III: On the expression of a number as
a sum of primes, Acta Math. 44, 1-70.
1923. The approximate functional equation in the theory of the zeta function, with
applications to the divisor problems of Dirichlet and Piltz, Proc. London Math.
Soc. (2), 21, 39-74.
1929. The approximate functional equations for ((s) and ((s) 2 , Proc. London Math.
Soc. (2) 29, 81-97.
G.H. Hardy & S. Ramanujan,
1917. The normal number of prime factors of a number n, Quart. J. Math. 48,
76-92.
G.H. Hardy & M. Riesz,
1915. The general theory of Dirichlet series, Cambridge Tracts in Mathematics and
Mathematical Physics no. 18, Cambridge University Press; 2nd impression:
1952.
G.H. Hardy & E.M. Wright,
1938. An introduction to the theory of numbers, Oxford (5th ed. 1979).
D.R. Heath-Brown,
1992. Zero-free regions for Dirichlet L-functions, and the least prime in an arithmetic
progression, Proc. London Math. Soc. (3) 64, no. 2,265-338.
W. Hengartner & R. Theodorescu,
1973. Concentration functions, Academic Press, New York, London.
D. Hensley,
1986. The convolution powers of the Dickman function, J. London Math. Soc. (2)
33, 395-406,
1987. The distribution of round numbers, Proc. London Math. Soc. (3) 54, 412-444.
E. Heppner,
1974. Uber die Iteration von Teilerfunktionen, J. reine angew. Math. 265, 176-182.
A. Hildebrand,
1983. An asymptotic formula for the variance of an additive function, Math. Z. 183,
145-170.
1984a. Integers free of large prime factors and the Riemann hypothesis, Mathematika
31, 258-271.
Bibliography 433
1987. Fluctuations in the mean of Euler's phi function, Proc. Indian Acad. Sci.
Math. Sci. 97, nos. 1-3,239-245.
H.L. Montgomery Si R.C. Vaughan,
1973. On the large sieve, Mathem,atika 20, 119-134.
1974. Hilbert's inequality, J. London Math. Soc. (2) 8, 73-81.
1977. Exponential sums with multiplicative coefficients, Invent. Math. 43, 69-82.
1979. Mean values of character sums, Can. J. Math. 31, 476-487.
1981. The distribution of squarefree numbers, in: H. Halberstam & C. Hooley (eds.),
Recent Progress in Analytic Number Theory, Academic Press; vol. 1,247-256.
1994. Mean values of multiplicative functions, preprint.
L. Moser Si J. Lambek,
1953. On monotone multiplicative functions, Proc. Amer. Math. Soc. 4, 544-545.
M. Nakni,
1988. Les entiers sans facteur carre < x dont les facteurs premiers sont < y, Groupe
de travail en theorie analytique des nombres 1986-87, 69-76, Publ. Math. Orsay
88-01, Univ. Paris XI, Orsay.
M. Nair,
1982a. On Chebyshev-type inequalities for primes, Amer. Math. Monthly, 89, no. 2,
126-129.
1982b. A new method in elementary prime number theory, J. London Math. Soc.
(2) 25, 385-391.
P. Nanopoulos,
1975. Lois de Dirichlet sur N* et pseudo-probabilites, C. R. Acad. Sci. Paris 280,
Serie A, 1543-1546.
1977. Lois zeta et fonctions arithmetiques additives: loi faible des grand nombres,
C. R. Acad. Sci. Paris 285, Serie A, 875-877.
1982. Lois zeta et fonctions arithmetiques additives: convergence vers une loi nor-
male, C. R. Acad. Sci. Paris, Serie I, 295, 159-161.
D.J. Newman,
1980. Simple analytic proof of the prime number theorem, Amer. Math. Monthly
87, no. 9,693-696.
J.-L. Nicolas,
1974/75. Grandes valeurs des fonctions arithmetiques, Sem. de Theorie des Nombres
(Delange-Pisot-Poitou) 16-ieme annee, no. G20.
1978. Sur les entiers N pour lesquels il y a beaucoup de groupes abeliens d'ordre N,
Ann. Inst. Fourier 28, 1-16.
1983a. Petites valeurs de la fonction d'Euler, J. Number Theory 17, 375-388.
1983b. Autour de formules dues a A. Selberg, Pub. Math. Orsay 83-04, Colloque
Hubert Delange, 122-134.
1984. Sur la distribution des nombres entiers ayant une quantite fixee de nombres
premiers, Acta Arith. 44, 191-200.
1988. On highly composite numbers, in: G. E. Andrews, R. A. Askey, B. C. Berndt,
K. G. Ramanathan & R. A. Rankin (eds.), Ramanujan Revisited, Academic
Press, 215-244.
438 Bibliography
K.K. Norton,
1971. Numbers with small prime factors and the least kth power non-residue, Mem.
Amer. Math. Soc. no. 106.
1976. On the number of restricted prime factors of an integer I, Illinois J. Math.
20, 681-705.
1978. Estimates for partial sums of the exponential series, J. of Math. and Applica-
tions 63, 265-296.
1979. On the number of restricted prime factors of an integer II, Acta Math. 143,
9-38.
1982. On the number of restricted prime factors of an integer III, Enseign. Math.,
II Ser., 28, 31-52.
E.V. Novoselov,
1964. A new method in probabilistic number theory (in Russian), iv). Akad. Nauk
SSSR, Ser. Mat., 28, 307-364.
A. Oppenheim,
1926. On an arithmetic function, J. London Math. Soc. 1, 205-211.
1927. On an arithmetic function, II, J. London Math. Soc. 2, 123-130.
R.E. A. C . Paley,
1932. A theorem on characters, J. London Math. Soc. 7, 28-32.
E. Phillips,
1933. The zeta function of Riemann: further developments of van der Corput's
method, Quart. J. Math. (Oxford) 4, 209-225.
J. Pintz,
1984. On the remainder term of the prime number formula and the zeros of Rie-
mann's zeta function, in: H. Jager (ed.), Number Theory, Noordwijkerhout
1983, Springer Lecture Notes 1068,186-197.
G. Polya,
1919. Verschiedene Bemerkungen zur Zahlentheorie, Jahresber. Deutsch. Math.-Ver.
28, 31-40.
C. Pomerance,
1984. On the distribution of round numbers, in: K. Alladi (ed.), Number theory,
Proc. 4th Matsci. Conf. Ootacamund, India, 1984, Springer Lecture Notes
1122,173-200.
K. Prachar,
1958. Uber die kleinste quadratfrei Zahl einer arithmetischen Reihe, Monatsch.
Math. 62, 173-176.
S. Ramanuj an,
1915. Highly composite numbers, Proc. London Math. Soc. (2) 14, 347-409.
R. Rankin,
1938. The difference between consecutive prime numbers, J. London Math. Soc. 13,
242-247.
A. Renyi,
1950. On the large sieve of Ju. V. Linnik, Compositio Math. 8, 68-75.
1955. On the density of certain sequences of integers, Publ. Inst. Math. (Belgrade)
8, 157-162.
1965. A new proof of a theorem of Delange, Publ. Math. Debrecen 12, 323-329.
Bibliography 439
A. Renyi Si P. Turan,
1958. On a theorem of Erdos—Kac, Acta Arith. 4, 71-84.
G.J. Rieger,
1972. Uber einige arithmetische Summen, Manuscripta Math. 7, 23-34.
1983. On Wiener's method in prime number theory, Abstracts Amer. Math. Soc. 4,
no. 5, Abstract 802-10-86, p. 144; II, ibid., no. 5, Abstract 83 T-10-345, p. 392.
B. Riemann,
1859. UTher die Anzahl der Primzahlen unter einer gegebenen GrOsse, Monatsber-
ichte der Berliner Akademie, 671-680; see: CEuvres de Riemann, Albert Blan-
chard, Paris 1968,165-176.
B.A. Rogozin,
1961. An estimate for concentration functions, Theor. Probab. Appl. 6, 94-97.
J.B. Rosser Si L. Shoenfeld,
1962. Approximate formulas for some functions of prime numbers, Illinois J. Math.
6, 64-94.
1975. Sharper bounds for the Chebyshev functions 0(x) and 0(x), Math. Comp. 29,
243-269.
K.F. Roth,
1965. On the large sieves of Linnik and Renyi, Mathematika 12, 1-9.
W. Rudin,
1970. Real and Complex Analysis, McGraw-Hill.
I.Z. Ruzsa,
1982. Effective results in probabilistic number theory, in: J. Coquet (ed.), Theorie
elem,entaire et anal ytique des nombres, Dept. Math. Univ. Valenciennes, 107—
130.
1983. On the variance of additive functions, Studies in Pure Mathematics, Mem. of
P. Turan, 577-586.
1984. Generalized moments of additive functions, J. Number Theory 18, 27-33.
B. Saffari,
1976. On the asymptotic density of sets of integers, J. London Math. Soc. (2) 13,
475-485.
1979. Com,portement a l'infini de la transform,ee de Fourier de 'if pour f (fortement)
additive telle que f (p) = (log p)a , a > 0 fixe, private communication.
E. Saias,
1989. Sur le nombre des entiers sans grand facteur premier, J. Number Theory 32,
no. 1, 78-99.
A. Sarkozy,
1977a. Some remarks concerning irregularities of distribution of sequences of integers
in arithmetic progressions, IV, Acta Math. Acad. Sci. Hung. 30, 155-162.
1977b. Remarks on a paper of G. Halasz, Periodica Math. Hung. 8, 135-150.
L.G. Sathe,
1953. On a problem of Hardy and Ramanujan on the distribution of integers having
a given number of prime factors, J. Indian Math. Soc. 17, 63-141.
1954. (same title), J. Indian Math. Soc. 18, 27-81.
440 Bibliography
L.G. Schnirelmann,
1930. Uber additive Eigenschaften von Zahlen, Annals Polyt. Inst. Novocherkassk.
14, 3-28; Math. Annalen 107 (1933), 649-690.
I.J. Schoenberg,
1928. tber die asymptotische Verteilung reeller Zahlen mod 1, Math. Z. 28, 171-200.
1936. On asymptotic distributions of arithmetical functions, Trans. Amer. Math.
Soc. 39, 315-330.
L. Schoenfeld,
1976. Sharper bounds for the Chebyshev functions 0(x) and 7,b(x), II, Math. Comp.
30, 337-360.
A. Selberg,
1942. On the zeros of Riemann's zeta-function on the critical line, Skr. Norske Vid.
Akad. Oslo, no. 10.
1949. An elementary proof of the prime number theorem, Ann. Math. 50, 305-313.
1954. Note on the paper by L.G. Sathe, J. Indian Math. Soc. 18, 83-87.
H.N. Shapiro,
1950. On the number of primes less than or equal to x, Proc. Amer. Math. Soc. 1,
346-348.
1959. Tauberian theorems and elementary prime number theory, Comm. Pure Appl.
Math. 12, 579-610.
1972. On the convolution ring of arithmetic functions, Comm. Pure Appl. Math. 25,
287-336.
C.L. Siegel,
1936. tber die Classenzahl quadratischer Zahlkorper, Acta Arith. 1, 83-86.
1966. Gesammelte Abhandlungen, Springer-Verlag, vol. 1,406-409.
V. Sitaramaiah Si M.V. Subbarao,
1993. The maximal order of certain arithmetic functions, Indian J. Pure Appl. Math.
24(6), 347-355.
H. Smida,
1991. Sur les puissances de convolution de la fonction de Dickman, Acta Arith. 59,
123-143.
A. Smith,
1980. On Shapiro's Tauberian theorem, Carleton Math. Ser. 170, 4p.
A.V. Sokolovskii,
1979. Lower bounds in the "large sieve", Zap. Nau 6n.. Sem. Leningrad. Otdel. Mat.
Inst. Steklov. (LOMI) 91, 125-133, 181-182; English translation: J. Soviet.
Math. 17 (1981), 2166-2173.
E. Sperner,
1928. Ein Satz iiber Untermengen einer endlichen Menge, Math. Z. 27, 544-548.
H. Squalli,
1985. Sur la repartition du noyau d'un entier, These 3-ieme cycle, Univ. Nancy I.
C.M. Stein,
1984. On the Turcin—Kubilius inequality, Technical Report no. 220, Stanford Univer-
sity.
Bibliography 441
T.J. Stieltjes,
1887. Note sur la mutliplication de deux series, Nouvelles Annales de Mathemati-
ques, ser. 3,6, 210-215.
D. Suryanarayana & R.R. Sita Rama Chandra,
1973. The distribution of squarefull integers, Arkiv. Math. 11, 195-201.
P. Sziisz,
1974. Remark to a theorem of P. Erdos, Acta Arith. 26, 97-100.
A. Tauber,
1897. Ein Satz aus der Theorie der unendlichen Reihen, Monatshefte fur Mathematik
und Physik, 3, 273-277.
G. Tenenbaum,
1982. Sur la densite divisorielle d'une suite d'entiers, J. Number Theory 15, no. 3,
331-346.
1985. Sur la concentration moyenne des diviseurs, Comment. Math. Helvetici 60,
411-428.
1987. Sur un probleme extremal en Arithmetique, Ann. Inst. Fourier (Grenoble)
37, 2, 1-18.
1988. La methode du col en theorie analytique des nombres, in: C. Goldstein (ed.),
Sem,inaire de Theorie des Nombres, Paris 1986-87, Prog. Math. 75 (Birk-
hauser), 411-441.
1990. Sur un probleme d'ErdOs et Alladi, in: C. Goldstein (ed.), Seminaire de Theo-
re des Nombres, Paris 1988-89, Prog. Math. 91 (Birkhauser), 221-239.
E.C. Titchmarsh,
1939. The theory of functions, Oxford University Press (second edition, new printing
in 1979).
1951. The theory of the Riem,ann zeta-function, Oxford University Press.
E.C. Titchmarsh & D.R. Heath-Brown,
1986. The theory of the Riemann zeta-function, Oxford.
K.C. Tong,
1956. On divisor problems II, III, Acta Math. Sinica 6, 139-152; 515-541.
P. Turan,
1934. On a theorem of Hardy and Ramanujan, J. London Math. Soc. 9, 274-276.
1936. Uber einige Verallgemeinerungen eines Satzes von Hardy und Ramanujan, J.
London Math. Soc. 11, 125-133.
J.D. Vaaler,
1985. Some extremal functions in Fourier Analysis, Bull. (N.S.) of the Amer. Math.
Soc. 12, 183-216.
G. Valiron,
1955. Theorie des fonctions (second edition), Masson, Paris.
R.C. Vaughan,
1980. An elementary method in prime number theory, Acta Arith. 37, 111-115.
A.I. Vinogradov,
1965. On the density hypothesis for Dirichlet L-functions, Izv. Akad. Nauk SSSR,
Ser. Math. 29, 903-934.
1966. Correction to the paper of A.I. Vinogradov 'On the density hypothesis for
Dirichlet L-functions' , Izv. Akad. Nauk SSSR, Ser. Math. 30, 719-720.
442 Bibliography
I.M. Vinogradov,
1954. The method of trigonometric sums in the theory of numbers, Interscience.
1958. A new estimate for ((1 + it) (in Russian), /zv. Akad. Nauk SSSR, Ser. Math.
22, 161-164.
G. VoronoI,
1903. Sur un probleme de calcul des fonctions asymptotiques, J. reine angew. Math.
126, 241-282.
M. Vose,
1984. Integers with consecutive divisors in small ratio, J. Number Theory 19, 233—
238
A. Walfisz,
1963. Weylsche Exponentialsum,men in der Neueren Zahlentheorie, VEB Deutscher
Verlag, Berlin.
R. Warlimont,
1969. On squarefree numbers in arithmetic progressions, Monatsh. Math. 73, 433—
448.
1980. Squarefree numbers in arithmetic progressions, J. London Math. Soc. (2) 22,
no. 1, 21-24.
N. Watt
1989. Exponential sums and the Riemann zeta function II, J. London Math. Soc.
(2) 39, 385-404.
H. Weyl
1916. Uber die Gleichverteilung von Zahlen mod. Eins, Math. Ann. 77, 313-404.
1921. Zur Abschatzung von ((1 + ti), Math. Z. 10, 88-101.
E.T. Whittaker 8.i G.N. Watson,
1927. A course of modern analysis (4th edition), Cambridge University Press (new
printing: 1986).
D.V. Widder,
1946. The Laplace transform, Princeton University Press, Princeton, New Jersey.
1971. An introduction to transform, theory, Academic Press, New York and London.
E. Wirsing,
1956. UTher die Zahlen, deren Primteiler einer gegeben Menge angehoren, Arch. der
Math. 7, no. 4, 263-272.
1967. Das asymptotische Verhalten von Summen iiber multiplikative Funktionen II,
Acta Math. Acad. Sci. Hung. 18, 411-467.
J. Wu,
1990. Sur la suite des nombres premiers jumeaux, Acta Arith. 55, 365-394.
Index