2 Bertrand’s postulate
For any positive natural number, there is a prime which is greater than it, but no more than twice as large.
We will estimate the size of the binomial coefficient \(\binom {2n}{n}\) carefully enough to see that if it didn’t have any prime factors in the range \(n {\lt} p \le 2n\), then it would be “too small.” Our argument is in five steps.
We first prove Bertrand’s postulate for \(n \le 511\). For this one does not need to check 511 cases: it suffices (this is “Landau’s trick”) to check that
\[ 2,3,5,7,13,23,43,83,163,317,521 \]is a sequence of prime numbers, where each is smaller than twice the previous one. Hence every interval \(\{ y : n {\lt} y \le 2n\} \), with \(n \le 511\), contains one of these 11 primes.
Next we prove that
\begin{equation} \label{eq:prod_prime_le} \prod _{p \le x} p \le 4^{x-1} \quad \text{for all real } x \ge 2, \end{equation}1where our notation — here and in the following — is meant to imply that the product is taken over all prime numbers \(p \le x\). The proof that we present for this fact uses induction on the number of these primes. It is not from Erdős’ original paper, but it is also due to Erdős, and it is a true Book Proof. First we note that if \(q\) is the largest prime with \(q \le x\), then
\[ \prod _{p \le x} p = \prod _{p \le q} p \quad \text{and} \quad 4^{q-1} \le 4^{x-1}. \]Thus it suffices to check ?? for the case where \(x=q\) is a prime number. For \(q=2\) we get “\(2 \le 4\),” so we proceed to consider odd primes \(q = 2m + 1\). (Here we may assume, by induction, that ?? is valid for all integers \(x\) in the set \(\{ 2, 3, \dots , 2m\} \).) For \(q = 2m + 1\) we split the product and compute
\[ \prod _{p \le 2m + 1} p = \prod _{p \le m + 1} p \cdot \prod _{m + 1 {\lt} p \le 2m + 1} p \le 4^m \binom {2m + 1}{m} \le 4^m 2^{2m} = 4^{2m}. \]All the pieces of this “one-line computation” are easy to see. In fact,
\[ \prod _{p \le m + 1} p \le 4^m \]holds by induction. The inequality
\[ \prod _{m + 1 {\lt} p \le 2m + 1} p \le \binom {2m + 1}{m} \]follows from the observation that \(\binom {2m + 1}{m} = \frac{(2m + 1)!}{m! (m + 1)!}\) is an integer, where the primes that we consider all are factors of the numerator \((2m + 1)!\), but not of the denominator \(m! (m + 1)!\). Finally
\[ \binom {2m + 1}{m} \le 2^{2m} \]holds since
\[ \binom {2m + 1}{m} \quad \text{and} \quad \binom {2m + 1}{m + 1} \]are two (equal!) summands that appear in
\[ \sum _{k = 0}^{2m + 1} \binom {2m + 1}{k} = 2^{2m + 1}. \]From Legendre’s theorem we get that \(\binom {2n}{n} = \frac{(2n)!}{n! n!}\) contains the prime factor \(p\) exactly
\[ \sum _{k \geq 1} \left( \left\lfloor \frac{2n}{p^k} \right\rfloor - 2 \left\lfloor \frac{n}{p^k} \right\rfloor \right) \]times. Here each summand is at most 1, since it satisfies
\[ \left\lfloor \frac{2n}{p^k} \right\rfloor - 2 \left\lfloor \frac{n}{p^k} \right\rfloor {\lt} \frac{2n}{p^k} - 2 \left( \frac{n}{p^k} - 1 \right) = 2, \]and it is an integer. Furthermore the summands vanish whenever \(p^k {\gt} 2n\). Thus \(\binom {2n}{n}\) contains \(p\) exactly
\[ \sum _{k \geq 1} \left( \left\lfloor \frac{2n}{p^k} \right\rfloor - 2 \left\lfloor \frac{n}{p^k} \right\rfloor \right) \le \max \{ r : p^r \le 2n\} \]times. Hence the largest power of \(p\) that divides \(\binom {2n}{n}\) is not larger than \(2n\). In particular, primes \(p {\gt} \sqrt{2n}\) appear at most once in \(\binom {2n}{n}\).
Furthermore — and this, according to Erdős, is the key fact for his proof — primes \(p\) that satisfy \(\frac{2}{3} n {\lt} p {\lt} n\) do not divide \(\binom {2n}{n}\) at all! Indeed, \(3p {\gt} 2n\) implies (for \(n \geq 3\), and hence \(p \geq 3\)) that \(p\) and \(2p\) are the only multiples of \(p\) that appear as factors in the numerator of \(\frac{(2n)!}{n! n!}\), while we get two \(p\)-factors in the denominator.
Now we are ready to estimate \(\binom {2n}{n}\), benefitting from a suggestion by Raimund Seidel, which nicely improves Erdős’ original argument. For \(n \ge 3\), using an estimate for the lower bound, we get
\[ \frac{4^n}{2n} \le \binom {2n}{n} \le \prod _{p \le \sqrt{2n}} 2n \cdot \prod _{\sqrt{2n} {\lt} p \le \frac{2}{3}n} p \cdot \prod _{n {\lt} p \le 2n} p. \]Now, there are no more than \(\sqrt{2n}\) primes in the first factor; hence using (1) for the second factor and letting \(P(n)\) denote the number of primes between \(n\) and \(2n\) we get
\[ \frac{4^n}{2n} {\lt} ((2n)^{\sqrt{2n}}) \cdot (4^{\frac{2}{3}n}) \cdot (2n)^{P(n)}, \]that is,
\[ 4^{\frac{n}{3}} {\lt} (2n)^{\sqrt{2n}+1+P(n)}. \quad \text{(2)} \]Taking the logarithm to base 2, the last inequality is turned into
\[ P(n) {\gt} \frac{2n}{3 \log _2(2n)} - (\sqrt{2n} + 1). \quad \text{(3)} \]It remains to verify that the right-hand side of (3) is positive for \(n\) large enough. We show that this is the case for \(n = 2^9 = 512\) (actually, it holds for \(n = 468\) onward). By writing \(2n - 1 = (\sqrt{2n} - 1)(\sqrt{2n} + 1)\) and cancelling the \((\sqrt{2n} + 1)\)-factor it suffices to show
\[ \sqrt{2n}-1 {\gt} 3 \log _2(2n) \quad \text{for } n \ge 2^9. \quad \text{(4)} \]For \(n = 2^9\), (4) becomes \(31 {\gt} 30\), and comparing the derivatives \((\sqrt{x}-1)' = \frac{1}{2} \frac{1}{\sqrt{x}}\) and \((3 \log _2 x)' = \frac{3}{\log 2} \frac{1}{x}\) we see that \(\sqrt{x} - 1\) grows faster than \(3 \log _2 x\) for \(x {\gt} (\frac{6}{\log 2})^2 \approx 75\) and thus certainly for \(x \ge 2^{10} = 1024.\) □
2.1 Appendix: Some estimates
For all \(n \in \mathbb {N}\)
There is a very simple-but-effective method of estimating sums by integrals. For estimating the harmonic numbers
we draw the figure and derive from it
by comparing the area below the graph of \(f(t) = \frac{1}{t} (1 \le t \le n)\) with the area of the dark shaded rectangles, and
by comparing with the area of the large rectangles (including the lightly shaded parts). Taken together, this yields
For all \(n \in \mathbb {N}\)
The same method applied to
yields
where the integral is easily computed:
Thus we get a lower estimate on \(n!\)
and at the same time an upper estimate