Loading [MathJax]/extensions/TeX/mathchoice.js

Chebyshev's Mémoire sur les nombres premiers

 Chebyshev stamp
P. L. Chebyshev (1821-1894)

Mémoire sur les nombres premiers[1] is Chebyshev's great work of 1850, where he used ingenious arguments to find the order of π(x), the number of prime numbers less than or equal to x, and proved Bertrand's postulate, that there is always at least one prime number strictly between n and 2n2 for every integer n>3.[2] To this end, he introduced:
θ(x)=pxlogpψ(x)=θ(x)+θ(x1/2)+θ(x1/3)+,
where the first sum is taken over all primes less than or equal to the positive real number x. Note that the second sum has a finite number of terms, since for a given x,x1/n is eventually less than 2 and for such n,θ(x1/n)=0.

These functions seem mysterious and unmotivated but are the key to the puzzle.[3] If the object is to find the number of primes between α and β, where α<β, then θ(β)θ(α) is the sum of the logarithms of the primes p such that α<pβ and therefore:
(π(β)π(α))log(α)θ(β)θ(α)(π(β)π(α))log(β)θ(β)θ(α)logβπ(β)π(α)θ(β)θ(α)logα.
So finding bounds for θ(x) will lead to bounds for π(x). If β=2α2 in (1), for example, Bertrand's postulate follows if the left side of (1) is greater than zero. And α=2 will lead to bounds on π(β).

The strategy is:

• Use Stirling's formula to bound ψ(x).

• Use the bounds on ψ(x) to bound θ(x).

• Use the bounds on θ(x) to bound π(x).

Legendre's Formula

Legendre's formula[4] gives an expression for the highest power of a given prime dividing n! for any positive integer n. For a positive real number x, denote the greatest integer less than or equal to x by x; and for prime p, let νp(x) be the highest power of p dividing x. Then Legendre says:
νp(n!)=i=1npi.
Proof. The sum is actually finite since eventually pi>n, and from that point on the terms are all zero. Let n be a positive integer and assume p=2 for concreteness. Then there are exactly n/2 even numbers less than or equal to n, which account for that many factors of 2. But there are generally more — namely, the multiples of four, each of which provides another factor of 2, and there are n/22 of them contributing more factors of 2. Then there are the multiples of 8, which contribute n/23 more factors of 2. Continuing in this fashion accounts exactly for all the factors of 2 dividing n!. The argument is identical for any prime p, proving the theorem. QED.

Consider n=10 and p=2 to illustrate the effect, where 10!=3,628,800=2834527:
10!=2345678910102=5 multiples of 2 in 10!10!25=1325374951022=2 multiples of 4 in 10!10!27=1315372951023=1 multiple of 8 in 10!10!28=1315371951024=0 multiples of 16 in 10!
ν2(10!)=102+1022+1023=5+2+1=8.

Bringing in T(x):=log(x!)

The relevance of T(x) is not an entire mystery considering that Stirling's formula, an approximation for n!, is about to be brought in. The plot thickens with:

Theorem. T(x):=log(x!)=m=1ψ(xm).

Proof. Here too the sum is finite. Consider:
m=1k=1θ((xm)1/k)={θ(x)+θ(x1/2)+θ(x1/3)+θ(x1/4)+θ(x2)+θ((x2)1/2)+θ((x2)1/3)+θ((x2)1/4)+θ(x3)+θ((x3)1/2)+θ((x3)1/3)+θ((x3)1/4)+θ(x4)+θ((x4)1/2)+θ((x4)1/3)+θ((x4)1/4)+
Every entry in this table is of the form logp for some prime p, and T(x) is also the sum of logarithms of primes, so the task is to show that there are the same number of copies of logp in the table as there are in T(x), where p is any prime. logp is in column 1, row m exactly when px/m, that is, when mx/p. There are exactly x/p values of m for which this holds. In the same way, column k has x/pk values of m for which logp appears in the table. Summing over the columns gives the expression in Legendre's formula for the number of copies of logp in T(x). It follows that T(x) equals the double sum. Considering that the mth row of the table is ψ(x/m), this shows that:
T(x)=m=1ψ(xm).QED.

Leveraging Stirling's Formula

One form of Stirling's formula is:
2πn (ne)n<n!<2πn(ne)ne112nlog(2πn (ne)n)<logn!<log(2πn(ne)ne112n)log2πn+log(ne)n<logn!<log2πn+log(ne)n+loge112n12log2π+12logn+nlognn<logn!<12log2π+12logn+nlognn+112n
Replacing n with n+1 in the first inequality in (3) and subtracting log(n+1) from both sides results in:
12log2π+log(n+1)(n+12)(n+1)<logn!12log2π+(n+1){log(n+1)1}12log(n+1)<logn!.
n+1 can be replaced by anything less than n+1 on the left side of this last inequality considering that the function f(y)=y(logy1)12logy is increasing for y2. So putting n=x and considering that x<x+1 for any real x1:
12log2π+x(logx1)12logx<logn!=logx!=T(x).
The only possible issue is for 1x2, and all good in this range since the left side is negative there. The second inequality in (3) is only strengthened when replacing 112n by 112 and n by x, since 1x<x and the function g(y)=(y+12)logyy is increasing for y1. So:
T(x)<12log2π+x(logx1)+12logx+112.
(4) and (5) bound T(x) below and above by simple expressions involving logx.

S(x)=T(x)+T(x30)T(x2)T(x3)T(x5)

Next define S(x) as in the subhead immediately above and consider this sum, expanding each T() across a row by applying (2):
S(x)={ψ(x)+ψ(x2)+ψ(x3)+ψ(x4)++ψ(x30)+ψ(x230)+ψ(x330)+ψ(x430)+ψ(x2)ψ(x22)ψ(x32)ψ(x42)ψ(x3)ψ(x23)ψ(x33)ψ(x43)ψ(x5)ψ(x25)ψ(x35)ψ(x45)
Each of the terms in the expanded sum is of the form ψ(x/n), so:
S(x)=anψ(xn),
where an is an integer (possibly negative). There are three cases:

1) an=1 if n is relatively prime to each of 2,3, and 5. This is true because then ψ(x/n) appears in the first row and only the first row, since every denominator in lower rows is a multiple of at least one of 2,3, or 5.

2) an=0 if n is divisible by only one of 2,3, or 5. Suppose, for example, that n is divisible by 2 but not by 3 or 5. Then ψ(x/n) appears in the first row and the third row, but not the others, because the denominators in the other rows are divisible by at least one of 3 or 5. The two instances have different signs, so an=0. The same obtains if n is divisible only by 3 or only by 5.

3) an=1 in all other cases. One possibility here is that n is divisible by just two of 2,3, or 5. Suppose n is divisible by 2 and 3 but not by 5, for example. In that case, ψ(x/n) appears in the first, third, and fourth rows, but not the second or fifth. The first row contributes +ψ(x/n), while each of the third and fourth rows contribute ψ(x/n) so an=1 when the three are combined. The same effect prevails if n is divisible by any two of 2,3, or 5, but not the third. Another possibility is that n is divisible by 2,3, and 5. In this case, ψ(x/n) appears in each row and an=1 after cancellation.

It follows from these rules that the an repeat in cycles of 30. For 1n30, the an are as follows:
n123456789101112131415case122223122313123an100001100111101
n161718192021222324252627282930case213132213222213an011110011000011,
so S(x) can be written out:
S(x)=ψ(x)ψ(x6)+ψ(x7)ψ(x10)+ψ(x11)ψ(x12)+ψ(x13)+.
Note the alternating signs, which continue throughout. Since ψ(x) is non-negative and non-decreasing, this implies that:
ψ(x)ψ(x6)S(x)ψ(x),i.e., ψ(x)ψ(x6)T(x)+T(x30)T(x2)T(x3)T(x5)ψ(x).
Now (4) and (5) above can be brought in to replace each T(x/k) with expressions involving logx. The positive and negative terms will have to be handled differently:
T(x)+T(x30)>log2π+3130xlogx130xlog303130xlogx+12log30,T(x)+T(x30)<log2π+3130xlogx130xlog303130x+logx12log30+16,T(x2)+T(x3)+T(x5)>32log2π+3130xlogxBx3130x32logx+12log30,T(x2)+T(x3)+T(x5)<32log2π+3130xlogxBx3130x+32logx12log30+14,
where B=12log2+13log3+15log5. The first and last of these inequalities gives a lower bound for S(x)=T(x)+T(x/30)T(x/2) T(x/3)T(x/5) and the second and third give an upper bound for S(x):
x(B130log30)52logx+12log450π14<S(x),S(x)<x(B130log30)+52logx12log1800π+16.
The constant A=B130log30=12log2+13log3+15log5130log30=0.92129202293 is an important fixture in this development and appears in both lower and upper bounds, leading to the reframing:
Ax52logx+12log450π14<S(x)<Ax+52logx12log1800π+16,
which can be replaced by the weaker inequality
Ax52logx1<S(x)<Ax+52logx, for x1.

Bounding ψ(x)

Combining (6) and (7) leads to:
Ax52logx1<ψ(x),ψ(x)ψ(x6)<Ax+52logx.
The second inequality can be worked into an upper bound for ψ(x) with a telescoping strategy. Replacing x by x/6,x/62, and so on, results in:
ψ(x)ψ(x6)<Ax+52logxψ(x6)ψ(x62)<A(x6)+52log(x6)ψ(x62)ψ(x63)<A(x62)+52log(x62)ψ(x6n)ψ(x6n+1)<A(x6n)+52log(x6n)
Choosing n to be the smallest integer such that ψ(x/6n+1)=0 and adding these inequalities:
ψ(x)<Axnk=0(x6k)+52nk=0log(x6k)ψ(x)<Ax65+52nk=0log(x6k)
ψ(x/6n+1)=0 when x/6n+1<2. To get a sense of this, suppose x/6n+1=1, that is, n+1= logx/log6. Then the sum in (10) reduces after a series of calculations to:
54log6log2x+54logx.
This isn't going to be exact, if for no other reason than that n is an integer, but it leads to an upper bound by defining:
f(x)=65Ax+54log6log2x+54logx.
Note that:
f(x)f(x6)=65Ax+54log6log2x+54logx(65A(x6)+54log6log2(x6)+54log(x6))=Ax+54log6(log2xlog2(x6))+54(logxlog(x6))=Ax+54log6(logxlog(x6))(logx+log(x6))+54(logxlog(x6))=Ax+54log6(log6)(logx+log(x6))+54(log6)=Ax+54(2logxlog6)+54log6=Ax+52logx.
Combining this with (9) leads to:
ψ(x)ψ(x6)<f(x)f(x6)ie. ψ(x)f(x)<ψ(x6)f(x6).
Let m be the greatest integer such that x/6m1. Then 16x/6m+1<1, so ψ(x/6m+1)=0. Apply (11) to x,x/6,x/62,x/6m:
ψ(x)f(x)<ψ(x6)f(x6)ψ(x6)f(x6)<ψ(x62)f(x62)ψ(x62)f(x62)<ψ(x63)f(x63)ψ(x6m)f(x6m)<ψ(x6m+1)f(x6m+1)

 Chebyshev's f(u)

Chaining these together:
ψ(x)f(x)<ψ(x6m+1)f(x6m+1)<1,
considering that f(u)>1 for u>0. That is:
ψ(x)<f(x)+1=65Ax+54log6log2x+54logx+1.
Combining this with (8) leads to lower and upper bounds on ψ(x):
Ax52logx1<ψ(x)<65Ax+54log6log2x+54logx+1.

Bounding θ(x)

Now bounding θ(x) won't be too hard. Start with:
ψ(x)=θ(x)+θ(x1/2)+θ(x1/3)+θ(x1/4)+ψ(x1/2)=θ(x1/2)+θ(x1/4)+θ(x1/6)+θ(x1/8)+ψ(x)ψ(x1/2)=θ(x)+θ(x1/3)+θ(x1/5)+θ(x1/7)+
All the terms after θ(x) on the last line are non-negative, so:
θ(x)ψ(x)ψ(x1/2).
Similarly:
ψ(x)2ψ(x1/2)=θ(x){θ(x1/2)θ(x1/3)}{θ(x1/4)θ(x1/5)}+
Here each difference in braces is non-negative, so θ(x) is followed by a series of negative terms, implying:
ψ(x)2ψ(x1/2)θ(x).
Employing the lower bound on ψ(x1/2) and the upper bound on ψ(x):
ψ(x1/2)<A(x1/2)+52log(x1/2)+1=A(x1/2)+54logx+1θ(x)ψ(x)ψ(x1/2)<65AxAx1/2+54log6log2x+52logx+2.
And using the upper bound on ψ(x1/2) and the lower bound on ψ(x):
2ψ(x1/2)>265Ax1/2254log6log2(x1/2)254log(x1/2)21=125Ax1/258log6log2x54logx2θ(x)ψ(x)2ψ(x1/2)>Ax125Ax1/258log6log2x154logx3.
In summary:
θ(x)<θ1(x):=65AxAx1/2+54log6log2x+52logx+2,θ(x)>θ2(x):=Ax125Ax1/258log6log2x154logx3.

Proving Bertrand's Postulate

These bounds on θ(x) can be used to show the existence of prime numbers. Suppose 1<α<β. Then by (12) and (13):
θ(β)θ(α)>θ2(β)θ1(α),θ(β)θ(α)<θ1(β)θ2(α).
Now:
θ(β)θ(α)=α<pβlogplogβα<pβ1=logβ(π(β)π(α)).
A similar result obtains for the upper bound, so:
θ(β)θ(α)logβ<π(β)π(α)<θ(β)θ(α)logα,θ2(β)θ1(α)logβ<π(β)π(α)<θ1(β)θ2(α)logα.
So if k is an integer less than (θ2(β)θ1(α))/logβ, then there are more than k primes between α and β. That is:
klogβ<θ2(β)θ1(α) there are more than k primes p such that α<pβ.
(14) is the key and let's take a moment to reflect on what a remarkable statement it is. If k=0, it is only necessary to prove that θ2(β)θ1(α), an expression involving αs and βs and their square roots and logarithms, is positive in order to show that there is at least one prime number between α and β. Continuing:
θ2(β)θ1(α)=Aβ125Aβ1/258log6log2β154logβ365Aα+Aα1/254log6log2α52logα2=A(β65α)A(125β1/2α1/2)58log6(log2β+2log2α)54(3logβ+2logα)5
This can be turned into an inequality by leaving out the positive term Aα1/2. Also the log2α and logα can be replaced by log2β, and logβ respectively, since those terms are negative and α<β:
θ2(β)θ1(α)>A(β65α)125Aβ1/2158log6log2β254logβ5.
Therefore, in order to prove that there is a prime p with α<pβ, it is sufficient to show that the right side of (15) is positive:
0<A(β65α)125Aβ1/2158log6log2β254logβ5,ie. α<56β2β1/22516Alog6log2β12524Alogβ256A.
(16) holds when β is sufficiently larger than α. In fact:

Bertrand's Postulate: There is at least one prime number p such that α<p<2α2 for every integer α>3.

Proof. Let β=2α3 in (16) and rearrange to get:
α>32α3+7532Alog6log2(2α3)+12516Alog(2α3)+(154+256A).

 Chebyshev function for Bertrand's postulate

α eventually outstrips all of those slower growing functions of α on the right side of this inequality, so it holds whenever α is large enough. Turning the inequality into an equality and solving for α results in α=156.44, so Bertrand's postulate holds for values of α greater than that. It holds for smaller values of α by directly checking. QED.

A much sharper result is available from (16), for if β>65α, the same calculation as in the proof of Bertrand's postulate leads to the existence of at least one prime between α and β inclusive for all α such that:
α>Pα+Qlog2α+Rlogα+S,
where P,Q,R, and S are positive constants, and this inequality holds for sufficiently large α, whatever the constants. The threshold might be higher than 3 though as it is for Bertrand's postulate. It's true that there is at least one prime strictly between α and 54α for sufficiently large α, for example, but it's not true for smaller values of α. There is no prime strictly between α=23 and 54α=2834, so the threshold is at least 23.

Lower Bound for π(x)

Since 1800 or so, Legendre and Gauss had noticed through numerical work that π(x)x/logx and Chebyshev himself had proven that if π(x)/(x/logx) tends to a limit, then that limit is 1.[5] The goal here is to prove that there are positive constants C1 and C2 such that:
C1xlogx<π(x)<C2xlogx
for all x or perhaps all x greater than some threshold x0. The two statements are actually synonymous because if the second is true, then other constants can be produced accounting for the finite number of exceptions as well as all large x.

Chebyshev doesn't prove (17), though it follows from his analysis. For the lower bound, recall from (13) that:
θ(x)>θ2(x):=AxCx1/2Dlog2xElogxF,
where C,D,E,F are positive constants. Also:
θ(x)=pxlogplogxpx1=logxπ(x).θ2(x)logx<θ(x)logx<π(x)AxCx1/2Dlog2xElogxFlogx<π(x).Axlogxxlogx(Cx1/2x+Dlog2xx+Elogxx+Fx)<π(x).
Each summand in the parenthesized expression goes to zero as x goes to infinity, so the same is true of the sum. It follows that for every ϵ>0, there is an x0 such that:
(Aϵ)xlogx<π(x) for x>x0.
Considering that A=12log2+13log3+15log5130log30=0.92129202293, this shows that π(x) is not too much less than x/logx, relatively speaking.

Upper Bound for π(x)

The upper bound is harder to prove. Since x/logx is strictly increasing for x3 and π(x) is a non-decreasing step function changing value only at some integers (primes!), it suffices to prove the upper bound for integers. Start with this, for any integer n2:
π(n)=θ(2)θ(1)log2+θ(3)θ(2)log3+θ(4)θ(3)log4++θ(n)θ(n1)logn.
The equation holds because θ(k)θ(k1) is logk if k is prime and 0 if not, so the fraction is 1 if k is prime and 0 otherwise. The upshot is that the sum just counts the primes less than or equal to n. Rearranging (19) results in:
π(n)=θ(1)log2+(1log21log3)θ(2)++(1logn1log(n+1))θ(n)+θ(n)log(n+1)<θ2(1)log2+(1log21log3)θ1(2)++(1logn1log(n+1))θ1(n)+θ1(n)log(n+1)=θ1(1)log2θ2(1)log2+θ1(2)θ1(1)log2+θ1(3)θ1(2)log3++θ1(n)θ1(n1)logn=θ1(1)log2θ2(1)log2+nk=2θ1(k)θ1(k1)logk=K+nk=2θ1(k)θ1(k1)logk=K+nk=21logk(65AA(k1/2(k1)1/2)+G(log2klog2(k1))+H(logklog(k1))),(20)
where K,G and H are positive constants. There are four terms in the long parenthesized sum within the summation sign in (20), and I'll address them one by one as separate sums:

 Area under Li(x)

1)S1=nk=2(1logk65A)=65Ank=21logk.

To evaluate this sum, consider that:
nk=31logk<n2dtlogtnk=21logk<1log2+n2dtlogt=1log2+ Li(n),
where  Li(n) is defined as the indicated definite integral. It is a fact (proof here) that:
 Li(x)x/logx1 as xnk=21logkn/logn<1log2+ Li(n)n/logn1 as nS1<(65A+ϵ1)nlogn for large n for every ϵ1>0.
2)S2=nk=2(A(k1/2(k1)1/2)logk).

S2 can be left out when seeking an upper bound since it's negative.

3)S3=nk=2(G(log2klog2(k1))logk).

A larger sum results from replacing the logk in the denominator by log2, so:
S3<Glog2nk=2(log2klog2(k1)).
The sum telescopes and collapses to log2n, so:
S3<Glog2log2n=nlogn(lognnGlog2log2n)=nlogn(Glog2log3nn)S3<ϵ3nlogn for large n for every ϵ3>0,
considering that log3n/n0 as n.

4)S4=nk=2(H(logklog(k1))logk).

The analysis is identical to that for S3, so:
S4<ϵ4nlogn for large n for every ϵ4>0.
The leading K is not an obstacle because:
K<ϵ0nlogn for large n for every ϵ0>0.
With these substitutions, (20) becomes:
π(n)<K+S1+S3+S4<ϵ0nlogn+(65A+ϵ1)nlogn+ϵ3nlogn+ϵ4nlogn<(65A+ϵ0+ϵ1+ϵ3+ϵ4)nlogn=(65A+ϵ)nlogn for large n for every ϵ>0.
Combining (18) and (21) and keeping in mind that the upper bound can be converted from integers to real numbers:
(Aϵ)xlogx<π(x)<(65A+ϵ)xlogx for large x for every ϵ>0,
where A=0.92129202293 and 65A=1.10555042752.

Some Numerical Results

Here are π(x) and its proposed upper and lower bounds for powers of 2 up to 223, "proposed" since they've only been proven to bound π(x) beyond some unspecified threshold and up to a tolerance of ϵ.

x x/logx Ax/logx π(x) 65Ax/logx
2 2.9 2.7 1 3.2
4 2.9 2.7 2 3.2
8 3.8 3.5 4 4.3
16 5.8 5.3 6 6.4
32 9.2 8.5 11 10.2
64 15.4 14.2 18 17.0
128 26.4 24.3 31 29.2
256 46.2 42.5 54 51.0
512 82.1 75.6 97 90.7
1024 147.7 136.1 172 163.3
2048 268.6 247.5 309 297.0
4096 492.4 453.7 564 544.4
8192 909.1 837.6 1028 1005.1
16,384 1688.4 1555.5 1900 1866.6
32,768 3151.6 2903.6 3512 3484.3
65,536 5909.3 5444.2 6542 6533.0
131,072 11,123.3 10,247.9 12,251 12,297.4
262,144 21,010.8 19,357.1 23,000 23,228.5
524,288 39,809.9 36,676.5 43,390 44,011.8
1,048,576 75,638.8 69,685.4 82,025 83,622.5
2,097,152 144,073.8 132,734.1 155,611 159,280.9
4,194,304 275,050.1 253,401.4 295,947 304,081.7
8,388,608 526,182.7 484,768.0 564,163 581,721.6

This table suggests that Ax/logx<π(x) except for a few small values of x and indeed this is the case for all x>10. The proposed upper bound runs closer to π(x) but slightly below it until x=131,072 in the chart. In fact the threshold is x=96,097, above which π(x)<65Ax/logx. π(x) continues to follow the upper bound closely until x=1,000,000 or so.

Simpler Approaches

Although complicated, Chebyshev's approach uses elementary tools and provides constants close to 1. Much simpler methods are available if the goal is simply to provide positive constants C1 and C2 in (17). For example, Tom Apostol uses the binomial coefficient \binom{2n}{n} to show that[6]:
\frac{1}{6} {{n} \over {\log{n}}} < \pi(n) < 6 {{n} \over {\log{n}}} \text{ for all integers } n > 2.
It does make sense that \binom{2n}{n} = \frac{(2n)!}{n! \cdot n!} figures into these matters, since \binom{2n}{n} contains all primes between n and 2n in its factorization. That's because every such prime divides (2n)! but does not divide n!. For example:
\binom{20}{10} = \frac{(20)!}{10! \cdot 10!} = 2^2 \cdot 11 \cdot 13 \cdot 17 \cdot 19.
Paul Pollack proves (17) very much in Chebyshev's spirit[7], without specifying the constants however. He too use what amounts to \binom{2n}{n}, considering that:
\begin{align} \log{\binom{2n}{n}} &= \log{\frac{(2n)!}{n! \cdot n!}}\\[1em] &=\log{(2n)!} - 2\log{n!}\\[1em] &=T(2n) - 2T(n), \end{align}
where T(\cdot) is Chebyshev's function defined above. Instead of S(x), Pollack uses Q(x) = T(x) - 2T(x/2) and works it into a proof of Bertrand's postulate as well.

Chebyshev's Accomplishment

A quick glance at Apostol or Pollock makes it abundantly clear that the details of Chebyshev's approach have long since been abandoned. Its spirit lives on however, as do \theta(x) and \psi(x). Who would have thought that these analytical processes involving logarithms and continuous real functions would be the key to understanding the highly unruly distribution of prime numbers? This is part of a large subject today, analytic number theory — applying analysis to the study of integers — and Chebyshev was an early founder. (17) is generally taken as the first step to:

Prime Number Theorem: \displaystyle \lim_{x \rightarrow \infty} {{\pi(x)}\over{x/\log{x}}} = 1.

The Prime Number Theorem, one of the gems of modern mathematics, was proven independently by Hadamard and de la Vallée Poussin in 1896 using complex number theory and then in 1949 by Selberg and Erdős using "elementary" methods (no complex number theory). Chebyshev showed the way.

Mike Bertrand

March 14, 2023


^ 1. "Mémoire sur les nombres premiers", by P. L. Chebyshev, Journal de mathématiques pures et appliquées 1re série, tome 17 (1852), pp. 366-390. The paper was originally read to the St. Petersburg Academy of Sciences in September 1850. It also appears in the Oeuvres here, but care is in order with that edition, which includes at least one set of egregious typos in the important expressions (11) on p. 70. About half of the Mémoire was translated into English by J. D. Tamarkin and is included in A Source Book in Mathematics, Vol 1, by David Eugene Smith, McGraw-Hill (1929). There are some less grievous typos, like putting the inequality sign the wrong way.

^ 2. "Mémoire sur le nombre de valeurs que peut prendre une fonction quand on y permute les lettres qu'elle renferme" (Memoir on the number of values that a function can take when the letters it contains are permuted), by Joseph Bertrand, Journal De L'Ecole Royale Polytechnique, Tome I (1845), pp. 123-140. See p. 129 for the postulate.

^ 3. Some nineteenth century works helpfully reprise the Mémoire up to Bertrand's postulate. See Theory of Numbers, Part I, by G. B. Mathews, Deighton, Bell and Co. (1892), pp. 278-287. Joseph Serret closely follows Chebyshev in his influential Cours d'algèbre supérieure, quatrième éd., Gauthier-Villars (1879), pp. 225-239. This textbook was first published in 1849 and much updated over the years.

^ 4. See Théorie des nombres, troisième éd., by Adrien-Marie Legendre, Didot Frères (1830). His formula is on p. 10. He gives the example:
\begin{align} \displaystyle \nu _{7}(10,000!) &= \left\lfloor {\frac {10,000}{7}}\right\rfloor + \left\lfloor {\frac {10,000}{7^{2}}}\right\rfloor + \left\lfloor {\frac {10,000}{7^{3}}}\right\rfloor + \left\lfloor {\frac {10,000}{7^{4}}}\right\rfloor + \left\lfloor {\frac {10,000}{7^{5}}}\right\rfloor\\[1em] &= 1428 + 204 + 29 + 4 + 0\\[1em] &= 1665. \end{align}
^ 5. See "Sur la totalité des nombres premiers inférieurs à une limite donnée", by P. L. Chebyshev, Journal de mathématiques pures et appliquées 1re série, tome 17 (1852), pp. 341-365. The paper was originally read to the St. Petersburg Academy of Sciences in May 1848. There is an English translation in the source book cited in footnote 1.

^ 6. Introduction to Analytic Number Theory, by Tom Apostol, Springer (1976), ISBN 0-387-90163-9, p. 82ff.

^ 7. Not Always Buried Deep: A Second Course in Analytic Number Theory, by Paul Pollack, American Mathematical Society (2009), ISBN 978-8218-4880-7, pp. 89-95.