Baker's theorem

In transcendental number theory, a mathematical discipline, Baker's theorem gives a lower bound for the absolute value of linear combinations of logarithms of algebraic numbers. Nearly fifteen years earlier, Alexander Gelfond had considered the problem with only integer coefficients to be of "extraordinarily great significance".^[1] The result, proved by Alan Baker (1966, 1967a, 1967b), subsumed many earlier results in transcendental number theory. Baker used this to prove the transcendence of many numbers, to derive effective bounds for the solutions of some Diophantine equations, and to solve the class number problem of finding all imaginary quadratic fields with class number 1.

History

To simplify notation, let $\mathbb {L}$ be the set of logarithms to the base e of nonzero algebraic numbers, that is $\mathbb {L} =\left\{\lambda \in \mathbb {C} :\ e^{\lambda }\in {\overline {\mathbb {Q} }}\right\},$ where $\mathbb {C}$ denotes the set of complex numbers and ${\overline {\mathbb {Q} }}$ denotes the algebraic numbers (the algebraic closure of the rational numbers $\mathbb {Q}$ ). Using this notation, several results in transcendental number theory become much easier to state. For example the Hermite–Lindemann theorem becomes the statement that any nonzero element of $\mathbb {L}$ is transcendental.

In 1934, Alexander Gelfond and Theodor Schneider independently proved the Gelfond–Schneider theorem. This result is usually stated as: if $a$ is algebraic and not equal to 0 or 1, and if $b$ is algebraic and irrational, then $a^{b}$ is transcendental. The exponential function is multi-valued for complex exponents, and this applies to all of its values, which in most cases constitute infinitely many numbers. Equivalently, though, it says that if $\lambda _{1},\lambda _{2}\in \mathbb {L}$ are linearly independent over the rational numbers, then they are linearly independent over the algebraic numbers. So if $\lambda _{1},\lambda _{2}\in \mathbb {L}$ and $\lambda _{2}$ is not zero, then the quotient $\lambda _{1}/\lambda _{2}$ is either a rational number or transcendental. It cannot be an algebraic irrational number like ${\sqrt {2}}$ .

Although proving this result of "rational linear independence implies algebraic linear independence" for two elements of $\mathbb {L}$ was sufficient for his and Schneider's result, Gelfond felt that it was crucial to extend this result to arbitrarily many elements of $\mathbb {L} .$ Indeed, from Gel'fond (1960, p. 177):

...one may assume ... that the most pressing problem in the theory of transcendental numbers is the investigation of the measures of transcendence of finite sets of logarithms of algebraic numbers.

This problem was solved fourteen years later by Alan Baker and has since had numerous applications not only to transcendence theory but in algebraic number theory and the study of Diophantine equations as well. Baker received the Fields medal in 1970 for both this work and his applications of it to Diophantine equations.

Statement

With the above notation, Baker's theorem is a nonhomogeneous generalization of the Gelfond–Schneider theorem. Specifically it states:

Baker's Theorem — If $\lambda _{1},\ldots ,\lambda _{n}\in \mathbb {L}$ are linearly independent over the rational numbers, then for any algebraic numbers $\beta _{0},\ldots ,\beta _{n},$ not all zero, we have $\left|\beta _{0}+\beta _{1}\lambda _{1}+\cdots +\beta _{n}\lambda _{n}\right|>H^{-C}$ where H is the maximum of the heights of $\beta _{i}$ and C is an effectively computable number depending on n, $\lambda _{i}$ and the maximum d of the degrees of $\beta _{i}.$ (If β₀ is nonzero then the assumption that $\lambda _{i}$ are linearly independent can be dropped.) In particular this number is nonzero, so 1 and $\lambda _{i}$ are linearly independent over the algebraic numbers.

Just as the Gelfond–Schneider theorem is equivalent to the statement about the transcendence of numbers of the form a^b, so too Baker's theorem implies the transcendence of numbers of the form

a_{1}^{b_{1}}\cdots a_{n}^{b_{n}},

where the b_i are all algebraic, irrational, and 1, b₁, ..., b_n are linearly independent over the rationals, and the a_i are all algebraic and not 0 or 1.

Baker (1977) also gave several versions with explicit constants. For example, if $\exp(\lambda _{j})=\alpha _{j}$ has height at most $A_{j}\geq 4$ and all the numbers $\beta _{j}$ have height at most $B\geq 4$ then the linear form

\Lambda =\beta _{0}+\beta _{1}\lambda _{1}+\cdots +\beta _{n}\lambda _{n}

is either 0 or satisfies

\log |\Lambda |>(16nd)^{200n}\Omega \left(\log \Omega -\log \log A_{n}\right)(\log B+\log \Omega )

where

\Omega =\log A_{1}\log A_{2}\cdots \log A_{n}

and the field generated by $\alpha _{i}$ and $\beta _{i}$ over the rationals has degree at most d. In the special case when β₀ = 0 and all the $\beta _{j}$ are rational integers, the rightmost term log Ω can be deleted.

An explicit result by Baker and Wüstholz for a linear form Λ with integer coefficients yields a lower bound of the form

\log |\Lambda |>-Ch(\alpha _{1})h(\alpha _{2})\cdots h(\alpha _{n})\log \left(\max \left\{|\beta _{1}|,\ldots ,|\beta _{n}|\right\}\right),

where

C=18(n+1)!n^{n+1}(32d)^{n+2}\log(2nd),

and d is the degree of the number field generated by the $\alpha _{i}.$

Baker's method

Baker's proof of his theorem is an extension of the argument given by Gel'fond (1960, chapter III, section 4). The main ideas of the proof are illustrated by the proof of the following qualitative version of the theorem of Baker (1966) described by Serre (1971):

If the numbers

2\pi i,\log a_{1},\ldots ,\log a_{n}

are linearly independent over the rational numbers, for nonzero algebraic numbers

a_{1},\ldots ,a_{n},

then they are linearly independent over the algebraic numbers.

The precise quantitative version of Baker's theory can be proved by replacing the conditions that things are zero by conditions that things are sufficiently small throughout the proof.

The main idea of Baker's proof is to construct an auxiliary function $\Phi (z_{1},\ldots ,z_{n-1})$ of several variables that vanishes to high order at many points of the form $z_{1}=\cdots =z_{n-1}=l,$ then repeatedly show that it vanishes to lower order at even more points of this form. Finally the fact that it vanishes (to order 1) at enough points of this form implies using Vandermonde determinants that there is a multiplicative relation between the numbers a_i.

Construction of the auxiliary function

Assume there is a relation

\beta _{1}\log \alpha _{1}+\cdots +\beta _{n-1}\log \alpha _{n-1}=\log \alpha _{n}

for algebraic numbers α₁, ..., α_n, β₁, ..., β_n−1. The function Φ is of the form

\Phi (z_{1},\ldots ,z_{n-1})=\sum _{\lambda _{1}=0}^{L}\cdots \sum _{\lambda _{n}=0}^{L}p(\lambda _{1},\ldots ,\lambda _{n})\alpha _{1}^{(\lambda _{1}+\lambda _{n}\beta _{1})z_{1}}\cdots \alpha _{n-1}^{(\lambda _{n-1}+\lambda _{n}\beta _{n-1})z_{n-1}}

The integer coefficients p are chosen so that they are not all zero and Φ and its derivatives of order at most some constant M vanish at $z_{1}=\cdots =z_{n-1}=l,$ for integers $l$ with $0\leq l\leq h$ for some constant h. This is possible because these conditions are homogeneous linear equations in the coefficients p, which have a non-zero solution provided the number of unknown variables p is larger than the number of equations. The linear relation between the logs of the α's is needed to cut down the number of linear equations that have to be satisfied. Moreover, using Siegel's lemma, the sizes of the coefficients p can be chosen to be not too large. The constants L, h, and M have to be carefully adjusted so that the next part of the proof works, and are subject to some constraints, which are roughly:

L must be somewhat smaller than M to make the argument about extra zeros below work.
A small power of h must be larger than L to make the final step of the proof work.
Lⁿ must be larger than about Mⁿ⁻¹h in order that it is possible to solve for the coefficients p.

The constraints can be satisfied by taking h to be sufficiently large, M to be some fixed power of h, and L to be a slightly smaller power of h. Baker took M to be about h² and L to be about h^2−1/2n.

The linear relation between the logarithms of the α's is used to reduce L slightly; roughly speaking, without it the condition Lⁿ must be larger than about Mⁿ⁻¹h would become Lⁿ must be larger than about Mⁿh, which is incompatible with the condition that L is somewhat smaller than M.

Zeros of the auxiliary function

The next step is to show that Φ vanishes to slightly smaller order at many more points of the form $z_{1}=\cdots =z_{n-1}=l$ for integers l. This idea was Baker's key innovation: previous work on this problem involved trying to increase the number of derivatives that vanish while keeping the number of points fixed, which does not seem to work in the multivariable case. This is done by combining two ideas; First one shows that the derivatives at these points are quite small, by using the fact that many derivatives of Φ vanish at many nearby points. Then one shows that derivatives of Φ at this point are given by algebraic integers times known constants. If an algebraic integer has all its conjugates bounded by a known constant, then it cannot be too small unless it is zero, because the product of all conjugates of a nonzero algebraic integer is at least 1 in absolute value. Combining these two ideas implies that Φ vanishes to slightly smaller order at many more points $z_{1}=\cdots =z_{n-1}=l.$ This part of the argument requires that Φ does not increase too rapidly; the growth of Φ depends on the size of L, so requires a bound on the size of L, which turns out to be roughly that L must be somewhat smaller than M. More precisely, Baker showed that since Φ vanishes to order M at h consecutive integers, it also vanishes to order M/2 at h^1+1/8n consecutive integers 1, 2, 3, .... Repeating this argument J times shows that Φ vanishes to order M/2^J at h^1+J/8n points, provided that h is sufficiently large and L is somewhat smaller than M/2^J.

One then takes J large enough that:

h^{1+{\frac {J}{8n}}}>(L+1)^{n}.

(J larger than about 16n will do if h² > L) so that:

\forall l\in \left\{1,2,\ldots ,(L+1)^{n}\right\}:\qquad \Phi (l,\ldots ,l)=0.

Completion of the proof

By definition $\Phi (l,\ldots ,l)=0$ can be written as:

\sum _{\lambda _{1}=0}^{L}\cdots \sum _{\lambda _{n}=0}^{L}p(\lambda _{1},\ldots ,\lambda _{n})\alpha _{1}^{\lambda _{1}l}\cdots \alpha _{n}^{\lambda _{n}l}=0.

Therefore as l varies we have a system of (L + 1)ⁿ homogeneous linear equations in the (L + 1)ⁿ unknowns which by assumption has a non-zero solution, which in turn implies the determinant of the matrix of coefficients must vanish. However this matrix is a Vandermonde matrix and the formula for the determinant of such a matrix forces an equality between two of the values:

\alpha _{1}^{\lambda _{1}}\cdots \alpha _{n}^{\lambda _{n}}

so $\alpha _{1},\ldots ,\alpha _{n}$ are multiplicatively dependent. Taking logs shows that $2\pi i,\log \alpha _{1},\ldots ,\log \alpha _{n}$ are linearly dependent over the rationals.

Extensions and generalizations

Baker (1966) in fact gave a quantitative version of the theorem, giving effective lower bounds for the linear form in logarithms. This is done by a similar argument, except statements about something being zero are replaced by statements giving a small upper bound for it, and so on.

Baker (1967a) showed how to eliminate the assumption about 2πi in the theorem. This requires a modification of the final step of the proof. One shows that many derivatives of the function $\phi (z)=\Phi (z,\ldots ,z)$ vanish at z = 0, by an argument similar to the one above. But these equations for the first (L+1)ⁿ derivatives again give a homogeneous set of linear equations for the coefficients p, so the determinant is zero, and is again a Vandermonde determinant, this time for the numbers λ₁ log α₁ + ⋯ + λ_n log α_n. So two of these expressions must be the same which shows that log α₁,...,log α_n are linearly dependent over the rationals.

Baker (1967b) gave an inhomogeneous version of the theorem, showing that

\beta _{0}+\beta _{1}\log \alpha _{1}+\cdots +\beta _{n}\log \alpha _{n}

is nonzero for nonzero algebraic numbers β₀, ..., β_n, α₁, ..., α_n, and moreover giving an effective lower bound for it. The proof is similar to the homogeneous case: one can assume that

\beta _{0}+\beta _{1}\log \alpha _{1}+\cdots +\beta _{n-1}\log \alpha _{n-1}=\log \alpha _{n}

and one inserts an extra variable z₀ into Φ as follows:

\Phi (z_{0},\ldots ,z_{n-1})=\sum _{\lambda _{0}=0}^{L}\cdots \sum _{\lambda _{n}=0}^{L}p(\lambda _{0},\ldots ,\lambda _{n})z_{0}^{\lambda _{0}}e^{\lambda _{n}\beta _{0}z_{0}}\alpha _{1}^{(\lambda _{1}+\lambda _{n}\beta _{1})z_{1}}\cdots \alpha _{n-1}^{(\lambda _{n-1}+\lambda _{n}\beta _{n-1})z_{n-1}}

Corollaries

As mentioned above, the theorem includes numerous earlier transcendence results concerning the exponential function, such as the Hermite–Lindemann theorem and Gelfond–Schneider theorem. It is not quite as encompassing as the still unproven Schanuel's conjecture, and does not imply the six exponentials theorem nor, clearly, the still open four exponentials conjecture.

The main reason Gelfond desired an extension of his result was not just for a slew of new transcendental numbers. In 1935 he used the tools he had developed to prove the Gelfond–Schneider theorem to derive a lower bound for the quantity

|\beta _{1}\lambda _{1}+\beta _{2}\lambda _{2}|

where β₁ and β₂ are algebraic and λ₁ and λ₂ are in $\mathbb {L}$ .^[2] Baker's proof gave lower bounds for quantities like the above but with arbitrarily many terms, and he could use these bounds to develop effective means of tackling Diophantine equations and to solve Gauss' class number problem.

Extensions

Baker's theorem grants us the linear independence over the algebraic numbers of logarithms of algebraic numbers. This is weaker than proving their algebraic independence. So far no progress has been made on this problem at all. It has been conjectured^[3] that if λ₁, ..., λ_n are elements of $\mathbb {L}$ that are linearly independent over the rational numbers, then they are algebraically independent too. This is a special case of Schanuel's conjecture, but so far it remains to be proved that there even exist two algebraic numbers whose logarithms are algebraically independent. Indeed, Baker's theorem rules out linear relations between logarithms of algebraic numbers unless there are trivial reasons for them; the next most simple case, that of ruling out homogeneous quadratic relations, is the still open four exponentials conjecture.

Similarly, extending the result to algebraic independence but in the p-adic setting, and using the p-adic logarithm function, remains an open problem. It is known that proving algebraic independence of linearly independent p-adic logarithms of algebraic p-adic numbers would prove Leopoldt's conjecture on the p-adic ranks of units of a number field.

Notes

^ See the final paragraph of Gel'fond (1960).
^ See Gel'fond (1960) and Sprindžuk (1993) for details.
^ Waldschmidt (2000), conjecture 1.15.

References

Baker, Alan (1966), "Linear forms in the logarithms of algebraic numbers. I", Mathematika, 13 (2): 204–216, doi:10.1112/S0025579300003971, ISSN 0025-5793, MR 0220680
Baker, Alan (1967a), "Linear forms in the logarithms of algebraic numbers. II", Mathematika, 14: 102–107, doi:10.1112/S0025579300008068, ISSN 0025-5793, MR 0220680
Baker, Alan (1967b), "Linear forms in the logarithms of algebraic numbers. III", Mathematika, 14 (2): 220–228, doi:10.1112/S0025579300003843, ISSN 0025-5793, MR 0220680
Baker, Alan (1990), Transcendental number theory, Cambridge Mathematical Library (2nd ed.), Cambridge University Press, ISBN 978-0-521-39791-9, MR 0422171
Baker, Alan (1977), "The theory of linear forms in logarithms", Transcendence theory: advances and applications (Proc. Conf., Univ. Cambridge, Cambridge, 1976), Boston, MA: Academic Press, pp. 1–27, ISBN 978-0-12-074350-6, MR 0498417
Baker, A.; Wüstholz, G. (1993), "Logarithmic forms and group varieties", Journal für die reine und angewandte Mathematik, 1993 (442): 19–62, doi:10.1515/crll.1993.442.19, MR 1234835, S2CID 118335888.
Baker, Alan; Wüstholz, G. (2007), Logarithmic forms and Diophantine geometry, New Mathematical Monographs, vol. 9, Cambridge University Press, ISBN 978-0-521-88268-2, MR 2382891
Gel'fond, A. O. (1960) [1952], Transcendental and algebraic numbers, Dover Phoenix editions, New York: Dover Publications, ISBN 978-0-486-49526-2, MR 0057921
Serre, Jean-Pierre (1971) [1969], "Travaux de Baker (Exposé 368)", Séminaire Bourbaki. Vol. 1969/70: Exposés 364--381, Lecture Notes in Mathematics, vol. 180, Berlin, New York: Springer-Verlag, pp. 73–86
Sprindžuk, Vladimir G. (1993), Classical Diophantine equations, Lecture Notes in Mathematics, vol. 1559, Berlin, New York: Springer-Verlag, doi:10.1007/BFb0073786, ISBN 978-3-540-57359-3, MR 1288309
Waldschmidt, Michel (2000), Diophantine approximation on linear algebraic groups, Grundlehren der Mathematischen Wissenschaften, vol. 326, Berlin, New York: Springer-Verlag, doi:10.1007/978-3-662-11569-5, ISBN 978-3-540-66785-8, MR 1756786

[1] See the final paragraph of Gel'fond (1960).

[2] See Gel'fond (1960) and Sprindžuk (1993) for details.

[FOOTNOTEWaldschmidt2000conjecture_1.15-3] Waldschmidt (2000), conjecture 1.15.

[1]

[2]

[3]