Convergence proof techniques

Convergence proof techniques are canonical patterns of mathematical proofs that sequences or functions converge to a finite limit when the argument tends to infinity.

There are many types of sequences and modes of convergence, and different proof techniques may be more appropriate than others for proving each type of convergence of each type of sequence. Below are some of the more common and typical examples. This article is intended as an introduction aimed to help practitioners explore appropriate techniques. The links below give details of necessary conditions and generalizations to more abstract settings. Proof techniques for the convergence of series, a particular type of sequences corresponding to sums of many terms, are covered in the article on convergence tests.

Convergence in Rn

edit

It is common to want to prove convergence of a sequence   or function  , where   and   refer to the natural numbers and the real numbers, respectively, and convergence is with respect to the Euclidean norm,  .

Useful approaches for this are as follows.

First principles

edit

The analytic definition of convergence of   to a limit   is that[1] for all   there exists a   such for all  ,  . The most direct proof technique from this definition is to find such a   and prove the required inequality. If the value of   is not known in advance, the techniques below may be useful.

Contraction mappings

edit

In many cases, the function whose convergence is of interest has the form   for some transformation  . For example,   could map   to   for some conformable matrix  , so that  , a matrix generalization of the geometric progression. Alternatively,   may be an elementwise operation, such as replacing each element of   by the square root of its magnitude.

In such cases, if the problem satisfies the conditions of Banach fixed-point theorem (the domain is a non-empty complete metric space) then it is sufficient to prove convergence to prove that   is a contraction mapping to prove that it has a fixed point. This requires that  for some constant   which is fixed for all   and  . The composition of two contraction mappings is a contraction mapping, so if  , then it is sufficient to show that   and   are both contraction mappings.

Example

edit

Famous examples of applications of this approach include

  • If   has the form   for some matrices   and  , then   converges to   if the magnitudes of all eigenvalues of   are less than 1[citation needed].

Non-expansion mappings

edit

If both above inequalities in the definition of a contraction mapping are weakened from "strictly less than" to "less than or equal to", the mapping is a non-expansion mapping. It is not sufficient to prove convergence to prove that   is a non-expansion mapping. For example,   is a non-expansion mapping, but the sequence   does not converge for any  . However, the composition of a contraction mapping and a non-expansion mapping (or vice versa) is a contraction mapping.

Contraction mappings on limited domains

edit

If   is not a contraction mapping on its entire domain, but it is on its codomain (the image of the domain), that is also sufficient for convergence. This also applies for decompositions. For example, consider  . The function   is not a contraction mapping, but it is on the restricted domain  , which is the codomain of   for real arguments. Since   is a non-expansion mapping, this implies   is a contraction mapping.

Convergent subsequences

edit

Every bounded sequence in   has a convergent subsequence, by the Bolzano–Weierstrass theorem. If these subsequences all have the same limit, then the original sequence also converges to that limit. If it can be shown that all of the subsequences of   must have the same limit, such as by showing that there is a unique fixed point of the transformation   and that there are no invariant sets of   that contain no fixed points of  , then the initial sequence must also converge to that limit.

Monotonicity (Lyapunov functions)

edit

Every bounded monotonic sequence in   converges to a limit.

This fact can be used directly and can also be used to prove the convergence of sequences that are not monotonic using techniques and theorems named for Aleksandr Lyapunov. In these cases, one defines a function   such that   is monotonic in   and thus   converges. If   satisfies the conditions to be a Lyapunov function then Lyapunov's theorem implies that   is also convergent. Lyapunov's theorem is normally stated for ordinary differential equations, but it can also be applied to sequences of iterates by replacing derivatives with discrete differences.

The basic requirements on   to be a Lyapunov function are that

  1.   for all   and  
  2.   for   (discrete case) or   for   (continuous case)
  3.   is "radially unbounded", i.e., that   for any sequence with  .

In many cases a quadratic Lyapunov function of the form   can be found, although more complex forms are also common, for instance entropies in the study of convergence of probability distributions.

For delay differential equations, a similar approach applies with Lyapunov functions replaced by Lyapunov functionals also called Lyapunov-Krasovskii functionals.

If the inequality in the condition 2 is weak, LaSalle's invariance principle may be used.

Convergence of sequences of functions

edit

To consider the convergence of sequences of functions,[2] it is necessary to define a distance between functions to replace the Euclidean norm. These often include

  • Convergence in the norm (strong convergence) -- a function norm, such as   is defined, and convergence occurs if  . For this case, all of the above techniques can be applied with this function norm.
  • Pointwise convergence -- convergence occurs if for each  ,  . For this case, the above techniques can be applied for each point   with the norm appropriate for  .
  • uniform convergence -- In pointwise convergence, some (open) regions can converge arbitrarily slowly. With uniform convergence, there is a fixed convergence rate such that all points converge at least that fast. Formally,   where   is the domain of each  .

See also

Convergence of random variables

edit

Random variables[3] are more complicated than simple elements of  . (Formally, a random variable is a mapping   from an event space   to a value space  . The value space may be  , such as the roll of a dice, and such a random variable is often spoken of informally as being in  , but convergence of sequence of random variables corresponds to convergence of the sequence of functions, or the distributions, rather than the sequence of values.)

There are multiple types of convergence, depending on how the distance between functions is measured.

Each has its own proof techniques, which are beyond the current scope of this article.

See also

Topological convergence

edit

For all of the above techniques, some form the basic analytic definition of convergence above applies. However, topology has its own definitions of convergence. For example, in a non-Hausdorff space, it is possible for a sequence to converge to multiple different limits.

References

edit
  1. ^ Ross, Kenneth. Elementary Analysis: The Theory of Calculus. Springer.
  2. ^ Haase, Markus. Functional Analysis: An Elementary Introduction. American Mathematics Society.
  3. ^ Billingsley, Patrick (1995). Probability and Measure. John Wesley.