Central limit theorem for directional statistics

In probability theory, the central limit theorem states conditions under which the average of a sufficiently large number of independent random variables, each with finite mean and variance, will be approximately normally distributed.[1]

Directional statistics is the subdiscipline of statistics that deals with directions (unit vectors in Rn), axes (lines through the origin in Rn) or rotations in Rn. The means and variances of directional quantities are all finite, so that the central limit theorem may be applied to the particular case of directional statistics.[2]

This article will deal only with unit vectors in 2-dimensional space (R2) but the method described can be extended to the general case.

The central limit theorem

edit

A sample of angles   are measured, and since they are indefinite to within a factor of  , the complex definite quantity   is used as the random variate. The probability distribution from which the sample is drawn may be characterized by its moments, which may be expressed in Cartesian and polar form:

 

It follows that:

 
 
 
 

Sample moments for N trials are:

 

where

 
 
 
 

The vector [ ] may be used as a representation of the sample mean   and may be taken as a 2-dimensional random variate.[2] The bivariate central limit theorem states that the joint probability distribution for   and   in the limit of a large number of samples is given by:

 

where   is the bivariate normal distribution and   is the covariance matrix for the circular distribution:

 
 
 
 

Note that the bivariate normal distribution is defined over the entire plane, while the mean is confined to be in the unit ball (on or inside the unit circle). This means that the integral of the limiting (bivariate normal) distribution over the unit ball will not be equal to unity, but rather approach unity as N approaches infinity.

It is desired to state the limiting bivariate distribution in terms of the moments of the distribution.

Covariance matrix in terms of moments

edit

Using multiple angle trigonometric identities[2]

 
 

It follows that:

 
 
 

The covariance matrix is now expressed in terms of the moments of the circular distribution.

The central limit theorem may also be expressed in terms of the polar components of the mean. If   is the probability of finding the mean in area element  , then that probability may also be written  .

References

edit
  1. ^ Rice, John A. (1995). Mathematical Statistics and Data Analysis (2nd ed.). Duxbury Press.
  2. ^ a b c Jammalamadaka, S. Rao; SenGupta, A. (2001). Topics in circular statistics. New Jersey: World Scientific. ISBN 978-981-02-3778-3. Retrieved 2011-05-15.