Draft:Bayes Space (statistics)


Bayes space is a function space defined as an equivalence class of measures with the same null-sets. Two measures are defined to be equivalent if they are proportional. The basic ideas of Bayes spaces have their roots in Compositional Data Analysis and the Aitchison geometry[1]. Applications are mainly in statistics, specifically functional data analysis of density functions[2][3].

The basic structure of the Bayes space is that of a vector space, with addition and multiplication being defined by perturbation and powering [4]. The space is formed over a -finite reference/base measure, denoted or depending on whether it is infinite or finite. Densities are considered as Radon-Nikodym derivatives of the measures with same null-sets as the base measure, and are equivalent if they are proportional. In case of finite base measures, Hilbert space structure can be achieved by defining a centered log-ratio on the measures, mapping them to a subset of consisting of funtions integrating to 0[5].

Definitions and main results

edit

Consider a finite base measure   (not necessarily a probability measure) on a domain  . This may be a uniform distribution on a bounded interval, or it can be a Radon-Nikodym derivative of the Lebesgue measure (the Gaussian distribution, for example). If we take two densities   with respect to  , they are said to be B-equivalent if there exists a   s.t  , denoted   (the convention   is used in cases where a measure is infinite). It can be shown that   is an equivalence relation. The Bayes space   is defined as the quotient space of all measures with the same null-sets in   as   under the equivalence relation  .

The first challenge to analysing density functions is that   is not linear space under ordinary addition and multiplication since the ordinary difference between two densities would not be non-negative everywhere. Like in the Aitchison geometry for finite dimensional data, perturbation and powering is defined for densities:

Perturbation

 

Powering

 

where   are densities in   and   is some real number. It can be shown using the properties of multiplication and powering of real numbers that   forms a vector space over the real numbers.

The definition of Bayes space does not strictly require a finite reference measure  . If Bayes space is defined over an infinite reference measure  , it must be  -finite (like the Lebesgue measure). The finite reference measure is, however, necessary for adding Hilbert space structure to a subset of  . Consider the subspace

 . For  , this is a linear subspace and isometrically isomorphic to the Hilbert space   via the centered log-ratio (clr) transformation  . The subspace of log-square integrable functions is termed the Bayes Hilbert space. It can be shown that the clr-transformation is a linear isomorphism between the two spaces. Defining an inner product on   as the inner product of the clr-transformations will provide the Hilbert space structure for  , obtaining the centered log-ratio as a linear isometry.

Multivariate densities

edit

The measure   does not have to be univariate (1 dimensional), but can also be defined as a product measure on cartesian products, characterising bivariate (2 dimensional) or multivariate densities. The geometric structure of Hilbert spaces can be used to decompose multivariate densities into orthogonal independent and interaction parts[6][7], using the concept of "clr-marginals". This decomposition has relations to copula theory[7]. The geometry in   defines norms on densities that can be used to quantify "relative simplicial deviance," which is measure of how much of a bivariate distribution can be explained by assuming independence[6].

See also

edit

References

edit
  1. ^ Egozcue, Juan José. "Hilbert space of probability density functions based on Aitchison geometry". Acta Mathematica Sinica: 1175–1183.
  2. ^ Hron, Karel (2015). "Simplicial principal component analysis for density functions in Bayes spaces". Computational Statistics & Data Analysis – via Elsevie Science Direct.
  3. ^ Talská, Renáta. "Compositional Scalar-on-Function Regression with Application to Sediment Particle Size Distributions". International Association for Mathematical Geosciences.
  4. ^ van den Boogart, Karl Gerald (2010). "Bayes linear spaces". SORT: statistics and operations research transactions. 34: 201–222.
  5. ^ van den Boogart, Karl Gerald (2014). "Bayes Hilbert Spaces". Australian & New Zealand Journal of Statistics: 171–194.
  6. ^ a b Hron, Karel. "Bivariate densities in Bayes spaces: orthogonal decomposition and spline representation". Stat papers. 64: 1629–1667 – via Springer Nature.
  7. ^ a b Škorňa, Stanislav. "Approximation of bivariate densities with compositional splines". arXiv.