First Mondave summer school

Color

edit
 

Vacuum polarization

edit

History

edit

Vacuum polarization was first discussed in papers by P. A. M. Dirac[1] and W. Heisenberg[2] in 1934. Effects of vacuum polarization were calculated to first order in the coupling constant by R. Serber[3] and E. A. Uehling[4] in 1935.[5]

Berestetskii, V. B.; Lifshitz, E. M.; Pitaevskii, L. (1980). "Section 114". Quantum Electrodynamics. Course of Theoretical Physics. Vol. 4 (2nd ed.). Butterworth-Heinemann. ISBN 978-0750633710.

Brown, Douglas H.; Worstell, William A (1996). "The Lowest Order Hadronic Contribution to the Muon g − 2 Value with Systematic Error Correlations". Physical Review D. 54 (5): 3237–3249. arXiv:hep-ph/9607319. Bibcode:1996PhRvD..54.3237B. doi:10.1103/PhysRevD.54.3237. PMID 10020994. S2CID 37689024.

Dirac, P. A. M. (1934). "Discussion of the infinite distribution of electrons in the theory of the positron". Cambridge Phil. Soc. 30 (2). Cambridge Philosophical Sociery: 150–163. doi:10.1017/S030500410001656X. S2CID 117314876. {{cite journal}}: Unknown parameter |subscription= ignored (|url-access= suggested) (help)

Gell-Mann, M.; Low, F. E. (1954). "Quantum Electrodynamics at Small Distances". Phys. Rev. 95 (5). APS: 1300–1312. Bibcode:1954PhRv...95.1300G. doi:10.1103/PhysRev.95.1300. {{cite journal}}: Unknown parameter |subscription= ignored (|url-access= suggested) (help)

Heisenberg, W. (1934). "Bemerkungen zur Diracschen Theorie des Positrons". Zeitschrift für Physik (in German). 90 (3–4). Deutsche Physikalische Gesellschaft: 209–231. doi:10.1007/BF01333516. ISSN 0044-3328. S2CID 186232913. {{cite journal}}: Unknown parameter |subscription= ignored (|url-access= suggested) (help)

Levine, I.; et al. (TOPAZ collaboration) (1997). "Measurement of the Electromagnetic Coupling at Large Momentum Transfer". Physical Review Letters. 78 (3): 424–427. Bibcode:1997PhRvL..78..424L. doi:10.1103/PhysRevLett.78.424. Measurement of the Electromagnetic Coupling at Large Momentum Transfer

Serber, R. (1935). "Linear Modifications in the Maxwell Field Equations". Phys. Rev. 48 (1). APS: 49–54. doi:10.1103/PhysRev.48.49. {{cite journal}}: Unknown parameter |subscription= ignored (|url-access= suggested) (help) Linear Modifications in the Maxwell Field Equations

Uehling, E. A. (1935). "Polarization Effects in the Positron Theory". Phys. Rev. 48 (1). APS: 55–63. doi:10.1103/PhysRev.48.55. {{cite journal}}: Unknown parameter |subscription= ignored (|url-access= suggested) (help) Polarization Effects in the Positron Theory

Representation theory of the Lorentz group

edit

The Lorentz group is a symmetry group of the unbound Kepler problem. This is manifested by the Laplace–Runge–Lenz vector D together with the angular momentum vector L. The components of these vectors form a Lie algebra under the Poisson bracket. In the bound problem, this Lie algebra is so(4), but in the unbound problem, the Lie algebra is so(3, 1).[6] Thus the connected component SO(3, 1)+ is isomorphic to the group of canonical transformations for this problem.[7]

Symmetry groups

edit
 
A 90° clockwise rotation of a square around its center.
 
The sphere, a maximally symmetric object.

The mathematical notion of a group and the notion of symmetry in both mathematics and physics are intimately related. A group has the simple property that if one element of a group is multiplied by another, the result is another element of the group. The same can, mutatis mutandis, be said of symmetries. Apply one symmetry operation (physically or by changing coordinate system), and then another one. The result is that of applying a single symmetry operation. Group theory is thus the mathematical language in which symmetries of nature are expressed.[8] Symmetries may relate to very concrete symmetries of physical objects, like the symmetries of a square. One then speaks of the symmetry group associated with the object, in this case the finite group Dihedral group D2.

In the case of a rectangle, the symmetry group is a subgroup of D2 called the Klein four-group K4. Only one rotation, and two reflections in the plane, will make the transformed rectangle look exactly like it did before the symmetry operation.

Other objects possess higher symmetry. The sphere is an extreme example. It possesses full rotational symmetry and reflection symmetry. Rotate or reflect a ball with any kind of rotation or reflection about any plane through the origin, and it will look exactly the same as before the symmetry operation. In this case of the symmetries of a sphere, the matrix group is the orthogonal group of three dimensions. These are 3 × 3 matrices.

Multiplication table

edit
Klein four-group Cayley table
* E P T R
E E P T R
P P E R T
T T R E P
R R T P E

The basic feature of every finite group is its multiplication table, also called Cayley table, that records the result of multiplying any two elements. A representation of a group can be thought of new set of elements, finite-dimensional or infinite-dimensional matrices, giving the same multiplication table after mapping the old elements to the new elements in a one-to-one fashion.[nb 1]

The multiplication table of the Klein four-group is displayed on the right. The elements are the identity E, vertical reflection P, the horizontal reflection T, and a 180 degree rotation R.

The same holds true in the case of an infinite group like the rotation group SO(3) or the Lorentz group. The multiplication table is just harder to visualize in the case of a group of uncountable size (same size as the set of reals). One way to do this is to wellorder the elements of the group with an ordinal number ρ being the order type. The "infinite Cayley table" is then indexed by two ordinals 0 ≤ α, βρ written on Cantor normal form.

Representations

edit

A central fact is that the symmetry groups can be represented by matrices.[nb 2] In the case of K4 for the rectangle, one matrix representation is composed of the four 2 × 2 matrices:

 

The matrices satisfy by direct calculation the multiplication table in the Cayley table above, and hence represent K2. The Klein four-group emerges as the zeroth homotopy group π0 or component group of the Lorentz group.

Symmetry of space and time

edit

Space itself possesses symmetry. It looks the same no matter how one rotates it, and the resulting rotational symmetry is referred to as isotropy of space. In the present case it is common to use passive rotations, meaning that the observer[nb 3] rotates himself. Mathematically, the active operation of a rotation is performed by multiplying position vectors by a rotation matrix. A passive rotation is accomplished by rotating only the basis vectors of the coordinate system. (The coordinate system can be thought of as being fixed in the rotated observer. The observer is physically rotated.) In this way, every point in space obtains new coordinates as if it was somehow physically rigidly rotated.

The Lorentz group contains all rotation matrices, extended to four dimensions with zeros in the first row and the first column except for the upper left element which is one, as elements.

There are, in addition, matrices that effect Lorentz boosts. These can be thought of, in the passive view, as (instantly!) giving the coordinate system (and with it the observer) a velocity in a chosen direction. A representative rotation and a representative boost are given by

 

where the rotation is θ radians about the z-axis and the boost is with rapidity ς in the x-direction.

Finally, two special transformations are used to invert the coordinate system in space, space inversion, and in time, time reversal. In the first case, the space coordinate axes are reversed. The latter is reversal of the time direction. This is thought of, in the passive view, as having the observer set his clock at minus what it shows and then have the clock's hands move counterclockwise. Physical time progresses forward. The identity E and inversions P and T, together with their product PT have matrices

 

Direct calculation confirms that the new E, P, T and R = PT represent the Klein four-group.

Mathematically the Lorentz group is defined as the set of transformations preserving the bilinear form

 

where the left-hand side is the Minkowski inner product of two events in spacetime, and the right-hand side is the spacetime interval, see classical group for mathematical detail.

Infinite-dimensional unitary representations

edit

There are several features of the general infinite-dimensional theory of representations of non-compact semisimple Lie groups that differs profoundly from that of compact or finite groups.

  1. There are irreducible unitary representations that do not occur in the regular representation.[9] The complementary series is of this sort.
  2. The regular representation may have (with respect to parameters) both discrete and continuous parts.

There are agreeable features:[10]

  1. There is an intimate connection between representations of the Lie group and the Lie algebra. This enables the study of general representations, not necessarily unitary, from a purely algebraic point of viw.
  2. The notion of characters to the non-compact case, making it possible to express a Plancherel formula.

These results surfaced already in the earliest investigations, referenced below.

Equivalence of representations

edit

A G-map or an intertwining map of representations   of a Lie group   and vector spaces   is a linear map   satisfying  . Two representations are isomorphic or equivalent if there exists bijective G-map (an isomorphism or equivalence). If   and   is an equivalence, then   is a change-of basis matrix.

Lie group representations

edit

The Lie correspondence may be employed for obtaining group representations of the connected component of the G. This is effected by taking the matrix exponential of the matrices of the Lie algebra representation. A subtlety arises if G is not simply connected. This results in projective representations or n-valued representations that are actually representations of the universal covering group.

The Lie correspondence gives results only for the connected component of the groups, and thus the other components of the full group are treated separately representatives for matrices representing these components, one for each component. These form the zeroth homotopy group.

The Lie correspondence

edit
 
Sophus Lie, the originator of Lie theory. The theory of manifolds was not discovered in Lie's time, so he worked locally with subsets of   The structure would today be called a local group.

The Lie correspondence for linear groups and Lie algebras is stated for reference. If G denotes a linear Lie group (i.e. a group of matrices)[nb 4] and   a linear Lie algebra (again a set of matrices),[nb 5] let   denote the group generated by   the image of the Lie algebra under the exponential mapping (which is the matrix exponential in this case),[nb 6] and let Lie(G) denote the Lie algebra of G (interpreted as the set of matrices X such that eitXG for all  ). The Lie correspondence reads in modern language, here specialized to linear Lie groups, as follows:

There is a one-to-one correspondence between connected linear Lie groups and linear Lie algebras given by   with   or, equivalently   expressed as   respectively  [11] Lie

Lie algebra representations from group representations

edit

Using the above theorem it is always possible to pass from a representation of a Lie group G to a representation of its Lie algebra   If Π : G → GL(V) is a group representation for some vector space V, then its pushforward (differential) at the identity, or Lie map,   is a Lie algebra representation. It is explicitly computed using[nb 7]

  (G6)

Not all Lie algebra representations arise this way because their corresponding group representations may not exist as proper representations, i.e. they are projective, see below.

Group representations from Lie algebra representations

edit
 
Here V is a finite-dimensional vector space, GL(V) is the set of all invertible linear transformations on V and   is its Lie algebra. The maps π and Π are Lie algebra and group representations respectively, and exp is the exponential mapping. The diagram commutes only up to a sign if Π is projective.

If   for some vector space V is a representation, a representation Π of the connected component of G is tentatively defined by setting

  (GH2)

It can be shown that simple connectedness of G is a sufficient condition for (G2) to yield a representation, but it is not a necessary condition.

  • The simply connected case is the statement of the theorem If   is a Lie algebra homomorphism and G is simply connected, then there is a unique Lie group homomorphism Π : GH satisfying the first line in (G2).[12]
  • If G is not simply connected, then there is a unique representation Πc of the universal covering group Gc of G satisfying the same equation as the first equation in (G2). It is a consequence of the above theorem.
    • If the kernel of the covering map is included in the kernel of Πc, then the representation of Πc descends to a unique representation of G. This is essentially a consequence of a variant of the first isomorphism theorem.
    • If the kernel of the covering map is not included in the kernel of Πc, then a (non-unique) projective representation of G results.

All representations have the following properties:

  • Near the identity i.e. for X in a small enough open neighborhood, (G2) yields, by the Baker–Campbell–Hausdorff formula and that exp is one-to-one on that neighborhood, a unique local homomorphism.[13]
  • Representatives of elements g far off the identity are defined by selecting a path from the identity to g, partitioning it finely enough so that the above property can be used. The result using (G2) can then only depend on the homotopy class (in the standard representation of G) of the path used in the (attempted) definition of Π.[14][15] In turn, this depends only on which X in the Lie algebra is used to represent an element g in the standard representation (and is used in (G2).
    • In the simply connected case, there is only one homotopy class, and (G2) is unambiguous even far off the identity.
    • In the non-simply connected case, there are card π1 homotopy classes, and (G2) is to a certain extent ambiguous far off the identity.

A pictorial view of how the universal covering group contains all such homotopy classes, and a technical definition of it (as a set and as a group) is given in geometric view.

For example, when this is specialized to the doubly connected SO(3, 1)+, the universal covering group is  , and whether its corresponding representation Πc is faithful decides whether Π is projective.

Talk:Electromagnetic tensor

edit

This is a statement most would say is wrong. There can, on the face of things, be at most four since the entries are derived from the four-potential. Then there are the Maxwell equations, and gauge freedom, reducing the degrees of freedom to only two. It is true that there are field configurations satisfying any given value (respecting anti-symmetry) at an event. But the above needs to be clarified in the article.

Physics stack exchange

edit

Irreducibility is not an absolute demand in the classification. For instance, photons of helicity   are both considered to be the same particle. These helicities correspond each to a different one-dimensional representation of the two-dimensional Euclidean group, denoted   or  , which is the "little group" that leaves a light-like vector invariant. These induce, in turn, different irreducible unitary representations of the Poincaré group.

The one-dimensionality of these representations is also one way to explain why helicity invariant under the connected component of the Lorentz group, and does not mix like spin  -component of ordinary spin does. Moreover, a would-be zero mass helicity   particle does not exist since the corresponding representation of   is the trivial representation.

However, helicity changes sign under space inversion, and since space inversion is a symmetry of the theory (QED), photons of helicity   are both considered to be the same particle.

The representation theory of   is detailed in Wu-Ki Tung, Group Theory in Physics (1985). It can be pursued by recycling the same method that leads to the unitary representation of the Poincare group, the method of induced representations (the Mackey machine).

Answer

edit

According to [Wigner's classification](https://en.wikipedia.org/wiki/Wigner%27s_classification), the task in the case of a lightlike four-vector is to find all irreducible unitary representations of the "little group" that leaves this vector invariant, $\mathrm{ISO}(2)$. As it happens in this case, $\mathrm{ISO}(2)$ itself is amenable to the method of induced representations (i. e. the little group approach). The invariant subgroup is in this case isomorphic to the group of translations in two dimensions. (These translations have nothing to do with translations in spacetime, nor do the corresponding momentum operators correspond to those of the Poincare algebra.)

Now, given a vector in the plane, which is the little group (subgroup of the factor group $\mathrm{SO}(2)$) preserving standard vectors in the plane? It is, in the case of the null-vector all of $\mathrm{SO}(2)$. The unitary representations of $\mathrm{SO}(2)$ are all one-dimensional, labeled by an integer $m$ and act by multiplication by a phase factor. The null-vector is an eigenvector corresponding to eigenvalue zero of the momentum operators of $\mathrm{ISO}(2)$.

For a non-zero vector in the plane the short little group is the trivial group. The corresponding irreducible unitary representation is infinite-dimensional. This follows, as is outlined in the cited passage, from the commutation relations. Again, this vector is an eigenvector corresponding to eigenvalue zero of the momentum operators of $\mathrm{ISO}(2)$. These vectors do get "rotated" by the action of $\mathrm{SO}(2)$.

Another way to see that these are infinite-dimensional is to use another basis in which the $\mathrm{SO}(2)$ rotation operator is diagonal and to construct "raising" and "lowering" operators from the momentum operators that shift the eigenvalue of the rotation operator. For a given eigenvalue of $P^2$ (the Casimir operator of $\mathrm{ISO}(2)$), $m$ assumes all values $0, \pm1, \pm2,\ldots$.

This exhausts the irreducible unitary representations. The infinite-dimensional ones do not correspond to known particles, since there is no known quantum number (like spin z-component) that could vary in these representation spaces. Only the one-dimensional cases remains. (When including space-inversion, these become two-dimensional in the case of photons, but that is another matter.)

Affine space

edit

Affine combinations and barycenter

edit

Collections {a1, ..., an} and  , being n points in an affine space   and n elements of the ground field of the associated vector space   respectively, are considered below.

If   then for any two points o and o' ,

 

Thus this sum is independent of the choice of the origin, and for the resulting unambiguous vector in   the notation is   In particular, when   the   occuring in the definition of the subtraction of points is retrieved.


For     is defined, given  , by

 

Then

 


where the second equality is by the first defining equality. Hence this quantity is also independent from the choice of the origin. Therefore, when the condition holds, the notation is  

The point   is called the barycenter (or center of mass) of the   for the weights   The quantity   is also called an affine combination of the   with coefficients  

When letting the   (hence  ) vary, subject only to the condition, an affine subspace results. With the additional condition  , a simplex is obtained.

Representation theory of the Poincaré group

edit

The Poincaré group is the full symmetry group of Albert Einstein's special relativity underlying all of foundational physics.

Induced representations

edit

In order to expose the idea behind representations induced by representations of subgroups, it is advantageous to examine the finite-dimensional framework developed by Frobenius.

If a Lie group has an abelian invariant subgroup, and if this subgroup is unitarily represented on a space, then there is a subgroup of the factor group of the semidirect product that leaves this subspace invariant.

Little groups of the Poincare group

edit

Timelike four-vectors: Rotation group SO(3, 1)

edit

Lightlike four-vectors: Euclidean group E(2)

edit

The little group leaving a timelike vector invariant is the two-dimensional Euclidean group. This group is a semidirect product

 

where   is the translation group in two dimensions. Since   is abelian, the same analysis as pertain to the Poincaré group can be applied here.   acts on the Euclidean plane  .

Representation theory from the method of induced representations
edit
  • For each   there is a one-dimensional unitary representation of the full   It is labeled by  , where the first coordinate refers to the eigenvalue of the Casimir operator  , and the second coordinate is a further label referring to the eigenvalue of the Casimir operator   of the little group  . The action of the Lie algebra is given by
 
At the group level,
 
is obtained.
  • For each   there is an infinite-dimensional unitary representation of the full   It is labeled by  , the square root of eigenvalue of the Casimir operator. The action of the Lie algebra is given by
 
At the group level,
 

To derive these results, the standard representation on   is examined for subgroups leaving invariant a vector  . There are only two cases. Either   in which case the little group is  , or   in which case the little group is the trivial group  

Little groups of Euclidean group E(2)
edit
= Nonzero vector: The one-element group =
edit

Here the labeling of states   is introduced. By definition, the operators   act by

 

At the group level this is

 

where  

The Lie algebra   of the little group   is trivial,   and   has only one irreducible unitary representation, the trivial one.

To deduce the action of the full group  , the action of   is examined by examining the effect of   on rotated states. To facilitate notation, write   as  

 

or

 

On infinitesimal form, this is

 


Since this is deduced from the postulated behavior of   and   on a single vector, and the result are eigenvalues different from the postulated ones for the single vector, the only reasonable conclusion is that   is a new eigenvector of   and   orthogonal to  . Evidently,

 

Since elements are orthogonal matrices, the norm of   is the same as the norm of   Thus the eigenvalue of the Casimir operator remains the same, and an infinite-dimensional unitary representation of   is characterized by this eigenvalue.

= Zero vector: Rotation group SO(2) =
edit

Here the labeling is   is introduced. The first two zeros refer to the eigenvalues of the  . They act by definition according to

 

It remains to work out how the little group acts. If the representation is to be irreducible, it must be one-dimensional, since only one-dimensional irreducible representations of   exist. In these representations, labeled by   the generator   of   acts by

 

This suggests the labeling   for the basis vector.

At the group level, one obtains

 

As it happens, the actions of the abelian subgroup   and of the little group   described so far exhausts the action of all of  .

Notes
edit
  1. ^ Dirac 1934
  2. ^ Heisenberg 1934
  3. ^ Serber 1935
  4. ^ Uehling 1935
  5. ^ Gell-Mann & Low 1954
  6. ^ Goldstein 1980, Chapter 9.
  7. ^ Goldstein 1980, p. 321
  8. ^ Tung 1985, Preface. harvnb error: multiple targets (4×): CITEREFTung1985 (help)
  9. ^ Varadarajan 1989, First two items in list p. 67.
  10. ^ Varadarajan 1989, Items three and four in list p. 67.
  11. ^ Rossmann 2002 harvnb error: multiple targets (2×): CITEREFRossmann2002 (help) Theorem 1, Paragraph 2.5.
  12. ^ Hall 2015, Theorem 5.6.
  13. ^ Hall 2015, Proposition 5.9.
  14. ^ Weinberg 2002, Appendix B. harvnb error: multiple targets (2×): CITEREFWeinberg2002 (help)
  15. ^ Hall 2015, Step 3 in proof of theorem 5.10.
References
edit

Spacelike four-vectors: Generalized Lorentz group SO(2, 1)

edit

Zero four-vectors: Lorentz group SO(3, 1)

edit

Euclidean group E(2)

edit

This group is a semidirect product

 

where   is the translation group in two dimensions. Since   is abelian, the same analysis as pertain to the Poincaré group can be applied here.   acts on the Euclidean plane  .

Notes

edit

References

edit
  • Rossmann, Wulf (2002), Lie Groups - An Introduction Through Linear Groups, Oxford Graduate Texts in Mathematics, Oxford Science Publications, ISBN 0-19-859683-9

Representation theory of the Euclidean group E(2)

edit

The Euclidean group E(2) in two dimensions is the group of isometries of the Euclidean plane. It is also denoted ISO(2) provided reflections are excluded. The I stands for inhomogeneous, referring to the translational part, and SO stands for special orthogonal, referring to the rotational part. Its elements are called rigid Euclidean motions. When reflections are included, the group is sometimes denoted E+(2) (but rarely IO(2)). The elements are then rigid motions.

Group multiplication rule

edit

A rigid motion can be written as

 

where the vector is first rotated in the plane and then a translation is added. The group has a faithful three-dimensional representation. The idea is to embed 2 as the affine plane z = 1 in 3.[1] Then x ∈ ℝ2 is represented by (x, 1) ∈ ℝ3, and

 

The group multiplication rule[2]

 

and the inverse operation

 

are read off directly from the multiplication law.

Representation theory from the method of induced representations

edit
  • For each   there is a one-dimensional unitary representation of the full   It is labeled by  , where the first coordinate refers to the eigenvalue of the Casimir operator  , and the second coordinate is a further label referring to the eigenvalue of the Casimir operator   of the little group  . The action of the Lie algebra is given by
 
At the group level,
 
is obtained.
  • For each   there is an infinite-dimensional unitary representation of the full   It is labeled by  , the square root of eigenvalue of the Casimir operator. The action of the Lie algebra is given by
 
At the group level,
 

To derive these results, the standard representation on   is examined for subgroups leaving invariant a vector  . There are only two cases. Either   in which case the little group is  , or   in which case the little group is the trivial group  

Little groups of Euclidean group E(2)

edit
Nonzero vector: The one-element group
edit

Here the labeling of states   is introduced. By definition, the operators   act by

 

At the group level this is

 

where  

The Lie algebra   of the little group   is trivial,   and   has only one irreducible unitary representation, the trivial one.

To deduce the action of the full group  , the action of   is examined by examining the effect of   on rotated states. To facilitate notation, write   as  

 

or

 

On infinitesimal form, this is

 


Since this is deduced from the postulated behavior of   and   on a single vector, and the result are eigenvalues different from the postulated ones for the single vector, the only reasonable conclusion is that   is a new eigenvector of   and   orthogonal to  . Evidently,

 

Since elements are orthogonal matrices, the norm of   is the same as the norm of   Thus the eigenvalue of the Casimir operator remains the same, and an infinite-dimensional unitary representation of   is characterized by this eigenvalue.

Zero vector: Rotation group SO(2)
edit

Here the labeling is   is introduced. The first two zeros refer to the eigenvalues of the  . They act by definition according to

 

It remains to work out how the little group acts. If the representation is to be irreducible, it must be one-dimensional, since only one-dimensional irreducible representations of   exist. In these representations, labeled by   the generator   of   acts by

 

This suggests the labeling   for the basis vector.

At the group level, one obtains

 

As it happens, the actions of the abelian subgroup   and of the little group   described so far exhausts the action of all of  .

Remarks

edit
  1. ^ It isn't actually required that the mapping is one-to-one. It is merely required that the mapping is a group homomorphism, i. e. Π(gh) = Π(g)Π(h) into some GL(V) the general linear group of some vector space V. (The vector space V is allowed to be infinite-dimensional, e.g a Hilbert space H, in which case one speaks of B(H), linear operators on H instead of GL(V).
  2. ^ This is true of all groups encountered in the present context. There are examples of non-compact Lie groups not allowing a matrix representation.
  3. ^ It is tacitly always assumed that each inertial frame has a dedicated Lorentz observer, I. e. someone who has, in principle, a complete record (i. e. coordinates!) of every event as observed in that frame.
  4. ^ Not all groups have matrix representations. For example, the universal covering group of the linear Lie group SL(2, R) has none. See Hall (2015, Proposition 5.16.) A quotient of a matrix Lie group need not be linear. This is e.g. the case for the quotient of the Heisenberg group by a discrete subgroup of its center. See Hall (2015, Section 4.8.)

    However, if G is a compact Lie group, it is representable as a matrix Lie group. This is a consequence of the Peter–Weyl theorem. See Rossmann (2002, Section 6.2.) harvtxt error: multiple targets (2×): CITEREFRossmann2002 (help)

  5. ^ It's a rather deep fact that all finite-dimensional Lie algebras have matrix representations. This is the content of Ado's theorem. See Hall (2015, Section 5.10.)
  6. ^ The exponential mapping need not be onto and the image is in those cases not a group. Therefore one takes all finite products of elements in the image in order to obtain a group, which necessarily must be closed under multiplication.
  7. ^ Hall 2003, Equation 2.16.

Notes

edit
  1. ^ Rossmann 2002, Example 5, section 2.1. harvnb error: multiple targets (2×): CITEREFRossmann2002 (help)
  2. ^ Tung 1985 harvnb error: multiple targets (4×): CITEREFTung1985 (help)

References

edit

Tautological one-form

edit

The tautological one-form is the unique horizontal one-form that "cancels" a pullback. That is, let

 

be any 1-form on   and consider it as a map from   to  , that is

 

since one-form are sections.

Let   denote the operation of pulling back by  . Then   elaluated at  , denoted  , is a linear map from the cotangent space of the range   at  , i. e.,

 

in all generality. Now since a one-form is a section, one has

 
 ,

which can be most easily understood in terms of coordinates:

 

So, by the commutation between the pull-back and the exterior derivative,

 .

Weyl spinor

edit

In theoretical physics a Weyl spinor is a set of two complex quantities that transform under Lorentz transformations under the   or   representation of the Lorentz group. The former are called left-handed and the latter right-handed. In the simplest description, a Weyl spinor is a column matrix with two complex entries. Weyl spinors may also be though of sitting in a Clifford algebra, in which case they are represented by complex matrices. The connection between the two descriptions is that the complex matrix can be expanded in a certain basis with two elements. The coefficients in this basis then constitute the corresponding column vector.

Weyl spinors are related to the Dirac spinor   via

 

In this expression,

Minkowski diagram

edit

The term Minkowski diagram is used in both a generic and particular sense. In general, a Minkowski diagram is a graphic depiction of a portion of Minkowski space, usually where space has been curtailed to a single dimension. These two-dimensional diagrams portray worldlines as curves in a plane that correspond to motion along the spatial axis. The vertical axis is usually temporal, and the units of measurement are taken such that the light cone at an event consists of the lines of slope plus or minus one through that event.[1] The horizontal lines corresponds to the usual notion of simultaneous events, for a stationary observer at the origin.

A particular Minkowski diagram illustrates the result of a Lorentz transformation. The Lorentz transformation relates two inertial frames of reference, where an observer stationary at the event (0, 0) makes a change of velocity along the x-axis. The new time axis of the observer forms an angle α with the previous time axis, with α < π/4. See the first figure on the right. In the new frame of reference the simultaneous events lie parallel to a line inclined by α to the previous lines of simultaneity. This is the new x-axis. Both the original set of axes and the primes set of axes have the property that they are orthogonal with respect to the Minkowski inner product or relativistic dot product.

Basics

edit
 
Minkowski diagram with resting frame (x, t), moving frame (x′, t′), light cone, and hyperbolas indicating the surfaces of transitivity of the orthochronous Lorentz transformations. In 4-dimensional spacetime they are hyperboloids of two sheets inside the light cone and one-sheeted hyperboloids outside the light-cone together with the past and future light cones themselves and, separately, the origin.

The starting point is a subset of the Poincaré transformations relating the inertial frames of special relativity. A special inertial frame is chosen by a suitable spacetime translation and a suitable rotation in space such that the origin of the coordinate system (0, 0, 0, 0) is located appropriately, e.g. one imagines a Lorentz observer stationary in space there, and the coordinate axes are directed appropriately, e.g. such that dynamical changes of, for example, a moving particle, or the motion of the origin of another inertial system effects one coordinate only, typically taken to be the x-coordinate. According to this setup, the remaining freedom in choice of inertial coordinates are Lorentz boosts.

In order to derive the properties of the diagram, one starts with knowledge of the properties of Lorentz transformations. Let x = (x0, x1, x2, x3) and y = (y0, y1, y2, y3) be the coordinates of any two events in spacetime. Lorentz transformations leave the bilinear form

 

invariant. This means in particular that if x = (x0, x1, x2, x3) and x′ = (x0, x1, x2, x3) are the coordinates of one event described in different inertial systems related by a Lorentz transformation. that

 

The invariance was used justify the third equality sign.


At this time it is convenient to assume for the coordinates of the events of interest the simple expression x = (ct, x, y, z) = (ct, x, 0, 0) ≡ (ct, x), with reference to the appropriate choice of origin and orientation of the coordinate system described in the first paragraph.

Now draw a 2-dimensional diagram with ct labeling the vertical axis and x labeling the horizontal axis. Each point in this diagram represents a point in spacetime with coordinates (ct, x, 0, 0) where ct and x can be read off from the diagram using orthogonal projection onto the vertical axis and orthogonal projection onto the horizontal axis respectively.


The above expression simply becomes

 

where the coordinates (ct, x) and (ct′, x′) are related by a Lorentz boost in the x-direction. Considering a fixed event inside the light cone, i.e. a timelike event (ct2 > x2), and writing

 

one may solve for ct and write

 

This is the equation of a hyperbola of one sheet (for either the positive or the negative solution). This says that an event E with coordinates (ctE, xE will be described in any coordinates (ctE, xE}} related by a Lorentz boost in the x-direction to the original coordinates such that when plotted in the original coordinates, the point E expressed in the new cordinates lies on the hyperbola. That is

 

To recapitulate, a single timelike event can be marked by two points in a Minkowski diagram provided the two points are joined by one of the two branches of the hyperbolas of two sheets. Likewise, single spacelike event can be marked by two points in a Minkowski diagram provided the two points are joined by one of the hyperbolas of one sheet defined by

 

Minkowski space

edit

Affine Minkowski space

edit

The following may be taken as a postulate:

Spacetime of special relativity is a four-dimensional affine space M.[2]

Affine spaces and vector spaces are similar, with one significant difference being that affine spaces have no distinguished point, whereas vector spaces do, the origin. An n-dimensional affine space A has a set of bijections Φα: A → ℝn with the property that the composite maps

 

are bijective affine transformations, i. e. L is bijective linear transformation and a is a vector in n. In this situation n is called the standard vector space of the affine space A. The set of charts Φα, called an affine atlas is a smooth atlas on A and can be extended to a maximal atlas, giving A the structure of a smooth manifold. The maps καβ are called affine coordinate transformations.

The notion of translation in A is defined as a map

 

such that

 

This is independent of α. For every (x, y) ∈ A × A define

 

if

 

for some translation ta, a ∈ ℝn. This is similar to the concept of parallel transport in a vector space, allowing for vectors to be moved around freely. Equivalence holds if and only if

 

for some translation tb, b ∈ ℝn. The relation is an equivalence relation, and an equivalence class in A/≈ is characterized by the vector b ∈ ℝn. A vector space structure results on A/≈ with origin the class with b = 0. The ordered pairs should be thought of as displacement vectors (attached at some point) rather than true vectors (that can be freely moved around).

Space and Time

edit

In his seminal paper Minkowski & 1908–1909 termed Space and Time, Minkowski notes the symmetries associated with Newtonian mechanics with space and time seen as separate entities. Minkowski keeps in the beginning a distinction between "change in position" (invariance under spatial translation and invariance under spatial rotation), and "change in motion" (invariance under Galilean boosts). He also notes invariance under translations in time separately. In modern terms, Newtonian mechanics is invariant under Galilean transformations. This is the topic of Galilean relativity.

He then introduces the world manifold consisting of the totality of world-points x, y, z, t. This is Minkowski space as a set (so far stripped of further structure) and the elements are today called events. Minkowski's terminology lives on in the higher dimensional analogues world line and world sheet. The motivation:

Subjects of our perception are always places and times connected. No one has observed a place except at a particular time, or has observed a time except at a particular place.

— Minkowski, Space and Time

Minkowski assumes for the sake of discussion that at every world-point something exists:

In order not to allow any yawning gap to exist, we shall suppose that at every place and time, something perceptible exists. In order not to say either matter or electricity, we shall simply use the word substance for this something. We direct our attention to the substantial point located at world-point x, y, z, t, ...

— Minkowski, Space and Time

He deduces the concept of world-line as the spacetime history of such a substantial point, parametrized by (−∞ < t < ∞). This would today probably be referred to as a possible world line of a massive point particle with positive mass.

 
Graphical construction of a Lorentz boost in the x-direction.

For simplicity, assumption is made that the world is endowed with an origin and that one has four initially orthogonal axes. But then ,by considering Galilean boosts in the plane as transformations preserving the form of Newtonian mechanics, the time axis, initially assumed orthogonal to the space axes, is free to point in any upward direction. In order to establish a connection, Minkowski considers (this is motivated only later) the upper sheet of the hyperbola c2t2x2y2z2 = 1 with c being an adjustable parameter.

I.e., in his words, he asks what the requirement of orthogonality in space has to do with this perfect freedom of the time-axis towards the upper half. This question is hard to understand, since the answer in special relativity is the non-freedom of the time axis is connected to behavior under boosts.

Élie Cartan 1923–24 considered this question in the context of a spacetime manifold with Newtonian gravitation (like Minkowski, though Minkowski does not mention gravitation specifically), but using the tools of differential geometry. It turns out Newtonian spacetime is curved![3]. Using this fact, and considering the most general transformations of spacetime preserving the canonical equations are the Galilean transformations, but these are allowed to depend on time. One consequence is the existence of special coordinates (in the same sense that Cartesian coordinates are special for Lorentz frames),

 

where g3 is the Euclidean metric on space This should be compared to

 

of Minkowski space. Thus there is a Riemannian metric on space, but this does not allow for extension to all of spacetime in the Newtonian setting. This corresponds to the "perfect freedom" of the time-axis in the language of Minkowski.

With the aid of a figure, an early Minkowski diagram, new values x and t are constructed so that again c2t′2x′2y2z2 = 1. The connection between the parameter c and the diagram is such that in the limit c → ∞ the hyperbola flattens out and approaches the x-axis. The group of transformations for c finite (with translations in spacetime and space rotations also included) is denoted Gc and that for infinite c G. It is, argues Minkowski, in this context reasonable to identify C with the full symmetry group of Newtonian mechanics (the Galilean group). Minkowski notes that a mathematician very well could have asked if some natural phenomena could be invariant under C with c finie. He adds that it would have been an extraordinarily triumph for pure mathematics.

The justification given for these transformations is the, by then known, invariance of the laws of electrodynamics under Gc with c being the speed of light, in other words, Gc is the Poincare group.

Axiom:

The substance existing at any world point can always be conceived to be at rest, if time and space are interpreted suitably.

— Minkowski, Space and Time

Corollary:

The axiom means, that in a world-point the expression

 

shall always be positive or what is equivalent to the same thing, every velocity v should always be smaller than c.

— Minkowski, Space and Time

It is important to recall Minkowski's consideration of substantial world-points only, for else the conclusions are false. In modern terminology, the speed of a massive particle is less than the speed of light in any intertial frame of reference, and for every massive particle, there is an instantaneous rest frame whether the particle is accelerating or not. The axiom, in the form of the restatement here, is sometimes referred to as the third postulate of relativity.[4].

Suppose now that mechanics obeys Galilean relativity (symmetry group Gc), but electromagnetic theory obeys special relativity (symmetry group Gc). The concept of a rigid body no sense in special relativity.[5]

Now if we have an optics with Gc, and on the other hand if there were rigid bodies, it is easy to see that one t-direction is preferred by the two hyperboloidal shells belonging to the groups G}, and Gc, which would have got the further consequence, that by means of suitable rigid instruments in the laboratory, we can perceive a change in natural phenomena, in case of different orientations with regard to the direction of progressive motion of the earth.

— Minkowski, Space and Time

This is ruled out by experiment. The null result result of the Michelson–Morley experiment had previously prompted researchers to postulate that rigid bodies suffers a length contraction in the direction of motion relative to the ether, the Lorentz–FitzGerald contraction hypothesis.

According to Lorentz every body in motion, shall suffer a contraction in the direction of its motion, namely at velocity v in the ratio

 

This hypothesis sounds rather fantastical. For the contraction is not to be thought of as a consequence of resistances in the ether, but purely as a gift from above, as a condition accompanying the state of motion.

— Minkowski, Space and Time

Minkowski shows, using his diagram, that the exact same length contraction follows from the transformations of coordinates under Gc.

 
Light cone in 2D space plus a time dimension.

He then proceeds to consider an arbitrary event O with coordinates (0, 0, 0, 0 and defines the cone

 

The part with ct < 0 is called the fore-cone, and the part with ct > 0 the aft-cone. In modern terminology, one has the light cone, the past light cone and the future light cone. The regions enclosed by the cones are called the for-side (of O) and the aft-side, today absolute past and absolute future respectively. Time-like vectors (those sitting inside the light cone) and space-like vectors are defined. It is noted that any time-like vectors can be used to define a new time-axis. Likewise, any space-like vector S can be used to define axes such that the events at O and S occur at either the same time or event O either preceding or following S in time.

Minkowski defines two vectors (ct, x, y, z) and (ct1, x1, y1, z1) as normal if

 

where one vector is in a unit hyperboloid inside the light cone, and the other is the unit hyperboloid od space-like vectors, by which is meant they are orthonormal.

Minkowski goes on to define proper time:

Let us now fix our attention upon the world-line of a substantial point running through the world-point P(x, y, z, t); then as we follow the progress of the line, the quantity

 

corresponds to the time-like vector-element dx, dy, dz, dt.

The integral

 

of this sum, taken over the world-line from any fixed initial P0 to any variable endpoint P, may be called the "proper-time" of the substantial point in P.

— Minkowski, Space and Time

Two things may be noted: This applies to the "substantial world points" of Minkowski – massive point particles of positive mass. The definition of proper time is fully Lorentz invariant, since the quantity under he square root is (by definition of O(1, 3) or (O(3, 1))). Niether Parity (physics) nor time reversal, which are Lorentz transformations, has any effect on proper time. Thus it is a Lorentz scalar, defined along a time-like curve in spacetime.

It is clear that proper time can be used as a parameter of the curve a massive particle traces out in spacetime, i.e. t = t(τ), x = x(τ), y = y(τ), z = z(τ). By differentiation w r t τ one obtains

 

The first equation says that the four-velocity is constant. The second equation says that the particles four-acceleration is always orthogonal to its four-velocity.

Outer measure

edit

An outer measure on a set   is a function   from the power set   of   to the non-negative extended reals  ,   with the property that for   and any countable set  [6]

  (D)

Remarks

  • It is understood that countable set means either finite set or countably infinite.
  • It is understood that there is a unique summation operator defined on all functions on all sets with range  . In fact, the summation operator is defined more generally, roughly speaking, it is defined whenever   does not occur when the formal definition is applied to a function with range all of the extended reals.[7]
  • The condition   can be paraphrased as   is a countable cover of  [8]

Basic properties

edit
  • The empty collection   is a countable cover of  ,[9]
  (D3)
where and the last equality is by definition of the summation operator.
  • If   then   covers  , hence the property of monotonicity,[10]
  (D3)
  • Let   Then   covers  , whence
  (D3)
which is the property of countable subadditivity.

The properties (D1)(D3), here deduced, are sometimes taken as the defining properties for any function   to be an outer measure.[11] The property (D) is then a derived property.

Lorentz transformation

edit

Let an object be at rest at the origin a Lorentz frame (unprimed) and let the particle velocity be   in another frame (primed). The coordinates of the object in the two frames are related through

 

Taking differentials,

 

Since the object is at rest in the unprimed frame,  , thus

 

Divide the upper equation by the lower to obtain

 

so

  (D1.2)

The Lorentz group is characterized by

 

or in component form

 

where

 

is read off from the bilienar form, see O(p, q). The matrix   corresponds to the matrix   with   and   in the linked article. Put   in these relations. Then

  (D1.2)

Equations (D1.1) and (D1.2) can be solved to yield

 

For the standard configuration, where   of the particle is taken in the negative x-direction (the particle is at rest in the unprimed frame and moves) and no rotation is involved, this means

 

The   entry was found using the condition  .

From this one reads off

 

Wave function

edit

Using the Dirac bra-ket machinery, it is relatively simple to derive the Schrödinger equation in various representations. In abstract notation, it reads

 

In order to obtain a coordinate representation of it, one may use a resolution of the identity,

 

where Id is the identity operator on Hilbert space. The kets |x form, since the position operator is Hermitian, a complete set by the spectral theorem. Multiply the abstract Schrödinger equation from the left with the bra corresponding to |x, and insert a resolution of the identity in the right spot,

 

or

 

For a free particle one has in the coordinate representation

 

which may be taken as a basic postulate, see canonical quantization.

Wigner's theorem

edit
 
E.P. Wigner (1992-1995) first proved the theorem bearing his name. It was a key step towards the modern classification scheme of particle types, according to which particle types are partly characterized by which representation of the Lorentz group under which it transforms. The Lorentz group is a symmetry group of every relativistic quantum field theory.

Wigner's theorem, proved by Eugene Wigner in 1931,[12] is a cornerstone of the mathematical formulation of quantum mechanics. The theorem specifies how physical symmetries such as rotations, translations, and CPT are represented on the Hilbert space of states.

According to the theorem, any symmetry transformation of ray space is represented by a linear and unitary or antilinear and antiunitary transformation of Hilbert space. The representation of a symmetry group on Hilbert space is either an ordinary representation or a projective representation.

Invariance principles derived from symmetry transformations play an important role in physics and, as Wigner noted in [Houtappel et al., 1965, they serve as a guide in the search for new laws of nature and as tools for obtaining properties of the solutions of equations provided by the laws of nature. Moreover, representations of symmetry groups, in particular the Lorentz group, though not sufficient to replace the quantum mechanical equations entirely, can replace them to a large extent. One can say that representations can replace the equation of motion, it cannot replace, however, connections holding between operators at one instant of time.[13] It may be mentioned, finally, that these developments apply not only in quantutm mechanics, but also to all linear theories, e.g., the Maxwell equations in empty space. The only difference is that there is no arbitrary factor in the description and the o, can be omitted in (3a) and one is led to real representations instead of representations up to a factor.

History

edit

In his book in 1931 book,[14] Wigner postulated that the transition probability between two states has an invariant physical sense. This assumption led him to consider transformations of the states of a physical system which preserve the transition probability associated to any pair of states. He proved that any such transformation, called a symmetry transformation, is induced either by a unitary or by an antiunitary operator on the Hilbert space associated to the physical system. This is the result that is known as Wigner's theorem.

From the mathematical point of view, the proof given by Wigner was incomplete, and, according to some, not rigorous, if not incorrect. More than thirty years after the publication of Wigner's book, the first complete proof of his theorem were given in Lomont & Mendelson (1963) and in Uhlhorn (1963).[15] The alleged incorrectness or non-rigor in Wigner's proof is contested in Bargmann (1894),[nb 1] where Wigner's original proof is spelled in detail. According to Bargmann, Uhlhorn's result is interesting though, because the premises from which the theorem is proved are different, and it highlights the connection between Wigner's theorem and the first fundamental theorem of projective geometry.[16]

In the last two decades, L. Molnár has renewed the subject, especially with regard to its mathematical aspects, and several new proofs and generalizations have been published in recent years. See Outline of proof of Wigner's theorem below.

Rays and ray space

edit

It is a postulate of quantum mechanics that vectors in Hilbert space that are scalar nonzero multiples of each other represent the same pure state. A ray is a set[17][18]

 

and a ray whose vectors have unit norm is called a unit ray. There is a one-to-one correspondence between physical pure states and unit rays.[nb 2] The space of all rays is called ray space. Ray space is sometimes awkward to work with. It is, for instance, not a vector space with well-defined linear combinations of rays. But a transformation of a physical system is a transformation of states, hence mathematically a transformation of ray space. In quantum mechanics, a transformation of a physical system gives rise to a bijective ray transformation T of ray space,[18]

 

Symmetry transformations

edit

Intuitively, a symmetry transformation is a change in which "nothing happens"[19] or a change of our view[20] that does not change the outcomes of possible experiments. For example, translating a system in a homogeneous environment should have no effect on the system. Likewise for rotating a system in an isotropic environment. This becomes even clearer when one considers the mathematically equivalent passive transformations, i.e. simply changes of coordinates and let the system be. Usually, the domain and range Hilbert spaces are the same. An exception would be (in a non-relativistic theory) the Hilbert space of electron states that is subjected to a charge conjugation transformation. In this case the electron states are mapped to the Hilbert space of positron states and vice versa. To make this precise, introduce the ray product,

 

where ⟨Ψ,Φ⟩ is the Hilbert space inner product. A ray transformation is called a symmetry transformation if[21]

 

In particular, unit rays are taken to unit rays. The significance of this definition is that transition probabilities are preserved. In particular the Born rule, another postulate of quantum mechanics, will predict the same probabilities in the transformed and untransformed systems,

 

It is clear from the definitions that this is independent of the representatives of the rays chosen.

Symmetry groups

edit

Some facts about symmetry transformations that can be verified using the definition:

  • The product of symmetry two transformations, i.e. two symmetry transformations applied in concession, is a symmetry transformation.
  • Any symmetry transformation has an inverse.
  • The identity transformation is a symmetry transformation.
  • Multiplication of symmetry transformations is associative.

The set of symmetry transformations thus forms a group, the symmetry group of the system. Some important frequently occurring subgroups in the symmetry group of a system are realizations of

These groups are also referred to as symmetry groups of the system.

Statement of Wigner's theorem

edit

A linear transformation U of Hilbert space is unitary[nb 3] if

 

and an antilinear transformation is antiunitary if

 

Given a unitary transformation U of Hilbert space, define

 

This is a symmetry transformation since

 

In the same way an antilinear antiunitary transformation of Hilbert space induces a symmetry transformation. One says that a transformation U of Hilbert space is compatible with the transformation T of ray space if for all Ψ,

 

Transformations of Hilbert space by either a unitary linear transformation or an antiunitary antilinear operator are obviously then compatible with the transformations or ray space they induce per above. Wigner's theorem states a converse of this:[22]

Wigner's theorem (1931): If H and K are Hilbert spaces and if
 
is a symmetry transformation, then there exists a transformation V:HK which is compatible with T and such that V is either unitary and linear or antiunitary and antilinear if dim H ≥ 2. If dim H = 1 there exists a unitary transformation U:HK and an antiunitary transformation A:HK, both compatible with T.

Proofs can be found in Wigner (1931, 1959), Bargmann (1964) and Weinberg (2002) harvtxt error: multiple targets (2×): CITEREFWeinberg2002 (help).

Representations and projective representations

edit

A transformation compatible with a symmetry transformation is not unique. One has the following.

Theorem: If U and V are two additive transformations of H onto K, both compatible with the ray transformation T with dim H ≥ 2, then
 

If G is a symmetry group in this latter sense, and if f, g, hG with fg = h, then

 

where the T are ray transformations. From the last theorem, one has for the compatible representatives U,

 

where ω(f, g) is a phase factor. The function ω is called a 2-cocycle or Schur multiplier. A map U:G → GL(V) satisfying the above relation for some vector space V is called a projective representation or a ray representation. If ω(f, g) = 1, then it is called a representation. One should note that the terminology differs between mathematics and physics. In the linked article, term projective representation has a slightly different meaning, but the term as presented here enters as an ingredient and the mathematics per se is the same. Applying the last relation (several times) to the product fgh and appealing to the known associativity of multiplication of operators on H, one finds

 

Upon redefinition of the phases,

 

which is allowed by last theorem, one finds[23][24]

 

where the hatted quantities are defined by

 

This freedom of choice of phases can be used to simplify the phase factors. In the case of the Lorentz group and its subgroup the rotation group SO(3), phases can be chosen such that ω(g, h) = ± 1. For their respective universal covering groups SL(2,C) and Spin(3), it is even possible to have ω(g, h) = 1. The study of of redefinition of phases involves group cohomology. Two functions related as the hatted and unhatted versions of ω above are said to be cohomologous. They belong to the same second cohomology class, i.e. they are represented by the same element in H2(G), the second cohomology group of G. If a an element of H2(G) contains the trivial function ω = 0, then it is said to be trivial.[24] The topic can be studied at the level of Lie algebras and Lie algebra cohomology as well.[25][26]

Outline of proof of Wigner's theorem

edit

Like for many foundational theorems, Wigner's theorem has been proved in a variety of ways. Approaches include a direct manipulation in Hilbert space,[27][28] an algebraic approach allowing for generalization,[29], a complex analysis approach,[30], and variants.[31] It has also been generalized in different directions. Generalizations of Wigner's theorem to indefinite inner product spaces,[32] Hilbert modules,[33] type I1 factors,[34], Banach spaces,[35] quaternionic Hilbert spaces,[36][37] etc. exist.[38] The premises of the theorem have been altered, preservation of orthogonality (a weaker assumption than Wigner's assumption of preserved inner product for all vectors) suffices,[39] and the requirement of bijectivity of the ray transformation and separability of the Hilbert space (the latter is equivalent to dropping the requirement of a countable basis) have been dropped.[40]

Preliminaries

edit

Among all proofs, Bargmann's is probably the most accessible. Knowledge of basics of inner product spaces and very basic theory of complex numbers suffice for comprehension. Wigner adopts definition (D1) for ray space.

Real multiplication of rays
edit

The ray product is as before and, in addition, multiplication of rays by real numbers is defined by

 

This multiplication satisfies

 

Every ray may then be expressed as the product of a real number and a unit ray.

Symmetry transformations
edit

Symmetry transformations are defined on all unit rays (but initially on unit rays only). They are demanded to preserve the ray product. The facts that symmetry transformations are one-to-one and onto follow at once. For unit rays e1, e2,

 

for representatives e1, e2 of e1, e2. The second to last equality is equality in the Cauchy-Schwartz inequality, which holds only if e1 is a multiple of e2, necessarily here of modulus 1. Hence e1 = e2 and T is injective. Then T has a left inverse T−1, a symmetry transformation required to preserve the ray product as well. Let {{{1}}} be a maximal orthonormal set in H. Then To is an orthonormal set. It is complete, for if not, then there is a nonzero f in the range orthogonal to all e'α TeαH'. Under the left inverse, T−1f then has zero ray product with every eα contradicting the completeness of o. Thus To is complete and T is surjective.

Conclusion of theorem
edit

The theorem guarantees the existence of an operator U with the following properties:

  (4)

and

  (5)

for χ being either the identity or complex conjugation. Only property (4) is used in the construction, together with consistency with previous definitions. The properties in (5) will follow.

Independence of unitary-antiunitary character of U on realization
edit

The proof amounts to constructing an operator with these properties. The unitarity or antiunitarity of the operator the theorem asserts the existence of is determined by T. To see this, define a map from R × R × R to ,

 

which is, using properties of the inner product on H, seen to be independent of representatives. Now

 

where (5a) and properties of χ are used. This can hold only if U is unitary when χ is the identity and antiunitary when χ is complex conjugation. This means that the character of U in this respect does not depend on any arbitrary choice in the proof and can be determined without construction of U.

Extension of T
edit

Before construction of U the unit ray map T is extended from unit rays to all of ray space by

 

One has

  (7)

the first two following from the definitions, and the third from the second.

Lemma 1
edit

Suppose now that U has been constructed in a subspace (at least one-dimensional). Then, for a in this subspace,

  (8)

Since Ua ∈ Ta by (4), the first equality follows from (7c). Likewise, U() = U(a|λ|eiarg λ) ∈ Taeiarg λ|λ| = Ta|λ|, where the phase factor was absorbed into a. Since U(a) ∈Ta, the second assertion is true provided χa satisfies the two assertions in (8b). Equation (8c) is evident from (4) and (7b).

Bargmann's proof

edit
Expansion coefficients in the range
edit

Consider an orthonormal set,

 

with m finite, and let Let

 

Then if a' is any element in a' = Ta,

Key result

 

(9)

Proof:

 

The second to last equality, while not entirely obvious, follows by comparison with the far left and the first term in the last row. Orthonormality of the fρ and the fact that T preserves absolute values of inner products have been used. Hence (9) holds and α'ρ = (f'ρ, a') since

 

Construction of U

edit

Fix a unit ray e with Te = e'. Fix ee, e' ∈ e' and define

Definition

 

(A)

This is arbitrary up to a factor of eiφ, φ ∈ ℝ. This is the only arbitrary choice made in the proof, in accord with that the final complete operator is unique up to the same factor.

Now decompose H = P ⊕ span e. For any aH,

 

First consider a for which α = 0 or 1. Set a = e + z and define f = z||z||, so that f is a unit vector, and let a, f and z be the corresponding rays. Accordingly, z = f||z||, ||f|| = 1. For any a' ∈ a' = Ta and f' ∈ f' = Tf,

 

according to (9). But then

 

as well. Now pick instead b' ∈ a' and g' ∈ f' and apply the same procedure;

 

But, necessarily, b' = a'e and g' = f'e so

 

whence b'' = a''. This means that a' contains a unique vector of the form e' + f'β' with |β'| = |z|. Set

 

and define

Definition

 

(B)

Since Vz = f'β' ∈ Tf||z|| = Tz it is permissible to set

Definition

 

(C)

Analysis of the map V

edit

Let w, xP and let definition (C) apply to both. From (C) and (8c) follows

Key result

 

(11)

and from (B) and (8c) follows

 

or, which is the same thing after expansion of both sides,

 

But

 

whence

Key result

 

(12)

The second to last step uses (11) and the last step is just expansion of the RHS of the equation above. Also,

Key result

 

(12a)

if (w,x) is real. An imaginary part in (Vw,Vx) would violate (11).

Real part of the function χ

edit

Now fix two vectors y, zP and define f1 = y/|y| and a second orthogonal unit vector f2 such that

:  (13)

and let L = span {f1, f2}. By the same procedure as above, f'ρ = VfρTfρ, and f'1 and f'2 are orthogonal. By (8),

  (14)

Equation (8a) gives

 

With α = 1,

Key result

 

(15a)

and with β real, using (12a),

  (15b)

Coefficients α in terms of the function χ

edit

Now set

 

Then, by (9),

 

Put γρ = α*ρ−1. Then

 

hence by (12a), (8a) and orthonormality,

 

and thus

Key result

 

Independence of χ

edit

Next set

 

which can hold only if

 

whence

Key result

 

(16)

Imaginary part of the function χ

edit

Then set β = i. By (8b), |χ1(i)| = 1, and by (8b), Reχ1(i) = 0. Hence χ1(i) = ±iηi. By equation (15),

 

Combination with (15a) yields

 

Whether χ is complex conjugation or the identity, it satisfies

Observations

 

(O)

The right properties of V

edit

Finally, let w, xL with

 .

One has

Key result

 

(18)

Parts (18a) and (18b) follow from (16) and observations (Oa) and (Ob) respectively. Part (18c) is a consequence of (13) and observation (Oc).

Construction of U completed

edit

Lastly, consider general vectors a = + z, zP for α ≠ 0, 1. Define b = e + zα. Then a = and Ta = (Tb)|α|. UbTb is defined as before. But then Ubχ(α) ∈ Ta and one may define

  (D)

This definition coincides with the previous ones in case α = 0, 1. Using (18) and (O) one confirms that U satisfies all advertized properties in (5),

 
 
 

Dirac's expansors

edit

To make concrete what a representation space of a unitary infinite-dimensional representation of the Lorentz group may look like, one may look at the Dirac (1945) publication. In this paper first an infinite-dimensional unitary representation of O(3) is constructed and then extended to an infinite-dimensional unitary representation of O(3, 1). The O(3) representation is not irreducible, it couldn't be because all its irreducible unitary representations are finite-dimensional. On the contrary, the O(3, 1) representation is not reducible to finite-dimensional unitary representations because its irreducible unitary representations are all infinite-dimensional. While Dirac's proposed expansors found little relevance in physics, they were of considerable mathematical import. Its analysis of infinite unitary representations was later developed further by Harish-Chandra and Bargmann.[41]

O(3)

edit

Begin by considering power series

 

in the real variable ξ1 with real coefficients. Let V1 be the inner product space of sequences (ar) such that

 

converges. Then

 

is an inner product on this space and

 

defines the norm. Define similar vector spaces V2 and V3 and define

 

Also, define product operations

 

By using bases,

 

one may write

 

with

 


The elements of the "triple indexed sequence" (Arst) may then be though of as coefficients in a power series

 

the inner product being defined by

 

Now apply an element of O(3) to the triple ξ ≡ (ξ1, ξ2, ξ3), ξ' = Rξ, R ∈ O(3). Then

 

with A'rst (rst fixed) some linear combination of Arst (as rst varies).


It turns out that

 

implying that the space V is a representation space of a unitary representation of O(3). Consider first the transformation

 

Plug this into the expression for P,

 

so

 

The squared norm is thus

 

the last step by virtue of changing summation variable rr − 1 in the second term and ss + 1 in the third.


The space V is clearly infinite-dimensional, but by considering homogeneous polynomials of a fixed degree, say u, then the space is finite. In this case the Arst can be seen as to equal certain components of a symmetrical tensor of rank u. To see this, write

 

Then there are

 

terms with r indices equal to 1, s indices equal to 2 and t indices equal to 3. These terms are identical since the ξi commute, any part not symmetric in all indices would drop out, and one may write

 

Due to the invariance of P, the latter expression for it implies that the quantity A must transform as a tensor of rank u because each ξi transforms as a vector.

The total space is clearly an infinite sum of such finite-dimensional spaces, so that the infinite-dimensional representation of O(3) decomposes as a direct sum of finite-dimensional unitary representations that can, in turn, be broken down to irreducibles.

Generalizations

edit

By considering more 3-touples of variables, say ξ ≡ (ξ1, ξ2, ξ3), η ≡ (η1, η2, η3) and ζ ≡ (ζ1, ζ2, ζ3), one obtains more unitary representations. However, by restricting these to homogeneous polynomials of degree u, the resulting tensor of rank u is not symmetric.

O(3, 1)

edit

Now consider a decending power series in a real variable ξ0,

 

Go through the same steps as in the O(3) case and define V0 with the inner product

 

In the same fashion as above, define

 

with inner product

 

The usage of the Gamma function is motivated by that Γ(n + 1) = n! for n ∈ ℕ and anticipating occurrences of some positive powers of ξ0. When accepting nonzero coefficients for negative values of n, the inner product is, strictly speaking, a pseudo-inner product space. By going through the same procedure as in the O(3) case, one finds that if ξ ≡ (ξ0, ξ1, ξ2, ξ3), ξ' = Λξ, Λ ∈ O(3, 1). Then

 

with A'nrst (nrst fixed) some linear combination of Anrst (as nrst varies). It turns out that

 

so that one obtains a unitary representation of the Lorentz group with representation space V. The fact that this representation is unitary implies[42] that it is not a direct sum of finite-dimensional unitary representations, since no such exist.

The coefficients

 

(required to have finite norm) were termed expansors by Dirac. The name derives from the connection with binomial expansions (actually multinomial) under Lorentz transformations. If the power series associated to A is homogeneous of fixed degree, they are termed homogeneous expansors. The analogy with the 3-dimensional case suggests that these can be regarded as symmetrical tensors in spacetime with the suffix 0 occuring a negative number of times. By contrast to the 3-dimensional case, homogeneous expansors are not finite.

Details

edit

In detail, apply a Lorentz transformation to the ξμ,

 

This forces

 

where p is a finite polynomial in the ξμ, and

 

The latter may be expressed as a series with ascending powers in ξ'1, ξ'2, ξ'3 and descending powers in ξ'0. For a brief proof put

 

Make re-substitutions and the result follows.

While there may be positive powers of ξ'0 in ξ0n − 1ξ1rξ2sξ3t (originating from p), these contributions will be killed off by the gamma function when calculating the norm.

Consider first

 

Substitution yields

 

This yields

 

and

 

since the second and third terms cancel when substituting nn + 1 in the former and rr − 1 in the latter.

Generalizations

edit

By considering more 4-touples of variables, say ξ ≡ (ξ0, ξ1, ξ2, ξ3), η ≡ (η0, η1, η2, η3) and ζ ≡ (ζ0, ζ1, ζ2, ζ3), one obtains more unitary representations. The introduction of the gamma function opens up for another generalization. Instead of considering n ∈ ℕ, one my consider n ∈ { n0, n0 + 1, …} for n0 any real integer. If n0 > −1 then the norm is still positive definite. This is the case if the expression for Q is homogeneous of negative degree. If Q is homogeneous of positive degree, there will be a finite number of negative terms in the expression for the norm. Dirac calls such representations nearly unitary.

Convergence

edit

Suppose Q is homogeneous of degree u − 1. Then write

 

and

 

Now apply,

 

this is the Cauchy–Schwartz inequality in sequence space with the inner product

 

Fix n, r, s, t, subject to the condition above. Put

 

and Cauch–Schwartz inequality yields

 

One finds

 

and

 

Now sum over the allowable values for r, s, t for a fixed n, indexed by S,

 

and employ the multinomial theorem to find

 

Suppose now that

 

converges. Then the terms are bounded,

 

It follows that

 

so if

 

then Q will converge whenever it is homogeneous and ||(Anrst)|| converges.

Special expansors

edit

Expansors being coefficients of

 

go over into themselves under Lorentz transformations. This is obvious from the the definition of the Lorentz transformation,

 

To see what they look like, write

 

the last step by virtue of the geometric series. Now consider the case l = 1. Then

 

the last manipulation using the multinomial expansion. The coefficients are then found by matching coefficients.

If f is a homogeneous polynomial in ξ0, ξ1, ξ2, ξ3, then

 

transforms like f, i.e. like a tensor of order equal to the degree of f. This expansor is infinite. Write

 

where the g are homogeneous polynomials in ξ1, ξ2, ξ3 of the degree given by the index. It is not entirely obvious that this can be done. To see that it can, create the g's by considering f term by term. For each term, begin by constructing the g of lowest degree. This construction will yield residual terms that go into the g's of the next higher degree. For example

 

so that in this case

 

Looking at the contributions, one finds

 

To prove this, extract ξ02 from the denominator and use the geometric series. Then use

 

which, in turn, is proved by induction on l. The observation here is that, for m large, the m'th term of gu - 2 is of order ml smaller than the corresponding term for gu. For convergence issues then, consideration of gu and gu − 1 suffices. For this purpose, write

 

the existence motivated by the same reasoning as that for the g's. These each contribute

 

The squared length of expansors whose components are the coefficients of the above expressions are series on the form

 

For large m, the ratio of one term and the next in the series is 1 + u/m, implying that the series diverges. The corresponding expansor is thus infinite. Moreover, expansors of this form are orthogonal meaning that the total expansor corresponding to f diverges, hence is of infinite length.

Transformation of variables

edit

Expansors can be put into correspondence with another set of functions. For this purpose, introduce a set of operators on ξ-functions,

 

Now consider the ξ-function ξ0−1. It vanishes when operated on by

 

and effectively (it is killed off by the gamma function in the norm) also when multiplied by

 

Put this ξ-function in association with an x-function that is killed off by the right hand sides. From the theory of differential equations, this is uniquely (up to the constant)

 

Multiply this equation by

 

on the left and the operator equal to it on the right. This yields

 

Thus a general expansor can be identified with both a ξ-function and an x-functions and

 

The chief virtue of this identification is that the inner product of two expansors become

 
Proof
edit
 

Apply this m times. If m > m', then the prefactor of the integral on the LHS will yield zero. Using the standard integral for the Gaussian, the result becomes

 

This result applied to two expansors in the guise of x functions is

 

The conclusion is that the set of all x-functions may be seen as a unitary representation of the Lorentz group. The transformation rule is determined by that of the expansors. Naturally, the xμ may be interpreted as spacetime coordinates.

Physical applications

edit

Let the xμ denote the coordinates of a four-dimensional harmonic oscillator. Then pμ = ixμ are interpreted as momentum operators and the energy of the operator can be taken as

 

The state of the oscillator characterized by (n, r, s, t) is then represented by the function Fnrst; in the ξ-representation it is represented by the expansor ξn − 1ξrξsξt.

Sandbox

edit

ƒ̂ , , f, f, f, ĝ, ĥ

S-matrix

edit

Consider

 

Use

 

on both sides. Then

 

or

 

But Weinberg gets

 

or

 

Obviously, use has been made of

 

We have

 

or

 

Thus

 

and

 

i.e.

 

Asymptotic operators

edit

[43] [44] Associated to the in and out states are the in and out field operators (fields) or asymptotic fields, Φi, Φo. To the full interacting theory there corresponds the interacting field Φ. In the context of axiomatic field theory, it is called the interpolating field. Again attention is focused on a neutral Klein-Gordon field in order to keep the notation as clean as possible. It is postulated to satisfy the same equal time commutation relations (ETCR) as the free field.[44]

 

This is assuming that there are no gradient couplings modifying the canonically conjugate field π. In contrast to the non-interacting case, these commutation relations cannot be extended to unequal times.[44] The commutator between the field operator and the momentum operator retains the free field behavior,[43]

 

with formal solution

  (19)

Like for the interacting states and the asymptotic states, there are connections between the interacting field and the asymptotic fields. The field equation for the interacting field is given by

  (20)

where m is the physical mass and the source term in general depends on Φ. One term in the source is, again assuming no derivative couplings, partly given by[44]

 

with

 

the free and interacting parts of the Lagrangian density. There is an additional term that accounts for the use of the physical mass m in the Klein-Gordon equation instead of the bare mass, m0 appearing in the Lagrangian density,[44]

 

The field equations are formally solved by using Green's function techniques,

  (21)

These are the Yang–Feldman equations. Here Δ is the Pauli–Jordan function and Θ is the Heaviside step function. The in and out fields solve the homogeneous equation, while ΔR, A solve[45]

 

One might naïvely expect from these equations that (under typical scattering conditions)

  (asymtotic conditions 1)

in analogy with the states. This will hold if j(x) = 0, x0 > |T| for some finite T. But this is not the case since self-interaction cannot be "turned off". A slightly less naïve guess is to assume

  (asymtotic conditions 2)

where Z is a field renormalization constant. Its interpretation is that it gives the probability of Φ creating a one-particle state when applied to the vacuum,[44]

 

This probability is assumed to be less than one since the interacting field can create many-particle states. But this is inconsistent as speculated so far without more qualification of what is meant. Consider the string of equalities

  (11)

These equations imply Z = 1, and nothing has changed. Implicit in asymtotic conditions 2 and the equation above is that Φ converges in norm to ZΦi. Convergence in norm to Φ holds, for a sequence of operators, if

 

This requirement is too strong, and one can instead demand

  (asymtotic conditions 3)

corresponding to demanding weak operator convergence or convergence in WOT, the weak operator topology. There is also an intermediate topology, the SOT, strong operator topology, where ΦvΦiv is demanded for all v.[46] While asymtotic conditions 2 imply (11), assymtotic conditions 3 do not, and this opens up the possibility for a rigorous description of the asymptotic behavior. Next problem: The respective RHS of asymtotic conditions 3 have time dependence, see (9), valid for the asymptotic fields as well, so the limits aren't well-defined. This can be remedied by considering smeared field operators, analogous to wave packets. The scalar product in the Hilbert space of the scalar theory is given by

 

Use this to write

 

for

 

or, since a is time-independent,

  (12)

where the uα satisfy

 

and constitute a complete set of localized wave functions. They can be expressed as

 

where gα is a smooth function. See the previous section for a definition of . The completeness can be expressed as

 

and

 

Equivalently, write

  (13)

it can be shown that Φαi, o are time-independent.[44] The interpretation is that Φαi, o applied to the vacuum operates on wave packages described by gα as opposed to plane waves (associated to the creation and annihilation operators) or distinctly localized particles (associated to the field operator). The interpretation for the interacting operators is harder, since these are not known in detail. The asymptotic condition is now written

  (asymptotic conditions 4)

.

If one uses, instead of wave packets uα uses plane waves up, then

 
 

the last equality following from the usual expansion of free fields in terms of creation and annihilation operators. The interpretation of ap is more intricate due to the interaction. The asymptotic conditions 4 become

  (asymptotic conditions 4*)

which are weaker (follow from, but does not imply) asymptotic conditions 4. By postulating asymptotic conditions 4*, a weak version of (21) can be derived,[44]

  (25)

Crossing the Tsirelson bound

edit

Crossing the Tsirelson bound amounts to, at least, leaving the confines of ordinary quantum mechanics. Crossing has been achieved (non-local-boxes or PR-boxes), but only purely mathematically, with no directly supporting physical principles, but still with the no-signalling condition imposed by special relativity satisfied.[47] By incorporating the, physically speculative, idea of supersymmetry, that has received much attention the past decades as perhaps the most promising candidate for physics beyond the standard model, it is possible that the bound is crossed. But this comes with a price. One set of conditions that suffices in a minimal supersymmetric extension of qubits for breaking the bound, as outlined below, is the following:

  • Supersymmetry
  • A certain Grassmann-valued transition probability between states.[48] The chosen definition of transition probability reduces to the usual Born rule when the states have no Grassmann-component, and the the transition probabilities add up to 1.
  • A particular choice of map from Grassmann-valued probabilities to real probabilities. This map must respect the UOSp(1|2) symmetry of below, just like the usual transition probabilities respect the U(2) symmetry, but otherwise this choice has no counterpart in ordinary quantum mechanics.

In one "simple" case, the two-dimensional complex Hilbert space whose normalized elements are qubits, with symmetry group U(2), is replaced by a (2|1)-dimensional super Hilbert space with the orthosymplectic group UOSp(1|2) as symmetry group, inhabited by superqubits.[49] In this scenario, Lie algebras are replaced with Lie superalgebras, the Bloch sphere with the super-sphere S2|2 = UOSp(1|2)/U(0|1) and so on.

The space of superqubits, here initially thought of as a Z2-graded vector space over the complex numbers, is spanned by the kets

 ,

where the first two are of even grade, and the last one is odd. The space of linear transformations, denoted End(Kp, q) on any super vector space is (in a basis) just the set of (p × q)-matrices with entries from the ground field K, here K = ℂ. These decompose uniquely as a sum of a grade-preserving and a grade-reversing transformation.

The (real) orthosymplectic Lie superalgebra osp(p|q) is defined as

 

Here H is a matrix representing a bilinear form with both a symmetric and antisymmetric part. This is in almost complete analogy with how the Lie algebras of the classical groups are defined, but with two exceptions. There is the alternating sign depending on whether the transformations are even (grade-preserving) or odd, and the supertranspose is used as opposed to the ordinary Hermitian adjoint.

In the current application, the notion of a super vector space is generalized to a supermodule over a Grassmann algebra, so that the coefficients of a general vector expressed in the basis kets of above are Grassmann numbers rather than ordinary complex numbers. Likewise, the linear transformations are generalized, so that the entries in the representative matrices are Grassmann numbers.

The superqubit is not itself a candidate for foundational supersymmetric physics beyond the standard model because the space it lives in is not a representation space of the super-Poincaré group. But the superqubit and the Lie superalgebra finds supersymmetric applications in the field of condensed matter physics.[50]

The CHSH inequality can be rephrased as a limit on the probability of winning a certain non-local game, the CHSH game,[51]. The CHSH game is played as follows. The participants are Alice and Bob, who are competing against a referee. Alice and Bob are allowed to cooperate in the sense that they can agree upon a strategy before the game is played. But, they are not allowed to communicate during the game (the no-signaling condition). The referee sends to each of Alice and Bob a number, 0 or 1, denoted i and j respectively, with a probability of 14 for each combination. Alice and Bob each sends a number, a and b respectively, in return to the referee. The winning conditions for Alice and Bob are shown in the table below on the left.

ij a + b (mod 2)
00 0
01 0
10 0
11 1

On the face of it, one optimal strategy for Alice and Bob is to send 0 in return, regardless of what they receive. The only way they can lose is if the referee sends i = j = 1. Thus the maximized winning probability is 0.75, and this is the CHSH bound in this case. But if Alice and Bob share a so-called non-local resource, they can improve their winning chances. One such resource is an entangled state, such as a 2-qubit, e.g Ψ = (uu) + (dd), where u and d may (for instance) be interpreted as a spin up and spin down of the components of the spin of a spin 12 particle with respect to some direction. (This is only a factor in a complete physical 2-particle state, but it is all that is needed for the discussion.) By using an agreed upon measurement strategy and utilizing the results of the measurements on their respective "ends" of the qubit, Alice and Bob can improve their chances of winning in the game, up to, and including, a limit, the Tsirelson bound, which in this case is cos2 π8 ≈ 0.8536. Loosely speaking, the reason that this is possible is that the measurement results are correlated due to the entanglement.

Now suppose Alice an Bob share a non-local resource in the form of a entangled state of a 2-superqubit. The amplitudes in super Hilbert space are Grassmann numbers. Results will therefore depend on how real probabilities are extracted. A Grassmann supernumber z may always be written as

 ,

where the first term is a complex number and the second term is a sum over the Grassmann algebra generators with complex coefficients. The terms are referred to as the body and the soul. For a start, a counterpart of the Born rule must be chosen. A reasonable, but not canonical, choice is

 .

The hash operator denotes graded involution, similar to complex conjugation. This choice reduces to the usual Born rule when there is no soul. Secondly, a map from the Grassmann-valued transition probabilities to the real numbers must be chosen. In Borsten, Brádler & Duf (2014), the authors consider three plausible ways of doing this. The first alternative considered is to simply ignore the "soul". The resulting probabilities are those of ordinary quantum mechanics, and then an optimal strategy for Alice and Bob reaches the Tsirelson bound. The second case considered is a trigonometric map. In this case too, the Tsirelson bound can be reached. The third case, the Modified Rogers case, as they call it, exceeds the bound. All states used in the games involve probabilities lying between 0 and 1, but the third interpretation of Grassmann transition probabilities in terms of real numbers permits changes of basis inducing negative transition probabilities. The authors write in the conclusion,

Violating Tsirelson's bound was always destined to involve paying a price and it remains to be seen whether the existence of negative transition probabilities is too high price to pay (even though we did not invoke them in the CHSH game). In fact, one might ask whether it is this feature alone, with or without supersymmetry, that is responsible for exceeding the bound.

They finally present arguments in the direction that existence of negative transition probabilities alone is not sufficient for violation of Tsirelson's bound.

The general case

edit

The formula will be proved by finding a general expression for

 

where C(t) is a curve in the Lie algebra. A Lie group is endowed with a smooth Lie algebra valued form ω such that for each point gG ωg:TGg → TGe is a linear map. It is defined as ω = Lg−1, the pushforward of left multiplication by g−1. It satisfies

 

the Maurer-Cartan equations. See Vector valued differential form for the definitions of exterior derivative and wedge product for vector valued forms. In addition, there is a Lie bracket for Lie algebra valued forms. In terms of this, the Maurer–Cartan's equations read

 

In case G is a matrix Lie group, then the Maurer–Cartan form cam be expressed as

 

The Lie algebra valued forms also pull back under smooth maps. Thus if ω denotes the pulled back ω under some map g:ℝ2G, it too will satisfy the Maurer–Cartan's equations. Given coordinates (s, t) on 2 one may write

 

The inverse of g on the right hand side is the group inverse. Since the pullback commutes with exterior derivatives and commutators, the Maurer–Cartan equations read in these coordinates

 

Let

 

where C(t) is a C1 path in the Lie algebra. Then

 

so that the Maurer–Cartan equations are

  (10)

The desired general expression is thus β(1, t). For fixed t, make an ansatz for β as a formal power series in s, with coefficients in the Lie algebra,

 

and one finds

 

This leads to

 

Define

 

and the general expression for β becomes

 

or

 

Proof

edit

Here g is to be considered as a matrix of maps (g)ij, 0 ≤ i, jn of k variables, where k is the dimension of the group. In exponential coordinates, the k-tuples of variables (Xi) can be thought of as elements of the Lie algebra with basis Xi, 1 ≤ ik. Then

 

where

 

In this expression,

 

Gamma matrices

edit

Tong footnote.[52]

Weinberg footnote.[53]


For an explicit statement of the disputed sentence, please see Tong (2007, p. 93) harvtxt error: multiple targets (2×): CITEREFTong2007 (help). I consider David Tong a reliable reference.

About dependencies in odd spacetime dimension:
In spaces or spacetimes with with odd dimensionality, the totally antisymmetric tensors of rank n can be linearly related by the conditions

 

where ε is the totally antisymmetric symbol and the left hand side is taken as the identity matrix for r = 0. Under these conditions there are only 2d−1 independent tensors, requiring γ-matrices of dimensionality at least 2(d−1)/2.[Note 1]

So, yes, we can have dependencies. There is nothing in the definition of a Clifford algebra ruling it out. I consider Steven Weinberg a reliable reference.

About the representations in general:
D even

  • γa = γa (a=1, ...,D) are 2D/2×2D/2 matrices, which are unique modulo a similarity transformation.
  • All Γa are linearly independent.
  • All 2D/2×2D/2 matrices can be decomposed into γa1a1...an (0 ≤ nD).[Note 2]

D odd

  • γa = γa (a=1, ...,D) are 2(D−1)/2×2(D−1)/2 matrices, which are not unique; there are two representations.
  • Not all Γa are linearly independent.
  • All 2(D−1)/2×2(D−1)/2 matrices can be decomposed into γa1a1...an (0 ≤ n ≤ (D−1)/2).[Note 3]

This wraps it up. I don't know personally about the authors reliability, the reference is new to me, but someone I trust ensures me that it is good enough. You may also want to see Higher-dimensional gamma matrices that tells exactly the same story.

  1. ^ Weinberg 2002, p. 216 harvnb error: multiple targets (2×): CITEREFWeinberg2002 (help).
  2. ^ de Wit & Smith 1986, p. 680 harvnb error: multiple targets (2×): CITEREFde_WitSmith1986 (help)
  3. ^ de Wit & Smith 1986, p. 680 harvnb error: multiple targets (2×): CITEREFde_WitSmith1986 (help)

Witt algebra and Polynomial loop algebra combined

edit

Let L be a polynomial loop group with simple compact Lie group G and Lie algebra g, and let W be the Witt group. Then, if Σ : G → GL(V) is a faithful representation of G, actions of both L and W may be defined and subsequently combined. The set on which the action is to be defined is

 

the set of smooth functions from the circle to V. G acts on F by

 

This extends in a natural way to an action of L on F, since elements l of L are maps from S1 into G,

 

The Witt group acts by

 

Both of these last two constructs yield representations (and not merely group actions). Now consider the ordered pair (w, l) and define its action on F to be

 

Here, all action takes place in End F, the linear operators on F. In order to save bandwidth and the eyes of the reader, it is necessary to identify elements of L and W with their images in End F. Examination of (w1, l1)(w2, l2)f(z) suggests the multiplication law

 
Motivation for group multiplication law

By this definition

 

Then

 

Now

 

hence

 

With this replacement,

 

In other words,

 

and the tentative group multiplication law

 

suggests itself. It might be noted that l1w2L.

This is indeed a group operation, with inverse operation

 
Proof of group properties

For the inverse,

 

For associativity,

 

and

 

By using (a·b) ∘ c = ac · bc in the first equations last expression, one finds that these are indeed equal.

By now, it is established that the pairs (Π(w), Π(l)) do constitute a group with the above multiplication rule. The multiplication rule makes no mention of F, the representations are faithful, and hence one may define the operations directly on pairs (w), l} ∈ E = W × L and forget about F. L is normal in E,

 

and W and L intersect in (eW, eL). Thus E is the semidirect product[nb 4]

 

The Lie algebra then is the set wl. The Lie bracket between elements of l and w follow from the explicit expressions

 

from which it follows

 

Conformal group

edit
 

Together,

 

Relativistic string

edit

The coordinates X of a string,

 

take on their values in generalized Minkowski space of dimension  . For each   they comprise a  -vector and reduce to a 4-vector for  . Define the coordinate partial derivatives

 

Define also the differential forms

 

and

 

as well as

 

Then

 

Metric

edit

The metric on the world sheet of a string can be obtained in several ways. One "intuitive" way is to simply compute the line element. Thus, consider an infinitesimal displacement

 

Then the line element is

 

Static gauge

edit

One obvious possibility is to put   in some Lorentz frame. Then

 

for fixed   comprises a string in space and a two-dimensional surface in spacetime called the world sheet for non-fixed  . This is called the static gauge. The tangent vector of a path traced out by a point particle is timelike or lightlike since it travels with a speed less than or equal to the speed of light. Thus require in the neighborhood of each of its points there exists an infinitesimal displacement along the surface which points in the time-like or light-like direction.

One may impose the constraints that the tangent vectors should be orthogonal,

 

and normalized,,

 

at any point  . One consequence of these conditions is that the ends of the string move at the speed of light. Another is the fact that only the motion of the string perpendicular to itself is dynamically significant.

Remarks

edit
  1. ^ Bargmann 1964: "In recent years several papers have appeared in which a proof of Wigner's theorem is presented (Uhlhorn (1963) Lamont & Mendelson (1963)) To this writer most of these proofs seem unsatisfactory in one significant aspect: They obscure the quite elementary nature of Wigner's theorem (Wigner (1959)). In addition, some authors state - or imply - the view that it is desirable, if not necessary, to depart from Wigner's construction in order to arrive at a simple or rigorous demonstration of his theorem. This writer, on the contrary, has always felt that Wigner's construction provides an excellent basis for an elementary and straightforward proof."
  2. ^ Here the possibility of superselection rules is ignored. It may be the case that a system cannot be prepared in specific states. For instance, superposition of states with different spin is generally believed impossible. Likewise, states being superpositions of states with different charge are considered impossible. Minor complications due to those issues are treated in Bogoliubov, Logunov & Todorov (1975)
  3. ^ A unitary operator is automatically linear, Bäurle & de Kerf (1999, p. 342). Likewise an antiunitary transformation is necessarily antilinear.
  4. ^ Here, the normal term is placed on the right to comply with the other definitions. This is (once pointed out) allowed.

Notes

edit
  1. ^ Mermin (1968) Chapter 17
  2. ^ Bäuerle & de Kerf 1999, Postulate 17.3.2.
  3. ^ Misner, Thorne & Wheeler 1973
  4. ^ Sard 1970
  5. ^ Landau & Lifshitz 2002
  6. ^ Federer 1996, Section 2.2.1
  7. ^ Federer 1996, Definition section 2.1.
  8. ^ Vestrup
  9. ^ Federer 1996, Section 2.1.
  10. ^ Federer 1996, Section 2.1.
  11. ^ Vestrup 2007, Section 3.2.
  12. ^ Wigner 1931, pp. 251–254 (in German),
    Wigner 1959, pp. 233–236 (English translation).
  13. ^ Wigner 1939
  14. ^ Wigner 1931
  15. ^ Chevalier 2007
  16. ^ Chevalier 2007
  17. ^ Weinberh 2002, p. 49
  18. ^ a b Bäuerle & de Kerf 1999, p. 341
  19. ^ de Kerf & Bäuerle 1999
  20. ^ Weinberg 2992, p. 50
  21. ^ de Kerf & Van Groesen 1999, p. 342
  22. ^ de Kerf & Van Groesen 1999, p. 343
  23. ^ de Kerf & Van Groesen 1999, p. 346
  24. ^ a b Weinberg 2002, p. 82 harvnb error: multiple targets (2×): CITEREFWeinberg2002 (help)
  25. ^ Bäurle & de Kerf 1999, pp. 347–349
  26. ^ Weinberg 2002, Section 2.7. harvnb error: multiple targets (2×): CITEREFWeinberg2002 (help)
  27. ^ Wigner 1931
  28. ^ Bargmann 1964
  29. ^ Molnár 1999a
  30. ^ Mouchet 2013
  31. ^ Györy 2004
  32. ^ Molnár 2000a
  33. ^ Molnár 1999b
  34. ^ Molnár 2000c
  35. ^ Molnár 2000c
  36. ^ Bargmann 1964
  37. ^ Sharma & Almeida 1990 harvnb error: multiple targets (2×): CITEREFSharmaAlmeida1990 (help)
  38. ^ An overview is given in Chevalier 2007.
  39. ^ Uhlhorn 1963
  40. ^ Gehér 2014
  41. ^ In 1945 Harish-Chandra came to see Dirac in Cambridge. He became convinced that he was not suitable for theoretical physics. Harish-Chandra had found an error in a proof by Dirac in his work on the Lorentz group. Dirac said "I am not interested in proofs but only interested in what nature does." Harish-Chandra later wrote "This remark confirmed my growing conviction that I did not have the mysterious sixth sense which one needs in order to succeed in physics and I soon decided to move over to mathematics." Dirac did however suggest the topic of his thesis, the classification of the irreducible infinite-dimensional representations of the Lorentz group.Dalitz & Peierls 1986
  42. ^ See Wigner's paper or proof outline above
  43. ^ a b Weinberg 2002 harvnb error: multiple targets (2×): CITEREFWeinberg2002 (help) Section 2.2.
  44. ^ a b c d e f g h Greiner & Reinhardt 1995 Chapter 9.
  45. ^ Greiner & Reinhardt 1995 This follows from properties of the Pauli-Jordan function. See &para:¶ 4.4-4.6 for a detailed account of all invariant functions in use for the scalar theory.
  46. ^ For definitions of the norm and the topology induced by the norm, as well as the SOT and the WOT, see Conway 1990 harvnb error: multiple targets (2×): CITEREFConway1990 (help).
  47. ^ Borsten, Brádler & Duf 2014 They formulate it as "without violating other established principles of physics, in particular, the no-signaling condition imposed by special relativity".
  48. ^ Borsten, Brádler & Duf 2014.
  49. ^ Borsten et al. 2010
  50. ^ Borsten, Brádler & Duf 2014 See section 2 for examples and references.
  51. ^ Clauser et al. 1969
  52. ^ The reason for the notation γ5 is because that set of matrices (ΓA) = (γμ, 5) with A = (0, 1, 2, 3, 4) satisfy the five-dimensional Clifford algebra {ΓA, ΓB} = 2ηAB. Tong 2007, p. 93 harvnb error: multiple targets (2×): CITEREFTong2007 (help).
  53. ^ The notation is particularly appropriate because the set {γ0, γ1, γ2, γ3, γ5} provides a Clifford algebra in five spacetime dimensions. Weinberg 2002, p. 218 harvnb error: multiple targets (2×): CITEREFWeinberg2002 (help).

Mathmatics

edit

Differential geometry

edit
  • Lee, J. M. (1997). Riemannian Manifolds – An Introduction to Curvature. Springer Graduate Texts in Mathematics. Vol. 176. New York · Berlin · Heidelberg: Springer Verlag. ISBN 978-0-387-98322-6.
  • Wolf, Joseph A. (2010). Spaces of Constant Curvature (6th ed.). Providence, Rhode Island: AMS Chelsea Publishing. ISBN 978-0821852828.

Linear algebra

edit

Functional analysis

edit

Topological vector spaces

edit
  • Schaefer, H. H. (1999). Topological vector spaces. Graduate texts in mathematics. Vol. 3. H. H. Wolff coauth. (2nd ed.). New York: Springer Verlag. ISBN 978-0387987262.

Group Theory

edit
  • Pontryagin, L. (1987) [1939]. Gamkrelidze, R. V. (ed.). Topological Groups. Classics of Soviet Mathematics. Vol. 2 (3rd ed.). CRC Press. ASIN B010WFQHA4.
  • Rossmann, Wulf (2002). Lie Groups - An Introduction Through Linear Groups. Oxford Graduate Texts in Mathematics. Oxford Science Publications. ISBN 0-19-859683-9.

Representation theory

edit

Lie algebra

edit

Harmonic analysis

edit

Fix 76/75

Measure theory

edit

Measure and integration

edit
  • Vestrup, E.M. (2003). The theory of measures and integration. Wiley series in probability and statistics. John Wiley & sons. ISBN 978-0471249771.

Qunantization

edit

Supersymmetry

edit

References

edit

Physics

edit

Classical electromagnetism

edit

String theory

edit

ADS/CFT correspondence

edit

Statistical physics and thermodynamics

edit

Mechanics

edit

Quantum field theory

edit
  • Aitchison, I. J. R; Hey, A. J. G. (2004). Brewer, D. F. (ed.). Non-Abelian Gauge Theories: QCD and the Electroweak Theory. Gauge Theories in particle physics: A practical introduction. Vol. II (3d ed.). Bristol·Philadelphia: Institute of Physics Publishing. ISBN 0-7503-0950-4.

Relativity

edit
  • Minkowski, Hermann (1908–1909), "Raum und Zeit"  [Space and Time], Physikalische Zeitschrift, 10: 75–88 {{citation}}: Check date values in: |date= (help) Various English translations on Wikisource: Space and Time

History

edit
  • Bradley, James (1727–1728). "A Letter from the Reverend Mr. James Bradley Savilian Professor of Astronomy at Oxford, and F.R.S. to Dr.Edmond Halley Astronom. Reg. &c. Giving an Account of a New Discovered Motion of the Fix'd Stars". Phil. Trans. R. Soc. 35 (406): 637–661. doi:10.1098/rstl.1727.0064. S2CID 186214147.

General

edit

Quantum mechanics

edit
  • Shankar, R. (1994). Principles of Quantum Mechanics (2nd ed.). ISBN 0306447908.

Main

edit

Other

edit

Last

edit

Inönü–Wigner contraction

edit

Gamma matrices

edit

History and biographies

edit

See also

edit

Supersymmetry literature for the perplexed.