From the Poincaré and Galilei Algebras to the Spacetime Metric
Overview¶
We start from the abstract Lie algebras of the Poincaré and (bare) Galilei groups, defined purely by their commutation relations, and derive the invariant metric on the spacetime they act on. We do everything at the level of the algebra, using nothing but commutators and the Leibniz rule.
Conventions: 3+1 dimensions, no factors of in the structure constants, signature , units . For Galilei we use the bare algebra (no central charge), which is sufficient for the spacetime metric.
The companion document Lie Groups I starts from explicit matrix generators and solves the same invariance condition. Here we derive everything from the commutation relations.
Strategy¶
The translation subspace as a representation¶
Let with , the 4-dimensional subspace of translation generators inside the full Lie algebra.
The homogeneous generators act on the whole algebra by the adjoint action
A direct inspection of the Poincaré or Galilei commutation relations (given in Parts I and II below) shows that for every generator and every ,
so the adjoint action restricts to a linear representation of the homogeneous algebra on .
Group invariance ⇒ Leibniz condition¶
Exponentiate. The group element acts on by conjugation:
A symmetric bilinear form on is invariant under the group action if
Differentiate at . Using and the bilinearity of ,
Invariance forces this derivative to vanish:
for every generator and every . This is the Leibniz condition: invariance is the statement that acts as a derivation on with zero output.
It is a necessary condition. It is also sufficient: the infinitesimal Leibniz condition (at ) implies the finite group invariance for all . To see this, define
Differentiating at an arbitrary :
By closure of under the adjoint action, , so the Leibniz condition (which holds for every pair of vectors in , not just the basis) gives identically. Hence is constant, , for all . So the infinitesimal Leibniz condition is equivalent to finite group invariance along each one-parameter subgroup, and composing these recovers invariance under every element of the identity component of the group.
Writing with , the Leibniz condition becomes a finite system of linear equations on the 10 components of .
Spacetime as a homogeneous space¶
So far we have treated the translation subspace as if its basis carried a preferred geometric interpretation: generates translation along the -th spacetime direction, and the parameters in are spacetime coordinates. This step needs justification, because the Lie algebra by itself does not announce which of its elements are “translations” and which are not.
The proper framework is that of a Klein geometry. Given a Lie group and a closed subgroup , the coset space
is a smooth manifold on which acts transitively, with playing the role of the stabilizer of a chosen basepoint. At the algebra level this corresponds to a vector-space decomposition
where is the Lie subalgebra of (the isotropy at the basepoint) and is a complementary subspace identified with the tangent space . The pair is what defines the geometry; the Lie algebra alone is not enough.
Reductive and symmetric Klein geometries¶
The vector-space splitting needs a word of qualification, because the rest of this document silently relies on a non-trivial property of it.
A Klein geometry is called reductive if the splitting can be chosen so that is preserved by the adjoint action of — equivalently, at the algebra level,
Note that is not required to be a Lie subalgebra; only that for maps back into itself.
If, in addition,
the geometry is called symmetric. Every symmetric Klein geometry is reductive; the converse is false.
Why reductivity matters here. Each step of our construction needs to act on separately from the rest of :
The Leibniz invariance condition asks for an -invariant bilinear form on . This presupposes that for , , i.e. exactly .
The Maurer–Cartan splitting into vielbein and spin connection, used to translate the algebraic metric on into a metric in coordinates (see the derivation in Part IV), needs an -equivariant projection , which exists iff the geometry is reductive.
Without reductivity, “invariant form on ” is not even well-posed, because every choice of complement to in would lose pieces of back into under . The whole machinery of this document is specifically a recipe for reductive Klein geometries.
Are there non-reductive Klein geometries? Yes, and they describe important geometries — they are simply not the ones where a metric is the primary invariant. The canonical examples are the parabolic geometries, where is a parabolic subgroup of a semisimple :
| Geometry | Primary invariant on | ||
|---|---|---|---|
| Conformal | parabolic | conformal class (no preferred metric) | |
| Projective | affine stabiliser of a line | projective class of connections | |
| CR | parabolic | complex tangent distribution + Levi form |
In each case the parabolic has a Levi decomposition with unipotent, and the nilpotent pieces in prevent any choice of complement from being -invariant. The right invariants on are no longer first-order tensors on (such as a metric) but live in filtrations of and capture geometric structure one or more derivative orders higher. This is the world of Tanaka–Morimoto–Čap–Slovák parabolic geometry, the modern home for conformal and projective geometry.
All six examples in this document. Poincaré, Galilei, , , , and — together with their close cousins (de Sitter, anti-de Sitter, hyperbolic space) — are all not just reductive but symmetric. That is why each one admits a canonical metric (or pair of metrics) determined by an algebraic invariance condition, and why the construction works so cleanly. The bracket dichotomy
— flat vs. constant curvature, as summarised in the table after Part IV — is the symmetric-space dichotomy. The cases where it fails (e.g., neither inclusion holds, or in any complement) take us out of symmetric Klein geometry and, in the most interesting non-reductive case, into parabolic geometry.
Many familiar spaces are constructed this way: the sphere as , Euclidean space as , hyperbolic space as , and — the case at hand — Minkowski spacetime as and Galilean spacetime as . In each case the Lie group encodes the symmetries, and the chosen subgroup encodes “what fixes the origin”.
For the Poincaré algebra, the choice of subalgebra is essentially canonical. The Levi decomposition reads
with the radical being the unique maximal abelian ideal. Both and the Levi complement are determined up to conjugation by the algebraic structure alone. So Minkowski spacetime is intrinsically attached to the Poincaré algebra.
For the bare Galilei algebra, the situation is genuinely ambiguous. The maximal abelian ideal turns out to be — six-dimensional, since and one verifies that bracketing this subspace with any generator stays inside it. The standard “spacetime translations” form a strictly smaller four-dimensional abelian ideal, and indeed there are two natural Klein pairs:
, — giving the four-dimensional Galilean spacetime;
, — giving the six-dimensional phase space (position × velocity).
Both are valid homogeneous spaces of the Galilei group. The Lie algebra alone does not single one of them out; the physical identification of spacetime as the space of events fixes the choice — that is, rotations and boosts are the transformations that leave a chosen event in place.
To see this more explicitly, note that the choice of Klein pair is fixed operationally, by specifying what kind of object the points of the homogeneous space are meant to be. An event in physics is a localised occurrence: a flash at a place at a moment. Two observers at different positions or with different velocities may label it by different coordinates, but it is the same physical thing. Asking which of the ten Galilei transformations does NOT move an event gives a clean answer:
| Transformation | Effect on an event | Fixes event? |
|---|---|---|
| (spatial translation by ) | the flash is now at a different place | ✗ |
| (time translation by ) | the flash is now at a different time | ✗ |
| (rotation around the event) | the flash stays where and when it is | ✓ |
| (boost by ) | only the observer’s velocity changes; the flash is unchanged | ✓ |
So the transformations that fix an event are exactly and those that move events are exactly . By the orbit–stabilizer theorem, the space of events is
with acting on it transitively.
Repeating the same procedure with a different physical primitive gives a different homogeneous space. Take inertial worldlines as the primitive (a worldline is a constant-velocity trajectory ):
| Transformation | Effect on an inertial worldline | Fixes worldline? |
|---|---|---|
| (rotation around the time axis) | the basepoint worldline (the time axis) is invariant | ✓ |
| (time translation) | slides along the worldline; the worldline as a set is unchanged | ✓ |
| (boost) | changes the velocity — different worldline | ✗ |
| (spatial translation) | shifts to a parallel worldline | ✗ |
The stabilizer of a worldline is thus and the resulting homogeneous space is the 6-dimensional space of inertial worldlines, parametrised by . This is the underlying manifold of classical Galilean phase space, equivalently .
One can take this further. Choosing as primitive “an event together with a velocity vector at it” gives a 7-dimensional homogeneous space , with coordinates — Souriau’s evolution space of classical mechanics. Choosing the empty stabilizer recovers the full 10-dimensional group itself. Every choice of physical primitive thus gives a homogeneous space; the Lie algebra accommodates them all, and physics picks one by saying what kind of object the points are.
Two remarks are in order. First, the Klein construction returns only the smooth manifold — not the symplectic structure that makes the 6D space a Hamiltonian phase space. The symplectic form involves the mass (since ), which is not present in the bare Galilei algebra; it enters via the Bargmann central extension, where one of the commutators is modified to with a central generator. So bare Galilei gives the 6D manifold; Bargmann gives the symplectic structure. More generally, phase spaces in physics arise as coadjoint orbits of (centrally extended) symmetry groups — the Kirillov–Kostant–Souriau picture — in which the Klein-pair construction supplies the manifold and the central extension supplies the symplectic geometry.
Second, among all these candidates events are special because they are the most local and operational primitives: point-like, observable in principle by a single localised detection, and requiring no derived notion of velocity, mass, frame, or trajectory. Worldlines, phase-space points, and evolution-space points are all built out of events (equivalence classes of events under time translation, of worldlines under further equivalences, and so on). This operational primacy of events is what makes spacetime the natural arena for the laws of physics, and the other homogeneous spaces the natural arenas for derived constructions such as Hamiltonian dynamics or Lagrangian mechanics.
In the Poincaré case this whole subtlety disappears. The only abelian ideal is the spacetime translations , so there is no analog of the 6-dimensional abelian ideal and no competing Klein pair. The “events as primitive” choice is canonical because it is the only choice. The Galilei algebra is ambiguous precisely because — the same single algebraic difference that destroys non-degeneracy of the metric is what permits two distinct Klein pairs to exist in the first place.
The reason the translation parameters can be used directly as global coordinates on spacetime — in both the Poincaré and the standard Galilei case — is that in each case is an abelian ideal. Two consequences follow:
Because is abelian, the exponential map is a Lie-group isomorphism: is the additive group , with coordinates .
Because is an ideal complementary to , every coset has a unique representative in , giving a global diffeomorphism .
This is what makes Minkowski and Galilean spacetimes flat affine spaces — a special feature of the algebras at hand. For a non-abelian or non-ideal — for instance — the exponential map is not a global bijection and the resulting homogeneous space is curved. The further generalization to spaces that are not even homogeneous is the framework of Cartan geometry, where the homogeneous model varies smoothly from point to point; this is how curved spacetimes of general relativity are described, modelled point-wise on flat Minkowski space.
Finally, with the Klein-geometry picture in place, the connection between the abstract bilinear form derived below and the spacetime metric is straightforward. The translation parameters transform under the homogeneous generators in exactly the same way as the basis , since the conjugation at the group level differentiates to the same adjoint action at the algebra level. Hence the matrix that solves the Leibniz invariance condition on is also the matrix of the invariant bilinear form on the spacetime coordinates — i.e., the metric.
Part I: The Poincaré algebra → Minkowski metric¶
Commutation relations¶
The Poincaré algebra has ten generators with
We use and label accordingly: , , .
Imposing invariance¶
Apply the Leibniz condition with and the indicated indices.
:
:
So
, both spatial:
which is automatic from .
Rotations impose no further constraint: the diagonal isotropic form already commutes with them.
Result¶
Setting :
It is non-degenerate, , so the inverse is also invariant and is the same matrix.
Part II: The bare Galilei algebra → degenerate metrics¶
Commutation relations¶
The bare Galilei algebra has the same generators with two changes from Poincaré (boxed):
The two differences — and — are what change Minkowski into the Galilean structure.
Temporal metric (bilinear form on )¶
Apply the Leibniz condition with :
:
:
The spatial block is killed.
: (automatic).
Only survives. Rotations leave it untouched. Setting :
This is the spacetime interval : absolute time.
Spatial metric (symmetric 2-tensor on )¶
The temporal metric is degenerate (), so it has no inverse. But the Galilei algebra admits a second, independent invariant: a symmetric contravariant 2-tensor
The Leibniz condition for is derived exactly as before. The group element acts on on both factors:
Invariance for all , differentiated at , gives
for every generator . This is the Leibniz condition on a contravariant tensor — note that the same rule applies, but now contracted against rather than evaluating on .
Reading off coefficients of the basis with :
For Galilei , the only nonzero commutator is , i.e. . The invariance condition reduces to
Choosing , free : . Choosing , free : . Together: the entire time row and column of vanish.
The remaining spatial block is rotation-invariant only if proportional to . Setting the proportionality to 1:
This is the Euclidean 3-metric on each slice of simultaneity.
Why two metrics for Galilei?¶
The two invariants live in different spaces:
Both are degenerate, so neither is the inverse of the other:
For Poincaré the corresponding and forms are both non-degenerate Minkowski — and they are mutual inverses, so the distinction collapses to a single metric.
That no non-degenerate exists for Galilei is visible directly from the calculation: the boost condition forces the entire spatial block to vanish, leaving .
Part III: Euclidean spaces — and ¶
The Poincaré and Galilei algebras are only two members of a much larger family. The Lie algebras of Euclidean isometries — and the Lie algebras of spherical isometries — admit the same Klein-pair + Leibniz-invariance analysis, and produce the standard Euclidean and spherical metrics. We work them out in 2 and 3 dimensions to show how generic the construction is.
Two-dimensional Euclidean space ¶
The Euclidean algebra has three generators , with
Klein-pair candidates. The proper subalgebras of , up to conjugacy, are:
: is the whole 3-dim group manifold .
: 1-dim, generated by a rotation. Gives .
for : 1-dim, generated by a translation. All such are conjugate under rotation. Gives .
: 2-dim abelian (and an ideal). Gives , the circle of orientations.
Of these, the choice that gives “the standard of points” is . The operational reason is the orbit–stabilizer analysis of which transformations fix a point: translations move points to other points, while a rotation centered on the point itself leaves the point fixed. So , , and
Metric derivation. Define the symmetric bilinear form on . Three independent components: . The Leibniz condition is imposed for every ; here only :
Plugging in the brackets:
: .
: .
Hence for an overall positive scale (an arbitrary unit of length squared). Choosing :
This is the standard Euclidean metric on . Because is an abelian ideal, the exponential map gives a global diffeomorphism , and on becomes on .
Translating to polar coordinates. Polar coordinates make the same point that was sharper on : the coordinate form of the metric depends on the chart, while the underlying bilinear form does not. We run the five-step Maurer–Cartan procedure used for , now on instead of . The output is the familiar polar-coordinate metric of flat — and the connection comes out flat by direct calculation, confirming that the space is flat despite the coordinate-dependent metric components.
Step 1: Coordinate-defining section. Pick the basepoint at the origin and define polar coordinates algebraically by the section
translates the basepoint by along the 1-direction (giving point ); then rotates by around the origin (giving ). These are standard polar coordinates by construction. The section is regular on ; at the -coordinate is ill-defined, exactly as for the polar chart on .
Step 2: Maurer–Cartan form. Compute .
gives, after multiplying on the left by ,
(since ). Next, gives
The Lie-algebra calculation: ; the next bracket closes the series. Hence
Step 3: Read off the vielbein. With and , split :
Reading off vielbein components :
equivalently the coframe
Step 4: Killing vector fields. Using , i.e. (-coefficient of ) and (-coefficient) :
: (computed above), so , , giving as expected.
: (rotation by in the -plane, since acts on as a 90° rotation), giving
: similarly , giving
These are the standard polar-coordinate expressions for the Cartesian translation Killing fields .
Step 5: The metric in coordinates. The bridge from form on to tensor field on is identical to Step 5a of the case: . Expanding,
Choosing the conventional unit :
This is the flat Euclidean metric of , written in polar coordinates. The coordinate components do depend on , even though the space is flat — the is the squared length of at radius , in exact analogy with the of .
Why this is flat — verified at the algebra level. The Cartan curvature 2-form of the connection on the model space is
Here , so (since is a constant element of the algebra), and trivially. Hence
i.e., zero sectional curvature everywhere. By contrast, on one gets , whose exterior derivative is non-zero — precisely the constant positive curvature .
So the polar-coordinate exhibits the moral lesson sharply: the coordinate components of the metric vary, but the curvature (read off the connection 1-form, an algebraic Lie-algebra object) vanishes. The bracket-level distinction is doing exactly the work that makes flat space flat.
Cartesian derivation: ¶
The derivation above was the traditional one — it used a section with the rotation generator inside (as a “fiber direction”) and a translation generator radially. The same machinery accepts another natural choice: skip the rotation entirely and use only the translations,
Geometrically, this is a single translation by the vector from the basepoint. Because , this is the same as in either order — the two translations commute, so no Baker–Campbell–Hausdorff correction arises. Let us run the four-step algorithm with this section.
Step 1 — structure constants¶
The Klein pair is unchanged: , . The non-zero brackets are
The crucial novelty is the – bracket: . We will see exactly what this triviality does to the algorithm.
Step 2 — Maurer–Cartan form¶
We need . Use the standard affine representation of :
The Lie-algebra element has in this representation (both translation matrices are strictly upper-triangular and their product vanishes). So the exponential series terminates after one term:
This is just the affine matrix for translation by — as expected. The inverse:
The bottom row of is zero, so the matrix product leaves the upper-right column untouched:
Decomposed into the - and -parts :
The vielbein is the identity matrix: . The spin connection vanishes identically. Both consequences trace back to — the BCH formula contributes no nonlinear correction, and the rotation generator never appears in .
Step 3 — invariance equation¶
The invariance equation is the same one we solved in the polar derivation. With a rotation in the -plane, the symmetric matrix satisfying is
A single positive scale parameter — the same answer as before. The invariance condition is a property of the Klein pair and does not depend on which section was chosen.
Step 4 — assemble the metric¶
with , :
Setting for the standard normalization, we recover the Cartesian metric of in the most direct possible way. The intrinsic geometry is identical to the polar result — it is the same flat metric in different coordinates — but the coordinate components are now constants instead of being -dependent.
Why this works globally¶
Three structural features make Cartesian coordinates better-behaved than polar in this example:
Single-step exponential. is well-defined for all , with no coordinate singularity (compare polar: is a singularity).
Identity vielbein. everywhere. No point is excluded from the chart.
Vanishing spin connection. globally on the chart, so no curvature term is even present.
All three are consequences of one algebraic fact: . The translations commute, so the BCH formula reduces to the linear sum, no -generator can appear in , and the exponential map is a global diffeomorphism. This is the algebraic signature of flatness with a globally trivial chart. We will see in the next subsection that the same condition is also what makes the position vector globally well-defined on .
Killing vector fields¶
The three Killing fields generated by acting on the section. Using with vielbein the identity:
produces .
produces .
produces .
These are exactly the three Killing fields familiar from elementary geometry — two translations and a rotation around the origin.
Christoffel symbols, frame vectors, and the position vector¶
Once the metric is in hand, everything familiar from elementary vector calculus in polar coordinates can be read off it. We give the short version here for completeness, then turn to the harder question of how to compute the position vector without going through Cartesian coordinates — a question for which the Klein-geometry section does exactly the right thing.
Curvilinear basis vectors¶
The coordinate basis vectors are and . Their lengths and inner products come straight from the metric:
So is already a unit vector, while has length . The unit (orthonormal) frame vectors are obtained by normalizing:
This is exactly the orthonormal frame produced by the Klein construction: the dual of the coframe satisfies , giving
How came from the Maurer–Cartan form¶
Briefly recapping the polar derivation above so the rest of this section is self-contained. We chose the section , computed
and split it via , . The -coefficients are the vielbein components — i.e., the coefficient of in :
Writing ,
Christoffel symbols¶
For , (others zero), with inverse , , the formula
has only one non-zero metric derivative, . Direct substitution gives two non-zero connection coefficients:
All others vanish. (Check: ; .) These reproduce the familiar coordinate derivatives of polar unit vectors:
and the Riemann tensor is identically zero (since has only one non-zero term and the second-derivative combinations cancel), consistent with the algebraic flatness computed earlier.
The position vector — derived without Cartesian coordinates¶
Now the structural question: in flat space, there is a distinguished object — the position vector — that points from the origin to the current point. Its existence depends on the flatness of the space (translations form an abelian Lie algebra , and the exponential map is a global diffeomorphism). One can also see this from coordinates: in the position vector is naturally , which uses the global Cartesian chart. The challenge is to compute at a given polar coordinate point from the algebra alone, without parametrizing through Cartesian.
The Klein-geometry section gives the answer directly. The key step is to rewrite as a translation times a rotation:
where is a Lie-algebra translation and is the residual rotation at the destination point. Once we have , the position vector is just identified with a vector in : it is the translation that moves the origin to the current point.
The decomposition. Start from our section and insert the identity factor between the two exponentials:
using the identity (conjugation acts on the exponent by the adjoint). The bracket structure on — , — gives, by the same series we used in Step 4 of the polar derivation,
Therefore the section factorizes as
and we read off
The vector is the position vector — its components in the algebraic basis of are exactly . So Cartesian coordinates have re-emerged as a purely algebraic byproduct: they are the components of the position vector in the algebra basis of . We never had to invoke an independent Cartesian chart; the Klein structure delivers it automatically when is abelian.
Expressing at the current point. The above lives in , naturally an algebra element. To view it as a tangent vector field on — as an arrow attached to the current point — push it through the translation Killing fields, whose polar-coordinate expressions we have from Step 4 of the polar derivation:
Substituting:
The component cancels exactly by the Pythagorean identity. So in polar coordinates,
The familiar elementary result is recovered, but the derivation used only the algebra (the bracket relations and the section), not Cartesian coordinates. Two observations close the section:
Why polar coordinates make the position vector look so simple. In the local orthonormal frame , at the point , the position vector has frame components — one non-zero entry. This is because the section first translates along and then rotates: the rotation aligns with the radial direction by construction, so the entire displacement shows up in the slot.
Why the position vector exists only in flat space. The decomposition relied on the fact that, in , the translation stays a single translation under conjugation by the rotation — there is no “tilting term” added. Equivalently, sends to itself as an honest linear map without picking up -pieces, and so the exponential map is a diffeomorphism. On the second condition fails: , so does not exhaust the group, the exponential map is only local, and there is no globally well-defined “position vector to the basepoint.” The position vector is the algebraic signature of flatness.
Three-dimensional Euclidean space ¶
The algebra has six generators () with
Klein-pair candidates. The conjugacy classes of subalgebras include:
: gives , the standard .
: an abelian ideal, gives , the space of rotations .
: 1-dim, gives .
: 2-dim, the “screw axis” subgroup; gives .
And several others.
The standard comes from by the same orbit–stabilizer argument: a point is moved by translations and fixed by rotations centered on it. So and
Metric derivation. Set , a symmetric matrix (6 unknowns). The Leibniz condition with is
Picking and running through the off-diagonal/diagonal pairings as in the 2D case forces and ; then gives . Hence
the standard 3D Euclidean metric. By the same reasoning as in 2D, the abelian-ideal structure of makes flat with global coordinates , and becomes .
All Klein-pair candidates for ¶
The four candidates listed above are illustrative, not exhaustive. Here we enumerate the complete classification of proper subalgebras up to -conjugacy, compute the metric in each reductive case, and show with equations that the orbit– stabilizer argument singles out as the unique “space of points”.
Notion of conjugacy. We classify subalgebras up to the adjoint action of itself: two subalgebras are equivalent if one is obtained from the other by rotating axes (conjugation by ) and/or shifting the origin (conjugation by translation). This freedom lets us always orient a distinguished axis along and place rotation centers at the origin.
Full classification. Working through bracket closure, the -conjugacy classes of proper non-trivial subalgebras are:
| Reductive? | family on | Non-deg? | Geometric meaning | |||
|---|---|---|---|---|---|---|
| 0 | yes (trivial) | 21 params (left-inv) | yes | 6 | group manifold | |
| 1 | yes | 7 | no | 5 | frames mod -translation | |
| 1 | yes | 5 | yes | 5 | (point + direction) | |
| 1 | , | yes | 3 | no | 5 | frames mod screw of pitch |
| 2 | no | — | — | 4 | (no reductive metric) | |
| 2 | yes | 2 | no | 4 | space of oriented lines in | |
| 3 | yes | 1 | yes | 3 | , space of points ★ | |
| 3 | no | — | — | 3 | as coset space (no Leibniz metric) | |
| 3 | yes | 1 | no | 3 | space of oriented 2-planes () | |
| 3 | screw-, | no | — | — | 3 | (no reductive metric) |
| 4 | no | — | — | 2 | (directions in ) |
There are no 5-dim proper subalgebras: any 5-dim would require either with (but has no 2-dim subalgebra), or a Lie-section with image disjoint from except along one direction — which one can check has no solution.
Reductivity. Each case is checked by computing for the natural complement . The four non-reductive cases fail because some bracket lands back in :
: .
: .
screw- with : any candidate complement fails the closure constraint (worked out below).
: for lies inside .
For these, the Leibniz algorithm does not produce a -invariant metric on — although the coset space itself still makes sense as a manifold.
Metric for each reductive case. Solving for each on the chosen (these are short computations, easily verified by a SymPy script analogous to the one above; results stated in basis order):
, : . 1 parameter, non-degenerate. The Euclidean metric.
, : invariance under (90° rotation in both and planes) leaves 5 free parameters: , , , , . Non-degenerate for generic (with giving the most natural choice).
, : invariance under both and gives 2 parameters , , but . Degenerate (translation block vanishes).
, : 1 parameter . Degenerate on .
, : 7-parameter family, but the entire -block is forced to zero by . Degenerate.
with : 3 parameters; the extra contribution gives the constraint , so the translation block dies for any . Degenerate.
: Leibniz is vacuous; any inner product on the 6-dim defines a left-invariant Riemannian metric on . Requiring full bi-invariance (ad-invariance under all of , in particular under translations) gives the constraint — so the only bi-invariant form on is degenerate. Thus admits many left-invariant Riemannian metrics, but no bi-invariant Riemannian one.
The non-degeneracy pattern. Three of the eight reductive cases give a non-degenerate metric: , , and . These are exactly the cases with (no translations in ).
This is not coincidence. Take any and any with non-zero rotation part. Applying the Leibniz condition with , , for any translation :
so . As ranges over rotations in , the bracket spans the translation directions in , forcing for all translations in . The translation block dies. So:
A non-degenerate -invariant Riemannian metric on exists only when contains no translations.
This narrows the candidates to , , and .
The stabilizer: what it is, and what each candidate fixes¶
So far has been treated as a piece of algebra. Each has a vivid geometric meaning, however: it is the stabilizer of some configuration in , and the homogeneous space is the space of all configurations of that type. Once we make this precise we can compute, for each candidate above, what is “fixing”, and then argue unambiguously why is the physically correct choice.
Definition of the stabilizer¶
Suppose a Lie group acts on a set (whose elements we call configurations — they could be points, lines, planes, frames, ...). For any , the stabilizer subgroup of is
i.e., all group elements that leave untouched. It is automatically a closed subgroup of . Differentiating at the identity gives the stabilizer subalgebra
i.e., all infinitesimal generators whose flow leaves fixed instantaneously.
The orbit–stabilizer correspondence¶
If the -action is transitive on the orbit of , the smooth bijection
makes into “the space of all configurations of type ”:
if is a point, then is the space of all points;
if is an oriented line, then is the space of oriented lines;
if is an oriented plane, is the space of oriented planes;
if is a frame (point + 3 orthonormal vectors), is the space of frames.
So choosing a Klein pair is equivalent to choosing what geometric object the homogeneous space is made of. The different reductive subalgebras of are not “competing answers to one question”; they answer different questions about .
How to read the stabilizer from the algebra¶
For acting on , the standard infinitesimal generators act on a point by
For a general element , the infinitesimal motion of is
Whether fixes a given configuration is then a linear check:
fixes a point iff ;
fixes a line as a set iff and (translation along, rotation about);
fixes a plane as a set iff translations lie in the plane and rotations are about ;
fixes a direction (an “unattached arrow”) iff , i.e., ;
fixes a frame (point + 3 orthonormal vectors at it) iff and , i.e., .
Using the conjugation freedom (orient the distinguished axis along , place rotation centers at the origin) we can read off the maximal fixed by each candidate at a glance.
Stabilizer of each candidate¶
For each Klein-pair candidate we list the richest (highest-content) configuration such that every fixes . The resulting is then “the space of all -type configurations”:
| What each does to | Maximal fixed by | = “space of...” | ||
|---|---|---|---|---|
| 0 | nothing | a full orthonormal frame (point + 3 directions) | frames (6-dim) | |
| 1 | translates along | a horizontal foliation ; no point fixed | frames mod -shift (5-dim) | |
| 1 | rotates about -axis through origin | origin + direction (a “pointed direction”) | (point, direction)-pairs (5-dim) | |
| 1 | , | screws along with pitch | the -axis as an unparameterized line | screw axes of pitch (5-dim) |
| 2 | translates in -plane | the horizontal foliation; no point fixed | (non-reductive — no metric) | |
| 2 | rotation + translation along | the -axis as an oriented line | oriented lines in (4-dim) | |
| 3 | all rotations about origin | the origin (a single point) | points of ★ | |
| 3 | all translations | nothing is fixed | (non-reductive — no metric) | |
| 3 | rotation about + translation in | the plane (as oriented set) | oriented planes (3-dim) | |
| 3 | screw-, | screwy version of | a “twisted plane” with helical pitch | (non-reductive — no metric) |
| 4 | rotation about + all translations | only the direction (no point, no line) | directions (2-dim) |
A few rows deserve a sanity check:
fixes the origin. Every , while no nonzero translation fixes the origin. So . ✓
fixes “origin + direction”. fixes the origin (rotation about origin) and fixes (rotation axis), but not any other direction. ✓
fixes the plane as a set, not pointwise. shift the plane along itself; rotates it within itself. But move every point of the plane to a different point of the plane — so no single point is fixed, only the plane as a whole. ✓
fixes nothing. For any configuration , at least one translation moves it. Translations act freely on every reasonable geometric structure on . So for every “nothing” — meaning if we want there is no associated configuration in at all. (The resulting space is a space of rotations, not a space “inside ”.)
Why we want : stabilizing a point¶
Every row of the table above defines a perfectly self-consistent Klein geometry. They differ in what kind of object the homogeneous space is made of. So the question “which is right?” only has an answer if we say what we want to be a space of. The physical answer is unambiguous: is the space of points — places where a particle can sit, where a field can take a value, where an event can occur. This singles out exactly one row:
We want to stabilize a single point and nothing more.
Among the candidates, is the unique one fitting this description, for the following reasons.
The space of points is logically prior. Lines, planes, frames, directions, etc., are all constructed from points (a line is a 1-parameter family of points; a plane is a 2-parameter family of points; a frame is a point with extra data). One cannot conversely reconstruct points from, say, the space of planes without extra information. Among our candidates, only has “points” as its elements; the others have higher-order objects.
“Stabilize a point” is precise. It means: at our chosen base point, every transformation in leaves that point exactly where it is. Translations move points; rotations about the point do not. So for “point” must consist of rotations about that point only. In 3D that’s exactly . (Any sub-algebra of would stabilize the point plus a preferred axis, contradicting “and nothing more”.)
Stabilizing a plane gives a different space. If — translations in the plane and rotations in the plane — then is the space of oriented planes in , not the space of points. This is a perfectly real and useful space (used in the theory of integral geometry, foliations, Radon transforms), but it is not what physicists call “physical space”. A plane is a 2-parameter family of points; the moduli space of all such planes is 3-dimensional but topologically , not .
Stabilizing a line gives the space of oriented lines. Choosing stabilizes the -axis. The resulting is the 4-manifold of oriented lines in (, roughly). Again interesting (this is the space of “rays of light” or worldline projections), but again not “points”.
Stabilizing a frame gives the bundle of frames. Choosing stabilizes the entire orthonormal frame at the origin. The space itself is the 6-dim frame bundle of , a useful object (it appears in Cartan geometry as the total space of the principal -bundle), but each “point” of it carries 3 extra direction labels — more information than a position.
is the largest stabilizer that fixes only the point. This is the “maximality” criterion: enlarge as much as possible while keeping the fixed configuration to a single point. Smaller subalgebras of would fix the point plus some extra structure (a preferred axis for , for example), so they fail “and nothing more”. Larger subalgebras (e.g., or ) include translations and therefore fail to fix any point. So in this precise sense is the “stabilizer of a single point” in .
Isotropy at the point. Stabilizing a point with the full (and not less) automatically guarantees that, at that point, all directions are equivalent — the weak isotropy principle discussed below. Stabilizing a plane or line instead would break isotropy by singling out a preferred normal or axis.
The slogan, summarizing all of the above:
as a “space of points where particles can sit” is the Klein quotient , because is the maximal subalgebra of that pins down a single point and nothing more.
Singling out . Among these three non-degenerate candidates:
gives the 6-dim group manifold — the bundle of orthonormal frames, not the underlying space of points.
gives a 5-dim space parameterizing (point, unit vector) pairs — point plus attached direction, not just a point.
gives the 3-dim space of points . ★
The third is picked out by the orbit–stabilizer correspondence:
The space of points is the homogeneous space whose elements are moved simply-transitively by translations alone, with rotations playing the role of the stabilizer at each point.
Concretely, fix the origin . An element (rotation followed by translation ) acts on as
The stabilizer of is
At the Lie-algebra level: rotations centered at the origin fix (), while translations move it (). So the stabilizer subalgebra is
and translations act simply transitively on points (any two points related by a unique translation). The orbit–stabilizer theorem gives
The Leibniz invariance condition with then uniquely (up to scale) selects , and the flat coordinates on give the global .
What translations alone give vs. what rotations add. A natural question is: translations by themselves already act transitively on — any point reaches any other by a unique translation, and the stabilizer is trivial. So why is the full Euclidean group needed at all? What do the rotations contribute?
Concretely, take just the abelian translation group with Klein pair : and . Topologically this still gives — the manifold is the same. But the Leibniz invariance condition
becomes vacuous when , so every symmetric matrix is allowed — a 6-parameter family of translation-invariant constant metrics. Concrete examples all satisfying translation-invariance:
| geometry | |
|---|---|
| standard flat Euclidean | |
| (2+1)-Minkowski-like indefinite form | |
| anisotropic / axis-scaled | |
| any symmetric | some constant-metric “geometry” |
Translation-invariance alone forces the metric to be constant across , but says nothing about its shape. Adding to imposes the additional Leibniz constraints , which collapse the 6-parameter family to a 1-parameter family — unique up to overall scale.
So rotations contribute not transitivity but shape-fixing:
Isotropy — all directions become equivalent (no axis is preferred).
Uniqueness of metric (up to scale) — is the only the rotations preserve.
Orthonormal frames — at each point, rotations distinguish orthonormal bases from skewed ones.
In Erlangen-program language: alone gives a flat affine space (parallelism and straight lines, but no canonical distances or angles); enlarging to promotes it to a flat Euclidean space (distances and angles too). Equivalently, the isometry group of is the full , not just : is the isometry group of every constant-metric flat geometry on , whereas is the one that singles out the round metric.
The weak isotropy principle. What is the minimal physical input that forces the rotation algebra ? Not “preservation of a positive-definite metric” (which presupposes that a metric exists), nor “preservation of length” (which presupposes a notion of length), nor any quantitative geometric structure. The minimal input is a single qualitative statement:
Weak isotropy principle. At each point of space, all directions are physically equivalent: for any two unit vectors in the tangent space, there exists a (closed, bounded) symmetry transformation taking to .
That is: the stabilizer at each point integrates to a closed Lie subgroup that acts transitively on the unit sphere of directions. No assumption about distances, angles, metric, or any quantitative structure is made — only that all directions are “the same”.
Inevitability of . From weak isotropy alone — just “directions are transitive” — the algebra emerges inevitably, by a structural theorem (Borel; Montgomery–Samelson; Onishchik, 1940s–60s):
Every compact connected Lie group acting transitively on a sphere () contains as a closed subgroup.
At the Lie-algebra level: every algebra implementing weak isotropy in dimension contains as a subalgebra, with the commutation relations baked in.
The minimal compact transitive groups in low dimensions:
| ambient | sphere | minimal compact transitive group | algebra | contains ? |
|---|---|---|---|---|
| ✓ (it is ) | ||||
| ✓ | ||||
| ✓ () | ||||
| ✓ () | ||||
| (14-dim, exceptional!) | ✓ (root inside) | |||
| ✓ |
The structural reason: any compact connected non-abelian Lie algebra contains as a subalgebra — one copy for each root in its root system. Transitive action on a sphere of dimension forces non-abelianness (abelian groups have torus orbits, not spherical orbits). So every isotropy algebra in dimension must contain .
Even in dimensions where the full rotation algebra is not required for isotropy — e.g., where (8-dim) is already sufficient, or where the exceptional (14-dim) is — the subalgebra is always present. It is the irreducible algebraic content of “isotropy”.
Everything else is a derived consequence, not an additional assumption. Once is forced, the Leibniz invariance equation with the rotation generators acting on automatically gives:
A unique (up to scale) invariant Riemannian metric on (the calculation done earlier).
Hence a notion of length .
Hence a notion of angle .
Hence the existence of orthonormal frames at each point.
Hence (combined with translation-invariance and flatness from ) the standard global metric $ds^2 = (dx^1)^2 + (dx^2)^2
(dx^3)^2\mathbb{R}^3$.
None of these structures were assumed up front. They are all forced by the single weak-isotropy principle (giving ) plus translation invariance (giving the flat-manifold structure). This is a striking conceptual economy of the Klein-geometric viewpoint: a single qualitative statement — “all directions are equivalent” — generates the entire quantitative scaffolding of Euclidean geometry.
The four 3-dim quotients side-by-side. Each 3-dim Klein quotient corresponds to “things modulo a different stabilizer”:
| Stabilizes... | What parameterizes | Metric | |
|---|---|---|---|
| a point (origin) | points of | flat Euclidean ★ | |
| (no point; identifies all of ) | orientations | non-reductive (bi-invariant from Killing form) | |
| an oriented 2-plane through origin | oriented planes | reductive but degenerate | |
| screw-, | a “twisted plane” with helical pitch | (no clean reductive description) | non-reductive |
The same Lie algebra thus admits several geometrically distinct 3-dim homogeneous spaces. The orbit–stabilizer interpretation uniquely identifies as the space of points, and the Leibniz invariance condition then fixes its metric to be the standard flat one.
The structural pattern is exactly the one we saw for Minkowski: is a semidirect product with an abelian ideal of translations, acts transitively on the space, and the Leibniz invariance condition with respect to fixes the metric uniquely (up to overall scale). The only signature-distinguishing input is the bracket : with acting on by the defining representation, the invariant form is positive-definite Euclidean; with acting on by its defining representation, the invariant form is Minkowski.
Can a single Lie algebra have more than one “point-stabilizing” Klein geometry?¶
In the case above the answer was particularly clean: exactly one reductive subalgebra () fixes a single point and nothing more, so the “space of points” Klein geometry was unique. But this uniqueness is a feature of the semidirect-product structure of , where translations are forced into and rotations are the only candidates for .
For a simple Lie algebra like the situation is genuinely richer: the same algebra can admit several inequivalent reductive point-stabilizers, each producing a different Klein geometry that is still a “space of points”. They differ not in what kind of object the space is made of (all are spaces of points) but in the signature of the resulting metric.
Smallest example: ¶
This is the lowest-dimensional simple Lie algebra where the phenomenon appears. Take generators with brackets
Concretely, generates the compact “rotation” (closed orbit ) and generate non-compact “boosts” (open orbit ). To get a 2-dimensional Klein geometry we need . The 1-dim subalgebras come in three conjugacy classes (by the Iwasawa / Cartan-classification of one-parameter subgroups of ):
| class | representative | type | topology |
|---|---|---|---|
| elliptic | compact rotation | ||
| hyperbolic | non-compact boost | ||
| parabolic | null / unipotent |
For the parabolic class one finds does not close inside for any natural complement, so the Leibniz machinery does not yield a metric (it gives a projective line as ). The first two are both reductive, both give a 2-dim “space of points”, and they yield different signatures.
Case 1: compact stabilizer . Complement . The adjoint action of on in the basis is
Imposing Leibniz invariance for a generic symmetric gives the linear system
so
Riemannian signature . The resulting space is the hyperbolic plane with its negative-curvature Riemannian metric.
Case 2: non-compact stabilizer . Complement . The adjoint action of on in basis :
Imposing Leibniz invariance gives
so
Lorentzian signature . The resulting space is 2-dimensional anti-de Sitter space , a Lorentzian spacetime of constant negative curvature.
(Both calculations are verified by a one-line SymPy script analogous to those for and .)
What just happened, physically¶
The same abstract Lie algebra , with two different choices of “what stabilizes a point”, gave two genuinely different physical spaces:
| stabilizer | topology | signature | curvature | character | |
|---|---|---|---|---|---|
| , compact | Riemannian space | ||||
| , non-compact | Lorentzian spacetime |
Both stabilize a single point — at the base point of either coset space, the stabilizer fixes that point only. The difference is in the kind of “rotation” that does the stabilizing:
A compact rotation (the generated by ) sweeps the tangent plane around in a closed orbit, treating all directions through the point as equivalent up to a continuous rotation. The invariant metric is positive-definite.
A non-compact boost (the generated by ) hyperbolically rescales two transverse directions in opposite ways. The invariant form must be preserved by these “boosts”, which forces it to take the indefinite quadratic-form shape .
The key structural fact is:
The signature of the invariant metric on is determined by whether is conjugate to a compact (orthogonal-like) or non-compact (Lorentz-like) family of transformations.
So which one-parameter subgroup we call “the rotations of space” is what distinguishes a space from a spacetime. The Lie algebra alone does not know.
Higher-dimensional examples¶
The same phenomenon happens for every higher-rank , producing the standard zoo of constant-curvature pseudo-Riemannian geometries:
| Lie algebra | (point-stabilizer) | signature | name | |
|---|---|---|---|---|
| -sphere | ||||
| hyperbolic space | ||||
| de Sitter | ||||
| anti-de Sitter | ||||
| ? | (other quotients) | mixed | ... |
For example (the 3+1 Lorentz algebra), familiar as the homogeneous part of the Poincaré algebra, has two natural reductive point-stabilizers:
: gives , 3-dim hyperbolic space (Riemannian, ). Used in cosmology as the spatial slice of a hyperbolic Friedmann universe.
: gives , 3-dim de Sitter space (Lorentzian, ).
Two completely different physical spaces, both stabilizing a single point, both built from the same Lorentz algebra.
Physical meaning of multiple point-stabilizers¶
This phenomenon is the signature ambiguity of Klein geometry. The Lie algebra encodes only the abstract algebraic structure of the symmetries; it does not, by itself, distinguish “rotations of a Euclidean space” from “boosts of a spacetime”. The choice of stabilizer subalgebra is precisely the additional information that says which generators we want to be “rotation-like” (closed orbits, compact, positive-definite invariant) and which we want to be “boost-like” (open orbits, non-compact, indefinite invariant).
Three concrete ways this matters in physics:
Wick rotation. The relationship between Minkowski space (Lorentzian) and 4-dim Euclidean space (Riemannian) — central to Euclidean quantum field theory — is exactly an exchange of compact-vs-non-compact stabilizers in a complexified version of the Poincaré algebra. Choosing as point-stabilizer of a 5-dim parent gives Minkowski; choosing gives 4D-Euclidean. Same abstract complex algebra, different real forms of the stabilizer.
dS / AdS / Minkowski as deformations. The -dim constant-curvature spacetimes — de Sitter, anti-de Sitter, and Minkowski — are all realized as Klein quotients with and , but with the algebra of chosen differently in each. The cosmological constant of the resulting spacetime is read off from whether closes into with a sign (), a sign (), or vanishes (Minkowski).
Signature change in quantum gravity. Models that allow the universe to “tunnel” between Riemannian and Lorentzian phases (Hartle–Hawking, etc.) make sense precisely because the underlying Lie-algebraic structure has both as Klein quotients of the same complex algebra.
When is the point-stabilizing geometry unique?¶
Putting all of the above together:
For a semidirect-product algebra (Euclidean, Poincaré, Galilei) the point-stabilizer is essentially forced to be — the translations have to go into in order to act transitively, and what is left is the rotation/boost piece. Uniqueness as in our analysis.
For a simple algebra (no nontrivial ideals — like , , etc.) there is generally a family of point-stabilizers indexed by which real form of which subalgebra one chooses. This produces the zoo of constant-curvature pseudo-Riemannian geometries.
Concretely: the number of inequivalent point-stabilizers is the number of -conjugacy classes of “symmetric pair” subalgebras of the right codimension. For a classical of dimension (with ), the pairs with exhaust them, giving all the constant-curvature pseudo-Riemannian geometries of dimension .
In short: the “unique answer” was a special feature of the semidirect-product structure. For simple Lie algebras, the same algebra is the symmetry algebra of a whole family of homogeneous spaces of different metric signature. The choice of point-stabilizer is then a genuine extra physical input (Riemannian space? Lorentzian spacetime? what cosmological constant?) — not a mathematical accident.
Worked example: point-stabilizers of (the Lorentz algebra)¶
As a concrete and physically important illustration, let’s enumerate all point-stabilizing Klein geometries arising from — the homogeneous Lorentz algebra in 3+1 dimensions. This is the same algebra that appears in special relativity as the rotation/boost subalgebra of the Poincaré algebra, and we’ll find that it is simultaneously the isometry algebra of three distinct 3-dimensional homogeneous spaces.
Set-up¶
The six generators are rotations and boosts , with brackets
To get a 3-dim Klein quotient we need . Up to -conjugacy, the relevant 3-dim Lie subalgebras of are:
| basis | type | as a Lie algebra | |
|---|---|---|---|
| compact | rotations | ||
| split | one rotation + two boosts in the transverse plane | ||
| parabolic | rotation + two commuting “null rotations” |
The first two are the little groups of timelike and spacelike unit vectors in respectively; the third is the little group of a null (lightlike) vector — familiar from particle physics as the Wigner little group for massless particles.
Case 1: — compact stabilizer¶
Complement . Bracket check:
: . Reductive. ✓
: . Symmetric pair. ✓
Imposing Leibniz invariance with acting on by the standard rotation representation forces :
Signature — Riemannian. The sign of the bracket is , which gives negative sectional curvature. The resulting space is the 3-dimensional hyperbolic space
i.e., the upper sheet of the unit timelike hyperboloid in , with its negative-curvature Riemannian metric.
Case 2: — non-compact stabilizer¶
Complement . Bracket check (verified by the SymPy enumeration):
closes inside . Reductive. ✓
, , , all in . Symmetric pair. ✓
Solving the Leibniz equations (three invariance equations, one per generator of ) for a generic symmetric on yields
Signature , or equivalently after overall sign — Lorentzian. Why the indefinite signature? Because acts on by mixing the compact rotation generators with the non-compact boost via the boost generators ; the only invariant quadratic form distinguishes “compact-type” (directions ) from “non-compact-type” (direction ) with opposite signs.
The resulting space is the 3-dimensional de Sitter space
i.e., the unit spacelike hyperboloid in , with its constant-positive-curvature Lorentzian metric.
Case 3: — parabolic stabilizer¶
This is the little group of a null vector. Take the standard null vector in ; one finds (and SymPy verifies) the stabilizer is the 3-dim algebra
with structure
That is, — a “rotation” together with two commuting “null rotations” and . Crucially, this Klein pair is non-reductive: by the same argument we made for , any complement to ends up mixing with via the null-rotation generators. The Leibniz equations admit no non-degenerate invariant metric on .
Geometrically, is the future null cone minus the origin in — a 3-dim cone of null vectors that inherits only a degenerate “Carrollian” structure (a vector-line is distinguished at each point, but no full metric). This is the same degenerate situation we saw with , where the would-be “space of oriented planes” had a degenerate metric.
(Projectivizing the null cone — that is, enlarging to the 4-dim parabolic subalgebra — gives the celestial sphere , the asymptotic boundary of Minkowski space. This is a 2-dim space, important for asymptotic symmetries and the holographic / BMS / AdS/CFT story, but it is not a 3-dim space of points so we set it aside here.)
Summary table¶
The complete enumeration of point-stabilizing Klein geometries for :
| metric signature | curvature | character | |||
|---|---|---|---|---|---|
| 3 | Riemannian space | ||||
| 3 | Lorentzian spacetime | ||||
| null cone | 3 | degenerate | — | Carrollian (no metric) |
So has two reductive point-stabilizers giving non-degenerate metrics: one Riemannian, one Lorentzian. (Plus a parabolic “null” stabilizer giving a degenerate-metric space, which is the non-reductive analogue of the situation.)
The picture is exactly the lower-dimensional story amplified by one dimension. Compact stabilizer → Riemannian; non-compact stabilizer → Lorentzian. The Lie algebra itself is “agnostic” about which geometry it is the symmetry algebra of.
Where actually appears in physics¶
In ordinary special relativity, does not act on Minkowski space as a homogeneous group — the homogeneous group of is the Poincaré algebra , of which is the point-stabilizer (the rotation/boost subalgebra fixing the origin). So plays the role of , not of , in the standard relativistic setting.
But also plays the role of the symmetry algebra in two further physically important situations:
as spatial slice of an open FRW universe: in cosmology, a spatially-hyperbolic Friedmann–Lemaître–Robertson–Walker universe has as its constant-time spatial slice, and is the isometry algebra of that slice (acting through case 1 above).
as a 3-dim de Sitter spacetime: in lower-dim quantum-gravity models, the 3-dim de Sitter universe is a Lorentzian spacetime of positive cosmological constant; its isometry algebra is acting through case 2 above.
The same abstract algebra plays three roles in physics — Lorentz stabilizer of Minkowski, isometry of hyperbolic 3-space, isometry of 3-dim de Sitter — distinguished by what we want to be and what we want to be in the Klein pair. The choice of point-stabilizer within is what selects between the Riemannian and Lorentzian geometries.
Compare with in general¶
The pattern continues for all real forms of :
has one reductive point-stabilizer () → only , Riemannian, . (The only other 3-dim subalgebra is in the left factor of , which gives the bi-invariant 3-sphere as a group manifold — topologically the same, but with a slightly different geometric interpretation.) The point: a compact simple Lie algebra has only compact subalgebras, so only Riemannian quotients are available.
: two reductive point-stabilizers as above (, ).
: at least two reductive point-stabilizers yielding (Riemannian, , with ) and (Lorentzian, , with a different embedding).
→ only.
→ (Riemannian) and (Lorentzian).
→ (Lorentzian, ) and a Riemannian geometry of signature .
The general rule: for with , the reductive point-stabilizers are conjugate to with , , , — giving exactly the constant-curvature pseudo-Riemannian geometries of dimension accessible from that Lie algebra.
So having two point-stabilizers (giving and ) is exactly the next step up from having two point-stabilizers (giving and ). Both illustrate the general phenomenon: simple Lie algebras of indefinite signature admit multiple inequivalent “spaces of points” as Klein quotients, distinguished by the compactness or non-compactness of the stabilizer.
Worked example: point-stabilizers of the Poincaré algebra¶
What happens when we put translations back in and ask the analogous question for the full Poincaré algebra (dim 10)? Now the natural “space of points” should be 4-dim Minkowski spacetime , just as was for . The question is whether the point-stabilizer giving this Minkowski geometry is unique, or whether (like taken alone) Poincaré admits several inequivalent “point-stabilizing” Klein geometries.
The answer turns out to be exactly the same as for : the semidirect-product structure forces uniqueness. Minkowski is the unique non-degenerate 4-dim point-space. The reason is the same non-degeneracy filter we derived earlier.
The non-degeneracy filter applies, unchanged¶
The Poincaré brackets relevant for the argument are
Suppose contains some translation and the complement contains some Lorentz generator (a rotation or boost). Acting on , the Leibniz invariance under reads
But for and is again a non-zero translation in (the boost rotates into another translation direction, etc.). So as ranges over the Lorentz part of , sweeps out the translation block of , forcing for all translations . The translation block dies — exactly as it did for .
A non-degenerate -invariant metric on requires that contain no translations: .
This is verified directly by SymPy: e.g., taking (rotations + time translation, dim 4) and complement (dim 6), the Leibniz equations give a degenerate in block form
i.e., the boost block has but the entire spatial- translation block is zero. The metric on has rank 3, not 6.
Among Lorentz subalgebras, only itself has dimension 6¶
For a 4-dim we need . The non-degeneracy filter restricts , which is itself 6-dim. So is the unique point-stabilizer giving a non-degenerate 4-dim spacetime:
Verifying the invariance with SymPy: the Leibniz system $\eta,\mathrm{ad}_X
(\mathrm{ad}X)^T \eta = 0X \in \mathfrak{so}(3,1)\mathfrak{m} = \mathfrak{t}{3,1}$ has the unique (up to scale) solution
with translation block index ordering . Minkowski signature , exactly as expected.
Orbit–stabilizer interpretation¶
Concretely, fix the spacetime origin . The Poincaré action takes , so the orbit of is all of (transitive). The stabilizer of is
At the Lie algebra level: Lorentz transformations through the origin fix (any satisfies ), while translations move to . So exactly. This is the relativistic analogue of “rotations fix the origin of ” — except now “rotations” means Lorentz transformations, which include both compact rotations and non-compact boosts.
Other Klein quotients of Poincaré (lower-dim stabilizers, higher-dim spaces)¶
Although is the unique point-stabilizer, the Poincaré algebra admits many other Klein quotients corresponding to spaces of (event + extra structure). The key recurring fact: such an is a subalgebra of , hence corresponds to one of the cases we already enumerated for :
| What is fixed in Minkowski | as a fiber bundle over | |||
|---|---|---|---|---|
| 6 | 4 | a single event (origin) | base only: Minkowski ★ | |
| 3 | 7 | event + future-pointing 4-velocity | — unit timelike tangent bundle | |
| 3 | 7 | event + unit spacelike direction | — unit spacelike tangent bundle | |
| (parabolic) | 3 | 7 | event + null direction | — projective null-cone bundle |
| 1 | 9 | event + 4-velocity + spatial axis | (point, time-axis, spatial-axis) — almost a frame | |
| 0 | 10 | full Lorentz frame (point + 4 orthonormal vectors) | the entire Poincaré group as a frame bundle |
Each row reuses one of the point-stabilizers from the previous subsection — but now translated up by adding 4 translation directions to . So Poincaré “inherits” the multiplicity from its Lorentz subalgebra: choosing different sub-stabilizers of gives different “tower” Klein geometries over Minkowski with different fibre geometries (, , , frames). These are familiar from physics:
is the mass shell times spacetime — the phase space of a single massive particle in special relativity.
is the bundle of light rays; its projectivization (modding out the affine null direction) is the space of unoriented light rays through Minkowski, basic in optics and twistor theory.
is the bundle of unit-spacelike vectors — the configuration space of a “tachyonic” mode if one wanted such a thing.
So Poincaré has a unique “space of events” Klein quotient, but a family of richer Klein quotients indexed by what extra structure one attaches to each event.
Higher-dim stabilizers (containing translations) give degenerate quotients¶
For completeness: if contains some translations, the non-degeneracy filter fails. The two physically most natural examples:
(rotations + spatial translations, dim 6). This is the little group of a future- pointing timelike vector in the affine action. The Klein quotient is 4-dim, parameterized by — three boost rapidities and one time offset. Geometrically, this is the space of inertial worldlines starting at the origin, or equivalently the Galilean limit boundary of Minkowski as . The Leibniz equations admit only a degenerate metric (Galilei has two invariants, the temporal and spatial metrics, as derived earlier in Part II).
is not a subalgebra because . There is no “stabilize a spatial hyperplane ” subalgebra of — boosts necessarily mix time-translations with space-translations. To stabilize a hyperplane one must drop the boosts, recovering Galilei.
The non-existence of a “spacelike hyperplane” Klein quotient of Poincaré is the algebraic content of the relativity of simultaneity: there is no Poincaré-invariant notion of “the space at time ”.
Structural comparison¶
The whole story so far is captured by a single table:
| algebra | structure | # of reductive “point-stabilizers” | resulting spaces |
|---|---|---|---|
| semidirect: | unique: | Euclidean only | |
| semidirect: | unique: | Minkowski only | |
| simple | two: , | Riemannian, Lorentzian | |
| simple | two: , | Riemannian, Lorentzian | |
| (conformal) | simple | several | compactified Minkowski, , , , ... |
The structural punchline:
Semidirect-product (kinematical) algebras have a unique point- stabilizing Klein geometry — the metric is forced. Simple Lie algebras have a family of point-stabilizing geometries — Riemannian and pseudo-Riemannian constant-curvature variants of various signatures.
Physically, this matches the role of these algebras in physics:
Newtonian / Euclidean physics uses , which produces uniquely. No metric ambiguity.
Special relativity uses , which produces Minkowski uniquely. No metric ambiguity.
Cosmology / quantum gravity / AdS-CFT, when working with constant- curvature spacetimes, use . These come in pairs (Riemannian and Lorentzian for each curvature sign), and the choice of point-stabilizer is part of the model.
In particular, the uniqueness of Minkowski as the Klein quotient of the Poincaré algebra explains why “special relativity is special”: the algebra has only one geometric interpretation, and the metric signature is forced — not assumed — by the algebraic structure alone.
Worked example: point-stabilizers of the full 10-parameter Galilei algebra¶
Continuing the parallel with Poincaré, we now examine the full 10-parameter Galilei algebra , with the homogeneous Galilei algebra (rotations + Galilean boosts) and the 4-dim abelian ideal of time + spatial translations.
Generators and brackets¶
Ten generators: (3 rotations), (3 Galilean boosts, generators of ), (time translation), (3 spatial translations). The brackets of the bare Galilei algebra (no Bargmann central extension) are
The key qualitative differences from Poincaré:
Galilean boosts commute with each other: , not the of Lorentz. This is the algebraic signature of .
Boosts commute with spatial translations: , where Poincaré had . (Reinstating the central extension would give , the Bargmann algebra of non-relativistic quantum mechanics.)
Boosts produce spatial translations from time: . (This is the algebraic backbone of “moving frames pick up a position offset ”.)
Non-degeneracy filter (still applies)¶
The exact same Leibniz argument as for and Poincaré shows: any containing a translation kills the corresponding block of the metric on . So a non-degenerate -invariant metric (if it existed) would force .
For a 4-dim (the natural “Galilean spacetime”) we need , and is exactly 6-dim. So is the unique candidate, just as for Poincaré.
What the metric looks like¶
Take and . The bracket check:
, ✓
, ✓
So reductive. Solving the Leibniz invariance with SymPy (or by hand — rotations give standard so(3) constraints, the boost gives with taking and the spatial ) yields
This is rank 1: only the temporal block survives, and the entire spatial block is forced to zero by boost-invariance. There is no non-degenerate Galilei-invariant metric on .
What the degenerate metric means physically¶
The Galilei algebra forbids a single non-degenerate 4-dim metric. Instead it admits two compatible invariant tensors, derived in Part II of this document:
A temporal 1-form (the rank-1 covariant tensor above), measuring time intervals between events.
A spatial inverse metric on the cotangent bundle, restricted to spatial covectors (those annihilating ), measuring spatial distances within a simultaneity slice.
Together form the Newton–Cartan structure on Galilean spacetime. The reason both must exist independently is precisely that boost-invariance forces the spatial-covariant block to vanish: boosts “tilt” simultaneity slices into each other’s, so no Galilei-invariant 4-dim metric on the tangent bundle can distinguish “simultaneous” from “non-simultaneous” pairs of points.
Point-stabilizer uniqueness for Galilei¶
Despite the metric being degenerate, the point-stabilizer remains unique by exactly the same semidirect-product argument:
| Quantity | Galilei value |
|---|---|
| Algebra | |
| Total dim | 10 |
| Translation ideal | 4-dim () |
| Homogeneous part | 6-dim () |
| Unique point-stabilizer | , dim 6 |
| Quotient | Galilean spacetime (4-dim) |
| Invariant metric | degenerate rank-1 temporal + spatial inverse |
Galilean spacetime is therefore unique as a Klein quotient of the Galilei algebra — exactly as Minkowski is unique for Poincaré and is unique for . The semidirect-product structure with abelian translation ideal forces the answer in all three cases. What distinguishes Galilei from Poincaré is not the choice of — it’s the bracket between the stabilizer and translations:
Poincaré: AND (boosts couple time and space symmetrically) → non-degenerate of signature .
Galilei: but (boosts couple time to space, but not back) → degenerate with rank 1.
The “missing bracket” in Galilei is the algebraic content of the limit, and it is exactly what makes the resulting spacetime metric degenerate.
Other Klein quotients of Galilei¶
As with Poincaré, Galilei admits many other Klein quotients of higher dimension corresponding to attaching extra structure to each event:
| Geometric interpretation | |||
|---|---|---|---|
| 6 | 4 | Galilean spacetime ★ | |
| 3 | 7 | event + 3-velocity (Galilean phase space) | |
| (abelian) | 3 | 7 | event + spatial axis-orientation |
| 2 | 8 | event + axis direction (rotational + boost about ) | |
| 0 | 10 | bare Galilei group as frame bundle | |
| 6 | 4 | (non-reductive — has translations) |
The first row is the standard Galilean spacetime. The second row, with , gives 4-dim base + 3-dim Galilean boosts as fibre — the Galilean phase space , the configuration space of a single non-relativistic particle (event + 3-velocity).
Worked example: point-stabilizers of (compact case)¶
For contrast with the indefinite-signature and , let’s examine the compact algebra . Its action is the symmetry of the round 3-sphere . How many inequivalent point-stabilizers does it admit?
Set-up: ¶
The Lie algebra is dim 6 and famously not simple: it splits as a direct sum of two commuting copies of ,
Concretely, writing generators of as antisymmetric 4×4 matrices (), one can form two commuting triples and , satisfying
This decomposition is the compact analogue of and is not present for (which is simple as a real Lie algebra, even though over the complexification).
The 3-dim subalgebras up to -conjugacy¶
For a 3-dim Klein quotient ( of dim 3, hence “-like”) we need . Up to -conjugacy, the only 3-dim subalgebras are isomorphic to (this is the only compact simple 3-dim Lie algebra). Their embeddings in are parameterized by Lie-algebra homomorphisms . Up to conjugacy these are:
| label | embedding | properties |
|---|---|---|
| left factor | , an ideal of | |
| right factor | , an ideal of | |
| diagonal | , not an ideal |
Under the outer automorphism of (swapping the two factors), the left and right embeddings are equivalent. So there are two conjugacy classes: factor (ideal) and diagonal (non-ideal symmetric pair).
Case 1: — the round ¶
Use the change of basis (diagonal) and (anti-diagonal). One computes
So with and :
: acts on as the standard 3-dim rotation. Reductive. ✓
. Symmetric pair. ✓
Leibniz invariance with acting on as standard rotation forces .
Signature — Riemannian. The bracket (positive sign) gives positive sectional curvature. The result is the round 3-sphere:
Case 2: — non-effective!¶
Here is one of the two ideals of . With :
(the factors commute). Reductive, yes; but the isotropy action is trivial.
. NOT a symmetric pair (the bracket lands in , not ).
The Leibniz invariance condition is vacuous — any symmetric on is automatically -invariant. SymPy confirms: a 6-parameter family of metrics.
But there’s a catch — this Klein pair is not effective. Because is a non-trivial ideal of , the action of on has a non-trivial kernel: the factor of acts trivially on every coset. Effectively, only the factor acts, and it does so simply- transitively (= as left translation on the group manifold ).
So the “Klein geometry” is really just the group manifold with the half-redundant - labelling: the left part doesn’t do anything. The 6-parameter family of allowed metrics is the family of left-invariant metrics on the group manifold — including the round metric (the bi-invariant one, when both factors are scaled equally) but also the Berger spheres (anisotropic left-invariant metrics).
In the standard Klein-geometry framework one requires Klein pairs to be effective — i.e., contains no non-trivial ideal of . Excluding the two factor embeddings on this basis leaves only as a legitimate point- stabilizer, giving the unique round .
Summary table for ¶
| dim | effective? | metric | ||
|---|---|---|---|---|
| 3 | yes (symmetric pair) | round, unique up to scale ★ | ||
| (or ) | 3 | no (ideal) | as group | 6-param family of left-invariant metrics |
| 2 | yes | oriented 2-Grassmannian of | 4-dim symmetric space | |
| 1 | yes | 5-dim Stiefel-like space | various | |
| 0 | yes | frame bundle | 6-dim, left-invariant |
Among effective 3-dim Klein pairs there is exactly one point- stabilizer for , giving the unique round 3-sphere. The non-effective factor-embedding case is more naturally viewed as the group manifold , which is the same topological but is no longer a symmetric space.
Why only one (vs. two for )?¶
The key contrast:
is compact — all its subalgebras are compact — so all its invariant metrics are positive-definite. There is no analogue of “the boost subalgebra” that could yield a Lorentzian quotient.
Whereas , being of indefinite signature, contains both a compact subalgebra and a non-compact subalgebra, giving Riemannian and Lorentzian respectively. The same dimensions, but compact yields one Riemannian space; indefinite yields a Riemannian + Lorentzian pair.
The general pattern, then, for the orthogonal real forms:
| type | # of effective point-stabilizers giving non-degen 3-dim quotient | spaces | |
|---|---|---|---|
| compact | 1 | (Riemannian, ) | |
| indefinite | 2 | , | |
| indefinite | 2 | , | |
| same | same |
Worked example: the Euclidean “Poincaré” algebra ¶
We now combine the previous two examples: take as the homogeneous algebra and adjoin 4 translation generators with the standard Poincaré-like commutation relations. The result is the Euclidean Poincaré algebra
This is a perfectly well-defined 10-parameter kinematical algebra — the same dimension as (Poincaré) and the Galilei algebra. The question is: what spacetime geometry does it produce, and why isn’t it the physically realized one?
Brackets¶
Write the 10 generators as (3 spatial rotations), (3 “Euclidean boosts”, i.e., rotations in the planes), (time translation), (3 spatial translations), with and . From and with Euclidean , the brackets are
The two boxed signs are the algebraic difference between the three 10-parameter kinematical algebras:
| algebra | ||
|---|---|---|
| Lorentz (Poincaré) | (non-compact boosts) | |
| Galilei | 0 (commuting boosts) | 0 (decoupled) |
| Euclidean | (compact boosts!) |
The Euclidean version differs from Lorentz by a single sign in — exactly the Wick-rotation flip .
Point-stabilizer and metric¶
The non-degeneracy filter forces . We need , and is exactly 6-dim. So just as in Poincaré and Galilei,
Solving Leibniz invariance for the symmetric metric on (SymPy verified) gives
Signature — fully Riemannian 4-space. The Klein quotient is
i.e., flat 4-dim Euclidean space with the round metric . This is the Wick-rotated Minkowski space of Euclidean QFT.
Physical problem #1: velocity addition can give zero or negative velocity¶
In Lorentz, a boost acts on as the Lorentz “rotation” , with rapidity . In the Euclidean case it acts as a literal circular rotation:
For a particle worldline , the boosted worldline has velocity
Identifying the frame velocity of the boost as , the Euclidean velocity addition law is
Compare:
| group | addition law | comments |
|---|---|---|
| Galilei | linear, unbounded | |
| Lorentz | stays in | |
| Euclidean | singularities and sign flips |
The Euclidean law has three pathologies:
Pole at : adding two velocities of magnitude 1 gives infinite velocity. There’s a vertical-asymptote in the addition law.
Sign reversal for : adding two positive velocities gives a negative result. Example: gives .
Periodic return to rest: repeated boosting cycles through . The boost subgroup is , which is compact: a “boost by angle ” returns the velocity to zero.
The geometric picture is clean: where is the rotation angle in the plane. As runs around the circle:
| description | ||
|---|---|---|
| 0 | 0 | rest |
| 1 | “diagonal” worldline | |
| worldline becomes horizontal in | ||
| crosses over: velocity flips sign | ||
| -1 | ||
| 0 | back to rest! |
Two boosts add their angles: . So composing a boost of with another of gives , i.e., — the moving frame is at rest with respect to the original. Two non-zero boosts compose to no boost.
Physical problem #2: no causal structure¶
The “boost” rotation by exactly swaps time and space axes:
So time and space are interchangeable by a symmetry of the algebra. There is:
No invariant notion of “future” vs. “past”: no light cones, no causal ordering.
No distinction between “particle” worldlines and “spatial” curves: any smooth curve can be rotated into any other.
No fixed mass shell — only with , which is a 3-sphere in momentum space (compact!), and the “energy” is bounded () rather than the unbounded Lorentzian hyperboloid.
This is incompatible with what we actually observe: the universe has a preferred causal structure (events have unambiguous past/future relations, signals do not propagate faster than , energy is bounded below but not above).
Why it “works for small speeds”¶
Expand the Euclidean addition law for small :
Compare with the other two:
To first order in , all three agree with Galilei: . The differences appear only at :
Lorentz: correction has negative sign → velocities saturate at .
Euclidean: correction has positive sign → velocities accelerate past any bound, eventually overflow into the pole, and reverse.
Galilei: no correction at all → strict additivity.
This is why all three kinematical algebras agree with everyday experience. To distinguish them experimentally one must reach speeds where the cubic correction is detectable. The Michelson–Morley experiment and many later observations rule in favor of Lorentz (negative sign, ). The Euclidean version has never been observed because the cubic correction would have to enhance velocity addition rather than suppressing it.
Connection to Wick rotation and Euclidean QFT¶
Although is not the kinematical algebra of physical spacetime, it plays a crucial role in quantum field theory. The Wick rotation analytically continues Lorentzian quantities to Euclidean ones, replacing with and converting
turning the oscillatory path integral into a well-defined statistical- mechanics partition function. The two algebras share the same complexification
so analytic continuation is meaningful and many calculations are easier to perform in the Euclidean version. But the physical algebra is the Lorentzian one, recovered by rotating back.
Summary¶
| Property | |
|---|---|
| Algebra | , dim 10, semidirect |
| Point-stabilizer | , unique (semidirect rigidity) |
| Klein quotient | with metric , signature |
| Boost subgroup | , compact |
| Velocity addition | — pathological |
| Causal structure | none (time and space interchangeable) |
| Small-velocity limit | matches Galilei and Lorentz to first order in |
| Physical role | Wick rotation in QFT; not a physical kinematical group |
So joins our table as a fifth 10-parameter kinematical algebra with a unique point-stabilizer, giving Euclidean . The semidirect-product rigidity holds — but the metric it forces, while non-degenerate and a perfectly good Riemannian metric, has the wrong signature to describe spacetime. The signature of the metric is not a free choice: it is dictated by the bracket , and Lorentz vs. Euclidean vs. Galilei are the three algebraically distinct options.
Updated structural table¶
Combining all examples done in this document so far:
| algebra | structure | # eff. pt-stabilizers | resulting “spaces of points” |
|---|---|---|---|
| semidirect, abelian translations | 1 | (Euclidean) | |
| semidirect, abelian translations | 1 | (Euclidean) | |
| semidirect, abelian translations | 1 | Galilean spacetime (degenerate metric) | |
| semidirect, abelian translations | 1 | Minkowski , signature | |
| semidirect, abelian translations | 1 | Euclidean , signature — pathological as kinematical | |
| simple, indefinite | 2 | , | |
| simple, compact | 1 | ||
| semisimple (= sum of two ideals), compact | 1 (effective) | ||
| simple, indefinite | 2 | , | |
| (conformal) | simple, indefinite | several | comp. Minkowski, , , ... |
Three rules emerging:
Semidirect-product algebras with abelian translation ideal always have a unique point-stabilizer, namely itself. The metric on is determined by the action of on — and may be Lorentzian (Poincaré), Riemannian (Euclidean , Galilei spatial part), or degenerate (Galilei full metric). Uniqueness of the quotient does not mean uniqueness of the physical interpretation: is mathematically fine but physically wrong; the signature of the metric alone selects which kinematical algebra is realized.
Simple compact algebras , , etc. have a unique effective point-stabilizer, yielding the corresponding sphere or other compact Riemannian symmetric space.
Simple indefinite-signature algebras with have multiple point-stabilizers, yielding Riemannian / Lorentzian pairs (or larger families) of constant-curvature pseudo- Riemannian symmetric spaces.
The semidirect-product cases are the “kinematical” algebras of physics (Galilei, Newton–Hooke, Carroll, Poincaré, , de Sitter, Bargmann), each yielding a unique spacetime geometry. Of these, only Poincaré, Galilei, and (with caveats) Carroll / Newton–Hooke describe physically realized regimes; is ruled out empirically by the velocity-addition pathologies derived above. The simple indefinite cases are typically the conformal extensions or the cosmological-constant modifications of these, where multiple geometric interpretations coexist.
The converse question: which algebras give as a quotient?¶
The point-stabilizer enumerations above answered a question of the form “given an algebra , what spaces are its Klein quotients?” The natural converse is: “given a space — say — what algebras have it as a Klein quotient?”
We have spent the entire R³ part of this document working with one answer: . Is this the only way? Up to choice of preserved structure, no — there is in fact a hierarchy of Klein realizations of (the underlying space) , with sitting in the middle.
The hierarchy¶
Order the realizations by what they preserve:
| invariant structure | metric? | ||
|---|---|---|---|
| straight lines, parallels | affine — no metric, no angles | ||
| angles, infinitesimal circles | conformal class of Euclidean | ||
| distances and angles | Euclidean, unique up to scale ★ | ||
| translations only | any constant symmetric tensor (no constraint) |
All four have the same 3-dim underlying smooth manifold . What differs is the invariant tensors the symmetry group preserves:
Affine geometry : the stabilizer is all of (9 parameters), preserving the origin. The invariants are parallel transport and straightness — but distances and angles transform arbitrarily. No invariant metric exists. This is the natural setting for, e.g., classical mechanics in absolute time with no preferred metric (just an affine connection).
Conformal geometry : enlarging the isotropy by dilations and special conformal transformations adds 4 generators to , giving a 7-dim stabilizer . The Euclidean metric is preserved up to conformal rescaling. Acting on the conformal compactification , this geometry is also called Möbius geometry.
Euclidean geometry : the metric is rigid (up to global scale). This is our standard setting.
Translation geometry : no rotation stabilizer, no isotropy condition. Every constant symmetric tensor is an invariant — 6 free parameters — so the Euclidean is one allowed choice but no longer forced.
Each row higher up is “more general” — a bigger symmetry group means fewer invariants and a coarser notion of geometry. Each row lower down is “more rigid” — smaller symmetry means more allowed metrics.
Klein’s Erlanger Programm on a single space¶
This is precisely Felix Klein’s 1872 Erlanger Programm in action: a geometry is determined by its symmetry group, not just by its underlying set. The same point-set supports several inequivalent geometries — affine, conformal, Euclidean, ... — each presented as a quotient for a different Lie algebra.
The Erlangen hierarchy has a clean algebraic structure: each “smaller” geometry’s Lie algebra embeds into the “larger” one:
with strict inclusions giving strictly weaker invariants. The realization is singled out as the minimal algebra preserving the Euclidean metric — i.e., the unique isometry algebra of flat Euclidean 3-space.
Same topology, different geometry: Heisenberg and friends¶
The bottom row of the table above is the “trivial” stabilizer with acting on itself by translation. But we could also take with a non-abelian 3-dim Lie group:
| what looks like | |
|---|---|
| (abelian) | flat 3-space with any left-invariant metric |
| Heisenberg group () | topologically; only sub-Riemannian metric |
| solvable groups | topologically; left-invariant Riemannian, non-flat |
Topologically these are all , but as Riemannian manifolds they are different. The Heisenberg group with its Carnot–Carathéodory metric is the standard example of a non-Riemannian sub-Riemannian space — a 2-dim “horizontal distribution” within the 3-dim tangent space, with the third direction reachable only through brackets. This is not the Euclidean we use as physical space.
So “” is ambiguous unless one specifies the geometric structure. The Klein-pair disambiguates it as flat Euclidean 3-space.
Flat limits: as a contraction of and ¶
A different and important sense in which other algebras “give ” is by Inönü–Wigner contraction. The simple compact algebra and the simple indefinite algebra both contract to in the limit where the curvature scale :
Concretely, rescale the “translation-like” generators where is the appropriate boost/rotation generator of or . Then
The bracket vanishes in the limit, so the curved-space algebra becomes . Geometrically, and “flatten out” to as their radius is sent to infinity. So is the unique representative of the one- parameter family of constant-curvature isotropic 3-spaces — each member of the family () is a distinct Klein quotient, but they are connected by smooth deformation of the Lie bracket.
as a subspace of higher-dim Klein quotients¶
A final, distinct way in which shows up: as a subspace of 4-dim spacetime quotients.
Galilean spacetime : each simultaneity slice is a copy of Euclidean , foliating the spacetime.
Minkowski : each spacelike hyperplane is a copy of with the induced positive-definite metric. (Choice of which hyperplane is not Poincaré-invariant — different inertial frames slice the spacetime differently.)
Newton–Cartan, Carroll: similar story with different boost actions.
In each case is an -orbit inside the larger 4-dim Klein quotient, not the full quotient. The bigger algebra acts on the spacetime as a whole; restricting to the spatial slice gives back .
Algebraically: in Galilei or Poincaré, the subalgebra is not closed (because leaks into the translation block). So one cannot directly write — the would-be stabilizer is not a subalgebra. The spatial appears only as a foliation leaf, not a Klein quotient of the full kinematical group.
Summary¶
For “ as a 3-dim manifold with full rotational and translational symmetry”, the answer is unique:
— this is the unique reductive, effective, fully isotropic, flat Klein quotient with a non-degenerate Riemannian metric. Other Klein pairs that produce “” do so by:
Preserving less structure — affine , conformal , or pure translation . Same underlying set, weaker invariants.
Preserving a different structure — Heisenberg gives the same topology but sub-Riemannian (not Euclidean) geometry.
In the contraction limit — or contract to as the curvature radius diverges.
As a subspace — appearing as a slice or orbit inside a higher-dim Klein quotient (Galilei, Minkowski, ...).
Choosing as “the” algebra of is therefore a choice of how much symmetry to demand: full Euclidean isotropy fixes both the algebra and the metric uniquely. The Klein program’s deeper message is that the pair , not the underlying set, is what defines the geometry.
Part IV: Spheres — and ¶
The Euclidean cases above are flat because the translation generators satisfy . Replacing the Euclidean algebra by changes exactly this bracket: now , the homogeneous space acquires constant positive curvature, and one obtains the round sphere . The metric on is computed by the same Leibniz condition, and turns out to have the same form ; the curvature lives in the bracket , not in the metric formula itself. This is the canonical example of how the Klein/Cartan framework separates the shape of the metric (controlled by ) from the curvature of the space (controlled by ).
One subtlety should be flagged in advance, because the spherical case makes it impossible to ignore: the metric the Leibniz condition produces on is the metric in an orthonormal frame — equivalently, the metric at the basepoint in the basis given by the Lie-algebra generators. It is not the metric in some chosen coordinate chart. For the flat cases above, the two agree, because is an abelian ideal and exponentiation gives global Cartesian coordinates in which the orthonormal frame coincides with the coordinate basis. For , this is no longer true: no single coordinate chart is orthonormal everywhere, so the coordinate form of the metric differs from . We treat this carefully in below.
The 2-sphere ¶
The algebra has three generators with
Klein-pair candidates. The proper subalgebras of are:
: is 3-dimensional (the group manifold , or for the double cover).
Any 1-dim subspace for a unit vector — all such are conjugate under . Gives .
No 2-dim subalgebras exist (any pair has generically outside ).
So the only nontrivial 2-dimensional Klein geometry of is for some axis , and by conjugacy all choices are equivalent. Picking to fix a “north pole,” we take
Operational interpretation. generates rotations around the -axis. A point at the north pole is fixed by these rotations and moved by the other two generators (which tilt the pole toward the - or -axis). So “the points of the sphere” form the homogeneous space with this stabilizer.
Reductive check. We need , so that the Leibniz condition makes sense as a condition on . Compute:
What about ? Compute . So : this is the hallmark of a symmetric space — and it is what produces the curvature, as discussed below.
Metric derivation. Set for , three unknowns. The Leibniz condition with :
: .
: .
So on , exactly as for the 2D plane. This result needs careful unpacking, because read naively it appears to say that the metric on is flat — which is wrong.
What this answer is, and what it is not. The Lie-algebra basis for is a basis of one tangent space — namely , the tangent space at the basepoint (north pole). The derivation gives the components of the metric in that basis on that one tangent space. By -equivariance, the same components reappear at every other point if we use the basis obtained by transporting around via the group action: that is an orthonormal frame , in which the metric is globally.
This does not contradict being curved. By a standard theorem, every Riemannian manifold admits an orthonormal frame at every point; in that frame the metric components are always . The curvature does not live in the values of the metric in such a frame — it lives in how the frame must twist as one moves from point to point, encoded in the connection 1-form . This is the same fact that underlies the equivalence principle in general relativity: at any point one can choose coordinates in which and , even on a strongly curved spacetime.
Translating to spherical coordinates. To recover the familiar form we must express the -invariant metric in the coordinate basis rather than the Lie-algebra-derived frame . The bridge is the vielbein (here a zweibein) — a soldering form that gives the components of the coframe in coordinates. Crucially, the entire bridge can be obtained from the abstract Lie algebra alone, without any embedding of in . We carry that derivation out now.
Step 1: Define coordinates by a coset section. A point of is an equivalence class . To turn this into a coordinate chart we pick a smooth map that selects one representative in each class — a coset section. The natural section adapted to the basepoint (north pole) is the algebraic prescription
This is purely algebraic: tilts the basepoint by an angle in the -plane (giving colatitude ), then rotates by around the chosen axis (giving longitude ). No embedding in has been used — only one-parameter subgroups generated by Lie algebra elements. The coordinates are by definition the parameters of this section.
Step 2: Compute the Maurer–Cartan form of the section. For any smooth map , the pullback of the Maurer–Cartan form is
with . Using the Baker–Campbell–Hausdorff identity and, more usefully, the identity on a product of exponentials, one computes:
(Sketch: , so . For , we get , which by acting on — using , — gives .)
Step 3: Read off the vielbein. Split into its -part (the soldering form / vielbein) and its -part (the spin connection). With and :
Writing with for gives the vielbein components
equivalently the coframe (up to an orientation flip of that absorbs the sign of )
This is the orthonormal coframe in its bare (scale-free) form, derived now from the Maurer–Cartan calculation rather than asserted. The overall scale from the Leibniz derivation will be reinstated when we assemble the metric in Step 5.
Step 4: Killing vector fields. The vector field on generated by is the unique field whose tangent vector at matches the infinitesimal left-action of on the section. In coordinates,
i.e. one expresses in the basis , keeps the -components, and inverts the vielbein to read off . The adjoint action is a pure Lie-algebra calculation:
and similarly for . The -projection -coefficient , -coefficient ) and the vielbein inversion (, ) then yield
These are the Killing vector fields of , derived from the bracket relations of and the choice of section, with no use of any embedding.
Step 5: The metric in coordinates. We now turn the bilinear form on the single vector space into a (0,2)-tensor field on the manifold , and evaluate it explicitly in .
5a. From form on to tensor field on . At each point the vielbein, viewed at , is a linear isomorphism , — it sends a tangent vector to its components in the orthonormal frame. Pulling the bilinear form back along this isomorphism gives a bilinear form on :
As a (0,2)-tensor field on this reads
The choice is -equivariant by construction: the section threads the basepoint frame consistently across the manifold, and the single bilinear form is used at every point in that frame. So is automatically the unique -invariant metric extending from the basepoint.
5b. Expand the index sum. With and diagonal, only the two diagonal terms survive:
Writing for the symmetric tensor square (equivalently, in the line-element notation),
5c. Substitute the coframe. From Step 3, and (in the bare, -free normalisation read off the Maurer–Cartan form). Then
Adding the two and multiplying by the overall scale :
5d. Identify the scale with the squared radius. Define . This is just a relabelling of the single free parameter — the algebra fixed the ratio of metric scale to curvature scale, so is determined by the choice of curvature unit. The metric becomes
This is the round metric of — and it is the same metric as in the frame: the two formulas express the same bilinear form in different bases. The is not a feature of the metric itself; it is the squared length of the coordinate vector , a Killing field whose magnitude varies from 0 at the poles to at the equator.
Algebraic content of the construction. Steps 1–5 use only:
the bracket relations ,
the Klein-pair choice , ,
and the Leibniz-derived metric on .
The same procedure (coset section ⇒ Maurer–Cartan form ⇒ vielbein and Killing fields ⇒ coordinate metric via ) applies verbatim to any reductive Klein geometry, and is the standard algorithm by which the abstract algebraic data is turned into explicit coordinate geometry.
Flat space in curvilinear coordinates. Lest this seem like a feature of curvature, the same phenomenon appears in flat if one uses polar coordinates: even though the space is flat. The orthonormal coframe , still gives . The genuine algebraic distinction “flat vs. curved” is not whether depends on the coordinates; it is whether one can choose global coordinates in which is constant. For ( abelian, exponential map a global diffeomorphism) yes; for (, exponential map only local) no. This is exactly the bracket-level distinction discussed next.
Where the curvature comes from. The fact that this is not the flat 2-plane is encoded in the bracket . In Cartan-geometric language, the Maurer–Cartan equation on the model space decomposes via into a torsion equation on the soldering form and a curvature equation on the connection. The -part of contains , which here is non-zero and is precisely what gives its constant positive sectional curvature . For the Euclidean case , the same equation gives zero curvature — flat .
A streamlined derivation following the general algorithm¶
The construction above proceeds in two phases: first an abstract bilinear form on via Leibniz invariance, then a separate coordinate-and-vielbein passage. The general recipe in §“Mechanizing the algorithm” merges these into one mechanical procedure with exactly four steps. We run it here on , spelling out every operation and defining each new object in the language of vielbein/tetrad differential geometry — the same language used for general relativity. No new technology is needed beyond what physicists already use to write in a non-coordinate frame.
Set-up¶
We want a metric on that is invariant under . The Lie algebra has basis with . Pick the basepoint at the north pole. The rotations that fix are those around the -axis, generated by . So
The two-dimensional subspace plays the role of “tangent space at the basepoint” — its two basis vectors are the two infinitesimal directions in which the north pole can be moved.
Choose polar coordinates . We need a way to write every point of as the image of under some rotation. The natural choice: first rotate the pole down by angle around the -axis (using ), then around the -axis by (using ). Symbolically:
This map is called a section. It is just a coordinate parametrization of the sphere by rotations: at every point we have chosen a specific rotation that takes the pole to that point. (Different points in map to the same point in if they differ by an extra -rotation, which is why the choice is not unique. The section picks one representative per point.)
Step 1 — structure constants¶
Read them off the brackets:
In the notation , the only non-zero ’s are the cyclic permutations . We shall need only two of them in what follows: and .
Step 2 — Maurer–Cartan form¶
This is the central new object. Let us define it carefully, then compute it for our section, then interpret the result.
Definition and physical meaning¶
For a path in a Lie group , the Maurer–Cartan 1-form is
What does this mean? Imagine moving from coordinates to . The group element changes from to , where . Multiplying by from the left,
This is the infinitesimal group element needed to step from to . Because the difference is infinitesimal, this element is small and lies in the Lie algebra . So is a 1-form on the coordinate space, valued in the Lie algebra.
The cleanest physics analogy is the angular velocity of a rigid body. If is the orientation of a rigid body as a function of time, the body’s angular velocity tensor is
This is the same construction, with replacing and one parameter instead of . The angular velocity tells us “how the body is rotating at time ” expressed as an element of the rotation algebra. Likewise tells us “how the section is varying at point ” expressed as a Lie-algebra element.
Why we need this for the metric¶
In standard tetrad/vielbein differential geometry the metric on a manifold is written
where is a constant flat metric (Euclidean or Minkowski) and is the vielbein — a frame field that, at each point, gives an orthonormal basis of . The components of any tensor in the vielbein basis are also constant under parallel transport in a metric-compatible torsion-free connection, provided the spin connection is chosen correctly.
The remarkable fact on a homogeneous space is that the vielbein and the spin connection come for free from the section : they are exactly the two pieces into which splits under . Concretely, write
where runs over a basis of and over a basis of . Then:
The -components are the vielbein components. Reason: (the generators of are exactly the infinitesimal moves of the basepoint), so encodes how the basepoint shifts as we vary — i.e., a frame field with values in .
The -components are the spin connection components. Reason: generates rotations of the local frame; its components encode how the frame rotates as we move along . (This is the same connection that appears in the covariant derivative of spinors in GR.)
Both pieces are simply read off from one matrix product: .
Computing for ¶
We need and with .
For : differentiating commutes with the leftmost factor since it does not depend on , giving
Multiplying by on the left, the inner factors cancel, leaving
where the last equality holds because commutes with the exponentials of itself.
For : now the derivative hits the leftmost factor,
Multiplying by on the left,
This is the adjoint action of the group element applied to the algebra element , written . The adjoint is just matrix conjugation by a group element; its purpose here is to express “the element , as seen from a rotated frame.” We compute it by the standard expansion
With and , each nested bracket maps to itself (a closed 2-d subspace under because of the cyclic algebra). Specifically:
So nested brackets cycle through with period 4. Summing the series with the sign-alternating coefficient :
(Sanity check: at we get unchanged. At we get , which is correct since rotating the -axis by around the -axis brings it to the negative -axis.)
Putting both partial derivatives together, the Maurer–Cartan form on in polar coordinates is
Reading off the vielbein and connection¶
Split into its -part (coefficients of ) and -part (coefficient of ):
We choose to label the -basis as in the coframe indices, so is the -coefficient and the -coefficient.
The minus sign in is just a frame orientation convention (from in the expansion); it disappears when we square in Step 4. The pair is an orthonormal coframe, the standard tetrad object of GR. The 1-form is the spin connection — a single -valued (i.e., -valued) 1-form, which in matrix terms is a single skew rotation generator multiplied by .
Step 3 — invariance equation¶
Why this step is needed¶
We have a frame field at every point of . To turn it into a metric we need to specify a constant inner product on the algebraic tangent space : the full metric is then .
But on cannot be just any symmetric matrix. The basepoint is fixed by , the rotations around the -axis. Any such rotation acts on the tangent space at the basepoint — that is, on — by a 2-d rotation; physically, the rotation acts on the directions in which we can move the pole. For the metric to be the same at every point of (an -invariant metric), we need it at the basepoint to be invariant under this -action on :
This is just saying “the dot product of two tangent vectors does not change if we rotate them both by the same -rotation.”
The adjoint action of on is matrix conjugation: , with the result still in because the Klein pair is reductive (). Restricting to , is an honest matrix acting on the column vector of components in the basis.
Linearizing the invariance¶
Take for , differentiate at , and use the chain rule. With ,
This is the infinitesimal invariance condition — the Leibniz rule used throughout the document. The map on is called (“the infinitesimal adjoint”); as a matrix, , with structure constants from Step 1.
In matrix notation, the condition reads
Solving it for ¶
The single generator of is . Its bracket action on :
In the basis , is therefore the matrix of a 90° rotation,
Let (symmetric). Compute :
So and . The unique (up to scale) invariant inner product on is
This says: and are orthogonal and have the same length, in the only frame-independent way available. Geometrically obvious — the two algebraic “directions to move the pole” are equivalent up to the rotation that fixes the pole. Algebraically, this fact is now derived rather than postulated.
Step 4 — assemble the metric¶
Compute with :
Identifying the overall scale with the squared radius, :
The round metric of the sphere of radius , as expected.
Summary of new ingredients¶
The whole construction needed only three abstract concepts beyond the GR vielbein formalism:
Section . A choice of representative rotation per point of . The same role as a coordinate chart, but expressed via group elements rather than coordinates only. Selecting a section is literally choosing a parametrization.
Maurer–Cartan form . An -valued 1-form. By construction it tells us which infinitesimal rotation takes to . Splitting it via produces the vielbein and the spin connection — the two standard tetrad-formalism objects of GR — automatically.
Adjoint action and . Conjugation in the group () and its infinitesimal version (). Their role is twofold: they appear inside the computation of (because moving the derivative through produces conjugations), and they express the symmetry condition that the metric must satisfy on .
Everything else is matrix algebra. No general-position differential topology, no global existence theorems, no abstract bundle constructions were invoked. The recipe is local, explicit, and mechanical — exactly the kind of computation a physicist would write out using only the tools of GR. The same four steps applied to any other reductive Klein pair with section yield the corresponding metric.
Cartesian-like derivation: ¶
The streamlined derivation above used a product section, , in which an -generator () handles the azimuthal direction. The natural analogue of the Cartesian section on — built entirely from -generators in a single exponential — is
Geometrically, this is a single rotation: about the axis in the equatorial plane, by angle . The coordinates are geodesic normal coordinates at the north pole — exponentiating an -vector of length moves the basepoint along the geodesic of arclength in direction where .
This is precisely the recipe of the previous section transplanted onto . Let us run the four-step algorithm and see what comes out — and exactly where the chart breaks down.
Step 1 — structure constants¶
Unchanged from the polar derivation: , with Klein pair , .
The crucial bracket is now , not zero. Already at this stage we can predict — by comparison with the Cartesian case — that the BCH formula will not collapse and that the spin connection will not vanish.
Step 2 — Maurer–Cartan form¶
The exponential with no longer truncates after one term. In the standard representation of ,
which is the skew-symmetric matrix representing infinitesimal rotation about the axis . Direct computation shows with , so the exponential series collapses to the Rodrigues formula:
This is a rotation by angle about the axis — a well-known elementary fact.
The Maurer–Cartan form is (using
since ). Carrying out the
matrix differentiation and projecting onto the basis
gives (computed with klein_geometry.py)
These are not pretty in coordinates. But the vielbein determinant simplifies dramatically:
This is the key formula. It is nonzero on the open disc , vanishes first at , becomes negative for , vanishes again at , and so on. The chart is non-singular only inside the disc of radius centered at the origin in space.
Cleaning up by switching to geodesic-polar coordinates. Set , , so is the geodesic distance from the basepoint and is the angle around it. Then and . Substituting and simplifying:
In these coordinates, the vielbein determinant is — the singularity at the origin came purely from the conversion , the usual artifact of polar coordinates at the origin. The remaining zero at is the genuine failure of the section.
Step 3 — invariance equation¶
Unchanged: is a rotation in the -plane, and the invariance equation gives with . Set to match the standard convention.
Step 4 — assemble the metric¶
in geodesic-polar coordinates:
The cross-terms cancel (the pieces have opposite signs) and the diagonal terms collapse by :
Setting : . This is the round metric on the unit sphere — the same intrinsic geometry as from the streamlined polar derivation. The geodesic-polar coordinates are essentially relabelled colatitude/longitude: (on a unit sphere, colatitude is the geodesic distance from the pole), and (longitude is the angle around the pole). The two sections produce identical metrics up to this relabelling.
What “fails” — and what it teaches us¶
The Cartesian-like section on does not fail algebraically. The four-step algorithm runs to completion and produces the correct, -invariant, constant-curvature round metric. What is different from Cartesian is in the columns below:
| Property | Cartesian | Cartesian-like |
|---|---|---|
| Section | ||
| BCH series | Terminates () | Doesn’t terminate (Rodrigues) |
| Vielbein in section coords | Identity | -modulated |
| Spin connection | 0 identically | |
| Vielbein determinant | 1 everywhere | , vanishes at |
| Global chart? | Yes — covers all of | No — covers |
| Curvature | 0 |
The non-trivial entries on the right are not flaws in the algorithm — they are the algebra honestly reporting the geometry of the sphere:
The non-vanishing is the canonical Cartan connection for the round metric. Its exterior derivative , times , encodes the constant positive Gaussian curvature .
The vanishing of at is the algebra reporting the injectivity radius of the exponential map. Geometrically: a tangent vector at the north pole of length traces out a half-great-circle and reaches the south pole; tangent vectors of length in all directions converge to the same antipodal point. The chart cannot distinguish them, so the Jacobian collapses.
Beyond the exponential map is no longer injective at all; tangent vectors of length produce the same group element as those of length . The Cartesian-like chart cannot be extended to all of no matter how cleverly it is stitched.
So no algebraic failure. The only failure is topological: is compact and cannot be covered by a single -valued chart. The algebra-level signal of this — visible already before any matrix arithmetic — is that the bracket lives in , not in . That is precisely the curvature of the canonical connection, and it is also precisely what prevents the exponential map from being a global diffeomorphism.
The price of insisting on a single-exponential, Cartesian-like section on is therefore: (i) an incomplete chart (missing one point), and (ii) metric components that — by coincidence of on the unit sphere — happen to match the usual spherical form anyway. The product section trades these for a chart that is also incomplete (the poles are singular there too) but where the metric components are immediately recognizable from elementary spherical geometry. Both sections describe the same Riemannian manifold; only the coordinate dress is different.
The other Klein pair: as a 3-manifold¶
The enumeration of proper subalgebras of at the start of this section listed two non-trivial Klein-pair choices: for some axis (giving ), and (giving the whole group manifold). The first occupied the rest of the section; we now run the algorithm with the second choice for completeness.
The setup is
The space is literally the group manifold , and its identity depends on which integration of we take:
| Group | Topology | |
|---|---|---|
| (simply connected cover) |
Both are 3-dimensional and carry constant positive sectional curvature; they differ by an antipodal identification. The Lie algebra alone does not distinguish them — only the global group does. Below we work with (the adjoint representation we have used throughout); would yield exactly the same local formulas with a doubled -range.
Step 1 — structure constants¶
Unchanged: .
Step 2 — Maurer–Cartan form¶
Take the geodesic-normal section
a single exponential parameterized by the whole . Geometrically, is a rotation by angle about the axis . The Rodrigues formula gives
Express in spherical coordinates: with . Then covers all of (rotations of any axis up to angle ; at a rotation by about equals one about , the antipodal identification of ). The coordinates are three-dimensional geodesic-polar coordinates centered at the identity.
A direct computation of (using the
identity
truncated by , or
equivalently via direct matrix algebra; verified by
klein_geometry.py) gives a
vielbein
whose first column is just (so at in the radial direction points along — the radial geodesic), and whose orthogonal columns combine and factors. The remaining entries are not crucial; what matters is the resulting metric, computed below.
Step 3 — invariance equation¶
Here is the new twist. The invariance equation must hold for every , but has no non-zero elements. The equation is therefore vacuous: any symmetric matrix is allowed.
This is six parameters of freedom in — more than the single-parameter family we got on . The reason: with no -symmetry to enforce, the algorithm produces a generic left-invariant metric on , not a distinguished bi-invariant one.
To single out a “natural” choice we impose one further requirement: invariance under the adjoint action of the full group — i.e., must be -invariant for every , not just for . This is the bi-invariance condition. For a compact simple Lie algebra such as , the unique (up to scale) bi-invariant inner product is (the negative of) the Killing form:
Taking with for a Riemannian (positive-definite) metric makes the resulting geometry the standard bi-invariant metric on the group.
Step 4 — assemble the metric¶
With , computing in coordinates:
This is exactly the round metric on the 3-sphere of radius 2 restricted to the closed ball , with the antipodal identification at . To see this, substitute :
which is 4 times the standard unit-radius metric in geodesic-polar coordinates, with . So we have covered a closed hemisphere of unit ; the equatorial at (i.e., ) is identified antipodally, giving .
Rescaling would produce the round unit-radius instead. Lifting to extends the -range to (with identified with the identity rotation) and recovers the full unit . The intrinsic Riemannian geometry — constant positive sectional curvature — is the same throughout this family of normalizations.
Connection to the later derivation¶
The 3-sphere also appears later in this document as a different Klein quotient: , built from the algebra . That construction is genuinely independent of the present one — it uses a bigger algebra ( has six generators, not three), with four-dimensional minus a three-dimensional stabilizer. The fact that both constructions land on the same manifold is because and the diagonal -action realizes as a coset. We do not pursue this isomorphism here, but it is the algebraic reason the two constructions agree on the geometry while differing on group-theoretic ancillary data.
Why is not a Klein quotient of , even though ¶
A natural question arises from comparing the classification above with the Euclidean algebra of Part III. The rotation generators are literally the same — as a subalgebra — and the resulting space sits inside as a unit sphere. So why doesn’t alone produce ? The Lie algebra is a subset of the bigger one; why aren’t the spaces?
The short answer is that the Klein construction outputs the space whose symmetries the algebra encodes. Rotations alone are not enough symmetry to move every point of to every other point — they only move points around within spheres of constant radius. The translations in are exactly what allows us to jump from one sphere to another.
Let us spell this out from three perspectives.
1. Group action: rotations alone are not transitive¶
A homogeneous space is one on which the group acts transitively: any point can be moved to any other point by some group element. acts on by rotations centered at the origin, and these preserve the radial coordinate . So:
The orbit of a point under is the sphere of the same radius.
Two points with different radii cannot be related by any rotation.
acts transitively on each sphere separately, but not on as a whole.
So is not a homogeneous space of . It is foliated by -orbits — spheres of varying radii (plus the fixed origin) — and each leaf is an . To make itself homogeneous, we must enlarge the group by adding transformations that move between the orbits. The minimal such enlargement is to add translations: any two points are connected by a unique translation. The result is the Euclidean group , with Lie algebra .
2. Dimension count: what Klein quotients of are available¶
The Klein construction outputs spaces of dimension , where runs over the proper subalgebras of . For (dimension 3) the options are exactly:
| Space | ||
|---|---|---|
| 0 | 3 | Group manifold (or ) |
| 1 | 2 | |
| 2 | — | (no 2-dim subalgebra exists) |
| 3 | 0 | Point |
The 3-dimensional output is the group manifold — and the group manifold of is compact ( or , both finite volume). It cannot be . The compactness is an algebraic feature, visible already on the Killing form: is negative definite, which by Cartan’s criterion is the signature of a compact semisimple Lie algebra. There is no way to integrate to a non-compact Lie group.
By contrast, has dimension 6, the right amount to fit as . The translation subalgebra is itself a non- compact abelian Lie algebra (the additive group ), and this non-compactness is what gives its non-compact character.
3. The “stacking ’s” picture is exactly right¶
The intuition — is foliated by ’s of all radii — captures the geometric content precisely. Three statements make this rigorous:
(a) Foliation by -orbits. is fibered by -orbits via
and the origin is the unique -fixed point. Each fiber is a copy of . So is (topologically) a cone over : the trivial bundle with the -fiber at zero collapsed to a point.
(b) knows about both layers, about only one. Inside (as a vector space), rotations move points within a sphere and translations move them across spheres. The semidirect bracket encodes that the ’s transform as a vector under rotations — precisely what is needed to make the foliation above -equivariant.
(c) Algebraic inclusion vs. space inclusion go in opposite directions. This is the part that can feel counterintuitive. Embedding gives fewer symmetries, not more, and produces a smaller (lower- dimensional) space:
sees only the angular directions — its homogeneous quotient is , two-dimensional.
adds the radial direction via translations — its homogeneous quotient is , three-dimensional.
The geometric inclusion corresponds to the pair of algebra inclusions and (stabilizer of a point on as subgroup of the rotation stabilizer of the origin in ). Together these make the embedding -equivariant.
4. The slogan¶
The general lesson is that the same algebra can describe the symmetries of geometrically different objects, and different algebras can produce the same object via different routes:
produces both and — same algebra, different Klein pairs, different manifolds.
arises both as (= group manifold of with ) and as (a different algebra, a different ). Same manifold, different algebraic routes.
The Klein pair , not the algebra alone, is what determines the space. So the right comparison between and is not “same algebra, different spaces” — it’s different Klein pairs whose spaces are geometrically nested:
| Klein pair | Space | Role of |
|---|---|---|
| The stabilizer of the origin | ||
| The full symmetry group |
Climbing from the lower row to the upper row corresponds to embedding the unit sphere into Euclidean space: . The same generators play different roles — symmetries of the smaller space (on ) become stabilizers of a basepoint in the bigger space (in ). This is the algebraic signature of “thinking of as the unit sphere of ”.
The 3-sphere ¶
The algebra has six generators. Writing them as () — three “spatial” rotations and three “tilts” of the fourth axis toward each spatial axis — the brackets are
These match Poincaré except for the sign of , which here is instead of — a positive sign, since the fourth axis enters with Euclidean (not Lorentzian) signature.
Klein-pair candidates. via , , with . The natural subalgebras include:
: is itself, six-dimensional.
The “diagonal” : gives . This is our choice — .
or : each also has . These exhibit as the group manifold of with its bi-invariant metric — the same Riemannian manifold, realised via the group-manifold construction.
1-dim subalgebras (e.g. a single or ): give 5-dim homogeneous spaces, related to the Hopf fibration.
For “the standard 3-sphere of points” the diagonal choice is the one that matches the orbit–stabilizer analysis: rotations among the three spatial axes fix the north pole on the fourth axis, while tilt the pole tangentially in three independent directions. So and
Reductive check. ✓. And , so again is a symmetric space.
Metric derivation. Set , symmetric . The Leibniz condition with is
which is exactly the same equation as for (with playing the role of ). The unique solution up to scale is
and the resulting -invariant metric on is the round metric of radius (with proportional to ). As in the case, this is the metric in the orthonormal frame at the basepoint (and globally, by -equivariance). In hyperspherical coordinates the same metric reads
with the coordinate-dependent factors and arising — exactly as for — from the coordinate basis vectors having varying length, not from any change in the underlying bilinear form. The soldering form is , , , and .
The bracket — the one that distinguishes from — does not enter the metric derivation at all, because the right-hand side lies in , not in . It determines instead the curvature of the space: constant positive sectional curvature .
Same algebra-to-metric machinery, different curvature¶
Comparing the four cases , , , side by side, the picture is now uniform. In every case the metric on comes out as
and the only input to this derivation is the bracket — which has the same structure ( acting on its defining -dim representation) in all four cases. What changes from one space to another is not the metric formula but the bracket :
| Space | Algebra | Curvature | |||
|---|---|---|---|---|---|
| 0 | 0 (flat) | ||||
| 0 | 0 (flat) | ||||
The hyperbolic space would appear with and with the opposite sign, giving constant negative curvature — and Minkowski space sits in the same family with an abelian ideal but instead of , producing the signature flip .
The single algebraic distinction “ vs. ” thus separates the flat model spaces (Minkowski, Galilean, Euclidean) from the curved constant-curvature ones (spheres, hyperbolic, de Sitter, anti-de Sitter). The metric on , in every case, is fixed by the same Leibniz invariance condition.
Summary¶
In one algebraic principle — that be invariant under the adjoint action of —
the spacetime metric is completely determined by the commutation relations:
| Algebra | Invariants on | Curvature | |||
|---|---|---|---|---|---|
| Poincaré | 0 | (Minkowski) | 0 | ||
| Galilei (bare) | 0 | , (degenerate pair) | 0 | ||
| 0 | (Euclidean) | 0 | |||
| 0 | (Euclidean) | 0 | |||
The pattern is uniform across all rows: the Leibniz invariance condition on , given the action , fixes the metric uniquely (up to overall scale). The Poincaré/Galilei split comes from the bracket (yielding versus 0), which controls the signature / non-degeneracy of the metric. The flat/curved split comes from the bracket (vanishing versus landing in ), which controls the curvature of the homogeneous space.
What the algebra determines (and what it doesn’t)¶
Looking back across the six examples, the Leibniz condition is doing something quite specific — and worth stating in its sharpest form.
For an isotropy representation , the Leibniz condition is the statement “find the -invariant symmetric bilinear forms on .” This is a Schur’s-lemma exercise, and the structure of the answer is dictated entirely by the decomposition of under .
When is irreducible — Poincaré, all , all , all , de Sitter, anti-de Sitter — Schur’s lemma forces the real vector space of invariant symmetric forms to be one-dimensional. The answer is then:
a discrete invariant — the signature of the form, fixed by which real form of the isotropy algebra ( for some ) is acting;
plus one continuous degree of freedom — an overall positive scalar (the unit of length squared).
So in this case the entire derivation produces the signature plus one scale. The signature is forced by the algebra; the scale is free.
When is reducible — Galilei, Carrollian, and other limiting algebras — the invariant forms decompose summand by summand. Each irreducible piece contributes its own independent scale, and isomorphic pieces can contribute additional off-diagonal mixing parameters. For Galilei, splits as under , the two summands are non-isomorphic, and we get exactly the two independent forms — the temporal and the spatial — each with its own scale (the unit of time and the unit of length are algebraically independent in non-relativistic physics).
What is not derived. The numerical values of the scales. The algebra cannot know whether the unit is meters or light-years; in the irreducible case there is nothing else free, and in the reducible case the freedom is exactly one positive number per irreducible summand (plus any off-diagonal mixing among isomorphic summands).
Scale ratios are fixed by the algebra. In curved cases like , the bracket has a definite normalisation. Once we pick the metric scale , the Cartan structure equation forces the sectional curvature to be : metric and curvature scales are not independent. The algebra fixes the ratio of (length scale)² to (curvature scale)⁻¹; only the absolute scale is free.
The same applies on the Lorentzian side. Poincaré has a single scale, so the speed of light is the only dimensionful constant relating space and time, and the algebra lets us set without loss. Galilei has two independent scales, so has no algebraic meaning — it is undefined, not merely unset.
The clean statement. The algebra determines:
the signature of the metric (a discrete invariant, fixed by the real form of the isotropy algebra);
the dimensional structure — how many independent scales appear, i.e., how many irreducible pieces has under ;
all relations among metric scales and curvature scale.
It does not determine the absolute numerical value of any scale. This is exactly what one would want from a derivation that starts only with abstract commutation relations: the algebra cannot know about units, but it determines everything invariant under the choice of units.
Mechanizing the algorithm¶
By this point the same steps have been performed by hand for six examples: Minkowski, Galilean, , , , . The procedure is mechanical. Given a faithful matrix representation of and a section , every step reduces to symbolic linear algebra. We collect the algorithm in one place here, then describe a companion SymPy script that runs it end-to-end.
The algorithm¶
Inputs.
A faithful matrix representation — i.e., a list of matrices , , forming a basis of .
A partition of the basis indices into “stabilizer” indices and “complement” indices — corresponding to the splitting that defines the Klein pair.
A coordinate chart on and a section realizing it.
Crank.
Step 1 — Structure constants. Compute the commutators in the matrix algebra, and decompose them on the basis:
Each decomposition is one linear system in unknowns (the coefficients ).
Step 2 — Maurer–Cartan form. For each coordinate , compute
and split into - and -parts: the vielbein 1-form components are for , and the spin-connection components are for .
Step 3 — Invariance equations. For each , build the matrix from the structure constants (its entry, with , is ). Solve the linear system
for the symmetric matrix . The solution space is finite-dimensional (an irreducible -rep on gives a 1-d family; reducible gives one scale per irreducible piece).
Step 4 — Metric in coordinates. For any choice of in the invariant family,
Step 5 (optional) — Curvature. Compute the Cartan curvature 2-form
It vanishes iff the model space is flat (e.g. , Minkowski, Galilean); on it is proportional to the volume form with coefficient .
Step 6 (optional) — Killing vector fields. For regarded as an infinitesimal generator on , the induced vector field is
where is the inverse vielbein and the subscript projects onto the -indices.
Inputs that must be supplied¶
Three pieces cannot be deduced from the bare Lie algebra:
Subalgebra . Different stabilizer choices give different homogeneous spaces (Galilean spacetime vs. phase space, for instance; cf. Part II).
Section . This selects coordinates. Different sections give different charts (Cartesian vs. polar vs. spherical) — the intrinsic geometry is unchanged.
Scale parameters in . Step 3 produces a family; physical units fix the numerical values.
Everything else — the vielbein, the connection, the curvature, the Killing fields, the relations among scales — is determined by Steps 1–6 above.
How much freedom is there in choosing the section?¶
Of the three supplied inputs above, the section is the one that is purely a matter of coordinates — what parametrization is used, not what space is described. So in practice the natural follow-up question is: how much freedom is there? Is there always a Cartesian-like choice that covers the whole space, like in the polar section above? For , the spherical coordinates worked, but did we get lucky — or could we have predicted in advance which choices would and would not give a global chart?
The freedom is large, the obstructions are well-understood, and “luck” plays no role: the bracket in the algebra tells us in advance whether a global Cartesian-style section exists.
What counts as a section¶
Formally, a section over an open subset is any smooth map such that projecting back to gives the identity: . In plain terms, picks one specific group element representing the coset of . Any such is good for running the four-step algorithm, with one technical requirement: the resulting vielbein must be non-singular on , i.e., . Otherwise the algorithm cannot read off coordinates.
In practice, two natural families show up:
Exponential normal section. , where runs over a basis of only. Single exponential of an algebra element.
Product (nested) section. , where each is some chosen generator from either or . Each factor exponentiates a single generator.
The product form is more general: we can include -generators in the section (as we did with for spherical, for polar). This is fine — the action of an -generator on the basepoint produces a non-trivial point in whenever it does not commute with everything in , so -factors carry geometric information too. They typically produce angular coordinates (longitude, azimuth) that pair naturally with radial -coordinates.
Example A: Cartesian on ¶
For with , the exponential normal section is
Because , the Baker–Campbell–Hausdorff series collapses: is simply the pure translation by . The product version equals the same group element, for the same reason. The Maurer–Cartan form is then
so the vielbein is the identity matrix, the spin connection vanishes, and the metric is . This is global: the section is defined on all of , and the vielbein is non-singular everywhere. Cartesian coordinates cover the whole space.
Example B: “Cartesian-like” on ¶
The natural analogue for with is
i.e., a single rotation by some angle around an axis in the -plane. This is a valid section, and in fact it gives geodesic normal coordinates at the north pole — the radial parameter is the geodesic distance from the pole. Re-parametrising , (so is the geodesic distance and is the angle around the basepoint), direct computation gives
and the metric is
after using . This is the same round metric we derived earlier with spherical coordinates — just with now interpreted as colatitude and as longitude. The intrinsic geometry is the same; only the coordinate labels differ.
The crucial point about coverage, though: this Cartesian-like section does not cover the whole sphere. The vielbein determinant in the coordinates above is
which vanishes at (the basepoint, a coordinate singularity of the same kind as the origin in plane polars) and at (the antipodal south pole, a genuine wraparound — all values of collapse to the same group element there). The exponential wraps the open disc onto minus the south pole, and beyond the map is no longer injective. The exponential normal section is a chart on an open dense subset, not on all of .
Algebraic foreknowledge: when does a global chart exist?¶
Here is the predictive content. From the bracket alone:
(abelian ). The exponential map is a covering, and for simply-connected it is a global diffeomorphism. So an exponential normal section is good globally. This is the Cartesian situation. The same algebra-level signature also explains why the position vector is globally well-defined here (cf. the R² subsection on the position vector): both are consequences of abelian .
(non-abelian ). The exponential map is only a local diffeomorphism. In fact, is precisely the curvature of the canonical connection on in the abstract Cartan-geometric sense. For , is the bracket that encodes the constant positive curvature . From this single bracket we know in advance — without doing any computation — that no globally non-singular section can exist.
So no luck was involved. We knew on entering the calculation that no chart would cover the whole sphere. The spherical product section and the geodesic normal section are two different local charts; both miss measure-zero subsets (the poles, or the antipodal point and the basepoint), and they patch together via overlap maps in the standard atlas-of-charts way. The Klein construction is happy with any one of them.
Pragmatic guidance¶
If is abelian, prefer the exponential normal section. It is global, the vielbein is the identity, the spin connection vanishes, and the metric in the resulting coordinates is the flat -metric — exactly Cartesian. Cf. Example A.
If is non-abelian, use whichever local section makes the symmetries of the problem manifest. The product section, with -generators handling angular directions, is the standard recipe — and it is what produces polar / spherical coordinates throughout this document.
Whatever section is chosen, check . Where it vanishes is a coordinate singularity, not an obstruction to the algorithm — just switch charts there.
In short, the section is a coordinate chart in the usual sense, and the algebra tells us in advance whether a global chart exists. The algorithm itself is indifferent: it produces the metric in whichever coordinates the section delivers.
The SymPy implementation¶
The script klein_geometry.py in this repository
implements the algorithm as a KleinGeometry class with methods
.structure_constants() Step 1
.maurer_cartan() Step 2 → (ω_components, e, ω_h)
.invariant_forms() Step 3 → (η symbolic, free scales)
.metric(scale_subs=...) Step 4
.curvature() Step 5
.killing_field(ξ) Step 6The four canonical examples from this document are reproduced at the bottom of the script: Minkowski , Galilean , in polar coordinates, and the round . Running it produces the algebra, the vielbein and connection, the dimension of the invariant-form space, the explicit metric in the chosen chart, the Cartan curvature, and the Killing fields — for each example.
Self-test output from running python klein_geometry.py:
[R^2 polar]
[OK] metric = dr^2 + r^2 dphi^2
[OK] curvature is flat
[OK] J = ∂_φ
[OK] P^1 = cos(φ) ∂_r − (sin(φ)/r) ∂_φ
[S^2]
[OK] metric = dθ^2 + sin^2θ dφ^2
[OK] curvature = −sin(θ) (so K = 1/R^2 > 0)
[Minkowski (1+1)]
[OK] Minkowski signature is (−,+) or (+,−)
[OK] Minkowski curvature flat
[Galilei (1+1)]
[OK] Galilei: 1-d family of invariant (0,2) forms
[OK] Galilei (0,2) form is degenerate clock dt^2A few caveats worth noting:
Only reductive Klein geometries are handled; the splitting with is built into Step 3. Parabolic (non-reductive) geometries need additional filtration data.
Only (0,2)-tensors are enumerated. The Galilean cometric , a (2,0)-tensor, is a separate invariant the routine does not look for.
Local chart only. The section is a choice of coordinates; output is local to the chart’s domain.
So the abstract narrative — “from commutators alone, by turning the Maurer–Cartan crank, the metric falls out” — really is an algorithm. The Lie algebra is the input; the metric, connection, curvature, and isometry generators are the output. The script is a small worked proof that the construction mechanizes.
Comparison with Lie Groups I¶
The companion document postulates the boost generators as explicit matrices acting on and then solves for the invariant metric. The present derivation does not need any matrix representation: the Leibniz invariance condition is applied directly to the commutators of the abstract algebra. The final metrics agree.
Outlook: from Klein to Cartan — the connection to general relativity¶
The Klein-geometry picture used above produces a single rigid model space in which acts globally and the curvature is zero by construction. Curved spacetimes of general relativity cannot be obtained this way, because they have no transitive symmetry group. The required generalisation — “Klein geometry made local” — is Cartan geometry.
A Cartan geometry modelled on the Klein pair consists of a principal -bundle over a manifold of dimension , together with a Cartan connection — a -valued 1-form that
is a linear isomorphism at every ,
restricts on vertical (fibre) vectors to the Maurer–Cartan form of , and
is -equivariant under the right action.
The first condition is the key one: identifies each tangent space with the full model algebra , not just . So the model is present at every point of — but only the isotropy subgroup acts globally on the fibres. The curvature
measures the failure of the Klein model to fit exactly: when the Cartan geometry is locally isomorphic to . Cartan’s intuitive picture is that of rolling the model space along without slipping; the connection encodes how the model tilts as it rolls, and the curvature is the holonomy of a small loop.
For the Klein model the splitting decomposes the Cartan connection into two fields,
which are exactly the spin connection and the vierbein of the tetrad (or first-order) formulation of general relativity. The curvature splits in the same way:
| Cartan-geometry object | Name in GR |
|---|---|
| -part of | spin connection |
| -part of | vierbein |
| -part of | Riemann curvature |
| -part of | torsion |
Cartan’s structure equations and are nothing but the components of . This is the Einstein–Cartan formulation of GR; imposing determines uniquely from (the Levi-Civita connection) and recovers ordinary Einstein gravity.
The connection back to the present derivation is now direct. The -part of the Cartan connection — the vierbein — identifies each tangent space with the model translation subspace . The bilinear form we derived above as the unique Lorentz-invariant form on is then automatically a Lorentz-invariant form on every , and the spacetime metric is
The metric of a curved spacetime is, in this sense, “the Minkowski metric of the Klein model, transported from point to point by the vierbein.” The signature, the Lorentz fibre symmetry, and the structure of the tangent space are all inherited from the algebraic derivation of this note; what Cartan geometry adds is the prescription for varying the model smoothly across .
The conceptual chain is therefore
Why the Klein pair is not ad hoc — the Erlangen Programme¶
The construction may at first look like an arbitrary technical device. It is not. It is the precise mathematical articulation of Felix Klein’s Erlangen Programme (1872), which proposed that
a geometry is the invariant theory of a group action on a space.
A geometry, in this view, is a pair with acting on , and a property is “geometric” exactly when it is preserved by every element of . This single idea unified the previously disparate Euclidean, projective, affine, hyperbolic, and inversive geometries into a common framework — each is characterised by its symmetry group acting on an appropriate space.
Once one accepts the Erlangen point of view, the Klein pair is not chosen but forced by an elementary theorem. Three steps:
Homogeneity — every point is to be on the same footing as every other (no preferred origin). This is the principle of relativity in its purest form: for spacetime, “no preferred event”; for space, “no preferred location.”
Pick a basepoint and consider its stabilizer .
Orbit–stabilizer theorem — the map is a -equivariant bijection .
So any homogeneous space of a Lie group is automatically of the form . The Klein pair is then the most economical encoding of “(symmetry group) + (operational notion of a point).” In physics, comes from a relativity principle and comes from “what fixes a chosen event.” In the Poincaré case this gives Minkowski space; in the Galilei case, Galilean spacetime.
Of course the Erlangen viewpoint is not the only way to define a space. Pure manifold theory ( = topological space smooth atlas) imposes no symmetry from the outset; metric-space, synthetic-axiomatic, algebraic, sheaf-theoretic, topos-theoretic, and noncommutative formulations all provide alternative foundations. What is special about the Klein–Cartan viewpoint is that it is the foundation that puts symmetry first, which is exactly what physical theories do.
Scope of the framework¶
Within its proper domain — smooth manifolds with local Lie-group symmetry — Cartan geometry is remarkably comprehensive. The same construction specialises to:
| Klein model | Resulting geometry |
|---|---|
| Riemannian | |
| Poincaré Lorentz | Pseudo-Riemannian / GR |
| Galilei | Newton–Cartan |
| Conformal group stab. | Conformal |
| stab. | Projective |
| Affine | |
| stab. | Contact / projective contact |
Three-dimensional Euclidean space is the case . The Cartan connection consists of an orthonormal frame field and a rotation connection ; on flat both torsion and curvature vanish, while on a curved Riemannian 3-manifold the same data describes the geometry. Curvilinear coordinates are simply a change of frame: the vierbein relates the orthonormal frame to a coordinate frame, and Christoffel symbols arise as the components of the connection in a coordinate basis. The whole tensor calculus used in GR — covariant derivatives, Christoffels, Riemann tensor — sits inside Cartan geometry as a particular choice of frame.
A vast modern generalisation of this list is the theory of parabolic geometries of Čap and Slovák, which treats all with a parabolic subgroup uniformly.
Where Cartan is not enough¶
Cartan geometry requires three things: a smooth manifold, a homogeneous local model , and a connection soldered to the tangent bundle. Step outside any of these and one needs a different framework:
Yang–Mills gauge theory. Principal -bundles in which is an internal gauge group — most notably electromagnetism (, the simplest Yang–Mills theory), colour (), and electroweak (). The connection here is not soldered to , so it is a principal connection, not a Cartan connection. The technical distinction is sharp: a Cartan connection must be a pointwise linear isomorphism , whose -part (the vierbein) identifies with the model translation space at every point. An electromagnetic connection has no such piece — is one-dimensional and internal, and the bundle direction sits “above” spacetime rather than “tangent” to it. So while and the vierbein-plus-spin-connection are both principal-bundle connections with curvature, only the latter is a Cartan connection. Schematically, Cartan geometry is “tangential” gauge theory; Yang–Mills is “internal” gauge theory.
Symplectic and Poisson geometry on non-homogeneous phase spaces — the setting of most of classical mechanics on cotangent bundles of curved configuration spaces.
Singular and stratified spaces: orbifolds, algebraic varieties with singularities, conifold geometries.
Quantum theory. Hilbert spaces and operator algebras are not Cartan geometries. The classical background spacetime of a QFT can be Cartan, but the quantum sector itself is not.
Noncommutative geometry (Connes): replace by a noncommutative algebra; there is no underlying manifold.
Higher gauge theory, gerbes, -bundles: anomalies, the -field of string theory, higher-form gauge fields.
Generalised geometry (Hitchin), double field theory, T-duality: the tangent bundle is replaced by or by a doubled spacetime, which is outside the Cartan paradigm.
Discrete / lattice geometries as used in lattice gauge theory and numerical relativity.
So Cartan geometry covers essentially all classical relativistic field theory built on a spacetime manifold — including all of GR and its geometric extensions (Einstein–Cartan, teleparallel, Poincaré gauge theory, MacDowell–Mansouri). Where it falls short is exactly where one leaves smooth manifolds with local Lie-group symmetry behind.
There is one fascinating qualification to the statement that electromagnetism is not a Cartan geometry: the Kaluza–Klein observation. On a 5-manifold , a rotation of the extra circle is a translation in the geometry of . The gauge potential then becomes a piece of the 5-dimensional metric,
so that in the higher-dimensional Cartan geometry (model ) electromagnetism is part of the geometry — its gauge group has been promoted from “internal” to “spacetime” by adding a dimension. Analogous constructions exist for non-abelian Yang–Mills at the cost of extra dimensions, but with strong constraints on the higher-dimensional metric. This is the line pursued by Kaluza, Klein, Einstein, Pauli, and many others, and never quite produced a fully viable unification. The takeaway is that the boundary “Cartan vs. internal gauge theory” is dimension-dependent: a gauge field that is internal in dimensions can become geometrical in .
Klein, Cartan, and Yang–Mills compared¶
All three frameworks use Lie groups, but they package them very differently — and only the first two construct the underlying space out of the group. Yang–Mills theory does not construct spacetime at all. It takes a manifold as given (typically Minkowski, or a curved Lorentzian manifold which by itself may be a Cartan geometry — the gravitational sector) and adds on top of it the following data:
A principal -bundle with a Lie group (, , , …). Each fibre is an “internal” copy of the gauge group, not related to . The total space has dimension .
A principal connection satisfying for the vertical vector generated by , and the -equivariance . Compared with the Cartan case, only the vertical condition is imposed — there is no demand that be a linear isomorphism with all of , because is not soldered to .
A curvature , the field strength.
Associated bundles for each representation ; sections of are the matter fields (quarks, leptons, …).
The crucial conceptual point is that the geometric object is the pair , not a space . The Lie group encodes a local symmetry that acts pointwise on the fibres of , not on the base . Spacetime is given separately, and the bundle structure is layered on top of it. The three frameworks line up as follows:
| Klein | Cartan | Yang–Mills | |
|---|---|---|---|
| Role of | defines the space | local model of the manifold | internal gauge symmetry |
| Where acts | on the space itself, transitively | on tangent spaces (soldered to ) | on internal fibres of (not soldered) |
| What is “space”? | , locally modelled on | (given), plus the bundle | |
| Connection 1-form | (Maurer–Cartan of on itself) | Cartan: | Principal: only the vertical part of is fixed |
| Curvature | 0 identically | model failure torsion | field strength |
| Typical example | Minkowski as | GR as a Cartan geometry on the same model | EM, QCD, electroweak |
In one sentence: a Klein geometry is a Lie-group quotient; a Cartan geometry is a manifold modelled on a Lie-group quotient, with the model tangentially soldered at every point; a Yang–Mills theory is a manifold equipped with an unrelated Lie-group fibre bundle and a non-soldered connection on it. All three use Lie groups, but only the first two build the space itself out of them. Yang–Mills uses Lie groups to enrich a pre-existing space with internal symmetry.
There is one further wrinkle worth knowing about. In gauge theory one often studies the moduli space of connections (connections modulo gauge transformations), which becomes a genuine “space” of its own — typically infinite-dimensional and not homogeneous. Donaldson invariants, Seiberg–Witten theory, and similar 4-manifold invariants are about the geometry of this space. But that is one level removed from the principal-bundle setup itself.
Synthesis: Cartan for physical space, other geometries on top¶
Stepping back from the individual items in the “Where Cartan is not enough” list, one notices that almost every entry is not really an alternative to Cartan geometry but an additional structure layered on top of a Cartan-modelled physical spacetime. They are not different descriptions of physical space; they are different kinds of space attached to it.
| Theory | Physical spacetime | Extra structure on top | Geometry of the extra structure |
|---|---|---|---|
| Yang–Mills (EM, QCD, …) | Cartan (, possibly curved) | Principal -bundle | Principal connection |
| Classical mechanics | Cartan ( or curved ) | Phase space | Symplectic |
| Quantum mechanics / QFT | Cartan ( or curved) | Hilbert space of states | Linear / Hilbert |
| Statistical mechanics | Cartan ( or ) | Phase space with a measure | Measure-theoretic |
| Higher gauge theory, gerbes | Cartan | Higher principal bundle or gerbe | Higher-categorical |
| Lattice gauge theory | Cartan lattice | Bundle data on | Discrete |
In every row, the physical spacetime is still a smooth Cartan geometry — Minkowski or its curved GR generalisation — and supplies the metric, the causal structure, and the gravitational dynamics. The additional layer encodes whatever the theory is about: internal symmetry directions (gauge theories), configurations and states (mechanics), quantum amplitudes (Hilbert spaces), or probability distributions (statistical mechanics). These additional structures need their own appropriate geometry, and there is no reason to expect them to be Cartan — they are not spacetime.
This is particularly clear in classical mechanics, where the architecture is the three-layer structure
The physical space is , the Klein geometry . The configuration space — points of for a particle on a sphere, for a rigid body, for point particles — is typically itself a Cartan/Riemannian manifold, with the kinetic energy supplying a metric. The phase space , where Hamilton’s equations live, is symplectic and not in general homogeneous, so it is not a Cartan geometry. Yet none of this is in conflict with the Cartan framework: each layer is the natural geometric setting for the data it carries.
A small list of frameworks does propose genuinely modifying physical spacetime itself rather than adding structure on top — noncommutative geometry, generalised geometry, double field theory, higher-categorical spacetimes, and the various Planck-scale or string-theoretic notions of emergent or noncommutative spacetime. Those are the cases where the Cartan picture of physical space itself is being challenged. For all standard physics, however, the working rule is:
Physical spacetime is a Cartan geometry; everything else is additional structure layered on top of it, and the geometry of those layers is not Cartan.
This is the reason the Lie-algebra-to-metric derivation of this note — followed by its Cartan extension to curved spacetimes — sits at the foundation of essentially all relativistic field theory: it is the foundation of spacetime itself, on which the rest of physics is built.
Closing remark¶
The same Cartan construction applied to the Galilei algebra yields Newton–Cartan geometry, the geometric formulation of Newtonian gravity, in which the degenerate temporal and spatial metrics derived in Part II play the role of the two background tensors. A standard reference is Sharpe, Differential Geometry: Cartan’s Generalization of Klein’s Erlangen Program.