Determinant - Build html

Multiplicative Property¶

In this section we assume the usual definition of a determinant, from which one can derive the following properties that we will need:

\det \mathrm{diag}(d_1, d_2, \dots, d_n)=d_1 d_2 \dots d_n\,,

(1)

\det AB = \det A \det B\,,

(2)

\det \lambda A = \lambda^n \det A\,.

(3)

Now our task is to find all functions $f(A)$ from matrices to real numbers that satisfy the multiplicative property:

f(AB) = f(A)f(B)\,.

(4)

We will use (4) repeatedly in the following calculation. First we decompose the matrix $A=HU$ using a polar decomposition, where $H$ is Hermitian and $U$ is unitary. Both $H$ and $U$ are individually diagonalizable (but not mutually diagonalizable, since $A$ is not in general diagonalizable). For any matrix $C$ that is diagonalizable, we can write $C=P D P^{-1}$ where $D$ is diagonal and $P$ is invertible. Then:

f(C) = f(P D P^{-1}) = f(P) f(D) f(P^{-1}) = f(P) f(P^{-1}) f(D)=

(5)

= f(P P^{-1} D) = f(D)\,.

(6)

That means that if a matrix is diagonalizable, then $f$ applied to that matrix is equal to $f$ applied to the diagonal matrix.

Below we will use matrices $P_i$ that are unit matrices with the 1st and $i$ -th rows permuted. Now we can compute, using the fact that $H$ and $U$ can be decomposed into diagonal matrices $D_H$ and $D_U$ , and their product is also a diagonal matrix $D=D_H D_U$ :

f(A) = f(HU) = f(H)f(U) = f(D_H)f(D_U) = f(D_H D_U) = f(D) =

(7)

= f(\mathrm{diag}(d_1,1,1,\dots)) f(\mathrm{diag}(1,d_2,1,\dots)) \dots f(\mathrm{diag}(1,\dots,1,d_n))=

(8)

= f(P_1\mathrm{diag}(d_1,1,1,\dots)P_1^{-1}) f(P_2\mathrm{diag}(d_2,1,1,\dots)P_2^{-1}) \dots

(9)

\dots f(P_n\mathrm{diag}(d_n,1,1,\dots)P_n^{-1})=

(10)

= f(\mathrm{diag}(d_1,1,1,\dots) \mathrm{diag}(d_2,1,1,\dots) \dots \mathrm{diag}(d_n,1,1,\dots))=

(11)

= f(\mathrm{diag}(d_1 d_2 \dots d_n,1,1,\dots))=

(12)

= f(\mathrm{diag}(\det D,1,1,\dots))=

(13)

= g(\det D)= g(\det D_H D_U) = g(\det D_H \det D_U)= g(\det H \det U) =

(14)

=g(\det HU) = g(\det A)\,,

(15)

where we defined a function $g(x)$ using

g(x) = f(\mathrm{diag}(x,1,1,\dots))\,.

(16)

Let’s compute the functional equation for $g(x)$ :

g(x y) = f(\mathrm{diag}(xy,1,1,\dots))=

(17)

=f(\mathrm{diag}(x,1,1,\dots)\mathrm{diag}(y,1,1,\dots))=

(18)

=f(\mathrm{diag}(x,1,1,\dots)) f(\mathrm{diag}(y,1,1,\dots))=

(19)

=g(x) g(y)\,.

(20)

In the above computations we have only used (4) and no other assumptions.

Let’s summarize our results. From (4) it follows that:

f(A) = g(\det A)\,,

(21)

g(xy) = g(x)g(y)\,,

(22)

for numbers $x$ , $y$ .

The equation (22) can be solved in several different ways.

Approach 1: Assuming that $f(A)$ (and thus $g(x)$ ) is measurable, we show below that the only solutions are $g(x)=0$ , $g(x)=|\mathrm{sign}(x)|$ , $g(x)=|x|^s$ and $g(x)=\mathrm{sign}(x)|x|^s$ . Consequently the only nonzero solution to (4) is a power of a determinant (optionally multiplied with the signum function). If $f(A)$ is not measurable then we also get highly pathological solutions (see the section below). The above is thus the most general solution.

Approach 2: Instead of assuming measurability, we can instead assume homogeneity:

f(\lambda A) = \lambda^n f(A)\,,

(23)

where $n$ is the (integer) dimension of the matrix $A$ . Using (21) and (22) we get:

f(\lambda A) = g(\det \lambda A) =g(\lambda^n \det A) =g(\lambda^n) g(\det A) =g(\lambda^n) f(A)=

(24)

=g(\lambda)^n f(A) =f(g(\lambda) A)\,.

(25)

Comparing the LHS and RHS we can see that $g(\lambda)=\lambda$ , or:

g(x) = x

(26)

and from (21) we get:

f(A) = \det A\,.

(27)

Summary:

Assuming equations (4) and (23) we obtained (27) as the only nonzero solution. That means that we can define a determinant using (4) and (23) and no other assumptions.
Alternatively, if we only assume (4) and that $f(A)$ is measurable, then the only nonzero solution is a power of a determinant (optionally multiplied with the signum function).

Cauchy’s Additive Functional Equation¶

The Cauchy’s functional equation (Wikipedia (2025)) usually means the additive equation of this form:

f(x+y) = f(x) + f(y)\,.

(28)

There are other related equations, such as the Cauchy’s multiplicative functional equation in the next section.

For integer $n$ we get:

f(nx) = f(\underbrace{x+x+\dots+x}_{n}) = \underbrace{f(x)+f(x)+\dots+f(x)}_{n} = nf(x)\,.

(29)

By substituting $y=nx$ we get $f(y)=nf(y/n)$ , from which $f(y/n)=(1/n)f(y)$ . Using these two relations for multiplication and division by an integer we obtain for any rational number $q={p\over q}$ , where $p$ , $q$ are integers:

f(qx) =f\left({p \over q}x\right) =pf\left({x\over q}\right) ={p\over q}f(x) =q f(x) \,.

(30)

We also have to prove it for $p\le 0$ which we can using $f(0)=f(0)+f(0)$ which implies $f(0)=0$ and using $0=f(0)=f(x-x)=f(x)+f(-x)$ from which follows $f(-x)=-f(x)$ . Setting $x=1$ we get for all rational $q$ :

f(q) = q f(1) = c q\,.

(31)

where $c=f(1)$ is any real constant. If we assume that $f(x)$ is continuous, then this implies that $f(x)=c x$ for all real $x$ , since the rational numbers are dense in real numbers. It turns out that one can relax the continuity requirement to only require that $f(x)$ is measurable. Since $f(x)=c x$ is linear, we get:

A linear function $f(x)=ax+b$ can be substituted into (28) and one obtains that $b=0$ . So this theorem is a simple way to remember the solutions to the Cauchy’s additive functional equation.

Proof

To prove this theorem, we introduce $g(x)=f(x)-f(1)x$ and prove that $g(x)=0$ , from which it follows that $f(x)=f(1)x=c x$ . We have:

g(x+y)=f(x+y)-f(1)(x+y)=f(x)-f(1)x + f(y)-f(1)y = g(x)+g(y)\,,

(32)

so $g(x)$ is also additive. For rational $q$ we have:

g(q)=f(q)-f(1)q=q f(1) - f(1)q = 0\,.

(33)

Also we get:

g(x+q)=g(x)+g(q)=g(x)\,,

(34)

so $g(x)$ is periodic with a period $q$ . It can be shown from this periodicity and function $g(x)$ being measurable that $g(x)$ is essentially bounded. Now we can compute the following integral which is exists and is finite:

\int_0^1 g(x) dx =\int_0^1 g(x+y) dx =\int_0^1 g(x) dx +\int_0^1 g(y) dx =\int_0^1 g(x) dx +g(y)\,,

(35)

so $g(y)=0$ for all real $y$ . Here we have used the dominated convergence theorem for $g(x)=g(x+q_n)$ where a sequence of rational $q_n$ is converging towards the real $y$ .

Cauchy’s Multiplicative Functional Equation¶

The Cauchy’s multiplicative functional equation is:

f(xy) = f(x)f(y)\,.

(36)

There are only 3 cases that can happen:

$f(x) = 0\,.$
$f(x) = 1\,.$
$f(x) = \begin{cases} h(x) & \text{if } x > 0\,,\\ 0 & \text{if } x = 0\,,\\ \pm h(|x|) & \text{if } x < 0\,.\\ \end{cases}$

In the last case the function $h(x)$ is positive $h(x) > 0$ and defined only for $x>0$ . The $x < 0$ case has two options for even and odd $f(x)$ . We can equivalently write the two cases explicitly as follows:

Even: $f(x) = \begin{cases} 0 & \text{if } x = 0\,,\\ h(|x|) & \text{if } x \ne 0\,.\\ \end{cases}$
Odd: $f(x) = \begin{cases} 0 & \text{if } x = 0\,,\\ \mathrm{sign}(x) h(|x|) & \text{if } x \ne 0\,.\\ \end{cases}$

As a special case for $h(x)=1$ we get:

Even: $f(x) = |\mathrm{sign}(x)| = \begin{cases} 0 & \text{if } x = 0\,,\\ 1 & \text{if } x \ne 0\,. \end{cases}$
Odd: $f(x) = \mathrm{sign}(x) = \begin{cases} 0 & \text{if } x = 0\,,\\ 1 & \text{if } x > 0\,,\\ -1 & \text{if } x < 0\,. \end{cases}$

We can then use the substitution $g(x)=\log f(e^x)$ to convert the multiplicative equation into an additive equation from the previous section:

g(x+y) = \log f(e^{x+y}) = \log f(e^x e^y) = \log(f(e^x) f(e^y))=

(37)

= \log f(e^x) + \log f(e^y) = g(x) + g(y)\,.

(38)

We find the solution using the previous section, and then we compute $h(x)$ using $h(x)=\exp(g(\log x))$ , which satisfies:

h(xy) =\exp(g(\log(xy))) =\exp(g(\log x + \log y)) =\exp(g(\log x) + g(\log y))=

(39)

=\exp(g(\log x))\exp(g(\log y)) =h(x)h(y)\,.

(40)

For measurable $g(x)$ the solution to the additive equation is $g(x) = cx$ , so we assume $f(x)$ is measurable (which implies $g(x)$ is measurable) and we get $h(x)=\exp(c\log x)=\exp(\log(x^c))=x^c$ and the four solutions are

$f(x) = 0\,.$
$f(x) = 1\,.$
$f(x) = \begin{cases} 0 & \text{if } x = 0\,,\\ h(|x|) & \text{if } x \ne 0\,,\\ \end{cases} = \begin{cases} 0 & \text{if } x = 0\,,\\ |x|^c & \text{if } x \ne 0\,.\\ \end{cases}$
$f(x) = \begin{cases} 0 & \text{if } x = 0\,,\\ \mathrm{sign}(x) h(|x|) & \text{if } x \ne 0\,,\\ \end{cases} = \begin{cases} 0 & \text{if } x = 0\,,\\ \mathrm{sign}(x) |x|^c & \text{if } x \ne 0\,.\\ \end{cases}$

The solutions can be equivalently written as:

$f(x)=0\,.$
$f(x) = |\mathrm{sign}(x)| = \begin{cases} 0 & \text{if } x = 0\,,\\ 1 & \text{if } x \ne 0\,. \end{cases}$
$f(x)=|x|^c$ , with $f(0)=0$ (needed for $c < 0)$ .
$f(x)=\mathrm{sign}(x)|x|^c$ , with $f(0)=0$ (needed for $c < 0)$ .

The solution $f(x)=1$ is already included in $|x|^c$ for $c=0$ . If $c$ is an integer, then the last two solutions can always be written (for both even and odd integers) as $f(x)=x^c$ and $f(x)=\mathrm{sign}(x) x^c$ .

Some examples are: 0, 1, $\mathrm{sign}(x)$ , $|\mathrm{sign}(x)|$ , $x$ , $|x|$ , $x^2$ , $\mathrm{sign}(x) x^2$ , $x^3$ , $|x|^3$ , $1\over x$ , $1\over |x|$ , $1\over x^2$ , ${\mathrm{sign}(x)\over x^2}={|x|\over x^3}$ , $1\over x^3$ , $\sqrt{|x|}$ , $1\over\sqrt{|x|^3}$ , etc.

Note

The function $h(x)$ is only defined for positive $x$ and that is all we need. But if we are curious how the formula evaluates for $x \le 0$ , we get the following. For $c>0$ and $x=0^+$ we get:

h(x)=\exp(c \log 0^+)=\exp(-c\,\infty)=0\,.

(41)

For $x<0$ we get:

h(x) =\exp(c (\log |x| + i \arg x)) =\exp(c (\log(-x) + i \pi))=

(42)

=\exp(\log((-x)^c))\exp(i c \pi) =(-x)^c \exp(i c \pi)\,.

(43)

For an even integer $c$ we have $\exp(ic\pi)=1$ and $h(x)=(-x)^c=x^c$ , for an odd integer $c$ we have $\exp(ic\pi)=-1$ and $h(x)=-(-x)^c=x^c$ and for non-integer $c$ we get some complex number. For example, for $c=1$ (which is an odd integer) we get:

h(x) = (-x) (-1) = x\,.

(44)

Due to the discontinuity at $x=0$ , we can stitch solutions there, and extend $h(x)$ as even or odd functions.

Geometric Definition¶

The way to define a determinant using area/volume from geometry is as follows:

V(\mathbf{e}_1,\mathbf{e}_2) = 1\,,

(45)

V(\mathbf{u},\mathbf{u}) = 0\,,

(46)

for $c>0$ :

V(c\mathbf{u}, \mathbf{w}) = c V(\mathbf{u}, \mathbf{w}) = V(\mathbf{u}, c\mathbf{w})\,,

(47)

V(\mathbf{u}+\mathbf{v}, \mathbf{w}) = V(\mathbf{u}, \mathbf{w}) + V(\mathbf{v}, \mathbf{w})\,,

(48)

V(\mathbf{w}, \mathbf{u}+\mathbf{v}) = V(\mathbf{w}, \mathbf{u}) + V(\mathbf{w}, \mathbf{v})\,.

(49)

All these formulas can be “derived” from intuitive properties of areas in 2D. Another equivalent set of requirements is:

V(\mathbf{e}_1,\mathbf{e}_2) = 1\,,

(50)

V(c\mathbf{u}, \mathbf{w}) = c V(\mathbf{u}, \mathbf{w}) = V(\mathbf{u}, c\mathbf{w})\,,

(51)

V(\mathbf{u}+\mathbf{v}, \mathbf{v}) = V(\mathbf{u}, \mathbf{v}) = V(\mathbf{u}, \mathbf{u}+\mathbf{v})\,.

(52)

Let’s use the first set of requirements below. We get:

0 = V(\mathbf{v}-\mathbf{v}, \mathbf{w}) = V(\mathbf{v}, \mathbf{w}) + V(-\mathbf{v}, \mathbf{w})\,,

(53)

from which it follows:

V(-\mathbf{v}, \mathbf{w}) = -V(\mathbf{v}, \mathbf{w})\,.

(54)

Now we derive antisymmetry:

0 = V(\mathbf{u}+\mathbf{v}, \mathbf{u}+\mathbf{v}) =

(55)

= V(\mathbf{u},\mathbf{u}) + V(\mathbf{u},\mathbf{v}) + V(\mathbf{v},\mathbf{u}) + V(\mathbf{v},\mathbf{v}) = V(\mathbf{u},\mathbf{v}) + V(\mathbf{v},\mathbf{u})

(56)

and we get:

V(\mathbf{v},\mathbf{u}) = -V(\mathbf{u},\mathbf{v})\,.

(57)

Now we derive a formula for $V$ :

V(\mathbf{u},\mathbf{v}) = V(u_1\mathbf{e}_1+u_2\mathbf{e}_2, v_1\mathbf{e}_1+v_2\mathbf{e}_2) =

(58)

= u_1v_2V(\mathbf{e}_1,\mathbf{e}_2) + u_2v_1V(\mathbf{e}_2,\mathbf{e}_1) = (u_1v_2-u_2v_1)V(\mathbf{e}_1,\mathbf{e}_2) = u_1v_2-u_2v_1\,.

(59)

This proves both existence and uniqueness.

Coordinate Definition¶

Determinant of a 2x2 matrix $(\mathbf{A})_{ij}=a_{ij}$ is defined as:

\det \mathbf{A} = \epsilon^{ij} a_{1i}a_{2j} = {1\over 2!}\epsilon^{ij}\epsilon^{kl} a_{ki}a_{lj}

(60)

and for a 3x3 matrix $(\mathbf{A})_{ij}=a_{ij}$ :

\det \mathbf{A} = \epsilon^{ijk} a_{1i}a_{2j}a_{3k} = {1\over 3!} \epsilon^{ijk}\epsilon^{lmn} a_{li}a_{mj}a_{nk}

(61)

And so on in any dimension. The symbol $\epsilon^{ijk}$ is a Levi-Civita symbol. All the properties of determinants are encoded in the Levi-Civita symbol, so we need to understand it well, but the rest are just usual tensor manipulations.

Note

For the 2x2 matrix we can compute the second formula on the RHS as follows:

\det \mathbf{A} = {1\over 2!}\epsilon^{ij}\epsilon^{kl} a_{ki}a_{lj} = {1\over 2!}\det\begin{pmatrix} \delta^{ik} & \delta^{il} \\ \delta^{jk} & \delta^{jl} \end{pmatrix} a_{ki}a_{lj} =

(62)

= {1\over 2!}( \delta^{ik}\delta^{jl} - \delta^{il} \delta^{jk}) a_{ki}a_{lj} = {1\over 2!}( a_{ii}a_{jj} - a_{ji}a_{ij}) =

(63)

= {1\over 2!}( (a_{11}+a_{22})(a_{11}+a_{22}) - (a_{11}a_{11}+a_{12}a_{21}+a_{21}a_{12}+a_{22}a_{22})) =

(64)

= {1\over 2!}( 2a_{11}a_{22} - 2a_{21}a_{12}) = a_{11}a_{22} - a_{21}a_{12}\,.

(65)

The first formula on the RHS is just:

\det \mathbf{A} = \epsilon^{ij} a_{1i}a_{2j} = a_{11}a_{22}-a_{12}a_{21}\,.

(66)

Both results are equivalent. It works the same way in any dimension, the factorial factor corrects the overcounting.

Let us now derive the multiplicative formula for determinants:

\det\mathbf{AB} = \det\mathbf{A}\det\mathbf{B}\,.

(67)

Let’s show it for 2x2 matrices. The LHS is:

\det\mathbf{AB} = \epsilon^{ij} (\mathbf{AB})_{1i}(\mathbf{AB})_{2j} = \epsilon^{ij} (a_1{}^k b_{ki}) (a_2{}^l b_{lj})\,.

(68)

The RHS is:

\det\mathbf{A}\det\mathbf{B} = \epsilon^{ij} a_{1i}a_{2j} \epsilon^{kl} b_{1k}b_{2l}=

(69)

= (\delta^{ik}\delta^{jl} - \delta^{il}\delta^{jk}) a_{1i}a_{2j} b_{1k}b_{2l}=

(70)

= a_{1i}a_{2j} b_1{}^i b_2{}^j - a_{1i}a_{2j} b_1{}^j b_2{}^i =

(71)

= a_{1i}a_{2j} (b_1{}^i b_2{}^j - b_1{}^j b_2{}^i) =

(72)

= \epsilon^{kl}a_{1i}a_{2j} b_k{}^i b_l{}^j =

(73)

= \epsilon^{kl}a_{1i} b_k{}^i a_{2j} b_l{}^j =

(74)

= \epsilon^{ij}a_{1k} b_i{}^k a_{2l} b_j{}^l =

(75)

= \epsilon^{ij}a_1{}^k b_{ik} a_2{}^l b_{jl} =

(76)

=\det\mathbf{AB}^T\,.

(77)

Now we need to prove $\det\mathbf{A}^T = \det\mathbf{A}$ . We can do that using the factorial formula by relabeling indices as follows:

\det\mathbf{A}^T = {1\over 2!}\epsilon^{ij}\epsilon^{kl} a_{ik}a_{jl} = {1\over 2!}\epsilon^{kl}\epsilon^{ij} a_{ki}a_{lj} = {1\over 2!}\epsilon^{ij}\epsilon^{kl} a_{ki}a_{lj} = \det\mathbf{A}\,.

(78)

We have thus shown:

\det\mathbf{AB} =\det\mathbf{A}\det\mathbf{B}^T =\det\mathbf{A}\det\mathbf{B}\,.

(79)

Alternative Proof¶

First we need the following identity:

\det({\mathbf A}) \epsilon_{ij} = \epsilon^{kl} a_{1k}a_{2l} \epsilon_{ij} = \det\begin{pmatrix} \delta_i{}^{k} & \delta_i{}^{l} \\ \delta_j{}^{k} & \delta_j{}^{l} \end{pmatrix} a_{1k}a_{2l} = \det\begin{pmatrix} a_{1i} & a_{2i} \\ a_{1j} & a_{2j} \end{pmatrix} = \epsilon^{kl} a_{ki} a_{lj}\,.

(80)

Note

The final step above follows from using the definition of a determinant as follows:

\det(\mathbf{X}) = \det\begin{pmatrix} a_{1i} & a_{2i} \\ a_{1j} & a_{2j} \end{pmatrix} =\epsilon^{kl} \mathbf{X}_{1k} \mathbf{X}_{2l}\,.

(81)

Now $\mathbf{X}_{11}=a_{1i}$ and $\mathbf{X}_{12}=a_{2i}$ , so $\mathbf{X}_{1k}=a_{ki}$ . Similarly $\mathbf{X}_{2l}=a_{lj}$ , so we get:

=\epsilon^{kl} a_{ki} a_{lj}\,.

(82)

Alternatively, one can simply expand the determinant as follows:

\det\begin{pmatrix} a_{1i} & a_{2i} \\ a_{1j} & a_{2j} \end{pmatrix} = a_{1i} a_{2j} - a_{2i} a_{1j} =\epsilon^{kl} a_{ki} a_{lj}\,.

(83)

Contracting both sides with $\epsilon^{ij}$ we get:

\det({\mathbf A}) \epsilon_{ij} \epsilon^{ij} =\epsilon^{ij}\epsilon^{kl} a_{ki} a_{lj}\,,

(84)

\det({\mathbf A})\, 2! =\epsilon^{ij}\epsilon^{kl} a_{ki} a_{lj}\,,

(85)

\det({\mathbf A}) ={1\over2!}\epsilon^{ij}\epsilon^{kl} a_{ki} a_{lj}\,.

(86)

Note that $\epsilon^{kl} a_{ki} a_{lj} = \epsilon^{kl} a_{ik} a_{jl}$ due to the transposition property derived using the last equation in the previous section:

\det({\mathbf A}) \epsilon_{ij} = \epsilon^{kl} a_{ki} a_{lj} = \det({\mathbf A}^T) \epsilon_{ij} = \epsilon^{kl} a_{ik} a_{jl}\,.

(87)

Now we can compute:

\det(\mathbf{AB}) = \epsilon^{ij} (\mathbf{AB})_{1i} (\mathbf{AB})_{2j} = \epsilon^{ij} (a_1{}^{k}b_{ki}) (a_2{}^{l}b_{lj}) = a_1{}^{k} a_2{}^{l} (\epsilon^{ij} b_{ki} b_{lj}) =

(88)

= a_1{}^{k} a_2{}^{l} (\epsilon^{ij} b_{ik} b_{jl}) = a_1{}^{k} a_2{}^{l} \det(\mathbf{B}) \epsilon_{kl} = (\epsilon^{kl} a_{1k} a_{2l}) \det(\mathbf{B}) = \det(\mathbf{A}) \det(\mathbf{B})\,.

(89)

References¶

Wikipedia. (2025). Cauchy’s functional equation. https://en.wikipedia.org/wiki/Cauchy%27s_functional_equation