Schur product theorem - Misplaced Pages

This is the current revision of this page, as edited by Volunteer Marek (talk | contribs) at 19:09, 23 November 2024. The present address (URL) is a permanent link to this version.

Revision as of 19:09, 23 November 2024 by Volunteer Marek (talk | contribs)(diff) ← Previous revision | Latest revision (diff) | Newer revision → (diff)

In mathematics, particularly in linear algebra, the Schur product theorem states that the Hadamard product of two positive definite matrices is also a positive definite matrix. The result is named after Issai Schur (Schur 1911, p. 14, Theorem VII) (note that Schur signed as J. Schur in Journal für die reine und angewandte Mathematik.)

The converse of the theorem holds in the following sense: if $M$ is a symmetric matrix and the Hadamard product $M\circ N$ is positive definite for all positive definite matrices $N$ , then $M$ itself is positive definite.

Proof

Proof using the trace formula

For any matrices $M$ and $N$ , the Hadamard product $M\circ N$ considered as a bilinear form acts on vectors $a,b$ as

a^{*}(M\circ N)b=\operatorname {tr} \left(M^{\textsf {T}}\operatorname {diag} \left(a^{*}\right)N\operatorname {diag} (b)\right)

where $\operatorname {tr}$ is the matrix trace and $\operatorname {diag} (a)$ is the diagonal matrix having as diagonal entries the elements of $a$ .

Suppose $M$ and $N$ are positive definite, and so Hermitian. We can consider their square-roots $M^{\frac {1}{2}}$ and $N^{\frac {1}{2}}$ , which are also Hermitian, and write

\operatorname {tr} \left(M^{\textsf {T}}\operatorname {diag} \left(a^{*}\right)N\operatorname {diag} (b)\right)=\operatorname {tr} \left({\overline {M}}^{\frac {1}{2}}{\overline {M}}^{\frac {1}{2}}\operatorname {diag} \left(a^{*}\right)N^{\frac {1}{2}}N^{\frac {1}{2}}\operatorname {diag} (b)\right)=\operatorname {tr} \left({\overline {M}}^{\frac {1}{2}}\operatorname {diag} \left(a^{*}\right)N^{\frac {1}{2}}N^{\frac {1}{2}}\operatorname {diag} (b){\overline {M}}^{\frac {1}{2}}\right)

Then, for $a=b$ , this is written as $\operatorname {tr} \left(A^{*}A\right)$ for $A=N^{\frac {1}{2}}\operatorname {diag} (a){\overline {M}}^{\frac {1}{2}}$ and thus is strictly positive for $A\neq 0$ , which occurs if and only if $a\neq 0$ . This shows that $(M\circ N)$ is a positive definite matrix.

Proof using Gaussian integration

Case of M = N

Let $X$ be an $n$ -dimensional centered Gaussian random variable with covariance $\langle X_{i}X_{j}\rangle =M_{ij}$ . Then the covariance matrix of $X_{i}^{2}$ and $X_{j}^{2}$ is

\operatorname {Cov} \left(X_{i}^{2},X_{j}^{2}\right)=\left\langle X_{i}^{2}X_{j}^{2}\right\rangle -\left\langle X_{i}^{2}\right\rangle \left\langle X_{j}^{2}\right\rangle

Using Wick's theorem to develop $\left\langle X_{i}^{2}X_{j}^{2}\right\rangle =2\left\langle X_{i}X_{j}\right\rangle ^{2}+\left\langle X_{i}^{2}\right\rangle \left\langle X_{j}^{2}\right\rangle$ we have

\operatorname {Cov} \left(X_{i}^{2},X_{j}^{2}\right)=2\left\langle X_{i}X_{j}\right\rangle ^{2}=2M_{ij}^{2}

Since a covariance matrix is positive definite, this proves that the matrix with elements $M_{ij}^{2}$ is a positive definite matrix.

General case

Let $X$ and $Y$ be $n$ -dimensional centered Gaussian random variables with covariances $\left\langle X_{i}X_{j}\right\rangle =M_{ij}$ , $\left\langle Y_{i}Y_{j}\right\rangle =N_{ij}$ and independent from each other so that we have

\left\langle X_{i}Y_{j}\right\rangle =0

for any

i,j

Then the covariance matrix of $X_{i}Y_{i}$ and $X_{j}Y_{j}$ is

\operatorname {Cov} \left(X_{i}Y_{i},X_{j}Y_{j}\right)=\left\langle X_{i}Y_{i}X_{j}Y_{j}\right\rangle -\left\langle X_{i}Y_{i}\right\rangle \left\langle X_{j}Y_{j}\right\rangle

Using Wick's theorem to develop

\left\langle X_{i}Y_{i}X_{j}Y_{j}\right\rangle =\left\langle X_{i}X_{j}\right\rangle \left\langle Y_{i}Y_{j}\right\rangle +\left\langle X_{i}Y_{i}\right\rangle \left\langle X_{j}Y_{j}\right\rangle +\left\langle X_{i}Y_{j}\right\rangle \left\langle X_{j}Y_{i}\right\rangle

and also using the independence of $X$ and $Y$ , we have

\operatorname {Cov} \left(X_{i}Y_{i},X_{j}Y_{j}\right)=\left\langle X_{i}X_{j}\right\rangle \left\langle Y_{i}Y_{j}\right\rangle =M_{ij}N_{ij}

Since a covariance matrix is positive definite, this proves that the matrix with elements $M_{ij}N_{ij}$ is a positive definite matrix.

Proof using eigendecomposition

Proof of positive semidefiniteness

Let $M=\sum \mu _{i}m_{i}m_{i}^{\textsf {T}}$ and $N=\sum \nu _{i}n_{i}n_{i}^{\textsf {T}}$ . Then

M\circ N=\sum _{ij}\mu _{i}\nu _{j}\left(m_{i}m_{i}^{\textsf {T}}\right)\circ \left(n_{j}n_{j}^{\textsf {T}}\right)=\sum _{ij}\mu _{i}\nu _{j}\left(m_{i}\circ n_{j}\right)\left(m_{i}\circ n_{j}\right)^{\textsf {T}}

Each $\left(m_{i}\circ n_{j}\right)\left(m_{i}\circ n_{j}\right)^{\textsf {T}}$ is positive semidefinite (but, except in the 1-dimensional case, not positive definite, since they are rank 1 matrices). Also, $\mu _{i}\nu _{j}>0$ thus the sum $M\circ N$ is also positive semidefinite.

Proof of definiteness

To show that the result is positive definite requires even further proof. We shall show that for any vector $a\neq 0$ , we have $a^{\textsf {T}}(M\circ N)a>0$ . Continuing as above, each $a^{\textsf {T}}\left(m_{i}\circ n_{j}\right)\left(m_{i}\circ n_{j}\right)^{\textsf {T}}a\geq 0$ , so it remains to show that there exist $i$ and $j$ for which corresponding term above is nonzero. For this we observe that

a^{\textsf {T}}(m_{i}\circ n_{j})(m_{i}\circ n_{j})^{\textsf {T}}a=\left(\sum _{k}m_{i,k}n_{j,k}a_{k}\right)^{2}

Since $N$ is positive definite, there is a $j$ for which $n_{j}\circ a\neq 0$ (since otherwise $n_{j}^{\textsf {T}}a=\sum _{k}(n_{j}\circ a)_{k}=0$ for all $j$ ), and likewise since $M$ is positive definite there exists an $i$ for which $\sum _{k}m_{i,k}(n_{j}\circ a)_{k}=m_{i}^{\textsf {T}}(n_{j}\circ a)\neq 0.$ However, this last sum is just $\sum _{k}m_{i,k}n_{j,k}a_{k}$ . Thus its square is positive. This completes the proof.

References

Schur, J. (1911). "Bemerkungen zur Theorie der beschränkten Bilinearformen mit unendlich vielen Veränderlichen". Journal für die reine und angewandte Mathematik. 1911 (140): 1–28. doi:10.1515/crll.1911.140.1. S2CID 120411177.
Zhang, Fuzhen, ed. (2005). The Schur Complement and Its Applications. Numerical Methods and Algorithms. Vol. 4. doi:10.1007/b105056. ISBN 0-387-24271-6., page 9, Ch. 0.6 Publication under J. Schur
Ledermann, W. (1983). "Issai Schur and His School in Berlin". Bulletin of the London Mathematical Society. 15 (2): 97–106. doi:10.1112/blms/15.2.97.

External links

Bemerkungen zur Theorie der beschränkten Bilinearformen mit unendlich vielen Veränderlichen at EUDML

Categories: