Revision as of 01:49, 4 February 2019 edit89.107.5.192 (talk) parantheses← Previous edit | Revision as of 01:52, 4 February 2019 edit undo89.107.5.192 (talk) TransposeNext edit → | ||
Line 6: | Line 6: | ||
For any matrices <math>M</math> and <math>N</math>, the Hadamard product <math>M \circ N</math> considered as a bilinear form acts on vectors <math>a, b</math> as | For any matrices <math>M</math> and <math>N</math>, the Hadamard product <math>M \circ N</math> considered as a bilinear form acts on vectors <math>a, b</math> as | ||
: <math>a^* (M \circ N) b = \operatorname{tr}\left(M^T \operatorname{diag}\left(a^*\right) N \operatorname{diag}(b)\right)</math> | : <math>a^* (M \circ N) b = \operatorname{tr}\left(M^\textsf{T} \operatorname{diag}\left(a^*\right) N \operatorname{diag}(b)\right)</math> | ||
where <math>\operatorname{tr}</math> is the matrix ] and <math>\operatorname{diag}(a)</math> is the ] having as diagonal entries the elements of <math>a</math>. | where <math>\operatorname{tr}</math> is the matrix ] and <math>\operatorname{diag}(a)</math> is the ] having as diagonal entries the elements of <math>a</math>. | ||
Line 12: | Line 12: | ||
Suppose <math>M</math> and <math>N</math> are positive definite, and so ]. We can consider their square-roots <math>M^\frac{1}{2}</math> and <math>N^\frac{1}{2}</math>, which are also Hermitian, and write | Suppose <math>M</math> and <math>N</math> are positive definite, and so ]. We can consider their square-roots <math>M^\frac{1}{2}</math> and <math>N^\frac{1}{2}</math>, which are also Hermitian, and write | ||
: <math> | : <math> | ||
\operatorname{tr}\left(M^T \operatorname{diag}\left(a^*\right) N \operatorname{diag}(b)\right) = | \operatorname{tr}\left(M^\textsf{T} \operatorname{diag}\left(a^*\right) N \operatorname{diag}(b)\right) = | ||
\operatorname{tr}\left(\overline{M}^\frac{1}{2} \overline{M}^\frac{1}{2} \operatorname{diag}\left(a^*\right) N^\frac{1}{2} N^\frac{1}{2} \operatorname{diag}(b)\right) = | \operatorname{tr}\left(\overline{M}^\frac{1}{2} \overline{M}^\frac{1}{2} \operatorname{diag}\left(a^*\right) N^\frac{1}{2} N^\frac{1}{2} \operatorname{diag}(b)\right) = | ||
\operatorname{tr}\left(\overline{M}^\frac{1}{2} \operatorname{diag}\left(a^*\right) N^\frac{1}{2} N^\frac{1}{2} \operatorname{diag}(b) \overline{M}^\frac{1}{2}\right) | \operatorname{tr}\left(\overline{M}^\frac{1}{2} \operatorname{diag}\left(a^*\right) N^\frac{1}{2} N^\frac{1}{2} \operatorname{diag}(b) \overline{M}^\frac{1}{2}\right) | ||
Line 53: | Line 53: | ||
==== Proof of positive semidefiniteness ==== | ==== Proof of positive semidefiniteness ==== | ||
Let <math>M = \sum \mu_i m_i m_i^T</math> and <math>N = \sum \nu_i n_i n_i^T</math>. Then | Let <math>M = \sum \mu_i m_i m_i^\textsf{T}</math> and <math>N = \sum \nu_i n_i n_i^\textsf{T}</math>. Then | ||
: <math>M \circ N = \sum_{ij} \mu_i \nu_j \left(m_i m_i^T\right) \circ \left(n_j n_j^T\right) = \sum_{ij} \mu_i \nu_j \left(m_i \circ n_j\right) \left(m_i \circ n_j\right)^T</math> | : <math>M \circ N = \sum_{ij} \mu_i \nu_j \left(m_i m_i^\textsf{T}\right) \circ \left(n_j n_j^\textsf{T}\right) = \sum_{ij} \mu_i \nu_j \left(m_i \circ n_j\right) \left(m_i \circ n_j\right)^\textsf{T}</math> | ||
Each <math>\left(m_i \circ n_j\right) \left(m_i \circ n_j\right)^T</math> is positive semidefinite (but, except in the 1-dimensional case, not positive definite, since they are ] 1 matrices). Also, <math>\mu_i \nu_j > 0</math> thus the sum <math>M \circ N</math> is also positive semidefinite. | Each <math>\left(m_i \circ n_j\right) \left(m_i \circ n_j\right)^\textsf{T}</math> is positive semidefinite (but, except in the 1-dimensional case, not positive definite, since they are ] 1 matrices). Also, <math>\mu_i \nu_j > 0</math> thus the sum <math>M \circ N</math> is also positive semidefinite. | ||
==== Proof of definiteness ==== | ==== Proof of definiteness ==== | ||
To show that the result is positive definite requires further proof. We shall show that for any vector <math>a \neq 0</math>, we have <math>a^T (M \circ N) a > 0</math>. Continuing as above, each <math>a^T \left(m_i \circ n_j\right) \left(m_i \circ n_j\right)^T a \ge 0</math>, so it remains to show that there exist <math>i</math> and <math>j</math> for which the inequality is strict. For this we observe that | To show that the result is positive definite requires further proof. We shall show that for any vector <math>a \neq 0</math>, we have <math>a^\textsf{T} (M \circ N) a > 0</math>. Continuing as above, each <math>a^\textsf{T} \left(m_i \circ n_j\right) \left(m_i \circ n_j\right)^\textsf{T} a \ge 0</math>, so it remains to show that there exist <math>i</math> and <math>j</math> for which the inequality is strict. For this we observe that | ||
: <math>a^T (m_i \circ n_j) (m_i \circ n_j)^T a = \left(\sum_k m_{i,k} n_{j,k} a_k\right)^2</math> | : <math>a^\textsf{T} (m_i \circ n_j) (m_i \circ n_j)^\textsf{T} a = \left(\sum_k m_{i,k} n_{j,k} a_k\right)^2</math> | ||
Since <math>N</math> is positive definite, there is a <math>j</math> for which <math>n_{j,k} a_k</math> is not 0 for all <math>k</math>, and then, since <math>M</math> is positive definite, there is an <math>i</math> for which <math>m_{i,k} n_{j,k} a_k</math> is not 0 for all <math>k</math>. Then for this <math>i</math> and <math>j</math> we have <math>\left(\sum_k m_{i,k} n_{j,k} a_k\right)^2 > 0</math>. This completes the proof. | Since <math>N</math> is positive definite, there is a <math>j</math> for which <math>n_{j,k} a_k</math> is not 0 for all <math>k</math>, and then, since <math>M</math> is positive definite, there is an <math>i</math> for which <math>m_{i,k} n_{j,k} a_k</math> is not 0 for all <math>k</math>. Then for this <math>i</math> and <math>j</math> we have <math>\left(\sum_k m_{i,k} n_{j,k} a_k\right)^2 > 0</math>. This completes the proof. |
Revision as of 01:52, 4 February 2019
In mathematics, particularly in linear algebra, the Schur product theorem states that the Hadamard product of two positive definite matrices is also a positive definite matrix. The result is named after Issai Schur (Schur 1911, p. 14, Theorem VII) (note that Schur signed as J. Schur in Journal für die reine und angewandte Mathematik.)
Proof
Proof using the trace formula
For any matrices and , the Hadamard product considered as a bilinear form acts on vectors as
where is the matrix trace and is the diagonal matrix having as diagonal entries the elements of .
Suppose and are positive definite, and so Hermitian. We can consider their square-roots and , which are also Hermitian, and write
Then, for , this is written as for and thus is strictly positive for , which occurs if and only if . This shows that is a positive definite matrix.
Proof using Gaussian integration
Case of M = N
Let be an -dimensional centered Gaussian random variable with covariance . Then the covariance matrix of and is
Using Wick's theorem to develop we have
Since a covariance matrix is positive definite, this proves that the matrix with elements is a positive definite matrix.
General case
Let and be -dimensional centered Gaussian random variables with covariances , and independent from each other so that we have
- for any
Then the covariance matrix of and is
Using Wick's theorem to develop
and also using the independence of and , we have
Since a covariance matrix is positive definite, this proves that the matrix with elements is a positive definite matrix.
Proof using eigendecomposition
Proof of positive semidefiniteness
Let and . Then
Each is positive semidefinite (but, except in the 1-dimensional case, not positive definite, since they are rank 1 matrices). Also, thus the sum is also positive semidefinite.
Proof of definiteness
To show that the result is positive definite requires further proof. We shall show that for any vector , we have . Continuing as above, each , so it remains to show that there exist and for which the inequality is strict. For this we observe that
Since is positive definite, there is a for which is not 0 for all , and then, since is positive definite, there is an for which is not 0 for all . Then for this and we have . This completes the proof.
References
- "Bemerkungen zur Theorie der beschränkten Bilinearformen mit unendlich vielen Veränderlichen". Journal für die reine und angewandte Mathematik (Crelle's Journal). 1911 (140): 1–28. 1911. doi:10.1515/crll.1911.140.1.
- Zhang, Fuzhen, ed. (2005). "The Schur Complement and Its Applications". Numerical Methods and Algorithms. 4. doi:10.1007/b105056. ISBN 0-387-24271-6.
{{cite journal}}
: Cite journal requires|journal=
(help), page 9, Ch. 0.6 Publication under J. Schur - Ledermann, W. (1983). "Issai Schur and His School in Berlin". Bulletin of the London Mathematical Society. 15 (2): 97–106. doi:10.1112/blms/15.2.97.
External links
- Bemerkungen zur Theorie der beschränkten Bilinearformen mit unendlich vielen Veränderlichen at EUDML