Misplaced Pages

Majorization

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
(Redirected from Majorized) Preorder on vectors of real numbers This article is about a specific ordering on real vectors. For ordering in general, see Partially ordered set.

In mathematics, majorization is a preorder on vectors of real numbers. For two such vectors, x ,   y R n {\displaystyle \mathbf {x} ,\ \mathbf {y} \in \mathbb {R} ^{n}} , we say that x {\displaystyle \mathbf {x} } weakly majorizes (or dominates) y {\displaystyle \mathbf {y} } from below, commonly denoted x w y , {\displaystyle \mathbf {x} \succ _{w}\mathbf {y} ,} when

i = 1 k x i i = 1 k y i {\displaystyle \sum _{i=1}^{k}x_{i}^{\downarrow }\geq \sum _{i=1}^{k}y_{i}^{\downarrow }} for all k = 1 , , n {\displaystyle k=1,\,\dots ,\,n} ,

where x i {\displaystyle x_{i}^{\downarrow }} denotes i {\displaystyle i} largest entry of x {\displaystyle x} . If x , y {\displaystyle \mathbf {x} ,\mathbf {y} } further satisfy i = 1 n x i = i = 1 n y i {\displaystyle \sum _{i=1}^{n}x_{i}=\sum _{i=1}^{n}y_{i}} , we say that x {\displaystyle \mathbf {x} } majorizes (or dominates) y {\displaystyle \mathbf {y} } , commonly denoted x y {\displaystyle \mathbf {x} \succ \mathbf {y} } . Majorization is a partial order for vectors whose entries are non-decreasing, but only a preorder for general vectors, since majorization is agnostic to the ordering of the entries in vectors, e.g., the statement ( 1 , 2 ) ( 0 , 3 ) {\displaystyle (1,2)\prec (0,3)} is simply equivalent to ( 2 , 1 ) ( 3 , 0 ) {\displaystyle (2,1)\prec (3,0)} .

Majorizing also sometimes refers to entrywise ordering, e.g. the real-valued function f majorizes the real-valued function g when f ( x ) g ( x ) {\displaystyle f(x)\geq g(x)} for all x {\displaystyle x} in the domain, or other technical definitions, such as majorizing measures in probability theory.

Equivalent conditions

Geometric definition

Figure 1. 2D majorization example

For x ,   y R n , {\displaystyle \mathbf {x} ,\ \mathbf {y} \in \mathbb {R} ^{n},} we have x y {\displaystyle \mathbf {x} \prec \mathbf {y} } if and only if x {\displaystyle \mathbf {x} } is in the convex hull of all vectors obtained by permuting the coordinates of y {\displaystyle \mathbf {y} } . This is equivalent to saying that x = D y {\displaystyle \mathbf {x} =\mathbf {D} \mathbf {y} } for some doubly stochastic matrix D {\displaystyle \mathbf {D} } . In particular, x {\displaystyle \mathbf {x} } can be written as a convex combination of n {\displaystyle n} permutations of y {\displaystyle \mathbf {y} } .

Figure 1 displays the convex hull in 2D for the vector y = ( 3 , 1 ) {\displaystyle \mathbf {y} =(3,\,1)} . Notice that the center of the convex hull, which is an interval in this case, is the vector x = ( 2 , 2 ) {\displaystyle \mathbf {x} =(2,\,2)} . This is the "smallest" vector satisfying x y {\displaystyle \mathbf {x} \prec \mathbf {y} } for this given vector y {\displaystyle \mathbf {y} } . Figure 2 shows the convex hull in 3D. The center of the convex hull, which is a 2D polygon in this case, is the "smallest" vector x {\displaystyle \mathbf {x} } satisfying x y {\displaystyle \mathbf {x} \prec \mathbf {y} } for this given vector y {\displaystyle \mathbf {y} } .

Figure 2. 3D Majorization Example

Other definitions

Each of the following statements is true if and only if x y {\displaystyle \mathbf {x} \succ \mathbf {y} } .

  • From x {\displaystyle \mathbf {x} } we can produce y {\displaystyle \mathbf {y} } by a finite sequence of "Robin Hood operations" where we replace two elements x i {\displaystyle x_{i}} and x j < x i {\displaystyle x_{j}<x_{i}} with x i ε {\displaystyle x_{i}-\varepsilon } and x j + ε {\displaystyle x_{j}+\varepsilon } , respectively, for some ε ( 0 , x i x j ) {\displaystyle \varepsilon \in (0,x_{i}-x_{j})} .
  • For every convex function h : R R {\displaystyle h:\mathbb {R} \to \mathbb {R} } , i = 1 d h ( x i ) i = 1 d h ( y i ) {\displaystyle \sum _{i=1}^{d}h(x_{i})\geq \sum _{i=1}^{d}h(y_{i})} .
    • In fact, a special case suffices: i x i = i y i {\displaystyle \sum _{i}{x_{i}}=\sum _{i}{y_{i}}} and, for every t, i = 1 d max ( 0 , x i t ) i = 1 d max ( 0 , y i t ) {\displaystyle \sum _{i=1}^{d}\max(0,x_{i}-t)\geq \sum _{i=1}^{d}\max(0,y_{i}-t)} .
  • For every t R {\displaystyle t\in \mathbb {R} } , j = 1 d | x j t | j = 1 d | y j t | {\displaystyle \sum _{j=1}^{d}|x_{j}-t|\geq \sum _{j=1}^{d}|y_{j}-t|} .

Examples

Among non-negative vectors with three components, ( 1 , 0 , 0 ) {\displaystyle (1,0,0)} and permutations of it majorize all other vectors ( p 1 , p 2 , p 3 ) {\displaystyle (p_{1},p_{2},p_{3})} such that p 1 + p 2 + p 3 = 1 {\displaystyle p_{1}+p_{2}+p_{3}=1} . For example, ( 1 , 0 , 0 ) ( 1 / 2 , 0 , 1 / 2 ) {\displaystyle (1,0,0)\succ (1/2,0,1/2)} . Similarly, ( 1 / 3 , 1 / 3 , 1 / 3 ) {\displaystyle (1/3,1/3,1/3)} is majorized by all other such vectors, so ( 1 / 2 , 0 , 1 / 2 ) ( 1 / 3 , 1 / 3 , 1 / 3 ) {\displaystyle (1/2,0,1/2)\succ (1/3,1/3,1/3)} .

This behavior extends to general-length probability vectors: the singleton vector majorizes all other probability vectors, and the uniform distribution is majorized by all probability vectors.

Schur convexity

Main article: Schur-convex function

A function f : R n R {\displaystyle f:\mathbb {R} ^{n}\to \mathbb {R} } is said to be Schur convex when x y {\displaystyle \mathbf {x} \succ \mathbf {y} } implies f ( x ) f ( y ) {\displaystyle f(\mathbf {x} )\geq f(\mathbf {y} )} . Hence, Schur-convex functions translate the ordering of vectors to a standard ordering in R {\displaystyle \mathbb {R} } . Similarly, f ( x ) {\displaystyle f(\mathbf {x} )} is Schur concave when x y {\displaystyle \mathbf {x} \succ \mathbf {y} } implies f ( x ) f ( y ) . {\displaystyle f(\mathbf {x} )\leq f(\mathbf {y} ).}

An example of a Schur-convex function is the max function, max ( x ) = x 1 {\displaystyle \max(\mathbf {x} )=x_{1}^{\downarrow }} . Schur convex functions are necessarily symmetric that the entries of it argument can be switched without modifying the value of the function. Therefore, linear functions, which are convex, are not Schur-convex unless they are symmetric. If a function is symmetric and convex, then it is Schur-convex.

Generalizations

Majorization can be generalized to the Lorenz ordering, a partial order on distribution functions. For example, a wealth distribution is Lorenz-greater than another if its Lorenz curve lies below the other. As such, a Lorenz-greater wealth distribution has a higher Gini coefficient, and has more income disparity.

The majorization preorder can be naturally extended to density matrices in the context of quantum information. In particular, ρ ρ {\displaystyle \rho \succ \rho '} exactly when s p e c [ ρ ] s p e c [ ρ ] {\displaystyle \mathrm {spec} \succ \mathrm {spec} } (where s p e c {\displaystyle \mathrm {spec} } denotes the state's spectrum).

Similarly, one can say a Hermitian operator, H {\displaystyle \mathbf {H} } , majorizes another, M {\displaystyle \mathbf {M} } , if the set of eigenvalues of H {\displaystyle \mathbf {H} } majorizes that of M {\displaystyle \mathbf {M} } .

See also

Notes

  1. Talagrand, Michel (1996-07-01). "Majorizing measures: the generic chaining". The Annals of Probability. 24 (3). doi:10.1214/aop/1065725175. ISSN 0091-1798.
  2. ^ Barry C. Arnold. "Majorization and the Lorenz Order: A Brief Introduction". Springer-Verlag Lecture Notes in Statistics, vol. 43, 1987.
  3. Xingzhi, Zhan (2003). "The sharp Rado theorem for majorizations". The American Mathematical Monthly. 110 (2): 152–153. doi:10.2307/3647776. JSTOR 3647776.
  4. July 3, 2005 post by fleeting_guest on "The Karamata Inequality" thread, AoPS community forums. Archived 11 November 2020.
  5. ^ Nielsen, Michael A.; Chuang, Isaac L. (2010). Quantum Computation and Quantum Information (2nd ed.). Cambridge: Cambridge University Press. ISBN 978-1-107-00217-3. OCLC 844974180.
  6. Marshall, Albert W. (2011). "14, 15". Inequalities : theory of majorization and its applications. Ingram Olkin, Barry C. Arnold (2nd ed.). New York: Springer Science+Business Media, LLC. ISBN 978-0-387-68276-1. OCLC 694574026.
  7. Wehrl, Alfred (1 April 1978). "General properties of entropy". Reviews of Modern Physics. 50 (2): 221–260. Bibcode:1978RvMP...50..221W. doi:10.1103/RevModPhys.50.221.

References

External links

Software

Categories: