Misplaced Pages

Samuelson's inequality

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Concept in statistics

In statistics, Samuelson's inequality, named after the economist Paul Samuelson, also called the Laguerre–Samuelson inequality, after the mathematician Edmond Laguerre, states that every one of any collection x1, ..., xn, is within √n − 1 uncorrected sample standard deviations of their sample mean.

Statement of the inequality

If we let

x ¯ = x 1 + + x n n {\displaystyle {\overline {x}}={\frac {x_{1}+\cdots +x_{n}}{n}}}

be the sample mean and

s = 1 n i = 1 n ( x i x ¯ ) 2 {\displaystyle s={\sqrt {{\frac {1}{n}}\sum _{i=1}^{n}(x_{i}-{\overline {x}})^{2}}}}

be the standard deviation of the sample, then

x ¯ s n 1 x j x ¯ + s n 1 for  j = 1 , , n . {\displaystyle {\overline {x}}-s{\sqrt {n-1}}\leq x_{j}\leq {\overline {x}}+s{\sqrt {n-1}}\qquad {\text{for }}j=1,\dots ,n.}

Equality holds on the left (or right) for x j {\displaystyle x_{j}} if and only if all the n − 1 x i {\displaystyle x_{i}} s other than x j {\displaystyle x_{j}} are equal to each other and greater (smaller) than x j . {\displaystyle x_{j}.}

If you instead define s = 1 n 1 i = 1 n ( x i x ¯ ) 2 {\displaystyle s={\sqrt {{\frac {1}{n-1}}\sum _{i=1}^{n}(x_{i}-{\overline {x}})^{2}}}} then the inequality x ¯ s n 1 x j x ¯ + s n 1 {\displaystyle {\overline {x}}-s{\sqrt {n-1}}\leq x_{j}\leq {\overline {x}}+s{\sqrt {n-1}}} still applies and can be slightly tightened to x ¯ s n 1 n x j x ¯ + s n 1 n . {\displaystyle {\overline {x}}-s{\tfrac {n-1}{\sqrt {n}}}\leq x_{j}\leq {\overline {x}}+s{\tfrac {n-1}{\sqrt {n}}}.}

Comparison to Chebyshev's inequality

Main article: Chebyshev's inequality § Samuelson's inequality

Chebyshev's inequality locates a certain fraction of the data within certain bounds, while Samuelson's inequality locates all the data points within certain bounds.

The bounds given by Chebyshev's inequality are unaffected by the number of data points, while for Samuelson's inequality the bounds loosen as the sample size increases. Thus for large enough data sets, Chebyshev's inequality is more useful.

Applications

This section needs expansion. You can help by adding to it. (July 2017)

Samuelson’s inequality has several applications in statistics and mathematics. It is useful in the studentization of residuals which shows a rationale for why this process should be done externally to better understand the spread of residuals in regression analysis.

In matrix theory, Samuelson’s inequality is used to locate the eigenvalues of certain matrices and tensors.

Furthermore, generalizations of this inequality apply to complex data and random variables in a probability space.

Relationship to polynomials

Samuelson was not the first to describe this relationship: the first was probably Laguerre in 1880 while investigating the roots (zeros) of polynomials.

Consider a polynomial with all roots real:

a 0 x n + a 1 x n 1 + + a n 1 x + a n = 0 {\displaystyle a_{0}x^{n}+a_{1}x^{n-1}+\cdots +a_{n-1}x+a_{n}=0}

Without loss of generality let a 0 = 1 {\displaystyle a_{0}=1} and let

t 1 = x i {\displaystyle t_{1}=\sum x_{i}} and t 2 = x i 2 {\displaystyle t_{2}=\sum x_{i}^{2}}

Then

a 1 = x i = t 1 {\displaystyle a_{1}=-\sum x_{i}=-t_{1}}

and

a 2 = x i x j = t 1 2 t 2 2  where  i < j {\displaystyle a_{2}=\sum x_{i}x_{j}={\frac {t_{1}^{2}-t_{2}}{2}}\qquad {\text{ where }}i<j}

In terms of the coefficients

t 2 = a 1 2 2 a 2 {\displaystyle t_{2}=a_{1}^{2}-2a_{2}}

Laguerre showed that the roots of this polynomial were bounded by

a 1 / n ± b n 1 {\displaystyle -a_{1}/n\pm b{\sqrt {n-1}}}

where

b = n t 2 t 1 2 n = n a 1 2 a 1 2 2 n a 2 n {\displaystyle b={\frac {\sqrt {nt_{2}-t_{1}^{2}}}{n}}={\frac {\sqrt {na_{1}^{2}-a_{1}^{2}-2na_{2}}}{n}}}

Inspection shows that a 1 n {\displaystyle -{\tfrac {a_{1}}{n}}} is the mean of the roots and that b is the standard deviation of the roots.

Laguerre failed to notice this relationship with the means and standard deviations of the roots, being more interested in the bounds themselves. This relationship permits a rapid estimate of the bounds of the roots and may be of use in their location.

When the coefficients a 1 {\displaystyle a_{1}} and a 2 {\displaystyle a_{2}} are both zero no information can be obtained about the location of the roots, because not all roots are real (as can be seen from Descartes' rule of signs) unless the constant term is also zero.

References

  1. Samuelson, Paul (1968). "How Deviant Can You Be?". Journal of the American Statistical Association. 63 (324): 1522–1525. doi:10.2307/2285901. JSTOR 2285901.
  2. ^ Jensen, Shane Tyler (1999). The Laguerre–Samuelson Inequality with Extensions and Applications in Statistics and Matrix Theory (PDF) (MSc). Department of Mathematics and Statistics, McGill University.
  3. Jensen, Shane T.; Styan, George P. H. (1999). "Some Comments and a Bibliography on the Laguerre-Samuelson Inequality with Extensions and Applications in Statistics and Matrix Theory". Analytic and Geometric Inequalities and Applications. pp. 151–181. doi:10.1007/978-94-011-4577-0_10. ISBN 978-94-010-5938-1.
  4. Barnett, Neil S.; Dragomir, Sever Silvestru (2008). Advances in Inequalities from Probability Theory and Statistics. Nova Publishers. p. 164. ISBN 978-1-60021-943-6.
  5. JIN, HONGWEI; BEN´ITEZ, JULIO (2017). "Some generalizations and probability versions of Samuelson's inequality" (PDF). Mathematical Inequalities & Applications: 1–12. doi:10.7153/mia-20-01. Retrieved 4 September 2024.
  6. Demuynck, Thomas; Hjertstrand, Per (2019). "Samuelson's Approach to Revealed Preference Theory: Some Recent Advances" (PDF). Paul Samuelson. Remaking Economics: Eminent Post-War Economists. pp. 193–227. doi:10.1057/978-1-137-56812-0_9. ISBN 978-1-137-56811-3.
  7. Laguerre E. (1880) Mémoire pour obtenir par approximation les racines d'une équation algébrique qui a toutes les racines réelles. Nouv Ann Math 2 série, 19, 161–172, 193–202
Category: