Misplaced Pages

Vieta's formulas

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
(Redirected from Vieta relations) Relating coefficients and roots of a polynomial For a method for computing π, see Viète's formula.
François Viète

In mathematics, Vieta's formulas relate the coefficients of a polynomial to sums and products of its roots. They are named after François Viète (more commonly referred to by the Latinised form of his name, "Franciscus Vieta").

Basic formulas

Any general polynomial of degree n P ( x ) = a n x n + a n 1 x n 1 + + a 1 x + a 0 {\displaystyle P(x)=a_{n}x^{n}+a_{n-1}x^{n-1}+\cdots +a_{1}x+a_{0}} (with the coefficients being real or complex numbers and an ≠ 0) has n (not necessarily distinct) complex roots r1, r2, ..., rn by the fundamental theorem of algebra. Vieta's formulas relate the polynomial coefficients to signed sums of products of the roots r1, r2, ..., rn as follows:

{ r 1 + r 2 + + r n 1 + r n = a n 1 a n ( r 1 r 2 + r 1 r 3 + + r 1 r n ) + ( r 2 r 3 + r 2 r 4 + + r 2 r n ) + + r n 1 r n = a n 2 a n r 1 r 2 r n = ( 1 ) n a 0 a n . {\displaystyle {\begin{cases}r_{1}+r_{2}+\dots +r_{n-1}+r_{n}=-{\dfrac {a_{n-1}}{a_{n}}}\\(r_{1}r_{2}+r_{1}r_{3}+\cdots +r_{1}r_{n})+(r_{2}r_{3}+r_{2}r_{4}+\cdots +r_{2}r_{n})+\cdots +r_{n-1}r_{n}={\dfrac {a_{n-2}}{a_{n}}}\\{}\quad \vdots \\r_{1}r_{2}\cdots r_{n}=(-1)^{n}{\dfrac {a_{0}}{a_{n}}}.\end{cases}}} (*)

Vieta's formulas can equivalently be written as 1 i 1 < i 2 < < i k n ( j = 1 k r i j ) = ( 1 ) k a n k a n {\displaystyle \sum _{1\leq i_{1}<i_{2}<\cdots <i_{k}\leq n}\left(\prod _{j=1}^{k}r_{i_{j}}\right)=(-1)^{k}{\frac {a_{n-k}}{a_{n}}}} for k = 1, 2, ..., n (the indices ik are sorted in increasing order to ensure each product of k roots is used exactly once).

The left-hand sides of Vieta's formulas are the elementary symmetric polynomials of the roots.

Vieta's system (*) can be solved by Newton's method through an explicit simple iterative formula, the Durand-Kerner method.

Generalization to rings

Vieta's formulas are frequently used with polynomials with coefficients in any integral domain R. Then, the quotients a i / a n {\displaystyle a_{i}/a_{n}} belong to the field of fractions of R (and possibly are in R itself if a n {\displaystyle a_{n}} happens to be invertible in R) and the roots r i {\displaystyle r_{i}} are taken in an algebraically closed extension. Typically, R is the ring of the integers, the field of fractions is the field of the rational numbers and the algebraically closed field is the field of the complex numbers.

Vieta's formulas are then useful because they provide relations between the roots without having to compute them.

For polynomials over a commutative ring that is not an integral domain, Vieta's formulas are only valid when a n {\displaystyle a_{n}} is not a zero-divisor and P ( x ) {\displaystyle P(x)} factors as a n ( x r 1 ) ( x r 2 ) ( x r n ) {\displaystyle a_{n}(x-r_{1})(x-r_{2})\dots (x-r_{n})} . For example, in the ring of the integers modulo 8, the quadratic polynomial P ( x ) = x 2 1 {\displaystyle P(x)=x^{2}-1} has four roots: 1, 3, 5, and 7. Vieta's formulas are not true if, say, r 1 = 1 {\displaystyle r_{1}=1} and r 2 = 3 {\displaystyle r_{2}=3} , because P ( x ) ( x 1 ) ( x 3 ) {\displaystyle P(x)\neq (x-1)(x-3)} . However, P ( x ) {\displaystyle P(x)} does factor as ( x 1 ) ( x 7 ) {\displaystyle (x-1)(x-7)} and also as ( x 3 ) ( x 5 ) {\displaystyle (x-3)(x-5)} , and Vieta's formulas hold if we set either r 1 = 1 {\displaystyle r_{1}=1} and r 2 = 7 {\displaystyle r_{2}=7} or r 1 = 3 {\displaystyle r_{1}=3} and r 2 = 5 {\displaystyle r_{2}=5} .

Example

Vieta's formulas applied to quadratic and cubic polynomials:

The roots r 1 , r 2 {\displaystyle r_{1},r_{2}} of the quadratic polynomial P ( x ) = a x 2 + b x + c {\displaystyle P(x)=ax^{2}+bx+c} satisfy r 1 + r 2 = b a , r 1 r 2 = c a . {\displaystyle r_{1}+r_{2}=-{\frac {b}{a}},\quad r_{1}r_{2}={\frac {c}{a}}.}

The first of these equations can be used to find the minimum (or maximum) of P; see Quadratic equation § Vieta's formulas.

The roots r 1 , r 2 , r 3 {\displaystyle r_{1},r_{2},r_{3}} of the cubic polynomial P ( x ) = a x 3 + b x 2 + c x + d {\displaystyle P(x)=ax^{3}+bx^{2}+cx+d} satisfy r 1 + r 2 + r 3 = b a , r 1 r 2 + r 1 r 3 + r 2 r 3 = c a , r 1 r 2 r 3 = d a . {\displaystyle r_{1}+r_{2}+r_{3}=-{\frac {b}{a}},\quad r_{1}r_{2}+r_{1}r_{3}+r_{2}r_{3}={\frac {c}{a}},\quad r_{1}r_{2}r_{3}=-{\frac {d}{a}}.}

Proof

Direct proof

Vieta's formulas can be proved by expanding the equality a n x n + a n 1 x n 1 + + a 1 x + a 0 = a n ( x r 1 ) ( x r 2 ) ( x r n ) {\displaystyle a_{n}x^{n}+a_{n-1}x^{n-1}+\cdots +a_{1}x+a_{0}=a_{n}(x-r_{1})(x-r_{2})\cdots (x-r_{n})} (which is true since r 1 , r 2 , , r n {\displaystyle r_{1},r_{2},\dots ,r_{n}} are all the roots of this polynomial), multiplying the factors on the right-hand side, and identifying the coefficients of each power of x . {\displaystyle x.}

Formally, if one expands ( x r 1 ) ( x r 2 ) ( x r n ) , {\displaystyle (x-r_{1})(x-r_{2})\cdots (x-r_{n}),} the terms are precisely ( 1 ) n k r 1 b 1 r n b n x k , {\displaystyle (-1)^{n-k}r_{1}^{b_{1}}\cdots r_{n}^{b_{n}}x^{k},} where b i {\displaystyle b_{i}} is either 0 or 1, accordingly as whether r i {\displaystyle r_{i}} is included in the product or not, and k is the number of r i {\displaystyle r_{i}} that are included, so the total number of factors in the product is n (counting x k {\displaystyle x^{k}} with multiplicity k) – as there are n binary choices (include r i {\displaystyle r_{i}} or x), there are 2 n {\displaystyle 2^{n}} terms – geometrically, these can be understood as the vertices of a hypercube. Grouping these terms by degree yields the elementary symmetric polynomials in r i {\displaystyle r_{i}} – for x, all distinct k-fold products of r i . {\displaystyle r_{i}.}

As an example, consider the quadratic f ( x ) = a 2 x 2 + a 1 x + a 0 = a 2 ( x r 1 ) ( x r 2 ) = a 2 ( x 2 x ( r 1 + r 2 ) + r 1 r 2 ) . {\displaystyle f(x)=a_{2}x^{2}+a_{1}x+a_{0}=a_{2}(x-r_{1})(x-r_{2})=a_{2}(x^{2}-x(r_{1}+r_{2})+r_{1}r_{2}).}

Comparing identical powers of x {\displaystyle x} , we find a 2 = a 2 {\displaystyle a_{2}=a_{2}} , a 1 = a 2 ( r 1 + r 2 ) {\displaystyle a_{1}=-a_{2}(r_{1}+r_{2})} and a 0 = a 2 ( r 1 r 2 ) {\displaystyle a_{0}=a_{2}(r_{1}r_{2})} , with which we can for example identify r 1 + r 2 = a 1 / a 2 {\displaystyle r_{1}+r_{2}=-a_{1}/a_{2}} and r 1 r 2 = a 0 / a 2 {\displaystyle r_{1}r_{2}=a_{0}/a_{2}} , which are Vieta's formula's for n = 2 {\displaystyle n=2} .

Proof by mathematical induction

Vieta's formulas can also be proven by induction as shown below.

Inductive hypothesis:

Let P ( x ) {\displaystyle {P(x)}} be polynomial of degree n {\displaystyle n} , with complex roots r 1 , r 2 , , r n {\displaystyle {r_{1}},{r_{2}},{\dots },{r_{n}}} and complex coefficients a 0 , a 1 , , a n {\displaystyle a_{0},a_{1},\dots ,a_{n}} where a n 0 {\displaystyle {a_{n}}\neq 0} . Then the inductive hypothesis is that P ( x ) = a n x n + a n 1 x n 1 + + a 1 x + a 0 = a n x n a n ( r 1 + r 2 + + r n ) x n 1 + + ( 1 ) n ( a n ) ( r 1 r 2 r n ) {\displaystyle {P(x)}={a_{n}}{x^{n}}+{{a_{n-1}}{x^{n-1}}}+{\cdots }+{{a_{1}}{x}}+{{a}_{0}}={{a_{n}}{x^{n}}}-{a_{n}}{({r_{1}}+{r_{2}}+{\cdots }+{r_{n}}){x^{n-1}}}+{\cdots }+{{(-1)^{n}}{(a_{n})}{({r_{1}}{r_{2}}{\cdots }{r_{n}})}}}

Base case, n = 2 {\displaystyle n=2} (quadratic):

Let a 2 , a 1 {\displaystyle {a_{2}},{a_{1}}} be coefficients of the quadratic and a 0 {\displaystyle a_{0}} be the constant term. Similarly, let r 1 , r 2 {\displaystyle {r_{1}},{r_{2}}} be the roots of the quadratic: a 2 x 2 + a 1 x + a 0 = a 2 ( x r 1 ) ( x r 2 ) {\displaystyle {a_{2}x^{2}}+{a_{1}x}+a_{0}={a_{2}}{(x-r_{1})(x-r_{2})}} Expand the right side using distributive property: a 2 x 2 + a 1 x + a 0 = a 2 ( x 2 r 1 x r 2 x + r 1 r 2 ) {\displaystyle {a_{2}x^{2}}+{a_{1}x}+a_{0}={a_{2}}{({x^{2}}-{r_{1}x}-{r_{2}x}+{r_{1}}{r_{2}})}} Collect like terms: a 2 x 2 + a 1 x + a 0 = a 2 ( x 2 ( r 1 + r 2 ) x + r 1 r 2 ) {\displaystyle {a_{2}x^{2}}+{a_{1}x}+a_{0}={a_{2}}{({x^{2}}-{({r_{1}}+{r_{2}}){x}}+{r_{1}}{r_{2}})}} Apply distributive property again: a 2 x 2 + a 1 x + a 0 = a 2 x 2 a 2 ( r 1 + r 2 ) x + a 2 ( r 1 r 2 ) {\displaystyle {a_{2}x^{2}}+{a_{1}x}+a_{0}={{a_{2}}{x^{2}}-{{a_{2}}({r_{1}}+{r_{2}}){x}}+{a_{2}}{({r_{1}}{r_{2}})}}} The inductive hypothesis has now been proven true for n = 2 {\displaystyle n=2} .

Induction step:

Assuming the inductive hypothesis holds true for all n 2 {\displaystyle n\geqslant 2} , it must be true for all n + 1 {\displaystyle n+1} . P ( x ) = a n + 1 x n + 1 + a n x n + + a 1 x + a 0 {\displaystyle {P(x)}={a_{n+1}}{x^{n+1}}+{{a_{n}}{x^{n}}}+{\cdots }+{{a_{1}}{x}}+{{a}_{0}}} By the factor theorem, ( x r n + 1 ) {\displaystyle {(x-r_{n+1})}} can be factored out of P ( x ) {\displaystyle P(x)} leaving a 0 remainder. Note that the roots of the polynomial in the square brackets are r 1 , r 2 , , r n {\displaystyle r_{1},r_{2},\cdots ,r_{n}} : P ( x ) = ( x r n + 1 ) [ a n + 1 x n + 1 + a n x n + + a 1 x + a 0 x r n + 1 ] {\displaystyle {P(x)}={(x-r_{n+1})}{}} Factor out a n + 1 {\displaystyle a_{n+1}} , the leading coefficient P ( x ) {\displaystyle P(x)} , from the polynomial in the square brackets: P ( x ) = ( a n + 1 ) ( x r n + 1 ) [ x n + 1 + a n x n ( a n + 1 ) + + a 1 ( a n + 1 ) x + a 0 ( a n + 1 ) x r n + 1 ] {\displaystyle {P(x)}={(a_{n+{1}})}{(x-r_{n+1})}{}} For simplicity sake, allow the coefficients and constant of polynomial be denoted as ζ {\displaystyle \zeta } : P ( x ) = ( a n + 1 ) ( x r n + 1 ) [ x n + ζ n 1 x n 1 + + ζ 0 ] {\displaystyle P(x)={(a_{n+1})}{(x-r_{n+1})}{}} Using the inductive hypothesis, the polynomial in the square brackets can be rewritten as: P ( x ) = ( a n + 1 ) ( x r n + 1 ) [ x n ( r 1 + r 2 + + r n ) x n 1 + + ( 1 ) n ( r 1 r 2 r n ) ] {\displaystyle P(x)={(a_{n+1})}{(x-r_{n+1})}{}} Using distributive property: P ( x ) = ( a n + 1 ) ( x [ x n ( r 1 + r 2 + + r n ) x n 1 + + ( 1 ) n ( r 1 r 2 r n ) ] r n + 1 [ x n ( r 1 + r 2 + + r n ) x n 1 + + ( 1 ) n ( r 1 r 2 r n ) ] ) {\displaystyle P(x)={(a_{n+1})}{({x}{}{-r_{n+1}}{})}} After expanding and collecting like terms: P ( x ) = a n + 1 x n + 1 a n + 1 ( r 1 + r 2 + + r n + r n + 1 ) x n + + ( 1 ) n + 1 ( r 1 r 2 r n r n + 1 ) {\displaystyle {\begin{aligned}{P(x)}={{a_{n+1}}{x^{n+1}}}-{a_{n+1}}{({r_{1}}+{r_{2}}+{\cdots }+{r_{n}}+{r_{n+1}}){x^{n}}}+{\cdots }+{{(-1)^{n+1}}{({r_{1}}{r_{2}}{\cdots }{r_{n}}{r_{n+1}})}}\\\end{aligned}}} The inductive hypothesis holds true for n + 1 {\displaystyle n+1} , therefore it must be true n N {\displaystyle \forall n\in \mathbb {N} }

Conclusion: a n x n + a n 1 x n 1 + + a 1 x + a 0 = a n x n a n ( r 1 + r 2 + + r n ) x n 1 + + ( 1 ) n ( r 1 r 2 r n ) {\displaystyle {a_{n}}{x^{n}}+{{a_{n-1}}{x^{n-1}}}+{\cdots }+{{a_{1}}{x}}+{{a}_{0}}={{a_{n}}{x^{n}}}-{a_{n}}{({r_{1}}+{r_{2}}+{\cdots }+{r_{n}}){x^{n-1}}}+{\cdots }+{{(-1)^{n}}{({r_{1}}{r_{2}}{\cdots }{r_{n}})}}} By dividing both sides by a n {\displaystyle a_{n}} , it proves the Vieta's formulas true.

History

As reflected in the name, the formulas were discovered by the 16th-century French mathematician François Viète, for the case of positive roots.

In the opinion of the 18th-century British mathematician Charles Hutton, as quoted by Funkhouser, the general principle (not restricted to positive real roots) was first understood by the 17th-century French mathematician Albert Girard:

... the first person who understood the general doctrine of the formation of the coefficients of the powers from the sum of the roots and their products. He was the first who discovered the rules for summing the powers of the roots of any equation.

See also

References

  1. Weisstein, Eric W. (2024-06-22). "Vieta's Formulas". MathWorld--A Wolfram Web Resource.
  2. (Funkhouser 1930)
Categories: