Misplaced Pages

Function of several real variables

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
(Redirected from Functions of several real variables) Mathematical function with multiple real-number arguments
This article includes a list of general references, but it lacks sufficient corresponding inline citations. Please help to improve this article by introducing more precise citations. (November 2017) (Learn how and when to remove this message)
Function
x ↦ f (x)
History of the function concept
Types by domain and codomain

ℝ β†’ X

 Classes/properties 
  Constructions
  Generalizations  
  List of specific functions

In mathematical analysis and its applications, a function of several real variables or real multivariate function is a function with more than one argument, with all arguments being real variables. This concept extends the idea of a function of a real variable to several variables. The "input" variables take real values, while the "output", also called the "value of the function", may be real or complex. However, the study of the complex-valued functions may be easily reduced to the study of the real-valued functions, by considering the real and imaginary parts of the complex function; therefore, unless explicitly specified, only real-valued functions will be considered in this article.

The domain of a function of n variables is the subset of ⁠ R n {\displaystyle \mathbb {R} ^{n}} ⁠ for which the function is defined. As usual, the domain of a function of several real variables is supposed to contain a nonempty open subset of ⁠ R n {\displaystyle \mathbb {R} ^{n}} ⁠.

General definition

n = 1n = 2n = 3Functions f(x1, x2, …, xn) of n variables, plotted as graphs in the space R. The domains are the red n-dimensional regions, the images are the purple n-dimensional curves.

A real-valued function of n real variables is a function that takes as input n real numbers, commonly represented by the variables x1, x2, …, xn, for producing another real number, the value of the function, commonly denoted f(x1, x2, …, xn). For simplicity, in this article a real-valued function of several real variables will be simply called a function. To avoid any ambiguity, the other types of functions that may occur will be explicitly specified.

Some functions are defined for all real values of the variables (one says that they are everywhere defined), but some other functions are defined only if the value of the variable are taken in a subset X of R, the domain of the function, which is always supposed to contain an open subset of R. In other words, a real-valued function of n real variables is a function

f : X R {\displaystyle f:X\to \mathbb {R} }

such that its domain X is a subset of R that contains a nonempty open set.

An element of X being an n-tuple (x1, x2, …, xn) (usually delimited by parentheses), the general notation for denoting functions would be f((x1, x2, …, xn)). The common usage, much older than the general definition of functions between sets, is to not use double parentheses and to simply write f(x1, x2, …, xn).

It is also common to abbreviate the n-tuple (x1, x2, …, xn) by using a notation similar to that for vectors, like boldface x, underline x, or overarrow xβ†’. This article will use bold.

A simple example of a function in two variables could be:

V : X R X = { ( A , h ) R 2 A > 0 , h > 0 } V ( A , h ) = 1 3 A h {\displaystyle {\begin{aligned}&V:X\to \mathbb {R} \\&X=\left\{(A,h)\in \mathbb {R} ^{2}\mid A>0,h>0\right\}\\&V(A,h)={\frac {1}{3}}Ah\end{aligned}}}

which is the volume V of a cone with base area A and height h measured perpendicularly from the base. The domain restricts all variables to be positive since lengths and areas must be positive.

For an example of a function in two variables:

z : R 2 R z ( x , y ) = a x + b y {\displaystyle {\begin{aligned}&z:\mathbb {R} ^{2}\to \mathbb {R} \\&z(x,y)=ax+by\end{aligned}}}

where a and b are real non-zero constants. Using the three-dimensional Cartesian coordinate system, where the xy plane is the domain R and the z axis is the codomain R, one can visualize the image to be a two-dimensional plane, with a slope of a in the positive x direction and a slope of b in the positive y direction. The function is well-defined at all points (x, y) in R. The previous example can be extended easily to higher dimensions:

z : R p R z ( x 1 , x 2 , , x p ) = a 1 x 1 + a 2 x 2 + + a p x p {\displaystyle {\begin{aligned}&z:\mathbb {R} ^{p}\to \mathbb {R} \\&z(x_{1},x_{2},\ldots ,x_{p})=a_{1}x_{1}+a_{2}x_{2}+\cdots +a_{p}x_{p}\end{aligned}}}

for p non-zero real constants a1, a2, …, ap, which describes a p-dimensional hyperplane.

The Euclidean norm:

f ( x ) = x = x 1 2 + + x n 2 {\displaystyle f({\boldsymbol {x}})=\|{\boldsymbol {x}}\|={\sqrt {x_{1}^{2}+\cdots +x_{n}^{2}}}}

is also a function of n variables which is everywhere defined, while

g ( x ) = 1 f ( x ) {\displaystyle g({\boldsymbol {x}})={\frac {1}{f({\boldsymbol {x}})}}}

is defined only for x β‰  (0, 0, …, 0).

For a non-linear example function in two variables:

z : X R X = { ( x , y ) R 2 : x 2 + y 2 8 , x 0 , y 0 } z ( x , y ) = 1 2 x y x 2 + y 2 {\displaystyle {\begin{aligned}&z:X\to \mathbb {R} \\&X=\left\{(x,y)\in \mathbb {R} ^{2}\,:\,x^{2}+y^{2}\leq 8\,,\,x\neq 0\,,\,y\neq 0\right\}\\&z(x,y)={\frac {1}{2xy}}{\sqrt {x^{2}+y^{2}}}\end{aligned}}}

which takes in all points in X, a disk of radius √8 "punctured" at the origin (x, y) = (0, 0) in the plane R, and returns a point in R. The function does not include the origin (x, y) = (0, 0), if it did then f would be ill-defined at that point. Using a 3d Cartesian coordinate system with the xy-plane as the domain R, and the z axis the codomain R, the image can be visualized as a curved surface.

The function can be evaluated at the point (x, y) = (2, √3) in X:

z ( 2 , 3 ) = 1 2 2 3 ( 2 ) 2 + ( 3 ) 2 = 1 4 3 7 , {\displaystyle z\left(2,{\sqrt {3}}\right)={\frac {1}{2\cdot 2\cdot {\sqrt {3}}}}{\sqrt {\left(2\right)^{2}+\left({\sqrt {3}}\right)^{2}}}={\frac {1}{4{\sqrt {3}}}}{\sqrt {7}}\,,}

However, the function couldn't be evaluated at, say

( x , y ) = ( 65 , 10 ) x 2 + y 2 = ( 65 ) 2 + ( 10 ) 2 > 8 {\displaystyle (x,y)=(65,{\sqrt {10}})\,\Rightarrow \,x^{2}+y^{2}=(65)^{2}+({\sqrt {10}})^{2}>8}

since these values of x and y do not satisfy the domain's rule.

Image

The image of a function f(x1, x2, …, xn) is the set of all values of f when the n-tuple (x1, x2, …, xn) runs in the whole domain of f. For a continuous (see below for a definition) real-valued function which has a connected domain, the image is either an interval or a single value. In the latter case, the function is a constant function.

The preimage of a given real number c is called a level set. It is the set of the solutions of the equation f(x1, x2, …, xn) = c.

Domain

This section does not cite any sources. Please help improve this section by adding citations to reliable sources. Unsourced material may be challenged and removed. (November 2017) (Learn how and when to remove this message)

The domain of a function of several real variables is a subset of R that is sometimes, but not always, explicitly defined. In fact, if one restricts the domain X of a function f to a subset Y βŠ‚ X, one gets formally a different function, the restriction of f to Y, which is denoted f | Y {\displaystyle f|_{Y}} . In practice, it is often (but not always) not harmful to identify f and f | Y {\displaystyle f|_{Y}} , and to omit the restrictor |Y.

Conversely, it is sometimes possible to enlarge naturally the domain of a given function, for example by continuity or by analytic continuation.

Moreover, many functions are defined in such a way that it is difficult to specify explicitly their domain. For example, given a function f, it may be difficult to specify the domain of the function g ( x ) = 1 / f ( x ) . {\displaystyle g({\boldsymbol {x}})=1/f({\boldsymbol {x}}).} If f is a multivariate polynomial, (which has R n {\displaystyle \mathbb {R} ^{n}} as a domain), it is even difficult to test whether the domain of g is also R n {\displaystyle \mathbb {R} ^{n}} . This is equivalent to test whether a polynomial is always positive, and is the object of an active research area (see Positive polynomial).

Algebraic structure

The usual operations of arithmetic on the reals may be extended to real-valued functions of several real variables in the following way:

  • For every real number r, the constant function ( x 1 , , x n ) r {\displaystyle (x_{1},\ldots ,x_{n})\mapsto r} is everywhere defined.
  • For every real number r and every function f, the function: r f : ( x 1 , , x n ) r f ( x 1 , , x n ) {\displaystyle rf:(x_{1},\ldots ,x_{n})\mapsto rf(x_{1},\ldots ,x_{n})} has the same domain as f (or is everywhere defined if r = 0).
  • If f and g are two functions of respective domains X and Y such that X ∩ Y contains a nonempty open subset of R, then f g : ( x 1 , , x n ) f ( x 1 , , x n ) g ( x 1 , , x n ) {\displaystyle f\,g:(x_{1},\ldots ,x_{n})\mapsto f(x_{1},\ldots ,x_{n})\,g(x_{1},\ldots ,x_{n})} and g f : ( x 1 , , x n ) g ( x 1 , , x n ) f ( x 1 , , x n ) {\displaystyle g\,f:(x_{1},\ldots ,x_{n})\mapsto g(x_{1},\ldots ,x_{n})\,f(x_{1},\ldots ,x_{n})} are functions that have a domain containing X ∩ Y.

It follows that the functions of n variables that are everywhere defined and the functions of n variables that are defined in some neighbourhood of a given point both form commutative algebras over the reals (R-algebras). This is a prototypical example of a function space.

One may similarly define

1 / f : ( x 1 , , x n ) 1 / f ( x 1 , , x n ) , {\displaystyle 1/f:(x_{1},\ldots ,x_{n})\mapsto 1/f(x_{1},\ldots ,x_{n}),}

which is a function only if the set of the points (x1, …,xn) in the domain of f such that f(x1, …, xn) β‰  0 contains an open subset of R. This constraint implies that the above two algebras are not fields.

Univariable functions associated with a multivariable function

One can easily obtain a function in one real variable by giving a constant value to all but one of the variables. For example, if (a1, …, an) is a point of the interior of the domain of the function f, we can fix the values of x2, …, xn to a2, …, an respectively, to get a univariable function

x f ( x , a 2 , , a n ) , {\displaystyle x\mapsto f(x,a_{2},\ldots ,a_{n}),}

whose domain contains an interval centered at a1. This function may also be viewed as the restriction of the function f to the line defined by the equations xi = ai for i = 2, …, n.

Other univariable functions may be defined by restricting f to any line passing through (a1, …, an). These are the functions

x f ( a 1 + c 1 x , a 2 + c 2 x , , a n + c n x ) , {\displaystyle x\mapsto f(a_{1}+c_{1}x,a_{2}+c_{2}x,\ldots ,a_{n}+c_{n}x),}

where the ci are real numbers that are not all zero.

In next section, we will show that, if the multivariable function is continuous, so are all these univariable functions, but the converse is not necessarily true.

Continuity and limit

Until the second part of 19th century, only continuous functions were considered by mathematicians. At that time, the notion of continuity was elaborated for the functions of one or several real variables a rather long time before the formal definition of a topological space and a continuous map between topological spaces. As continuous functions of several real variables are ubiquitous in mathematics, it is worth to define this notion without reference to the general notion of continuous maps between topological space.

For defining the continuity, it is useful to consider the distance function of R, which is an everywhere defined function of 2n real variables:

d ( x , y ) = d ( x 1 , , x n , y 1 , , y n ) = ( x 1 y 1 ) 2 + + ( x n y n ) 2 {\displaystyle d({\boldsymbol {x}},{\boldsymbol {y}})=d(x_{1},\ldots ,x_{n},y_{1},\ldots ,y_{n})={\sqrt {(x_{1}-y_{1})^{2}+\cdots +(x_{n}-y_{n})^{2}}}}

A function f is continuous at a point a = (a1, …, an) which is interior to its domain, if, for every positive real number Ξ΅, there is a positive real number Ο† such that |f(x) βˆ’ f(a)| < Ξ΅ for all x such that d(x a) < Ο†. In other words, Ο† may be chosen small enough for having the image by f of the ball of radius Ο† centered at a contained in the interval of length 2Ξ΅ centered at f(a). A function is continuous if it is continuous at every point of its domain.

If a function is continuous at f(a), then all the univariate functions that are obtained by fixing all the variables xi except one at the value ai, are continuous at f(a). The converse is false; this means that all these univariate functions may be continuous for a function that is not continuous at f(a). For an example, consider the function f such that f(0, 0) = 0, and is otherwise defined by

f ( x , y ) = x 2 y x 4 + y 2 . {\displaystyle f(x,y)={\frac {x^{2}y}{x^{4}+y^{2}}}.}

The functions x ↦ f(x, 0) and y ↦ f(0, y) are both constant and equal to zero, and are therefore continuous. The function f is not continuous at (0, 0), because, if Ξ΅ < 1/2 and y = x β‰  0, we have f(x, y) = 1/2, even if |x| is very small. Although not continuous, this function has the further property that all the univariate functions obtained by restricting it to a line passing through (0, 0) are also continuous. In fact, we have

f ( x , λ x ) = λ x x 2 + λ 2 {\displaystyle f(x,\lambda x)={\frac {\lambda x}{x^{2}+\lambda ^{2}}}}

for Ξ» β‰  0.

The limit at a point of a real-valued function of several real variables is defined as follows. Let a = (a1, a2, …, an) be a point in topological closure of the domain X of the function f. The function, f has a limit L when x tends toward a, denoted

L = lim x a f ( x ) , {\displaystyle L=\lim _{{\boldsymbol {x}}\to {\boldsymbol {a}}}f({\boldsymbol {x}}),}

if the following condition is satisfied: For every positive real number Ξ΅ > 0, there is a positive real number Ξ΄ > 0 such that

| f ( x ) L | < ε {\displaystyle |f({\boldsymbol {x}})-L|<\varepsilon }

for all x in the domain such that

d ( x , a ) < δ . {\displaystyle d({\boldsymbol {x}},{\boldsymbol {a}})<\delta .}

If the limit exists, it is unique. If a is in the interior of the domain, the limit exists if and only if the function is continuous at a. In this case, we have

f ( a ) = lim x a f ( x ) . {\displaystyle f({\boldsymbol {a}})=\lim _{{\boldsymbol {x}}\to {\boldsymbol {a}}}f({\boldsymbol {x}}).}

When a is in the boundary of the domain of f, and if f has a limit at a, the latter formula allows to "extend by continuity" the domain of f to a.

Symmetry

A symmetric function is a function f that is unchanged when two variables xi and xj are interchanged:

f ( , x i , , x j , ) = f ( , x j , , x i , ) {\displaystyle f(\ldots ,x_{i},\ldots ,x_{j},\ldots )=f(\ldots ,x_{j},\ldots ,x_{i},\ldots )}

where i and j are each one of 1, 2, …, n. For example:

f ( x , y , z , t ) = t 2 x 2 y 2 z 2 {\displaystyle f(x,y,z,t)=t^{2}-x^{2}-y^{2}-z^{2}}

is symmetric in x, y, z since interchanging any pair of x, y, z leaves f unchanged, but is not symmetric in all of x, y, z, t, since interchanging t with x or y or z gives a different function.

Function composition

Suppose the functions

ξ 1 = ξ 1 ( x 1 , x 2 , , x n ) , ξ 2 = ξ 2 ( x 1 , x 2 , , x n ) , ξ m = ξ m ( x 1 , x 2 , , x n ) , {\displaystyle \xi _{1}=\xi _{1}(x_{1},x_{2},\ldots ,x_{n}),\quad \xi _{2}=\xi _{2}(x_{1},x_{2},\ldots ,x_{n}),\ldots \xi _{m}=\xi _{m}(x_{1},x_{2},\ldots ,x_{n}),}

or more compactly ΞΎ = ΞΎ(x), are all defined on a domain X. As the n-tuple x = (x1, x2, …, xn) varies in X, a subset of R, the m-tuple ΞΎ = (ΞΎ1, ΞΎ2, …, ΞΎm) varies in another region Ξ a subset of R. To restate this:

ξ : X Ξ . {\displaystyle {\boldsymbol {\xi }}:X\to \Xi .}

Then, a function ΢ of the functions ξ(x) defined on Ξ,

ζ : Ξ R , ζ = ζ ( ξ 1 , ξ 2 , , ξ m ) , {\displaystyle {\begin{aligned}&\zeta :\Xi \to \mathbb {R} ,\\&\zeta =\zeta (\xi _{1},\xi _{2},\ldots ,\xi _{m}),\end{aligned}}}

is a function composition defined on X, in other terms the mapping

ζ : X R , ζ = ζ ( ξ 1 , ξ 2 , , ξ m ) = f ( x 1 , x 2 , , x n ) . {\displaystyle {\begin{aligned}&\zeta :X\to \mathbb {R} ,\\&\zeta =\zeta (\xi _{1},\xi _{2},\ldots ,\xi _{m})=f(x_{1},x_{2},\ldots ,x_{n}).\end{aligned}}}

Note the numbers m and n do not need to be equal.

For example, the function

f ( x , y ) = e x y [ sin 3 ( x y ) cos 2 ( x + y ) ] {\displaystyle f(x,y)=e^{xy}}

defined everywhere on R can be rewritten by introducing

( α , β , γ ) = ( α ( x , y ) , β ( x , y ) , γ ( x , y ) ) = ( x y , x y , x + y ) {\displaystyle (\alpha ,\beta ,\gamma )=(\alpha (x,y),\beta (x,y),\gamma (x,y))=(xy,x-y,x+y)}

which is also everywhere defined in R to obtain

f ( x , y ) = ζ ( α ( x , y ) , β ( x , y ) , γ ( x , y ) ) = ζ ( α , β , γ ) = e α [ sin ( 3 β ) cos ( 2 γ ) ] . {\displaystyle f(x,y)=\zeta (\alpha (x,y),\beta (x,y),\gamma (x,y))=\zeta (\alpha ,\beta ,\gamma )=e^{\alpha }\,.}

Function composition can be used to simplify functions, which is useful for carrying out multiple integrals and solving partial differential equations.

Calculus

Elementary calculus is the calculus of real-valued functions of one real variable, and the principal ideas of differentiation and integration of such functions can be extended to functions of more than one real variable; this extension is multivariable calculus.

Partial derivatives

Main article: Partial derivative

Partial derivatives can be defined with respect to each variable:

x 1 f ( x 1 , x 2 , , x n ) , x 2 f ( x 1 , x 2 , x n ) , , x n f ( x 1 , x 2 , , x n ) . {\displaystyle {\frac {\partial }{\partial x_{1}}}f(x_{1},x_{2},\ldots ,x_{n})\,,\quad {\frac {\partial }{\partial x_{2}}}f(x_{1},x_{2},\ldots x_{n})\,,\ldots ,{\frac {\partial }{\partial x_{n}}}f(x_{1},x_{2},\ldots ,x_{n}).}

Partial derivatives themselves are functions, each of which represents the rate of change of f parallel to one of the x1, x2, …, xn axes at all points in the domain (if the derivatives exist and are continuousβ€”see also below). A first derivative is positive if the function increases along the direction of the relevant axis, negative if it decreases, and zero if there is no increase or decrease. Evaluating a partial derivative at a particular point in the domain gives the rate of change of the function at that point in the direction parallel to a particular axis, a real number.

For real-valued functions of a real variable, y = f(x), its ordinary derivative dy/dx is geometrically the gradient of the tangent line to the curve y = f(x) at all points in the domain. Partial derivatives extend this idea to tangent hyperplanes to a curve.

The second order partial derivatives can be calculated for every pair of variables:

2 x 1 2 f ( x 1 , x 2 , , x n ) , 2 x 1 x 2 f ( x 1 , x 2 , x n ) , , 2 x n 2 f ( x 1 , x 2 , , x n ) . {\displaystyle {\frac {\partial ^{2}}{\partial x_{1}^{2}}}f(x_{1},x_{2},\ldots ,x_{n})\,,\quad {\frac {\partial ^{2}}{\partial x_{1}x_{2}}}f(x_{1},x_{2},\ldots x_{n})\,,\ldots ,{\frac {\partial ^{2}}{\partial x_{n}^{2}}}f(x_{1},x_{2},\ldots ,x_{n}).}

Geometrically, they are related to the local curvature of the function's image at all points in the domain. At any point where the function is well-defined, the function could be increasing along some axes, and/or decreasing along other axes, and/or not increasing or decreasing at all along other axes.

This leads to a variety of possible stationary points: global or local maxima, global or local minima, and saddle pointsβ€”the multidimensional analogue of inflection points for real functions of one real variable. The Hessian matrix is a matrix of all the second order partial derivatives, which are used to investigate the stationary points of the function, important for mathematical optimization.

In general, partial derivatives of higher order p have the form:

p x 1 p 1 x 2 p 2 x n p n f ( x 1 , x 2 , , x n ) p 1 x 1 p 1 p 2 x 2 p 2 p n x n p n f ( x 1 , x 2 , , x n ) {\displaystyle {\frac {\partial ^{p}}{\partial x_{1}^{p_{1}}\partial x_{2}^{p_{2}}\cdots \partial x_{n}^{p_{n}}}}f(x_{1},x_{2},\ldots ,x_{n})\equiv {\frac {\partial ^{p_{1}}}{\partial x_{1}^{p_{1}}}}{\frac {\partial ^{p_{2}}}{\partial x_{2}^{p_{2}}}}\cdots {\frac {\partial ^{p_{n}}}{\partial x_{n}^{p_{n}}}}f(x_{1},x_{2},\ldots ,x_{n})}

where p1, p2, …, pn are each integers between 0 and p such that p1 + p2 + β‹― + pn = p, using the definitions of zeroth partial derivatives as identity operators:

0 x 1 0 f ( x 1 , x 2 , , x n ) = f ( x 1 , x 2 , , x n ) , , 0 x n 0 f ( x 1 , x 2 , , x n ) = f ( x 1 , x 2 , , x n ) . {\displaystyle {\frac {\partial ^{0}}{\partial x_{1}^{0}}}f(x_{1},x_{2},\ldots ,x_{n})=f(x_{1},x_{2},\ldots ,x_{n})\,,\quad \ldots ,\,{\frac {\partial ^{0}}{\partial x_{n}^{0}}}f(x_{1},x_{2},\ldots ,x_{n})=f(x_{1},x_{2},\ldots ,x_{n})\,.}

The number of possible partial derivatives increases with p, although some mixed partial derivatives (those with respect to more than one variable) are superfluous, because of the symmetry of second order partial derivatives. This reduces the number of partial derivatives to calculate for some p.

Multivariable differentiability

Main article: differentiable function

A function f(x) is differentiable in a neighborhood of a point a if there is an n-tuple of numbers dependent on a in general, A(a) = (A1(a), A2(a), …, An(a)), so that:

f ( x ) = f ( a ) + A ( a ) ( x a ) + α ( x ) | x a | {\displaystyle f({\boldsymbol {x}})=f({\boldsymbol {a}})+{\boldsymbol {A}}({\boldsymbol {a}})\cdot ({\boldsymbol {x}}-{\boldsymbol {a}})+\alpha ({\boldsymbol {x}})|{\boldsymbol {x}}-{\boldsymbol {a}}|}

where α ( x ) 0 {\displaystyle \alpha ({\boldsymbol {x}})\to 0} as | x a | 0 {\displaystyle |{\boldsymbol {x}}-{\boldsymbol {a}}|\to 0} . This means that if f is differentiable at a point a, then f is continuous at x = a, although the converse is not true - continuity in the domain does not imply differentiability in the domain. If f is differentiable at a then the first order partial derivatives exist at a and:

f ( x ) x i | x = a = A i ( a ) {\displaystyle \left.{\frac {\partial f({\boldsymbol {x}})}{\partial x_{i}}}\right|_{{\boldsymbol {x}}={\boldsymbol {a}}}=A_{i}({\boldsymbol {a}})}

for i = 1, 2, …, n, which can be found from the definitions of the individual partial derivatives, so the partial derivatives of f exist.

Assuming an n-dimensional analogue of a rectangular Cartesian coordinate system, these partial derivatives can be used to form a vectorial linear differential operator, called the gradient (also known as "nabla" or "del") in this coordinate system:

f ( x ) = ( x 1 , x 2 , , x n ) f ( x ) {\displaystyle \nabla f({\boldsymbol {x}})=\left({\frac {\partial }{\partial x_{1}}},{\frac {\partial }{\partial x_{2}}},\ldots ,{\frac {\partial }{\partial x_{n}}}\right)f({\boldsymbol {x}})}

used extensively in vector calculus, because it is useful for constructing other differential operators and compactly formulating theorems in vector calculus.

Then substituting the gradient βˆ‡f (evaluated at x = a) with a slight rearrangement gives:

f ( x ) f ( a ) = f ( a ) ( x a ) + α | x a | {\displaystyle f({\boldsymbol {x}})-f({\boldsymbol {a}})=\nabla f({\boldsymbol {a}})\cdot ({\boldsymbol {x}}-{\boldsymbol {a}})+\alpha |{\boldsymbol {x}}-{\boldsymbol {a}}|}

where Β· denotes the dot product. This equation represents the best linear approximation of the function f at all points x within a neighborhood of a. For infinitesimal changes in f and x as x β†’ a:

d f = f ( x ) x 1 | x = a d x 1 + f ( x ) x 2 | x = a d x 2 + + f ( x ) x n | x = a d x n = f ( a ) d x {\displaystyle df=\left.{\frac {\partial f({\boldsymbol {x}})}{\partial x_{1}}}\right|_{{\boldsymbol {x}}={\boldsymbol {a}}}dx_{1}+\left.{\frac {\partial f({\boldsymbol {x}})}{\partial x_{2}}}\right|_{{\boldsymbol {x}}={\boldsymbol {a}}}dx_{2}+\dots +\left.{\frac {\partial f({\boldsymbol {x}})}{\partial x_{n}}}\right|_{{\boldsymbol {x}}={\boldsymbol {a}}}dx_{n}=\nabla f({\boldsymbol {a}})\cdot d{\boldsymbol {x}}}

which is defined as the total differential, or simply differential, of f, at a. This expression corresponds to the total infinitesimal change of f, by adding all the infinitesimal changes of f in all the xi directions. Also, df can be construed as a covector with basis vectors as the infinitesimals dxi in each direction and partial derivatives of f as the components.

Geometrically βˆ‡f is perpendicular to the level sets of f, given by f(x) = c which for some constant c describes an (n βˆ’ 1)-dimensional hypersurface. The differential of a constant is zero:

d f = ( f ) d x = 0 {\displaystyle df=(\nabla f)\cdot d{\boldsymbol {x}}=0}

in which dx is an infinitesimal change in x in the hypersurface f(x) = c, and since the dot product of βˆ‡f and dx is zero, this means βˆ‡f is perpendicular to dx.

In arbitrary curvilinear coordinate systems in n dimensions, the explicit expression for the gradient would not be so simple - there would be scale factors in terms of the metric tensor for that coordinate system. For the above case used throughout this article, the metric is just the Kronecker delta and the scale factors are all 1.

Differentiability classes

If all first order partial derivatives evaluated at a point a in the domain:

x 1 f ( x ) | x = a , x 2 f ( x ) | x = a , , x n f ( x ) | x = a {\displaystyle \left.{\frac {\partial }{\partial x_{1}}}f({\boldsymbol {x}})\right|_{{\boldsymbol {x}}={\boldsymbol {a}}}\,,\quad \left.{\frac {\partial }{\partial x_{2}}}f({\boldsymbol {x}})\right|_{{\boldsymbol {x}}={\boldsymbol {a}}}\,,\ldots ,\left.{\frac {\partial }{\partial x_{n}}}f({\boldsymbol {x}})\right|_{{\boldsymbol {x}}={\boldsymbol {a}}}}

exist and are continuous for all a in the domain, f has differentiability class C. In general, if all order p partial derivatives evaluated at a point a:

p x 1 p 1 x 2 p 2 x n p n f ( x ) | x = a {\displaystyle \left.{\frac {\partial ^{p}}{\partial x_{1}^{p_{1}}\partial x_{2}^{p_{2}}\cdots \partial x_{n}^{p_{n}}}}f({\boldsymbol {x}})\right|_{{\boldsymbol {x}}={\boldsymbol {a}}}}

exist and are continuous, where p1, p2, …, pn, and p are as above, for all a in the domain, then f is differentiable to order p throughout the domain and has differentiability class C .

If f is of differentiability class C, f has continuous partial derivatives of all order and is called smooth. If f is an analytic function and equals its Taylor series about any point in the domain, the notation C denotes this differentiability class.

Multiple integration

Main article: Multiple integration

Definite integration can be extended to multiple integration over the several real variables with the notation;

R n R 2 R 1 f ( x 1 , x 2 , , x n ) d x 1 d x 2 d x n R f ( x ) d n x {\displaystyle \int _{R_{n}}\cdots \int _{R_{2}}\int _{R_{1}}f(x_{1},x_{2},\ldots ,x_{n})\,dx_{1}dx_{2}\cdots dx_{n}\equiv \int _{R}f({\boldsymbol {x}})\,d^{n}{\boldsymbol {x}}}

where each region R1, R2, …, Rn is a subset of or all of the real line:

R 1 R , R 2 R , , R n R , {\displaystyle R_{1}\subseteq \mathbb {R} \,,\quad R_{2}\subseteq \mathbb {R} \,,\ldots ,R_{n}\subseteq \mathbb {R} ,}

and their Cartesian product gives the region to integrate over as a single set:

R = R 1 × R 2 × × R n , R R n , {\displaystyle R=R_{1}\times R_{2}\times \dots \times R_{n}\,,\quad R\subseteq \mathbb {R} ^{n}\,,}

an n-dimensional hypervolume. When evaluated, a definite integral is a real number if the integral converges in the region R of integration (the result of a definite integral may diverge to infinity for a given region, in such cases the integral remains ill-defined). The variables are treated as "dummy" or "bound" variables which are substituted for numbers in the process of integration.

The integral of a real-valued function of a real variable y = f(x) with respect to x has geometric interpretation as the area bounded by the curve y = f(x) and the x-axis. Multiple integrals extend the dimensionality of this concept: assuming an n-dimensional analogue of a rectangular Cartesian coordinate system, the above definite integral has the geometric interpretation as the n-dimensional hypervolume bounded by f(x) and the x1, x2, …, xn axes, which may be positive, negative, or zero, depending on the function being integrated (if the integral is convergent).

While bounded hypervolume is a useful insight, the more important idea of definite integrals is that they represent total quantities within space. This has significance in applied mathematics and physics: if f is some scalar density field and x are the position vector coordinates, i.e. some scalar quantity per unit n-dimensional hypervolume, then integrating over the region R gives the total amount of quantity in R. The more formal notions of hypervolume is the subject of measure theory. Above we used the Lebesgue measure, see Lebesgue integration for more on this topic.

Theorems

With the definitions of multiple integration and partial derivatives, key theorems can be formulated, including the fundamental theorem of calculus in several real variables (namely Stokes' theorem), integration by parts in several real variables, the symmetry of higher partial derivatives and Taylor's theorem for multivariable functions. Evaluating a mixture of integrals and partial derivatives can be done by using theorem differentiation under the integral sign.

Vector calculus

One can collect a number of functions each of several real variables, say

y 1 = f 1 ( x 1 , x 2 , , x n ) , y 2 = f 2 ( x 1 , x 2 , , x n ) , , y m = f m ( x 1 , x 2 , x n ) {\displaystyle y_{1}=f_{1}(x_{1},x_{2},\ldots ,x_{n})\,,\quad y_{2}=f_{2}(x_{1},x_{2},\ldots ,x_{n})\,,\ldots ,y_{m}=f_{m}(x_{1},x_{2},\cdots x_{n})}

into an m-tuple, or sometimes as a column vector or row vector, respectively:

( y 1 , y 2 , , y m ) [ f 1 ( x 1 , x 2 , , x n ) f 2 ( x 1 , x 2 , x n ) f m ( x 1 , x 2 , , x n ) ] [ f 1 ( x 1 , x 2 , , x n ) f 2 ( x 1 , x 2 , , x n ) f m ( x 1 , x 2 , , x n ) ] {\displaystyle (y_{1},y_{2},\ldots ,y_{m})\leftrightarrow {\begin{bmatrix}f_{1}(x_{1},x_{2},\ldots ,x_{n})\\f_{2}(x_{1},x_{2},\cdots x_{n})\\\vdots \\f_{m}(x_{1},x_{2},\ldots ,x_{n})\end{bmatrix}}\leftrightarrow {\begin{bmatrix}f_{1}(x_{1},x_{2},\ldots ,x_{n})&f_{2}(x_{1},x_{2},\ldots ,x_{n})&\cdots &f_{m}(x_{1},x_{2},\ldots ,x_{n})\end{bmatrix}}}

all treated on the same footing as an m-component vector field, and use whichever form is convenient. All the above notations have a common compact notation y = f(x). The calculus of such vector fields is vector calculus. For more on the treatment of row vectors and column vectors of multivariable functions, see matrix calculus.

Implicit functions

A real-valued implicit function of several real variables is not written in the form "y = f(…)". Instead, the mapping is from the space R to the zero element in R (just the ordinary zero 0):

ϕ : R n + 1 { 0 } ϕ ( x 1 , x 2 , , x n , y ) = 0 {\displaystyle {\begin{aligned}&\phi :\mathbb {R} ^{n+1}\to \{0\}\\&\phi (x_{1},x_{2},\ldots ,x_{n},y)=0\end{aligned}}}

is an equation in all the variables. Implicit functions are a more general way to represent functions, since if:

y = f ( x 1 , x 2 , , x n ) {\displaystyle y=f(x_{1},x_{2},\ldots ,x_{n})}

then we can always define:

ϕ ( x 1 , x 2 , , x n , y ) = y f ( x 1 , x 2 , , x n ) = 0 {\displaystyle \phi (x_{1},x_{2},\ldots ,x_{n},y)=y-f(x_{1},x_{2},\ldots ,x_{n})=0}

but the converse is not always possible, i.e. not all implicit functions have an explicit form.

For example, using interval notation, let

ϕ : X { 0 } ϕ ( x , y , z ) = ( x a ) 2 + ( y b ) 2 + ( z c ) 2 1 = 0 X = [ a , a ] × [ b , b ] × [ c , c ] = { ( x , y , z ) R 3 : a x a , b y b , c z c } . {\displaystyle {\begin{aligned}&\phi :X\to \{0\}\\&\phi (x,y,z)=\left({\frac {x}{a}}\right)^{2}+\left({\frac {y}{b}}\right)^{2}+\left({\frac {z}{c}}\right)^{2}-1=0\\&X=\times \times =\left\{(x,y,z)\in \mathbb {R} ^{3}\,:\,-a\leq x\leq a,-b\leq y\leq b,-c\leq z\leq c\right\}.\end{aligned}}}

Choosing a 3-dimensional (3D) Cartesian coordinate system, this function describes the surface of a 3D ellipsoid centered at the origin (x, y, z) = (0, 0, 0) with constant semi-major axes a, b, c, along the positive x, y and z axes respectively. In the case a = b = c = r, we have a sphere of radius r centered at the origin. Other conic section examples which can be described similarly include the hyperboloid and paraboloid, more generally so can any 2D surface in 3D Euclidean space. The above example can be solved for x, y or z; however it is much tidier to write it in an implicit form.

For a more sophisticated example:

ϕ : R 4 { 0 } ϕ ( t , x , y , z ) = C t z e t x y z + A sin ( 3 ω t ) ( x 2 z B y 6 ) = 0 {\displaystyle {\begin{aligned}&\phi :\mathbb {R} ^{4}\to \{0\}\\&\phi (t,x,y,z)=Ctze^{tx-yz}+A\sin(3\omega t)\left(x^{2}z-By^{6}\right)=0\end{aligned}}}

for non-zero real constants A, B, C, Ο‰, this function is well-defined for all (t, x, y, z), but it cannot be solved explicitly for these variables and written as "t =", "x =", etc.

The implicit function theorem of more than two real variables deals with the continuity and differentiability of the function, as follows. Let Ο•(x1, x2, …, xn) be a continuous function with continuous first order partial derivatives, and let Ο• evaluated at a point (a, b) = (a1, a2, …, an, b) be zero:

ϕ ( a , b ) = 0 ; {\displaystyle \phi ({\boldsymbol {a}},b)=0;}

and let the first partial derivative of Ο• with respect to y evaluated at (a, b) be non-zero:

ϕ ( x , y ) y | ( x , y ) = ( a , b ) 0. {\displaystyle \left.{\frac {\partial \phi ({\boldsymbol {x}},y)}{\partial y}}\right|_{({\boldsymbol {x}},y)=({\boldsymbol {a}},b)}\neq 0.}

Then, there is an interval containing b, and a region R containing (a, b), such that for every x in R there is exactly one value of y in satisfying Ο•(x, y) = 0, and y is a continuous function of x so that Ο•(x, y(x)) = 0. The total differentials of the functions are:

d y = y x 1 d x 1 + y x 2 d x 2 + + y x n d x n ; {\displaystyle dy={\frac {\partial y}{\partial x_{1}}}dx_{1}+{\frac {\partial y}{\partial x_{2}}}dx_{2}+\dots +{\frac {\partial y}{\partial x_{n}}}dx_{n};}
d ϕ = ϕ x 1 d x 1 + ϕ x 2 d x 2 + + ϕ x n d x n + ϕ y d y . {\displaystyle d\phi ={\frac {\partial \phi }{\partial x_{1}}}dx_{1}+{\frac {\partial \phi }{\partial x_{2}}}dx_{2}+\dots +{\frac {\partial \phi }{\partial x_{n}}}dx_{n}+{\frac {\partial \phi }{\partial y}}dy.}

Substituting dy into the latter differential and equating coefficients of the differentials gives the first order partial derivatives of y with respect to xi in terms of the derivatives of the original function, each as a solution of the linear equation

ϕ x i + ϕ y y x i = 0 {\displaystyle {\frac {\partial \phi }{\partial x_{i}}}+{\frac {\partial \phi }{\partial y}}{\frac {\partial y}{\partial x_{i}}}=0}

for i = 1, 2, …, n.

Complex-valued function of several real variables

A complex-valued function of several real variables may be defined by relaxing, in the definition of the real-valued functions, the restriction of the codomain to the real numbers, and allowing complex values.

If f(x1, …, xn) is such a complex valued function, it may be decomposed as

f ( x 1 , , x n ) = g ( x 1 , , x n ) + i h ( x 1 , , x n ) , {\displaystyle f(x_{1},\ldots ,x_{n})=g(x_{1},\ldots ,x_{n})+ih(x_{1},\ldots ,x_{n}),}

where g and h are real-valued functions. In other words, the study of the complex valued functions reduces easily to the study of the pairs of real valued functions.

This reduction works for the general properties. However, for an explicitly given function, such as:

z ( x , y , α , a , q ) = q 2 π [ ln ( x + i y a e i α ) ln ( x + i y + a e i α ) ] {\displaystyle z(x,y,\alpha ,a,q)={\frac {q}{2\pi }}\left}

the computation of the real and the imaginary part may be difficult.

Applications

Multivariable functions of real variables arise inevitably in engineering and physics, because observable physical quantities are real numbers (with associated units and dimensions), and any one physical quantity will generally depend on a number of other quantities.

Examples of real-valued functions of several real variables

Examples in continuum mechanics include the local mass density ρ of a mass distribution, a scalar field which depends on the spatial position coordinates (here Cartesian to exemplify), r = (x, y, z), and time t:

ρ = ρ ( r , t ) = ρ ( x , y , z , t ) {\displaystyle \rho =\rho (\mathbf {r} ,t)=\rho (x,y,z,t)}

Similarly for electric charge density for electrically charged objects, and numerous other scalar potential fields.

Another example is the velocity field, a vector field, which has components of velocity v = (vx, vy, vz) that are each multivariable functions of spatial coordinates and time similarly:

v ( r , t ) = v ( x , y , z , t ) = [ v x ( x , y , z , t ) , v y ( x , y , z , t ) , v z ( x , y , z , t ) ] {\displaystyle \mathbf {v} (\mathbf {r} ,t)=\mathbf {v} (x,y,z,t)=}

Similarly for other physical vector fields such as electric fields and magnetic fields, and vector potential fields.

Another important example is the equation of state in thermodynamics, an equation relating pressure P, temperature T, and volume V of a fluid, in general it has an implicit form:

f ( P , V , T ) = 0 {\displaystyle f(P,V,T)=0}

The simplest example is the ideal gas law:

f ( P , V , T ) = P V n R T = 0 {\displaystyle f(P,V,T)=PV-nRT=0}

where n is the number of moles, constant for a fixed amount of substance, and R the gas constant. Much more complicated equations of state have been empirically derived, but they all have the above implicit form.

Real-valued functions of several real variables appear pervasively in economics. In the underpinnings of consumer theory, utility is expressed as a function of the amounts of various goods consumed, each amount being an argument of the utility function. The result of maximizing utility is a set of demand functions, each expressing the amount demanded of a particular good as a function of the prices of the various goods and of income or wealth. In producer theory, a firm is usually assumed to maximize profit as a function of the quantities of various goods produced and of the quantities of various factors of production employed. The result of the optimization is a set of demand functions for the various factors of production and a set of supply functions for the various products; each of these functions has as its arguments the prices of the goods and of the factors of production.

Examples of complex-valued functions of several real variables

Some "physical quantities" may be actually complex valued - such as complex impedance, complex permittivity, complex permeability, and complex refractive index. These are also functions of real variables, such as frequency or time, as well as temperature.

In two-dimensional fluid mechanics, specifically in the theory of the potential flows used to describe fluid motion in 2d, the complex potential

F ( x , y , ) = φ ( x , y , ) + i ψ ( x , y , ) {\displaystyle F(x,y,\ldots )=\varphi (x,y,\ldots )+i\psi (x,y,\ldots )}

is a complex valued function of the two spatial coordinates x and y, and other real variables associated with the system. The real part is the velocity potential and the imaginary part is the stream function.

The spherical harmonics occur in physics and engineering as the solution to Laplace's equation, as well as the eigenfunctions of the z-component angular momentum operator, which are complex-valued functions of real-valued spherical polar angles:

Y m = Y m ( θ , ϕ ) {\displaystyle Y_{\ell }^{m}=Y_{\ell }^{m}(\theta ,\phi )}

In quantum mechanics, the wavefunction is necessarily complex-valued, but is a function of real spatial coordinates (or momentum components), as well as time t:

Ψ = Ψ ( r , t ) = Ψ ( x , y , z , t ) , Φ = Φ ( p , t ) = Φ ( p x , p y , p z , t ) {\displaystyle \Psi =\Psi (\mathbf {r} ,t)=\Psi (x,y,z,t)\,,\quad \Phi =\Phi (\mathbf {p} ,t)=\Phi (p_{x},p_{y},p_{z},t)}

where each is related by a Fourier transform.

See also

References

  1. R. Courant. Differential and Integral Calculus. Vol. 2. Wiley Classics Library. pp. 46–47. ISBN 0-471-60840-8.
  2. R. Courant. Differential and Integral Calculus. Vol. 2. Wiley Classics Library. p. 70. ISBN 0-471-60840-8.
  3. W. Fulks (1978). Advanced calculus. John Wiley & Sons. pp. 300–302. ISBN 0-471-02195-4.
  4. R. Courant. Differential and Integral Calculus. Vol. 2. Wiley Classics Library. pp. 117–118. ISBN 0-471-60840-8.
Major topics in mathematical analysis
Mathematics portal
Categories: