Misplaced Pages

Kramers–Moyal expansion

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.

In stochastic processes, Kramers–Moyal expansion refers to a Taylor series expansion of the master equation, named after Hans Kramers and José Enrique Moyal. In many textbooks, the expansion is used only to derive the Fokker–Planck equation, and never used again. In general, continuous stochastic processes are essentially all Markovian, and so Fokker–Planck equations are sufficient for studying them. The higher-order Kramers–Moyal expansion only come into play when the process is jumpy. This usually means it is a Poisson-like process.

For a real stochastic process, one can compute its central moment functions from experimental data on the process, from which one can then compute its Kramers–Moyal coefficients, and thus empirically measure its Kolmogorov forward and backward equations. This is implemented as a python package

Statement

Start with the integro-differential master equation

p ( x , t ) t = p ( x , t | x 0 , t 0 ) p ( x 0 , t 0 ) d x 0 {\displaystyle {\frac {\partial p(x,t)}{\partial t}}=\int p(x,t|x_{0},t_{0})p(x_{0},t_{0})dx_{0}}

where p ( x , t | x 0 , t 0 ) {\displaystyle p(x,t|x_{0},t_{0})} is the transition probability function, and p ( x , t ) {\displaystyle p(x,t)} is the probability density at time t {\displaystyle t} . The Kramers–Moyal expansion transforms the above to an infinite order partial differential equation

t p ( x , t ) = n = 1 ( x ) n [ D n ( x , t ) p ( x , t ) ] {\displaystyle \partial _{t}p(x,t)=\sum _{n=1}^{\infty }(-\partial _{x})^{n}}

and also t p ( x , t | x 0 , t 0 ) = n = 1 ( x ) n [ D n ( x , t ) p ( x , t | x 0 , t 0 ) ] {\displaystyle \partial _{t}p(x,t|x_{0},t_{0})=\sum _{n=1}^{\infty }(-\partial _{x})^{n}}

where D n ( x , t ) {\displaystyle D_{n}(x,t)} are the Kramers–Moyal coefficients, defined by D n ( x , t ) = 1 n ! lim τ 0 1 τ μ n ( t | x , t τ ) {\displaystyle D_{n}(x,t)={\frac {1}{n!}}\lim _{\tau \to 0}{\frac {1}{\tau }}\mu _{n}(t|x,t-\tau )} and μ n {\displaystyle \mu _{n}} are the central moment functions, defined by

μ n ( t | x , t ) = ( x x ) n p ( x , t x , t )   d x . {\displaystyle \mu _{n}(t'|x,t)=\int _{-\infty }^{\infty }(x'-x)^{n}p(x',t'\mid x,t)\ dx'.}

The Fokker–Planck equation is obtained by keeping only the first two terms of the series in which D 1 {\displaystyle D_{1}} is the drift and D 2 {\displaystyle D_{2}} is the diffusion coefficient.

Also, the moments, assuming they exist, evolves as

t x n = k = 1 n n ! ( n k ) ! x n k D ( k ) ( x , t ) {\displaystyle {\frac {\partial }{\partial t}}\left\langle x^{n}\right\rangle =\sum _{k=1}^{n}{\frac {n!}{(n-k)!}}\left\langle x^{n-k}D^{(k)}(x,t)\right\rangle } where angled brackets mean taking the expectation: f = f ( x ) p ( x , t ) d x {\displaystyle \left\langle f\right\rangle =\int f(x)p(x,t)dx} .

n-dimensional version

The above version is the one-dimensional version. It generalizes to n-dimensions. (Section 4.7 )

Proof

In usual probability, where the probability density does not change, the moments of a probability density function determines the probability density itself by a Fourier transform (details may be found at the characteristic function page): p ( x ) = 1 2 π e i k x p ~ ( k ) d k = n = 0 ( 1 ) n n ! δ ( n ) ( x ) μ n {\displaystyle p(x)={\frac {1}{2\pi }}\int e^{-ikx}{\tilde {p}}(k)dk=\sum _{n=0}^{\infty }{\frac {(-1)^{n}}{n!}}\delta ^{(n)}(x)\mu _{n}} p ~ ( k ) = e i k x p ( x ) d x = n = 0 ( i k ) n n ! μ n {\displaystyle {\tilde {p}}(k)=\int e^{ikx}p(x)dx=\sum _{n=0}^{\infty }{\frac {(ik)^{n}}{n!}}\mu _{n}} Similarly, p ( x , t | x 0 , t 0 ) = n = 0 ( 1 ) n n ! δ ( n ) ( x x 0 ) μ n ( t | x 0 , t 0 ) {\displaystyle p(x,t|x_{0},t_{0})=\sum _{n=0}^{\infty }{\frac {(-1)^{n}}{n!}}\delta ^{(n)}(x-x_{0})\mu _{n}(t|x_{0},t_{0})} Now we need to integrate away the Dirac delta function. Fixing a small τ > 0 {\displaystyle \tau >0} , we have by the Chapman-Kolmogorov equation, p ( x , t ) = p ( x , t | x , t τ ) p ( x , t τ ) d x = n = 0 ( 1 ) n n ! p ( x , t τ ) δ ( n ) ( x x ) μ n ( t | x , t τ ) d x = n = 0 ( 1 ) n n ! x n ( p ( x , t τ ) μ n ( t | x , t τ ) ) {\displaystyle {\begin{aligned}p(x,t)&=\int p(x,t|x',t-\tau )p(x',t-\tau )dx'\\&=\sum _{n=0}^{\infty }{\frac {(-1)^{n}}{n!}}\int p(x',t-\tau )\delta ^{(n)}(x-x')\mu _{n}(t|x',t-\tau )dx'\\&=\sum _{n=0}^{\infty }{\frac {(-1)^{n}}{n!}}\partial _{x}^{n}(p(x,t-\tau )\mu _{n}(t|x,t-\tau ))\end{aligned}}} The n = 0 {\displaystyle n=0} term is just p ( x , t τ ) {\displaystyle p(x,t-\tau )} , so taking derivative with respect to time, t p ( x , t ) = lim τ 0 + 1 τ n = 1 ( 1 ) n n ! x n ( p ( x , t τ ) μ n ( t | x , t τ ) ) = n = 1 ( x ) n ( p ( x , t ) D n ( x , t ) ) {\displaystyle \partial _{t}p(x,t)=\lim _{\tau \to 0^{+}}{\frac {1}{\tau }}\sum _{n=1}^{\infty }{\frac {(-1)^{n}}{n!}}\partial _{x}^{n}(p(x,t-\tau )\mu _{n}(t|x,t-\tau ))=\sum _{n=1}^{\infty }(-\partial _{x})^{n}(p(x,t)D_{n}(x,t))}

The same computation with p ( x , t | x 0 , t 0 ) {\displaystyle p(x,t|x_{0},t_{0})} gives the other equation.

Forward and backward equations

The equation can be recast into a linear operator form, using the idea of infinitesimal generator. Define the linear operator A f := n = 1 ( x ) n [ D n ( x , t ) f ( x , t ) ] {\displaystyle {\mathcal {A}}f:=\sum _{n=1}^{\infty }(-\partial _{x})^{n}} then the equation above states t p ( x , t ) = A p ( x , t ) t p ( x , t | x 0 , t 0 ) = A p ( x , t | x 0 , t 0 ) {\displaystyle {\begin{aligned}\partial _{t}p(x,t)&={\mathcal {A}}p(x,t)\\\partial _{t}p(x,t|x_{0},t_{0})&={\mathcal {A}}p(x,t|x_{0},t_{0})\end{aligned}}} In this form, the equations are precisely in the form of a general Kolmogorov forward equation. The backward equation then states that t p ( x 1 , t 1 | x , t ) = A p ( x 1 , t 1 | x , t ) {\displaystyle \partial _{t}p(x_{1},t_{1}|x,t)=-{\mathcal {A}}^{\dagger }p(x_{1},t_{1}|x,t)} where A f := n = 1 D n ( x , t ) x n [ f ( x , t ) ] {\displaystyle {\mathcal {A}}^{\dagger }f:=\sum _{n=1}^{\infty }D_{n}(x,t)\partial _{x}^{n}} is the Hermitian adjoint of A {\displaystyle {\mathcal {A}}} .

Computing the Kramers–Moyal coefficients

By definition, D n ( x , t ) = 1 n ! lim τ 0 1 τ μ n ( t | x , t τ ) {\displaystyle D_{n}(x,t)={\frac {1}{n!}}\lim _{\tau \to 0}{\frac {1}{\tau }}\mu _{n}(t|x,t-\tau )} This definition works because μ n ( t | x , t ) = 0 {\displaystyle \mu _{n}(t|x,t)=0} , as those are the central moments of the Dirac delta function. Since the even central moments are nonnegative, we have D 2 n 0 {\displaystyle D_{2n}\geq 0} for all n 1 {\displaystyle n\geq 1} . When the stochastic process is the Markov process d X = b d t + σ d W t {\displaystyle dX=bdt+\sigma dW_{t}} , we can directly solve for p ( x , t | x , t τ ) {\displaystyle p(x,t|x,t-\tau )} as approximated by a normal distribution with mean x + b ( x ) τ {\displaystyle x+b(x)\tau } and variance σ 2 τ {\displaystyle \sigma ^{2}\tau } . This then allows us to compute the central moments, and so D 1 = b , D 2 = 1 2 σ 2 , D 3 = D 4 = = 0 {\displaystyle D_{1}=b,\quad D_{2}={\frac {1}{2}}\sigma ^{2},\quad D_{3}=D_{4}=\cdots =0} This then gives us the 1-dimensional Fokker–Planck equation: t p = x ( b p ) + 1 2 x 2 ( σ 2 p ) {\displaystyle \partial _{t}p=-\partial _{x}(bp)+{\frac {1}{2}}\partial _{x}^{2}(\sigma ^{2}p)}

Pawula theorem

Pawula theorem states that either the sequence D 1 , D 2 , D 3 , . . . {\displaystyle D_{1},D_{2},D_{3},...} becomes zero at the third term, or all its even terms are positive.

Proof

By Cauchy–Schwarz inequality, the central moment functions satisfy μ n + m 2 μ 2 n μ 2 m {\displaystyle \mu _{n+m}^{2}\leq \mu _{2n}\mu _{2m}} . So, taking the limit, we have D n + m 2 ( 2 n ) ! ( 2 m ) ! ( n + m ) ! 2 D 2 n D 2 m {\displaystyle D_{n+m}^{2}\leq {\frac {(2n)!(2m)!}{(n+m)!^{2}}}D_{2n}D_{2m}} . If some D 2 + n 0 {\displaystyle D_{2+n}\neq 0} for some n 1 {\displaystyle n\geq 1} , then D 2 D 2 + 2 n > 0 {\displaystyle D_{2}D_{2+2n}>0} . In particular, D 2 + n , D 2 + 2 n , D 2 + 4 n , . . . > 0 {\displaystyle D_{2+n},D_{2+2n},D_{2+4n},...>0} . So the existence of any nonzero coefficient of order 3 {\displaystyle \geq 3} implies the existence of nonzero coefficients of arbitrarily large order. Also, if D n 0 {\displaystyle D_{n}\neq 0} , then D 2 D 2 n 2 > 0 , D 4 D 2 n 4 > 0 , . . . {\displaystyle D_{2}D_{2n-2}>0,D_{4}D_{2n-4}>0,...} . So the existence of any nonzero coefficient of order n {\displaystyle n} implies all coefficients of order 2 , 4 , . . . , 2 n 2 {\displaystyle 2,4,...,2n-2} are positive.

Interpretation

Let the operator A m {\displaystyle {\mathcal {A}}_{m}} be defined such A m f := n = 1 m ( x ) n [ D n ( x , t ) f ( x , t ) ] {\displaystyle {\mathcal {A}}_{m}f:=\sum _{n=1}^{m}(-\partial _{x})^{n}} . The probability density evolves by t ρ A m ρ {\displaystyle \partial _{t}\rho \approx {\mathcal {A}}_{m}\rho } . Different order of m {\displaystyle m} gives different level of approximation.

  • m = 0 {\displaystyle m=0} : the probability density does not evolve
  • m = 1 {\displaystyle m=1} : it evolves by deterministic drift only.
  • m = 2 {\displaystyle m=2} : it evolves by drift and Brownian motion (Fokker-Planck equation)
  • m = {\displaystyle m=\infty } : the fully exact equation.

Pawula theorem means that if truncating to the second term is not exact, that is, A 2 A {\displaystyle {\mathcal {A}}_{2}\neq {\mathcal {A}}} , then truncating to any term is still not exact. Usually, this means that for any truncation A m {\displaystyle {\mathcal {A}}_{m}} , there exists a probability density function ρ {\displaystyle \rho } that can become negative during its evolution t ρ A m ρ {\displaystyle \partial _{t}\rho \approx {\mathcal {A}}_{m}\rho } (and thus fail to be a probability density function). However, this doesn't mean that Kramers-Moyal expansions truncated at other choices of m {\displaystyle m} is useless. Though the solution must have negative values at least for sufficiently small times, the resulting approximation probability density may still be better than the m = 2 {\displaystyle m=2} approximation.

References

  1. Kramers, H. A. (1940). "Brownian motion in a field of force and the diffusion model of chemical reactions". Physica. 7 (4): 284–304. Bibcode:1940Phy.....7..284K. doi:10.1016/S0031-8914(40)90098-2. S2CID 33337019.
  2. Moyal, J. E. (1949). "Stochastic processes and statistical physics". Journal of the Royal Statistical Society. Series B (Methodological). 11 (2): 150–210. doi:10.1111/j.2517-6161.1949.tb00030.x. JSTOR 2984076.
  3. Risken, Hannes (6 December 2012). The Fokker-Planck Equation: Methods of Solution and Applications. Springer. ISBN 9783642968075.
  4. Tabar, M. Reza Rahimi (2019), Rahimi Tabar, M. Reza (ed.), "Stochastic Processes with Jumps and Non-vanishing Higher-Order Kramers–Moyal Coefficients", Analysis and Data-Based Reconstruction of Complex Nonlinear Dynamical Systems: Using the Methods of Stochastic Processes, Understanding Complex Systems, Cham: Springer International Publishing, pp. 99–110, doi:10.1007/978-3-030-18472-8_11, ISBN 978-3-030-18472-8, retrieved 2023-06-09
  5. Spinney, Richard; Ford, Ian (2013). "Fluctuation relations: a pedagogical overview". In Klages, Rainer; Just, Wolfram; Jarzynski, Christopher (eds.). Nonequilibrium Statistical Physics of Small Systems: Fluctuation relations and beyond. Reviews of Nonlinear Dynamics and Complexity. Weinheim: Wiley-VCH. pp. 3–56. arXiv:1201.6381. doi:10.1002/9783527658701.ch1. ISBN 978-3-527-41094-1. MR 3308060.
  6. Rydin Gorjão, L.; Meirinhos, F. (2019). "kramersmoyal: Kramers--Moyal coefficients for stochastic processes". Journal of Open Source Software. 4 (44): 1693. arXiv:1912.09737. Bibcode:2019JOSS....4.1693G. doi:10.21105/joss.01693.
  7. Gardiner, C. (2009). Stochastic Methods (4th ed.). Berlin: Springer. ISBN 978-3-642-08962-6.
  8. Van Kampen, N. G. (1992). Stochastic Processes in Physics and Chemistry. Elsevier. ISBN 0-444-89349-0.
  9. ^ Risken, H. (1996). The Fokker–Planck Equation. Berlin, Heidelberg: Springer. pp. 63–95. ISBN 3-540-61530-X.
  10. Paul, Wolfgang; Baschnagel, Jörg (2013). "A Brief Survey of the Mathematics of Probability Theory". Stochastic Processes. Springer. pp. 17–61 . doi:10.1007/978-3-319-00327-6_2. ISBN 978-3-319-00326-9.
  11. Tabar, M. Reza Rahimi (2019), Rahimi Tabar, M. Reza (ed.), "Kramers–Moyal Expansion and Fokker–Planck Equation", Analysis and Data-Based Reconstruction of Complex Nonlinear Dynamical Systems: Using the Methods of Stochastic Processes, Understanding Complex Systems, Cham: Springer International Publishing, pp. 19–29, doi:10.1007/978-3-030-18472-8_3, ISBN 978-3-030-18472-8, retrieved 2023-06-09
  12. Pawula, R. F. (1967). "Generalizations and extensions of the Fokker–Planck–Kolmogorov equations" (PDF). IEEE Transactions on Information Theory. 13 (1): 33–41. doi:10.1109/TIT.1967.1053955.
  13. Pawula, R. F. (1967). "Approximation of the linear Boltzmann equation by the Fokker–Planck equation". Physical Review. 162 (1): 186–188. Bibcode:1967PhRv..162..186P. doi:10.1103/PhysRev.162.186.
Categories: