Misplaced Pages

Marsaglia polar method

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Method for generating pseudo-random numbers

The Marsaglia polar method is a pseudo-random number sampling method for generating a pair of independent standard normal random variables.

Standard normal random variables are frequently used in computer science, computational statistics, and in particular, in applications of the Monte Carlo method.

The polar method works by choosing random points (xy) in the square −1 < x < 1, −1 < y < 1 until

0 < s = x 2 + y 2 < 1 , {\displaystyle 0<s=x^{2}+y^{2}<1,\,}

and then returning the required pair of normal random variables as

x 2 ln ( s ) s ,     y 2 ln ( s ) s , {\displaystyle x{\sqrt {\frac {-2\ln(s)}{s}}}\,,\ \ y{\sqrt {\frac {-2\ln(s)}{s}}},}

or, equivalently,

x s 2 ln ( s ) ,     y s 2 ln ( s ) , {\displaystyle {\frac {x}{\sqrt {s}}}{\sqrt {-2\ln(s)}}\,,\ \ {\frac {y}{\sqrt {s}}}{\sqrt {-2\ln(s)}},}

where x / s {\displaystyle x/{\sqrt {s}}} and y / s {\displaystyle y/{\sqrt {s}}} represent the cosine and sine of the angle that the vector (x, y) makes with x axis.

Theoretical basis

The underlying theory may be summarized as follows:

If u is uniformly distributed in the interval 0 ≤ u < 1, then the point (cos(2πu), sin(2πu)) is uniformly distributed on the unit circumference x + y = 1, and multiplying that point by an independent random variable ρ whose distribution is

Pr ( ρ < a ) = 0 a r e r 2 / 2 d r {\displaystyle \Pr(\rho <a)=\int _{0}^{a}re^{-r^{2}/2}\,dr}

will produce a point

( ρ cos ( 2 π u ) , ρ sin ( 2 π u ) ) {\displaystyle \left(\rho \cos(2\pi u),\rho \sin(2\pi u)\right)}

whose coordinates are jointly distributed as two independent standard normal random variables.

History

This idea dates back to Laplace, whom Gauss credits with finding the above

I = e x 2 / 2 d x {\displaystyle I=\int _{-\infty }^{\infty }e^{-x^{2}/2}\,dx}

by taking the square root of

I 2 = e ( x 2 + y 2 ) / 2 d x d y = 0 2 π 0 r e r 2 / 2 d r d θ . {\displaystyle I^{2}=\int _{-\infty }^{\infty }\int _{-\infty }^{\infty }e^{-(x^{2}+y^{2})/2}\,dx\,dy=\int _{0}^{2\pi }\int _{0}^{\infty }re^{-r^{2}/2}\,dr\,d\theta .}

The transformation to polar coordinates makes evident that θ is uniformly distributed (constant density) from 0 to 2π, and that the radial distance r has density

r e r 2 / 2 . {\displaystyle re^{-r^{2}/2}.\,}

(r has the appropriate chi square distribution.)

This method of producing a pair of independent standard normal variates by radially projecting a random point on the unit circumference to a distance given by the square root of a chi-square-2 variate is called the polar method for generating a pair of normal random variables,

Practical considerations

A direct application of this idea,

x = 2 ln ( u 1 ) cos ( 2 π u 2 ) , y = 2 ln ( u 1 ) sin ( 2 π u 2 ) {\displaystyle x={\sqrt {-2\ln(u_{1})}}\cos(2\pi u_{2}),\quad y={\sqrt {-2\ln(u_{1})}}\sin(2\pi u_{2})}

is called the Box–Muller transform, in which the chi variate is usually generated as

2 ln ( u 1 ) , {\displaystyle {\sqrt {-2\ln(u_{1})}},}

but that transform requires logarithm, square root, sine and cosine functions. On some processors, the cosine and sine of the same argument can be calculated in parallel using a single instruction. Notably for Intel-based machines, one can use fsincos assembler instruction or the expi instruction (available e.g. in D), to calculate complex

expi ( z ) = e i z = cos ( z ) + i sin ( z ) , {\displaystyle \operatorname {expi} (z)=e^{iz}=\cos(z)+i\sin(z),\,}

and just separate the real and imaginary parts.

Note: To explicitly calculate the complex-polar form use the following substitutions in the general form,

Let r = 2 ln ( u 1 ) {\displaystyle r={\sqrt {-2\ln(u_{1})}}} and z = 2 π u 2 . {\displaystyle z=2\pi u_{2}.} Then

  r e i z = 2 ln ( u 1 ) e i 2 π u 2 = 2 ln ( u 1 ) [ cos ( 2 π u 2 ) + i sin ( 2 π u 2 ) ] . {\displaystyle \ re^{iz}={\sqrt {-2\ln(u_{1})}}e^{i2\pi u_{2}}={\sqrt {-2\ln(u_{1})}}\left.}

In contrast, the polar method here removes the need to calculate a cosine and sine. Instead, by solving for a point on the unit circle, these two functions can be replaced with the x and y coordinates normalized to the x 2 + y 2 {\displaystyle {\sqrt {x^{2}+y^{2}}}} radius. In particular, a random point (xy) inside the unit circle is projected onto the unit circumference by setting s = x 2 + y 2 {\displaystyle s=x^{2}+y^{2}} and forming the point

( x s , y s ) , {\displaystyle \left({\frac {x}{\sqrt {s}}},{\frac {y}{\sqrt {s}}}\right),\,}

which is a faster procedure than calculating the cosine and sine. Some researchers argue that the conditional if instruction (for rejecting a point outside of the unit circle), can make programs slower on modern processors equipped with pipelining and branch prediction. Also this procedure requires about 27% more evaluations of the underlying random number generator (only π / 4 79 % {\displaystyle \pi /4\approx 79\%} of generated points lie inside of unit circle).

That random point on the circumference is then radially projected the required random distance by means of

2 ln ( s ) , {\displaystyle {\sqrt {-2\ln(s)}},\,}

using the same s because that s is independent of the random point on the circumference and is itself uniformly distributed from 0 to 1.

Implementation

Python

A simple implementation in Python:

import math
import random
def marsaglia_sample():
    while True:
        U1 = random.uniform(-1, 1)
        U2 = random.uniform(-1, 1)
        if (w := U1**2 + U2**2) < 1:
            break
    Z1 = U1 * math.sqrt(-2 * math.log(w) / w)
    Z2 = U2 * math.sqrt(-2 * math.log(w) / w)
    return Z1, Z2

Java

Simple implementation in Java using the mean and standard deviation:

private static double spare;
private static boolean hasSpare = false;
public static synchronized double generateGaussian(double mean, double stdDev) {
    if (hasSpare) {
        hasSpare = false;
        return spare * stdDev + mean;
    } else {
        double u, v, s;
        do {
            u = Math.random() * 2 - 1;
            v = Math.random() * 2 - 1;
            s = u * u + v * v;
        } while (s >= 1 || s == 0);
        s = Math.sqrt(-2.0 * Math.log(s) / s);
        spare = v * s;
        hasSpare = true;
        return mean + stdDev * u * s;
    }
}

C++

A non-thread safe implementation in C++ using the mean and standard deviation:

double generateGaussian(double mean, double stdDev) {
    static double spare;
    static bool hasSpare = false;
    if (hasSpare) {
        hasSpare = false;
        return spare * stdDev + mean;
    } else {
        double u, v, s;
        do {
            u = (rand() / ((double)RAND_MAX)) * 2.0 - 1.0;
            v = (rand() / ((double)RAND_MAX)) * 2.0 - 1.0;
            s = u * u + v * v;
        } while (s >= 1.0 || s == 0.0);
        s = sqrt(-2.0 * log(s) / s);
        spare = v * s;
        hasSpare = true;
        return mean + stdDev * u * s;
    }
}

C++11 GNU GCC libstdc++'s implementation of std::normal_distribution uses the Marsaglia polar method, as quoted from herein.

Julia

A simple Julia implementation:

"""
    marsagliasample(N)
Generate `2N` samples from the standard normal distribution using the Marsaglia method.
"""
function marsagliasample(N)
    z = Array{Float64}(undef,N,2);
    for i in axes(z,1)
        s = Inf;
        while s > 1
            z .= 2rand(2) .- 1;
            s = sum(abs2.(z))
        end
        z .*= sqrt(-2log(s)/s);
    end
    vec(z)
end
"""
    marsagliasample(n,μ,σ)
Generate `n` samples from the normal distribution with mean `μ` and standard deviation `σ` using the Marsaglia method.
"""
function marsagliasample(n,μ,σ)
    μ .+ σ*marsagliasample(cld(n,2));
end

The for loop can be parallelized by using the Threads.@threads macro.

References

  1. Marsaglia, G.; Bray, T. A. (1964). "A Convenient Method for Generating Normal Variables". SIAM Review. 6 (3): 260–264. Bibcode:1964SIAMR...6..260M. doi:10.1137/1006063. JSTOR 2027592.
  2. Peter E. Kloeden Eckhard Platen Henri Schurz, Numerical Solution of SDE Through Computer Experiments, Springer, 1994.
  3. Kanter, David. "Intel's Ivy Bridge Graphics Architecture". Real World Tech. Retrieved 8 April 2013.
  4. This effect can be heightened in a GPU generating many variates in parallel, where a rejection on one processor can slow down many other processors. See section 7 of Thomas, David B.; Howes, Lee W.; Luk, Wayne (2009), "A comparison of CPUs, GPUs, FPGAs, and massively parallel processor arrays for random number generation", in Chow, Paul; Cheung, Peter Y. K. (eds.), Proceedings of the ACM/SIGDA 17th International Symposium on Field Programmable Gate Arrays, FPGA 2009, Monterey, California, USA, February 22–24, 2009, Association for Computing Machinery, pp. 63–72, CiteSeerX 10.1.1.149.6066, doi:10.1145/1508128.1508139, ISBN 9781605584102, S2CID 465785.
Categories: