Statistics - (Normal|Gaussian) Distribution - Bell Curve

1 - About

A normal distribution is one of underlying assumptions of a lot of statistical procedures.

In nature, every outcome that depends on the sum of many independent events will approximate the Gaussian distribution after some time, if respected the assumptions of the Central limit theorem.

Because of the Central limit theorem, the normal distribution plays a fundamental role in probability theory and statistics.

The normal distribution is commonly denoted as <math>N(0,1)</math> .

3 - Properties

The properties of a normal distribution are well-known:

See density for the function

4 - Explication

Considering the classic bean machine (Galtonboard, Galtonbrett Simulation, quincunx or Galton).

The Galtonboard is a device invented by Sir Francis Galton to demonstrate the central limit theorem, in particular that the normal distribution is a good approximate to the binomial distribution.

There's only one way for a ball to reach the first column
There are four ways to reach the second column
There are six ways to reach the third column
Because the machine is symetrical, after some time it will look like a gaussian distribution

5 - Function

5.1 - Density

The Gaussian density function has the form:

<MATH> f(x) = \frac{1}{\sqrt{2\pi}\sigma} e^{\displaystyle -\frac{1}{2} \left (\frac{x-\mu}{\sigma} \right )^2} </MATH>


  • <math>\mu</math> is the mean
  • <math>\sigma</math> is the variance

As notated on the figure, the probabilities of intervals of values correspond to the area under the curve.

5.2 - Cumulative

The cumulative distribution function (CDF) is noted <math>\Phi(z)</math>.


  • <math>\mu</math> is the mean
  • <math>\sigma</math> is the variance

5.3 - Approximation

Trigonometry - (Cosine|Cosinus) <MATH> f(x) = \frac{1+cos(x)}{2\pi} </MATH> This approximation can be integrated in closed form

6 - Documentation / Reference

data_mining/normal_distribution.txt · Last modified: 2017/09/13 16:04 by gerardnico