Statistics / Probability - Distribution - (Function)

Thomas Bayes

About

This section talks about the term Distribution also knows as Probability distribution where you get:

  • on the y axis, the probability
  • on the x axis, the event

They can be seen as the outcomes of a single experiment.

The term “Probability'' asserts that each value in the set of possible values have different probabilities of being seen when reading/seeing a random variable.

A probability distribution is a mathematical function that provides the probabilities of occurrence of different possible outcomes in an experiment.

In more technical terms, the probability distribution is a mathematical description of a random phenomenon (random variable?) in terms of the probabilities of events,

Many distributions are normal but not always. An histogram can help to find the type of distribution.

A box plot is a good summary of a distribution.

Discrete / Continuous

Discrete

There is two representation of a discrete distribution:

  • the Bayesian representation: A discrete distribution plots just discrete values to probabilities such that the probabilities add up to 1.
  • the frequentist representation. A infinite lists such that as n gets larger, sampling from the collection and counting the frequencies of each element approximates the Bayesian representation of the distribution.

Continuous

standard continuous distributions— such as Gaussian, beta, binomial, and uniform.

algebraic properties, called conjugate priors. For example, a uniform prior combined with a binomial likelihood results in a beta posterior.

Function

A distribution can be specified by supplying:

Possible duplicate: Mathematics - Probability distribution function

Characteristics

  • Mode: for a discrete random variable, the value with highest probability (the location at which the probability mass function has its peak); for a continuous random variable, the location at which the probability density function has its peak.
  • Support: the smallest closed set whose complement has probability zero.
  • Head: the range of values where the pmf or pdf is relatively high.
  • Tail: the complement of the head within the support; the large set of values where the pmf or pdf is relatively low.
  • Expected value or mean: the weighted average of the possible values, using their probabilities as their weights; or the continuous analog thereof.
  • Median: the value such that the set of values less than the median has a probability of one-half.
  • Statistics - (Variance|Dispersion|Mean Square) (MS): the second moment of the pmf or pdf about the mean; an important measure of the dispersion of the distribution.
  • Standard deviation: the square root of the variance, and hence another measure of dispersion.
  • Symmetry: a property of some distributions in which the portion of the distribution to the left of a specific value is a mirror image of the portion to its right.
  • Skewness: a measure of the extent to which a pmf or pdf “leans” to one side of its mean.

Type

Management

Comparison

A Q-Q plot compare two distributions.

Example with ggplot current/stat_qq.html

ggplot(res_succes, aes(sample=res_succes$TOTAL_TIME_SEC, colour = factor(res_succes$PRESENTATION_NAME))) +
  geom_point(stat = "qq", size=0.75)

Visualization

Track

Monitoring Metrics - Distribution Summary

Documentation / Reference





Discover More
Binomial Distribution
(Probability|Statistics) - Binomial Distribution

The binomial distribution is the discrete probability distribution of the number of successes in a sequence of n independent yes/no experiments, each of which yields success with probability p. The...
Thomas Bayes
Data - Uncertainty

How likely is this prediction to be true? probability distributionconfidence interval3055303Erik Meijer - Making Money Using Math Thomas Bayes anticipated the need for dealing with uncertainty and formulated...
Utah Teapot
Data Visualisation - Histogram (Frequency distribution)

A histogram is a type of graph generally used to visualize a distribution An histogram is also known as a frequency distribution. Histograms can reveal information not captured by summary statistics...
Box Plot
Data Visualization - Box Plot

A box plot is a good summary of a distribution and was invented by John Tukey. See Five-number summary The boxplot is a special case of the quantile function in that it only returns the 1st, 2nd and...
Mean
Distribution - (Mean|Average) (M| | )

The average is a measure of center that statisticians call the mean. To calculate the mean, you add all numbers and divide the total by the number of numbers (N). The mean is not resistant. The...
Data System Architecture
Distribution - Measures of (center|central tendency) (Mean, Median, Mode)

A Measure of central tendency is a measure that describes the middle or center point of a distribution. A good measure of central tendency is representative of the distribution. The mean, the median and...
Data System Architecture
Distribution - Quantile Analysis

A quantile is a statistic that identifies the data that is less than the given value (ie that fall at or below a score in a distribution). A quantile function will always rank the data before giving any...
Thomas Bayes
Frequency Distribution

A frequency distribution is a distribution of the frequency of each element ie the count of each element in a set or the count of each element in a period multiset
Math Domain
Mathematics - Probability distribution function

A Probability distribution function is a function that is used that specify relative likelihood (probability) of different outcomes of a single experiment. It assigns a probability (a nonnegative number)...
Scale Counter Graph
Monitoring Metrics - Distribution Summary

A distribution summary is a monitoring metrics type used to track the distribution of events. It is similar to a timer structurally, but records values that do not represent a unit of time. A distribution...



Share this page:
Follow us:
Task Runner