Joint distribution 🌿

For two random variables , their joint probability distribution is a probability distribution that gives the probability of each possible pair of outcomes for and . It is denoted as or simply for discrete variables, and for the density in continuous cases.

Notation

: Probability mass function (discrete) - gives actual probability values
: Probability density function (continuous) - must be integrated to get probabilities
: Marginal density of
: Conditional density of given

Probability from density

The probability of being in some 2D area is the integral of the PDF over that area:

Marginal density

The marginal density of a variable is obtained by integrating the joint density over the other variable:

Here we’re integrating out (averaging over all possible values of) , to get the marginal density of , i.e. the density of regardless of .
This also work with discrete distributions:

Conditional probability

The conditional probability of given is given by:

This is almost identical to the standard conditional probability (probability of AND divided by the probability of ), but more general, as it’s a conditional density function of all values of and .
This is the same formula but using density notation - denotes the conditional density function of given :