Law of Large Numbers
Given a sequence of iid random variables , the sample mean will converge to as .
Formally:Note: The variances don’t actually have to be identical.
This law holds for many but not all distributions, e.g. the cauchy distribution with undefined mean, or distributions with infinite mean, like the pareto distribution with .
The Central Limit Theorem limit theorem is stronger, also describing the shape the distribution will take.
Proof, handwavy:
As , .
→ Averaging doesn’t change the mean, but it reduces noise.
Proof, rigorous:
By chebyshev inequality, for any :
Strong Law of Large Numbers
The strong version of the law holds even if the variables are not identically distributed, as long as they are independent/uncorrelated and satisfy → The sample mean converges to the true mean almost surely (with probability 1):